Augmenting Paraphrase Generation with Syntax Information using Graph Convolutional Networks

Xiaoqiang Chi; Yang Xiang

doi:10.20944/preprints202103.0754.v1

Submitted:

30 March 2021

Posted:

31 March 2021

You are already at the latest version

Abstract

Paraphrase generation is an important yet challenging task in NLP. Neural network-based approaches have achieved remarkable success in sequence-to-sequence(seq2seq) learning. Previous paraphrase generation work generally ignores syntactic information regardless of its availability, with the assumption that neural nets could learn such linguistic knowledge implicitly. In this work we make an endeavor to probe into the efficacy of explicit syntactic information for the task of paraphrase generation. Syntactic information can appear in the form of dependency trees which could be easily acquired from off-the-shelf syntactic parsers. Such tree structures could be conveniently encoded via graph convolutional networks(GCNs) to obtain more meaningful sentence representations, which could improve generated paraphrases. Through extensive experiments on four paraphrase datasets with different sizes and genres, we demonstrate the utility of syntactic information in neural paraphrase generation under the framework of seq2seq modeling. Specifically, our GCN-enhanced models consistently outperform their syntax-agnostic counterparts in multiple evaluation metrics.

Keywords:

paraphrase generation

;

syntax information

;

Graph Convolutional Network

;

sequence-to-sequence

Subject:

Computer Science and Mathematics - Computer Vision and Graphics

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Augmenting Paraphrase Generation with Syntax Information using Graph Convolutional Networks

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe