Submitted:
05 April 2024
Posted:
08 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
| Enhanced Example 1 | Enhanced Example 2 | |
|---|---|---|
| Question | How responsive is the screen in ABC model? | Can the sound quality of the phone rival that of a home theater system? |
| Reference Answer | The display responsiveness of my device is subpar | Absolutely, the sound clarity and depth is exceptional |
| Duplicate Q&A (Enriched) | 1) Between XYZ and ABC, which offers a better display experience? Definitely, ABC. | 1) How does the audio fidelity compare to MNO? PQR offers unparalleled sound quality. |
| Reviews | 1) The ABC model performs well overall, but the display is not its strongest suit | 1) The audio output is disappointingly mediocre. |
| 2) Display performance of ABC leaves much to be desired | 2) Lacks the expected audio quality. | |
| Specifications | 1) Display enhancements include...Slim Bezel: 2.05mm, Aspect Ratio: | 1) Audio enhancements feature...Echo Cancellation: Dual-microphone system |
| 2) Color Depth: 16.7 Million | ||
| 3) Screen Size: 6.22 inches |
- Introduction of a novel, comprehensive answer generation framework, CEAGS, that adeptly leverages three pivotal sources of information: product reviews, analogous queries, and detailed specifications, to inform its responses.
- A detailed exploration of the challenges inherent in synthesizing information from these diverse sources, namely the filtration of irrelevant data and the resolution of sentiment ambiguities, and the strategic methodologies employed by CEAGS to address these issues.
- The implementation of a cutting-edge, two-stage process within CEAGS that initially assesses relevance and clarifies sentiment, followed by a sophisticated mechanism for generating coherent and contextually rich answers.
- An exhaustive evaluation of CEAGS, demonstrating its superiority over existing models through comprehensive metrics such as BLEU and ROUGE, and corroborated by human assessment, showcasing an overall accuracy enhancement of 77.88%.
2. Related Work
3. Preliminary
4. Methodology
4.1. Relevancy Detection
4.2. Enhanced Ambiguity Resolution
4.3. Advanced Natural Language Answer Synthesis
4.4. Comprehensive Integration within CEAGS: Elevating E-Commerce Answer Generation
5. Experiments
5.1. Dataset and Model Training
5.2. Baseline Models and Evaluation Metrics
- Seq2Seq Enhanced [41] - An advanced implementation of the sequence-to-sequence model, augmented with attention mechanisms for improved context understanding. This model processes concatenated queries and candidates to produce coherent responses.
- HSSC-q Enhanced - A variant of the HSSC model [52] tailored for question answering, which leverages sentiment analysis alongside answer generation for sentiment-coherent outputs.
- T5-QA Enhanced [33] - A fine-tuned version of the T5 model, specially adapted for answering E-commerce queries. This model exemplifies the fusion of extensive pre-training with task-specific tuning for optimal answer generation.
5.3. Implementation Details
5.3.1. Enhancing Relevancy Prediction
5.3.2. Refining Ambiguity Prediction
5.3.3. Advancing Answer Generation
5.4. Experimental Findings
5.4.1. Evaluating Relevancy Prediction Performance
5.4.2. Answer Generation Pipeline Effectiveness
5.4.3. Insights from Human Evaluation
6. Conclusion and Future Directions
References
- Zijun Sun, Chun Fan, Qinghong Han, Xiaofei Sun, Yuxian Meng, et al. Self-explaining Structures Improve NLP Models. 2020. URL https://arxiv.org/abs/2012.01786.
- Hao Fei, Meishan Zhang, and Donghong Ji. Cross-lingual semantic role labeling with high-quality translated training corpus. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7014–7026, 2020a.
- A. Golubev and N. Loukachevitch. Transfer Learning for Improving results on Russian Sentiment Datasets. In Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference “Dialog”, pages 268–277, 2021.
- Shengqiong Wu, Hao Fei, Fei Li, Meishan Zhang, Yijiang Liu, Chong Teng, and Donghong Ji. Mastering the explicit opinion-role interaction: Syntax-aided neural transition system for unified opinion role labeling. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, pages 11513–11521, 2022.
- Wenxuan Shi, Fei Li, Jingye Li, Hao Fei, and Donghong Ji. Effective token graph modeling using a novel labeling strategy for structured sentiment analysis. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4232–4241, 2022.
- Hao Fei, Yue Zhang, Yafeng Ren, and Donghong Ji. Latent emotion memory for multi-label emotion classification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 7692–7699, 2020b.
- Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su, Wenxuan Shi, Donghong Ji, and Bo Cai. Entity-centered cross-document relation extraction. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9871–9881, 2022.
- Ling Zhuang, Hao Fei, and Po Hu. Knowledge-enhanced event relation extraction via event ontology prompt. Inf. Fusion, 100:101919, 2023.
- Jiangang Bai, Yujing Wang, Yiren Chen, Yaming Yang, Jing Bai, Jing Yu, and Yunhai Tong. Syntax-bert: Improving pre-trained transformers with syntax trees. arXiv preprint arXiv:2103.04350, 2021.
- Hao Fei, Yafeng Ren, and Donghong Ji. Retrofitting structure-aware transformer language model for end tasks. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pages 2151–2161, 2020c.
- Guang Qiu, Bing Liu, Jiajun Bu, and Chun Chen. Opinion word expansion and target extraction through double propagation. Computational linguistics, 2011; 27.
- Hao Fei, Shengqiong Wu, Jingye Li, Bobo Li, Fei Li, Libo Qin, Meishan Zhang, Min Zhang, and Tat-Seng Chua. Lasuie: Unifying information extraction with latent adaptive structure-aware generative language model. In Proceedings of the Advances in Neural Information Processing Systems, NeurIPS 2022, pages 15460–15475, 2022a.
- Hao Fei, Yafeng Ren, Yue Zhang, Donghong Ji, and Xiaohui Liang. Enriching contextualized language model from knowledge graph for biomedical information extraction. Briefings in Bioinformatics, 2021.
- Shengqiong Wu, Hao Fei, Wei Ji, and Tat-Seng Chua. Cross2StrA: Unpaired cross-lingual image captioning with cross-lingual cross-modal structure-pivoted alignment. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2593–2608, 2023a.
- Yang Deng, Wenxuan Zhang, and Wai Lam. Opinion-aware answer generation for review-driven question answering in e-commerce, 2020.
- Shiqian Chen, Chenliang Li, Feng Ji, Wei Zhou, and Haiqing Chen. Review-driven answer generation for product-related questions in e-commerce. WSDM ’19, page 411–419, New York, NY, USA, 2019. Association for Computing Machinery. ISBN 9781450359405. [CrossRef]
- Shengqiong Wu, Hao Fei, Leigang Qu, Wei Ji, and Tat-Seng Chua. Next-gpt: Any-to-any multimodal llm. CoRR, abs/2309.05519, 2023b.
- Shen Gao, Zhaochun Ren, Yihong Eric Zhao, Dongyan Zhao, Dawei Yin, and Rui Yan. Product-aware answer generation in e-commerce question-answering. CoRR, abs/1901.07696, 2019. URL http://arxiv.org/abs/1901.07696.
- Shengqiong Wu, Hao Fei, Yafeng Ren, Donghong Ji, and Jingye Li. Learn from syntax: Improving pair-wise aspect and opinion terms extraction with rich syntactic knowledge. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, pages 3957–3963, 2021.
- Bobo Li, Hao Fei, Lizi Liao, Yu Zhao, Chong Teng, Tat-Seng Chua, Donghong Ji, and Fei Li. Revisiting disentanglement and fusion on modality and context in conversational multimodal emotion recognition. In Proceedings of the 31st ACM International Conference on Multimedia, MM, pages 5923–5934, 2023a.
- Hao Fei, Qian Liu, Meishan Zhang, Min Zhang, and Tat-Seng Chua. Scene graph as pivoting: Inference-time image-free unsupervised multimodal machine translation with visual scene hallucination. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5980–5994, 2023a.
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. CoRR, abs/1706.03762, 2017. URL http://arxiv.org/abs/1706.03762.
- Jingye Li, Kang Xu, Fei Li, Hao Fei, Yafeng Ren, and Donghong Ji. MRN: A locally and globally mention-based reasoning network for document-level relation extraction. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1359–1370, 2021.
- Hao Fei, Shengqiong Wu, Yafeng Ren, and Meishan Zhang. Matching structure for dual learning. In Proceedings of the International Conference on Machine Learning, ICML, pages 6373–6391, 2022b.
- Hu Cao, Jingye Li, Fangfang Su, Fei Li, Hao Fei, Shengqiong Wu, Bobo Li, Liang Zhao, and Donghong Ji. OneEE: A one-stage framework for fast overlapping and nested event extraction. In Proceedings of the 29th International Conference on Computational Linguistics, pages 1953–1964, 2022.
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding. CoRR, abs/1810.04805, 2018. URL http://arxiv.org/abs/1810.04805.
- Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. Xlnet: Generalized autoregressive pretraining for language understanding. CoRR, abs/1906.08237, 2019. URL http://arxiv.org/abs/1906.08237.
- Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, and Tat-Seng Chua. Information screening whilst exploiting! multimodal relation extraction with feature denoising and multimodal topic modeling. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14734–14751, 2023c.
- Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. ALBERT: A lite BERT for self-supervised learning of language representations. CoRR, abs/1909.11942, 2019. URL http://arxiv.org/abs/1909.11942.
- Hao Fei, Fei Li, Bobo Li, and Donghong Ji. Encoder-decoder based unified semantic role labeling with label-aware syntax. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 12794–12802, 2021b.
- Bobo Li, Hao Fei, Fei Li, Yuhan Wu, Jinsong Zhang, Shengqiong Wu, Jingye Li, Yijiang Liu, Lizi Liao, Tat-Seng Chua, and Donghong Ji. DiaASQ: A benchmark of conversational aspect-based sentiment quadruple analysis. In Findings of the Association for Computational Linguistics: ACL 2023, pages 13449–13467, 2023b.
- Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. Roberta: A robustly optimized BERT pretraining approach. CoRR, abs/1907.11692, 2019. URL http://arxiv.org/abs/1907.11692.
- Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. Exploring the limits of transfer learning with a unified text-to-text transformer. CoRR, abs/1910.10683, 2019. URL http://arxiv.org/abs/1910.10683.
- Jianfei Yu, Minghui Qiu, Jing Jiang, Jun Huang, Shuangyong Song, Wei Chu, and Haiqing Chen. Modelling domain relationships for transfer learning on retrieval-based question answering systems in e-commerce. CoRR, abs/1711.08726, 2017. URL http://arxiv.org/abs/1711.08726.
- Lei Cui, Shaohan Huang, Furu Wei, Chuanqi Tan, Chaoqun Duan, and Ming Zhou. SuperAgent: A customer service chatbot for E-commerce websites. In Proceedings of ACL 2017, System Demonstrations, pages 97–102, Vancouver, Canada, 17. Association for Computational Linguistics. URL https://www.aclweb.org/anthology/P17-4017. 20 July.
- Happy Mittal, Aniket Chakrabarti, Belhassen Bayar, Animesh Anant Sharma, and Nikhil Rasiwasia. Distantly supervised transformers for e-commerce product qa, 2021.
- Hao Fei, Shengqiong Wu, Yafeng Ren, Fei Li, and Donghong Ji. Better combine them together! integrating syntactic constituency and dependency representations for semantic role labeling. In Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, pages 549–559, 2021c.
- Shengqiong Wu, Hao Fei, Hanwang Zhang, and Tat-Seng Chua. Imagine that! abstract-to-intricate text-to-image synthesis with scene graph hallucination diffusion. Advances in Neural Information Processing Systems, 36, 2024.
- Hao Fei, Shengqiong Wu, Wei Ji, Hanwang Zhang, and Tat-Seng Chua. Empowering dynamics-aware text-to-video diffusion with large language models. arXiv preprint arXiv:2308.13812, 2023b.
- Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, and Tat-Seng Chua. Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation. In Proceedings of the 31st ACM International Conference on Multimedia, pages 643–654, 2023.
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate, 2016.
- Hao Fei, Fei Li, Chenliang Li, Shengqiong Wu, Jingye Li, and Donghong Ji. Inheriting the wisdom of predecessors: A multiplex cascade framework for unified aspect-based sentiment analysis. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI, pages 4096–4103, 2022c.
- Hao Fei, Yafeng Ren, and Donghong Ji. Boundaries and edges rethinking: An end-to-end neural model for overlapping entity relation extraction. Information Processing & Management, 57(6):102311, 2020d.
- Jingye Li, Hao Fei, Jiang Liu, Shengqiong Wu, Meishan Zhang, Chong Teng, Donghong Ji, and Fei Li. Unified named entity recognition as word-word relation classification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 10965–10973, 2022.
- Julian McAuley and Alex Yang. Addressing complex and subjective product-related queries with customer reviews. In Proceedings of the 25th International Conference on World Wide Web, WWW ’16, page 625–635, Republic and Canton of Geneva, CHE, 2016. International World Wide Web Conferences Steering Committee. ISBN 9781450341431. [CrossRef]
- Hao Fei, Tat-Seng Chua, Chenliang Li, Donghong Ji, Meishan Zhang, and Yafeng Ren. On the robustness of aspect-based sentiment analysis: Rethinking model, data, and training. ACM Transactions on Information Systems, 41(2):50:1–50:32, 2023c.
- Yu Zhao, Hao Fei, Yixin Cao, Bobo Li, Meishan Zhang, Jianguo Wei, Min Zhang, and Tat-Seng Chua. Constructing holistic spatio-temporal scene graph for video semantic role labeling. In Proceedings of the 31st ACM International Conference on Multimedia, MM, pages 5281–5291, 2023a.
- Hao Fei, Yafeng Ren, Yue Zhang, and Donghong Ji. Nonautoregressive encoder-decoder neural framework for end-to-end aspect-based sentiment triplet extraction. IEEE Transactions on Neural Networks and Learning Systems, 5544.
- Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, and Tat-Seng Chua. Generating visual spatial description via holistic 3D scene understanding. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7960–7977, 2023b.
- Daria Dzendzik, Carl Vogel, and Jennifer Foster. Is it dish washer safe? automatically answering “yes/no” questions using customer reviews. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 1–6, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics. [CrossRef]
- Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, and Tat-Seng Chua. Reasoning implicit sentiment with chain-of-thought prompting. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 1171–1182, 2023e.
- Shuming Ma, Xu Sun, Junyang Lin, and Xuancheng Ren. A hierarchical end-to-end model for jointly improving text summarization and sentiment classification. CoRR, abs/1805.01089, 2018. URL http://arxiv.org/abs/1805.01089.
- Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, and Alexander M. Rush. Huggingface’s transformers: State-of-the-art natural language processing, 2020.
- Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. Automatic differentiation in pytorch. 2017.
| Training Set | Validation Set | |
|---|---|---|
| Number of Queries | 1638 | 362 |
| Total Information Candidates | 15122 | 3268 |
| Relevant Candidates | 8670 | 1736 |
| Average Relevance of Specifications | 0.308 | 0.253 |
| Average Relevance of Q&A | 0.668 | 0.634 |
| Average Relevance of Reviews | 0.626 | 0.573 |
| Training Set | Validation Set | |
|---|---|---|
| Total Number of Queries | 217086 | 11153 |
| Number of WH (Who, What, Where, etc.) Questions | 66075 | 3459 |
| Average Candidates per Query | 9.750 | 9.486 |
| Average Number of Specifications per Query | 2.381 | 2.381 |
| Average Number of Reviews per Query | 2.637 | 2.344 |
| Average Number of Duplicate Questions per Query | 4.732 | 4.762 |
| Dataset | Model |
|---|---|
| Relevancy Dataset (D1) | RoBERTa-Answers |
| BERT-Answers | |
| RoBERTa-QA Enhanced | |
| BERT-QA Enhanced | |
| Answer Generation Dataset (D2) | Seq2Seq Enhanced |
| HSSC-q Enhanced | |
| T5-QA Enhanced |
| Model | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|
| BERT-base | 0.635 | 0.637 | 0.996 | 0.777 |
| RoBERTa-Answers | 0.708 | 0.767 | 0.7778 | 0.772 |
| BERT-Answers | 0.749 | 0.806 | 0.797 | 0.802 |
| RoBERTa-QA Enhanced | 0.764 | 0.832 | 0.789 | 0.810 |
| BERT-QA Enhanced | 0.838 | 0.873 | 0.872 | 0.873 |
| Binary Questions | WH Questions | |||||||
|---|---|---|---|---|---|---|---|---|
| R1 | R2 | RL | B1 | R1 | R2 | RL | B1 | |
| T5-Base | 9.74 | 1.89 | 9.18 | 0.22 | 8.14 | 2.20 | 7.80 | 0.77 |
| Seq2Seq Enhanced | 22.87 | 6.57 | 22.09 | 1.50 | 14.50 | 3.74 | 13.77 | 0.10 |
| HSSC-q Enhanced | 24.19 | 8.65 | 23.46 | 1.91 | 15.43 | 4.85 | 14.68 | 0.90 |
| T5-QA Enhanced | 31.27 | 12.85 | 29.65 | 5.48 | 22.69 | 8.77 | 20.71 | 3.21 |
| CEAGS (Relevancy Only) | 31.17 | 12.79 | 29.58 | 5.41 | 22.48 | 8.47 | 20.60 | 3.18 |
| CEAGS (Full Pipeline) | 30.47 | 12.16 | 28.88 | 5.18 | 22.64 | 8.73 | 20.85 | 3.28 |
| Query: Is this device equipped with gorilla glass protection? |
|---|
| Reference Answer: No, additional protection is advised |
| T5-Base: Does it have gorilla glass? |
| Seq2Seq Enhanced: No, it comes with gorilla glass |
| HSSC-q Enhanced: Yes, it features gorilla glass 3 |
| T5-QA Enhanced: No, it lacks gorilla glass. |
| CEAGS (Relevancy): No gorilla glass is included. |
| CEAGS (Full Pipeline): No, gorilla glass is not a feature. |
| Query: How does the ABC game perform in terms of graphics and gameplay? |
| Reference Answer: HD mode runs smoothly, but HDR is too demanding for this model |
| T5-Base: Performance is lacking |
| Seq2Seq Enhanced:ABC can be played with basic settings |
| HSSC-q Enhanced: HD gameplay is possible on ABC |
| T5-QA Enhanced: Smooth gameplay and high-quality graphics are noted on ABC. |
| CEAGS (Relevancy):ABC provides excellent performance on high settings. |
| CEAGS (Full Pipeline):ABC achieves fluid gameplay in HD graphics. |
| Query: How is the sound quality and are there any heating issues? |
| Reference Answer: Sound is impressive with no heating concerns |
| T5-Base: Device overheats |
| Seq2Seq Enhanced: Good sound quality |
| HSSC-q Enhanced: High sound quality |
| T5-QA Enhanced: Sound quality is satisfactory, though heating can be an issue |
| CEAGS (Relevancy): Excellent sound, minimal heating detected |
| CEAGS (Full Pipeline): Sound quality is exceptional with no heating problems |
| Binary Questions | WH Questions | |||
|---|---|---|---|---|
| Correct vs. Context | Correct vs. Reference | Correct vs. Context | Correct vs. Reference | |
| T5-QA Enhanced | 0.919 | 0.628 | 0.833 | 0.537 |
| CEAGS (Full Pipeline) | 0.943 | 0.845 | 0.869 | 0.656 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).