Communication Arts: Faculty Publications
Developing a GPT-Based text Extraction Model for Cancer Information
Document Type
Conference Proceeding
Publication Date
1-18-2024
Publication Title
Proceedings of the 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024
DOI
10.1109/Confluence60223.2024.10463424
ISBN
9798350344837
Abstract
By employing Aristotle's rhetoric as the theoretical framework, the present study aims to develop a model that automatically extracts the three key components of persuasive strategies-ethos (authority), pathos (emotional appeal), and logos (logic)-from answers to pertinent cancer questions on Quora, a social question and answer platform. Furthermore, we apply the model to discrete groups of the most upvoted and random (non-upvoted) answers to compare differences in the three persuasive components. The dataset consists of a total of 103 questions and their corresponding answers, including both upvoted and random answers. It was employed for preliminary findings, comprising a total of 33 questions and answers, with answers to 19 questions used as training data and answers to 14 questions used as test data. We annotated sentences in the answers according to the three types of rhetoric employed. We then fine-tuned models based on Generative Pretrained Transformers (GPT) to classify the phrases, achieving an average F1 score of 0.84. Paired sample t-tests confirmed our research hypotheses regarding ethos and logos, while our hypothesis about pathos was not confirmed. Results suggest that ethos and logos are effective in communicating cancer information to consumers, but that pathos is not.
Recommended Citation
Yi, Yong Jeong, Jaemin Jo, Beom Jun Bae, Hyunwoo Moon, June Yoon, Sanghyuk Lee.
2024.
"Developing a GPT-Based text Extraction Model for Cancer Information."
Proceedings of the 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024: 165-169: Institute of Electrical and Electronics Engineers Inc..
doi: 10.1109/Confluence60223.2024.10463424 isbn: 9798350344837
https://digitalcommons.georgiasouthern.edu/comm-arts-facpubs/86
Copyright
This work is archived and distributed under the repository's Standard Copyright and Reuse License (opens in new tab). End users may copy, store, and distribute this work without restriction. For all other uses, permission must be obtained from the copyright owners or their authorized agents.
Comments
Georgia Southern University faculty member, Beom Jun Bae, co-authored, "Developing a GPT-Based text Extraction Model for Cancer Information."