Supplementary data for the paper 'Can ChatGPT be used to predict citation counts, readership, and social media interaction? An exploration among 2222 scientific abstracts'

doi:10.4121/710585da-ed2e-4d36-b8e4-ad02c3af1e65.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
doi: 10.4121/710585da-ed2e-4d36-b8e4-ad02c3af1e65
Datacite citation style:
de Winter, Joost (2024): Supplementary data for the paper 'Can ChatGPT be used to predict citation counts, readership, and social media interaction? An exploration among 2222 scientific abstracts'. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/710585da-ed2e-4d36-b8e4-ad02c3af1e65.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset

This study explores the potential of ChatGPT, a large language model, in scientometrics by assessing its ability to predict citation counts, Mendeley readers, and social media engagement. In this study, 2222 abstracts from PLOS ONE articles published during the initial months of 2022 were analyzed using ChatGPT-4, which used a set of 60 criteria to assess each abstract. Using a principal component analysis, three components were identified: Quality and Reliability, Accessibility and Understandability, and Novelty and Engagement. The Accessibility and Understandability of the abstracts correlated with higher Mendeley readership, while Novelty and Engagement and Accessibility and Understandability were linked to citation counts (Dimensions, Scopus, Google Scholar) and social media attention. Quality and Reliability showed minimal correlation with citation and altmetrics outcomes. Finally, it was found that the predictive correlations of ChatGPT-based assessments surpassed traditional readability metrics. The findings highlight the potential of large language models in scientometrics and possibly pave the way for AI-assisted peer review.

history
  • 2024-01-05 first online, published, posted
publisher
4TU.ResearchData
format
scripts/m; data/mat; data/xlsx; instructions/docx; prompt/txt
organizations
TU Delft, Faculty of Mechanical Engineering, Department of Cognitive Robotics

DATA

files (1)