LLMs outperform outsourced human coders on complex textual analysis
- PMID: 41249236
- PMCID: PMC12623721
- DOI: 10.1038/s41598-025-23798-y
LLMs outperform outsourced human coders on complex textual analysis
Abstract
This paper evaluates the effectiveness of large language models (LLMs) in extracting complex information from text data. Using a corpus of Spanish news articles, we compare how accurately various LLMs and outsourced human coders reproduce expert annotations on five natural language processing tasks, ranging from named entity recognition to identifying nuanced political criticism in news articles. We find that LLMs consistently outperform outsourced human coders, particularly in tasks requiring deep contextual understanding. These findings suggest that current LLM technology offers researchers without programming expertise a cost-effective alternative for sophisticated text analysis.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests.
Figures
References
-
- Barberá, P., Boydstun, A. E., Linn, S., McMahon, R. & Nagler, J. Automated text classification of news articles: A practical guide. Political Anal.29, 19–42. 10.1017/pan.2020.8 (2021). - DOI
-
- Gentzkow, M., Kelly, B. & Taddy, M. Text as data. J. Econ. Lit.57, 535–74. 10.1257/jel.20181020 (2019). - DOI
-
- Song, H. et al. In validations we trust? the impact of imperfect human annotations as a gold standard on the quality of validation of automated content analysis. Political Commun.37, 550–572. 10.1080/10584609.2020.1723752 (2020). - DOI
-
- Grimmer, J. & Stewart, B. M. Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Anal.21, 267–297. 10.1093/pan/mps028 (2013). - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
