Using Large Language Models for Studying Public Opinion

Project Description

The recent development and large-scale proliferation of large language models (LLMs), such as OpenAI’s GPT or Meta’s Llama, have spurred discussions about the extent to which these language models can be used for research in the social and behavioral sciences. This includes augmenting survey data collection and analysis. Research has started to examine to what extent LLM-generated “synthetic samples” could complement or replace traditional surveys, considering their training data potentially reflects attitudes and behaviors prevalent in the population. However, several contextual factors related to the relationship between the respective target population and LLM training data might limit such applications. In this project, we investigate the extent to which LLMs can estimate public opinion in countries with different digital, social, political, and linguistic settings. By examining the prediction of voting behavior using LLMs in new contexts, our studies contribute to the growing body of research about the conditions under which LLMs can be leveraged for studying public opinion.

Current Application: Predicting the 2024 European Elections with GPT

Contact Person

Dr. Leah von der Heyde

Send an email

More

Project Team

Name	Email
von der Heyde, Leah	leah.vonderheyde@gesis.org
Haensch, Anna-Carolina	anna-carolina.haensch@stat.uni-muenchen.de
Wenz, Alexander
Ma, Bolei	bolei.ma@lmu.de

Publications

von der Heyde, L., Haensch, A.-C., Weiß, B., & Daikeler, J. (in press). Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation. Survey Research Methods.
Preprint available at: https://doi.org/10.48550/ARXIV.2506.14634
von der Heyde, L., Haensch, A.-C., & Wenz, A. (2025). Vox Populi, Vox AI? Using Large Language Models to Estimate German Vote Choice. Social Science Computer Review, 0(0). https://doi.org/10.1177/08944393251337014
von der Heyde, L., Haensch, A., Wenz, A., & Ma. B. (2024). United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections. https://arxiv.org/abs/2409.09045
Bolei Ma, Xinpeng Wang, Tiancheng Hu, Anna-Carolina Haensch, Michael A. Hedderich, Barbara Plank, and Frauke Kreuter. (2024). The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 8783–8805, Miami, Florida, USA. Association for Computational Linguistics.
Bolei Ma, Berk Yoztyurk, Anna-Carolina Haensch, Xinpeng Wang, Markus Herklotz, Frauke Kreuter, Barbara Plank, and Matthias Aßenmacher. 2025. Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1785–1809, Vienna, Austria. Association for Computational Linguistics.

Using Large Language Models for Studying Public Opinion

Project Description

Contact Person

Project Team

Publications

What are you looking for?