Decoding moral responses in AI: A quantitative analysis of large language models

Hui, Bryant Pui Hung; Lau, Chiu Lun; Sun, Rui; LC, RAY; Hendra, Latisha Besariani; Kogan, Aleksandr; Hui, Bryant Pui Hung; Lau, Chiu Lun; Sun, Rui; LC, RAY; Hendra, Latisha Besariani; Kogan, Aleksandr

doi:https://doi.org/10.1016/j.chbr.2025.100854

Data de publicació

2025-12

URI https://hdl.handle.net/20.500.14342/6288

DOI

https://doi.org/10.1016/j.chbr.2025.100854

ISSN

2451-9588

Resum

Despite the proliferation of powerful large language models (LLMs), there remains a need for systematic, quantitative comparisons of their responses to moral dilemmas. While LLMs lack intrinsic capacities for moral evaluation, they can generate texts indistinguishable from human responses—a feature that raises serious moral consequences, particularly in advice-giving contexts. This study builds on advances in AI ethics by systematically comparing the moral response patterns of seven LLMs, including GPT-3, GPT-3.5, GPT-4, GPT-4.1, Claude 3.7 Sonnet, Grok 3, and Gemini 2.5 Pro. Each LLM was presented with a series of moral dilemmas, both personal and impersonal, under three conditions: no rule, a deontological preamble, and a utilitarian preamble. Human and AI-assisted coding were employed to categorize the models’ responses into distinct moral judgments. Logistic regression analyses revealed that LLMs produced patterns consistent with established human biases in moral dilemmas, tending toward more utilitarian moral judgments in impersonal (vs. personal) dilemmas. Despite similar utilitarian moral tendencies under “no rule” and “utilitarian” conditions in most LLMs, the models’ outputs varied significantly under deontological framing, except GPT-4 and Claude 3.7 Sonnet in personal dilemmas. These findings highlight the learning prowess, or “slow thinking,” of LLMs and thus potential ways AI models diverge from human response patterns in morally charged scenarios. Our findings and approach also advocate for the nascent field of “Artificial Intelligence Psychology,” a discipline poised to leverage psychological paradigms for a deeper understanding of AI's outputs and limitations. This insight supports the responsible advancement and application of AI in society.

Tipus de document

Article

Versió del document

Versió publicada

Llengua

Anglès

Paraules clau

Pàgines

14 p.

Publicat per

Elsevier Ltd.

Publicat a

Computers in Human Behavior Reports, Vol. 20, 100854

Mostra el registre complet de l'element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Articles publicats en revistes [343]

Drets

Excepte que s'indiqui una altra cosa, la llicència de l'ítem es descriu com http://creativecommons.org/licenses/by-nc-nd/4.0/