Gender Representation in Large Language Models: A
Cross-Linguistic and Cross-Model Analysis

Aleksandra Urman; Emese Domahidi; Johannes B. Gruber; Ana Jovančević; Michaela Maier; Dren Gërguri; Jaromír Mazak; Mariken Velden

doi:10.5117/CCR2026.2.11.URMA

E-ISSN: 2665-9085

OA
Gender Representation in Large Language Models: A Cross-Linguistic and Cross-Model Analysis
Author: Aleksandra Urman, Emese Domahidi, Johannes B. Gruber, Ana Jovančević, Michaela Maier, Dren Gërguri, Jaromír Mazak & Mariken Velden
Publisher: Amsterdam University Press
Source: Computational Communication Research, Volume 8, Issue 2, Jan 2026, p. 1 - 39
DOI: https://doi.org/10.5117/CCR2026.2.11.URMA
Language: English
- Published online: 01 Jan 2026

Abstract

The representation of gender in large language models (LLMs) can reflect and reinforce existing sociocultural inequalities. However, the nature of such gender biases can differ significantly across languages, influenced by linguistic features and a model’s training data. In this study, we investigate gender representation in 24 open-weight LLMs across six linguistically distinct languages (English, German, Russian, Czech, Albanian, and Serbian). Extending beyond binary frameworks, we incorporate nonbinary individuals as response options and examine associations across psychometrically validated stereotype dimensions (agency, communality, dominance, weakness, and giftedness). Our analysis accounts for variations between and within model families and differences in sampling parameters. The results reveal that traditional gender stereotypes persist with varying degrees of strength, while nonbinary associations show substantial cross-linguistic variations. Temperature analysis demonstrates that such associations are deeply embedded in model parameters rather than being artifacts of sampling procedures. These findings suggest that gender bias identification and potential mitigation in LLMs are shaped by both contextual and technical factors. Overall, our findings challenge the notion that gender bias is a simple, measurable construct, highlighting its complex, context-dependent nature across languages, models, and stereotype dimensions. Effective bias mitigation requires interventions at the level of training data, model architecture, or alignment procedures.

Article metrics loading...

/content/journals/10.5117/CCR2026.2.11.URMA

2026-01-01

2026-04-17

Metrics

Full text loading...

/deliver/fulltext/26659085/8/2/CCR2026.2.11.URMA.html?itemId=/content/journals/10.5117/CCR2026.2.11.URMA&mimeType=html&fmt=ahah

/content/journals/10.5117/CCR2026.2.11.URMA

Gender Representation in Large Language Models: A Cross-Linguistic and Cross-Model Analysis

CCR 8, 1 (2026); https://doi.org/10.5117/CCR2026.2.11.URMA

/content/journals/10.5117/CCR2026.2.11.URMA

Data & Media loading...

Article Type: Other

Keyword(s): Bias, Gender, Multilingual, LLM, generative AI

OA
Gender Representation in Large Language Models: A Cross-Linguistic and Cross-Model Analysis

Abstract

Metrics

Most Read This Month

Most Cited Most Cited RSS feed

A framework for privacy preserving digital trace data collection through data donation

Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video

OSD2F: An Open-Source Data Donation Framework

The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research

The Pervasive Presence of Chinese Government Content on Douyin Trending Videos

Conversational Agent Research Toolkit

Computational observation

Detecting Impoliteness and Incivility in Online Discussions

Image as Data: Automated Content Analysis for Visual Presentations of Political Actors and Events

Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

OA Gender Representation in Large Language Models: A Cross-Linguistic and Cross-Model Analysis

Abstract

Metrics

Most Read This Month

Most Cited Most Cited RSS feed

OA
Gender Representation in Large Language Models: A Cross-Linguistic and Cross-Model Analysis