Grounding the Comparative Turn in Communications: A Framework for ValidatingMultilingual Computational Text Analysis

Fabienne Lind; Martijn Schoonvelde; Christian Baden; Alona O. Dolinsky; Christian Pipal; Mariken A.C.G. van der Velden

doi:10.5117/CCR2025.1.13.LIND

E-ISSN: 2665-9085

oa Grounding the Comparative Turn in Communications: A Framework for ValidatingMultilingual Computational Text Analysis
Authors: Fabienne Lind¹, Martijn Schoonvelde², Christian Baden³, Alona O. Dolinsky⁴, Christian Pipal⁵ & Mariken A.C.G. van der Velden⁶
View Affiliations Hide Affiliations

¹ Department of Communication Science, University of Vienna, Austria ² Faculty of Arts, University of Groningen, The Netherlands ³ Department of Communication and Journalism, The Hebrew University of Jerusalem, Israel ⁴ Department of Communication Science, Vrije Universiteit Amsterdam, The Netherlands ⁵ Department of Communication and Media Research, University of Zurich, Switzerland ⁶ Department of Communication Science, Vrije Universiteit Amsterdam, The Netherlands
Publisher: Amsterdam University Press
Source: Computational Communication Research, Volume 7, Issue 1, Nov 2025, p. 1
DOI: https://doi.org/10.5117/CCR2025.1.13.LIND
Language: English

Abstract

Following the progressing internationalisation of social science research and the computational turn in the field, researchers are increasingly adopting computational text analysis (CTA) methods to compare textual data across multiple cases and languages. In these settings, it is not only the mapping between construct and measures that requires validation, but also the equivalence of this mapping across languages and cases. However, although the validation requirements in multilingual analyses exceed those in monolingual studies, current research shows that validation is often insufficiently and inconsistently addressed in comparative multilingual CTA. To support more robust comparative research, this article presents a framework for validating findings obtained from multilingual textual data. The framework outlines validation strategies for four key stages of a typical multilingual CTA workflow: corpus, input data, process, and output. It directly tackles the challenge of approaching equivalence across contexts and languages in these stages and moves beyond the common practice of identifying problems only at the final stage of research.

Article metrics loading...

/content/journals/10.5117/CCR2025.1.13.LIND

2025-11-01

2025-12-05

Full text loading...

/content/journals/10.5117/CCR2025.1.13.LIND

Article Type: Research Article

Keyword(s): comparative research; computational text analyis; cross-lingual; internationalisation; text as data; validation framework

oa Grounding the Comparative Turn in Communications: A Framework for ValidatingMultilingual Computational Text Analysis

Abstract

Most Read This Month

Most Cited Most Cited RSS feed

A framework for privacy preserving digital trace data collection through data donation

Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video

OSD2F: An Open-Source Data Donation Framework

The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research

Conversational Agent Research Toolkit

The Pervasive Presence of Chinese Government Content on Douyin Trending Videos

Computational observation

Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

Detecting Impoliteness and Incivility in Online Discussions

How Document Sampling and Vocabulary Pruning Affect the Results of Topic Models