Boosting Transformers: Recognizing Textual Entailment for Classification of Vaccine News Coverage

Luiz Neves; Chico Camargo; Luisa Massarani

doi:10.5117/CCR2025.1.1.NEVE

E-ISSN: 2665-9085

oa Boosting Transformers: Recognizing Textual Entailment for Classification of Vaccine News Coverage
Authors: Luiz Neves¹, Chico Camargo² & Luisa Massarani³
View Affiliations Hide Affiliations

¹ Federal University of Goiás; Brazilian Institute of Public Communication of Science and Technology ² University of Exeter; Ewha Womans University, South Korea ³ Brazilian Institute of Public Communication of Science and Technology; Oswaldo Cruz Foundation
Publisher: Amsterdam University Press
Source: Computational Communication Research, Volume 7, Issue 1, Jan 2025, p. 1
DOI: https://doi.org/10.5117/CCR2025.1.1.NEVE
Language: English

Previous Article
Table of Contents
Next Article

Abstract

The introduction of Transformers, neural networks employing self-attention mechanisms, revolutionized Natural Language Processing, handling long-range dependencies and capturing context effectively. Models like BERT and GPT, trained on massive text data, are at the forefront of Large Language Models and have found widespread use in text classification. Despite their benchmark performance, real-world applications pose challenges, including the requirement for substantial labeled data and class balance. Few-shot learning approaches, like the Recognizing Textual Entailment framework, have emerged to address these issues. RTE identifies relationships between a text T and a hypothesis H. T entails H if the meaning of H, as interpreted in the context of T, can be inferred from the meaning of T. This study explores an RTE- based framework for classifying vaccine-related news headlines with only 751 labeled data points distributed unevenly across 10 classes. The study evaluates eight models and procedures. The results highlight that deep transfer learning, combining language and task knowledge, like Transformers and RTE, enables the development of text classification models with superior performance, effectively addressing data scarcity and class imbalance. This approach provides a valuable protocol for creating new text classification models and delivers an advanced automated model for classifying vaccine- related content.

Article metrics loading...

/content/journals/10.5117/CCR2025.1.1.NEVE

2025-01-01

2025-06-06

Full text loading...

/content/journals/10.5117/CCR2025.1.1.NEVE

Article Type: Research Article

Keyword(s): BERT; GPT; Natural Language Processing; Recognizing Textual Entailment; Transformers

Most Cited Most Cited RSS feed

- oa A framework for privacy preserving digital trace data collection through data donation
  
  Authors: Laura Boeschoten, Jef Ausloos, Judith E. Möller, Theo Araujo & Daniel L. Oberski
- oa The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research
  
  Authors: Stijn Peeters & Sal Hagen
- oa Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video
  
  Authors: Benjamin Guinaudeau, Kevin Munger & Fabio Votta
- oa OSD2F: An Open-Source Data Donation Framework
  
  Authors: Theo Araujo, Jef Ausloos, Wouter van Atteveldt, Felicia Loecherbach, Judith Moeller, Jakob Ohme, Damian Trilling, Bob van de Velde, Claes de Vreese & Kasper Welbers
- oa Conversational Agent Research Toolkit
  
  By Theo Araujo
- oa Computational observation
  
  Authors: Mario Haim & Angela Nienierza
- oa Detecting Impoliteness and Incivility in Online Discussions
  
  Authors: Anke Stoll, Marc Ziegele & Oliver Quiring
- oa The Pervasive Presence of Chinese Government Content on Douyin Trending Videos
  
  Authors: Yingdan Lu & Jennifer Pan
- oa Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment
  
  Authors: Chung-hong Chan, Joseph Bajjalieh, Loretta Auvil, Hartmut Wessler, Scott Althaus, Kasper Welbers, Wouter van Atteveldt & Marc Jungblut
- oa How Document Sampling and Vocabulary Pruning Affect the Results of Topic Models
  
  Authors: Daniel Maier, Andreas Niekler, Gregor Wiedemann & Daniela Stoltenberg
More Less

oa Boosting Transformers: Recognizing Textual Entailment for Classification of Vaccine News Coverage

Abstract

Most Read This Month

Most Cited Most Cited RSS feed

oa A framework for privacy preserving digital trace data collection through data donation

oa The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research

oa Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video

oa OSD2F: An Open-Source Data Donation Framework

oa Conversational Agent Research Toolkit

oa Computational observation

oa Detecting Impoliteness and Incivility in Online Discussions

oa The Pervasive Presence of Chinese Government Content on Douyin Trending Videos

oa Four best practices for measuring news sentiment using ‘off-the-shelf’ dictionaries: a large-scale p-hacking experiment

oa How Document Sampling and Vocabulary Pruning Affect the Results of Topic Models