Explaining Social Media Engagement: Interpretable Deep and Machine Learning for Twitter Retweets

Understanding content diffusion on social media is still a wide study challenge. There is no simple answer to the question of why some content is more widely spread than others. To contribute to this, this thesis combines transformer-based, gradient boosting, and NN regression models. These are combined with explainability tools to identify linguistic, topical, and socio-psychological features that drive engagement on Twitter. By combining features based on user-level indicators, sentiment analysis, topic probabilities, and LWIC categories, among others, the thesis aims to provide methods with both accuracy and interpretability. The results obtained aim to not only find which features predict virality but also why they matter and how they are related, allowing a more interpretable understanding of content spread. This is implemented on a wide dataset of climate-change related tweets.

Explaining Social Media Engagement: Interpretable Deep and Machine Learning for Twitter Retweets

SANABRIA VANEGAS, CAMILO ANDRES

2024/2025

Abstract

Understanding content diffusion on social media is still a wide study challenge. There is no simple answer to the question of why some content is more widely spread than others. To contribute to this, this thesis combines transformer-based, gradient boosting, and NN regression models. These are combined with explainability tools to identify linguistic, topical, and socio-psychological features that drive engagement on Twitter. By combining features based on user-level indicators, sentiment analysis, topic probabilities, and LWIC categories, among others, the thesis aims to provide methods with both accuracy and interpretability. The results obtained aim to not only find which features predict virality but also why they matter and how they are related, allowing a more interpretable understanding of content spread. This is implemented on a wide dataset of climate-change related tweets.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Matematica "Tullio Levi-Civita" - DM
			
	Corso di studio
	
				DATA SCIENCE  Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2024
			
	Titolo inglese
	
				Explaining Social Media Engagement: Interpretable Deep and Machine Learning for Twitter Retweets
			
	Abstract in italiano
	
				Understanding content diffusion on social media is still a wide study challenge. There is no simple answer to the question of why some content is more widely spread than others. To contribute to this, this thesis combines transformer-based, gradient boosting, and NN regression models. These are combined with explainability tools to identify linguistic, topical, and socio-psychological features that drive engagement on Twitter. By combining features based on user-level indicators, sentiment analysis, topic probabilities, and LWIC categories, among others, the thesis aims to provide methods with both accuracy and interpretability.  The results obtained aim to not only find which features predict virality but also why they matter and  how they are related, allowing a more interpretable understanding of content spread. This is  implemented on a wide dataset of climate-change related tweets.
			
	Parola chiave
	
				Deep Learning
Interpretability
Engagement
Language
Networks
			
	Relatore
	
				ERSEGHE, TOMASO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
thesis_UNIPD_Camilo_A.pdf accesso aperto Dimensione 24.78 MB Formato Adobe PDF Visualizza/Apri	24.78 MB	Adobe PDF	Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/102134