Algorithmic personalization is difficult to approach because it entails studying many different user experiences, with a lot of variables outside of our control. Two common biases are frequent in experiments: relying on corporate service API and using synthetic profiles with small regards of regional and individualized profiling and personalization. In this work, we present the result of the first crowdsourced data collections of YouTube's recommended videos via YouTube Tracking Exposed (YTTREX). Our tool collects evidence of algorithmic personalization via an HTML parser, anonymizing the users. In our experiment we used a BBC video about COVID-19, taking into account 5 regional BBC channels in 5 different languages and we saved the recommended videos that were shown during each session. Each user watched the first five second of the videos, while the extension captured the recommended videos. We took into account the top-20 recommended videos for each completed session, looking for evidence of algorithmic personalization. Our results showed that the vast majority of videos were recommended only once in our experiment. Moreover, we collected evidence that there is a significant difference between the videos we could retrieve using the official API and what we collected with our extension. These findings show that filter bubbles exist and that they need to be investigated with a crowdsourced approach.

YTTREX: crowdsourced analysis of YouTube’s recommender system during COVID-19 pandemic / Sanna, Leonardo; Romano, Salvatore; Corona, Giulia; Agosti, Claudio. - (2020). (Intervento presentato al convegno SIMBig 2020 - 7th International Conference on Information Management and Big Data tenutosi a Online nel 1-3 October 2020).

YTTREX: crowdsourced analysis of YouTube’s recommender system during COVID-19 pandemic

Leonardo Sanna
Formal Analysis
;
2020

Abstract

Algorithmic personalization is difficult to approach because it entails studying many different user experiences, with a lot of variables outside of our control. Two common biases are frequent in experiments: relying on corporate service API and using synthetic profiles with small regards of regional and individualized profiling and personalization. In this work, we present the result of the first crowdsourced data collections of YouTube's recommended videos via YouTube Tracking Exposed (YTTREX). Our tool collects evidence of algorithmic personalization via an HTML parser, anonymizing the users. In our experiment we used a BBC video about COVID-19, taking into account 5 regional BBC channels in 5 different languages and we saved the recommended videos that were shown during each session. Each user watched the first five second of the videos, while the extension captured the recommended videos. We took into account the top-20 recommended videos for each completed session, looking for evidence of algorithmic personalization. Our results showed that the vast majority of videos were recommended only once in our experiment. Moreover, we collected evidence that there is a significant difference between the videos we could retrieve using the official API and what we collected with our extension. These findings show that filter bubbles exist and that they need to be investigated with a crowdsourced approach.
2020
SIMBig 2020 - 7th International Conference on Information Management and Big Data
Online
1-3 October 2020
Sanna, Leonardo; Romano, Salvatore; Corona, Giulia; Agosti, Claudio
YTTREX: crowdsourced analysis of YouTube’s recommender system during COVID-19 pandemic / Sanna, Leonardo; Romano, Salvatore; Corona, Giulia; Agosti, Claudio. - (2020). (Intervento presentato al convegno SIMBig 2020 - 7th International Conference on Information Management and Big Data tenutosi a Online nel 1-3 October 2020).
File in questo prodotto:
File Dimensione Formato  
24-SNMAM.pdf

Open access

Descrizione: Pre-print of the final version, publication in progress
Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 1.49 MB
Formato Adobe PDF
1.49 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1234304
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact