COVID-19 Outbreak through Tweeters’ Words: Monitoring Italian Social Media Communication about COVID-19 with Text Mining and Word Embeddings

Sciandra, A.

doi:10.1109/ISCC50000.2020.9219595

In this paper we aim to analyze the Italian social media communication about COVID-19 through a Twitter dataset collected in two months. The text corpus had been studied in terms of sensitivity to the social changes that are affecting people's lives in this crisis. In addition, the results of a sentiment analysis performed by two lexicons were compared and word embedding vectors were created from the available plain texts. Following we tested the informative effectiveness of word embeddings and compared them to a bag-of-words approach in terms of text classification accuracy. First results showed a certain potential of these textual data in the description of the different phases of the outbreak. However, a different strategy is needed for a more reliable sentiment labeling, as the results proposed by the two lexicons were discordant. Finally, although presenting interesting results in terms of semantic similarity, word embeddings did not show a predictive ability higher than the frequency vectors of the terms.

COVID-19 Outbreak through Tweeters’ Words: Monitoring Italian Social Media Communication about COVID-19 with Text Mining and Word Embeddings / Sciandra, A.. - 2020-:(2020), pp. 1004-1009. (2020 IEEE Symposium on Computers and Communications, ISCC 2020 Rennes (virtual) July 7th, 2020) [10.1109/ISCC50000.2020.9219595].

COVID-19 Outbreak through Tweeters’ Words: Monitoring Italian Social Media Communication about COVID-19 with Text Mining and Word Embeddings

Sciandra A.

2020

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2020
			
	Titolo del Convegno
	
				2020 IEEE Symposium on Computers and Communications, ISCC 2020
			
	Luogo del Convegno
	
				Rennes (virtual)
			
	Data del Convegno
	
				July 7th, 2020
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ISCC50000.2020.9219595
			
	Codice WoS
	
				WOS:000652570900159
			
	Codice Scopus
	
				2-s2.0-85094144501
			
	Serie
	
				PROCEEDINGS - IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS
			
	N° del Volume
	
				2020-
			
	Pagina iniziale
	
				1004
			
	Pagina finale
	
				1009
			
	Tutti gli autori
	
						Sciandra, A.
					
	Citazione
	
				COVID-19 Outbreak through Tweeters’ Words: Monitoring Italian Social Media Communication about COVID-19 with Text Mining and Word Embeddings / Sciandra, A.. - 2020-:(2020), pp. 1004-1009. (2020 IEEE Symposium on Computers and Communications, ISCC 2020 Rennes (virtual) July 7th, 2020) [10.1109/ISCC50000.2020.9219595].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
PID6483597.pdf Open access Tipologia: VOR - Versione pubblicata dall'editore Dimensione 450.22 kB Formato Adobe PDF Visualizza/Apri	450.22 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris