The amount of information that is possible to gather from social networks may be useful to different contexts ranging from marketing to intelligence. In this paper, we describe the three main techniques for data acquisition in social networks, the conditions under which they can be applied, and the open problems.We then focus on the main issues that crawlers have to address for getting data from social networks, and we propose a novel solution that exploits the cloud computing paradigm for crawling. The proposed crawler is modular by design and relies on a large number of distributed nodes and on the MapReduce framework to speedup the data collection process from large social networks.

Data Acquisition in Social Networks: Issues and Proposals / Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo. - ELETTRONICO. - (2011), pp. n/a-n/a. (Intervento presentato al convegno Services and Open Source - SOS’2011 - co-located with EGC 2011 tenutosi a Brest, FR nel 25/1/2011).

Data Acquisition in Social Networks: Issues and Proposals

CANALI, Claudia;COLAJANNI, Michele;LANCELLOTTI, Riccardo
2011

Abstract

The amount of information that is possible to gather from social networks may be useful to different contexts ranging from marketing to intelligence. In this paper, we describe the three main techniques for data acquisition in social networks, the conditions under which they can be applied, and the open problems.We then focus on the main issues that crawlers have to address for getting data from social networks, and we propose a novel solution that exploits the cloud computing paradigm for crawling. The proposed crawler is modular by design and relies on a large number of distributed nodes and on the MapReduce framework to speedup the data collection process from large social networks.
2011
Services and Open Source - SOS’2011 - co-located with EGC 2011
Brest, FR
25/1/2011
n/a
n/a
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
Data Acquisition in Social Networks: Issues and Proposals / Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo. - ELETTRONICO. - (2011), pp. n/a-n/a. (Intervento presentato al convegno Services and Open Source - SOS’2011 - co-located with EGC 2011 tenutosi a Brest, FR nel 25/1/2011).
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/648455
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact