The amount of information that is possible to gather from social networks may be useful to different contexts ranging from marketing to intelligence. In this paper, we describe the three main techniques for data acquisition in social networks, the conditions under which they can be applied, and the open problems.We then focus on the main issues that crawlers have to address for getting data from social networks, and we propose a novel solution that exploits the cloud computing paradigm for crawling. The proposed crawler is modular by design and relies on a large number of distributed nodes and on the MapReduce framework to speedup the data collection process from large social networks.
Data Acquisition in Social Networks: Issues and Proposals / Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo. - ELETTRONICO. - (2011), pp. n/a-n/a. (Intervento presentato al convegno Services and Open Source - SOS’2011 - co-located with EGC 2011 tenutosi a Brest, FR nel 25/1/2011).