dc.contributor.advisor | Anshus, Otto | |
dc.contributor.advisor | Bjørndalen, John Markus | |
dc.contributor.advisor | Ha, Phoung Hoai | |
dc.contributor.author | Kreutzer, Tor | |
dc.date.accessioned | 2012-06-13T11:36:00Z | |
dc.date.available | 2012-06-13T11:36:00Z | |
dc.date.issued | 2012-06-11 | |
dc.description.abstract | In recent years, social network providers has become one of the largest industries in the world. These networks created a new arena for sharing information over the Internet, and thus changed the way people interact with each other. Hundreds of millions of social network users are updating statuses and sending messages to each other every day. These interactions produce vast amounts of social data. This data is the core of the social network providers business model, and it is sold to large companies to perform personalized advertisement, brand monitoring and viral marketing. The price of this data can be intimidating, and some might be unable or unwilling to pay for it because of its price. If the data was freely available, research that could benefit from this data would be derived more freely, leading to new knowledge.
This thesis presents Harvest, a collaborative system for retrieving social data. Harvest is a peer-to-peer system consisting of contributing social network users, inspired by public resource computing. Harvest shares social network account-bound resources to retrieve large social data sets. Contribution is achieved by running an application on the contributors computer like other public resource computing system such as the @home systems.
The system implements retrieval of data from Twitter. Experiments on real Twitter data show that the system scales with increased contribution. The data retrieval bandwidth per contributing user is quite low, and the number of contributors needed to achieve a considerably large data retrieval bandwidth is high, but there are no associated financial costs with the system. Harvest would benefit greatly by retrieving data from more sources as this would increase its data retrieval bandwidth, in addition to offer more abundant data. | en |
dc.identifier.uri | https://hdl.handle.net/10037/4248 | |
dc.identifier.urn | URN:NBN:no-uit_munin_3963 | |
dc.language.iso | eng | en |
dc.publisher | Universitetet i Tromsø | en |
dc.publisher | University of Tromsø | en |
dc.rights.accessRights | openAccess | |
dc.rights.holder | Copyright 2012 The Author(s) | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/3.0 | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) | en_US |
dc.subject.courseID | INF-3990 | en |
dc.subject | VDP::Mathematics and natural science: 400::Information and communication science: 420::Communication and distributed systems: 423 | en |
dc.subject | VDP::Mathematics and natural science: 400::Information and communication science: 420::System development and system design: 426 | en |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Kommunikasjon og distribuerte systemer: 423 | en |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Systemutvikling og -arbeid: 426 | en |
dc.title | Harvest : a collaborative system for distributed retrieval of social data | en |
dc.type | Master thesis | en |
dc.type | Mastergradsoppgave | en |