Text Mining Reddit's Top Posts: The Potential Information In Internet-Based Communities
View/Open
Author
Fernández Pawlukojc, Alex
Other authors
Font Aragonès, Xavier
Publication date
2017Document Type
Bachelor thesis
Language
English
Subjects and keywords
Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació
Data mining
Text Mining
Mineria de dades
Publisher
Universitat Politècnica de Catalunya
Note
As of late, there has been a growing interest in the field of "data-mining" which involves,
among other things, the processment of massive amounts of data for a higher purpose. The
result of such processes usually allows for the assessment of interesting variables concerning
huge pools of population, like trending fads or the development of political currents. One such
pool is Reddit, a web content aggregator service which has been gaining popularity around
the world over the last few years. A statistical analysis has been performed, focused on the
titles that people gave to the most popular content published in Reddit within two specific
timeframes: 2013 and 2016, both in August. The results show interesting patterns regarding
the most popular words and combinations of words, which points towards promising results
should further investigation be undertaken.
This item appears in the following Collection(s)
Rights
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Open Access