Open Access
Mining social media communities and content
Tim Finin,Akshay Java +1 more
Reads0
Chats0
TLDR
This thesis presents a systematic study of the social media landscape through the combined analysis of its special properties, structure and content, and finds that microblogging provides users with a more immediate form of communication to talk about their daily activities and to seek or share information.Abstract:
Social Media is changing the way people find information, share knowledge and communicate with each other The important factor contributing to the growth of these technologies is the ability to easily produce “user-generated content” Blogs, Twitter, Wikipedia, Flickr and YouTube are just a few examples of Web 20 tools that are drastically changing the Internet landscape today These platforms allow users to produce and annotate content and more importantly, empower them to share information with their social network Friends can in turn, comment and interact with the producer of the original content and also with each other Such social interactions foster communities in online, social media systems User-generated content and the social graph are thus the two essential elements of any social media system
Given the vast amount of user-generated content being produced each day and the easy access to the social graph, how can we analyze the structure and content of social media data to understand the nature of online communication and collaboration in social applications? This thesis presents a systematic study of the social media landscape through the combined analysis of its special properties, structure and content
First, we have developed a framework for analyzing social media content effectively The BlogVox opinion retrieval system is a large scale blog indexing and content analysis engine For a given query term, the system retrieves and ranks blog posts expressing sentiments (either positive or negative) towards the query terms Further, we have developed a framework to index and semantically analyze syndicated1 feeds from news websites We use a sophisticated natural language processing system, OntoSem [163], to semantically analyze news stories and build a rich fact repository of knowledge extracted from real-time feeds It enables other applications to benefit from such deep semantic analysis by exporting the text meaning representations in Semantic Web language, OWL
Secondly, we describe novel algorithms that utilize the special structure and properties of social graphs to detect communities in social media Communities are an essential element of social media systems and detecting their structure and membership is critical in several real-world applications Many algorithms for community detection are computationally expensive and generally, do not scale well for large networks In this work we present an approach that benefits from the scale-free distribution of node degrees to extract communities efficiently Social media sites frequently allow users to provide additional meta-data about the shared resources, usually in the form of tags or folksonomies We have developed a new community detection algorithm that can combine information from tags and the structural information obtained from the graphs to effectively detect communities We demonstrate how structure and content analysis in social media can benefit from the availability of rich meta-data and special properties
Finally, we study social media systems from the user perspective In the first study we present an analysis of how a large population of users subscribes and organizes the blog feeds that they read This study has revealed interesting properties and characteristics of the way we consume information We are the first to present an approach to what is now known as the “feed distillation” task, which involves finding relevant feeds for a given query term Based on our understanding of feed subscription patterns we have built a prototype system that provides recommendations for new feeds to subscribe and measures the readership-based influence of blogs in different topics
We are also the first to measure the usage and nature of communities in a relatively new phenomena called Microblogging Microblogging is a new form of communication in which users can describe their current status in short posts distributed by instant messages, mobile phones, email or the Web In this study, we present our observations of the microblogging phenomena and user intentions by studying the content, topological and geographical properties of such communities We find that microblogging provides users with a more immediate form of communication to talk about their daily activities and to seek or share information
The course of this research has highlighted several challenges that processing social media data presents This class of problems requires us to re-think our approach to text mining, community and graph analysis Comprehensive understanding of social media systems allows us to validate theories from social sciences and psychology, but on a scale much larger than ever imagined Ultimately this leads to a better understanding of how we communicate and interact with each other today and in future
1RSS/ATOMread more
Citations
More filters
Book ChapterDOI
Data Mining in Social Media
Geoffrey Barbier,Huan Liu +1 more
TL;DR: This chapter introduces the basics of data mining, reviews social media, discusses how to mine social media data, and highlights some illustrative examples with an emphasis on social networking sites and blogs.
Proceedings ArticleDOI
Degree centrality and eigenvector centrality in twitter
TL;DR: This research applied degree and eigenvector centrality to observe the effect of centrality value for twitter data and shows that there is significant difference among 10 most influential users.
Dissertation
Fostering koinonia : a critical evaluation of the value of digital social networks in urban congregations
TL;DR: Digital social networks appear to provide the church with a unique opportunity to foster true koinonia, despite the limitations of computer-mediated communication.
Dissertation
Ethical communication in the professional practice of public relations in Cape Town, South Africa
TL;DR: Dissertation submitted in fulfilment of the requirements for the degree of Master of Technology: PUBLIC RELATIONS MANAGEMENT in the Faculty of Informatics and Design at the Cape Peninsula University of Technology.
Journal ArticleDOI
Supporting participation in online learning communities with awareness information
Stefan Nilsson,Lars Svensson +1 more
TL;DR: The article discusses the importance of why social systems should support the creation, recreation and reinforcement of social norms to better facilitate participation and four design implications of educational technologies supporting participation.