scispace - formally typeset
Open Access

Mining social media communities and content

Reads0
Chats0
TLDR
This thesis presents a systematic study of the social media landscape through the combined analysis of its special properties, structure and content, and finds that microblogging provides users with a more immediate form of communication to talk about their daily activities and to seek or share information.
Abstract
Social Media is changing the way people find information, share knowledge and communicate with each other The important factor contributing to the growth of these technologies is the ability to easily produce “user-generated content” Blogs, Twitter, Wikipedia, Flickr and YouTube are just a few examples of Web 20 tools that are drastically changing the Internet landscape today These platforms allow users to produce and annotate content and more importantly, empower them to share information with their social network Friends can in turn, comment and interact with the producer of the original content and also with each other Such social interactions foster communities in online, social media systems User-generated content and the social graph are thus the two essential elements of any social media system Given the vast amount of user-generated content being produced each day and the easy access to the social graph, how can we analyze the structure and content of social media data to understand the nature of online communication and collaboration in social applications? This thesis presents a systematic study of the social media landscape through the combined analysis of its special properties, structure and content First, we have developed a framework for analyzing social media content effectively The BlogVox opinion retrieval system is a large scale blog indexing and content analysis engine For a given query term, the system retrieves and ranks blog posts expressing sentiments (either positive or negative) towards the query terms Further, we have developed a framework to index and semantically analyze syndicated1 feeds from news websites We use a sophisticated natural language processing system, OntoSem [163], to semantically analyze news stories and build a rich fact repository of knowledge extracted from real-time feeds It enables other applications to benefit from such deep semantic analysis by exporting the text meaning representations in Semantic Web language, OWL Secondly, we describe novel algorithms that utilize the special structure and properties of social graphs to detect communities in social media Communities are an essential element of social media systems and detecting their structure and membership is critical in several real-world applications Many algorithms for community detection are computationally expensive and generally, do not scale well for large networks In this work we present an approach that benefits from the scale-free distribution of node degrees to extract communities efficiently Social media sites frequently allow users to provide additional meta-data about the shared resources, usually in the form of tags or folksonomies We have developed a new community detection algorithm that can combine information from tags and the structural information obtained from the graphs to effectively detect communities We demonstrate how structure and content analysis in social media can benefit from the availability of rich meta-data and special properties Finally, we study social media systems from the user perspective In the first study we present an analysis of how a large population of users subscribes and organizes the blog feeds that they read This study has revealed interesting properties and characteristics of the way we consume information We are the first to present an approach to what is now known as the “feed distillation” task, which involves finding relevant feeds for a given query term Based on our understanding of feed subscription patterns we have built a prototype system that provides recommendations for new feeds to subscribe and measures the readership-based influence of blogs in different topics We are also the first to measure the usage and nature of communities in a relatively new phenomena called Microblogging Microblogging is a new form of communication in which users can describe their current status in short posts distributed by instant messages, mobile phones, email or the Web In this study, we present our observations of the microblogging phenomena and user intentions by studying the content, topological and geographical properties of such communities We find that microblogging provides users with a more immediate form of communication to talk about their daily activities and to seek or share information The course of this research has highlighted several challenges that processing social media data presents This class of problems requires us to re-think our approach to text mining, community and graph analysis Comprehensive understanding of social media systems allows us to validate theories from social sciences and psychology, but on a scale much larger than ever imagined Ultimately this leads to a better understanding of how we communicate and interact with each other today and in future 1RSS/ATOM

read more

Citations
More filters
Book ChapterDOI

Data Mining in Social Media

TL;DR: This chapter introduces the basics of data mining, reviews social media, discusses how to mine social media data, and highlights some illustrative examples with an emphasis on social networking sites and blogs.
Proceedings ArticleDOI

Degree centrality and eigenvector centrality in twitter

TL;DR: This research applied degree and eigenvector centrality to observe the effect of centrality value for twitter data and shows that there is significant difference among 10 most influential users.
Dissertation

Fostering koinonia : a critical evaluation of the value of digital social networks in urban congregations

TL;DR: Digital social networks appear to provide the church with a unique opportunity to foster true koinonia, despite the limitations of computer-mediated communication.
Dissertation

Ethical communication in the professional practice of public relations in Cape Town, South Africa

TL;DR: Dissertation submitted in fulfilment of the requirements for the degree of Master of Technology: PUBLIC RELATIONS MANAGEMENT in the Faculty of Informatics and Design at the Cape Peninsula University of Technology.
Journal ArticleDOI

Supporting participation in online learning communities with awareness information

TL;DR: The article discusses the importance of why social systems should support the creation, recreation and reinforcement of social norms to better facilitate participation and four design implications of educational technologies supporting participation.
Related Papers (5)
Trending Questions (1)
How are the growing influence of social media and the internet in fostering atheist communities?

The provided paper does not discuss the growing influence of social media and the internet in fostering atheist communities. The paper primarily focuses on analyzing the structure and content of social media data and studying social media systems from a user perspective.