Top 9 papers published by Sameep Mehta from IBM in 2015

Journal Article•DOI•

Being Aware of the World: Toward Using Social Media to Support the Blind With Navigation

[...]

Samleo L. Joseph¹, Jizhong Xiao¹, Xiaochen Zhang¹, Bhupesh Chawda², Kanika Narang², Nitendra Rajput², Sameep Mehta², L. Venkata Subramaniam² - Show less +4 more•Institutions (2)

City College of New York¹, IBM²

14 Jan 2015-IEEE Transactions on Human-Machine Systems

TL;DR: The framework supports the navigation problem for the blind by combining the advantages of the real-time localization technologies so that the user is being made aware of the world, a necessity for independent travel.

...read moreread less

Abstract: This paper lays the ground work for assistive navigation using wearable sensors and social sensors to foster situational awareness for the blind. Our system acquires social media messages to gauge the relevant aspects of an event and to create alerts. We propose social semantics that captures the parameters required for querying and reasoning an event-of-interest, such as what, where, who, when, severity, and action from the Internet of things, using an event summarization algorithm. Our approach integrates wearable sensors in the physical world to estimate user location based on metric and landmark localization. Streaming data from the cyber world are employed to provide awareness by summarizing the events around the user based on the situation awareness factor. It is illustrated using disaster and socialization event scenarios. Discovered local events are fed back using sound localization so that the user can actively participate in a social event or get early warning of any hazardous events. A feasibility evaluation of our proposed algorithm included comparing the output of the algorithm to ground truth, a survey with sighted participants about the algorithm output, and a sound localization user interface study with blind-folded sighted participants. Thus, our framework supports the navigation problem for the blind by combining the advantages of our real-time localization technologies so that the user is being made aware of the world, a necessity for independent travel.

...read moreread less

41 citations

Proceedings Article•DOI•

Exploring a Scalable Solution to Identifying Events in Noisy Twitter Streams

[...]

Shamanth Kumar¹, Huan Liu¹, Sameep Mehta², L. Venkata Subramaniam²•Institutions (2)

Arizona State University¹, IBM²

25 Aug 2015

TL;DR: This paper investigates event detection in the context of real-time Twitter streams as observed in real-world crises, and presents a novel approach to address the key challenges: the informal nature of text, and the high-volume and high-velocity characteristics of Twitter streams.

...read moreread less

Abstract: The unprecedented use of social media through smartphones and other web-enabled mobile devices has enabled the rapid adoption of platforms like Twitter. Event detection has found many applications on the web, including breaking news identification and summarization. The recent increase in the usage of Twitter during crises has attracted researchers to focus on detecting events in tweets. However, current solutions have focused on static Twitter data. The necessity to detect events in a streaming environment during fast paced events such as a crisis presents new opportunities and challenges. In this paper, we investigate event detection in the context of real-time Twitter streams as observed in real-world crises. We highlight the key challenges in this problem: the informal nature of text, and the high-volume and high-velocity characteristics of Twitter streams. We present a novel approach to address these challenges using single-pass clustering and the compression distance to efficiently detect events in Twitter streams. Through experiments on large Twitter datasets, we demonstrate that the proposed framework is able to detect events in near real-time and can scale to large and noisy Twitter streams.

...read moreread less

21 citations

Proceedings Article•DOI•

Inferring and Exploiting Categories for Next Location Prediction

[...]

Ankita Likhyani, Deepak S. Padmanabhan¹, Srikanta Bedathur¹, Sameep Mehta¹•Institutions (1)

IBM¹

18 May 2015

TL;DR: This paper proposes a framework to use the location data from LBSNs, combine it with the data from maps for associating a set of venue categories with these locations and shows that this approach improves on the state-of-the-art methods for location prediction.

...read moreread less

Abstract: Predicting the next location of a user based on their previous visiting pattern is one of the primary tasks over data from location based social networks (LBSNs) such as Foursquare. Many different aspects of these so-called "check-in" profiles of a user have been made use of in this task, including spatial and temporal information of check-ins as well as the social network information of the user. Building more sophisticated prediction models by enriching these check-in data by combining them with information from other sources is challenging due to the limited data that these LBSNs expose due to privacy concerns. In this paper, we propose a framework to use the location data from LBSNs, combine it with the data from maps for associating a set of venue categories with these locations. For example, if the user is found to be checking in at a mall that has cafes, cinemas and restaurants according to the map, all these information is associated. This category information is then leveraged to predict the next checkin location by the user. Our experiments with publicly available check-in dataset show that this approach improves on the state-of-the-art methods for location prediction.

...read moreread less

19 citations

Proceedings Article•

Tracking political elections on social media: applications and experience

[...]

Danish Contractor¹, Bhupesh Chawda¹, Sameep Mehta¹, L. Venkata Subramaniam¹, Tanveer A. Faruquie¹ - Show less +1 more•Institutions (1)

IBM¹

25 Jul 2015

TL;DR: Using data from the 2012 US presidential elections and the 2013 Philippines General elections, this work provides detailed experiments on methods that use granger causality to identify topics that were most "causal" for public opinion and which in turn give an interpretable insight into "elections topics" that weremost important.

...read moreread less

Abstract: In recent times, social media has become a popular medium for many election campaigns. It not only allows candidates to reach out to a large section of the electorate, it is also a potent medium for people to express their opinion on the proposed policies and promises of candidates. Analyzing social media data is challenging as the text can be noisy, sparse and even multilingual. In addition, the information may not be completely trustworthy, particularly in the presence of propaganda, promotions and rumors. In this paper we describe our work for analyzing election campaigns using social media data. Using data from the 2012 US presidential elections and the 2013 Philippines General elections, we provide detailed experiments on our methods that use granger causality to identify topics that were most "causal" for public opinion and which in turn, give an interpretable insight into "elections topics" that were most important. Our system was deployed by the largest media organization in the Philippines during the 2013 General elections and using our work, the media house able to identify and report news stories much faster than competitors and reported higher TRP ratings during the election.

...read moreread less

14 citations

Book Chapter•DOI•

Entity Linking for Web Search Queries

[...]

Deepak P¹, Sayan Ranu², Prithu Banerjee¹, Sameep Mehta¹•Institutions (2)

IBM¹, Indian Institute of Technology Madras²

29 Mar 2015

TL;DR: This work proposes a three-phase method for linking web search queries to wikipedia entities using an IR-style scoring of entities against the search query to narrow down to a subset of entities that are expanded using hyperlink information in the second phase to a larger set.

...read moreread less

Abstract: We consider the problem of linking web search queries to entities from a knowledge base such as Wikipedia. Such linking enables converting a user’s web search session to a footprint in the knowledge base that could be used to enrich the user profile. Traditional methods for entity linking have been directed towards finding entity mentions in text documents such as news reports, each of which are possibly linked to multiple entities enabling the usage of measures like entity set coherence. Since web search queries are very small text fragments, such criteria that rely on existence of a multitude of mentions do not work too well on them. We propose a three-phase method for linking web search queries to wikipedia entities. The first phase does IR-style scoring of entities against the search query to narrow down to a subset of entities that are expanded using hyperlink information in the second phase to a larger set. Lastly, we use a graph traversal approach to identify the top entities to link the query to. Through an empirical evaluation on real-world web search queries, we illustrate that our methods significantly enhance the linking accuracy over state-of-the-art methods.

...read moreread less

5 citations

Proceedings Article•DOI•

Identifying Top-k Consistent News-Casters on Twitter

[...]

Sahisnu Mazumder¹, Sameep Mehta², Dhaval Patel¹•Institutions (2)

Indian Institute of Technology Roorkee¹, IBM²

17 Oct 2015

TL;DR: A framework, NCFinder, to discover top-k consistent news-casters directly from Twitter, using news headlines published in online news sources to periodically collect authentic news-tweets and employs HITS algorithm on it to score the news- broadcasters on daily basis.

...read moreread less

Abstract: News-casters are Twitter users who periodically pick up interesting news from online news media and spread it to their followers' network. Existing works on Twitter user analysis have only analysed a pre-defined set of users for user modeling, influence analysis and news recommendation. The problem of identifying prominent, trustworthy and consistent news-casters is unaddressed so far. In this paper, we present a framework, NCFinder, to discover top-k consistent news-casters directly from Twitter. NCFinder uses news headlines published in online news sources to periodically collect authentic news-tweets and processes them to discover news-casters, news sources and news concepts. Next, NCFinder builds a tripartite graph among news-casters, news source and news concepts and employs HITS algorithm on it to score the news-casters on daily basis. The daily score profiles of the news-casters collected over a time-period are then used to infer top-$k$ consistent news-casters. We run NCFinder from 11th Nov. to 24th Nov., 2014 and discover top-100 consistent news-casters and their profile information.

...read moreread less

2 citations

Patent•

Updating Annotator Collections Using Run Traces

[...]

Sameep Mehta¹, Deepak Padmanabhan¹•Institutions (1)

IBM¹

24 Sep 2015

TL;DR: In this article, a computer-implemented method for updating annotator collections using run traces is described, which includes generating one or more alternate versions of annotators selected from a set of multiple document annotators; and outputting an instruction to modify, based on the generated log information for each annotator in the set and each alternate version, at least one document annotator from the set.

...read moreread less

Abstract: Methods, systems, and computer program products for updating annotator collections using run traces are provided herein. A computer-implemented method includes generating one or more alternate versions of one or more document annotators selected from a set of multiple document annotators; executing, on one or more document data sets, (i) one or more document annotators from the set of multiple document annotators and (ii) the one or more alternate versions to generate log information for each document annotator in the set and each alternate version of the one or more alternate versions; and outputting an instruction to modify, based on the generated log information for each document annotator in the set and each alternate version, at least one document annotator from the set with at least one alternate version from the one or more alternate versions.

...read moreread less

1 citations

Proceedings Article•DOI•

Information integration for movies data using graph database

[...]

Vibhor Goyal¹, Aman Agarwal¹, Emmanuel S. Pilli¹, Sameep Mehta²•Institutions (2)

Malaviya National Institute of Technology, Jaipur¹, IBM²

23 Apr 2015

TL;DR: This paper has developed a system that integrates data about movies from various sources across the web and populated the TITAN graph database, which enables it to show that complex information can be retreived using simple queries using Gremlin, a graph query language.

...read moreread less

Abstract: The development of the Internet in the recent years has made it possible to access different information systems anywhere in the world. Information Integration is the merging of information from heterogeneous sources with differing conceptual, contextual and typographical representations. In this paper, we exploit Information Integration techniques for movies data from different sources over the web. Graphs are used to model many complex data objects and their relationships in the real world. In recent years, graphs have become increasingly popular in a variety of domains varying from Biology, Chemistry, Healthcare systems and computer vision to Business Intelligence and Social Media Analytics. We have developed a system that integrates data about movies from various sources across the web and populated the TITAN graph database. This enables us to show that complex information can be retreived using simple queries using Gremlin, a graph query language.

...read moreread less

1 citations

Patent•

Expanding entity and relationship patterns to a collection of document annotators using run traces

[...]

Sameep Mehta¹, Deepak Padmanabhan•Institutions (1)

IBM¹

24 Sep 2015

TL;DR: In this article, a computer-implemented method for updating annotator collections using run traces is described, which includes generating one or more alternate versions of annotators selected from a set of multiple document annotators; and outputting an instruction to modify, based on the generated log information for each annotator in the set and each alternate version, at least one document annotator from the set.

...read moreread less

Abstract: Methods, systems, and computer program products for updating annotator collections using run traces are provided herein. A computer-implemented method includes generating one or more alternate versions of one or more document annotators selected from a set of multiple document annotators; executing, on one or more document data sets, (i) one or more document annotators from the set of multiple document annotators and (ii) the one or more alternate versions to generate log information for each document annotator in the set and each alternate version of the one or more alternate versions; and outputting an instruction to modify, based on the generated log information for each document annotator in the set and each alternate version, at least one document annotator from the set with at least one alternate version from the one or more alternate versions.

...read moreread less

Showing papers by "Sameep Mehta published in 2015"