scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A Case Study of Collaboration and Reputation in Social Web Search

TL;DR: This article proposes a reputation model for HeyStaks users that utilise the implicit collaboration events that take place between users as recommendations are made and selected and indicates that incorporating reputation into the recommendation process further improves the relevance of HeyStak recommendations by up to 40%.
Abstract: Although collaborative searching is not supported by mainstream search engines, recent research has highlighted the inherently collaborative nature of many Web search tasks. In this article, we describe HeyStaks, a collaborative Web search framework that is designed to complement mainstream search engines. At search time, HeyStaks learns from the search activities of other users and leverages this information to generate recommendations based on results that others have found relevant for similar searches. The key contribution of this article is to extend the HeyStaks social search model by considering the search expertise, or reputation, of HeyStaks users and using this information to enhance the result recommendation process. In particular, we propose a reputation model for HeyStaks users that utilise the implicit collaboration events that take place between users as recommendations are made and selected. We describe a live-user trial of HeyStaks that demonstrates the relevance of its core recommendations and the ability of the reputation model to further improve recommendation quality. Our findings indicate that incorporating reputation into the recommendation process further improves the relevance of HeyStaks recommendations by up to 40p.

Summary (9 min read)

1. INTRODUCTION

  • The scale of the Web and the heterogeneous nature of its content [Signorini and Gulli 2005] introduces many significant information discovery challenges.
  • Part of the problem rests with the searchers themselves: with an average of only 2-3 terms [Lawrence and Giles 1998; Spink and Jansen 2004], the typical Web search query is often vague with respect to the searcher’s true intentions or information needs [Song et al. 2007].
  • Moreover, searchers sometimes choose query terms that are not well represented in the page that they are seeking and so simply increasing the length of queries will not necessarily improve search performance.
  • This paper focuses on the second idea, that of collaboration.

2 · McNally et al.

  • The traditional world of web search and the information sharing world of social networks, also known as discovery worlds.
  • Only a few years ago, by and large, the majority of people located information of interest through their favourite mainstream search engine.
  • This shift in their information discovery habits has lead to an explosion in the number and variety of new social-search type services — all of which can influence their information discovery activities, bringing the world of web search and social networks even closer together (see Figure 1).
  • A key contribution of this paper is a detailed description of a recent live-user trial of HeyStaks in order to understand the usage and collaboration patterns of users and also the quality of HeyStaks’ social recommendations relative to the organic results of mainstream search engines.

2. BACKGROUND

  • This paper focuses on discussing HeyStaks as a collaborative information retrieval technology, augmented by a reputation system based on the collaborations that implicitly take place between searchers in the HeyStaks social search utility.
  • As such this background section covers recent, relevant work in the two broad areas of collaborative information retrieval and reputation systems.

2.1 Collaborative Information Retrieval

  • Approaches to collaborative information retrieval can be usefully distinguished in terms of two important dimensions, time — synchronous versus asynchronous search — and place — that is, co-located versus remote searchers.
  • Co-located systems offer a collaborative search experience for multiple searchers at a single location, typically sharing a single PC [Amershi and Morris 2008; Smeaton et al. 2008], whereas remote approaches allow searchers to perform their searches at different locations across multiple devices [Morris and Horvitz 2007a; 2007b; Smyth et al. 2009b].
  • The former enjoy the obvious benefit of an increased faculty for direct ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

4 · McNally et al.

  • Collaboration that is enabled by the face-to-face nature of co-located search, while the latter offer a greater opportunity for collaborative search.
  • Once again, preliminary studies speak to the potential for such an approach to improve overall search productivity and collaboration, at least in specific types of information access tasks.
  • The iBingo system allows a group of users to collaborate on an image search task with each user using a iPod touch device as their primary search/feedback device (although conventional PCs appear to be just as applicable).
  • SearchTogether supports synchronous collaborative search by allowing searchers to invite others to join in specific search tasks, allowing cooperating searchers to synchronously view the results of each others’ searches via a split-screen style results interface.
  • Work by Pickens et al. [2008] describes an approach to collaborative search that is more tightly integrated with the underlying search engine resource so that the operation of the search engine is itself influenced by the activities of collaborating searchers.

2.2 Reputation Systems

  • Recently there has been considerable interest in reputation systems to provide a mechanism to evaluate user reputation and inter-user trust across a growing number of social web and e-commerce applications [Jøsang and Golbeck 2009; O’Donovan and Smyth 2005; 2006; Sabater and Sierra 2005; Resnick and Zeckhauser 2002; Resnick et al. 2000].
  • This work is, in part, motivated by the idea that an understanding of user reputation can serve as the basis for strategies to guard against malicious users [Lazzari 2010; Hoffman et al.
  • Here, the authors present a brief review of the work that has been undertaken in this regard.
  • Jøsang et al. [2007] confirms this, stating such systems require manual curation and protection from malicious users.
  • Unlike conventional reputation systems like eBay’s, reputation is not calculated by examining feedback received directly from users.

6 · McNally et al.

  • Collaborative filtering algorithm is modified to add a user-user trust score to compliment the normal profile or item-based similarity score, so that recommendation partners are chosen from those users that are not only similar to the target user, but who have also had a positive recommendation history with that user.
  • Using this metric average prediction error is improved by 22%.
  • Similar to O’Donovan and Smyth [2005], Massa and Avesani [2007] propose a reputation algorithm called MoleTrust that can be used to augment an existing collaborative filtering system.
  • Other recent research has examined reputation systems employed in social networking platforms.
  • Applying reputation globally affords malicious users influence over the entire system, which adds to its vulnerability.

3. HEYSTAKS: A SOCIAL SEARCH UTILITY

  • In designing HeyStaks their primary goal is to provide social Web search enhancements, while at the same time allowing searchers to continue to use their favourite search engine.
  • First, it allows users to create search staks, as a type of folder for their search experiences at search time, and the creator can invite initial members by providing their email addresses.
  • As shown in Figure 2, HeyStaks takes the form of two basic components: a clientside browser toolbar and a back-end server.
  • A Case Study of Collaboration and Reputation in Social Web Search · 7 list.
  • In the following sections the authors review how HeyStaks captures search activities within search staks and how this search knowledge is used to generate and filter result recommendations at search time; more detailed technical details can be found in [Smyth et al. 2009a; 2009b].

3.1 Profiling Stak Pages

  • In HeyStaks each search stak (S) serves as a profile of the search activities of the stak members.
  • A number of primary actions are facilitated, for example: —Selections (or Click-thrus) – that is, a user selects a search result (whether organic or recommended).
  • It is also a weak indicator of relevance because users will frequently select pages that turn out to be ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.
  • (1) In this way, each result page is associated with a set of term data (query terms and/or tag terms) and a set of usage data (the selection, tag, share, and voting count).
  • At search time, recommendations are produced in a number of stages: first, relevant results are retrieved and ranked from the stak index; next, these recommendation candidates are filtered based on the usage evidence to eliminate noisy recommendations; and, finally, the remaining results are added to the Google result-list according to a set of recommendation rules.

3.2 Retrieval & Ranking

  • Briefly, there are two types of recommendation candidates: primary recommendations are results that come from the active stak St; whereas secondary recommendations come from other staks in the searcher’s stak-list.
  • To generate these recommendation candidates, the HeyStaks server uses the current query qt as a probe into each stak index, Si, to identify a set of relevant stak results R(Si, qt).
  • Each candidate result, r, is assigned a relevance score using a TF*IDF -based retrieval function as per Equation 2, which serves as the basis for an initial recommendation ranking.
  • (2) Staks are inevitably noisy, in the sense that they will frequently contain results that are not on topic.
  • The precise details of this model are beyond the scope of this paper but suffice it to say that any results which do not meet the necessary evidence thresholds are eliminated from further consideration; further detail can be found in [Smyth et al. 2009a; 2009b].

3.3 Summary Discussion

  • HeyStaks is designed to help users to collaborate during Web search tasks and, importantly, it succeeds in integrating collaborative recommendation techniques with mainstream search engines.
  • In the next section the authors introduce their user reputation model, which is based on the collaboration events that inherently occur between users who share their search experiences.
  • In turn, the authors show how this model can be employed to further enhance the quality of recommendations provided by HeyStaks by using reputation to influence the ranking of recommended results.

4.1 From Activities to Reputation

  • It seems natural that the reputation of searchers should be linked to the search knowledge that they contribute to HeyStaks.
  • Each activity on the part of users causes the creation of new search knowledge.
  • If the target page is new to a stak, then its selection, sharing, voting, or tagging will cause it to be added to the stak for the first time.
  • Under the heading of “more search knowledge is better than less search knowledge” it might make sense to model reputation as a direct function of the sheer volume of activity that a given searcher engages in.
  • On the contrary, one of the major concerns in any social recommender is the potential ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

4.2 Reputation as Collaboration

  • The long-term value of HeyStaks as a social search service depends critically on the ability of users to benefit from its quality search knowledge and if, for example, all of the best search experiences are tied up in private staks and never shared, then this long-term value will be greatly diminished.
  • The key idea is that, ultimately, the quality of shared search knowledge can be estimated by looking at the frequency of search collaborations within HeyStaks.
  • In other words, the producer created search knowledge that was deemed to be relevant enough to be recommended and useful enough for the consumer to act upon it.
  • This collaboration-based model of reputation is incentivizing users not just to create search knowledge of high quality but also to share it with others.

4.3 A Computational Model of Reputation

  • The conferral of reputation by a single consumer on a single producer (Figure 4(a)) is the simplest case of their reputation model.
  • A specific producer may have been the first to select the result in a given stak, but subsequent users may have selected it for different queries, or they may have voted on it or tagged it or shared it with others independently of its other producers.
  • In this way reputation is shared equally among its k contributing producers; see Figure 5 for an example of how user reputation can evolve over time.
  • Given the formulation of the reputation model, some protection against malicious activity is inherently provided because users only benefit if their results are recommended and selected by other users.
  • The problem is that the current reputation model distributes reputation equally among all producers.

14 · McNally et al.

  • That a producer should receive more reputation if many of their past contributions have been consumed by other users but the should receive less reputation if most of their contributions have not been consumed.
  • More formally, for a producer pi, let nt(pi, t− 1) be the total number of distinct results that this user has added to the stak in question prior to time t; remember that pi refers to a single user and a specific stak.
  • Further, let nr(pi, t − 1) be the number of these results that have been subsequently recommended and consumed by other users.
  • Accordingly, if a producer has a high consumption ratio it means that many of their contributions have been consumed by other users, suggesting that the producer has consistently added useful content to the stak.

4.4 Reputation and Result Recommendation

  • In the previous sections the authors have described a reputation model for users.
  • In this section the authors describe how this reputation information can be used to produce better recommendations at search time.
  • One option is to simply add the reputation scores of the producers.
  • In their work the authors have found a third option to work best.
  • Thus, as long as at least some of the producers are considered reputable then this result will receive a high reputation score, even if many of the producers have low reputation scores.

5. EVALUATION

  • Live-user trial of HeyStaks, designed to evaluate the utility of HeyStaks’ brand of collaborative search in factfinding, information discovery tasks.the authors.
  • In addition the authors also have the opportunity to evaluate the potential benefits of their new reputation model when it comes to boosting the relevance of HeyStaks’ default promotions.
  • It is worth highlighting that this present evaluation complements earlier evaluations of HeyStaks such as that carried out by Smyth et al. [2009b].
  • These earlier evaluations had the benefit of being open-ended trials, following users during routine search tasks, but were limited in their ability to evaluate the relevance of HeyStaks recommendations.
  • Instead, these earlier evaluations reported on typical usage by HeyStaks users, focusing on stak creation and sharing behaviour.

5.1 Dataset and Methodology

  • The authors experiment involves 64 first-year undergraduate university students with varying degrees of search expertise.
  • It was highly unlikely that students would be able to answer any significant number of these questions from their own general knowledge and so the purpose of this experiment was to look at how the students used HeyStaks and Google to help them answer these questions.
  • The solitary staks served as a straightforward benchmark to evaluate the search effectiveness of individual users on a non-collaborative search setting, whereas the different sizes of shared staks provided an opportunity to examine the effectiveness of collaborative search across a range of different group sizes.
  • During the 60 minute trial a total of 3,124 queries and 1,998 result activities (selections, tagging, voting, popouts) were logged, and 724 unique results were selected.

18 · McNally et al.

  • —not relevant (i.e. the result page content had no relevance with respect to a question); —partially relevant (i.e. the result page contains an implicit reference to the answer or to a part of the answer to a question); —relevant (i.e. the result page contains an answer to a question).
  • Figure 6(b) shows a relevance breakdown of the result pages logged during the course of the trial.
  • 66% of result pages acted on were categorised as being not relevant with respect to the questions posed, while only 14% were deemed relevant.
  • These findings demonstrate the difficulty of the questions presented as mentioned above.
  • The authors will return to this relevance information later in this section when they use it to evaluate the relevance of HeyStaks recommendations.

5.2 Research Questions

  • Using this trial data the authors can explore a number of important questions pertaining to the benefits, or otherwise, of social web search and the value of reputation during result recommendation.
  • To answer this question the authors can look at the outcome of their quiz as the core search task.

5.3 Quiz Performance

  • To begin with it is worth looking at the overall performance of students during the quiz as a basic outcome measure for this search task.
  • These results point to benefit of the sharing and collaboration during this search task.
  • By comparison, the median values across shared staks are between 5.5 and 8 questions attempted and between 4 to 7 questions correctly answered.
  • In general the influence of stak size is less clear in terms of these measures of overall performance.
  • It is likely that the search expertise of individual users is playing a role here and as such a simple measure such as stak size is unlikely to be a powerful predictor of overall performance given the variation in expertise that likely exists between between the individual members of a stak.

5.4 Search Queries & Result Activities

  • The authors have presented evidence above to show how the members of their shared staks perform better than solitary searchers in their search task.
  • The authors key hypothesis is that this is due, at least in part, to the benefits of the type of search collaboration that HeyStaks is designed to facilitate.
  • Specifically, the authors posit that the members of shared staks will benefit from relevant results, promoted due to the activities of other stak members, results that might otherwise be difficult to find.
  • The authors will look in more detail at these promotions in the next section but first it is useful to look at the level of granular search activity across the different search staks.

20 · McNally et al.

  • The authors can view the number of queries submitted by a searcher as a proxy for their search effort and the number of activities (result selections, tagging, etc.) they generate to be an indicator of relevance for the results returned for these queries.
  • Now the authors can see a very significant difference between the activities per query for the solitary searchers (approximately 0.4 activities per query) and the collaborating searchers in the shared staks (approximately 0.6 – 0.8 activities per query).
  • A Case Study of Collaboration and Reputation in Social Web Search · 21 significantly from results that are, apparently at least, more relevant than those experienced by the latter.
  • In the case of the former, on average they correctly answer 0.044 questions per query, but for the latter this ratio increases to 0.15.

5.5 Recommendations & Relevance

  • Given that the members of shared search staks seem to be enjoying improved search productivity when compared to their solitary counterparts, the authors now turn their attention to likely source of this improvement: the recommendations that are generated by HeyStaks.
  • To begin with it is worth looking at how often HeyStaks is able to recommend results to the members of the different staks.
  • This is presented in Figure 10 as the percentage of queries that result in at least one HeyStaks recommendation.
  • As expected, larger staks mean more recommendations, because there are more search experiences to act as a source of recommendations.
  • For the solitary staks, the authors find that only 16% of the queries lead to recommendations, and, by definition, these recommendations are due to the solitary searcher submitting queries that are similar to those they have used previously.

22 · McNally et al.

  • The 5-person stak, nearly 40% of queries lead to recommendations, growing to over 62% for the largest 25-person stak.
  • Comparing the graphs for the recommended results versus the organic results the authors can see a significant relevance benefit for the former.
  • Similarly, the authors find that, on average, 41% of the organic result activities are for not relevant results compared to only 21% for the recommended result activities.
  • Relevance ratio = ar anr (9) To better quantify this relevance benefit the authors can compute a relevance ratio for organic and recommended results as per Equation 9.
  • This stak is the best performer (e.g. more questions answered correctly per user), most likely because its members are better searchers to begin with.

5.6 Searcher Reputation

  • The results of the previous section highlight the potential benefits of the HeyStaks form of collaborative web search in the context of the target search task.
  • Recommended results turned out to be significantly more relevant, according to their independent relevance metric, than conventional organic results.
  • Even absent overtly malicious users, recommendation quality can degrade if prolific, but inexperienced, searchers contribute large quantities of irrelevant results to a stak.
  • Clearly there is a diverse range of reputation scores across all of these users.
  • Figure 14 plots the reputation score of a user versus the number of distinct results contributed to collaboration events by that user.

26 · McNally et al.

  • Members of the 9-person stak, which has a very similar median reputation score (14.5) despite having ten fewer members compared to the 19-person stak (14.9).
  • The authors know from their earlier performance results that the users in the 9-person stak perform particularly well, both in terms of their quiz performance (e.g. median questions correct per queries submitted) and the relevance of their search results.
  • It is also interesting to look at how reputation builds during the course of the trial.
  • To examine this the authors note the number of users with non-zero reputation score at 5-minute intervals during the trial; they do this retrospectively by analysing the collaboration logs.
  • The authors see a consistent reputation profile across the 4 staks with reputation beginning to accumulate from an early stage, albeit more slowly, as expected, for the 5-person stak.

5.7 Reputation for Recommendation Ranking

  • As discussed previously the motivation for incorporating a reputation model into the HeyStaks recommendation engine is to provide a way for searcher expertise to influence recommendation.
  • In principle, by increasing the reputation threshold in this way the authors should experience an improvement in recommendation quality, but at the same time it will reduce recommendation coverage — the number of recommendations that can be made — because none of the recommendations for certain queries will exceed the threshold.
  • The results of this experiment are presented in Figure 17(b) as the relative benefit (percentage increase in relevance ratio) of reputation-based ranking, in comparison to the default HeyStaks recommendation ranking, for different values of the reputation weight (w), from 0 to 1, and for 3 different reputation thresholds (0, 0.3, and 0.5).
  • As the reputation weight is increased, initially the authors see a rapid increase in its relative benefit score but as the reputation weight exceeds 0.6 they see relative benefit fall back as it begins to over-influence the recommendation rankings.
  • At least in this experiment, most likely because of the limits ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

5.8 Limitations & Results Summary

  • In this evaluation the authors have described the results of a live-user trial of HeyStaks.
  • It is, for example, just one of the many reasons why users avail of search engines and there is clearly an opportunity for further work in order to broaden their evaluation to cover more open-ended search and discovery tasks; preliminary results for these open-ended style evaluations have been presented elsewhere in Smyth et al. [2009b].
  • Given these trial limitations, the outcome of their evaluation has been very positive.
  • These recommendations effectively amplified the relevance of results selected by search leaders and benefitted search followers accordingly.
  • Keane et al. [2008]) and, as such, it is generally accepted that if one can produce rankings where top-ranked results are more relevant, then these rankings are likely to meet with a better user response.

6. CONCLUSIONS

  • Many of their information needs are being met by sharing through social networks as much as they are through queries to search engines.
  • As web search evolves there is a significant opportunity for search engines ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

Did you find this useful? Give us your feedback

Figures (17)

Content maybe subject to copyright    Report

Provided by the author(s) and University College Dublin Library in accordance with publisher
policies. Please cite the published version when available.
Title A Case Study of Collaboration and Reputation in Social Web Search.
Authors(s) McNally, Kevin; O'Mahony, Michael P.; Coyle, Maurice; Briggs, Peter; Smyth, Barry
Publication date 2011-10
Publication information ACM Transactions on Intelligent Systems and Technology, 3 (1):
Publisher ACM
Item record/more information http://hdl.handle.net/10197/3913
Publisher's statement © ACM, 2011 This is the author's version of the work. It is posted here by permission of
ACM for your personal use. Not for redistribution. The definitive version was published in
ACM Transactions on Intelligent Systems and Technology (TIST), Volume 3 Issue 1,
October 2011 http://dx.doi.org/10.1145/2036264.2036268
Publisher's version (DOI) 10.1145/2036264.2036268
Downloaded 2022-08-09T21:48:23Z
The UCD community has made this article openly available. Please share how this access
benefits you. Your story matters! (@ucd_oa)
© Some rights reserved. For more information, please see the item record link above.

A Case Study of Collaboration and Reputation in
Social Web Search
KEVIN MCNALLY
MICHAEL P. O’MAHONY
BARRY SMYTH
PETER BRIGGS
and
MAURICE COYLE
CLARITY: Centre For Web Sensor Technologies, University College Dublin,
Dublin, Ireland
1. INTRODUCTION
The scale of the Web and the heterogeneous nature of its content [Signorini and
Gulli 2005] introduces many significant information discovery challenges. For all
of the recent developments in search engine technologies, modern search engines
continue to struggle when it comes to providing users with fast and efficient access
to information. For example, recent studies have highlighted how even today’s
leading search engines fail to satisfy 50% of user queries [Smyth et al. 2005]. Part
of the problem rests with the searchers themselves: with an average of only 2-3
terms [Lawrence and Giles 1998; Spink and Jansen 2004], the typical Web search
query is often vague with respect to the searcher’s true intentions or information
needs [Song et al. 2007]. Moreover, searchers sometimes choose query terms that
are not well represented in the page that they are seeking and so simply increasing
the length of queries will not necessarily improve search performance.
Two promising and powerful new ideas in web search are personalization and
collaboration. Personalization questions the one-size-fits-all nature of mainstream
web search two different users with the same query will receive the same result-
list, despite their different preferences and argues that web search needs to
become more personalized so that the implicit needs and preferences of searchers
can be accommodated [Chang et al. 2000; Chirita et al. 2004; Granka et al. 2004;
Speretta and Gauch 2005; Asnicar and Tasso 1997; Ma et al. 2007; Makris et al.
2007; Zhou et al. 2006; Chirita et al. 2005; Pretschner and Gauch 1999; Shen et al.
2005; Budzik and Hammond 2000; Finkelstein et al. 2001].
This paper focuses on the second idea, that of collaboration. In the main, web
search takes the form of an isolated interaction between lone searcher and search
engine. Recently, however, there has been considerable interest in the potential
for web search to evolve to become a more social activity [Morris et al. 2010;
Golovchinsky et al. 2009; Evans et al. 2010; Evans and Chi 2009], whereby the
search efforts of a user might be influenced by their social graph or the searches of
others, potentially leading to a more collaborative model of search. In the broadest
sense the idea of social search is one that tries to unify two distinctive information
ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001, Pages 111–0??.

2 · McNally et al.
discovery worlds: the traditional world of web search and the information sharing
world of social networks. Only a few years ago, by and large, the majority of people
located information of interest through their favourite mainstream search engine.
But recently there has been a very noticeable change in how many web users satisfy
their information needs. For example, recent statistics from Twitter claim that its
users are explicitly searching tweet content 24 billion times per month
1
as compared
to approximately 88 billion queries per month for Google. Similarly, at the time
of writing, Facebook’s own statistics highlight how its users are sharing upwards
of 30 billion items of content every month.
2
Many of these items of content would
have previously been located through mainstream search engines. Instead, today,
they are being accessed via our social networks and, in terms of raw volume of
information seeking activity, the social networks are now beginning to compete
with mainstream search engines.
This shift in our information discovery habits has lead to an explosion in the
number and variety of new social-search type services all of which can influence
our information discovery activities, bringing the world of web search and social
networks even closer together (see Figure 1). In this context, social search can mean
many things to many people. For some, social search is all about searching the real-
time web (blogs and micro-blogs) `a la the likes of InfoAxe, OneRiot, and Topsy.
For others, social search is about indexing and filtering web content according to
the online activities or opinions of users; see, for example, Mahalo (curated search
categories), Scour (content indexed and filtered by real-time conversations) or the
now-ended Wikia Search. For yet others, social search is about social bookmarking
services (e.g. Delicious, XMarks, Twine), people search (e.g. Pipl, Nayms, Spock),
or social news services (e.g., Digg, Reddit, Mixx).
Our aim is to make mainstream search engines more collaborative and to help
people during routine search tasks by harnessing the recent search experiences of
their friends and colleagues via their social networks. The focus of this paper is the
HeyStaks search service (www.heystaks.com), which adds a layer of collaboration on
top of mainstream search engines: so users continue to search as normal but benefit
from a more collaborative/social search experience. The core HeyStaks system has
been described in detail elsewhere [Smyth et al. 2009a; 2009b] and so we shall only
review the HeyStaks approach in this paper. However, a key contribution of this
paper is a detailed description of a recent live-user trial of HeyStaks in order to
understand the usage and collaboration patterns of users and also the quality of
HeyStaks’ social recommendations relative to the organic results of mainstream
search engines. In addition, a second contribution of this paper is a novel enhanced
reputation model for HeyStaks, which has been developed in order to evaluate the
reputation of individual searchers based on their search contributions. We go on to
demonstrate how this reputation model can be used to further improve the quality
of the HeyStaks recommendations, by prioritising those that originate from more
reputable users.
1
http://www.boygeniusreport.com/2010/07/07/twitter-handling-24-billion-search-queries-per-
month/
2
http://www.facebook.com/press/info.php?statistics
ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

A Case Study of Collaboration and Reputation in Social Web Search · 3
Fig. 1. Social search attempts to bridge the traditional, query-based world of web search with
the information sharing world of social networks. A variety of social search and sharing services
have emerged to help users harness their social networks in pursuit of more effective information
discovery across a variety of application contexts. This figure lists a number of well-known services,
both start-up and more mature, that have emerged to fill the gap between the mainstream search
world such as Google, Yahoo and Bing, and the major social networks such as Facebook, Twitter
and LinkedIn.
2. BACKGROUND
This paper focuses on discussing HeyStaks as a collaborative information retrieval
technology, augmented by a reputation system based on the collaborations that
implicitly take place between searchers in the HeyStaks social search utility. As
such this background section covers recent, relevant work in the two broad areas of
collaborative information retrieval and reputation systems.
2.1 Collaborative Information Retrieval
Approaches to collaborative information retrieval can be usefully distinguished in
terms of two important dimensions, time synchronous versus asynchronous
search and place that is, co-located versus remote searchers. Co-located
systems offer a collaborative search experience for multiple searchers at a single
location, typically sharing a single PC [Amershi and Morris 2008; Smeaton et al.
2008], whereas remote approaches allow searchers to perform their searches at dif-
ferent locations across multiple devices [Morris and Horvitz 2007a; 2007b; Smyth
et al. 2009b]. The former enjoy the obvious benefit of an increased faculty for direct
ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

4 · McNally et al.
collaboration that is enabled by the face-to-face nature of co-located search, while
the latter offer a greater opportunity for collaborative search. Alternatively, syn-
chronous approaches are characterised by systems that broadcast a “call to search”,
in which specific participants are requested to engage in a well-defined search task
for a well defined period of time [Smeaton et al. 2008]. In contrast, asynchronous
approaches are characterised by less well-defined, ad-hoc search tasks and provide
for a more open-ended approach to collaboration in which different searchers con-
tribute to an evolving search session over an extended period of time [Morris and
Horvitz 2007a; Boydell and Smyth 2010].
A good example of the co-located, synchronous approach to collaborative web
search is given by the work of Amershi and Morris [2008]. Their CoSearch system
is designed to improve the search experience for co-located users where computing
resources are limited; for example, a group of school children having access to a
single PC. CoSearch is specifically designed to leverage peripheral devices that may
be available (e.g. mobile phones, extra mice etc.) to facilitate distributed control
and division of effort, while maintaining group awareness and communication. The
purpose of CoSearch is to demonstrate the potential for productive collaborative
web search in resource-limited environments. The focus is very much on dividing
the search labour while maintaining communication between searchers, and live
user studies speak to the success of CoSearch in this regard [Amershi and Morris
2008]. The work of Smeaton et al. [2007] is related in spirit to CoSearch but fo-
cuses on image search tasks using a table-top computing environment. Once again,
preliminary studies speak to the potential for such an approach to improve overall
search productivity and collaboration, at least in specific types of information ac-
cess tasks. A variation on these forms of synchronous search activities is presented
by Smeaton et al. [2008], where the use of mobile devices as the primary search
device allows for a remote form of synchronous collaborative search. The iBingo
system allows a group of users to collaborate on an image search task with each
user using a iPod touch device as their primary search/feedback device (although
conventional PCs appear to be just as applicable).
Remote search collaboration (whether asynchronous or synchronous) is the aim of
SearchTogether, which allows groups of searchers to participate in extended shared
search sessions as they search to locate information on particular topics [Morris
and Horvitz 2007a]. The SearchTogether system allows users to create shared
search sessions and invite other users to join in these sessions. Each searcher can
independently search for information on a particular topic, but the system provides
features to allow individual searchers to share what they find with other session
members by recommending and commenting on specific results. SearchTogether
supports synchronous collaborative search by allowing searchers to invite others to
join in specific search tasks, allowing cooperating searchers to synchronously view
the results of each others’ searches via a split-screen style results interface. As
with CoSearch above, one of the key design goals in SearchTogether is to support a
division of labour in complex, open-ended search tasks. In addition, a key feature
of the work is the ability to create a shared awareness among group members by
reducing the overhead of search collaboration at the interface level. SearchTogether
does this by including various features, from integrated messaging, query histories,
ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.

Citations
More filters
Journal ArticleDOI
TL;DR: An overview of recommender systems as well as collaborative filtering methods and algorithms is provided, which explains their evolution, provides an original classification for these systems, identifies areas of future implementation and develops certain areas selected for past, present or future importance.
Abstract: Recommender systems have developed in parallel with the web. They were initially based on demographic, content-based and collaborative filtering. Currently, these systems are incorporating social information. In the future, they will use implicit, local and personal information from the Internet of things. This article provides an overview of recommender systems as well as collaborative filtering methods and algorithms; it also explains their evolution, provides an original classification for these systems, identifies areas of future implementation and develops certain areas selected for past, present or future importance.

2,639 citations

Patent
04 Oct 2012
TL;DR: In this paper, a graph is created of content recommended by members of the social network, where the nodes of the graph include content nodes (for the content) and recommending member nodes (recommended by members who recommended the content).
Abstract: Architecture that provides a data structure to facilitate personalized ranking over recommended content (e.g., documents). The data structure approximates the social distance of the searching user to the content at query time. A graph is created of content recommended by members of the social network, where the nodes of the graph include content nodes (for the content) and recommending member nodes (for members of the social network who recommended the content). If a member recommends content, an edge is created between the member node and the content node. If a member is a "friend" (tagged as related in some way) of another member, an edge is created between the two member nodes. Each node is converted to a lower dimensional feature set. Feature sets of the content are indexed and the feature set of the searching user is utilized to match and rank the search results at query time.

152 citations

Journal ArticleDOI
TL;DR: New recommendation methods using Adaptive Neuro-Fuzzy Inference Systems (ANFIS) and Self-Organizing Map (SOM) clustering are proposed to improve predictive accuracy of criteria CF and experimental results demonstrate that the proposed hybrid methods remarkably improve the accuracy of multi-criteria CF in relation to the previous methods based on multi-Criteria ratings.
Abstract: Recommender systems are software tools and techniques for suggesting items in an automated fashion to users tailored their preferences. Collaborative Filtering (CF) techniques, which attempt to predict what information will meet a user's needs from the neighborhoods of like-minded people, are becoming increasingly popular as ways to overcome the information overload. The multi-criteria based CF presents a possibility to provide accurate recommendations by considering the user preferences in multiple aspects and several methods have been proposed for improving the accuracy of these systems. However, the problem of multi-criteria recommendations with a single and overall rating is still considered an optimization problem. In addition, increasing the accuracy in predicting the appropriate items tailored to the users' preferences is on of the main challenges in these systems. Hence, in this research new recommendation methods using Adaptive Neuro-Fuzzy Inference Systems (ANFIS) and Self-Organizing Map (SOM) clustering are proposed to improve predictive accuracy of criteria CF. In this research, SOM enables us to generate high quality clusters of dataset and ANFIS is used for discovering knowledge (fuzzy rules) from users' ratings in multi-criteria dataset, generating appropriate membership functions (MFs), overall rating prediction and input selection. Using exhaustive search method for input selection, the effective inputs are determined to build the ANFIS models in all generated clusters. Furthermore, new fuzzy-based algorithms, Weighted Fuzzy MC-CF (WFuMC-CF), Fuzzy Euclidean MC-CF (FuEucMC-CF) and Fuzzy Average MC-CF (FuAvgMC-CF), are presented for prediction task in multi-criteria CF. FuEucMC-CF and FuAvgMC-CF algorithms uses the fuzzy-based Euclidian distance and fuzzy-based average similarity, respectively, the WFuMC-CF algorithm uses fuzzy-based user- and item-based prediction in a weighted approach. Experimental results on real-world dataset demonstrate that the proposed hybrid methods remarkably improve the accuracy of multi-criteria CF in relation to the previous methods based on multi-criteria ratings.

144 citations

Journal ArticleDOI
TL;DR: A model for delivering real-time, personalised marketing information concerning the recommended items for online and offline customers is described, using a blend of selling strategies: up-sell, cross-selling, best-in-class- selling, needs-satisfaction-selling and consultative-selling.

75 citations

Journal ArticleDOI
TL;DR: Examination of types of maintenance behaviors enacted on Facebook and how they predict relational escalation of Facebook friendships shows that most relationships go through a gradual rather than an extreme change and that these changes reflect both relational escalation and de-escalation.

56 citations

References
More filters
Journal ArticleDOI
01 Mar 2007
TL;DR: Trust and reputation systems represent a significant trend in decision support for Internet mediated service provision as mentioned in this paper, where the basic idea is to let parties rate each other, for example after the completion of a transaction, and use the aggregated ratings about a given party to derive a trust or reputation score.
Abstract: Trust and reputation systems represent a significant trend in decision support for Internet mediated service provision. The basic idea is to let parties rate each other, for example after the completion of a transaction, and use the aggregated ratings about a given party to derive a trust or reputation score, which can assist other parties in deciding whether or not to transact with that party in the future. A natural side effect is that it also provides an incentive for good behaviour, and therefore tends to have a positive effect on market quality. Reputation systems can be called collaborative sanctioning systems to reflect their collaborative nature, and are related to collaborative filtering systems. Reputation systems are already being used in successful commercial online applications. There is also a rapidly growing literature around trust and reputation systems, but unfortunately this activity is not very coherent. The purpose of this article is to give an overview of existing and proposed systems that can be used to derive measures of trust and reputation for Internet transactions, to analyse the current trends and developments in this area, and to propose a research agenda for trust and reputation systems.

3,493 citations

Journal ArticleDOI
TL;DR: Systems T he Internet offers vast new opportunities to interact with total strangers, but these interactions can be fun, informative, even profitable, but they also involve risk.
Abstract: Systems T he Internet offers vast new opportunities to interact with total strangers. These interactions can be fun, informative, even profitable. But they also involve risk. Is the advice of a self-proclaimed expert at expertcentral.com reliable? Will an unknown dotcom site or eBay seller ship items promptly with appropriate packaging? Will the product be the same one described online? Prior to the Internet, such questions were answered, in part, through personal and corporate reputations. Vendors provided references, Better Business Bureaus tallied complaints, and past personal experience and person-to-person gossip told you on whom you could rely and on whom you could not. Participants’ standing in their communities, including their roles in church and civic organizations, served as a valuable hostage. Internet services operate on a vastly larger scale

2,410 citations


"A Case Study of Collaboration and R..." refers background in this paper

  • ...…in reputation systems to provide a mechanism to evaluate user reputation and interuser trust across a growing number of social Web and e-commerce applications [Jøsang and Golbeck 2009; O Donovan and Smyth 2005, 2006; Resnick and Zeckhauser 2002; Resnick et al. 2000; Sabater and Sierra 2005]....

    [...]

Book ChapterDOI
31 Oct 2002
TL;DR: Examination of a large data set from 1999 reveals several interesting features, including a high correlation between buyer and seller feedback, suggesting that the players reciprocate and retaliate.
Abstract: One of the earliest and best known Internet reputation systems is run by eBay, which gathers comments from buyers and sellers about each other after each transaction. Examination of a large data set from 1999 reveals several interesting features. First, despite incentives to free ride, feedback was provided more than half the time. Second, well beyond reasonable expectation, it was almost always positive. Third, reputation profiles were predictive of future performance, though eBay's net feedback statistic is far from the best predictor available. Fourth, there was a high correlation between buyer and seller feedback, suggesting that the players reciprocate and retaliate.

1,948 citations


"A Case Study of Collaboration and R..." refers background in this paper

  • ...Recently there has been considerable interest in reputation systems to provide a mechanism to evaluate user reputation and inter-user trust across a growing number of social web and e-commerce applications [Jøsang and Golbeck 2009; O’Donovan and Smyth 2005; 2006; Sabater and Sierra 2005; Resnick and Zeckhauser 2002; Resnick et al. 2000]....

    [...]

  • ...…in reputation systems to provide a mechanism to evaluate user reputation and interuser trust across a growing number of social Web and e-commerce applications [Jøsang and Golbeck 2009; O Donovan and Smyth 2005, 2006; Resnick and Zeckhauser 2002; Resnick et al. 2000; Sabater and Sierra 2005]....

    [...]

Journal Article
TL;DR: A new conceptual paradigm for performing search in context is presented, that largely automates the search process, providing even non-professional users with highly relevant results.
Abstract: Keyword-based search engines are in widespread use today as a popular means for Web-based information retrieval Although such systems seem deceptively simple, a considerable amount of skill is required in order to satisfy non-trivial information needs This paper presents a new conceptual paradigm for performing search in context, that largely automates the search process, providing even non-professional users with highly relevant results This paradigm is implemented in practice in the IntelliZap system, where search is initiated from a text query marked by the user in a document she views, and is guided by the text surrounding the marked query in that document (“the context”) The context-driven information retrieval process involves semantic keyword extraction and clustering to automatically generate new, augmented queries The latter are submitted to a host of general and domain-specific search engines Search results are then semantically reranked, using context Experimental results testify that using context to guide search, effectively offers even inexperienced users an advanced search tool on the Web

1,615 citations

Proceedings ArticleDOI
19 Oct 2007
TL;DR: This work proposes to replace the step of finding similar users with the use of a trust metric, an algorithm able to propagate trust over the trust network and to estimate a trust weight that can be used in place of the similarity weight.
Abstract: Recommender Systems based on Collaborative Filtering suggest to users items they might like. However due to data sparsity of the input ratings matrix, the step of finding similar users often fails. We propose to replace this step with the use of a trust metric, an algorithm able to propagate trust over the trust network and to estimate a trust weight that can be used in place of the similarity weight. An empirical evaluation on Epinions.com dataset shows that Recommender Systems that make use of trust information are the most effective in term of accuracy while preserving a good coverage. This is especially evident on users who provided few ratings.

1,137 citations

Frequently Asked Questions (9)
Q1. What have the authors contributed in "A case study of collaboration and reputation in social web search" ?

Title A Case Study of Collaboration and Reputation in Social Web Search. Publisher 's statement © ACM, 2011 This is the author 's version of the work. 

The focus of this paper is the HeyStaks search service (www.heystaks.com), which adds a layer of collaboration on top of mainstream search engines: so users continue to search as normal but benefit from a more collaborative/social search experience. 

The 3 other forms of actions (voting, sharing, tagging) the authors refer to as explicit actions in the sense that they are not part of the normal search process, but rather they are HeyStaks specific actions that the user must chose to use. 

Part of the problem rests with the searchers themselves: with an average of only 2-3 terms [Lawrence and Giles 1998; Spink and Jansen 2004], the typical Web search query is often vague with respect to the searcher’s true intentions or information needs [Song et al. 2007]. 

Each candidate result, r, is assigned a relevance score using a TF*IDF -based retrieval function as per Equation 2, which serves as the basis for an initial recommendation ranking. 

Similar to O’Donovan and Smyth [2005], Massa and Avesani [2007] propose a reputation algorithm called MoleTrust that can be used to augment an existing collaborative filtering system. 

This work is, in part, motivated by the idea that an understanding of user reputation can serve as the basis for strategies to guard against malicious users [Lazzari 2010; Hoffman et al. 

the standardACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3, 09 2001.collaborative filtering algorithm is modified to add a user-user trust score to compliment the normal profile or item-based similarity score, so that recommendation partners are chosen from those users that are not only similar to the target user, but who have also had a positive recommendation history with that user. 

In addition, a second contribution of this paper is a novel enhanced reputation model for HeyStaks, which has been developed in order to evaluate the reputation of individual searchers based on their search contributions.