scispace - formally typeset
Search or ask a question

Showing papers by "Daqing Zhang published in 2015"


Journal ArticleDOI
01 Jan 2015
TL;DR: A STAP model is proposed that first models the spatial and temporal activity preference separately, and then uses a principle way to combine them for preference inference, and a context-aware fusion framework is put forward to combine the temporal and spatial activity preference models for preferences inference.
Abstract: With the recent surge of location based social networks (LBSNs), activity data of millions of users has become attainable. This data contains not only spatial and temporal stamps of user activity, but also its semantic information. LBSNs can help to understand mobile users’ spatial temporal activity preference (STAP), which can enable a wide range of ubiquitous applications, such as personalized context-aware location recommendation and group-oriented advertisement. However, modeling such user-specific STAP needs to tackle high-dimensional data, i.e., user-location-time-activity quadruples, which is complicated and usually suffers from a data sparsity problem. In order to address this problem, we propose a STAP model. It first models the spatial and temporal activity preference separately, and then uses a principle way to combine them for preference inference. In order to characterize the impact of spatial features on user activity preference, we propose the notion of personal functional region and related parameters to model and infer user spatial activity preference. In order to model the user temporal activity preference with sparse user activity data in LBSNs, we propose to exploit the temporal activity similarity among different users and apply nonnegative tensor factorization to collaboratively infer temporal activity preference. Finally, we put forward a context-aware fusion framework to combine the spatial and temporal activity preference models for preference inference. We evaluate our proposed approach on three real-world datasets collected from New York and Tokyo, and show that our STAP model consistently outperforms the baseline approaches in various settings.

548 citations


Proceedings ArticleDOI
07 Sep 2015
TL;DR: A novel framework called CCS-TA is proposed, combining the state-of-the-art compressive sensing, Bayesian inference, and active learning techniques, to dynamically select a minimum number of sub-areas for sensing task allocation in each sensing cycle, while deducing the missing data of unallocated sub-wereas under a probabilistic data accuracy guarantee.
Abstract: Data quality and budget are two primary concerns in urban-scale mobile crowdsensing applications. In this paper, we leverage the spatial and temporal correlation among the data sensed in different sub-areas to significantly reduce the required number of sensing tasks allocated (corresponding to budget), yet ensuring the data quality. Specifically, we propose a novel framework called CCS-TA, combining the state-of-the-art compressive sensing, Bayesian inference, and active learning techniques, to dynamically select a minimum number of sub-areas for sensing task allocation in each sensing cycle, while deducing the missing data of unallocated sub-areas under a probabilistic data accuracy guarantee. Evaluations on real-life temperature and air quality monitoring datasets show the effectiveness of CCS-TA. In the case of temperature monitoring, CCS-TA allocates 18.0-26.5% fewer tasks than baseline approaches, allocating tasks to only 15.5% of the sub-areas on average while keeping overall sensing error below 0.25°C in 95% of the cycles.

165 citations


Journal ArticleDOI
TL;DR: This paper intends to uncover the efficient and inefficient taxi service strategies based on a large-scale GPS historical database of approximately 7600 taxis over one year in a city in China and predicts the revenue of taxi drivers based on their strategies and achieves a prediction residual as less as 2.35 RMB/h.
Abstract: Taxi service strategies, as the crowd intelligence of massive taxi drivers, are hidden in their historical time-stamped GPS traces. Mining GPS traces to understand the service strategies of skilled taxi drivers can benefit the drivers themselves, passengers, and city planners in a number of ways. This paper intends to uncover the efficient and inefficient taxi service strategies based on a large-scale GPS historical database of approximately 7600 taxis over one year in a city in China. First, we separate the GPS traces of individual taxi drivers and link them with the revenue generated. Second, we investigate the taxi service strategies from three perspectives, namely, passenger-searching strategies, passenger-delivery strategies, and service-region preference. Finally, we represent the taxi service strategies with a feature matrix and evaluate the correlation between service strategies and revenue, informing which strategies are efficient or inefficient. We predict the revenue of taxi drivers based on their strategies and achieve a prediction residual as less as 2.35 RMB/h, The currency unit in China; 1 RMB $\approx$ U.S. $0.17.

152 citations


Journal ArticleDOI
TL;DR: This paper presents FlierMeet, a crowd-powered sensing system for cross-space public information reposting, tagging, and sharing that utilizes various contexts and textual features to group similar reposts and classify them into categories.
Abstract: Community bulletin boards serve an important function for public information sharing in modern society. Posted fliers advertise services, events, and other announcements. However, fliers posted offline suffer from problems such as limited spatial-temporal coverage and inefficient search support. In recent years, with the development of sensor-enhanced mobile devices, mobile crowd sensing (MCS) has been used in a variety of application areas. This paper presents FlierMeet, a crowd- powered sensing system for cross-space public information reposting, tagging, and sharing. The tags learned are useful for flier sharing and preferred information retrieval and suggestion. Specifically, we utilize various contexts (e.g., spatio-temporal info, flier publishing/reposting behaviors, etc.) and textual features to group similar reposts and classify them into categories. We further identify a novel set of crowd-object interaction hints to predict the semantic tags of reposts. To evaluate our system, 38 participants were recruited and 2,035 reposts were captured during an eight-week period. Experiments on this dataset showed that our approach to flier grouping is effective and the proposed features are useful for flier category/semantic tagging.

145 citations


Journal ArticleDOI
TL;DR: This paper proposes a novel framework, leveraging a combination of location-based social network (i.e., LBSN) and taxi GPS digital footprints to achieve personalized, interactive, and traffic-aware trip planning.
Abstract: Planning an itinerary before traveling to a city is one of the most important travel preparation activities. In this paper, we propose a novel framework called TripPlanner , leveraging a combination of location-based social network (i.e., LBSN) and taxi GPS digital footprints to achieve personalized, interactive, and traffic-aware trip planning. First, we construct a dynamic point-of-interest network model by extracting relevant information from crowdsourced LBSN and taxi GPS traces. Then, we propose a two-phase approach for personalized trip planning. In the route search phase , TripPlanner works interactively with users to generate candidate routes with specified venues . In the route augmentation phase , TripPlanner applies heuristic algorithms to add user's preferred venues iteratively to the candidate routes, with the objective of maximizing the route score while satisfying both the venue visiting time and total travel time constraints. To validate the efficiency and effectiveness of the proposed approach, extensive empirical studies were performed on two real-world data sets from the city of San Francisco, which contain more than 391 900 passenger delivery trips generated by 536 taxis in a month and 110 214 check-ins left by 15 680 Foursquare users in six months.

131 citations


Journal ArticleDOI
TL;DR: NationTelescope, a platform that monitors, compares and visualizes large-scale nation-wide user behavior in LBSNs, is proposed and the results show that the platform can not only efficiently capture, compare and visualizenation-wide collective behavior, but also achieve good usability and user experience.

125 citations


Proceedings ArticleDOI
07 Sep 2015
TL;DR: Evaluation of the semi-supervised feature selection method proposed shows that it can be applied to different cities to effectively recommend places with higher potential bike trip demand for placing future bike stations.
Abstract: Bike sharing systems have been deployed in many cities to promote green transportation and a healthy lifestyle. One of the key factors for maximizing the utility of such systems is placing bike stations at locations that can best meet users' trip demand. Traditionally, urban planners rely on dedicated surveys to understand the local bike trip demand, which is costly in time and labor, especially when they need to compare many possible places. In this paper, we formulate the bike station placement issue as a bike trip demand prediction problem. We propose a semi-supervised feature selection method to extract customized features from the highly variant, heterogeneous urban open data to predict bike trip demand. Evaluation using real-world open data from Washington, D.C. and Hangzhou shows that our method can be applied to different cities to effectively recommend places with higher potential bike trip demand for placing future bike stations.

122 citations


Journal ArticleDOI
TL;DR: A novel mobile crowdsensing framework called EMC3 is proposed, which intends to reduce energy consumption of individual user as well as all participants in data transfer caused by task assignment and data collection of MCS tasks, considering the user privacy issue, minimal number of task assignment requirement and sensing area coverage constraint.
Abstract: This paper proposes a novel mobile crowdsensing (MCS) framework called EMC $^3$ , which intends to reduce energyconsumption of individual user as well as all participants in data transfer caused by task assignment and data collection of MCS tasks, considering the user privacy issue, minimal number of task assignment requirement and sensing area coverage constraint.Specifically, EMC $^3$ incorporates novel pace control and decision making mechanisms for task assignment, leveraging participants’current call, historical call records as well as predicted future calls and mobility, in order to ensure the expected number of participants to return sensed results and fully cover the target area, with the objective of assigning a minimal number of tasks. Extensive evaluation with a large-scale real-world dataset shows that EMC $^3$ assigns much less sensing tasks compared to baseline approaches, it can save 43%-68% energy in data transfer compared to the traditional 3G-based scheme.

120 citations


Proceedings ArticleDOI
23 Mar 2015
TL;DR: CrowdTasker operates on top of energy-efficient Piggyback Crowdsensing (PCS) task model, and aims to maximize the coverage quality of the sensing task while satisfying the incentive budget constraint.
Abstract: This paper proposes a novel task allocation framework, CrowdTasker, for mobile crowdsensing. CrowdTasker operates on top of energy-efficient Piggyback Crowdsensing (PCS) task model, and aims to maximize the coverage quality of the sensing task while satisfying the incentive budget constraint. In order to achieve this goal, CrowdTasker first predicts the call and mobility of mobile users based on their historical records. With a flexible incentive model and the prediction results, CrowdTasker then selects a set of users in each sensing cycle for PCS task participation, so that the resulting solution achieves near-maximal coverage quality without exceeding incentive budget. We evaluated CrowdTasker extensively using a large-scale real-world dataset and the results show that CrowdTasker significantly outperformed three baseline approaches by achieving 3%–60% higher coverage quality.

116 citations


Journal ArticleDOI
19 May 2015
TL;DR: EffSense is an energy-efficient and cost-effective data uploading framework, which utilizes adaptive uploading schemes within fixed data uploading cycles, and can reduce 55%-65% energy consumption for DP users, and 48%-52% data cost for NDP users, respectively, compared to traditional uploading schemes.
Abstract: Energy consumption and mobile data cost are two key factors affecting users’ willingness to participate in mobile crowd-sensing tasks. While data-plan (DP) users are mostly concerned with energy consumption, non-data-plan (NDP) users are more sensitive to data cost. Traditional ways of data uploading in mobile crowdsensing tasks often go to two extremes: either in real time or completely offline after the whole task is over. In this paper, we propose effSense—an energy-efficient and cost-effective data uploading framework, which utilizes adaptive uploading schemes within fixed data uploading cycles. In each cycle, effSense empowers the participants with a distributed decision making scheme to choose the appropriate timing and network to upload data. effSense reduces data cost for NDP users by maximally offloading data to Bluetooth/WiFi gateways or DP users encountered; it reduces energy consumption for DP users by piggybacking data on a call or using more energy-efficient networks rather than initiating new 3G connections. By leveraging the predictability of users’ calls and mobility, effSense selects proper uploading strategies for both user types. Our evaluation with the MIT reality mining and Nodobo datasets shows that effSense can reduce 55%–65% energy consumption for DP users, and 48%–52% data cost for NDP users, respectively, compared to traditional uploading schemes.

89 citations


Journal ArticleDOI
TL;DR: NextCell-a novel algorithm that aims to enhance the location prediction by harnessing the social interplay revealed in cellular call records and achieves higher precision and recall than the state-of-the-art schemes at cell tower level in the forthcoming one to six hours.
Abstract: Location prediction based on cellular network traces has recently spurred lots of attention. However, predicting user mobility remains a very challenging task due to the fuzziness of human mobility patterns. Our preliminary study included in this paper shows that there is a strong correlation between the calling patterns and co-cell patterns of users (i.e., co-occurrence in the same cell tower at the same time). Based on this finding, we propose NextCell—a novel algorithm that aims to enhance the location prediction by harnessing the social interplay revealed in cellular call records. Moreover, our proposal removes the assumption held in previous schemes that binds locations of cell towers to concrete physical coordinates, e.g., GPS coordinates. We validate our approach with the MIT Reality Mining dataset that involves 32,579 symbolic cell tower locations and 350,000 hours of continuous activity information. Experimental results show that NextCell achieves higher precision and recall than the state-of-the-art schemes at cell tower level in the forthcoming one to six hours.

Journal ArticleDOI
Haoyi Xiong1, Daqing Zhang1, Leye Wang1, J. Paul Gibson1, Jie Zhu2 
TL;DR: Evaluations with a large-scale real-world phone call dataset show that the proposed EEMC framework outperforms the baseline approaches, and it can reduce overall energy consumption in data transfer by 54--66p when compared to the 3G-based solution.
Abstract: Mobile Crowdsensing (MCS) requires users to be motivated to participate. However, concerns regarding energy consumption and privacy—among other things—may compromise their willingness to join such a crowd. Our preliminary observations and analysis of common MCS applications have shown that the data transfer in MCS applications may incur significant energy consumption due to the 3G connection setup. However, if data are transferred in parallel with a traditional phone call, then such transfer can be done almost “for free”: with only an insignificant additional amount of energy required to piggy-back the data—usually incoming task assignments and outgoing sensor results—on top of the call. Here, we present an Energy-Efficient Mobile Crowdsensing (EEMC) framework where task assignments and sensing results are transferred in parallel with phone calls. The main objective, and the principal contribution of this article, is an MCS task assignment scheme that guarantees that a minimum number of anonymous participants return sensor results within a specified time frame, while also minimizing the waste of energy due to redundant task assignments and considering privacy concerns of participants. Evaluations with a large-scale real-world phone call dataset show that our proposed EEMC framework outperforms the baseline approaches, and it can reduce overall energy consumption in data transfer by 54--66p when compared to the 3G-based solution.

Posted Content
TL;DR: In this paper, the authors characterize the unique features and challenges of Mobile Crowd Sensing and Computing (MCSC) and present early efforts on MCSC to demonstrate the benefits of aggregating heterogeneous crowdsourced data.
Abstract: With the development of mobile sensing and mobile social networking techniques, Mobile Crowd Sensing and Computing (MCSC), which leverages heterogeneous crowdsourced data for large-scale sensing, has become a leading paradigm. Built on top of the participatory sensing vision, MCSC has two characterizing features: (1) it leverages heterogeneous crowdsourced data from two data sources: participatory sensing and participatory social media; and (2) it presents the fusion of human and machine intelligence (HMI) in both the sensing and computing process. This paper characterizes the unique features and challenges of MCSC. We further present early efforts on MCSC to demonstrate the benefits of aggregating heterogeneous crowdsourced data.

Book ChapterDOI
10 Jun 2015
TL;DR: In this paper, a real-time, non-intrusive, and low-cost indoor fall detector, called Anti-Fall, was proposed. And the CSI phase difference over two antennas is identified as the salient feature to reliably segment the fall and fall-like activities, both phase and amplitude information of CSI is then exploited to accurately separate the fall from other fall- like activities.
Abstract: Fall is one of the major health threats and obstacles to independent living for elders, timely and reliable fall detection is crucial for mitigating the effects of falls. In this paper, leveraging the fine-grained Channel State Information (CSI) and multi-antenna setting in commodity WiFi devices, we design and implement a real-time, non-intrusive, and low-cost indoor fall detector, called Anti-Fall. For the first time, the CSI phase difference over two antennas is identified as the salient feature to reliably segment the fall and fall-like activities, both phase and amplitude information of CSI is then exploited to accurately separate the fall from other fall-like activities. Experimental results in two indoor scenarios demonstrate that Anti-Fall consistently outperforms the state-of-the-art approach WiFall, with 10% higher detection rate and 10% less false alarm rate on average.

Journal ArticleDOI
TL;DR: The iBDD method is developed that is able to detect two categories of outlying trajectories in a uniform framework in real-time and can achieve 95% detection rate of disorientation with less than 3% of false positives, based on properly chosen parameters.

Journal ArticleDOI
13 Jan 2015
TL;DR: A framework that supports marketers in improving marketing effectiveness by carefully selecting invitees to sponsored offline events by leveraging location-based social networks is presented and the proposed marketing effect quantitative model is validated with real-world data.
Abstract: Offline event marketing invites people to participate in a sponsored gathering, thus allowing marketers to have face-to-face, direct, and close contact with their current and potential customers. This paper presents a framework that supports marketers in improving marketing effectiveness by carefully selecting invitees to such sponsored offline events by leveraging location-based social networks. In particular, we first transform the participant selection task into a combinatorial optimization problem. Second, we propose a marketing effect quantitative model that considers the distance and overlapping social influence. Third, we introduce algorithms to determine a participant team that can maximize the marketing effect while fulfilling the scale and item coverage constraints. We finally evaluate the effectiveness of the framework and validate the proposed marketing effect of the quantitative model with real-world data.

Proceedings ArticleDOI
07 Sep 2015
TL;DR: This paper proposes a novel Credit Distribution-User Influence Preference (CD-UIP) algorithm to find the most influential and preferable followers as the invitees in EBSNs, and demonstrates the proposed algorithm outperforms the state-of-the-art prediction methods.
Abstract: The newly emerging event-based social networks (EBSNs) extend social interaction from online to offline, providing an appealing platform for people to organize and participate realworld social events. In this paper, we investigate how to select potential participants in EBSNs from an event host's point of view. We formulate the problem as mining influential and preferable invitee set, considering from two complementary aspects. The first aspect concerns users' preference with respect to the event. The second aspect is influence maximization, which aims to influence the largest number of users to participate the event. In particular, we propose a novel Credit Distribution-User Influence Preference (CD-UIP) algorithm to find the most influential and preferable followers as the invitees. We collect a real-world dataset from a popular EBSNs called "Douban Events", and the experimental results on the dataset demonstrate the proposed algorithm outperforms the state-of-the-art prediction methods.

Book ChapterDOI
TL;DR: In this paper, a real-time, non-intrusive, and low-cost indoor fall detector, called Anti-Fall, was proposed. And the CSI phase difference over two antennas is identified as the salient feature to reliably segment the fall and fall-like activities, both phase and amplitude information of CSI is then exploited to accurately separate the fall from other fall- like activities.
Abstract: Fall is one of the major health threats and obstacles to independent living for elders, timely and reliable fall detection is crucial for mitigating the effects of falls. In this paper, leveraging the fine-grained Channel State Information (CSI) and multi-antenna setting in commodity WiFi devices, we design and implement a real-time, non-intrusive, and low-cost indoor fall detector, called Anti-Fall. For the first time, the CSI phase difference over two antennas is identified as the salient feature to reliably segment the fall and fall-like activities, both phase and amplitude information of CSI is then exploited to accurately separate the fall from other fall-like activities. Experimental results in two indoor scenarios demonstrate that Anti-Fall consistently outperforms the state-of-the-art approach WiFall, with 10% higher detection rate and 10% less false alarm rate on average.

Journal ArticleDOI
TL;DR: The content of human and machine intelligence, their complementary roles, and their potential collaboration modes in MCSC are described and several applications to demonstrate the power and usage of human-machine intelligence in MC SC are presented.
Abstract: Mobile crowd sensing and computing (MCSC) is a large-scale sensing and collective knowledge discovery paradigm that fuses human and machine intelligence. This article describes the content of human and machine intelligence, their complementary roles, and their potential collaboration modes in MCSC. The authors also discuss the challenges that arise from using human intelligence in MCSC systems. They further present several applications to demonstrate the power and usage of human-machine intelligence in MCSC.

Proceedings ArticleDOI
01 Aug 2015
TL;DR: This paper resorts to urban open data from bike sharing systems, and proposes a two-phase framework to identify social activities in Urban Activity Centers based on bike sharing open data, and shows that the framework can efficiently identify social Activities in different types of Urban activity Centers and outperforms the baseline approach.
Abstract: Understanding social activities in Urban Activity Centers can benefit both urban authorities and citizens. Traditionally, monitoring large social activities usually incurs significant costs of human labor and time. Fortunately, with the recent booming of urban open data, a wide variety of human digital footprints have become openly accessible, providing us with new opportunities to understand the social dynamics in the cities. In this paper, we resort to urban open data from bike sharing systems, and propose a two-phase framework to identify social activities in Urban Activity Centers based on bike sharing open data. More specifically, we first detect bike usage anomalies from the bike trip data, and then identify the potential social activities from the detected anomalies using a proposed heuristic method by considering both spatial and temporal constraints. We evaluate our framework based on the large-scale real-world dataset collected from the bike sharing system of Washington, D.C. The results show that our framework can efficiently identify social activities in different types of Urban Activity Centers and outperforms the baseline approach. In particular, our framework can identify 89% of the social activities in the major Urban Activity Centers of Washington, D.C.

Journal ArticleDOI
TL;DR: The new term mobile crowd sensing and computing (MCSC) is raised to characterize crowd intelligence extraction from large-scale and heterogeneous usercontributed data and aggregates and fuses the data in the cloud for crowd Intelligence extraction and human-centric service delivery.
Abstract: Participatory sensing [Burke 2006] is an emerging computing paradigm that tasks everyday mobile devices to form participatory sensor networks. It allows the increasing number of mobile phone users to share local knowledge acquired by their sensorenhanced devices, such as monitoring of pollution or noise levels and traffic conditions. The sensing data from volunteer contributors can be further analyzed and processed to form crowd intelligence [Zhang et al. 2011], which can be elaborated into three dimensions: personal awareness, social awareness, and urban awareness. Layered on these concepts, we have raised the new term mobile crowd sensing and computing (MCSC) to characterize crowd intelligence extraction from large-scale and heterogeneous usercontributed data [Guo et al. 2014]. A formal definition of MCSC is as follows: a new sensing paradigm that empowers ordinary citizens to contribute data sensed or generated from their mobile devices, then aggregates and fuses the data in the cloud for crowd intelligence extraction and human-centric service delivery. It has the following three features compared to participatory sensing:

Proceedings ArticleDOI
01 Aug 2015
TL;DR: A novel framework to distinguish PD gait patterns from healthy individuals by accurately extracting gait features that indicate three aspects of movement function, i.e., Stability, symmetry and harmony.
Abstract: Parkinson's Disease (PD) is one of the typical movement disorder diseases, which has a serious impact on the daily lives of elderly people. In this paper, we propose a novel framework for PD gait pattern recognition. The key idea of our approach is to distinguish PD gait patterns from healthy individuals by accurately extracting gait features that indicate three aspects of movement function, i.e., Stability, symmetry and harmony. Concretely, our framework contains three steps: gait phase discrimination, feature extraction and selection and pattern classification. In the first step, we put forward a key event based method to discriminate four gait phases from plantar pressure data. In the second step, based on the gait phases, we extract and select gait features that can indicate stability, symmetry and harmony of movement function. In the third step, we recognize PD gait pattern by employing BP neural network. We evaluate the framework using a real plantar pressure dataset that contains 93 PD patients and 72 healthy individuals. Experimental results demonstrate that our framework outperforms the baseline approach by 32.7% on average in terms of Precision, 42.2% on average in terms of Recall, and 24.0% on average in terms of AUC.

Journal ArticleDOI
TL;DR: This paper proposes a novel multi-parametric sensing system called sleep pattern recognition system (SPRS), equipped with a combination of various non-invasive sensors, that can monitor an elderly user’s sleep behavior and assess the user's sleep pattern automatically via machine learning algorithms.
Abstract: The quality of sleep may be a reflection of an elderly individual's health state, and sleep pattern is an important measurement. Recognition of sleep pattern by itself is a challenge issue, especially for elderly-care community, due to both privacy concerns and technical limitations. We propose a novelmulti-parametric sensing system called sleep pattern recognition system (SPRS). This system, equipped with a combination of various non-invasive sensors, can monitor an elderly user's sleep behavior. It accumulates the detecting data from a pressure sensor matrix and ultra wide band (UWB) tags. Based on these two types of complementary sensing data, SPRS can assess the user's sleep pattern automatically via machine learning algorithms. Compared to existing systems, SPRS operateswithout disrupting the users' sleep. It can be used in normal households with minimal deployment. Results of tests in our real assistive apartment at the Smart Elder-care Lab are also presented in this paper.

Proceedings ArticleDOI
01 Aug 2015
TL;DR: This paper proposes a novel worker selection framework for crowd sensing by proposing a core ontology model to semantically express general factors, based on which task creators can build their own task-specific models efficiently.
Abstract: Worker selection is very crucial for crowd-sensing to ensure high data quality. Existing approaches have two limitations. First, they only take specific factors into account for their motivating application scenarios, but do not provide general models in support of crowd-sensing at large. Second, they select workers only in terms of the requirements defined by the task creator without considering other worker-required factors. To overcome abovementioned limitations, this paper proposes a novel worker selection framework for crowd sensing. Compared to existing work, it mainly has following two characteristics. (1) Multi-scenario. Instead of defining specific factors, we propose a core ontology model to semantically express general factors, based on which task creators can build their own task-specific models efficiently. (2) Multi-view. We propose a two-phase process to select workers by considering factors both from the task creator and worker. We evaluate the effectiveness of the worker selection process by using a questionnaire-generated dataset. Results show that our approach outperforms the baseline method.