scispace - formally typeset
Search or ask a question
Author

Wanli Min

Other affiliations: University of Chicago, IBM
Bio: Wanli Min is an academic researcher from Alibaba Group. The author has contributed to research in topics: Intersection & Operational risk. The author has an hindex of 15, co-authored 45 publications receiving 1234 citations. Previous affiliations of Wanli Min include University of Chicago & IBM.

Papers
More filters
Journal ArticleDOI
Wanli Min1, Laura Wynter1
TL;DR: The method presented provides predictions of speed and volume over 5-min intervals for up to 1 h in advance for real-time road traffic prediction to be both fast and scalable to full urban networks.
Abstract: Real-time road traffic prediction is a fundamental capability needed to make use of advanced, smart transportation technologies. Both from the point of view of network operators as well as from the point of view of travelers wishing real-time route guidance, accurate short-term traffic prediction is a necessary first step. While techniques for short-term traffic prediction have existed for some time, emerging smart transportation technologies require the traffic prediction capability to be both fast and scalable to full urban networks. We present a method that has proven to be able to meet this challenge. The method presented provides predictions of speed and volume over 5-min intervals for up to 1 h in advance.

594 citations

Journal ArticleDOI
TL;DR: In this article, the authors consider asymptotic behavior of partial sums and sample covariances for linear processes whose innovations are dependent, and establish central limit theorems and invariance principles under fairly mild conditions.

109 citations

Journal ArticleDOI
TL;DR: Locality pursuit embedding as discussed by the authors is a linear algorithm that arises by solving a variational problem and produces a linear embedding that respects the local geometrical structure described by the Euclidean distances.

73 citations

Patent
06 Aug 2010
TL;DR: In this paper, a method for modeling thermal distributions in a data center is provided, where the vertical temperature distribution data for each of the locations is plotted as an s curve, and each of these s curves is represented with a set of parameters that characterize the shape of the s curve.
Abstract: Techniques for data center analysis are provided. In one aspect, a method for modeling thermal distributions in a data center is provided. The method includes the following steps. Vertical temperature distribution data is obtained for a plurality of locations throughout the data center. The vertical temperature distribution data for each of the locations is plotted as an s curve, wherein the vertical temperature distribution data reflects physical conditions at each of the locations which is reflected in a shape of the s curve. Each of the s curves is represented with a set of parameters that characterize the shape of the s curve, wherein the s curve representations make up a knowledge base model of predefined s curve types from which thermal distributions and associated physical conditions at the plurality of locations throughout the data center can be analyzed.

59 citations

Journal ArticleDOI
TL;DR: A mobile measurement technology for optimizing the space and energy efficiency of DCs is presented and the combination of these two data types, in conjunction with innovative modeling techniques, provides the basis for extending the MMT concept toward an interactive energy management solution.
Abstract: The combination of rapidly increasing energy use of data centers (DCs), which is triggered by dramatic increases in IT (information technology) demands, and increases in energy costs and limited energy supplies has made the energy efficiency of DCs a central concern from both a cost and a sustainability perspective This paper describes three important technology components that address the energy consumption in DCs First, we present a mobile measurement technology (MMT) for optimizing the space and energy efficiency of DCs The technology encompasses the interworking of an advanced metrology technique for rapid data collection at high spatial resolution and measurement-driven modeling techniques, enabling optimal adjustments of a DC environment within a target thermal envelope Specific example data demonstrating the effectiveness of MMT is shown Second, the static MMT measurements obtained at high spatial resolution are complemented by and integrated with a real-time sensor network The requirements and suitable architectures for wired and wireless sensor solutions are discussed Third, an energy and thermal model analysis for a DC is presented that exploits both the high-spatial-resolution (but static) MMT data and the high-timeresolved (but sparse) sensor data The combination of these two data types (static and dynamic), in conjunction with innovative modeling techniques, provides the basis for extending the MMT concept toward an interactive energy management solution

54 citations


Cited by
More filters
Journal ArticleDOI
Zheng Zhao1, Weihai Chen1, Xingming Wu1, Peter C. Y. Chen, Jingmeng Liu1 
TL;DR: A novel traffic forecast model based on long short-term memory (LSTM) network is proposed, which considers temporal-spatial correlation in traffic system via a two-dimensional network which is composed of many memory units.
Abstract: Short-term traffic forecast is one of the essential issues in intelligent transportation system. Accurate forecast result enables commuters make appropriate travel modes, travel routes, and departure time, which is meaningful in traffic management. To promote the forecast accuracy, a feasible way is to develop a more effective approach for traffic data analysis. The availability of abundant traffic data and computation power emerge in recent years, which motivates us to improve the accuracy of short-term traffic forecast via deep learning approaches. A novel traffic forecast model based on long short-term memory (LSTM) network is proposed. Different from conventional forecast models, the proposed LSTM network considers temporal-spatial correlation in traffic system via a two-dimensional network which is composed of many memory units. A comparison with other representative forecast models validates that the proposed LSTM network can achieve a better performance.

1,204 citations

Journal ArticleDOI
TL;DR: In this article, the authors present a review of the existing literature on short-term traffic forecasting and offer suggestions for future work, focusing on 10 challenging, yet relatively under researched, directions.
Abstract: Since the early 1980s, short-term traffic forecasting has been an integral part of most Intelligent Transportation Systems (ITS) research and applications; most effort has gone into developing methodologies that can be used to model traffic characteristics and produce anticipated traffic conditions. Existing literature is voluminous, and has largely used single point data from motorways and has employed univariate mathematical models to predict traffic volumes or travel times. Recent developments in technology and the widespread use of powerful computers and mathematical models allow researchers an unprecedented opportunity to expand horizons and direct work in 10 challenging, yet relatively under researched, directions. It is these existing challenges that we review in this paper and offer suggestions for future work.

927 citations

Journal ArticleDOI
TL;DR: An in-depth study of the existing literature on data center power modeling, covering more than 200 models, organized in a hierarchical structure with two main branches focusing on hardware-centric and software-centric power models.
Abstract: Data centers are critical, energy-hungry infrastructures that run large-scale Internet-based services. Energy consumption models are pivotal in designing and optimizing energy-efficient operations to curb excessive energy consumption in data centers. In this paper, we survey the state-of-the-art techniques used for energy consumption modeling and prediction for data centers and their components. We conduct an in-depth study of the existing literature on data center power modeling, covering more than 200 models. We organize these models in a hierarchical structure with two main branches focusing on hardware-centric and software-centric power models. Under hardware-centric approaches we start from the digital circuit level and move on to describe higher-level energy consumption models at the hardware component level, server level, data center level, and finally systems of systems level. Under the software-centric approaches we investigate power models developed for operating systems, virtual machines and software applications. This systematic approach allows us to identify multiple issues prevalent in power modeling of different levels of data center systems, including: i) few modeling efforts targeted at power consumption of the entire data center ii) many state-of-the-art power models are based on a few CPU or server metrics, and iii) the effectiveness and accuracy of these power models remain open questions. Based on these observations, we conclude the survey by describing key challenges for future research on constructing effective and accurate data center power models.

741 citations

Journal ArticleDOI
TL;DR: A novel methodology for predicting the spatial distribution of taxi-passengers for a short-term time horizon using streaming data and demonstrates that the proposed framework can provide effective insight into the spatiotemporal distribution of Taxi-passenger demand for a 30-min horizon.
Abstract: Informed driving is increasingly becoming a key feature for increasing the sustainability of taxi companies. The sensors that are installed in each vehicle are providing new opportunities for automatically discovering knowledge, which, in return, delivers information for real-time decision making. Intelligent transportation systems for taxi dispatching and for finding time-saving routes are already exploring these sensing data. This paper introduces a novel methodology for predicting the spatial distribution of taxi-passengers for a short-term time horizon using streaming data. First, the information was aggregated into a histogram time series. Then, three time-series forecasting techniques were combined to originate a prediction. Experimental tests were conducted using the online data that are transmitted by 441 vehicles of a fleet running in the city of Porto, Portugal. The results demonstrated that the proposed framework can provide effective insight into the spatiotemporal distribution of taxi-passenger demand for a 30-min horizon.

602 citations

Journal ArticleDOI
TL;DR: The gradient boosting tree method strategically combines additional trees by correcting mistakes made by its previous base models, therefore, potentially improves prediction accuracy and model interpretability in freeway travel time prediction.
Abstract: Tree based ensemble methods have reached a celebrity status in prediction field. By combining simple regression trees with ‘poor’ performance, they usually produce high prediction accuracy. In contrast to other machine learning methods that have been treated as black-boxes, tree based ensemble methods provide interpretable results, while requiring little data preprocessing, are able to handle different types of predictor variables, and can fit complex nonlinear relationship. These properties make the tree based ensemble methods good candidates for solving travel time prediction problems. However, applications of tree-based ensemble algorithms in traffic prediction area are limited. In this paper, we employ a gradient boosting regression tree method (GBM) to analyze and model freeway travel time to improve the prediction accuracy and model interpretability. The gradient boosting tree method strategically combines additional trees by correcting mistakes made by its previous base models, therefore, potentially improves prediction accuracy. Different parameters’ effect on model performance and correlations of input–output variables are discussed in details by using travel time data provided by INRIX along two freeway sections in Maryland. The proposed method is, then, compared with another popular ensemble method and a bench mark model. Study results indicate that the GBM model has its considerable advantages in freeway travel time prediction.

506 citations