Institution
University of Illinois at Chicago
Education•Chicago, Illinois, United States•
About: University of Illinois at Chicago is a education organization based out in Chicago, Illinois, United States. It is known for research contribution in the topics: Population & Poison control. The organization has 57071 authors who have published 110536 publications receiving 4264936 citations.
Topics: Population, Poison control, Health care, Cancer, Medicine
Papers published on a yearly basis
Papers
More filters
••
10 May 2005TL;DR: Experimental results using a large number of Web pages from diverse domains show that the proposed two-step technique is able to segment data records, align and extract data from them very accurately.
Abstract: This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items/fields from them and put the data in a database table. This problem has been studied by several researchers. However, existing methods still have some serious limitations. The first class of methods is based on machine learning, which requires human labeling of many examples from each Web site that one is interested in extracting data from. The process is time consuming due to the large number of sites and pages on the Web. The second class of algorithms is based on automatic pattern discovery. These methods are either inaccurate or make many assumptions. This paper proposes a new method to perform the task automatically. It consists of two steps, (1) identifying individual data records in a page, and (2) aligning and extracting data items from the identified data records. For step 1, we propose a method based on visual information to segment data records, which is more accurate than existing methods. For step 2, we propose a novel partial alignment technique based on tree matching. Partial alignment means that we align only those data fields in a pair of data records that can be aligned (or matched) with certainty, and make no commitment on the rest of the data fields. This approach enables very accurate alignment of multiple data records. Experimental results using a large number of Web pages from diverse domains show that the proposed two-step technique is able to segment data records, align and extract data from them very accurately.
572 citations
••
TL;DR: Through this comparative examination, four themes emerged, highlighting the private/public balance present in each social networking site, styles of self-presentation in spaces privately public and publicly private, cultivation of taste performances as a mode of sociocultural identification and organization and the formation of tight or loose social settings.
Abstract: This study provided a comparative analysis of three social network sites, the open-to-all Facebook, the professionally oriented LinkedIn and the exclusive, members-only ASmallWorld.The analysis focused on the underlying structure or architecture of these sites, on the premise that it may set the tone for particular types of interaction.Through this comparative examination, four themes emerged, highlighting the private/public balance present in each social networking site, styles of self-presentation in spaces privately public and publicly private, cultivation of taste performances as a mode of sociocultural identification and organization and the formation of tight or loose social settings. Facebook emerged as the architectural equivalent of a glasshouse, with a publicly open structure, looser behavioral norms and an abundance of tools that members use to leave cues for each other. LinkedIn and ASmallWorld produced tighter spaces, which were consistent with the taste ethos of each network and offered less...
572 citations
••
572 citations
••
TL;DR: A survey of heterogeneous information network analysis can be found in this article, where the authors introduce basic concepts of HIN analysis, examine its developments on different data mining tasks, discuss some advanced topics, and point out some future research directions.
Abstract: Most real systems consist of a large number of interacting, multi-typed components, while most contemporary researches model them as homogeneous information networks, without distinguishing different types of objects and links in the networks. Recently, more and more researchers begin to consider these interconnected, multi-typed data as heterogeneous information networks, and develop structural analysis approaches by leveraging the rich semantic meaning of structural types of objects and links in the networks. Compared to widely studied homogeneous information network, the heterogeneous information network contains richer structure and semantic information, which provides plenty of opportunities as well as a lot of challenges for data mining. In this paper, we provide a survey of heterogeneous information network analysis. We will introduce basic concepts of heterogeneous information network analysis, examine its developments on different data mining tasks, discuss some advanced topics, and point out some future research directions.
571 citations
••
TL;DR: It is demonstrated that Sir2's effect is not restricted to controlling mating type expression, but rather that SIR2 functions in a more general way in the genome.
570 citations
Authors
Showing all 57433 results
Name | H-index | Papers | Citations |
---|---|---|---|
Meir J. Stampfer | 277 | 1414 | 283776 |
Frank B. Hu | 250 | 1675 | 253464 |
Lewis C. Cantley | 196 | 748 | 169037 |
Ronald Klein | 194 | 1305 | 149140 |
Anil K. Jain | 183 | 1016 | 192151 |
Yusuke Nakamura | 179 | 2076 | 160313 |
Bruce M. Spiegelman | 179 | 434 | 158009 |
Jie Zhang | 178 | 4857 | 221720 |
D. M. Strom | 176 | 3167 | 194314 |
Yury Gogotsi | 171 | 956 | 144520 |
Todd R. Golub | 164 | 422 | 201457 |
Rodney S. Ruoff | 164 | 666 | 194902 |
Philip A. Wolf | 163 | 459 | 114951 |
Barbara E.K. Klein | 160 | 856 | 93319 |
David Jonathan Hofman | 159 | 1407 | 140442 |