scispace - formally typeset
Search or ask a question
Institution

University of Illinois at Chicago

EducationChicago, Illinois, United States
About: University of Illinois at Chicago is a education organization based out in Chicago, Illinois, United States. It is known for research contribution in the topics: Population & Poison control. The organization has 57071 authors who have published 110536 publications receiving 4264936 citations.


Papers
More filters
Proceedings ArticleDOI
10 May 2005
TL;DR: Experimental results using a large number of Web pages from diverse domains show that the proposed two-step technique is able to segment data records, align and extract data from them very accurately.
Abstract: This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items/fields from them and put the data in a database table. This problem has been studied by several researchers. However, existing methods still have some serious limitations. The first class of methods is based on machine learning, which requires human labeling of many examples from each Web site that one is interested in extracting data from. The process is time consuming due to the large number of sites and pages on the Web. The second class of algorithms is based on automatic pattern discovery. These methods are either inaccurate or make many assumptions. This paper proposes a new method to perform the task automatically. It consists of two steps, (1) identifying individual data records in a page, and (2) aligning and extracting data items from the identified data records. For step 1, we propose a method based on visual information to segment data records, which is more accurate than existing methods. For step 2, we propose a novel partial alignment technique based on tree matching. Partial alignment means that we align only those data fields in a pair of data records that can be aligned (or matched) with certainty, and make no commitment on the rest of the data fields. This approach enables very accurate alignment of multiple data records. Experimental results using a large number of Web pages from diverse domains show that the proposed two-step technique is able to segment data records, align and extract data from them very accurately.

572 citations

Journal ArticleDOI
TL;DR: Through this comparative examination, four themes emerged, highlighting the private/public balance present in each social networking site, styles of self-presentation in spaces privately public and publicly private, cultivation of taste performances as a mode of sociocultural identification and organization and the formation of tight or loose social settings.
Abstract: This study provided a comparative analysis of three social network sites, the open-to-all Facebook, the professionally oriented LinkedIn and the exclusive, members-only ASmallWorld.The analysis focused on the underlying structure or architecture of these sites, on the premise that it may set the tone for particular types of interaction.Through this comparative examination, four themes emerged, highlighting the private/public balance present in each social networking site, styles of self-presentation in spaces privately public and publicly private, cultivation of taste performances as a mode of sociocultural identification and organization and the formation of tight or loose social settings. Facebook emerged as the architectural equivalent of a glasshouse, with a publicly open structure, looser behavioral norms and an abundance of tools that members use to leave cues for each other. LinkedIn and ASmallWorld produced tighter spaces, which were consistent with the taste ethos of each network and offered less...

572 citations

Journal ArticleDOI
TL;DR: A survey of heterogeneous information network analysis can be found in this article, where the authors introduce basic concepts of HIN analysis, examine its developments on different data mining tasks, discuss some advanced topics, and point out some future research directions.
Abstract: Most real systems consist of a large number of interacting, multi-typed components, while most contemporary researches model them as homogeneous information networks, without distinguishing different types of objects and links in the networks. Recently, more and more researchers begin to consider these interconnected, multi-typed data as heterogeneous information networks, and develop structural analysis approaches by leveraging the rich semantic meaning of structural types of objects and links in the networks. Compared to widely studied homogeneous information network, the heterogeneous information network contains richer structure and semantic information, which provides plenty of opportunities as well as a lot of challenges for data mining. In this paper, we provide a survey of heterogeneous information network analysis. We will introduce basic concepts of heterogeneous information network analysis, examine its developments on different data mining tasks, discuss some advanced topics, and point out some future research directions.

571 citations

Journal ArticleDOI
10 Mar 1989-Cell
TL;DR: It is demonstrated that Sir2's effect is not restricted to controlling mating type expression, but rather that SIR2 functions in a more general way in the genome.

570 citations


Authors

Showing all 57433 results

NameH-indexPapersCitations
Meir J. Stampfer2771414283776
Frank B. Hu2501675253464
Lewis C. Cantley196748169037
Ronald Klein1941305149140
Anil K. Jain1831016192151
Yusuke Nakamura1792076160313
Bruce M. Spiegelman179434158009
Jie Zhang1784857221720
D. M. Strom1763167194314
Yury Gogotsi171956144520
Todd R. Golub164422201457
Rodney S. Ruoff164666194902
Philip A. Wolf163459114951
Barbara E.K. Klein16085693319
David Jonathan Hofman1591407140442
Network Information
Related Institutions (5)
University of Minnesota
257.9K papers, 11.9M citations

97% related

University of Pennsylvania
257.6K papers, 14.1M citations

97% related

University of Michigan
342.3K papers, 17.6M citations

96% related

Columbia University
224K papers, 12.8M citations

96% related

Johns Hopkins University
249.2K papers, 14M citations

96% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
2023112
2022582
20215,602
20205,335
20194,825
20184,520