scispace - formally typeset
J

Junghoo Cho

Researcher at University of California, Los Angeles

Publications -  81
Citations -  8584

Junghoo Cho is an academic researcher from University of California, Los Angeles. The author has contributed to research in topics: Web crawler & Web page. The author has an hindex of 35, co-authored 81 publications receiving 8345 citations. Previous affiliations of Junghoo Cho include Stanford University & New York University.

Papers
More filters
Journal ArticleDOI

Efficient crawling through URL ordering

TL;DR: In this paper, the authors study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first, and they show that a good ordering scheme can obtain important pages significantly faster than one without.
Journal ArticleDOI

Searching the Web

TL;DR: An overview of current Web search engine design is offered, introducing a generic search engine architecture and the results of several performance analyses conducted to compare different designs.
Proceedings Article

The Evolution of the Web and Implications for an Incremental Crawler

TL;DR: An architecture for the incremental crawler is proposed, which combines the best design choices, which can improve the ``freshness'' of the collection significantly and bring in new pages in a more timely manner.
Proceedings ArticleDOI

What's new on the web?: the evolution of the web from a search engine perspective

TL;DR: The authors' findings indicate a rapid turnover rate of Web pages, i.e., high rates of birth and death, coupled with an even higher rate ofturnover in the hyperlinks that connect them, which is likely to remain consistent over time.
Proceedings ArticleDOI

Automatic identification of user goals in Web search

TL;DR: This paper presents the results from a human subject study that strongly indicate the feasibility of automatic query-goal identification, and proposes two types of features for the goal-identification task: user-click behavior and anchor-link distribution.