J
Junghoo Cho
Researcher at University of California, Los Angeles
Publications - 81
Citations - 8584
Junghoo Cho is an academic researcher from University of California, Los Angeles. The author has contributed to research in topics: Web crawler & Web page. The author has an hindex of 35, co-authored 81 publications receiving 8345 citations. Previous affiliations of Junghoo Cho include Stanford University & New York University.
Papers
More filters
Journal ArticleDOI
Efficient crawling through URL ordering
TL;DR: In this paper, the authors study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first, and they show that a good ordering scheme can obtain important pages significantly faster than one without.
Journal ArticleDOI
Searching the Web
TL;DR: An overview of current Web search engine design is offered, introducing a generic search engine architecture and the results of several performance analyses conducted to compare different designs.
Proceedings Article
The Evolution of the Web and Implications for an Incremental Crawler
Junghoo Cho,Hector Garcia-Molina +1 more
TL;DR: An architecture for the incremental crawler is proposed, which combines the best design choices, which can improve the ``freshness'' of the collection significantly and bring in new pages in a more timely manner.
Proceedings ArticleDOI
What's new on the web?: the evolution of the web from a search engine perspective
TL;DR: The authors' findings indicate a rapid turnover rate of Web pages, i.e., high rates of birth and death, coupled with an even higher rate ofturnover in the hyperlinks that connect them, which is likely to remain consistent over time.
Proceedings ArticleDOI
Automatic identification of user goals in Web search
TL;DR: This paper presents the results from a human subject study that strongly indicate the feasibility of automatic query-goal identification, and proposes two types of features for the goal-identification task: user-click behavior and anchor-link distribution.