scispace - formally typeset
Search or ask a question
Institution

YMCA University of Science and Technology

EducationFaridabad, India
About: YMCA University of Science and Technology is a education organization based out in Faridabad, India. It is known for research contribution in the topics: Web crawler & Web page. The organization has 299 authors who have published 568 publications receiving 4547 citations.


Papers
More filters
Proceedings ArticleDOI
24 Aug 2013
TL;DR: This paper provides a novel approach to guide the crawler in choosing the right query term to be submitted to any search form interface that has been designed to accept keywords or terms as input to it.
Abstract: The Hidden Web refers to a huge portion of the WWW that holds numerous freely accessible Web databases, hidden behind search form interfaces which can only be accessed through dynamic web pages that are generated in response to the user queries issued at the search form interface. Thus, the core challenge to implement any crawler for the Hidden Web is to routinely surpass these search form interfaces by automatically generating & issuing queries that help discover these dynamic Web pages. The paper provides a novel approach to guide the crawler in choosing the right query term to be submitted to any search form interface that has been designed to accept keywords or terms as input to it. The system is based on the use of classification hierarchies that might have either been manually or automatically constructed. And for the purposes of illustration, we have considered the search form interfaces in the 'Medical' domain, it being one of the most popular domains used by the researchers and the use of a manually generated top-down classification hierarchy in the same domain.

5 citations

Journal ArticleDOI
01 Jan 2017
TL;DR: The architecture of migrating crawler is proposed which is based on URL ordering, URL scheduling and document redundancy elimination mechanism, which aims to work more efficiently than traditional single crawler by providing ordering and scheduling of URLs.
Abstract: In order to manage the vast information available on web, crawler plays a significant role. The working of crawler should be optimized to get maximum and unique information from the World Wide Web. In this paper, architecture of migrating crawler is proposed which is based on URL ordering, URL scheduling and document redundancy elimination mechanism. The proposed ordering technique is based on URL structure, which plays a crucial role in utilizing the web efficiently. Scheduling ensures that URLs should go to optimum agent for downloading. To ensure this, characteristics of both agents and URLs are taken into consideration for scheduling. Duplicate documents are also removed to make the database unique. To reduce matching time, document matching is made on the basis of their Meta information only. The agents of proposed migrating crawler work more efficiently than traditional single crawler by providing ordering and scheduling of URLs. KeywoRDS Duplicate, Hashing, Migrating Crawler, Ordering, Scheduling URL

5 citations

Journal ArticleDOI
TL;DR: A single numerical index has been developed graph theoretic approach for assessment and comparison of vendor in manufacturing organisation through variable permanent function.
Abstract: Graph theoretic approach has been adopted for evaluation and selection of vendor. The authors identified five factors affecting the vendor’s quality on the basis of 57 research papers of vendor selection and also mentioned the factors chosen by different researchers. On the basis of these factors, a model has been developed for vendor selection. The authors also considered 37 research papers on graph theoretic methodology and mentioned the factors used by different researchers. During the application of graph theoretic approach, a digraph of characteristics which contributes to quality of vendor has been developed further the interdependency of attributes as well as their inheritances has been identified and its representation in matrix form has been used for calculation of numerical index of the vendor’s quality through variable permanent function. A single numerical index has been developed graph theoretic approach for assessment and comparison of vendor in manufacturing organisation.

5 citations

Book ChapterDOI
01 Jan 2014
TL;DR: Insight is brought into the various steps, a crawler must perform to access the contents in the Hidden Web to structure the problem area and analyze what aspects have already been covered by previous research and what needs to be done.
Abstract: The Hidden Web is a part of the Web that consists mainly of the information inside databases, i.e., anything behind an interactive electronic form (search interfaces), which cannot be accessed by the conventional Web crawlers [1, 2, 8]. However, there have been well-defined, effective, and efficient methods for accessing Deep Web contents. One of these methods for accessing the Hidden Web employs an approach similar to ‘traditional’ crawling but aims at extracting the data behind the search interfaces or forms residing in databases. The paper brings insight into the various steps, a crawler must perform to access the contents in the Hidden Web. We structure the problem area and analyze what aspects have already been covered by previous research and what needs to be done.

5 citations

Book ChapterDOI
01 Jan 2018
TL;DR: The paper proposes a mobile agent-based solution for solving energy sink-hole problem and aims to extend the network life by reducing redundant data being passed to the nodes near to the sink thereby reducing the load and saving battery life.
Abstract: Repeated and continuous transmission of data to the sink leads to energy loss in all the nodes in case of flat WSN. Especially, depletion of energy is highly acute in case of nodes that are near to the sink. Conventionally known as energy sink-hole problem, it causes early failure of the network even when there is a substantial amount of residual energy left in it. Though the research fraternity has been continuously addressing this problem and even has provided various solutions to deal with it, the use of mobile agents to meet the above-stated problem is still in its infancy. The paper proposes a mobile agent-based solution for solving energy sink-hole problem. The proposed solution aims to extend the network life by reducing redundant data being passed to the nodes near to the sink thereby reducing the load and saving battery life. The algorithm is implemented using aglets and the analytical results show significant improvement in the network lifetime.

5 citations


Authors

Showing all 322 results

NameH-indexPapersCitations
Bharat Bhushan116127662506
Vikas Kumar8985939185
Dinesh Kumar69133324342
M K Arti21491179
Tilak Raj20681541
Parmod Kumar1948895
O.P. Mishra18461242
Neeraj Sharma18961063
Sandeep Grover18821251
Gurpreet Singh171071158
Vinod Chhokar1555526
Rahul Sindhwani1441498
Vineet Jain1434495
Arvind Kumar14118934
Rajesh Attri1341665
Network Information
Related Institutions (5)
Amity University
12.7K papers, 86K citations

86% related

Motilal Nehru National Institute of Technology Allahabad
5K papers, 61.8K citations

84% related

Thapar University
8.5K papers, 130.3K citations

83% related

National Institute of Technology, Durgapur
5.7K papers, 63.4K citations

83% related

National Institute of Technology, Rourkela
10.7K papers, 150.1K citations

82% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
202319
202220
20215
202021
201947
2018104