Results 11 to 20 of about 1,734,941 (332)
Information Retrieval deals with searching and retrieving information within the documents and it also searches the online databases and internet. Web crawler is defined as a program or software which traverses the Web and downloads web documents in a methodical, automated manner.
Sanjeev Kumar Singh+2 more
openaire +1 more source
Summary of web crawler technology research
With the continuous development of network information technology, there is a large amount of unstructured data called big data on the network. Human resources to collect information laborious, so web crawler technology came into being.
Linxuan Yu+5 more
semanticscholar +1 more source
Aplikasi Web Crawler Untuk Web Content Pada Mobile Phone [PDF]
Crawling is the process behind a search engine, which served through the World Wide Web in a structured and with certain ethics. Applications that run the crawling process is called Web Crawler, also called web spider or web robot.
Basori, A. H. (Ahmad)+2 more
core +3 more sources
An architecture for a focused trend parallel web crawler with the application of clickstream analysis [PDF]
The tremendous growth of the Web poses many challenges for all-purpose single-process crawlers including the presence of some irrelevant answers among search results and the coverage and scaling issues regarding the enormous dimension of the World Wide ...
Ahmadi-Abkenari, Fatemeh, Selamat, Ali
core +1 more source
The web is a repository of large amount of data. Information available in the web is organised in the form of pages. Due to the presence of unlimited amount of information, searching and finding out appropriate information from the web is a task which needs expertise.
S. R. Sreeja, Sangita Chaudhari
openaire +2 more sources
Design of a Parallel and Scalable Crawler for the Hidden Web
The WWW contains huge amount of information from different areas. This information may be present virtually in the form of web pages, media, articles (research journals / magazine), blogs etc.
Sonali Gupta, K. Bhatia
semanticscholar +1 more source
SIMHAR - Smart Distributed Web Crawler for the Hidden Web Using SIM+Hash and Redis Server
Developing a distributed web crawler obliges major engineering challenges, all of which are eventually associated to scale. To retain corpus of search engine and a reasonable state of freshness, the crawler must be distributed over multiple computers. In
Sawroop Kaur, Ieee G. GEETHA Member
semanticscholar +1 more source
Web Crawler: Design And Implementation For Extracting Article-Like Contents
The World Wide Web is a large, wealthy, and accessible information system whose users are increasing rapidly nowadays. To retrieve information from the web as per users’ requests, search engines are built to access web pages.
Ngo Le Huy Hien+2 more
semanticscholar +1 more source
The data extraction using distributed crawler inside multi-agent system [PDF]
The paper discusses the use of web crawler technology. We created an application based on standard web crawler. Our application is determined for data extraction.
Dubec, Patrik+4 more
core +2 more sources
- In world wide web, Web Crawler is working as a software agent which helps in web indexing. Web crawler at times called as spider or internet bot explicitly operated by many search engines. In Information Retrieval to colleting an information Web crawler play an important role. For effective searching and web indexing is mostly depends on web crawlers.
openaire +2 more sources