Results 1 to 10 of about 7,552,191 (344)

A Focused Event Crawler with Temporal Intent [PDF]

open access: goldApplied Sciences, 2023
Temporal intent is an important component of events. It plays an important role in collecting them from the web with focused crawlers. However, traditionally focused crawlers usually only consider factors such as topic keywords, web page content, and ...
Hao Wu, Dongyang Hou
doaj   +4 more sources

A focused crawler based on semantic disambiguation vector space model [PDF]

open access: goldComplex & Intelligent Systems, 2022
The focused crawler grabs continuously web pages related to the given topic according to priorities of unvisited hyperlinks. In many previous studies, the focused crawlers predict priorities of unvisited hyperlinks based on the text similarity models ...
Wenjun Liu   +8 more
doaj   +3 more sources

An Enhanced Semantic Focused Web Crawler Based on Hybrid String Matching Algorithm [PDF]

open access: diamondCybernetics and Information Technologies, 2021
Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score.
Sakunthala Prabha K. S.   +2 more
doaj   +3 more sources

A rule-based obfuscating focused crawler in the audio retrieval domain [PDF]

open access: hybridMultim. Tools Appl., 2023
The detection of violations of intellectual properties on multimedia files is a critical problem for the current infrastructure of the Internet, especially within very large document collections.
Marco Montanaro   +3 more
openalex   +2 more sources

Applying particle swarm optimization-based dynamic adaptive hyperlink evaluation to focused crawler for meteorological disasters

open access: yesComplex & Intelligent Systems, 2023
Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights.
Jingfa Liu   +3 more
doaj   +2 more sources

An Enhanced Focused Web Crawler for Biomedical Topics Using Attention Enhanced Siamese Long Short Term Memory Networks [PDF]

open access: greenBrazilian Archives of Biology and Technology, 2022
The Internet is chosen to be one among the primary source of biomedical information. To retrieve necessary biomedical information, the search engine needs an efficient, focused crawler mechanism.
Joe Dhanith Pal Nesamony Rose Mary   +2 more
doaj   +2 more sources

Research on the Focused Crawler of Mineral Intelligence Service Based on Semantic Similarity

open access: goldJournal of Physics: Conference Series, 2020
Large-scale general search engines have been unable to meet the needs of “specialized, sophisticated and deep” information in the field of mineral intelligence services. Vertical search engines have emerged at the historic moment, and the focused crawler
Shiqi Deng
openalex   +2 more sources

Focused Crawler Based on Reinforcement Learning and Decaying Epsilon-Greedy Exploration Policy

open access: gold˜The œinternational Arab journal of information technology, 2023
In order to serve a diversified user base with a range of purposes, general search engines offer search results for a wide variety of topics and material categories on the internet.
Parisa Begum Kaleel, Shina Sheen
openalex   +2 more sources

ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain [PDF]

open access: greenarXiv.org, 2023
Publicly available information contains valuable information for Cyber Threat Intelligence (CTI). This can be used to prevent attacks that have already taken place on other systems.
Philipp Kuehn   +3 more
openalex   +3 more sources

A Semantic Focused Web Crawler Based on a Knowledge Representation Schema [PDF]

open access: goldApplied Sciences, 2020
The Web has become the main source of information in the digital world, expanding to heterogeneous domains and continuously growing. By means of a search engine, users can systematically search over the web for particular information based on a text ...
Julio Hernandez   +2 more
doaj   +2 more sources

Home - About - Disclaimer - Privacy