A Focused Event Crawler with Temporal Intent [PDF]
Temporal intent is an important component of events. It plays an important role in collecting them from the web with focused crawlers. However, traditionally focused crawlers usually only consider factors such as topic keywords, web page content, and ...
Hao Wu, Dongyang Hou
doaj +4 more sources
A focused crawler based on semantic disambiguation vector space model [PDF]
The focused crawler grabs continuously web pages related to the given topic according to priorities of unvisited hyperlinks. In many previous studies, the focused crawlers predict priorities of unvisited hyperlinks based on the text similarity models ...
Wenjun Liu +8 more
doaj +3 more sources
An Enhanced Semantic Focused Web Crawler Based on Hybrid String Matching Algorithm [PDF]
Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score.
Sakunthala Prabha K. S. +2 more
doaj +3 more sources
A rule-based obfuscating focused crawler in the audio retrieval domain [PDF]
The detection of violations of intellectual properties on multimedia files is a critical problem for the current infrastructure of the Internet, especially within very large document collections.
Marco Montanaro +3 more
openalex +2 more sources
Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights.
Jingfa Liu +3 more
doaj +2 more sources
An Enhanced Focused Web Crawler for Biomedical Topics Using Attention Enhanced Siamese Long Short Term Memory Networks [PDF]
The Internet is chosen to be one among the primary source of biomedical information. To retrieve necessary biomedical information, the search engine needs an efficient, focused crawler mechanism.
Joe Dhanith Pal Nesamony Rose Mary +2 more
doaj +2 more sources
Research on the Focused Crawler of Mineral Intelligence Service Based on Semantic Similarity
Large-scale general search engines have been unable to meet the needs of “specialized, sophisticated and deep” information in the field of mineral intelligence services. Vertical search engines have emerged at the historic moment, and the focused crawler
Shiqi Deng
openalex +2 more sources
Focused Crawler Based on Reinforcement Learning and Decaying Epsilon-Greedy Exploration Policy
In order to serve a diversified user base with a range of purposes, general search engines offer search results for a wide variety of topics and material categories on the internet.
Parisa Begum Kaleel, Shina Sheen
openalex +2 more sources
ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain [PDF]
Publicly available information contains valuable information for Cyber Threat Intelligence (CTI). This can be used to prevent attacks that have already taken place on other systems.
Philipp Kuehn +3 more
openalex +3 more sources
A Semantic Focused Web Crawler Based on a Knowledge Representation Schema [PDF]
The Web has become the main source of information in the digital world, expanding to heterogeneous domains and continuously growing. By means of a search engine, users can systematically search over the web for particular information based on a text ...
Julio Hernandez +2 more
doaj +2 more sources

