Results 21 to 30 of about 76,818 (217)
Optimized Focused Web Crawler with Natural Language Processing Based Relevance Measure in Bioinformatics Web Sources [PDF]
In the fast growing of digital technologies, crawlers and search engines face unpredictable challenges. Focused web-crawlers are essential for mining the boundless data available on the internet.
S. R. Mani Sekhar+3 more
openalex +2 more sources
PDD Crawler : A Focused Web Crawler Using Link and Content Analysis for Relevence Prediction
9 pages, SEAS-2014, Dubai, UAE, International Conference 7-8 Nov ...
Prashant Dahiwale+2 more
openalex +4 more sources
IHWC: intelligent hidden web crawler for harvesting data in urban domains
Due to the massive size of the hidden web, searching, retrieving and mining rich and high-quality data can be a daunting task. Moreover, with the presence of forms, data cannot be accessed easily.
Sawroop Kaur+3 more
doaj +1 more source
ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain [PDF]
Publicly available information contains valuable information for Cyber Threat Intelligence (CTI). This can be used to prevent attacks that have already taken place on other systems. Ideally, only the initial attack succeeds and all subsequent ones are detected and stopped.
Philipp Kuehn+3 more
openalex +3 more sources
Improving Data Collection on Article Clustering by Using Distributed Focused Crawler
Dani Gunawan+2 more
openalex +3 more sources
A Chinese Topic Crawler Focused on Customer Development
AbstractThis paper presents a Chinese topic crawler focused on customer development, in order to meet the needs of users for more accurate and particular Internet information. The concept of meta-search engine is introduced, and the keywords are expanded by the ontology of HowNet.
Tong Wu+6 more
openalex +3 more sources
Bulk materials are necessary for hydropower construction. The bulk materials transportation (BMT) scheme is a guiding document for material supply, and its selection has a significant influence on hydropower construction.
Hao Wang+4 more
doaj +1 more source
Design and Implementation of A Focused Crawler—TargetCrawler [PDF]
Adopting focused crawler to search web sites is the trend of next generation search engines. Design and implementation of a focused crawler - TargetCrawler is introduced in detail, including its overall architecture, main modules, working processes and two key algorithms, duplicate removing algorithm based on the Bloom filter and ranking algorithm ...
Feng Jian, Chen Jing-zhou, Cao Lei
openaire +1 more source
Web Crawler for Indexing Video e-Learning Resources: A YouTube Case Study [PDF]
The main objective of the current paper is to develop and validate an algorithm focused on au-tomatically indexing YouTube e-learning resources about a certain domain of interest.
Bogdan IANCU
doaj +1 more source
Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights.
Jingfa Liu+3 more
doaj +1 more source