Results 21 to 30 of about 76,818 (217)

Optimized Focused Web Crawler with Natural Language Processing Based Relevance Measure in Bioinformatics Web Sources [PDF]

open access: diamondCybernetics and Information Technologies, 2019
In the fast growing of digital technologies, crawlers and search engines face unpredictable challenges. Focused web-crawlers are essential for mining the boundless data available on the internet.
S. R. Mani Sekhar   +3 more
openalex   +2 more sources

PDD Crawler : A Focused Web Crawler Using Link and Content Analysis for Relevence Prediction

open access: bronzeComputer Science & Information Technology ( CS & IT ), 2014
9 pages, SEAS-2014, Dubai, UAE, International Conference 7-8 Nov ...
Prashant Dahiwale   +2 more
openalex   +4 more sources

IHWC: intelligent hidden web crawler for harvesting data in urban domains

open access: yesComplex & Intelligent Systems, 2021
Due to the massive size of the hidden web, searching, retrieving and mining rich and high-quality data can be a daunting task. Moreover, with the presence of forms, data cannot be accessed easily.
Sawroop Kaur   +3 more
doaj   +1 more source

ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain [PDF]

open access: greenarXiv, 2023
Publicly available information contains valuable information for Cyber Threat Intelligence (CTI). This can be used to prevent attacks that have already taken place on other systems. Ideally, only the initial attack succeeds and all subsequent ones are detected and stopped.
Philipp Kuehn   +3 more
openalex   +3 more sources

A Chinese Topic Crawler Focused on Customer Development

open access: goldProcedia CIRP, 2016
AbstractThis paper presents a Chinese topic crawler focused on customer development, in order to meet the needs of users for more accurate and particular Internet information. The concept of meta-search engine is introduced, and the keywords are expanded by the ontology of HowNet.
Tong Wu   +6 more
openalex   +3 more sources

A Two-Stage Decision-Making Method Based on WebGIS for Bulk Material Transportation of Hydropower Construction

open access: yesEnergies, 2022
Bulk materials are necessary for hydropower construction. The bulk materials transportation (BMT) scheme is a guiding document for material supply, and its selection has a significant influence on hydropower construction.
Hao Wang   +4 more
doaj   +1 more source

Design and Implementation of A Focused Crawler—TargetCrawler [PDF]

open access: yesInternational Journal of Grid and Distributed Computing, 2014
Adopting focused crawler to search web sites is the trend of next generation search engines. Design and implementation of a focused crawler - TargetCrawler is introduced in detail, including its overall architecture, main modules, working processes and two key algorithms, duplicate removing algorithm based on the Bloom filter and ranking algorithm ...
Feng Jian, Chen Jing-zhou, Cao Lei
openaire   +1 more source

Web Crawler for Indexing Video e-Learning Resources: A YouTube Case Study [PDF]

open access: yesInformatică economică, 2019
The main objective of the current paper is to develop and validate an algorithm focused on au-tomatically indexing YouTube e-learning resources about a certain domain of interest.
Bogdan IANCU
doaj   +1 more source

Applying particle swarm optimization-based dynamic adaptive hyperlink evaluation to focused crawler for meteorological disasters

open access: yesComplex & Intelligent Systems, 2023
Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights.
Jingfa Liu   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy