State of the Art in Semantic Focused Crawlers [PDF]
Nowadays, the research of focused crawler approaches the field of semantic web, along with the appearance of increasing semantic web documents and the rapid development of ontology mark-up languages. Semantic focused crawlers are a series of focused crawlers enhanced by various semantic web technologies. In this paper, we make a survey in this research
Dong, Hai +2 more
openaire +2 more sources
Discovering Land Cover Web Map Services from the Deep Web with JavaScript Invocation Rules
Automatic discovery of isolated land cover web map services (LCWMSs) can potentially help in sharing land cover data. Currently, various search engine-based and crawler-based approaches have been developed for finding services dispersed throughout the ...
Dongyang Hou, Jun Chen, Hao Wu
doaj +1 more source
Hybrid focused crawling on the Surface and the Dark Web
Focused crawlers enable the automatic discovery of Web resources about a given topic by automatically navigating through the Web link structure and selecting the hyperlinks to follow by estimating their relevance to the topic of interest.
Christos Iliou +4 more
doaj +1 more source
Development of Focused Crawlers for Building Large Punjabi News Corpus
Web crawlers are as old as the Internet and are most commonly used by search engines to visit websites and index them into repositories. They are not limited to search engines but are also widely utilized to build corpora in different domains and ...
Gurjot Singh Mahi, Amandeep Verma
doaj +1 more source
Improving the performance of focused web crawlers
This work addresses issues related to the design and implementation of focused crawlers. Several variants of state-of-the-art crawlers relying on web page content and link information for estimating the relevance of web pages to a given topic are proposed.
Πετρακης Ευριπιδης(http://users.isc.tuc.gr/~epetrakis) +3 more
openaire +3 more sources
Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident [PDF]
With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has ...
H. Hu, Y. J. Ge
doaj +1 more source
Construction and Application of the Attention Analysis Model of Brand Management Policies of Agricultural Products with Geographical Indications [PDF]
[Purpose/Significance] Geographical indications (GIs) are an important tool for local governments in China to carry out brand building of agricultural products. Brand management is a continuous systematic project involving multiple subjects.
HUO Mengjia, LIU Juan, Huang Jie
doaj +1 more source
Research on post occupancy evaluation of Oze National Park in Japan based on online reviews
With the development of internet, online reviews are user-generated content posted in the web-media era and can extract meaning from the comments through data-mining technology.
Shouni Tang +3 more
doaj +1 more source
Scaling-laws of human broadcast communication enable distinction between human, corporate and robot Twitter users. [PDF]
Human behaviour is highly individual by nature, yet statistical structures are emerging which seem to govern the actions of human beings collectively. Here we search for universal statistical laws dictating the timing of human actions in communication ...
Faisal, A, Tavares, G
core +2 more sources
Crawling the German Health Web: Exploratory Study and Graph Analysis
BackgroundThe internet has become an increasingly important resource for health information. However, with a growing amount of web pages, it is nearly impossible for humans to manually keep track of evolving and continuously changing content in the ...
Zowalla, Richard +2 more
doaj +1 more source

