Results 31 to 40 of about 13,834 (319)
Improving the performance of focused web crawlers
This work addresses issues related to the design and implementation of focused crawlers. Several variants of state-of-the-art crawlers relying on web page content and link information for estimating the relevance of web pages to a given topic are proposed.
Πετρακης Ευριπιδης(http://users.isc.tuc.gr/~epetrakis)+3 more
openaire +3 more sources
Construction and Application of the Attention Analysis Model of Brand Management Policies of Agricultural Products with Geographical Indications [PDF]
[Purpose/Significance] Geographical indications (GIs) are an important tool for local governments in China to carry out brand building of agricultural products. Brand management is a continuous systematic project involving multiple subjects.
HUO Mengjia, LIU Juan, Huang Jie
doaj +1 more source
Automatically assembling a full census of an academic field [PDF]
The composition of the scientific workforce shapes the direction of scientific research, directly through the selection of questions to investigate, and indirectly through its influence on the training of future scientists.
Clauset, Aaron+2 more
core +4 more sources
Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident [PDF]
With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has ...
H. Hu, Y. J. Ge
doaj +1 more source
Towards Understanding Political Interactions on Instagram [PDF]
Online Social Networks (OSNs) allow personalities and companies to communicate directly with the public, bypassing filters of traditional medias. As people rely on OSNs to stay up-to-date, the political debate has moved online too.
Almeida, Jussara M.+7 more
core +3 more sources
Crawling the German Health Web: Exploratory Study and Graph Analysis
BackgroundThe internet has become an increasingly important resource for health information. However, with a growing amount of web pages, it is nearly impossible for humans to manually keep track of evolving and continuously changing content in the ...
Zowalla, Richard+2 more
doaj +1 more source
Research on post occupancy evaluation of Oze National Park in Japan based on online reviews
With the development of internet, online reviews are user-generated content posted in the web-media era and can extract meaning from the comments through data-mining technology.
Shouni Tang+3 more
doaj +1 more source
Terraces are the major vehicle for agricultural activities in mountainous areas and are an important component of the agro-cultural heritage landscape. This work explores tourists’ perceived attitudes toward, and characteristics of terraced agro-cultural
Xiaopiao Yang+4 more
doaj +1 more source
Researchers in the Digital Humanities and journalists need to monitor, collect and analyze fresh online content regarding current events such as the Ebola outbreak or the Ukraine crisis on demand.
Diligenti M.+4 more
core +1 more source
An Ontology-Based Focused Crawler [PDF]
In this paper we present a novel approach for building a focused crawler. The goal of our crawler is to effectively identify web pages that relate to a set of pre-defined topics and download them regardless of their web topology or connectivity with other popular pages on the web.
openaire +2 more sources