Results 31 to 40 of about 13,834 (319)

Improving the performance of focused web crawlers

open access: yesData & Knowledge Engineering, 2009
This work addresses issues related to the design and implementation of focused crawlers. Several variants of state-of-the-art crawlers relying on web page content and link information for estimating the relevance of web pages to a given topic are proposed.
Πετρακης Ευριπιδης(http://users.isc.tuc.gr/~epetrakis)   +3 more
openaire   +3 more sources

Construction and Application of the Attention Analysis Model of Brand Management Policies of Agricultural Products with Geographical Indications [PDF]

open access: yesNongye tushu qingbao xuebao, 2023
[Purpose/Significance] Geographical indications (GIs) are an important tool for local governments in China to carry out brand building of agricultural products. Brand management is a continuous systematic project involving multiple subjects.
HUO Mengjia, LIU Juan, Huang Jie
doaj   +1 more source

Automatically assembling a full census of an academic field [PDF]

open access: yes, 2018
The composition of the scientific workforce shapes the direction of scientific research, directly through the selection of questions to investigate, and indirectly through its influence on the training of future scientists.
Clauset, Aaron   +2 more
core   +4 more sources

Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident [PDF]

open access: yesThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2013
With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has ...
H. Hu, Y. J. Ge
doaj   +1 more source

Towards Understanding Political Interactions on Instagram [PDF]

open access: yes, 2019
Online Social Networks (OSNs) allow personalities and companies to communicate directly with the public, bypassing filters of traditional medias. As people rely on OSNs to stay up-to-date, the political debate has moved online too.
Almeida, Jussara M.   +7 more
core   +3 more sources

Crawling the German Health Web: Exploratory Study and Graph Analysis

open access: yesJournal of Medical Internet Research, 2020
BackgroundThe internet has become an increasingly important resource for health information. However, with a growing amount of web pages, it is nearly impossible for humans to manually keep track of evolving and continuously changing content in the ...
Zowalla, Richard   +2 more
doaj   +1 more source

Research on post occupancy evaluation of Oze National Park in Japan based on online reviews

open access: yesJournal of Asian Architecture and Building Engineering, 2023
With the development of internet, online reviews are user-generated content posted in the web-media era and can extract meaning from the comments through data-mining technology.
Shouni Tang   +3 more
doaj   +1 more source

Tourists’ Perceived Attitudes toward the Famous Terraced Agricultural Cultural Heritage Landscape in China

open access: yesAgriculture, 2022
Terraces are the major vehicle for agricultural activities in mountainous areas and are an important component of the agro-cultural heritage landscape. This work explores tourists’ perceived attitudes toward, and characteristics of terraced agro-cultural
Xiaopiao Yang   +4 more
doaj   +1 more source

iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling

open access: yes, 2016
Researchers in the Digital Humanities and journalists need to monitor, collect and analyze fresh online content regarding current events such as the Ebola outbreak or the Ukraine crisis on demand.
Diligenti M.   +4 more
core   +1 more source

An Ontology-Based Focused Crawler [PDF]

open access: yes, 2008
In this paper we present a novel approach for building a focused crawler. The goal of our crawler is to effectively identify web pages that relate to a set of pre-defined topics and download them regardless of their web topology or connectivity with other popular pages on the web.
openaire   +2 more sources

Home - About - Disclaimer - Privacy