Results 41 to 50 of about 76,818 (217)

Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident [PDF]

open access: yesThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2013
With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has ...
H. Hu, Y. J. Ge
doaj   +1 more source

iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling [PDF]

open access: yesProceedings of the 15th ACM/IEEE-CS Joint Conference on Digital Libraries (pp. 75--84) (2015), 2016
Researchers in the Digital Humanities and journalists need to monitor, collect and analyze fresh online content regarding current events such as the Ebola outbreak or the Ukraine crisis on demand. However, existing focused crawling approaches only consider topical aspects while ignoring temporal aspects and therefore cannot achieve thematically ...
arxiv   +1 more source

Research on post occupancy evaluation of Oze National Park in Japan based on online reviews

open access: yesJournal of Asian Architecture and Building Engineering, 2023
With the development of internet, online reviews are user-generated content posted in the web-media era and can extract meaning from the comments through data-mining technology.
Shouni Tang   +3 more
doaj   +1 more source

The Internet with Privacy Policies: Measuring The Web Upon Consent [PDF]

open access: yes, 2021
To protect users' privacy, legislators have regulated the usage of tracking technologies, mandating the acquisition of users' consent before collecting data. Consequently, websites started showing more and more consent management modules -- i.e., Privacy Banners -- the visitors have to interact with to access the website content.
arxiv   +1 more source

Crawling the German Health Web: Exploratory Study and Graph Analysis

open access: yesJournal of Medical Internet Research, 2020
BackgroundThe internet has become an increasingly important resource for health information. However, with a growing amount of web pages, it is nearly impossible for humans to manually keep track of evolving and continuously changing content in the ...
Zowalla, Richard   +2 more
doaj   +1 more source

Tourists’ Perceived Attitudes toward the Famous Terraced Agricultural Cultural Heritage Landscape in China

open access: yesAgriculture, 2022
Terraces are the major vehicle for agricultural activities in mountainous areas and are an important component of the agro-cultural heritage landscape. This work explores tourists’ perceived attitudes toward, and characteristics of terraced agro-cultural
Xiaopiao Yang   +4 more
doaj   +1 more source

Analysis and Evaluation of the Link and Content Based Focused Treasure-Crawler [PDF]

open access: yes, 2013
Indexing the Web is becoming a laborious task for search engines as the Web exponentially grows in size and distribution. Presently, the most effective known approach to overcome this problem is the use of focused crawlers. A focused crawler applies a proper algorithm in order to detect the pages on the Web that relate to its topic of interest.
arxiv   +1 more source

A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs [PDF]

open access: yesInternational Journal of UbiComp (IJU), Vol.3, No.1, January 2012, 2012
Getting informed of what is registered in the Web space on time, can greatly help the psychologists, marketers and political analysts to familiarize, analyse, make decision and act correctly based on the society`s different needs. The great volume of information in the Web space hinders us to continuously online investigate the whole space of the Web ...
arxiv   +1 more source

An Ontology-Based Focused Crawler [PDF]

open access: yes, 2008
In this paper we present a novel approach for building a focused crawler. The goal of our crawler is to effectively identify web pages that relate to a set of pre-defined topics and download them regardless of their web topology or connectivity with other popular pages on the web.
openaire   +2 more sources

Research on Sound Imagery of Electric Shavers Based on Kansei Engineering and Multiple Artificial Neural Networks

open access: yesApplied Sciences, 2022
The electric shaver market in China reach 26.3 billion RMB by 2021. Nowadays, in addition to functional satisfaction, consumers are increasingly focused on the emotional imagery conveyed by products with multiple-senses, and electric shavers are not only
Zhe-Hui Lin   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy