Results 41 to 50 of about 76,818 (217)
Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident [PDF]
With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has ...
H. Hu, Y. J. Ge
doaj +1 more source
iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling [PDF]
Researchers in the Digital Humanities and journalists need to monitor, collect and analyze fresh online content regarding current events such as the Ebola outbreak or the Ukraine crisis on demand. However, existing focused crawling approaches only consider topical aspects while ignoring temporal aspects and therefore cannot achieve thematically ...
arxiv +1 more source
Research on post occupancy evaluation of Oze National Park in Japan based on online reviews
With the development of internet, online reviews are user-generated content posted in the web-media era and can extract meaning from the comments through data-mining technology.
Shouni Tang+3 more
doaj +1 more source
The Internet with Privacy Policies: Measuring The Web Upon Consent [PDF]
To protect users' privacy, legislators have regulated the usage of tracking technologies, mandating the acquisition of users' consent before collecting data. Consequently, websites started showing more and more consent management modules -- i.e., Privacy Banners -- the visitors have to interact with to access the website content.
arxiv +1 more source
Crawling the German Health Web: Exploratory Study and Graph Analysis
BackgroundThe internet has become an increasingly important resource for health information. However, with a growing amount of web pages, it is nearly impossible for humans to manually keep track of evolving and continuously changing content in the ...
Zowalla, Richard+2 more
doaj +1 more source
Terraces are the major vehicle for agricultural activities in mountainous areas and are an important component of the agro-cultural heritage landscape. This work explores tourists’ perceived attitudes toward, and characteristics of terraced agro-cultural
Xiaopiao Yang+4 more
doaj +1 more source
Analysis and Evaluation of the Link and Content Based Focused Treasure-Crawler [PDF]
Indexing the Web is becoming a laborious task for search engines as the Web exponentially grows in size and distribution. Presently, the most effective known approach to overcome this problem is the use of focused crawlers. A focused crawler applies a proper algorithm in order to detect the pages on the Web that relate to its topic of interest.
arxiv +1 more source
A Proposed Architecture for Continuous Web Monitoring Through Online Crawling of Blogs [PDF]
Getting informed of what is registered in the Web space on time, can greatly help the psychologists, marketers and political analysts to familiarize, analyse, make decision and act correctly based on the society`s different needs. The great volume of information in the Web space hinders us to continuously online investigate the whole space of the Web ...
arxiv +1 more source
An Ontology-Based Focused Crawler [PDF]
In this paper we present a novel approach for building a focused crawler. The goal of our crawler is to effectively identify web pages that relate to a set of pre-defined topics and download them regardless of their web topology or connectivity with other popular pages on the web.
openaire +2 more sources
The electric shaver market in China reach 26.3 billion RMB by 2021. Nowadays, in addition to functional satisfaction, consumers are increasingly focused on the emotional imagery conveyed by products with multiple-senses, and electric shavers are not only
Zhe-Hui Lin+3 more
doaj +1 more source