Results 11 to 20 of about 1,073,326 (316)
Vorstellung eines (teil-)automatisierten Verfahrens zur Analyse der Multimodalität von Webseiten
This article presents and discusses a method for the (partially) automated analysis of the multimodality of web pages. The focus lies on analyzing unknown web pages for their multimodality without annotating them
Thomas Jurczyk
doaj +1 more source
Near-Duplicate Web Page Detection: An Efficient Approach Using Clustering, Sentence Feature and Fingerprinting [PDF]
Duplicate and near-duplicate web pages are the chief concerns for web search engines. In reality, they incur enormous space to store the indexes, ultimately slowing down and increasing the cost of serving results.
J. Prasanna Kumar, P. Govindarajulu
doaj +1 more source
A Survey Study on Relation Extraction for Web Pages [PDF]
Natural language means a language that is used for communication by human. Natural Language Processing (NLP) helps machines to understand the natural language.
Ghada Alsaigh +2 more
doaj +1 more source
Subcontractació de webs accessibles : què hem de vigilar?
S'explica com es gestiona la subcontractació d'un web accessible, especialment des d'un organisme públic. Així, s'estableixen criteris que cal incloure en el plec de condicions que afecten el proveïdor, el procés de disseny del web i l'eina que s'usarà ...
Ribera, Mireia
doaj +1 more source
How users assess web pages for information-seeking [PDF]
In this paper, we investigate the criteria used by online searchers when assessing the relevance of web pages for information-seeking tasks. Twenty four participants were given three tasks each, and indicated the features of web pages which they employed
Barry +40 more
core +5 more sources
File Not Found: Error 404 as an Example of a Spontaneous Web Genre
The development of the Internet has led to the emergence of new digital genres, also known as cybergenres or web genres. The existing research into diverse web pages has revealed that the new medium not only generated changes in traditional genres ...
Grzegorz Cebrat
doaj +1 more source
An Improved Framework for Content- and Link-Based Web-Spam Detection: A Combined Approach
In this modern era, people utilise the web to share information and to deliver services and products. The information seekers use different search engines (SEs) such as Google, Bing, and Yahoo as tools to search for products, services, and information ...
Asim Shahzad +3 more
doaj +1 more source
Automatically Discovering Relevant Images From Web Pages
Web pages contain irrelevant images along with relevant images. The classification of these images is an error-prone process due to the number of design variations of web pages.
Erdinc Uzun +4 more
doaj +1 more source
A Comparison of Techniques for Sampling Web Pages [PDF]
As the World Wide Web is growing rapidly, it is getting increasingly challenging to gather representative information about it. Instead of crawling the web exhaustively one has to resort to other techniques like sampling to determine the properties of ...
Baykan, Eda +4 more
core +6 more sources
Nowadays web pages are implemented in various kinds of languages on the Web and web crawlers are important for search engine. Language specific crawlers are crawlers that traverse and collect the relative web pages using the successive URls of web page. There are very few research areas in crawling for Myanmar Language web sites.
Yadana Thein, Su Mon Khine
openaire +2 more sources

