CaSePer: An efficient model for personalized web page change detection based on segmentation
Users who visit a web page repeatedly at frequent intervals are more interested in knowing the recent changes that have occurred on the page than the entire contents of the web page.
K.S. Kuppusamy, G. Aghila
doaj +1 more source
Profiling Web Archive Coverage for Top-Level Domain and Content Language [PDF]
The Memento aggregator currently polls every known public web archive when serving a request for an archived web page, even though some web archives focus on only specific domains and ignore the others.
AlSum, Ahmed +3 more
core +3 more sources
Web Page Content Block Identification with Extended Block Properties
Web page segmentation is one of the most influential factors for the automated integration of web page content with other systems. Existing solutions are focused on segmentation but do not provide a more detailed description of the segment including its ...
Kiril Griazev, Simona RamanauskaitÄ—
doaj +1 more source
HTMLPhish: Enabling Phishing Web Page Detection by Applying Deep Learning Techniques on HTML Analysis [PDF]
Recently, the development and implementation of phishing attacks require little technical skills and costs. This uprising has led to an ever-growing number of phishing attacks on the World Wide Web.
Chen, Yingke, Opara, Chidimma, Wei, Bo
core +2 more sources
An architecture for a focused trend parallel web crawler with the application of clickstream analysis [PDF]
The tremendous growth of the Web poses many challenges for all-purpose single-process crawlers including the presence of some irrelevant answers among search results and the coverage and scaling issues regarding the enormous dimension of the World Wide ...
Ahmadi-Abkenari, Fatemeh, Selamat, Ali
core +1 more source
Web Application Page Element Recognition and Visual Script Generation Based on Machine Vision [PDF]
In order to provide richer interactive response effect,the visualization elements of the Web application is becoming more complex and diverse.The traditional test based on DOM cannot match the new requirement to test Web application.A new generation test
LI Zi-dong, YAO Yi-fei, WANG Wei-wei, ZHAO Rui-lian
doaj +1 more source
Semantic Web and Web Page Clustering Algorithms: A Landscape View
The major evolution of the semantic web has become exchanging data between applications in all domains of activities. Based on this vision, different applications in recent days, e.g. in the fields of community web portals, social networking, e-learning,
Ahmed J. Obaid +2 more
doaj +1 more source
Arabic Web page clustering: A review
Clustering is the method employed to group Web pages containing related information into clusters, which facilitates the allocation of relevant information. Clustering performance is mostly dependent on the text features' characteristics.
Hanan M. Alghamdi, Ali Selamat
doaj +1 more source
User Identity Information Aggregation Method for Darknet Web Page [PDF]
The distribution of user identity information dispersed across darknet Web pages exhibits sparse and irregular characteristics, and current mainstream information aggregation techniques cannot be directly applied to this context.
Yuyan WANG, Jiapeng ZHAO, Jinqiao SHI, Liyan SHEN, Hongmeng LIU, Yanyan YANG
doaj +1 more source
Learning Visual Features from Snapshots for Web Search
When applying learning to rank algorithms to Web search, a large number of features are usually designed to capture the relevance signals. Most of these features are computed based on the extracted textual elements, link analysis, and user logs. However,
Cheng, Xueqi +5 more
core +1 more source

