Efficient processing of complex XSD using Hive and Spark [PDF]
The eXtensible Markup Language (XML) files are widely used by the industry due to their flexibility in representing numerous kinds of data. Multiple applications such as financial records, social networks, and mobile networks use complex XML schemas with
Diana Martinez-Mosquera +2 more
doaj +3 more sources
Human Behavior Analysis Using Intelligent Big Data Analytics [PDF]
Intelligent big data analysis is an evolving pattern in the age of big data science and artificial intelligence (AI). Analysis of organized data has been very successful, but analyzing human behavior using social media data becomes challenging.
Muhammad Usman Tariq +5 more
doaj +2 more sources
Enhancing monitoring of suspicious activities with AI-based and big data fusion [PDF]
This study provides an AI-based detection tool for the surveillance of suspicious activities using data fusion. The system leverages time, location, and specific data pertaining to individuals, objects, and vehicles associated with the agency.
Surapol Vorapatratorn
doaj +3 more sources
Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive [PDF]
Frequently accessed views in data warehouses are usually materialized in order to accelerate the speed of querying big data. However, the view materialization itself incurs huge costs.
Dongjin Yu +3 more
doaj +3 more sources
UniqueNOSD: a novel framework for NoSQL over SQL databases [PDF]
To date, most large corporations still have their core solutions on relational databases but only use non-relational (i.e. NoSQL) database management systems (DBMS) for their non-core systems that favour availability and scalability through partitioning ...
Abdulrauf A. Gidado, C. I. Ezeife
doaj +2 more sources
Analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment
The aim of this paper is the analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment. The analysis was based on comparison between both mentioned tools with use of large data set, represented by 28 million ...
Mikołaj Skrzypczyński, Piotr Muryjas
doaj +2 more sources
A method for integrating GIS and big data platforms [PDF]
Geographic Information System (GIS) has been played an important role in many applications of our daily life since 1970. Recently, with the rapid development of new technologies, earth’s data increases explosively. Many studies have been proposed to
Hong Le
doaj +1 more source
The research of social processes at the university using big data [PDF]
The volume of information in the 21st century is growing at a rapid pace. Big data technologies are used to process modern information. This article discusses the use of big data technologies to implement monitoring of social processes.
Hacimahmud Abdullayev Vugar +2 more
doaj +1 more source
SDE based Unified Scheme for Developing Entropy Prediction Models for OSS [PDF]
Today, so as to meet the user's requirement, modification of software is necessarily required. But at the same time, to incorporate these modifications and requirements there are enormous changes which are made to the coding of the software and over a ...
Deepika +3 more
doaj +1 more source
Efficient data replay mechanism of sensor stream data based on concurrent buffer pool
Analyzing historical time series data in the form of replay is very important. When replaying massive sensor data, long processing times predominantly occur, that significantly affects the performance of the whole system.
Feng Ye +5 more
doaj +1 more source

