Results 1 to 10 of about 8,723 (139)
NeuroPigPen: A Scalable Toolkit for Processing Electrophysiological Signal Data in Neuroscience Applications Using Apache Pig [PDF]
The recent advances in neurological imaging and sensing technologies have led to rapid increase in the volume, rate of data generation, and variety of neuroscience data. This "neuroscience Big data" represents a significant opportunity for the biomedical research community to design experiments using data with greater timescale, large number of ...
Satya S Sahoo +2 more
exaly +5 more sources
Analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment
The aim of this paper is the analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment. The analysis was based on comparison between both mentioned tools with use of large data set, represented by 28 million ...
Mikołaj Skrzypczyński, Piotr Muryjas
doaj +2 more sources
An Overview of Apache Pig and Apache Hive
Ever since the enhancement of technology has taken place, the data is growing at an alarming rate. The most prominent factor of data growth is the “Social Media”, leads to the origination of a tremendous amount of data called Big Data. Big Data is a term used for data sets that are extremely large in size as well as complicated to store and process ...
Saiyam Arora +3 more
exaly +2 more sources
The research of social processes at the university using big data [PDF]
The volume of information in the 21st century is growing at a rapid pace. Big data technologies are used to process modern information. This article discusses the use of big data technologies to implement monitoring of social processes.
Hacimahmud Abdullayev Vugar +2 more
doaj +1 more source
SDE based Unified Scheme for Developing Entropy Prediction Models for OSS [PDF]
Today, so as to meet the user's requirement, modification of software is necessarily required. But at the same time, to incorporate these modifications and requirements there are enormous changes which are made to the coding of the software and over a ...
Deepika +3 more
doaj +1 more source
Big data's infrastructure is a technology that provides the ability to store, process, analyze, and visualize large data. The tools and applications used are one of the challenges when building big data's infrastructure.
Shafiyah Shafiyah +2 more
doaj +1 more source
Los sistemas meteorológicos, como es el Sistema Mundial de Información Global de la Organización Meteorológica Mundial, necesitan almacenar diferentes tipos de imágenes, datos y archivos.
Marco Antonio Almeida Pazmiño +2 more
doaj +1 more source
Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources [PDF]
Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD.
Begoli, Edmon +4 more
core +2 more sources
Integración de herramientas para la toma de decisiones en la congestión vehicular
Este estudio tiene como finalidad presentar un análisis de la utilización e integración de herramientas tecnológicas que ayudan a tomar decisiones en situaciones de congestión vehicular.
Nelson Ivan Herrera-Herrera +2 more
doaj +1 more source
Resilient store: a heuristic-based data format selector for intermediate results [PDF]
The final publication is available at link.springer.comLarge-scale data analysis is an important activity in many organizations that typically requires the deployment of data-intensive workflows.
Abelló Gamazo, Alberto +5 more
core +1 more source

