Results 1 to 10 of about 8,723 (139)

NeuroPigPen: A Scalable Toolkit for Processing Electrophysiological Signal Data in Neuroscience Applications Using Apache Pig [PDF]

open access: yesFrontiers in Neuroinformatics, 2016
The recent advances in neurological imaging and sensing technologies have led to rapid increase in the volume, rate of data generation, and variety of neuroscience data. This "neuroscience Big data" represents a significant opportunity for the biomedical research community to design experiments using data with greater timescale, large number of ...
Satya S Sahoo   +2 more
exaly   +5 more sources

Analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment

open access: yesJournal of Computer Sciences Institute
The aim of this paper is the analysis of data processing efficiency with use of Apache Hive and Apache Pig in Hadoop environment. The analysis was based on comparison between both mentioned tools with use of large data set, represented by 28 million ...
Mikołaj Skrzypczyński, Piotr Muryjas
doaj   +2 more sources

An Overview of Apache Pig and Apache Hive

open access: yesInternational Journal of Scientific Research in Computer Science Engineering and Information Technology, 2019
Ever since the enhancement of technology has taken place, the data is growing at an alarming rate. The most prominent factor of data growth is the “Social Media”, leads to the origination of a tremendous amount of data called Big Data. Big Data is a term used for data sets that are extremely large in size as well as complicated to store and process ...
Saiyam Arora   +3 more
exaly   +2 more sources

The research of social processes at the university using big data [PDF]

open access: yesMATEC Web of Conferences, 2021
The volume of information in the 21st century is growing at a rapid pace. Big data technologies are used to process modern information. This article discusses the use of big data technologies to implement monitoring of social processes.
Hacimahmud Abdullayev Vugar   +2 more
doaj   +1 more source

SDE based Unified Scheme for Developing Entropy Prediction Models for OSS [PDF]

open access: yesInternational Journal of Mathematical, Engineering and Management Sciences, 2021
Today, so as to meet the user's requirement, modification of software is necessarily required. But at the same time, to incorporate these modifications and requirements there are enormous changes which are made to the coding of the software and over a ...
Deepika   +3 more
doaj   +1 more source

Big Data Infrastructure Design Optimizes Using Hadoop Technologies Based on Application Performance Analysis

open access: yesSistemasi: Jurnal Sistem Informasi, 2022
Big data's infrastructure is a technology that provides the ability to store, process, analyze, and visualize large data. The tools and applications used are one of the challenges when building big data's infrastructure.
Shafiyah Shafiyah   +2 more
doaj   +1 more source

Frameworks para la gestión, el almacenamiento y la preparación de grandes volúmenes de datos Big Data

open access: yesTecnología, Ciencia y Educación, 2015
Los sistemas meteorológicos, como es el Sistema Mundial de Información Global de la Organización Meteorológica Mundial, necesitan almacenar diferentes tipos de imágenes, datos y archivos.
Marco Antonio Almeida Pazmiño   +2 more
doaj   +1 more source

Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources [PDF]

open access: yes, 2018
Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD.
Begoli, Edmon   +4 more
core   +2 more sources

Integración de herramientas para la toma de decisiones en la congestión vehicular

open access: yesDyna, 2018
Este estudio tiene como finalidad presentar un análisis de la utilización e integración de herramientas tecnológicas que ayudan a tomar decisiones en situaciones de congestión vehicular.
Nelson Ivan Herrera-Herrera   +2 more
doaj   +1 more source

Resilient store: a heuristic-based data format selector for intermediate results [PDF]

open access: yes, 2016
The final publication is available at link.springer.comLarge-scale data analysis is an important activity in many organizations that typically requires the deployment of data-intensive workflows.
Abelló Gamazo, Alberto   +5 more
core   +1 more source

Home - About - Disclaimer - Privacy