Results 21 to 30 of about 5,534 (158)
Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources [PDF]
Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD.
Begoli, Edmon +4 more
core +2 more sources
A canonical model for seasonal climate prediction using Big Data
This article addresses the elaboration of a canonical model, involving methods, techniques, metrics, tools, and Big Data, applied to the knowledge of seasonal climate prediction, aiming at greater dynamics, speed, conciseness, and scalability.
M. P. Ramos +5 more
doaj +1 more source
Analysis of Apache Logs Using Hadoop and Hive [PDF]
In this paper we consider an analysis of Apache web logs using Cloudera Hadoop distribution and Hive for querying the data in the web logs. We used public available web logs from NASA Kennedy Space Center server. HDFS (Hadoop distributed file system) was used as a logs container.
Velinov, Aleksandar, Zdravev, Zoran
openaire +1 more source
Our analysis shows that compared to intramuscular injection epinephrine, continuous intravenous (CIV) infusion of epinephrine for the treatment of anaphylaxis has fewer adverse events, improves symptoms, and is relatively easy to administer under ready conditions. CIV infusion of epinephrine may also reduce the incidence of biphasic reactions.
Kenji Fujizuka +3 more
wiley +1 more source
An Overview of Apache Pig and Apache Hive
Ever since the enhancement of technology has taken place, the data is growing at an alarming rate. The most prominent factor of data growth is the “Social Media”, leads to the origination of a tremendous amount of data called Big Data. Big Data is a term used for data sets that are extremely large in size as well as complicated to store and process ...
Saiyam Arora +3 more
openaire +1 more source
This article first established a university network education system model based on physical failure repair behavior at the big data infrastructure layer and then examined in depth the complex common causes of multiple data failures in the big data environment caused by a single physical machine failure, all based on the principle of mobile edge ...
Min Zhu, Xin Ning
wiley +1 more source
The so‐called multimodal information refers to the information from different information sources on different or the same side of the same description target. These pieces of information are different in terms of storage structure, representation, semantic connotation, credibility, and emphasis, but there is a certain inevitable connection between ...
Guimei Yang, Fusheng Zhu
wiley +1 more source
A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures [PDF]
Scientific problems that depend on processing large amounts of data require overcoming challenges in multiple areas: managing large-scale data distribution, co-placement and scheduling of data with compute resources, and storing and transferring large ...
Fox, Geoffrey C. +4 more
core +1 more source
ShenZhen transportation system (SZTS): a novel big data benchmark suite [PDF]
Data analytics is at the core of the supply chain for both products and services in modern economies and societies. Big data workloads, however, are placing unprecedented demands on computing technologies, calling for a deep understanding and ...
Bei, Zhengdong +5 more
core +2 more sources
Real-time Twitter data analysis using Hadoop ecosystem
In the era of the Internet, social media has become an integral part of modern society. People use social media to share their opinions and to have an up-to-date knowledge about the current trends on a daily basis.
Anisha P. Rodrigues +1 more
doaj +1 more source

