Results 231 to 240 of about 9,555 (241)
Some of the next articles are maybe not open access.
Optimization Analysis of Hadoop
2016Hadoop is a distributed data processing platform supporting MapReduce parallel computing framework. In order to deal with general problems, there is always a need of accelerating Hadoop under certain circumstance such as Hive jobs. By outputting current time to logs at specially selected points, we traced the workflow of a typical MapReduce job ...
Hongzhi Wang, Shengfei Shi, Jinglun Li
openaire +2 more sources
2016
In this chapter, you will learn the basics of some other Hadoop ecosystem tools such as Zookeeper, Cascading, Presto, Tez, and Spark.
openaire +2 more sources
In this chapter, you will learn the basics of some other Hadoop ecosystem tools such as Zookeeper, Cascading, Presto, Tez, and Spark.
openaire +2 more sources
Hadoop MapReduce Performance Optimization Analysis by Calibrating Hadoop Parameters
The Journal of Korean Institute of Information Technology, 2021Keungyeup Ji, Youngmi Kwon
openaire +1 more source
2018
Hadoop is one of the most popular distributed big data platforms in the world. Besides computing power, its storage subsystem capability is also a key factor in its overall performance. In particular, there are many intermediate file exchanges for MapReduce.
openaire +2 more sources
Hadoop is one of the most popular distributed big data platforms in the world. Besides computing power, its storage subsystem capability is also a key factor in its overall performance. In particular, there are many intermediate file exchanges for MapReduce.
openaire +2 more sources
2016
Apache Hadoop is the de facto framework for processing large data sets. Apache Hadoop is a distributed software application that runs across several (up to hundreds and thousands) of nodes across a cluster. Apache Hadoop comprises of two main components: Hadoop Distributed File System (HDFS) and MapReduce.
openaire +2 more sources
Apache Hadoop is the de facto framework for processing large data sets. Apache Hadoop is a distributed software application that runs across several (up to hundreds and thousands) of nodes across a cluster. Apache Hadoop comprises of two main components: Hadoop Distributed File System (HDFS) and MapReduce.
openaire +2 more sources
2014
We live in a very insecure world. Starting with the key to your home’s front door to those all-important virtual keys, your passwords, everything needs to be secured—and well. In the world of Big Data where humungous amounts of data are processed, transformed, and stored, it’s all the more important to secure your data.
openaire +2 more sources
We live in a very insecure world. Starting with the key to your home’s front door to those all-important virtual keys, your passwords, everything needs to be secured—and well. In the world of Big Data where humungous amounts of data are processed, transformed, and stored, it’s all the more important to secure your data.
openaire +2 more sources