Results 191 to 200 of about 11,273 (224)
Some of the next articles are maybe not open access.
2016
In this chapter we consider situations in which a single host computer is inadequate because the data volume or processing demand exceeds the capacity of the host. A popular solution distributes the data and computations across a network of computers or a short-lived network created for the task (a cluster).
Brian Steele +2 more
openaire +1 more source
In this chapter we consider situations in which a single host computer is inadequate because the data volume or processing demand exceeds the capacity of the host. A popular solution distributes the data and computations across a network of computers or a short-lived network created for the task (a cluster).
Brian Steele +2 more
openaire +1 more source
2018
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire +1 more source
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire +1 more source
2019
Chapter 2 showed us how to use PolyBase to integrate SQL Server with Azure Blob Storage. In this chapter, we will integrate to the original external data source: Hadoop. In the first part of this chapter, we will take a peek at an already-built Hadoop cluster.
openaire +1 more source
Chapter 2 showed us how to use PolyBase to integrate SQL Server with Azure Blob Storage. In this chapter, we will integrate to the original external data source: Hadoop. In the first part of this chapter, we will take a peek at an already-built Hadoop cluster.
openaire +1 more source
A comprehensive bibliometric analysis of Apache Hadoop from 2008 to 2020
International Journal of Intelligent Computing and Cybernetics, 2023Jianpeng Zhang, Mingwei Lin
exaly
Performance optimization of computing task scheduling based on the Hadoop big data platform
Neural Computing and Applications, 2022Xinhong Hei, Hei Xinhong
exaly
Investigating the performance of Hadoop and Spark platforms on machine learning algorithms
Journal of Supercomputing, 2020Ali Mostafaeipour +2 more
exaly
Investigating Automatic Parameter Tuning for SQL-on-Hadoop Systems
Big Data Research, 2021Edson Ramiro Lucas Filho +2 more
exaly
COSHH: A classification and optimization based scheduler for heterogeneous Hadoop systems
Future Generation Computer Systems, 2014Douglas G Down
exaly

