Results 191 to 200 of about 11,273 (224)
Some of the next articles are maybe not open access.

Hadoop and MapReduce

2016
In this chapter we consider situations in which a single host computer is inadequate because the data volume or processing demand exceeds the capacity of the host. A popular solution distributes the data and computations across a network of computers or a short-lived network created for the task (a cluster).
Brian Steele   +2 more
openaire   +1 more source

Hadoop Tools

2018
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire   +1 more source

Connecting to Hadoop

2019
Chapter 2 showed us how to use PolyBase to integrate SQL Server with Azure Blob Storage. In this chapter, we will integrate to the original external data source: Hadoop. In the first part of this chapter, we will take a peek at an already-built Hadoop cluster.
openaire   +1 more source

A comprehensive bibliometric analysis of Apache Hadoop from 2008 to 2020

International Journal of Intelligent Computing and Cybernetics, 2023
Jianpeng Zhang, Mingwei Lin
exaly  

Performance optimization of computing task scheduling based on the Hadoop big data platform

Neural Computing and Applications, 2022
Xinhong Hei, Hei Xinhong
exaly  

Investigating the performance of Hadoop and Spark platforms on machine learning algorithms

Journal of Supercomputing, 2020
Ali Mostafaeipour   +2 more
exaly  

Investigating Automatic Parameter Tuning for SQL-on-Hadoop Systems

Big Data Research, 2021
Edson Ramiro Lucas Filho   +2 more
exaly  

Enhancements in Hadoop

2021
Jawwad Ahmed Shamsi   +1 more
openaire   +1 more source

Home - About - Disclaimer - Privacy