Results 131 to 140 of about 2,821 (160)
Some of the next articles are maybe not open access.

CnC-Hadoop

Proceedings of the 8th ACM International Conference on Computing Frontiers, 2011
The information-technology platform is being radically transformed with the widespread adoption of the cloud computing model supported by data centers containing large numbers of multicore servers. While cloud computing platforms can potentially enable a rich variety of distributed applications, the need to exploit multiscale parallelism at the inter ...
Riyaz Haque   +2 more
openaire   +1 more source

Kepler + Hadoop

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, 2009
MapReduce provides a parallel and scalable programming model for data-intensive business and scientific applications. MapReduce and its de facto open source project, called Hadoop, support parallel processing on large datasets with capabilities including automatic data partitioning and distribution, load balancing, and fault tolerance management ...
Jianwu Wang   +2 more
openaire   +1 more source

Hadoop's adolescence

Proceedings of the VLDB Endowment, 2013
We analyze Hadoop workloads from three di?erent research clusters from a user-centric perspective. The goal is to better understand data scientists' use of the system and how well the use of the system matches its design. Our analysis suggests that Hadoop usage is still in its adolescence. We see underuse of Hadoop features, extensions, and tools.
Kai Ren   +3 more
openaire   +1 more source

Hadoop

2018
K. G. Srinivasa   +2 more
  +4 more sources

ST-Hadoop

Proceedings of the 2017 ACM International Conference on Management of Data, 2017
This paper presents ST-Hadoop; the first full-fledged open-source MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and SpatialHadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers.
openaire   +1 more source

Beyond Hadoop

Communications of the ACM, 2013
The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise.
openaire   +1 more source

HJ-Hadoop

Proceedings of the 2013 companion publication for conference on Systems, programming, & applications: software for humanity, 2013
We introduces HabaneroJava-Hadoop, an extension to the HadoopMapReduce system that is optimized for multi-core machines. HJ-Hadoop exploits intra-JVM parallelism that increases memory efficiency of each node. Results show a significant improvement in the amount of data each MapReduce job could process and load balance across cores for certain ...
openaire   +1 more source

Hadoop Tools

2018
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire   +1 more source

������������������ ������������������ ���� �������������������� Mahout ������ Hadoop

2013
This study is a guide for recommender systems as they are implemented through the open source programs, Hadoop and Mahout. Hadoop is a program written in Java and implements Map-Reduce processes. On its own cannot be considered a recommender system, but with the help of the other studied software, Mahout, it is possible to make use of the distributed ...
openaire   +1 more source

Home - About - Disclaimer - Privacy