Results 191 to 200 of about 54,414 (244)
Some of the next articles are maybe not open access.
Proceedings of the VLDB Endowment, 2013
We analyze Hadoop workloads from three di?erent research clusters from a user-centric perspective. The goal is to better understand data scientists' use of the system and how well the use of the system matches its design. Our analysis suggests that Hadoop usage is still in its adolescence. We see underuse of Hadoop features, extensions, and tools.
Kai Ren +3 more
openaire +1 more source
We analyze Hadoop workloads from three di?erent research clusters from a user-centric perspective. The goal is to better understand data scientists' use of the system and how well the use of the system matches its design. Our analysis suggests that Hadoop usage is still in its adolescence. We see underuse of Hadoop features, extensions, and tools.
Kai Ren +3 more
openaire +1 more source
Proceedings of the 2017 ACM International Conference on Management of Data, 2017
This paper presents ST-Hadoop; the first full-fledged open-source MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and SpatialHadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers.
openaire +1 more source
This paper presents ST-Hadoop; the first full-fledged open-source MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and SpatialHadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers.
openaire +1 more source
The Hadoop Distributed File System
IEEE Conference on Mass Storage Systems and Technologies, 2010Konstantin Shvachko +3 more
semanticscholar +1 more source
Communications of the ACM, 2013
The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise.
openaire +1 more source
The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise.
openaire +1 more source
Proceedings of the 2013 companion publication for conference on Systems, programming, & applications: software for humanity, 2013
We introduces HabaneroJava-Hadoop, an extension to the HadoopMapReduce system that is optimized for multi-core machines. HJ-Hadoop exploits intra-JVM parallelism that increases memory efficiency of each node. Results show a significant improvement in the amount of data each MapReduce job could process and load balance across cores for certain ...
openaire +1 more source
We introduces HabaneroJava-Hadoop, an extension to the HadoopMapReduce system that is optimized for multi-core machines. HJ-Hadoop exploits intra-JVM parallelism that increases memory efficiency of each node. Results show a significant improvement in the amount of data each MapReduce job could process and load balance across cores for certain ...
openaire +1 more source
2018
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire +1 more source
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Archives, DistCp, Rumen, GridMix, and Scheduler Load Simulator. Hadoop Streaming is a utility that allows the user to have any executable or script for both mapper and reducer ...
openaire +1 more source
������������������ ������������������ ���� �������������������� Mahout ������ Hadoop
2013This study is a guide for recommender systems as they are implemented through the open source programs, Hadoop and Mahout. Hadoop is a program written in Java and implements Map-Reduce processes. On its own cannot be considered a recommender system, but with the help of the other studied software, Mahout, it is possible to make use of the distributed ...
openaire +1 more source

