Results 21 to 30 of about 54,414 (244)
A Comprehensive Survey for Hadoop Distributed File System
In the last few days, data and the internet have become increasingly growing, occurring in big data. For these problems, there are many software frameworks used to increase the performance of the distributed system.
Karwan Jameel Merceedi +1 more
semanticscholar +1 more source
LogM: Log Analysis for Multiple Components of Hadoop Platform
The Hadoop platform provides a powerful software framework for distributed storage and processing of massive amounts of data. It is at the heart of big data processing and has found numerous applications in diverse areas, ranging from environmental ...
Yuxia Xie, Kai Yang, Pan Luo
semanticscholar +1 more source
In this paper, we discuss some challenges regarding the Hadoop framework. One of the main ones is the computing performance of Hadoop MapReduce jobs in terms of CPU, memory, and hard disk I/O. The networking side of a Hadoop cluster is another challenge,
Ali Khaleel, Hamed Al-Raweshidy
doaj +1 more source
Studi Perbandingan Performa Algoritma Penjadwalan untuk Real Time Data Twitter pada Hadoop
Hadoop merupakan sebuah framework software yang bersifat open source dan berbasis java. Hadoop terdiri atas dua komponen utama, yaitu MapReduce dan Hadoop Distributed File System (HDFS).
Sidik Prabowo, Maman Abdurohman
doaj +1 more source
'Big Data 'describes and technologies to store, distribute, manage and analyze large-sized datasets with high-velocity. Big data can be structured, unstructured or semi-structured, resulting in incapability of conventional data management methods. Data is generated from various different sources and can arrive in the system at various rates. In order
null Yash Patel, null Prof. Manish Joshi
openaire +1 more source
BlockHDFS: Blockchain-integrated Hadoop distributed file system for secure provenance traceability
Hadoop Distributed File System (HDFS) is one of the widely used distributed file systems in Big Data analysis for frameworks such as Hadoop. It is used to manage a large volume of data with low-cost commodity hardware.
Viraaji Mothukuri +4 more
semanticscholar +1 more source
CloudDOE: a user-friendly tool for deploying Hadoop clouds and analyzing high-throughput sequencing data with MapReduce. [PDF]
BackgroundExplosive growth of next-generation sequencing data has resulted in ultra-large-scale data sets and ensuing computational problems. Cloud computing provides an on-demand and scalable environment for large-scale data analysis.
Wei-Chun Chung +9 more
doaj +1 more source
Research on intelligent medical big data system based on Hadoop and blockchain
In order to improve the intelligence of the medical system, this paper designs and implements a secure medical big data ecosystem on top of the Hadoop big data platform.
Xiangfeng Zhang, Yanmei Wang
semanticscholar +1 more source
OEHadoop: Accelerate Hadoop Applications by Co-Designing Hadoop With Data Center Network
Big data applications in Hadoop usually cause heavy bandwidth demand and network bottleneck in the current data center network (DCN). On one hand, the design of DCN does not take the traffic demand and the traffic patterns of Hadoop applications into ...
Yinan Tang +7 more
doaj +1 more source
An Intrusive Analyzer for Hadoop Systems Based on Wireless Sensor Networks
Owing to the acceleration of IoT- (Internet of Things-) based wireless sensor networks, cloud-computing services using Big Data are rapidly growing. In order to manage and analyze Big Data efficiently, Hadoop frameworks have been used in a variety of ...
Byoung-Jin Bae +4 more
doaj +1 more source

