Results 171 to 180 of about 54,414 (244)
Some of the next articles are maybe not open access.

An Efficient Data Duplication System based on Hadoop Distributed File System

International Congress on Information and Communication Technology, 2020
HDFS [Hadoop Distributed File System] a part of Apache Hadoop to store large data set consistently. HDFS is used for process Massive-Scale Data in parallel and it ensures accessibility of facts by replicating data to different nodes.
D. Veeraiah, J. Rao
semanticscholar   +1 more source

A Dynamic and Failure-Aware Task Scheduling Framework for Hadoop

IEEE Transactions on Cloud Computing, 2020
Hadoop has become a popular framework for processing data-intensive applications in cloud environments. A core constituent of Hadoop is the scheduler, which is responsible for scheduling and monitoring the jobs and tasks, and rescheduling them in case of
Mbarka Soualhia   +2 more
exaly   +2 more sources

A Critical Analysis of Apache Hadoop and Spark for Big Data Processing

2021 6th International Conference on Signal Processing, Computing and Control (ISPCC), 2021
The emergence of big data processing platforms that can work globally in an integrated manner and process the huge datasets efficiently has become very significant.
Piyush Sewal, Hari Singh
semanticscholar   +1 more source

Hadoop++

Proceedings of the VLDB Endowment, 2010
MapReduce is a computing paradigm that has gained a lot of attention in recent years from industry and research. Unlike parallel DBMSs, MapReduce allows non-expert users to run complex analytical tasks over very large data sets on very large clusters and clouds.
Jens Dittrich   +5 more
openaire   +1 more source

Apache Hadoop YARN: yet another resource negotiator

ACM Symposium on Cloud Computing, 2013
The initial design of Apache Hadoop [1] was tightly focused on running massive, MapReduce jobs to process a web crawl. For increasingly diverse companies, Hadoop has become the data and computational agorá---the de facto place where data and ...
Vinod Kumar Vavilapalli   +15 more
semanticscholar   +1 more source

Big data approach for sentiment analysis of twitter data using Hadoop framework and deep learning

2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), 2020
Sentiment analysis acquired a great area of attention in the microblogging websites and analysis of sentiment is a practice of categorization and identification of opinions that are articulated as speech, text, database sources and tweets to detect if ...
Mudassir Khan, Aadarsh Malviya
semanticscholar   +1 more source

Sports performance prediction model based on integrated learning algorithm and cloud computing Hadoop platform

Microprocessors and microsystems, 2020
This article discusses the classification and research performance information properties. It also discusses construction and application of the Hadoop cloud computing platform.
Haiyun Zhu, Xu Yizhe
semanticscholar   +1 more source

V-Hadoop: Virtualized Hadoop using containers

2016 IEEE 15th International Symposium on Network Computing and Applications (NCA), 2016
MapReduce is a popular programming model used to process large amounts of data by exploiting parallelism. Open-source implementations of MapReduce such as Hadoop are generally best suited for large, homogeneous clusters of commodity machines. However, many businesses cannot afford to invest in such infrastructure and others are reluctant to use cloud ...
Srihari Radhakrishnan   +2 more
openaire   +1 more source

Hadoop-MCC: Efficient Multiple Compound Comparison Algorithm Using Hadoop

Combinatorial Chemistry & High Throughput Screening, 2018
Aim and Objective: In the past decade, the drug design technologies have been improved enormously. The computer-aided drug design (CADD) has played an important role in analysis and prediction in drug development, which makes the procedure more economical and efficient.
Guan-Jie, Hua   +2 more
openaire   +2 more sources

Performance Analysis of Distributed Computing Frameworks for Big Data Analytics: Hadoop Vs Spark

Journal of Computacion y Sistemas, 2020
In the last one decade, the tremendous growth in data emphasizes big data storage and management issues with the highest priorities. For providing better support to software developers for dealing with big data problems, new programming platforms are ...
Shwet Ketu, P. K. Mishra, Sonali Agarwal
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy