Results 11 to 20 of about 54,414 (244)

Attribute based honey encryption algorithm for securing big data: Hadoop distributed file system perspective [PDF]

open access: yesPeerJ Computer Science, 2020
Hadoop has become a promising platform to reliably process and store big data. It provides flexible and low cost services to huge data through Hadoop Distributed File System (HDFS) storage.
Gayatri Kapil   +5 more
doaj   +3 more sources

MapReduce scheduling algorithms in Hadoop: a systematic study

open access: yesJournal of Cloud Computing, 2023
Hadoop is a framework for storing and processing huge volumes of data on clusters. It uses Hadoop Distributed File System (HDFS) for storing data and uses MapReduce to process that data.
Soudabeh Hedayati   +5 more
semanticscholar   +1 more source

Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

open access: yesMathematical Problems in Engineering, 2021
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources.
Yajun Wang   +4 more
semanticscholar   +1 more source

Big Data Technology Fusion Back Propagation Neural Network in Product Innovation Design Method

open access: yesIET Networks, Accepted Article., 2022
This research uses big data technology to combine the process of product innovation design method, which has certain significance for the formation of intelligent and systematic product innovation design method. Meanwhile, while predicting the results of all products innovative design methods, it can improve the product's predictive innovative design ...
Ren Li, Qiang Zeng
wiley   +1 more source

A Dynamic Repository Approach for Small File Management With Fast Access Time on Hadoop Cluster: Hash Based Extended Hadoop Archive

open access: yesIEEE Access, 2022
Small file processing in Hadoop is one of the challenging task. The performance of the Hadoop is quite good when dealing with large files because they require lesser metadata and consume less memory. But while dealing with enormous amount of small files,
Vijay Shankar Sharma   +4 more
semanticscholar   +1 more source

New Scheduling Algorithms for Improving Performance and Resource Utilization in Hadoop YARN Clusters

open access: yesIEEE Transactions on Cloud Computing, 2021
The MapReduce framework has become the defacto scheme for scalable semi-structured and un-structured data processing in recent years. The Hadoop ecosystem has evolved into its second generation, Hadoop YARN, which adopts fine-grained resource management ...
Yi Yao   +4 more
semanticscholar   +1 more source

Performance optimization of computing task scheduling based on the Hadoop big data platform

open access: yesNeural computing & applications (Print), 2022
Hadoop, a distributed computing framework that can efficiently process large-scale datasets, has been used by an increasing number of organizations as the basic computing framework to build cloud computing platforms. Improving its execution efficiency is
Yang Li, Xinhong Hei
semanticscholar   +1 more source

A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

open access: yesJournal of Big Data, 2020
Big Data analytics for storing, processing, and analyzing large-scale datasets has become an essential tool for the industry. The advent of distributed computing frameworks such as Hadoop and Spark offers efficient solutions to analyze vast amounts of ...
N. Ahmed   +3 more
semanticscholar   +1 more source

Fuzzy high-utility pattern mining in parallel and distributed Hadoop framework

open access: yesInformation Sciences, 2021
Over the past decade, high-utility itemset mining (HUIM) has received widespread attention that can emphasize more critical information than was previously possible using frequent itemset mining (FIM). Unfortunately, HUIM is very similar to FIM since the
J. Wu   +4 more
semanticscholar   +1 more source

Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19

open access: yesBig Data and Cognitive Computing, 2021
Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting As experienced in former massive information ...
Otmane Azeroual, Renaud Fabre
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy