Results 11 to 20 of about 54,414 (244)
Attribute based honey encryption algorithm for securing big data: Hadoop distributed file system perspective [PDF]
Hadoop has become a promising platform to reliably process and store big data. It provides flexible and low cost services to huge data through Hadoop Distributed File System (HDFS) storage.
Gayatri Kapil +5 more
doaj +3 more sources
MapReduce scheduling algorithms in Hadoop: a systematic study
Hadoop is a framework for storing and processing huge volumes of data on clusters. It uses Hadoop Distributed File System (HDFS) for storing data and uses MapReduce to process that data.
Soudabeh Hedayati +5 more
semanticscholar +1 more source
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources.
Yajun Wang +4 more
semanticscholar +1 more source
Big Data Technology Fusion Back Propagation Neural Network in Product Innovation Design Method
This research uses big data technology to combine the process of product innovation design method, which has certain significance for the formation of intelligent and systematic product innovation design method. Meanwhile, while predicting the results of all products innovative design methods, it can improve the product's predictive innovative design ...
Ren Li, Qiang Zeng
wiley +1 more source
Small file processing in Hadoop is one of the challenging task. The performance of the Hadoop is quite good when dealing with large files because they require lesser metadata and consume less memory. But while dealing with enormous amount of small files,
Vijay Shankar Sharma +4 more
semanticscholar +1 more source
New Scheduling Algorithms for Improving Performance and Resource Utilization in Hadoop YARN Clusters
The MapReduce framework has become the defacto scheme for scalable semi-structured and un-structured data processing in recent years. The Hadoop ecosystem has evolved into its second generation, Hadoop YARN, which adopts fine-grained resource management ...
Yi Yao +4 more
semanticscholar +1 more source
Performance optimization of computing task scheduling based on the Hadoop big data platform
Hadoop, a distributed computing framework that can efficiently process large-scale datasets, has been used by an increasing number of organizations as the basic computing framework to build cloud computing platforms. Improving its execution efficiency is
Yang Li, Xinhong Hei
semanticscholar +1 more source
Big Data analytics for storing, processing, and analyzing large-scale datasets has become an essential tool for the industry. The advent of distributed computing frameworks such as Hadoop and Spark offers efficient solutions to analyze vast amounts of ...
N. Ahmed +3 more
semanticscholar +1 more source
Fuzzy high-utility pattern mining in parallel and distributed Hadoop framework
Over the past decade, high-utility itemset mining (HUIM) has received widespread attention that can emphasize more critical information than was previously possible using frequent itemset mining (FIM). Unfortunately, HUIM is very similar to FIM since the
J. Wu +4 more
semanticscholar +1 more source
Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19
Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting As experienced in former massive information ...
Otmane Azeroual, Renaud Fabre
semanticscholar +1 more source

