Results 41 to 50 of about 9,135 (230)
Effective storage, processing and analyzing of power device condition monitoring data faces enormous challenges. A framework is proposed that can support both MapReduce and Graph for massive monitoring data analysis at the same time based on Aliyun ...
Hongtao Shen, Peng Tao, Pei Zhao, Hao Ma
doaj +1 more source
Clustering‐based recommendation method with enhanced grasshopper optimisation algorithm
Abstract In the era of big data, personalised recommendation systems are essential for enhancing user engagement and driving business growth. However, traditional recommendation algorithms, such as collaborative filtering, face significant challenges due to data sparsity, algorithm scalability, and the difficulty of adapting to dynamic user preferences.
Zihao Zhao+9 more
wiley +1 more source
Secure Grouping and Aggregation with MapReduce [PDF]
MapReduce programming paradigm allows to process big data sets in parallel on a large cluster. We focus on a scenario where the data owner outsources her data on an honest-but-curious server. Our aim is to evaluate grouping and aggregation with SUM, COUNT, AVG, MIN, and MAX operations for an authorized user. For each of these five operations, we assume
Ciucanu, Radu+3 more
openaire +4 more sources
An Alternative C++ based HPC system for Hadoop MapReduce [PDF]
MapReduce is a technique used to vastly improve distributed processing of data and can massively speed up computation. Hadoop and its MapReduce relies on JVM and Java which is expensive on memory. High Performance Computing based MapReduce framework could be used that can perform more memory-efficiently and faster than the standard MapReduce.
arxiv
Interpretable decision-tree induction in a big data parallel framework
When running data-mining algorithms on big data platforms, a parallel, distributed framework, such asMAPREDUCE, may be used. However, in a parallel framework, each individual model fits the data allocated to its own computing node without necessarily ...
Weinberg Abraham Itzhak, Last Mark
doaj +1 more source
D 3 -MapReduce: Towards MapReduce for Distributed and Dynamic Data Sets [PDF]
International ...
He, Haiwu+12 more
openaire +5 more sources
As the need for large-scale data analysis is rapidly increasing, Hadoop, or the platform that realizes large-scale data processing, and MapReduce, or the internal computational model of Hadoop, are receiving great attention. This paper reviews the basic concepts of Hadoop and MapReduce necessary for data analysts who are familiar with statistical ...
Joong-Ho Won+3 more
openaire +2 more sources
Autonomous Intelligent Monitoring of Photovoltaic Systems: An In‐Depth Multidisciplinary Review
This article examines the autonomous monitoring and analysis of PV plants, highlighting key barriers and research directions for smart monitoring of PV systems. The discussed topics cover UAV choice and related standards, camera varieties, AI applications in monitoring, datasets, and data enhancement methods, as well as data exchange between UAVs and ...
M. Aghaei+12 more
wiley +1 more source
Grid computing is an emerging technology that enabled the heterogeneous collection of data and provisioning of services to the users. Due to the high amount of incoming heterogeneous request, grid computing needs an efficient scheduling to reduce execution time and satisfy service level agreement (SLA) and quality of service (QoS) requirements.
Gangasandra Mahadevaiah Kiran+1 more
wiley +1 more source
Big Data Analytics for Healthcare Industry: Impact, Applications, and Tools
In recent years, huge amounts of structured, unstructured, and semi-structured data have been generated by various institutions around the world and, collectively, this heterogeneous data is referred to as big data.
Sunil Kumar, Maninder Singh
doaj +1 more source