Results 41 to 50 of about 36,780 (203)
Decoupled Strategy for Imbalanced Workloads in MapReduce Frameworks
In this work, we consider the integration of MPI one-sided communication and non-blocking I/O in HPC-centric MapReduce frameworks. Using a decoupled strategy, we aim to overlap the Map and Reduce phases of the algorithm by allowing processes to ...
Brabazon, Keeran +5 more
core +1 more source
Multithread Approximation: An OpenMP Constructor
ABSTRACT This study introduces an OpenMP construct designed to simplify and unify the integration of approximate computing techniques into shared‐memory parallel programs. Approximate Computing leverages the inherent error tolerance of many applications to trade computational accuracy for gains in performance and energy efficiency.
João Briganti de Oliveira +2 more
wiley +1 more source
Fast clustering using MapReduce [PDF]
Clustering problems have numerous applications and are becoming more challenging as the size of the data increases. In this paper, we consider designing clustering algorithms that can be used in MapReduce, the most popular programming environment for processing large datasets. We focus on the practical and popular clustering problems, $k$-center and $k$
Ene, Alina +2 more
openaire +2 more sources
A lightweight MapReduce framework for secure processing with SGX
MapReduce is a programming model used extensively for parallel data processing in distributed environments. A wide range of algorithms were implemented using MapReduce, from simple tasks like sorting and searching up to complex clustering and machine ...
Felber, Pascal +4 more
core +1 more source
Neural Network Models for Solar Irradiance Forecasting in Polluted Areas: A Comparative Study
Pollution‐aware hybrid ensemble model is proposed to forecast solar irradiance across eight diverse cities. The model integrates MLP, RNN, and NARX to handle varying atmospheric pollution levels. The model outperforms state‐of‐the‐art methods with enhanced accuracy and interpretability on standard solar irradiance data set.
Mujtaba Ali +6 more
wiley +1 more source
Pre-Processing and Modeling Tools for Bigdata
Modeling tools and operators help the user / developer to identify the processing field on the top of the sequence and to send into the computing module only the data related to the requested result.
Hashem Hadi, Ranc Daniel
doaj +1 more source
Comparative Study Parallel Join Algorithms for MapReduce environment
There are the following techniques that are used to analyze massive amounts of data: MapReduce paradigm, parallel DBMSs, column-wise store, and various combinations of these approaches. We focus in a MapReduce environment.
A. Yu. Pigul
doaj +1 more source
MapReduce is Good Enough? If All You Have is a Hammer, Throw Away Everything That's Not a Nail! [PDF]
Hadoop is currently the large-scale data analysis "hammer" of choice, but there exist classes of algorithms that aren't "nails", in the sense that they are not particularly amenable to the MapReduce programming model.
Lin, Jimmy
core +1 more source
Exploring the Power of Machine Learning in Analysing Protein–Protein Sequences
Figure 2 depicts the structure of a peptide bond formed between amino acids to form a polypeptide chain. ABSTRACT Proteins are fundamental biological macromolecules responsible for regulating nearly all cellular processes, and their functions are largely determined by the underlying amino acid sequences.
Anindya Nag +8 more
wiley +1 more source
A successful deployment of Industry 5.0 is significantly dependent on the synergetic integration of several advanced technologies such as big data processing, Artificial Intelligence (AI) integration, and several effective digitization techniques that ...
Arnab Mitra
doaj +1 more source

