Results 41 to 50 of about 41,447 (233)

Efficient Prefetching and Client-Side Caching Algorithms for Improving the Performance of Read Operations in Distributed File Systems

open access: yesIEEE Access, 2022
Modern web applications are deployed in cloud computing systems because they support unlimited storage and computing power. One of the main back-end storage components of this cloud computing system is the distributed file system which allows massive ...
Anusha Nalajala   +2 more
doaj   +1 more source

Parsing MetaMap Files in Hadoop [PDF]

open access: yes, 2017
The UMLS::Association CUICollector module identifies UMLS Concept Unique Identifier bigrams and their frequencies in a biomedical text corpus. CUICollector was re-implemented in Hadoop MapReduce to improve algorithm speed, flexibility, and scalability ...
Cano, Alberto   +2 more
core   +1 more source

Advancing Organic Chemistry Using High‐Throughput Experimentation

open access: yesAngewandte Chemie, Volume 137, Issue 40, September 26, 2025.
This review outlines major advances in the design, execution, analysis, and data management phases of high‐throughput experimentation (HTE). The limitations and potential opportunities of applying modern HTE to organic synthesis are highlighted. Abstract High‐throughput experimentation (HTE), the miniaturization and parallelization of reactions, is a ...
Reem Nsouli   +2 more
wiley   +2 more sources

Data locality in Hadoop [PDF]

open access: yes, 2016
Current market tendencies show the need of storing and processing rapidly growing amounts of data. Therefore, it implies the demand for distributed storage and data processing systems.
Kaluzka, Justyna
core   +1 more source

Garbage collection auto-tuning for Java MapReduce on Multi-Cores [PDF]

open access: yes, 2011
MapReduce has been widely accepted as a simple programming pattern that can form the basis for efficient, large-scale, distributed data processing. The success of the MapReduce pattern has led to a variety of implementations for different computational ...
Allen E.   +10 more
core   +1 more source

An Overview of Deep Learning Techniques for Big Data IoT Applications

open access: yesInternational Journal of Communication Systems, Volume 39, Issue 4, 10 March 2026.
Reviews deep learning integration with cloud, fog, and edge computing in IoT architectures. Examines model suitability across IoT applications, key challenges, and emerging trends Provides a comparative analysis to guide future deep learning research in IoT environments.
Gagandeep Kaur   +2 more
wiley   +1 more source

Hadoop Configuration Tuning With Ensemble Modeling and Metaheuristic Optimization

open access: yesIEEE Access, 2018
MapReduce is a popular programming model for big data processing. Although the distributed processing framework Hadoop greatly reduced the development complexity of MapReduce applications, fine tuning of the Hadoop systems for optimal performance remains
Xingcheng Hua   +2 more
doaj   +1 more source

Big Data Architectures and Concepts

open access: yesJournal of Innovation Information Technology and Application, 2023
Nowadays, the processing of big data has become a major preoccupation for businesses, not only for storage and processing but also for operational requirements such as speed, maintaining performance with scalability, reliability, availability, security ...
Audrey Tembo Welo   +3 more
doaj   +1 more source

Parallel detrended fluctuation analysis for fast event detection on massive PMU data [PDF]

open access: yes, 2015
("(c) 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Ashton, PM   +5 more
core   +1 more source

A Multi‐Layered Analysis of Energy Consumption in Spark

open access: yesConcurrency and Computation: Practice and Experience, Volume 38, Issue 3, February 2026.
ABSTRACT Although energy has become a major concern in data processing systems, it is usually hard to get a deep understanding of how performance and energy consumption relate to each other when planning how to configure a computing environment to execute a specific data‐oriented workload.
Nestor D. O. Volpini   +2 more
wiley   +1 more source

Home - About - Disclaimer - Privacy