Results 71 to 80 of about 3,115,284 (271)
Omphale: Streamlining the Communication for Jobs in a Multi Processor System on Chip [PDF]
Our Multi Processor System on Chip (MPSoC) template provides processing tiles that are connected via a network on chip. A processing tile contains a processing unit and a Scratch Pad Memory (SPM).
Bekooij, M.J.G.+3 more
core +2 more sources
Hybrid parallel programming with tasks
This technical report is an introduction to using a hybrid parallel programming model that combines MPI with OmpSs or OpenMP dependent tasks. This model allows both computation and communication to be expressed using a coarse-grained dataflow approach, which helps to remove most of the unnecessary ordering constraints and intranode synchronisation ...
J. Mark Bull, Jiehong Yu
openaire +2 more sources
THE INTEGRATION OF TASK AND DATA PARALLEL SKELETONS [PDF]
We describe a skeletal parallel programming library which integrates task and data parallel constructs within an API for C++. Traditional skeletal requirements for higher orderness and polymorphism are achieved through exploitation of operator overloading and templates, while the underlying parallelism is provided by MPI.
Kuchen, H. (Herbert), Cole, M. (Murray)
openaire +3 more sources
DBEFT: A Dependency-Ratio Bundling Earliest Finish Time Algorithm for Heterogeneous Computing
Performance effective task scheduling algorithms are essential for taking advantage of the heterogeneous multi-processor in heterogeneous computing environments.
Tao Li+6 more
doaj +1 more source
The Dynamic Partial Reconfiguration function of reconfigurable devices permits tasks to be performed simultaneously on a single device. Nevertheless, task placement and resource management problems emerge with the parallelism of reconfigurable devices ...
Tingyu Zhou+4 more
doaj +1 more source
Divided we stand: Parallel distributed stack memory management [PDF]
We present an overview of the stack-based memory management techniques that we used in our non-deterministic and-parallel Prolog systems: &-Prolog and DASWAM.
Hermenegildo, Manuel V., Kish, Shen
core +1 more source
Multi-task Parallel Algorithm for DSRC [PDF]
AbstractMulti-task parallel algorithm is applied to accelerate the program of DNA Sequence Reads Compressor (DSRC), which is specialized for compressing DNA sequencing data file - FASTQ format file. The compression process is first divided into two parallel tasks: data process task and data input task.
Wang Ting+3 more
openaire +1 more source
Response errors explain the failure of independent-channels models of perception of temporal order
Independent-channels models of perception of temporal order (also referred to as threshold models or perceptual latency models) have been ruled out because two formal properties of these models (monotonicity and parallelism) are not borne out by data ...
Miguel A García-Pérez+1 more
doaj +1 more source
Enhancement of GPU-accelerated smoothed particle hydrodynamics (SPH) method with dynamic parallelism
An innovative GPU programming architecture leveraging CUDA Dynamic Parallelism (CDP) is introduced in this study, aiming to enhance the computational efficiency of Smoothed Particle Hydrodynamics (SPH) simulations.
Liwen Xue, Shenglong Gu, Songdong Shao
doaj +1 more source
Towards Task-Parallel Reductions in OpenMP
Reductions represent a common algorithmic pattern in many scientific applications. OpenMP* has always supported them on parallel and worksharing constructs. OpenMP 3.0’s tasking constructs enable new parallelization opportunities through the annotation of irregular algorithms.
Alex Duran+14 more
openaire +4 more sources