Results 101 to 110 of about 42,875 (250)
Towards Transactional Memory for OpenMP [PDF]
The OpenMP specification lacks a composable shared memory concurrency mechanism: the current OpenMP concurrency mechanisms, such as OMP critical, locks, or atomics, do not support composition. In this paper, we motivate the need for transactional memory (TM) in OpenMP.
Wong, Michael +5 more
openaire +2 more sources
The HPCG benchmark: analysis, shared memory preliminary improvements and evaluation on an Arm-based platform [PDF]
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the performance evaluation coverage of large High-Performance Computing (HPC) systems. Due to its lower arithmetic intensity and higher memory pressure, HPCG is
Casas, Marc +4 more
core +1 more source
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many sequence alignment algorithms exist, differing in efficiency, in their models of the sequences, and in the relationship between sequences.
D. D. Shrimankar, S. R. Sathe
doaj +1 more source
OpenMP [14] is the dominant programming model for shared-memory parallelism in C, C++ and Fortran due to its easy-touse directive-based style, portability and broad support by compiler vendors. Compute-intensive application regions are increasingly being accelerated using devices such as GPUs and DSPs, and a programming model with similar ...
Beyer, J C +3 more
openaire +2 more sources
Setting up environment for OpenMP, description of the basic program environment and running of basic programs. Outline: What is OpenMP, Timeline, Main Terminology, OpenMP Programming Model, Main Components, Parallel Construct, Work-sharing Constructs (sections, single, workshare), Data Clauses (default, shared, private, ...)
openaire +1 more source
OpenMP tasking analysis for programmers
As of 2008, the OpenMP 3.0 standard includes task support allowing programmers to exploit irregular parallelism. Although several compilers are providing support for this new feature there has not been extensive investigation into the real possibilities of this extension.
Teruel, Xavier +7 more
openaire +3 more sources
Este trabalho é dedicado à simulação numérica do escoamento de gás natural em reservatórios não convencionais na presença dos efeitos de adsorção e escorregamento.
Grazione de Souza +3 more
doaj +1 more source
Running scientific codes on amazon EC2: a performance analysis of five high-end instances
Amazon Web Services (AWS) is a well-known public Infrastructure-as-a-Service (IaaS) provider whose Elastic Computing Cloud (EC2) o ering includes some instances, known as cluster instances, aimed at High-Performance Computing (HPC) applications.
Roberto R. Expósito +4 more
doaj
Unified Schemes for Directive-Based GPU Offloading
GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs.
Yohei Miki, Toshihiro Hanawa
doaj +1 more source
Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead
The OpenMP language features have been evolving to meet the rapid development in hardware platforms. This journal focuses on evaluating implementations of OpenMP 4.5 target offload features in compilers such as Clang, XL and GCC that are an integral part
J. M. Diaz +5 more
semanticscholar +1 more source

