Results 21 to 30 of about 42,875 (250)
OpenMP application experiences: Porting to accelerated nodes
As recent enhancements to the OpenMP specification become available in its implementations, there is a need to share the results of experimentation in order to better understand the OpenMP implementation’s behavior in practice, to identify pitfalls, and ...
Seonmyeong Bak +23 more
semanticscholar +1 more source
This chapter introduces the design of the OpenMP runtime and its key components, the offloading library and the tasking runtime library. Starting from the execution model introduced in the previous chapters, we first abstractly describe the main interactions among the main actors involved in program execution.
Marongiu A., Tagliavini G., Quinones E.
openaire +2 more sources
An accelerated framework for high-resolution X-ray holographic reconstruction. [PDF]
HiHolo, a high‐performance CUDA‐MPI software framework for X‐ray holographic reconstruction, achieves performance improvement over existing solutions while introducing three enhanced iterative algorithms that effectively reduce artifacts and improve spatial resolution in propagation‐based phase contrast imaging.X‐ray propagation‐based phase contrast ...
Hu J +5 more
europepmc +2 more sources
Performance analysis of CUDA, OpenACC and OpenMP programming models on TESLA V100 GPU
Graphics processors are widely utilized in modern supercomputers as accelerators. Ability to perform efficient parallelization and low-level allow scientists to greatly boost performance of their codes.
M. Khalilov, Alexey Timoveev
semanticscholar +1 more source
The article presents a software implementation of a parallel efficient and fast computational algorithm for solving the Cauchy problem for a nonlinear differential equation of a fractional variable order.
Tverdyi, D.A. +3 more
doaj +1 more source
Venous Vessel Size Imaging Derived From A Breath-Hold Task. [PDF]
In this study, we propose the utilization of a spin‐ and gradient‐echo echo‐planar imaging approach, in conjunction with a numerical simulation, for the quantitative assessment of venous vessel size measurements during a breath‐hold task. ABSTRACT Vessel size imaging (VSI) to provide a measure of vessel radius in the brain has been demonstrated using ...
Zhang K +9 more
europepmc +2 more sources
Developing Efficient Discrete Simulations on Multicore and GPU Architectures [PDF]
In this paper we show how to efficiently implement parallel discrete simulations on multicoreandGPUarchitecturesthrougharealexampleofanapplication: acellularautomatamodel of laser dynamics.
Cagigas Muñiz, Daniel +4 more
core +1 more source
Development of a hybrid parallel algorithm (MPI + OpenMP) for solving the Poisson equation
This article presents the development of a hybrid parallel algorithm for solving the Dirichlet problem for the two-dimensional Poisson equation. MPI and OpenMP were chosen as the technology for parallelization.
Y. G. Kenzhebek +2 more
doaj +1 more source
Many real-world systems are profitably described as complex networks that grow over time. Preferential attachment and node fitness are two simple growth mechanisms that not only explain certain structural properties commonly observed in real-world ...
Thong Pham +2 more
doaj +1 more source
Experiences in porting mini-applications to OpenACC and OpenMP on heterogeneous systems [PDF]
This article studies mini-applications—Minisweep, GenASiS, GPP, and FF—that use computational methods commonly encountered in HPC. We have ported these applications to develop OpenACC and OpenMP versions, and evaluated their performance on Titan (Cray ...
Budiardja, RD +5 more
core +2 more sources

