Results 11 to 20 of about 42,875 (250)

Advising OpenMP Parallelization via a Graph-Based Approach with Transformers [PDF]

open access: yesInternational Workshop on OpenMP, 2023
There is an ever-present need for shared memory parallelization schemes to exploit the full potential of multi-core architectures. The most common parallelization API addressing this need today is OpenMP.
Tal Kadosh   +5 more
semanticscholar   +1 more source

Quantifying OpenMP: Statistical Insights into Usage and Adoption [PDF]

open access: yesIEEE Conference on High Performance Extreme Computing, 2023
In high-performance computing (HPC), the demand for efficient parallel programming models has grown dramatically since the end of Dennard Scaling and the subsequent move to multi-core CPUs.
Tal Kadosh   +4 more
semanticscholar   +1 more source

Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation [PDF]

open access: yesConference on Machine Learning and Systems, 2023
Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of
Le Chen   +4 more
semanticscholar   +1 more source

The OpenMP Cluster Programming Model [PDF]

open access: yesICPP Workshops, 2022
Despite the various research initiatives and proposed programming models, efficient solutions for parallel programming in HPC clusters still rely on a complex combination of different programming models (e.g., OpenMP and MPI), languages (e.g., C++ and ...
H. Yviquel   +11 more
semanticscholar   +1 more source

Taskgraph: A Low Contention OpenMP Tasking Framework [PDF]

open access: yesIEEE Transactions on Parallel and Distributed Systems, 2022
OpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It includes a tasking model that offers a high-level of abstraction to effectively exploit structured (loop-based) and highly dynamic unstructured (task-based)
Chen Yu, Sara Royuela, Eduardo Quiñones
semanticscholar   +1 more source

ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP [PDF]

open access: yesInternational Workshop on Performance, Portability and Productivity in HPC, 2022
The OpenMP language continues to evolve with every new specification release, as does the need to validate and verify the new features that have been implemented by the different vendors.
Thomas Huber   +11 more
semanticscholar   +1 more source

ASH: A Multi-Scale, Multi-Theory Modeling Program. [PDF]

open access: yesJ Comput Chem
ASH is a Python‐based computational chemistry software for multi‐scale and multi‐theory computations (including QM/MM and ONIOM) and MD simulations. The program is designed for flexible workflows of molecular and biomolecular systems, allowing geometry optimizations, numerical frequencies, high‐level thermochemistry, MD and free‐energy simulations, NEB
Bjornsson R.
europepmc   +2 more sources

Source-to-Source Automatic Differentiation of OpenMP Parallel Loops [PDF]

open access: yesACM Transactions on Mathematical Software, 2021
This article presents our work toward correct and efficient automatic differentiation of OpenMP parallel worksharing loops in forward and reverse mode. Automatic differentiation is a method to obtain gradients of numerical programs, which are crucial in ...
Jan Hückelheim, Laurent Hascoët
semanticscholar   +1 more source

Efficient Execution of OpenMP on GPUs

open access: yesIEEE/ACM International Symposium on Code Generation and Optimization, 2022
-OpenMP is the preferred choice for CPU parallelism in High-Performance-Computing (HPC) applications written in C, C++, or Fortran. As HPC systems became heterogeneous, OpenMP introduced support for accelerator offloading via the target directive.
Joseph Huber   +7 more
semanticscholar   +1 more source

Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution

open access: yesIEEE International Parallel and Distributed Processing Symposium, 2022
GPU accelerators are ubiquitous in modern HPC systems. To program them, users have the choice between vendor-specific, native programming models, such as CUDA, which provide simple parallelism semantics with minimal runtime support, or portable ...
Johannes Doerfert   +6 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy