Results 41 to 50 of about 31,764 (209)
An Architecture for a Tri-Programming Model-Based Parallel Hybrid Testing Tool
As the development of high-performance computing (HPC) is growing, exascale computing is on the horizon. Therefore, it is imperative to develop parallel systems, such as graphics processing units (GPUs) and programming models, that can effectively ...
Saeed Musaad Altalhi +6 more
doaj +1 more source
MILC Code Performance on High End CPU and GPU Supercomputer Clusters [PDF]
With recent developments in parallel supercomputing architecture, many core, multi-core, and GPU processors are now commonplace, resulting in more levels of parallelism, memory hierarchy, and programming complexity.
DeTar, Carleton +3 more
core +2 more sources
В статье представлено исследование вычислительной эффективности гибридного параллельного алгоритма, реализующего нелокальную неявную конечно-разностную схему (IFDS) для численного решения задачи динамики объемной активности радона (ОАР).
Твёрдый, Д.А.
doaj +1 more source
Parallel Simulations of Dynamic Earthquake Rupture along Geometrically Complex Faults on CMP Systems
Chip multiprocessors (CMP) are widely used for high performance computing and are being configured in a hierarchical manner to compose a CMP compute node in a CMP system.
Xingfu Wu, Benchun Duan, Valerie Taylor
doaj +1 more source
Characterizing task-based OpenMP programs. [PDF]
Programmers struggle to understand performance of task-based OpenMP programs since profiling tools only report thread-based performance. Performance tuning also requires task-based performance in order to balance per-task memory hierarchy utilization ...
Ananya Muddukrishna +2 more
doaj +1 more source
To extract performance from supercomputers, programmers in the High Performance Computing (HPC) community are often required to use a combination of frameworks to take advantage of the multiple levels of parallelism. However, over the years, efforts have
Christian Asch +2 more
doaj +1 more source
CoreTSAR: Task Scheduling for Accelerator-aware Runtimes [PDF]
Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasingly popular due to their high peak performance, energy efficiency and comparatively low cost.
de Supinski, Bronis R. +3 more
core +1 more source
Implementing implicit OpenMP data sharing on GPUs
OpenMP is a shared memory programming model which supports the offloading of target regions to accelerators such as NVIDIA GPUs. The implementation in Clang/LLVM aims to deliver a generic GPU compilation toolchain that supports both the native CUDA C/C++
Bataev, Alexey +8 more
core +1 more source
Abstract figure legend Left atrial myocytes have TAT‐associated faster Ca2+ release but are more prone to maladaptation at higher frequencies due to weaker peripheral SR Ca2+ uptake and smaller trigger Ca2+ current. Abstract Changes in heart rate affect Ca2+ signalling and contractility in ventricular muscle, but the effects on atrial Ca2+ signalling ...
Joon‐Chul Kim +6 more
wiley +1 more source
Seismic Numerical Simulation Based on Half-precision Floating-point Number Optimization and OpenMP
Studying the propagation law of seismic wavefields is the basis of seismic exploration. The finite difference method is commonly used to solve the wave equation for seismic forward modeling, which has advantages of low memory usage, fast computation ...
Wenge LIU +4 more
doaj +1 more source

