Parallel Solution and Optimization of Large-Scale Sparse Linear System in GRAPES Dynamic Framework [PDF]
The Helmholtz equation is the core of dynamic framework of Global and Regional Assimilation Prediction System(GRAPES) for numerical weather forecast.This equation can essentially be transformed into the solution of a large-scale sparse linear system, but
ZHANG Kun, JIA Jinfang, YAN Wenxin, HUANG Jianqiang, WANG Xiaoying
doaj +1 more source
New execution model for CAPE using multiple threads on multicore clusters
Based on its simplicity and user‐friendly characteristics, OpenMP has become the standard model for programming on shared‐memory architectures. Checkpointing‐aided parallel execution (CAPE) is an approach that utilizes the discontinuous incremental ...
Xuan Huyen Do +3 more
doaj +1 more source
Optimizing Iterative Data-Flow Scientific Applications Using Directed Cyclic Graphs
Data-flow programming models have become a popular choice for writing parallel applications as an alternative to traditional work-sharing parallelism. They are better suited to write applications with irregular parallelism that can present load imbalance.
David Alvarez, Vicenc Beltran
doaj +1 more source
A Review of Lightweight Thread Approaches for High Performance Computing [PDF]
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores.
Balaji, Pavan +5 more
core +1 more source
Spectral analytical method of recognition of inexact repeats in character sequences
Proposed are theoretical basis and algorithmic implementation of spectral-analytical method of recognition of repeats in character sequences. The theoretical justification is based on the theorem on equivalent representation of the character sequence by ...
A. N. Pankratov +4 more
doaj +1 more source
An accelerated framework for high-resolution X-ray holographic reconstruction. [PDF]
HiHolo, a high‐performance CUDA‐MPI software framework for X‐ray holographic reconstruction, achieves performance improvement over existing solutions while introducing three enhanced iterative algorithms that effectively reduce artifacts and improve spatial resolution in propagation‐based phase contrast imaging.X‐ray propagation‐based phase contrast ...
Hu J +5 more
europepmc +2 more sources
ORC-OpenMP: An OpenMP Compiler Based on ORC [PDF]
This paper introduces a translation and optimization framework for OpenMP, based on the classification of OpenMP translation types. And an open source OpenMP compiler, which implements this framework is also introduced as a high performance research platform for Linux/IA-64.
Yongjian Chen +3 more
openaire +1 more source
On the Benefits of Tasking with OpenMP [PDF]
Tasking promises a model to program parallel applications that provides intuitive semantics. In the case of tasks with dependences, it also promises better load balancing by removing global synchronizations (barriers), and potential for improved locality. Still, the adoption of tasking in production HPC codes has been slow.
Alejandro Rico +5 more
openaire +2 more sources
Hints on the Multicore CPU Design for Personal Computers
- Multi-core processors are created by combining multiple units, each called the core, containing the processing unit, registers and a cache. This study provides a detailed analysis on the performances of multicore CPUs for personal computers.
İren Ecem +2 more
doaj +1 more source
GPU Parallelization of a Hybrid Pseudospectral Geophysical Turbulence Framework Using CUDA
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization approach for multidimensional distributed Fourier transforms, in a well-characterized pseudospectral fluid turbulence code.
Duane Rosenberg +3 more
doaj +1 more source

