Results 21 to 30 of about 31,764 (209)

Parallel Solution and Optimization of Large-Scale Sparse Linear System in GRAPES Dynamic Framework [PDF]

open access: yesJisuanji gongcheng, 2022
The Helmholtz equation is the core of dynamic framework of Global and Regional Assimilation Prediction System(GRAPES) for numerical weather forecast.This equation can essentially be transformed into the solution of a large-scale sparse linear system, but
ZHANG Kun, JIA Jinfang, YAN Wenxin, HUANG Jianqiang, WANG Xiaoying
doaj   +1 more source

New execution model for CAPE using multiple threads on multicore clusters

open access: yesETRI Journal, 2021
Based on its simplicity and user‐friendly characteristics, OpenMP has become the standard model for programming on shared‐memory architectures. Checkpointing‐aided parallel execution (CAPE) is an approach that utilizes the discontinuous incremental ...
Xuan Huyen Do   +3 more
doaj   +1 more source

Optimizing Iterative Data-Flow Scientific Applications Using Directed Cyclic Graphs

open access: yesIEEE Access, 2023
Data-flow programming models have become a popular choice for writing parallel applications as an alternative to traditional work-sharing parallelism. They are better suited to write applications with irregular parallelism that can present load imbalance.
David Alvarez, Vicenc Beltran
doaj   +1 more source

A Review of Lightweight Thread Approaches for High Performance Computing [PDF]

open access: yes, 2016
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores.
Balaji, Pavan   +5 more
core   +1 more source

Spectral analytical method of recognition of inexact repeats in character sequences

open access: yesТруды Института системного программирования РАН, 2018
Proposed are theoretical basis and algorithmic implementation of spectral-analytical method of recognition of repeats in character sequences. The theoretical justification is based on the theorem on equivalent representation of the character sequence by ...
A. N. Pankratov   +4 more
doaj   +1 more source

An accelerated framework for high-resolution X-ray holographic reconstruction. [PDF]

open access: yesJ Synchrotron Radiat
HiHolo, a high‐performance CUDA‐MPI software framework for X‐ray holographic reconstruction, achieves performance improvement over existing solutions while introducing three enhanced iterative algorithms that effectively reduce artifacts and improve spatial resolution in propagation‐based phase contrast imaging.X‐ray propagation‐based phase contrast ...
Hu J   +5 more
europepmc   +2 more sources

ORC-OpenMP: An OpenMP Compiler Based on ORC [PDF]

open access: yes, 2004
This paper introduces a translation and optimization framework for OpenMP, based on the classification of OpenMP translation types. And an open source OpenMP compiler, which implements this framework is also introduced as a high performance research platform for Linux/IA-64.
Yongjian Chen   +3 more
openaire   +1 more source

On the Benefits of Tasking with OpenMP [PDF]

open access: yes, 2019
Tasking promises a model to program parallel applications that provides intuitive semantics. In the case of tasks with dependences, it also promises better load balancing by removing global synchronizations (barriers), and potential for improved locality. Still, the adoption of tasking in production HPC codes has been slow.
Alejandro Rico   +5 more
openaire   +2 more sources

Hints on the Multicore CPU Design for Personal Computers

open access: yesActa Electrotechnica et Informatica, 2022
- Multi-core processors are created by combining multiple units, each called the core, containing the processing unit, registers and a cache. This study provides a detailed analysis on the performances of multicore CPUs for personal computers.
İren Ecem   +2 more
doaj   +1 more source

GPU Parallelization of a Hybrid Pseudospectral Geophysical Turbulence Framework Using CUDA

open access: yesAtmosphere, 2020
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization approach for multidimensional distributed Fourier transforms, in a well-characterized pseudospectral fluid turbulence code.
Duane Rosenberg   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy