Results 1 to 10 of about 31,177 (208)
Research on Optimization of Heterogeneous Stencil Computing Based on CPU and GPU [PDF]
As a type of algorithm that uses fixed pattern templates, stencil computing is widely employed in image processing, computational fluid dynamics simulations, and other fields.However, existing stencil computing approaches exhibit problems such as weak ...
LI Bo, HUANG Dongqiang, JIA Jinfang, WU Li, WANG Xiaoying, HUANG Jianqiang
doaj +1 more source
Parallel Compilation Optimization Method for Sunway High Performance Multi-Core Processors [PDF]
In the Sunway high performance multi-core server, the automatic parallelization compiling system produces OpenMP programs that are not sufficiently optimized to identify and assert parallelism in the program.Moreover, the program uses a simple fork-join ...
ZHOU Yonghao, XU Jinlong, LI Bin, QIAN Hong, NIE Kai
doaj +1 more source
An alternative C++-based HPC system for Hadoop MapReduce
MapReduce (MR) is a technique used to improve distributed data processing vastly and can massively speed up computation. Hadoop and MR rely on memory-intensive JVM and Java. A MR framework based on High-Performance Computing (HPC) could be used, which is
Srinivasakumar Vignesh +3 more
doaj +1 more source
Using OpenMP for HEP framework algorithm scheduling [PDF]
The OpenMP standard is the primary mechanism used at high performance computing facilities to allow intra-process parallelization. In contrast, many HEP specific software packages (such as CMSSW, GaudiHive, and ROOT) make use of Intel’s Threading ...
Jones Christopher, Gartung Patrick
doaj +1 more source
Influence of shortest path algorithms on energy consumption of multi-core processors
Modern multi-core processors, operating systems and applied software are being designed towards energy efficiency, which significantly reduces energy consumption.
A. A. Prihozhy, O. N. Karasik
doaj +1 more source
OpenMP Taskloop Dependences [PDF]
This research has received funding from the European Union’s Horizon 2020/EuroHPC research and innovation programme under grant agreement No 955606 (DEEP-SEA); and the support of the Spanish Ministry of Science and Innovation (Computacion de Altas Prestaciones VIII: PID2019-107255GB).
Maroñas Bravo, Marcos +2 more
openaire +1 more source
High performance 2D convolution utilizing the AVX512 on a multi-core architecture [PDF]
Convolution is a time consuming operation, especially for signal and image processing, which led us to develop an efficient implementation of 2D convolution for a multi-core architecture utilizing AVX512 intrinsics and OpenMP.
Isamail Masamae, Panyayot Chaikan
doaj +1 more source
Comparative Analysis of OpenMP and MPI Parallel Computing Implementations in Team Sort Algorithm
Tim Sort is a sorting algorithm that combines Merge Sort and Binary Insertion Sort sorting algorithms. Parallel computing is a computational processing technique in parallel or is divided into several parts and carried out simultaneously. The application
Eko Dwi Nugroho +4 more
doaj +1 more source
The results of a study of the performance of problem-oriented programs developed using the MPI and OpenMP parallel programming technologies which implement the numerical solution of problems in the framework of two models that are actively used in ...
Maxim Bashashin, Elena Zemlyanaya
doaj +1 more source
This chapter introduces the design of the OpenMP runtime and its key components, the offloading library and the tasking runtime library. Starting from the execution model introduced in the previous chapters, we first abstractly describe the main interactions among the main actors involved in program execution.
Marongiu A., Tagliavini G., Quinones E.
openaire +2 more sources

