Results 91 to 100 of about 31,764 (209)
HSTREAM: A directive-based language extension for heterogeneous stream computing
Big data streaming applications require utilization of heterogeneous parallel computing systems, which may comprise multiple multi-core CPUs and many-core accelerating devices such as NVIDIA GPUs and Intel Xeon Phis.
Memeti, Suejb, Pllana, Sabri
core +1 more source
The HPCG benchmark: analysis, shared memory preliminary improvements and evaluation on an Arm-based platform [PDF]
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the performance evaluation coverage of large High-Performance Computing (HPC) systems. Due to its lower arithmetic intensity and higher memory pressure, HPCG is
Casas, Marc +4 more
core +1 more source
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many sequence alignment algorithms exist, differing in efficiency, in their models of the sequences, and in the relationship between sequences.
D. D. Shrimankar, S. R. Sathe
doaj +1 more source
Este trabalho é dedicado à simulação numérica do escoamento de gás natural em reservatórios não convencionais na presença dos efeitos de adsorção e escorregamento.
Grazione de Souza +3 more
doaj +1 more source
Running scientific codes on amazon EC2: a performance analysis of five high-end instances
Amazon Web Services (AWS) is a well-known public Infrastructure-as-a-Service (IaaS) provider whose Elastic Computing Cloud (EC2) o ering includes some instances, known as cluster instances, aimed at High-Performance Computing (HPC) applications.
Roberto R. Expósito +4 more
doaj
Unified Schemes for Directive-Based GPU Offloading
GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs.
Yohei Miki, Toshihiro Hanawa
doaj +1 more source
Performance Evaluation of a Hybrid Programming Model for RSDFT on T2K Open Supercomputer
Non-uniform memory access (NUMA) systems, where each processor has its own memory, have been popular platform in high-end computing. While some early studies had reported that a flat-MPI programming model outperformed an OpenMP/MPI hybrid programming ...
Miwako Tsuji, Mitsuhisa Sato
doaj +1 more source
Multilevel Parallelization of AutoDock 4.2
Background Virtual (computational) screening is an increasingly important tool for drug discovery. AutoDock is a popular open-source application for performing molecular docking, the prediction of ligand-receptor interactions.
Norgan Andrew P +4 more
doaj +1 more source
Implementing OpenMP 4.0 for the NVIDIA PTX architecture in GCC compiler
The paper describes the approach used in implementing OpenMP offloading to NVIDIA accelerators in GCC. Offloading refers to a new capability in OpenMP 4.0 specification update that allows the programmer to specify regions of code that should be executed ...
A. V. Monakov, V. A. Ivanishin
doaj +1 more source
The memory model of OpenMP has been widely misunderstood since the first OpenMP specification was published in 1997 (Fortran 1.0). The proposed OpenMP specification (version 2.5) includes a memory model section to address this issue. This section unifies and clarifies the text about the use of memory in all previous specifications, and relates the ...
Hoeflinger, J P, de Supinski, B R
openaire +2 more sources

