Results 21 to 30 of about 31,177 (208)
New execution model for CAPE using multiple threads on multicore clusters
Based on its simplicity and user‐friendly characteristics, OpenMP has become the standard model for programming on shared‐memory architectures. Checkpointing‐aided parallel execution (CAPE) is an approach that utilizes the discontinuous incremental ...
Xuan Huyen Do +3 more
doaj +1 more source
Optimizing Iterative Data-Flow Scientific Applications Using Directed Cyclic Graphs
Data-flow programming models have become a popular choice for writing parallel applications as an alternative to traditional work-sharing parallelism. They are better suited to write applications with irregular parallelism that can present load imbalance.
David Alvarez, Vicenc Beltran
doaj +1 more source
Spectral analytical method of recognition of inexact repeats in character sequences
Proposed are theoretical basis and algorithmic implementation of spectral-analytical method of recognition of repeats in character sequences. The theoretical justification is based on the theorem on equivalent representation of the character sequence by ...
A. N. Pankratov +4 more
doaj +1 more source
MILC Code Performance on High End CPU and GPU Supercomputer Clusters [PDF]
With recent developments in parallel supercomputing architecture, many core, multi-core, and GPU processors are now commonplace, resulting in more levels of parallelism, memory hierarchy, and programming complexity.
DeTar, Carleton +3 more
core +2 more sources
With the increasing diversity of heterogeneous architecture in the HPC industry, porting a legacy application to run on different architectures is a tough challenge. In this paper, we present OpenMP Advisor, a first of its kind compiler tool that enables code offloading to a GPU with OpenMP using Machine Learning. Although the tool is currently limited
Mishra, Alok +3 more
openaire +2 more sources
Hints on the Multicore CPU Design for Personal Computers
- Multi-core processors are created by combining multiple units, each called the core, containing the processing unit, registers and a cache. This study provides a detailed analysis on the performances of multicore CPUs for personal computers.
İren Ecem +2 more
doaj +1 more source
Hierarchical Parallelisation of Functional Renormalisation Group Calculations -- hp-fRG [PDF]
The functional renormalisation group (fRG) has evolved into a versatile tool in condensed matter theory for studying important aspects of correlated electron systems.
Rohe, Daniel
core +2 more sources
GPU Parallelization of a Hybrid Pseudospectral Geophysical Turbulence Framework Using CUDA
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization approach for multidimensional distributed Fourier transforms, in a well-characterized pseudospectral fluid turbulence code.
Duane Rosenberg +3 more
doaj +1 more source
The Glasgow Parallel Reduction Machine: Programming Shared-memory Many-core Systems using Parallel Task Composition [PDF]
We present the Glasgow Parallel Reduction Machine (GPRM), a novel, flexible framework for parallel task-composition based many-core programming. We allow the programmer to structure programs into task code, written as C++ classes, and communication code,
Tousimojarad, Ashkan +1 more
core +3 more sources
Experimental computer tomograph
The computed tomography is one of the most important medical instruments, allowing the non-invasive visualization of cross sections which are free from superpositions. Since 2000 an experimental computer tomo-graph of the third generation for the purpose
Heinemann D., Keller A., Jannek D.
doaj +1 more source

