Results 21 to 30 of about 15,151 (210)
An investigation of the performance portability of OpenCL [PDF]
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level benchmark from the NAS Parallel Benchmark Suite.
Hammond, Simon D. +5 more
core +1 more source
Computing OpenSURF on OpenCL and General Purpose GPU
Speeded-Up Robust Feature (SURF) algorithm is widely used for image feature detecting and matching in computer vision area. Open Computing Language (OpenCL) is a framework for writing programs that execute across heterogeneous platforms consisting of ...
Wanglong Yan +3 more
doaj +1 more source
Parallel waveform extraction algorithms for the Cherenkov Telescope Array Real-Time Analysis [PDF]
The Cherenkov Telescope Array (CTA) is the next generation observatory for the study of very high-energy gamma rays from about 20 GeV up to 300 TeV. Thanks to the large effective area and field of view, the CTA observatory will be characterized by an ...
Aboudan, Alessio +6 more
core +2 more sources
Implementation of the Lattice Boltzmann Method on Heterogeneous Hardware and Platforms using OpenCL
The Lattice Boltzmann method (LBM) has become an alternative method for computational fluid dynamics with a wide range of applications. Besides its numerical stability and accuracy, one of the major advantages of LBM is its relatively easy ...
TEKIC, P. M. +2 more
doaj +1 more source
A Python package for fast GPU-based proton pencil beam dose calculation. [PDF]
Abstract Purpose Open‐source GPU‐based Monte Carlo (MC) proton dose calculation algorithms provide high speed and unparalleled accuracy but can be complex to integrate with new applications and remain slower than GPU‐based pencil beam (PB) methods, which sacrifice some physical accuracy for sub‐second plan calculation.
Bhattacharya M +4 more
europepmc +2 more sources
Comprehensive Evaluation of OpenCL-Based CNN Implementations for FPGAs [PDF]
Deep learning has significantly advanced the state of the art in artificial intelligence, gaining wide popularity from both industry and academia.
Kadetotad, Deepak +5 more
core +1 more source
Fast Algorithm Based on Parallel Computing for Sample Entropy Calculation
Sample entropy is a widely used method for assessing the irregularity of physiological signals, but it has a high computational complexity, which prevents its application for time-sensitive scenes.
Xinzheng Dong +4 more
doaj +1 more source
SkelCL: enhancing OpenCL for high-level programming of multi-GPU systems [PDF]
Application development for modern high-performance systems with Graphics Processing Units (GPUs) currently relies on low-level programming approaches like CUDA and OpenCL, which leads to complex, lengthy and error-prone programs.
Gorlatch, Sergei, Steuwer, Michel
core +2 more sources
Computation in engineering and science can often benefit from acceleration due to lengthy calculation times for certain classes of numerical models.
Junjie Gu, Attila Michael Zsaki
doaj +1 more source
Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems [PDF]
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new ...
Ayguadé Parra, Eduard +7 more
core +1 more source

