Results 161 to 170 of about 15,151 (210)
Some of the next articles are maybe not open access.
2016
Usage of General Purpose Graphics Processing Units (GPGPUs) in high-performance computing is increasing as heterogeneous systems continue to become dominant. CUDA had been the programming environment for nearly all such NVIDIA GPU based GPGPU applications.
Mayank Bhura +2 more
openaire +1 more source
Usage of General Purpose Graphics Processing Units (GPGPUs) in high-performance computing is increasing as heterogeneous systems continue to become dominant. CUDA had been the programming environment for nearly all such NVIDIA GPU based GPGPU applications.
Mayank Bhura +2 more
openaire +1 more source
2009 IEEE Hot Chips 21 Symposium (HCS), 2009
Optimization is a balancing act Things to consider: - Register usage / number of wavefronts in flight - ALU to memory access rat io ■ Sometimes better re-compute something - Workgroup size a multiple of 64 - Global size at least 2560 for a single ...
openaire +1 more source
Optimization is a balancing act Things to consider: - Register usage / number of wavefronts in flight - ALU to memory access rat io ■ Sometimes better re-compute something - Workgroup size a multiple of 64 - Global size at least 2560 for a single ...
openaire +1 more source
Optimized fast Walsh-Hadamard transform on OpenCL-GPU and OpenCL-CPU
2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA), 2016The Walsh-Hadamard transform plays a major role in many image and video coding algorithms. In one hand, its intensive use in these algorithms makes its acceleration a challenge, in order to speed-up the algorithm execution. On the other hand, the available fast implementations are not efficient across different platforms. In this work, a parallel-based
Pedro M. M. Pereira +4 more
openaire +1 more source
MASA‐OpenCL: Parallel pruned comparison of long DNA sequences with OpenCL
Concurrency and Computation: Practice and Experience, 2018SummaryBiological sequence comparison is often used as an auxiliary task in the analysis of genetic material. Pairwise comparison algorithms like Smith‐Waterman evaluate two strings representing sequences of proteins, DNA or RNA to obtain optimal alignment between them.
Marco Antonio C. de Figueiredo +4 more
openaire +1 more source
2018
This chapter focuses on OpenCL, which is the most popular Graphics Processing Unit (GPU) programming language, excluding Compute-Unified Device Architecture (CUDA). It examines how OpenCL simplifies writing multiplatform parallel programs. OpenCL was released in 2009 by the Khronos Group as a framework for writing parallel programs on many different ...
Chase Conklin, Tolga Soyata
openaire +1 more source
This chapter focuses on OpenCL, which is the most popular Graphics Processing Unit (GPU) programming language, excluding Compute-Unified Device Architecture (CUDA). It examines how OpenCL simplifies writing multiplatform parallel programs. OpenCL was released in 2009 by the Khronos Group as a framework for writing parallel programs on many different ...
Chase Conklin, Tolga Soyata
openaire +1 more source
2019
?????????????????? ?????????????? ?? ???????????????????? ???????????????????? ???????????? ?????????????????? ?????????????????????? ?????????????? ??????'???????????? ?? ?????????????????????????? ???????????????????????? ???????????????????? ???????????????????????????? ????????????. ?????????????????????? ???????????????????? ?????????????? ????????
openaire +1 more source
?????????????????? ?????????????? ?? ???????????????????? ???????????????????? ???????????? ?????????????????? ?????????????????????? ?????????????? ??????'???????????? ?? ?????????????????????????? ???????????????????????? ???????????????????? ???????????????????????????? ????????????. ?????????????????????? ???????????????????? ?????????????? ????????
openaire +1 more source
2021
Recently, there has been growing interest in custom and reconfigurable hardware. However, commercial hardware platforms have limitations for research, because they are optimized for specific use-cases or have proprietary parts. To address this problem, the Systems Group at ETH built Enzian, a new computing platform tailored for research and for ...
openaire +1 more source
Recently, there has been growing interest in custom and reconfigurable hardware. However, commercial hardware platforms have limitations for research, because they are optimized for specific use-cases or have proprietary parts. To address this problem, the Systems Group at ETH built Enzian, a new computing platform tailored for research and for ...
openaire +1 more source
2009 IEEE Hot Chips 21 Symposium (HCS), 2009
• OpenCL 1.0 Embedded Profile is a subset of the full profile • Not an “ES” specification of its own • Easier programming of heterogeneous multi-processor • Fast multiprocessor code without portability hassle • Speedups and energy efficiency possible via parallelism • But scheduling can be ...
openaire +1 more source
• OpenCL 1.0 Embedded Profile is a subset of the full profile • Not an “ES” specification of its own • Easier programming of heterogeneous multi-processor • Fast multiprocessor code without portability hassle • Speedups and energy efficiency possible via parallelism • But scheduling can be ...
openaire +1 more source
Proceedings of the International Workshop on OpenCL, 2020
Aksel Alpay, Vincent Heuveline
openaire +1 more source
Aksel Alpay, Vincent Heuveline
openaire +1 more source

