Results 11 to 20 of about 22,124 (260)
DC-SIMD : Dynamic communication for SIMD processors [PDF]
SIMD (single instruction multiple data)-type processors have been found very efficient in image processing applications, because their repetitive structure is able to exploit the huge amount of data-level parallelism in pixel-type operations, operating at a relatively low energy consumption rate.
Frijns, R.M.W. +3 more
openaire +2 more sources
Maximizing SIMD resource utilization in GPGPUs with SIMD lane permutation [PDF]
Current GPUs maintain high programmability by abstracting the SIMD nature of the hardware as independent concurrent threads of control with hardware responsible for generating predicate masks to utilize the SIMD hardware for different flows of control.
Minsoo Rhu, Mattan Erez
openaire +2 more sources
SIMD@OpenMP : a programming model approach to leverage SIMD features [PDF]
SIMD instruction sets are a key feature in current general purpose and high performance architectures. SIMD instructions apply in parallel the same operation to a group of data, commonly known as vector. A single SIMD/vector instruction can, thus, replace a sequence of scalar instructions. Consequently, the number of instructions can be greatly reduced
Caballero de Gea, Diego Luis
openaire +5 more sources
Revisiting SIMD Programming [PDF]
Massively parallel SIMD array architectures are making their way into embedded processors. In these architectures, a number of identical processing elements having small private storage and using asynchronous I/O for accessing large shared memory executes the same instruction in lockstep.
Anton Lokhmotov +4 more
openaire +2 more sources
Preterm birth, socioeconomic status, and white matter development across childhood [PDF]
Preterm birth and socioeconomic status (SES) are associated with brain development in early life, but the contribution of each over time is uncertain. We examined the effects of gestational age (GA) and SES on white matter microstructure in the neonatal ...
Katie Mckinnon +26 more
doaj +2 more sources
Minotaur: A SIMD-Oriented Synthesizing Superoptimizer [PDF]
A superoptimizing compiler—one that performs a meaningful search of the program space as part of the optimization process—can find optimization opportunities that are missed by even the best existing optimizing compilers.
Zhengyang Liu, Stefan Mada, J. Regehr
semanticscholar +1 more source
Performance comparison of CPU and GPGPU calculations using three simple case studies [PDF]
In this work, we have prepared and analyzed three case studies comparing CPU and GPGPU calculations. After briefly introducing the topic of parallel programming by means of contemporary CPU and GPGPU technologies, we provide an overview of selected ...
Branislav Lipovsky, Slavomir Simonak
doaj +1 more source
SIMD-ified R-tree Query Processing and Optimization [PDF]
The introduction of Single Instruction Multiple Data (SIMD) instructions in mainstream CPUs has enabled modern database engines to leverage data parallelism by performing more computation with a single instruction, resulting in a reduced number of ...
Yeasir Rayhan, W. Aref
semanticscholar +1 more source
SIMD-Matcher: A SIMD-based Arbitrary Matching Framework [PDF]
Packet classification methods rely upon matching packet content/header against pre-defined rules, which are generated by network applications and their configurations. With the rapid development of network technology and the fast-growing network applications, users seek more enhanced, secure, and diverse network services.
Ping Wang 0043 +3 more
openaire +1 more source
From Merging Frameworks to Merging Stars: Experiences using HPX, Kokkos and SIMD Types [PDF]
Octo-Tiger, a large-scale 3D AMR code for the merger of stars, uses a combination of HPX, Kokkos and explicit SIMD types, aiming to achieve performance-portability for a broad range of heterogeneous hardware.
Gregor Daiß +4 more
semanticscholar +1 more source

