Results 51 to 60 of about 1,369 (159)
MIMD Programs Execution Support on SIMD Machines: A Holistic Survey
The Single Instruction Multiple Data (SIMD) architecture, supported by various high-performance computing platforms, efficiently utilizes data-level parallelism.
Dheya Mustafa +3 more
doaj +1 more source
High-Performance Computing (HPC) systems are designed for large-scale processing and complex dataset analysis leveraging scalability, efficiency, and parallelism, often integrating specialized hardware structures such as Vector Processing Units (VPUs ...
Marcello Barbirotta +5 more
doaj +1 more source
Exploiting deep learning accelerators for neuromorphic workloads
Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy consumption and latency when performing inference with deep learning workloads.
Pao-Sheng Vincent Sun +6 more
doaj +1 more source
Employing register channels for the exploitation of instruction level parallelism [PDF]
A multiprocessor system capable of exploiting fine-grained parallelism must support efficient synchronization and data passing mechanisms. This paper demonstrates the use of shared register channels as the communication mechanism among processors in a multiprocessor chip.
openaire +3 more sources
P-CORE: Exploring RISC-V Packed-SIMD Extension for CNNs
In today’s technological landscape, embedded and IoT devices face escalating demands for performance and power efficiency in inference tasks employing Convolution Neural Networks (CNNs).
Muhammad Ali +3 more
doaj +1 more source
Power estimation on functional level for programmable processors [PDF]
In diesem Beitrag werden verschiedene Ansätze zur Verlustleistungsschätzung von programmierbaren Prozessoren vorgestellt und bezüglich ihrer Übertragbarkeit auf moderne Prozessor-Architekturen wie beispielsweise Very Long Instruction Word (VLIW ...
M. Schneider, H. Blume, T. G. Noll
doaj
Radiation schemes are critical components of Earth system models that need to be both efficient and accurate. Despite the use of approximations such as 1D radiative transfer, radiation can account for a large share of the runtime of expensive climate ...
Peter Ukkonen, Robin J. Hogan
doaj +1 more source
Exploiting Thread-Level and Instruction-Level Parallelism to Cluster Mass Spectrometry Data using Multicore Architectures. [PDF]
Saeed F +3 more
europepmc +1 more source
Fast noisy long read alignment with multi-level parallelism. [PDF]
Xia Z +6 more
europepmc +1 more source
Instruction-Level Parallelism and Parallelizing Compilation (Dagstuhl Seminar 99161) [PDF]
Arvind, D. K. +4 more
openaire +3 more sources

