Results 191 to 200 of about 22,124 (260)
Some of the next articles are maybe not open access.

CHOPPER: A Compiler Infrastructure for Programmable Bit-serial SIMD Processing Using Memory in DRAM

International Symposium on High-Performance Computer Architecture, 2023
Increasing interests in Bit-serial SIMD Processing-Using-DRAM (PUD) architectures amplify the needs for a compiler to automate code generation, credited to their ultra-wide SIMD width and reduction of data movements.
Xiangjun Peng   +2 more
semanticscholar   +1 more source

ALBUS: A method for efficiently processing SpMV using SIMD and Load balancing

Future generations computer systems, 2021
SpMV (Sparse matrix–vector multiplication) is widely used in many fields. Improving the performance of SpMV has been the pursuit of many researchers. Parallel SpMV using multi-core processors has been a standard parallel method used by researchers.
Haodong Bian   +4 more
semanticscholar   +1 more source

Simodense: a RISC-V softcore optimised for exploring custom SIMD instructions

International Conference on Field-Programmable Logic and Applications, 2021
Simodense is a high-performance open-source RISC-V (RV32IM) softcore, optimised for exploring custom SIMD instructions. In order to maximise SIMD instruction performance, the design’s memory system is optimised for streaming bandwidth, such as very wide ...
Philippos Papaphilippou   +2 more
semanticscholar   +1 more source

Improving SIMD Parallelism via Dynamic Binary Translation

open access: yesTransactions on Embedded Computing Systems, 2018
Recent trends in SIMD architecture have tended toward longer vector lengths, and more enhanced SIMD features have been introduced in newer vector instruction sets.
Ding-Yong Hong   +2 more
exaly   +2 more sources

FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs

IEEE International Conference on Data Engineering, 2020
Set intersection is an important operation and widely used in both database and graph analytics applications. However, existing state-of-the-art set intersection methods only consider the size of input sets and fail to optimize for the case in which the ...
Jiyuan Zhang   +3 more
semanticscholar   +1 more source

SIMD Bpriori Algorithms

2010 Fourth UKSim European Symposium on Computer Modeling and Simulation, 2010
Finding meaningful patterns is one of the most investigated fields of computational biology. Generalized Center String (GCS) problem is one of the problems that were established and Bpriori Algorithms have been proposed to solve GCS. In this paper we present parallel Bpriori Algorithms based on existing approaches.
F. S. Halataei, H. Haj Seyyed Javadi
openaire   +1 more source

Flex-PE: Flexible and SIMD Multiprecision Processing Element for AI Workloads

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
The rapid evolution of artificial intelligence (AI) models, from deep neural networks (DNNs) to transformers/large-language models (LLMs), demands flexible hardware solutions to meet diverse execution needs across edge and cloud platforms.
Mukul Lokhande   +2 more
semanticscholar   +1 more source

SIMD Image Resampling

IEEE Transactions on Computers, 1982
Due to advances in VLSI technology, large scale arrays of microprocessors forming parallel processing systems have become feasible. The use of such a microprocessor array operating in the SIMD (single instruction stream-multiple data stream) mode to perform image resampling is explored.
Michael R. Warpenburg, Leah J. Siegel
openaire   +1 more source

Compiling rewriting onto SIMD and MIMD/SIMD machines

1994
We present compilation techniques for Simple Maude, a declarative programming language based on Rewriting Logic which supports term, graph, and object-oriented rewriting. We show how to compile various constructs of Simple Maude onto SIMD and MIMD/SIMD massively parallel architectures, and in particular onto the Rewrite Rule Machine (RRM), a special ...
Patrick Lincoln   +3 more
openaire   +1 more source

OpenMP in VASP: Threading and SIMD

International Journal of Quantum Chemistry, 2019
The Vienna Ab initio Simulation Package (VASP) is a widely used electronic structure code that originally exploits process-level parallelism through the Message Passing Interface (MPI) for work distribution within and across nodes. Architectural changes
Florian Wende   +5 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy