Results 191 to 200 of about 22,124 (260)
Some of the next articles are maybe not open access.
CHOPPER: A Compiler Infrastructure for Programmable Bit-serial SIMD Processing Using Memory in DRAM
International Symposium on High-Performance Computer Architecture, 2023Increasing interests in Bit-serial SIMD Processing-Using-DRAM (PUD) architectures amplify the needs for a compiler to automate code generation, credited to their ultra-wide SIMD width and reduction of data movements.
Xiangjun Peng +2 more
semanticscholar +1 more source
ALBUS: A method for efficiently processing SpMV using SIMD and Load balancing
Future generations computer systems, 2021SpMV (Sparse matrix–vector multiplication) is widely used in many fields. Improving the performance of SpMV has been the pursuit of many researchers. Parallel SpMV using multi-core processors has been a standard parallel method used by researchers.
Haodong Bian +4 more
semanticscholar +1 more source
Simodense: a RISC-V softcore optimised for exploring custom SIMD instructions
International Conference on Field-Programmable Logic and Applications, 2021Simodense is a high-performance open-source RISC-V (RV32IM) softcore, optimised for exploring custom SIMD instructions. In order to maximise SIMD instruction performance, the design’s memory system is optimised for streaming bandwidth, such as very wide ...
Philippos Papaphilippou +2 more
semanticscholar +1 more source
Improving SIMD Parallelism via Dynamic Binary Translation
Recent trends in SIMD architecture have tended toward longer vector lengths, and more enhanced SIMD features have been introduced in newer vector instruction sets.
Ding-Yong Hong +2 more
exaly +2 more sources
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
IEEE International Conference on Data Engineering, 2020Set intersection is an important operation and widely used in both database and graph analytics applications. However, existing state-of-the-art set intersection methods only consider the size of input sets and fail to optimize for the case in which the ...
Jiyuan Zhang +3 more
semanticscholar +1 more source
2010 Fourth UKSim European Symposium on Computer Modeling and Simulation, 2010
Finding meaningful patterns is one of the most investigated fields of computational biology. Generalized Center String (GCS) problem is one of the problems that were established and Bpriori Algorithms have been proposed to solve GCS. In this paper we present parallel Bpriori Algorithms based on existing approaches.
F. S. Halataei, H. Haj Seyyed Javadi
openaire +1 more source
Finding meaningful patterns is one of the most investigated fields of computational biology. Generalized Center String (GCS) problem is one of the problems that were established and Bpriori Algorithms have been proposed to solve GCS. In this paper we present parallel Bpriori Algorithms based on existing approaches.
F. S. Halataei, H. Haj Seyyed Javadi
openaire +1 more source
Flex-PE: Flexible and SIMD Multiprecision Processing Element for AI Workloads
IEEE Transactions on Very Large Scale Integration (VLSI) SystemsThe rapid evolution of artificial intelligence (AI) models, from deep neural networks (DNNs) to transformers/large-language models (LLMs), demands flexible hardware solutions to meet diverse execution needs across edge and cloud platforms.
Mukul Lokhande +2 more
semanticscholar +1 more source
IEEE Transactions on Computers, 1982
Due to advances in VLSI technology, large scale arrays of microprocessors forming parallel processing systems have become feasible. The use of such a microprocessor array operating in the SIMD (single instruction stream-multiple data stream) mode to perform image resampling is explored.
Michael R. Warpenburg, Leah J. Siegel
openaire +1 more source
Due to advances in VLSI technology, large scale arrays of microprocessors forming parallel processing systems have become feasible. The use of such a microprocessor array operating in the SIMD (single instruction stream-multiple data stream) mode to perform image resampling is explored.
Michael R. Warpenburg, Leah J. Siegel
openaire +1 more source
Compiling rewriting onto SIMD and MIMD/SIMD machines
1994We present compilation techniques for Simple Maude, a declarative programming language based on Rewriting Logic which supports term, graph, and object-oriented rewriting. We show how to compile various constructs of Simple Maude onto SIMD and MIMD/SIMD massively parallel architectures, and in particular onto the Rewrite Rule Machine (RRM), a special ...
Patrick Lincoln +3 more
openaire +1 more source
OpenMP in VASP: Threading and SIMD
International Journal of Quantum Chemistry, 2019The Vienna Ab initio Simulation Package (VASP) is a widely used electronic structure code that originally exploits process-level parallelism through the Message Passing Interface (MPI) for work distribution within and across nodes. Architectural changes
Florian Wende +5 more
semanticscholar +1 more source

