Optimization of Digital Signal Transformation Functions in Multicluster VLIW DSP [PDF]
According to the characteristics of BWDSP100 processor’s architecture,this paper presents several practical ways to improve the performance of digital signal transformation functions in Digital Signal Processor(DSP) function library,including using ...
ZHEN Yang,GU Naijie,YE Hong
doaj +1 more source
Realization and Application of Shenwei Simultaneous Multithreading Functional Simulator [PDF]
imultaneous Multithreading(SMT) allows irrelevant instructions execute synchronously from multiple threads.It achieves the combination of Thread-level Parallelism(TLP) and Instruction-level Parallelism(ILP),enhances the performance of processor further ...
CHEN Weijian,GUO Yong,YIN Fei
doaj +1 more source
A Single Instruction Multiple Data Vectorization Reduction Method [PDF]
Single Instruction Multiple Data(SIMD) aims at exploiting the data-level parallelism of multimedia and scientific calculation.The true dependence caused by reduction operation hinders exploring data-level parallelism.But different architecture and ...
HAN Lin,GAO Wei,WANG Dong,WANG Pengxiang,LI Yingying
doaj +1 more source
Design of Deep Learning VLIW Processor for Image Recognition
In order to adapt the application demands of high resolution images recognition and efficient processing of localization in aviation and aerospace fields, and to solve the problem of insufficient parallelism in existing researches, an extensible ...
doaj +1 more source
An astute LVQ approach using neural network for the prediction of conditional branches in pipeline processor [PDF]
Nowadays, microprocessors use the deep pipeline to execute multiple instructions per cycle. The frequency and behavior of conditional instructions mainly affect the performance of instruction-level parallelism.
Sweety Nain, Prachi Chaudhary
doaj +1 more source
Instruction scheduling for instruction level parallel processors [PDF]
Nearly all personal computer and workstation processors, and virtually all high-performance embedded processor cores, now embody instruction level parallel (ILP) processing in the form of superscalar or very long instruction word (VLIW) architectures.
P. Faraboschi, J.A. Fisher, C. Young
openaire +1 more source
LLVM RISC-V RV32X Graphics Extension Support and Characteristics Analysis of Graphics Programs
In recent years, virtual reality technology has become the dominant means of human-computer interaction, with computer graphics rendering technology being a crucial component in realizing virtual reality experiences.
Peng Wang, Zhi-Bin Yu
doaj +1 more source
Automatically Extracting GPU Instruction-Level Parallelism
James A. Jablin
openalex +2 more sources
Optimized Realization of Sobel Edge Detection Algorithm for FT-M7002 [PDF]
Edge detection is a robust image analysis method used in image processing and computer vision.The Sobel operator is widely used in edge detection and image processing.With the development of domestic FT series high-performance Digital Signal Processors ...
FAN Mingliang, GUO Zihan, CHAI Xiaonan, SHANG Jiandong
doaj +1 more source
Instruction-Level Parallelism and Parallelizing Compilation (Dagstuhl Seminar 99161)
D. K. Arvind +4 more
openalex +4 more sources

