Instruction-level parallelism - Open Access .click

Results 1 to 10 of about 1,369 (159)

Realizing the Calculation of a Fully Normalized Associated Legendre Function Based on an FPGA [PDF]

Sensors
A large number of fully normalized associated Legendre function (fnALF) calculations are required to compute Earth’s gravity field elements using ultra high-order gravity field coefficient models.
Yuxiang Fang, Qingbin Wang, Yichao Yang
doaj +2 more sources

ConvAix: An Application-Specific Instruction-Set Processor for the Efficient Acceleration of CNNs

IEEE Open Journal of Circuits and Systems, 2021
ConvAix is an application-specific instruction-set processor (ASIP) that enables the energy-efficient processing of convolutional neural networks (CNNs) while retaining substantial flexibility through its instruction-set architecture (ISA) based design ...
Andreas Bytyn, Rainer Leupers, Gerd Ascheid +2 more
doaj +1 more source

An astute LVQ approach using neural network for the prediction of conditional branches in pipeline processor [PDF]

EAI Endorsed Transactions on Scalable Information Systems, 2021
Nowadays, microprocessors use the deep pipeline to execute multiple instructions per cycle. The frequency and behavior of conditional instructions mainly affect the performance of instruction-level parallelism.
Sweety Nain, Prachi Chaudhary
doaj +1 more source

COMPARISON OF INSTRUCTION SCHEDULING AND REGISTER ALLOCATION FOR MIPS AND HPL-PD ARCHITECTURE FOR EXPLOITATION OF INSTRUCTION LEVEL PARALLELISM [PDF]

Engineering Heritage Journal, 2018
The integrated approaches for instruction scheduling and register allocation have been promising area of research for code generation and compiler optimization.
Rajendra Kumar
doaj +1 more source

Design of Deep Learning VLIW Processor for Image Recognition

Xibei Gongye Daxue Xuebao, 2020
In order to adapt the application demands of high resolution images recognition and efficient processing of localization in aviation and aerospace fields, and to solve the problem of insufficient parallelism in existing researches, an extensible ...

doaj +1 more source

Optimized Realization of Sobel Edge Detection Algorithm for FT-M7002 [PDF]

Jisuanji gongcheng, 2022
Edge detection is a robust image analysis method used in image processing and computer vision.The Sobel operator is widely used in edge detection and image processing.With the development of domestic FT series high-performance Digital Signal Processors ...
FAN Mingliang, GUO Zihan, CHAI Xiaonan, SHANG Jiandong
doaj +1 more source

LLVM RISC-V RV32X Graphics Extension Support and Characteristics Analysis of Graphics Programs

IEEE Access, 2023
In recent years, virtual reality technology has become the dominant means of human-computer interaction, with computer graphics rendering technology being a crucial component in realizing virtual reality experiences.
Peng Wang, Zhi-Bin Yu
doaj +1 more source

A Highly-Efficient and Tightly-Connected Many-Core Overlay Architecture

IEEE Access, 2021
The technology advances of CPU (Central Processing Unit) architecture alternate between generalization and specialization. In the past decade, the general performance has been enhanced while addressing the new brick walls that include power, memory, and ...
Riadh Ben Abdelhamid, Yoshiki Yamaguchi, Taisuke Boku +2 more
doaj +1 more source

Compiler-Directed Parallelism Scaling Framework for Performance Constrained Energy Optimization

IEEE Access, 2020
Evolution of semiconductor manufacturing technology leads to the rising trend of leakage current and the end of Dennard scaling. At the dark silicon era, aggressive power gating scheme with quantitative management on power-gated hardware resources is ...
Yung-Cheng Ma
doaj +1 more source

Exploring Various Levels of Parallelism in High-Performance CRC Algorithms

IEEE Access, 2019
Modern processors have increased the capabilities of instruction-level parallelism (ILP) and thread-level parallelism (TLP). These resources, however, typically exhibit poor utilization on conventional cyclic redundancy check (CRC) algorithms.
Mucong Chi, Dazhong He, Jun Liu
doaj +1 more source

fpga
instruction level parallelism
risc-v

computer engineering
high-performance computing
computer hardware