Instruction-level parallelism - Open Access .click

Results 41 to 50 of about 189,345 (270)

Boosting Parallel Applications Performance on Applying DIM Technique in a Multiprocessing Environment

International Journal of Reconfigurable Computing, 2011
Limits of instruction-level parallelism and higher transistor density sustain the increasing need for multiprocessor systems: they are rapidly taking over both general-purpose and embedded processor domains.
Mateus B. Rutzig +7 more
doaj +1 more source

An integration of autonomic computing with multicore systems for performance optimization in Industrial Internet of Things

IET Communications, EarlyView., 2022
Abstract The goal of this work is to investigate how the self‐awareness characteristic of autonomic computing, paired with existing performance optimization rules, may be used in applications to minimise multi‐core processor performance concerns. The suggested self‐awareness technique can assist applications in self‐execution while also assisting other
Surendra Kumar Shukla +8 more
wiley +1 more source

Instruction Fetch Policy for SMT Processors with Different Allocations of Floating-point and Integer Resources [PDF]

Jisuanji gongcheng, 2017
In Simultaneous Multithreading(SMT) processors,different threads have different demands for floating-point and integer resources.How to allocate shared resources among threads is the key point to improve the whole performance for SMT processors.Aiming at
JIANG Shengjian,HU Xiangdong,YANG Jianxin
doaj +1 more source

Solving Large Nonlinear Systems of First-Order Ordinary Differential Equations With Hierarchical Structure Using Multi-GPGPUs and an Adaptive Runge Kutta ODE Solver

IEEE Access, 2013
The adaptive Runge-Kutta (ARK) method on multi-general-purpose graphical processing units (GPUs) is used for solving large nonlinear systems of first-order ordinary differential equations (ODEs) with over ~ 10 000 variables describing a large genetic ...
Ahmad Al-Omari +3 more
doaj +1 more source

Late-Stage Optimization of Modern ILP Processor Cores via FPGA Simulation

Applied Sciences, 2022
Late-stage (post-RTL implementation) optimization is important in achieving target performance for realistic processor design. However, several challenges remain for modern out-of-order ILP (instruction-level-parallelism) processors, such as simulation ...
Mengqiao Lan +6 more
doaj +1 more source

RGCA: A Reliable GPU Cluster Architecture for Large-Scale Internet of Things Computing Based on Effective Performance-Energy Optimization

Sensors, 2017
This paper aims to develop a low-cost, high-performance and high-reliability computing system to process large-scale data using common data mining algorithms in the Internet of Things (IoT) computing environment.
Yuling Fang +4 more
doaj +1 more source

Instruction-level parallel processing: History, overview, and perspective [PDF]

The Journal of Supercomputing, 1993
Instruction-level parallelism (ILP) is a family of processor and compiler design techniques that speed up execution by causing individual machine operations to execute in parallel. Although ILP has appeared in the highest performance uniprocessors for the past 30 years, the 1980s saw it become a much more significant force in computer design.
B. Ramakrishna Rau, Joseph A. Fisher
openaire +1 more source

Accelerating Nested Conditionals on CGRA With Tag-Based Full Predication Method

IEEE Access, 2020
CGRA (Coarse-grained Reconfigurable Architecture) has been widely considered as one of the most promising computing architectures to exploit spatial parallelism.
Jiang Sha, Wenbo Song, Yu Gong, Yingying Zhao +3 more
doaj +1 more source

A Pipeline-Based ODE Solving Framework

IEEE Access
The traditional parallel solving methods of ordinary differential equations (ODE) are mainly classified into task-parallelism, data-parallelism, and instruction-level parallelism.
Ruixia Cao, Shangjun Hou, Lin Ma
doaj +1 more source

Vectorization of Program Code Containing Low Probability Regions in Computational Geometry Problems

Современные информационные технологии и IT-образование, 2022
Improving application performance is an important practical task for supercomputer calculations. Along with parallelization of calculations between cluster nodes (for example, using MPI tools), as well as multithreaded programming (for example, using ...
Alexey Rybakov
doaj +1 more source

programming language
task parallelism
computer architecture

operating system
engineering
data parallelism

compiler
instruction set
mathematics