Results 11 to 20 of about 248 (147)
PDIP: Priority Directed Instruction Prefetching
Bhargav Reddy Godala +6 more
openalex +2 more sources
The Entangling Instruction Prefetcher [PDF]
Prefetching instructions is a fundamental technique for designing high-performance computers. There are three key properties to consider when designing an efficient and effective prefetcher: timeliness, coverage, and accuracy. Timeliness is an essential property, as bringing instructions too early increases the risk of the instructions being evicted ...
Alberto Ros, Alexandra Jimborean
openaire +2 more sources
Prefetching for the Kilo-Instruction Processor
The large latency of memory accesses in modern computer systems is a key obstacle to achieving high processor utilization. Techniques to reduce or tolerate large memory latencies become essential for achieving high processor utilization. Prefetch is one of the most widely studied mechanisms at literature.
Matherey Bracamonte Nunez
openalex +2 more sources
Neighborhood prefetching on multiprocessors using instruction history [PDF]
D.M. Koppelman
openalex +2 more sources
MANA: Microarchitecting an Instruction Prefetcher
24 pages with 15 ...
Ansari, Ali +3 more
openaire +2 more sources
Hardware Support for Prescient Instruction Prefetch [PDF]
This paper proposes and evaluates hardware mechanisms for supporting prescient instruction prefetch — an approach to improving single-threaded application performance by using helper threads to perform instruction prefetch. We demonstrate the need for enabling store-to-load communication and selective instruction execution when directly pre-executing ...
T.M. Aamodt +4 more
openaire +1 more source
Thread level parallelism (TLP) is a common approach to achieve parallelism where Instruction level parallelism (ILP) is insufficient. Hardware multithreading is a prevalent approach in the micro-architecture layer for tolerating long events such as ...
Hananya Ribo, Shlomo Greenberg
doaj +1 more source
Optimizing Lattice Basis Reduction Algorithm on ARM V8 Processors
The LLL (Lenstra–Lenstra–Lovász) algorithm is an important method for lattice basis reduction and has broad applications in computer algebra, cryptography, number theory, and combinatorial optimization.
Ronghui Cao +6 more
doaj +1 more source
A Cost-Effective Entangling Prefetcher for Instructions
Prefetching instructions in the instruction cache is a fundamental technique for designing high-performance computers. There are three key properties to consider when designing an efficient and effective prefetcher: timeliness, coverage, and accuracy.
Ros, Alberto, Jimborean, Alexandra
openaire +2 more sources
A cycle-level recovery method for embedded processor against HT tamper. [PDF]
Zhou W, Ye KH, Yuan S, Li L.
europepmc +1 more source

