Results 61 to 70 of about 4,058 (161)

Validation of hardware events for successful performance pattern identification in High Performance Computing

open access: yes, 2017
Hardware performance monitoring (HPM) is a crucial ingredient of performance analysis tools. While there are interfaces like LIKWID, PAPI or the kernel interface perf\_event which provide HPM access with some additional features, many higher level tools ...
Eitzinger, Jan   +3 more
core   +1 more source

An ACL2 proof of write invalidate cache coherence [PDF]

open access: yes, 1998
As a pedagogical exercise in ACL2, we formalize and prove the correctness of a write invalidate cache scheme. In our formalization, an arbitrary number of processors, each with its own local cache, interact with a global memory via a bus which is snooped by the caches.
openaire   +1 more source

Eliminating invalidation in coherent-cache parallel graph reduction

open access: yes, 1994
Parallel functional programs based on the graph reduction execution model display considerable locality of reference, favouring the use of large cache lines in the implementation of the shared heap on a shared-memory multiprocessor. They also display a very high rate of synchronisation, making conventional weakly-consistent coherency protocols ...
Andrew J. Bennett, Paul H. J. Kelly
openaire   +2 more sources

Accelerating sequential programs using FastFlow and self-offloading

open access: yes, 2010
FastFlow is a programming environment specifically targeting cache-coherent shared-memory multi-cores. FastFlow is implemented as a stack of C++ template libraries built on top of lock-free (fence-free) synchronization mechanisms.
Aldinucci, Marco   +4 more
core  

Single-Producer/Single-Consumer Queues on Shared Cache Multi-Core Systems

open access: yes, 2010
Using efficient point-to-point communication channels is critical for implementing fine grained parallel program on modern shared cache multi-core architectures.
Torquati, Massimo
core  

Home - About - Disclaimer - Privacy