Results 71 to 80 of about 62,688 (226)
A Simple Cache Emulator for Evaluating Cache Behavior for SMP Systems
Every modern CPU uses a complex memory hierarchy, which consists of multiple cache memory levels. It is very difficult to predict the behavior of this hierarchy for a given program (for details see [1, 2]).
I. Šimeček
doaj
A Survey of Techniques For Improving Energy Efficiency in Embedded Computing Systems
Recent technological advances have greatly improved the performance and features of embedded systems. With the number of just mobile devices now reaching nearly equal to the population of earth, embedded systems have truly become ubiquitous. These trends,
Mittal, Sparsh
core +1 more source
Leveraging machine learning and accelerometry to classify animal behaviours with uncertainty
Abstract Animal‐worn sensors have revolutionised the study of animal behaviour and ecology. Accelerometers, which measure changes in acceleration across planes of movement, are increasingly being used in conjunction with machine learning models to classify animal behaviours across taxa and research questions.
Medha Agarwal +4 more
wiley +1 more source
In this paper, the authors present GEMM-ArchProfiler, a simulation framework for evaluating General Matrix Multiplication performance in convolutional neural networks.
Binu Ayyappan, G. Santhosh Kumar
doaj +1 more source
In order to facilitate flexible network service virtualization and migration, network functions (NFs) are increasingly executed by software modules as so-called “softwarized NFs” on General-Purpose Computing (GPC) platforms and ...
Prateek Shantharama +2 more
doaj +1 more source
Fused Collapsing for Wide BVH Construction
Abstract We propose a novel approach for constructing wide bounding volume hierarchies on the GPU by integrating a simple bottom‐up collapsing procedure within an existing binary bottom‐up BVH builder. Our approach directly constructs a wide BVH without traversing a temporary binary BVH as done by previous approaches and achieves 1.4 – 1.6 × lower ...
Wilhem Barbier, Mathias Paulin
wiley +1 more source
CPU and cache efficient management of memory-resident databases [PDF]
Memory-Resident Database Management Systems (MRDBMS) have to be optimized for two resources: CPU cycles and memory bandwidth. To optimize for bandwidth in mixed OLTP/OLAP scenarios, the hybrid or Partially Decomposed Storage Model (PDSM) has been proposed.
Pirk, H. +7 more
openaire +3 more sources
Encoding Occupancy in Memory Location for Efficient and Compact High‐Resolution Voxel Structures
We encode information about geometric structure into the pointers of a sparse voxel directed acyclic graph (SVDAG). Each pointer carries information about the structure of the node it points to. Our encoding improves ray tracing performance and reduces model size in memory.
Jaina Modisett, Markus Billeter
wiley +1 more source
An improved neighbor list algorithm is proposed to reduce unnecessary interatomic distance calculations in molecular simulations. It combines the advantages of Verlet table and cell linked list algorithms by using cell decomposition approach to ...
Allen +23 more
core +1 more source
ABSTRACT Task‐based programming interfaces introduce a paradigm in which computations are decomposed into fine‐grained units of work known as “tasks”. StarPU is a runtime system originally developed to support task‐based parallelism on on‐premise heterogeneous architectures by abstracting low‐level hardware details and efficiently managing resource ...
Vanderlei Munhoz +5 more
wiley +1 more source

