Results 61 to 70 of about 109,285 (289)
Extending the Nested Parallel Model to the Nested Dataflow Model with Provably Efficient Schedulers
The nested parallel (a.k.a. fork-join) model is widely used for writing parallel programs. However, the two composition constructs, i.e. "$\parallel$" (parallel) and "$;$" (serial), are insufficient in expressing "partial dependencies" or "partial ...
Dinh, David +2 more
core +1 more source
Metalens Array for Complex‐Valued Optical Discrete Fourier Transform
A single‐layer metalens‐array device performs complex‐to‐complex discrete Fourier transforms (DFTs) optically with O(N) digital time complexity. Integrated interferometric phase retrieval and error mitigation scheme enables accurate, compact, and low‐power optical computing. The approach provides a versatile framework for implementing arbitrary complex‐
Randy Stefan Tanuwijaya +6 more
wiley +1 more source
Hardware acceleration of number theoretic transform for zk‐SNARK
An FPGA‐based hardware accelerator with a multi‐level pipeline is designed to support the large‐bitwidth and large‐scale NTT tasks in zk‐SNARK. It can be flexibly scaled to different scales of FPGAs and has been equipped in the heterogeneous acceleration system with the help of HLS and OpenCL.
Haixu Zhao +6 more
wiley +1 more source
Divided we stand: Parallel distributed stack memory management [PDF]
We present an overview of the stack-based memory management techniques that we used in our non-deterministic and-parallel Prolog systems: &-Prolog and DASWAM.
Hermenegildo, Manuel V., Kish, Shen
core +1 more source
TVB C++: A Fast and Flexible Back‐End for The Virtual Brain
TVB C++ is a streamlined and fast C++ Back‐End for The Virtual Brain (TVB), designed to make it as flexible as TVB, and FAST. Another pillar is to be fully compatible with TVB so easy bindings can be created from Python. Users can easily configure TVB C++ to execute the same code but with enhanced performance and parallelism.
Ignacio Martín +7 more
wiley +1 more source
DBEFT: A Dependency-Ratio Bundling Earliest Finish Time Algorithm for Heterogeneous Computing
Performance effective task scheduling algorithms are essential for taking advantage of the heterogeneous multi-processor in heterogeneous computing environments.
Tao Li +6 more
doaj +1 more source
The Dynamic Partial Reconfiguration function of reconfigurable devices permits tasks to be performed simultaneously on a single device. Nevertheless, task placement and resource management problems emerge with the parallelism of reconfigurable devices ...
Tingyu Zhou +4 more
doaj +1 more source
A Comparison of some recent Task-based Parallel Programming Models [PDF]
The need for parallel programming models that are simple to use and at the same time efficient for current ant future parallel platforms has led to recent attention to task-based models such as Cilk++, Intel TBB and the task concept in OpenMP version 3.0.
Brorsson, Mats +2 more
core +1 more source
Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
There are billions of lines of sequential code inside nowadays' software which do not benefit from the parallelism available in modern multicore architectures.
Cabral, Bruno +3 more
core +1 more source
Task parallel assembly language for uncompromising parallelism [PDF]
Mike Rainey +6 more
openalex +1 more source

