An asynchronous and task-based implementation of peridynamics utilizing HPX—the C++ standard library for parallelism and concurrency [PDF]
On modern supercomputers, asynchronous many task systems are emerging to address the new architecture of computational nodes. Through this shift of increasing cores per node, a new programming model with focus on handling of the fine-grain parallelism ...
Patrick Diehl+4 more
semanticscholar +1 more source
Heterogeneous Implementation of a Voronoi Cell-Based SVP Solver
This paper presents a new, heterogeneous CPU+GPU attacks against lattice-based (postquantum) cryptosystems based on the Shortest Vector Problem (SVP), a central problem in lattice-based cryptanalysis.
Gabriel Falcao+3 more
doaj +1 more source
When based on phylogenetic proposals, biogeographic historic narratives have a great interest for hypothesizing paths of origin of the current biodiversity.
Cristian Román-Palacios+2 more
doaj +1 more source
Integrated Model, Batch, and Domain Parallelism in Training Neural Networks [PDF]
We propose a new integrated method of exploiting model, batch and domain parallelism for the training of deep neural networks (DNNs) on large distributed-memory computers using minibatch stochastic gradient descent (SGD). Our goal is to find an efficient
A. Gholami+4 more
semanticscholar +1 more source
Autogenous self‐healing of concrete: Experimental design and test methodsA review
Concrete exhibits an intrinsic ability to heal cracks, defined as “autogenous self‐healing.” However, despite code restrictions, autogenous self‐healing of concrete shows limited effectiveness in practice. This indicates the need for further research to provide engineers with reliable design rules.
Daniel Lahmann+2 more
wiley +1 more source
Research on parallelization clustering algorithm for power communication big data
With the development of power communication technology, a large number of distributed power communication subsystems and massive power communication data have been generated.
Zeng Ying, Li Xingnan, Liu Xinzhan
doaj +1 more source
Efficient computation of oriented vertex and arc colorings of special digraphs [PDF]
In this paper we study the oriented vertex and arc coloring problem on edge series-parallel digraphs (esp-digraphs) which are related to the well known series-parallel graphs. Series-parallel graphs are graphs with two distinguished vertices called terminals, formed recursively by parallel and series composition.
arxiv
Hardware acceleration of number theoretic transform for zk‐SNARK
An FPGA‐based hardware accelerator with a multi‐level pipeline is designed to support the large‐bitwidth and large‐scale NTT tasks in zk‐SNARK. It can be flexibly scaled to different scales of FPGAs and has been equipped in the heterogeneous acceleration system with the help of HLS and OpenCL.
Haixu Zhao+6 more
wiley +1 more source
Parallelism in Randomized Incremental Algorithms [PDF]
In this article, we show that many sequential randomized incremental algorithms are in fact parallel. We consider algorithms for several problems, including Delaunay triangulation, linear programming, closest pair, smallest enclosing disk, least-element ...
G. Blelloch+3 more
semanticscholar +1 more source
A Scheduling Method of Moldable Parallel Tasks Considering Speedup and System Load on the Cloud
The moldable parallel task (MPT) is a kind of parallel task that their sub-tasks hold the resources exclusively, which has been widely used in different areas. Our paper focuses on the scheduling of moldable tasks when every sub-task supports time-slice.
Jianmin Li, Ying Zhong, Xin Zhang
doaj +1 more source