Message progression in parallel computing - to thread or not to thread? [PDF]
Message progression schemes that enable communication and computation to be overlapped have the potential to improve the performance of parallel applications. With currently available high-performance networks there are several options for making progress: manual progression, use of a progress thread, and communication offload. In this paper we analyze
Torsten Hoefler, Andrew Lumsdaine
semanticscholar +4 more sources
Thread-safe lattice Boltzmann for high-performance computing on GPUs [PDF]
We present thread-safe, highly-optimized lattice Boltzmann implementations, specifically aimed at exploiting the high memory bandwidth of GPU-based architectures. At variance with standard approaches to LB coding, the proposed strategy, based on the reconstruction of the post-collision distribution via Hermite projection, enforces data locality and ...
Andrea Montessori+7 more
semanticscholar +10 more sources
A Review of Lightweight Thread Approaches for High Performance Computing [PDF]
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores. However, exascale systems will spawn hundreds of thousands of threads in order to exploit their massive parallel architectures ...
Adrián Castelló+5 more
semanticscholar +6 more sources
A C++17 thread pool for high-performance scientific computing [PDF]
We present a modern C++17-compatible thread pool implementation, built from scratch with high-performance scientific computing in mind. The thread pool is implemented as a single lightweight and self-contained class, and does not have any dependencies ...
Barak Shoshany
doaj +5 more sources
Detection of Screw Threads in Computed Tomography 3D Density Fields [PDF]
Abstract Measurements and inspection in production must be rapid, robust and automated. In this paper a new method is proposed to automatically detect screw threads in 3D density fields obtained from computed tomography measurement devices. The described method can be used to automate many operations during screw thread inspection process and ...
Sergey Kosarevsky, Viktor Latypov
arxiv +7 more sources
A new computational method for threaded connection stiffness [PDF]
For designing bolted connections in machinery applications, it is necessary to estimate the stiffness of the threaded connection. This work provides a new method for computing the stiffness of engaged screw in bolted connections according to the load ...
Dongmei Zhang, Shiqiao Gao, Xiao Xu
doaj +2 more sources
svds-C: A multi-thread C code for computing truncated singular value decomposition [PDF]
This article presents svds-C, an open-source and high-performance C program for accurately and robustly computing truncated SVD, e.g. computing several largest singular values and corresponding singular vectors.
Xu Feng, Wenjian Yu, Yuyang Xie
doaj +3 more sources
Computing Method of Bolted Joint Critical Loosening Load with Flexible Thread [PDF]
: Bolted joint loosening is one of the important modes of bolted joint failure. In the project, the bolted joints are often subjected to alternating lateral dynamic loads, and the bolts tend to loosening very easily as the bolt axis is perpendicular to ...
Jiang Wen-qiang+4 more
openalex +2 more sources
A Micro Threading Based Concurrency Model for Parallel Computing [PDF]
The continuing launch of various multi-core processors popularizes parallel computing of gaining higher performance, however, it also exerts pressures on software developers concerning how to make full use of the multiplicity of resources to get the required benefits due to high coupling of parallel programs to specified hardware.
Qiang Yang, Chris Jesshope, Jian Fu
openalex +4 more sources
Analysis of Threading Libraries for High Performance Computing [PDF]
With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for that purpose and it remains as the most used threading solution in current hardware.
Adrián Castelló+5 more
openalex +7 more sources