Results 1 to 10 of about 3,321,514 (360)

Message progression in parallel computing - to thread or not to thread? [PDF]

open access: green2008 IEEE International Conference on Cluster Computing, 2008
Message progression schemes that enable communication and computation to be overlapped have the potential to improve the performance of parallel applications. With currently available high-performance networks there are several options for making progress: manual progression, use of a progress thread, and communication offload. In this paper we analyze
Torsten Hoefler, Andrew Lumsdaine
semanticscholar   +4 more sources

Thread-safe lattice Boltzmann for high-performance computing on GPUs [PDF]

open access: greenJournal of Computational Science, 2023
We present thread-safe, highly-optimized lattice Boltzmann implementations, specifically aimed at exploiting the high memory bandwidth of GPU-based architectures. At variance with standard approaches to LB coding, the proposed strategy, based on the reconstruction of the post-collision distribution via Hermite projection, enforces data locality and ...
Andrea Montessori   +7 more
semanticscholar   +10 more sources

A Review of Lightweight Thread Approaches for High Performance Computing [PDF]

open access: green2016 IEEE International Conference on Cluster Computing (CLUSTER), 2016
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores. However, exascale systems will spawn hundreds of thousands of threads in order to exploit their massive parallel architectures ...
Adrián Castelló   +5 more
semanticscholar   +6 more sources

A C++17 thread pool for high-performance scientific computing [PDF]

open access: goldSoftwareX
We present a modern C++17-compatible thread pool implementation, built from scratch with high-performance scientific computing in mind. The thread pool is implemented as a single lightweight and self-contained class, and does not have any dependencies ...
Barak Shoshany
doaj   +5 more sources

Detection of Screw Threads in Computed Tomography 3D Density Fields [PDF]

open access: goldMeasurement Science Review, 2013
Abstract Measurements and inspection in production must be rapid, robust and automated. In this paper a new method is proposed to automatically detect screw threads in 3D density fields obtained from computed tomography measurement devices. The described method can be used to automate many operations during screw thread inspection process and ...
Sergey Kosarevsky, Viktor Latypov
arxiv   +7 more sources

A new computational method for threaded connection stiffness [PDF]

open access: yesAdvances in Mechanical Engineering, 2016
For designing bolted connections in machinery applications, it is necessary to estimate the stiffness of the threaded connection. This work provides a new method for computing the stiffness of engaged screw in bolted connections according to the load ...
Dongmei Zhang, Shiqiao Gao, Xiao Xu
doaj   +2 more sources

svds-C: A multi-thread C code for computing truncated singular value decomposition [PDF]

open access: goldSoftwareX
This article presents svds-C, an open-source and high-performance C program for accurately and robustly computing truncated SVD, e.g. computing several largest singular values and corresponding singular vectors.
Xu Feng, Wenjian Yu, Yuyang Xie
doaj   +3 more sources

Computing Method of Bolted Joint Critical Loosening Load with Flexible Thread [PDF]

open access: goldJournal of Mechanical Engineering, 2020
: Bolted joint loosening is one of the important modes of bolted joint failure. In the project, the bolted joints are often subjected to alternating lateral dynamic loads, and the bolts tend to loosening very easily as the bolt axis is perpendicular to ...
Jiang Wen-qiang   +4 more
openalex   +2 more sources

A Micro Threading Based Concurrency Model for Parallel Computing [PDF]

open access: green2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, 2011
The continuing launch of various multi-core processors popularizes parallel computing of gaining higher performance, however, it also exerts pressures on software developers concerning how to make full use of the multiplicity of resources to get the required benefits due to high coupling of parallel programs to specified hardware.
Qiang Yang, Chris Jesshope, Jian Fu
openalex   +4 more sources

Analysis of Threading Libraries for High Performance Computing [PDF]

open access: bronzeIEEE Transactions on Computers, 2020
With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for that purpose and it remains as the most used threading solution in current hardware.
Adrián Castelló   +5 more
openalex   +7 more sources

Home - About - Disclaimer - Privacy