Results 81 to 90 of about 31,764 (209)
ABSTRACT Task‐based programming interfaces introduce a paradigm in which computations are decomposed into fine‐grained units of work known as “tasks”. StarPU is a runtime system originally developed to support task‐based parallelism on on‐premise heterogeneous architectures by abstracting low‐level hardware details and efficiently managing resource ...
Vanderlei Munhoz +5 more
wiley +1 more source
Tackling Exascale Software Challenges in Molecular Dynamics Simulations with GROMACS
GROMACS is a widely used package for biomolecular simulation, and over the last two decades it has evolved from small-scale efficiency to advanced heterogeneous acceleration and multi-level parallelism targeting some of the largest supercomputers in the ...
A Arnold +21 more
core +1 more source
The Ongoing Evolution of OpenMP
This paper presents an overview of the past, present and future of the OpenMP application programming interface (API). While the API originally specified a small set of directives that guided shared memory fork-join parallelization of loops and program sections, OpenMP now provides a richer set of directives that capture a wide range of parallelization
Bronis R. de Supinski +7 more
openaire +2 more sources
Multithread Approximation: An OpenMP Constructor
ABSTRACT This study introduces an OpenMP construct designed to simplify and unify the integration of approximate computing techniques into shared‐memory parallel programs. Approximate Computing leverages the inherent error tolerance of many applications to trade computational accuracy for gains in performance and energy efficiency.
João Briganti de Oliveira +2 more
wiley +1 more source
Failure Criterion‐Based Bead Optimization of a Wind Turbine Blade Root Section
ABSTRACT This study presents a bead‐based optimization framework for fiber composite wind turbine blades, employing the Tsai–Wu failure criterion as the structural performance measure. The objective is to design an optimal transition geometry between an existing blade and a root section with 20% smaller diameter, while preserving the original material ...
Philipp Ulrich Haselbach +4 more
wiley +1 more source
A Comparison of some recent Task-based Parallel Programming Models [PDF]
The need for parallel programming models that are simple to use and at the same time efficient for current ant future parallel platforms has led to recent attention to task-based models such as Cilk++, Intel TBB and the task concept in OpenMP version 3.0.
Brorsson, Mats +2 more
core +1 more source
Shared-memory parallelization (SMP) strategies for density matrix renormalization group (DMRG) algorithms enable the treatment of complex systems in solid state physics.
E. Jeckelmann +14 more
core +1 more source
Non‐Linear Reduced Order Modelling of Transonic Potential Flows for Fast Aerodynamic Analysis
ABSTRACT This work presents a physics‐based reduced order modelling (ROM) framework for the efficient simulation of steady transonic potential flows around aerodynamic configurations. The approach leverages proper orthogonal decomposition and a least‐squares Petrov‐Galerkin (LSPG) projection to construct intrusive ROMs for the full potential equation ...
M. Zuñiga +3 more
wiley +1 more source
Benchmarking MILC code with OpenMP and MPI
A trend in high performance computers that is becoming increasingly popular is the use of symmetric multiprocessing (SMP) rather than the older paradigm of MPP.
Gottlieb, Steven, Tamhankar, Sonali
core +1 more source
Abstract Hierarchical agglomerative clustering is a useful analysis technique which allows for a level of stability, interpretability and flexibility not available in other similar techniques such as K‐means, density‐based clustering or positive matrix factorization.
Colin J. Lee +2 more
wiley +1 more source

