Results 111 to 120 of about 42,875 (250)
The standard formulation of the $K$ -means clustering (Lloyd’s method) performs many unnecessary distance calculations. In this paper, we focus on four approaches that use the triangle inequality to avoid unnecessary distance calculations.
W. Kwedlo, Pawel J. Czochanski
semanticscholar +1 more source
Multilevel Parallelization of AutoDock 4.2
Background Virtual (computational) screening is an increasingly important tool for drug discovery. AutoDock is a popular open-source application for performing molecular docking, the prediction of ligand-receptor interactions.
Norgan Andrew P +4 more
doaj +1 more source
Performance Evaluation of a Hybrid Programming Model for RSDFT on T2K Open Supercomputer
Non-uniform memory access (NUMA) systems, where each processor has its own memory, have been popular platform in high-end computing. While some early studies had reported that a flat-MPI programming model outperformed an OpenMP/MPI hybrid programming ...
Miwako Tsuji, Mitsuhisa Sato
doaj +1 more source
Implementing OpenMP 4.0 for the NVIDIA PTX architecture in GCC compiler
The paper describes the approach used in implementing OpenMP offloading to NVIDIA accelerators in GCC. Offloading refers to a new capability in OpenMP 4.0 specification update that allows the programmer to specify regions of code that should be executed ...
A. V. Monakov, V. A. Ivanishin
doaj +1 more source
The Parallelization and Optimization of the N-Body Problem using OpenMP and OpenMPI
The focus of this research is exploring the efficient ways we can implement the NBody problem. The N-Body problem, in the field of physics, is a problem in which predicts or simulates the movements of planets and how they interact with each other ...
Carugati, Nicholas J.
core
Practical parallelization of scientific applications with OpenMP, OpenACC and MPI
Marco Aldinucci +7 more
semanticscholar +1 more source
The recent surge in high-performance computing (HPC) demands, particularly with the advent of Exascale supercomputers, has highlighted the need for robust parallel systems.
Salwa Saad +4 more
doaj +1 more source
Nonlinear Wave Simulation on the Xeon Phi Knights Landing Processor
We consider an interesting from computational point of view standing wave simulation by solving coupled 2D perturbed Sine-Gordon equations. We make an OpenMP realization which explores both thread and SIMD levels of parallelism.
Hristov Ivan +2 more
doaj +1 more source
Defect Detection and Correction in OpenMP: A Static Analysis and Machine Learning-Based Solution
Concurrency defects such as race conditions, deadlocks, and improper synchronization remain a critical challenge in developing reliable OpenMP-based parallel applications.
Norah A. Al-Johany +4 more
doaj +1 more source
Estimating the Potential Speedup of Computer Vision Applications on Embedded Multiprocessors
Computer vision applications constitute one of the key drivers for embedded multicore architectures. Although the number of available cores is increasing in new architectures, designing an application to maximize the utilization of the platform is still ...
Cleyet-Merle, Sébastien +3 more
core +1 more source

