Results 1 to 10 of about 173,867 (186)
Parallel Sequential Monte Carlo for Efficient Density Combination: The DeCo MATLAB Toolbox [PDF]
This paper presents the Matlab package DeCo (Density Combination) which is based on the paper by Billio et al. (2013) where a constructive Bayesian approach is presented for combining predictive densities originating from different models or other ...
Casarin, Roberto +3 more
core +8 more sources
In Mathematical Morphology, the max-tree is a region-based representation that encodes the inclusion relationship of the threshold sets of an image. This tree has proved useful in numerous image processing applications. For the last decade, work has led to improving the construction time of this structure; mixing algorithmic optimizations, parallel and
Blin, Nicolas +4 more
openaire +2 more sources
GPU computing for systems biology [PDF]
The development of detailed, coherent, models of complex biological systems is recognized as a key requirement for integrating the increasing amount of experimental data. In addition, in-silico simulation of bio-chemical models provides an easy way to test different experimental conditions, helping in the discovery of the dynamics that regulate ...
L. Dematte, Prandi, Davide
openaire +4 more sources
The graphics processing unit (GPU) has become an integral part oftoday's mainstream computing systems. Over the past six years, therehas been a marked increase in the performance and capabilities ofGPUs. The modern GPU is not only a powerful graphics engine but also ahighly-parallel programmable processor featuring peak arithmetic andmemory bandwidth ...
Owens, John D +5 more
openaire +3 more sources
Air pollution modelling using a graphics processing unit with CUDA [PDF]
The Graphics Processing Unit (GPU) is a powerful tool for parallel computing. In the past years the performance and capabilities of GPUs have increased, and the Compute Unified Device Architecture (CUDA) - a parallel computing architecture - has been ...
Lagzi, Istvan +3 more
core +2 more sources
Computing Treewidth on the GPU
We present a parallel algorithm for computing the treewidth of a graph on a GPU. We implement this algorithm in OpenCL, and experimentally evaluate its performance. Our algorithm is based on an $O^*(2^{n})$-time algorithm that explores the elimination orderings of the graph using a Held-Karp like dynamic programming approach.
van der Zanden, Tom C. +1 more
openaire +5 more sources
Sieve: Stratified GPU-Compute Workload Sampling
To exploit the ever increasing compute capabilities offered by GPU hardware, GPU-compute workloads have evolved from simple computational kernels to large-scale programs with complex software stacks and numerous kernels. Driving architecture exploration using real workloads hence becomes increasingly challenging, up to the point of becoming intractable
Naderan-Tahan, Mahmood +2 more
openaire +2 more sources
Fast calculation of HELAS amplitudes using graphics processing unit (GPU) [PDF]
We use the graphics processing unit (GPU) for fast calculations of helicity amplitudes of physics processes. As our first attempt, we compute $u\bar{u}\to n\gamma$ ($n=2$ to 8) processes in $pp$ collisions at $\sqrt{s} = 14$TeV by transferring the ...
Hagiwara, K. +4 more
core +1 more source
Importance of Explicit Vectorization for CPU and GPU Software Performance
Much of the current focus in high-performance computing is on multi-threading, multi-computing, and graphics processing unit (GPU) computing. However, vectorization and non-parallel optimization techniques, which can often be employed additionally, are ...
Allen +20 more
core +1 more source
For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs, have presented two significant challenges to developing a programmable high-performance graph library.
Davidson, Andrew +10 more
core +2 more sources

