Results 51 to 60 of about 11,256 (247)

Implementing implicit OpenMP data sharing on GPUs

open access: yes, 2017
OpenMP is a shared memory programming model which supports the offloading of target regions to accelerators such as NVIDIA GPUs. The implementation in Clang/LLVM aims to deliver a generic GPU compilation toolchain that supports both the native CUDA C/C++
Bataev, Alexey   +8 more
core   +1 more source

Loop Quasi-Invariant Chunk Motion by peeling with statement composition [PDF]

open access: yes, 2017
Several techniques for analysis and transformations are used in compilers. Among them, the peeling of loops for hoisting quasi-invariants can be used to optimize generated code, or simply ease developers' lives.
Moyen, Jean-Yves   +2 more
core   +2 more sources

More Than 25 Years of CRAN

open access: yesAustralian &New Zealand Journal of Statistics, Volume 68, Issue 1, March 2026.
ABSTRACT We discuss the development of the package repository of CRAN, its design principles and how these are put into practice. We provide insights into how the regular and submission checks are organised and the actions resulting from these checks.
Kurt Hornik, Uwe Ligges
wiley   +1 more source

Dynamic compilation of expressions in SQL queries for PostgreSQL

open access: yesТруды Института системного программирования РАН, 2018
In recent years, as performance and capacity of main and external memory grow, performance of database management systems (DBMSes) on certain kinds of queries is more determined by raw CPU speed.
E. Y. Sharygin   +4 more
doaj   +1 more source

Software Verification and Graph Similarity for Automated Evaluation of Students' Assignments [PDF]

open access: yes, 2012
In this paper we promote introducing software verification and control flow graph similarity measurement in automated evaluation of students' programs. We present a new grading framework that merges results obtained by combination of these two approaches
Kuncak, Viktor   +3 more
core   +2 more sources

Learning to Combine Instructions in LLVM Compiler

open access: yesCoRR, 2022
Instruction combiner (IC) is a critical compiler optimization pass, which replaces a sequence of instructions with an equivalent and optimized instruction sequence at basic block level. There can be thousands of instruction-combining patterns which need to be frequently updated as new coding idioms/applications and novel hardware evolve over time. This
Sandya Mannarswamy, Dibyendu Das
openaire   +2 more sources

Multithread Approximation: An OpenMP Constructor

open access: yesConcurrency and Computation: Practice and Experience, Volume 38, Issue 4, February 2026.
ABSTRACT This study introduces an OpenMP construct designed to simplify and unify the integration of approximate computing techniques into shared‐memory parallel programs. Approximate Computing leverages the inherent error tolerance of many applications to trade computational accuracy for gains in performance and energy efficiency.
João Briganti de Oliveira   +2 more
wiley   +1 more source

LLVM-based Static Analysis Tool Using Type and Effect Systems

open access: yesМоделирование и анализ информационных систем, 2011
The intention of this paper is to describe a static analysis tool under development. The principal idea behind the design of this tool is to use type and effect systems for static analysis of real programs. The tool uses LLVM bitcode files as input, thus
M. A. Belyaev, V. A. Tsesko
doaj  

ThreadMonitor: Low‐Overhead Data Race Detection Using Intel Processor Trace

open access: yesConcurrency and Computation: Practice and Experience, Volume 38, Issue 2, January 2026.
ABSTRACT Data races are among the most difficult multithreading bugs to find, due to their non‐deterministic nature. This and the increasing popularity of multithreaded programming have led to the need for practical automated data race detection. In this context, dynamic data race detectors have received more attention, compared to static tools, owing ...
Farzam Dorostkar   +3 more
wiley   +1 more source

goSLP: Globally Optimized Superword Level Parallelism Framework

open access: yes, 2018
Modern microprocessors are equipped with single instruction multiple data (SIMD) or vector instruction sets which allow compilers to exploit superword level parallelism (SLP), a type of fine-grained parallelism.
Amarasinghe, Saman, Mendis, Charith
core   +1 more source

Home - About - Disclaimer - Privacy