Results 11 to 20 of about 364,302 (257)

A Fast Causal Profiler for Task Parallel Programs [PDF]

open access: yesarXiv, 2017
This paper proposes TASKPROF, a profiler that identifies parallelism bottlenecks in task parallel programs. It leverages the structure of a task parallel execution to perform fine-grained attribution of work to various parts of the program. TASKPROF's use of hardware performance counters to perform fine-grained measurements minimizes perturbation ...
Nagarakatte, Santosh, Yoga, Adarsh
arxiv   +3 more sources

TAPP: DNN Training for Task Allocation through Pipeline Parallelism Based on Distributed Deep Reinforcement Learning

open access: yesApplied Sciences, 2021
The rapid development of artificial intelligence technology has made deep neural networks (DNNs) widely used in various fields. DNNs have been continuously growing in order to improve the accuracy and quality of the models.
Yingchi Mao   +4 more
doaj   +1 more source

Toward Efficient Similarity Search under Edit Distance on Hybrid Architectures

open access: yesInformation, 2022
Edit distance is the most widely used method to quantify similarity between two strings. We investigate the problem of similarity search under edit distance.
Madiha Khalid   +2 more
doaj   +1 more source

Characterizing task-based OpenMP programs. [PDF]

open access: yesPLoS ONE, 2015
Programmers struggle to understand performance of task-based OpenMP programs since profiling tools only report thread-based performance. Performance tuning also requires task-based performance in order to balance per-task memory hierarchy utilization ...
Ananya Muddukrishna   +2 more
doaj   +1 more source

Resolving ambiguous polarity stripping ellipsis structures in Persian

open access: yesGlossa, 2021
Previous studies have shown that English speakers use a range of factors including locality, information structure, and semantic parallelism to interpret clausal ellipsis structures. Yet, the relative importance of each factor is currently underexplored.
Jesse Harris, Vahideh Rasekhi
doaj   +2 more sources

Generalized Task Parallelism [PDF]

open access: yesACM Transactions on Architecture and Code Optimization, 2015
Existing approaches to automatic parallelization produce good results in specific domains. Yet, it is unclear how to integrate their individual strengths to match the demands and opportunities of complex software. This lack of integration has both practical reasons, as integrating those largely differing approaches into one compiler would impose an ...
Sebastian Hack   +4 more
openaire   +2 more sources

Dps-MuSyQ: A Distributed Parallel Processing System for Multi-Source Data Synergized Quantitative Remote Sensing Products Producing

open access: yesIEEE Access, 2020
With the development of earth observation technologies and the construction of earth observation systems, an increasing amount of remote sensing data are being obtained. These provide the datasets required for research on remote sensing monitoring across
Hongyi Li, Ping Tang
doaj   +1 more source

Optimizing Iterative Data-Flow Scientific Applications Using Directed Cyclic Graphs

open access: yesIEEE Access, 2023
Data-flow programming models have become a popular choice for writing parallel applications as an alternative to traditional work-sharing parallelism. They are better suited to write applications with irregular parallelism that can present load imbalance.
David Alvarez, Vicenc Beltran
doaj   +1 more source

Task Parallelism-Aware Deep Neural Network Scheduling on Multiple Hybrid Memory Cube-Based Processing-in-Memory

open access: yesIEEE Access, 2021
Processing-in-memory (PIM) comprises computational logic in the memory domain. It is the most promising solution to alleviate the memory bandwidth problem in deep neural network (DNN) processing.
Young Sik Lee, Tae Hee Han
doaj   +1 more source

Automatic Inference of Task Parallelism in Task-Graph-Based Actor Models

open access: yesIEEE Access, 2018
Automatic inference of task level parallelism is fundamental for ensuring many kinds of safety and liveness properties of parallel applications. For example, two tasks running in parallel may be involved in data races when they have conflicting memory ...
Abu Naser Masud   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy