SCHEDULING STRATEGIES FOR MIXED DATA AND TASK PARALLELISM ON HETEROGENEOUS CLUSTERS [PDF]
Olivier Beaumont+3 more
openalex +1 more source
High-level data-access analysis for characterisation of (sub)task-level parallelism in java [PDF]
Richard Stahl+4 more
openalex +1 more source
Parallel Sorted Neighborhood Blocking with MapReduce [PDF]
Cloud infrastructures enable the efficient parallel execution of data-intensive tasks such as entity resolution on large datasets. We investigate challenges and possible solutions of using the MapReduce programming model for parallel entity resolution.
arxiv
On Mapping N-Dimensional Data-Parallelism Efficiently into GPU-Thread-Spaces [PDF]
Niek Janssen, SvenāBodo Scholz
openalex +1 more source
An Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees [PDF]
We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our algorithm with other related ones which demonstrates the general superiority of this parallel algorithm over other ...
arxiv
Parallel processing and expert systems [PDF]
Whether it be monitoring the thermal subsystem of Space Station Freedom, or controlling the navigation of the autonomous rover on Mars, NASA missions in the 90's cannot enjoy an increased level of autonomy without the efficient use of expert systems ...
Lau, Sonie, Yan, Jerry C.
core +1 more source
Dynamically tuning level of parallelism in wide area data transfers [PDF]
Esma Yildirim+2 more
openalex +1 more source
Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator [PDF]
In large language model (LLM) training, several parallelization strategies, including Tensor Parallelism (TP), Pipeline Parallelism (PP), Data Parallelism (DP), as well as Sequence Parallelism (SP) and Context Parallelism (CP), are employed to distribute model parameters, activations, and optimizer states across devices.
arxiv
Hierarchical Place Trees: A Portable Abstraction for Task Parallelism and Data Movement [PDF]
Yonghong Yan+3 more
openalex +1 more source
Feed-forward volume rendering algorithm for moderately parallel MIMD machines [PDF]
Algorithms for direct volume rendering on parallel and vector processors are investigated. Volumes are transformed efficiently on parallel processors by dividing the data into slices and beams of voxels.
Yagel, Roni
core +1 more source