Results 51 to 60 of about 21,441,590 (245)
Exploiting the parallelism in multiprocessor systems is a major challenge in modern computer science. Multicore programming demands a change in the way we design and use fundamental data structures.
Damian Dechev+2 more
doaj +1 more source
UPIR: Toward the Design of Unified Parallel Intermediate Representation for Parallel Programming Models [PDF]
The complexity of heterogeneous computing architectures, as well as the demand for productive and portable parallel application development, have driven the evolution of parallel programming models to become more comprehensive and complex than before.
arxiv
A Parallel Random Forest Algorithm for Big Data in a Spark Cloud Computing Environment [PDF]
With the emergence of the big data age, the issue of how to obtain valuable knowledge from a dataset efficiently and accurately has attracted increasingly attention from both academia and industry. This paper presents a Parallel Random Forest (PRF) algorithm for big data on the Apache Spark platform.
arxiv +1 more source
Collective Communication Performance Evaluation for Distributed Deep Learning Training
In distributed deep learning, the improper use of the collective communication library can lead to a decline in deep learning performance due to increased communication time.
Sookwang Lee, Jaehwan Lee
doaj +1 more source
A case for merging the ILP and DLP paradigms [PDF]
The goal of this paper is to show that instruction level parallelism (ILP) and data-level parallelism (DLP) can be merged in a single architecture to execute vectorizable code at a performance level that can not be achieved using either paradigm on its ...
Espasa Sans, Roger+2 more
core +1 more source
Efficient electro-magnetic analysis of a GPU bitsliced AES implementation
The advent of CUDA-enabled GPU makes it possible to provide cloud applications with high-performance data security services. Unfortunately, recent studies have shown that GPU-based applications are also susceptible to side-channel attacks.
Yiwen Gao, Yongbin Zhou, Wei Cheng
doaj +1 more source
Milimili. Collecting Parallel Data via Crowdsourcing [PDF]
We present a methodology for gathering a parallel corpus through crowdsourcing, which is more cost-effective than hiring professional translators, albeit at the expense of quality. Additionally, we have made available experimental parallel data collected for Chechen-Russian and Fula-English language pairs.
arxiv
Exploring Various Levels of Parallelism in High-Performance CRC Algorithms
Modern processors have increased the capabilities of instruction-level parallelism (ILP) and thread-level parallelism (TLP). These resources, however, typically exhibit poor utilization on conventional cyclic redundancy check (CRC) algorithms.
Mucong Chi, Dazhong He, Jun Liu
doaj +1 more source
Impact of Design Decisions on Performance of Embarrassingly Parallel .NET Database Application
The implementation of parallel applications is always a challenge. It embraces many distinctive design decisions that are to be taken. The paper presents issues of parallel processing with use of .NET applications and popular Database Management Systems (
Piotr Karwaczyński+6 more
doaj +1 more source
A Parallel Multiobjective PSO Weighted Average Clustering Algorithm Based on Apache Spark
Multiobjective clustering algorithm using particle swarm optimization has been applied successfully in some applications. However, existing algorithms are implemented on a single machine and cannot be directly parallelized on a cluster, which makes it ...
Huidong Ling+5 more
doaj +1 more source