Acceleration of Image Classification and Object Tracking by the Intel Neural Compute Stick 2 with Power Efficiency Evaluation on Raspberry Pi 4B. [PDF]
Gao T, Suto J.
europepmc +1 more source
Parallel Lossless Compression of Raw Bayer Images on FPGA-Based High-Speed Camera. [PDF]
Regoršek Ž, Gorkič A, Trost A.
europepmc +1 more source
Multiscale regional calibration network for crowd counting. [PDF]
Yu J, Hu H.
europepmc +1 more source
Efficient nonlinear function approximation in analog resistive crossbars for recurrent neural networks. [PDF]
Yang J+11 more
europepmc +1 more source
Related searches:
Stream computing is often associated with regular, data-intensive applications, and more specifically with the family of cyclo-static data-flow models. The term also refers to bulk-synchronous data parallelism on SIMD architectures. Both interpretations are valid but incomplete: streams underline the formal definition of Kahn process networks, a ...
Albert Cohen
semanticscholar +4 more sources
Isolation for nested task parallelism
Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications, 2013Isolation--the property that a task can access shared data without interference from other tasks--is one of the most basic concerns in parallel programming. Whilethere is a large body of past work on isolated task-parallelism, the integration of isolation, task-parallelism, and nesting of tasks has been a difficult and unresolved challenge. In this pa-
Jisheng Zhao+4 more
semanticscholar +3 more sources
Task-Parallel Programming with Constrained Parallelism
IEEE Conference on High Performance Extreme Computing, 2022Task graph programming model (TGPM) has become central to a wide range of scientific computing applications because it enables top-down optimization of parallelism that governs the macro-scale performance.
Tsung-Wei Huang, L. Hwang
semanticscholar +1 more source
Efficiently Supporting Dynamic Task Parallelism on Heterogeneous Cache-Coherent Systems
International Symposium on Computer Architecture, 2020Manycore processors, with tens to hundreds of tiny cores but no hardware-based cache coherence, can offer tremendous peak throughput on highly parallel programs while being complexity and energy efficient.
Moyang Wang, T. Ta, Lin Cheng, C. Batten
semanticscholar +1 more source
A System for Fast and Scalable Point Cloud Indexing Using Task Parallelism
Smart Tools and Applications in Graphics, 2020We introduce a system for fast, scalable indexing of arbitrarily sized point clouds based on a task-parallel computation model. Points are sorted using Morton indices in order to efficiently distribute sets of related points onto multiple concurrent ...
P. Bormann, Michel Krämer
semanticscholar +1 more source
GPU-based Collaborative Filtering Recommendation System using Task parallelism approach
2018 2nd International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), 2018 2nd International Conference on, 2018Collaborative filtering is one among the top most preferred techniques when implementing recommendation systems. In recent times, more interest has turned towards parallel GPU-based implementation of collaborative filtering algorithms.
N. Sivaramakrishnan, V. Subramaniyaswamy
semanticscholar +1 more source