Results 151 to 160 of about 53,837 (168)
Some of the next articles are maybe not open access.

Optimizing parallel GEMM routines using auto-tuning with Intel AVX-512

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2019
This paper presents the optimal implementations of single- and double-precision general matrix-matrix multiplication (GEMM) routines for the Intel Xeon Phi Processor code-named Knights Landing (KNL) and the Intel Xeon Scalable Processors based on an auto-tuning approach with the Intel AVX-512 intrinsic functions.
Raehyun Kim, Jaeyoung Choi, Myungho Lee
openaire   +1 more source

Lightweight Deep Learning Applications on AVX-512

2021 IEEE Symposium on Computers and Communications (ISCC), 2021
André Ramos Carneiro   +2 more
openaire   +1 more source

Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512

Proceedings of the 47th International Conference on Parallel Processing, 2018
Emerging many-core CPU architectures with high degrees of single-instruction, multiple data (SIMD) parallelism promise to enable increasingly ambitious simulations based on partial differential equations (PDEs) via extreme-scale computing. However, such architectures present several challenges to their efficient use.
Hong Zhang 0006   +3 more
openaire   +1 more source

Finite Field Arithmetic Using AVX-512 For Isogeny-Based Cryptography

Anais do XVIII Simpósio Brasileiro de Segurança da Informação e de Sistemas Computacionais (SBSeg 2018), 2018
Isogeny-based cryptography introduces new candidates to quantum-resistant cryptographic protocols. The cost of finite field arithmetic dominates the cost of isogeny-based cryptosystems. In this work, we apply AVX-512 vector instructions to accelerate the finite field modular multiplication.
Gabriell Orisaka   +2 more
openaire   +1 more source

Vectorization of Flat Loops of Arbitrary Structure Using Instructions AVX-512

Lobachevskii Journal of Mathematics, 2021
G I Savin, A A Rybakov, S S Shumilin
exaly  

Multi-buffer AVX-512 accelerated parallelization of CBCS common encryption mode

Proceedings of the 1st Mile-High Video Conference, 2022
Marcel Cornu   +7 more
openaire   +1 more source

Approximate String Searching with AVX2 and AVX-512

2023
Chhabra, Tamanna   +3 more
openaire   +1 more source

Home - About - Disclaimer - Privacy