Results 151 to 160 of about 53,837 (168)
Some of the next articles are maybe not open access.
Optimizing parallel GEMM routines using auto-tuning with Intel AVX-512
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2019This paper presents the optimal implementations of single- and double-precision general matrix-matrix multiplication (GEMM) routines for the Intel Xeon Phi Processor code-named Knights Landing (KNL) and the Intel Xeon Scalable Processors based on an auto-tuning approach with the Intel AVX-512 intrinsic functions.
Raehyun Kim, Jaeyoung Choi, Myungho Lee
openaire +1 more source
Lightweight Deep Learning Applications on AVX-512
2021 IEEE Symposium on Computers and Communications (ISCC), 2021André Ramos Carneiro +2 more
openaire +1 more source
Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512
Proceedings of the 47th International Conference on Parallel Processing, 2018Emerging many-core CPU architectures with high degrees of single-instruction, multiple data (SIMD) parallelism promise to enable increasingly ambitious simulations based on partial differential equations (PDEs) via extreme-scale computing. However, such architectures present several challenges to their efficient use.
Hong Zhang 0006 +3 more
openaire +1 more source
Finite Field Arithmetic Using AVX-512 For Isogeny-Based Cryptography
Anais do XVIII Simpósio Brasileiro de Segurança da Informação e de Sistemas Computacionais (SBSeg 2018), 2018Isogeny-based cryptography introduces new candidates to quantum-resistant cryptographic protocols. The cost of finite field arithmetic dominates the cost of isogeny-based cryptosystems. In this work, we apply AVX-512 vector instructions to accelerate the finite field modular multiplication.
Gabriell Orisaka +2 more
openaire +1 more source
Vectorization of Flat Loops of Arbitrary Structure Using Instructions AVX-512
Lobachevskii Journal of Mathematics, 2021G I Savin, A A Rybakov, S S Shumilin
exaly
Multi-buffer AVX-512 accelerated parallelization of CBCS common encryption mode
Proceedings of the 1st Mile-High Video Conference, 2022Marcel Cornu +7 more
openaire +1 more source
Approximate String Searching with AVX2 and AVX-512
2023Chhabra, Tamanna +3 more
openaire +1 more source

