Results 1 to 10 of about 209,152 (323)
NVIDIA Tensor Core Programmability, Performance & Precision [PDF]
The NVIDIA Volta GPU microarchitecture introduces a specialized unit, called "Tensor Core" that performs one matrix-multiply-and-accumulate on 4x4 matrices per clock cycle.
Der Chien, Steven Wei +4 more
core +4 more sources
Numerical behavior of NVIDIA tensor cores [PDF]
We explore the floating-point arithmetic implemented in the NVIDIA tensor cores, which are hardware accelerators for mixed-precision matrix multiplication available on the Volta, Turing, and Ampere microarchitectures.
Massimiliano Fasi +3 more
doaj +5 more sources
Accelerating genomic workflows using NVIDIA Parabricks
Background As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets.
Kyle A. O’Connell +10 more
doaj +4 more sources
Nvidia Hopper GPU and Grace CPU Highlights
At GTC 2022, Nvidia announced a new product family that aims to cover from small enterprise workloads through exascale high performance computing (HPC) and trillion-parameter AI models.
Anne C Elster
exaly +2 more sources
A Deep Learning Framework Performance Evaluation to Use YOLO in Nvidia Jetson Platform
: Deep learning-based object detection technology can efficiently infer results by utilizing graphics processing units (GPU). However, when using general deep learning frameworks in embedded systems and mobile devices, processing functionality is limited.
Dong-Jin Shin, Kim Jeong Joon
exaly +2 more sources
All inputs are required for excellent and proper crop production, especially seed quality. In this way fewer disease and insect issues, increased seedling germination, uniform plant population and maturity, and better responsiveness to fertilizers and ...
Abdullah Beyaz, Zülfi SARIPINAR
exaly +2 more sources
Germanium-on-Silicon Waveguide-Integrated Photodiode with Dual Optical Inputs for Datacenter Applications [PDF]
As the exponential growth in advanced compute workloads drives intra-datacenter interconnects to ever increasing bitrates, optical networking equipment has risen to the challenge by shifting from NRZ signaling to bandwidth efficient modulation methods ...
Itamar-Mano Priel +3 more
doaj +2 more sources
NVIDIA FLARE: Federated Learning from Simulation to Real-World [PDF]
Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data.
H. Roth +22 more
semanticscholar +1 more source
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation [PDF]
As a representative cyber-physical system (CPS), robotic manipula-tors have been widely adopted in various academic research and industrial processes, indicating their potential to act as a universal interface between the cyber and the physical worlds ...
Zhehua Zhou +7 more
semanticscholar +1 more source
Accurate and Convenient Energy Measurements for GPUs: A Detailed Study of NVIDIA GPU’s Built-In Power Sensor [PDF]
GPU has emerged as the go-to accelerator for HPC workloads, however its power consumption has become a major limiting factor for further scaling HPC systems. An accurate understanding of GPU power consumption is essential for further improving its energy
Zeyu Yang, Karel Adámek, Wesley Armour
semanticscholar +1 more source

