Results 81 to 90 of about 308,238 (274)
Advances in GPU architecture for deep learning and scientific computing [PDF]
The talk will cover the recent NVIDIA product announcements made at the GTC'16 conference, and how the Pascal GPU and NVLink interconnect technologies greatly improve multi-GPU performance and efficiency in deep learning and scientific computing ...
Parienté, Frédéric
core
A Flexible Patch-Based Lattice Boltzmann Parallelization Approach for Heterogeneous GPU-CPU Clusters
Sustaining a large fraction of single GPU performance in parallel computations is considered to be the major problem of GPU-based clusters. In this article, this topic is addressed in the context of a lattice Boltzmann flow solver that is integrated in ...
Aidun +23 more
core +1 more source
A compact handheld GelSight probe reconstructs in vivo 3‐D skin topography with micron‐level precision using a custom elastic gel and a learning‐based surface normal to height map pipeline. The device quantifies wrinkle depth across various body locations and detects changes in wrinkle depth following moisturizer application.
Akhil Padmanabha +12 more
wiley +1 more source
This study evaluates 3D‐printed recombinant spider silk hydrogel eADF4(C16)‐RGD in a rat AV loop model for tissue engineering. Constructs with T17b endothelial progenitor cells showed enhanced vascularization and biodegradation. Results highlight the importance of scaffold design and cellular integration in improving vascular density and overall ...
Claire M. Weinhold +9 more
wiley +1 more source
Importance of Explicit Vectorization for CPU and GPU Software Performance
Much of the current focus in high-performance computing is on multi-threading, multi-computing, and graphics processing unit (GPU) computing. However, vectorization and non-parallel optimization techniques, which can often be employed additionally, are ...
Allen +20 more
core +1 more source
Real‐Time 3D Ultrasound Imaging with an Ultra‐Sparse, Low Power Architecture
This article presents a novel, ultra‐sparse ultrasound architecture that paves the way for wearable real‐time 3D imaging. By integrating a unique convolutional array with chirped data acquisition, the system achieves high‐resolution volumetric scans at a fraction of the power and hardware complexity.
Colin Marcus +9 more
wiley +1 more source
A Bitslice Implementation of Anderson’s Attack on A5/1
The A5/1 keystream generator is a part of Global System for Mobile Communications (GSM) protocol, employed in cellular networks all over the world. Its cryptographic resistance was extensively analyzed in dozens of papers.
Bulavintsev Vadim +3 more
doaj +1 more source
An Efficient Cell List Implementation for Monte Carlo Simulation on GPUs [PDF]
Maximizing the performance potential of the modern day GPU architecture requires judicious utilization of available parallel resources. Although dramatic reductions can often be obtained through straightforward mappings, further performance improvements ...
Hailat, Eyad +4 more
core
Screen gate‐based transistors are presented, enabling tunable analog sigmoid and Gaussian activations. The SA‐transistor improves MRI classification accuracy, while the GA‐transistor supports precise Gaussian kernel tuning for forecasting. Both functions are implemented in a single device, offering compact, energy‐efficient analog AI processing ...
Junhyung Cho +9 more
wiley +1 more source
Magnetic tunnel junctions (MTJs) using MgO tunnel barriers face challenges of high resistance‐area product and low tunnel magnetoresistance (TMR). To discover alternative materials, Literature Enhanced Ab initio Discovery (LEAD) is developed. The LEAD‐predicted materials are theoretically evaluated, showing that MTJs with dusting of ScN or TiN on ...
Sabiq Islam +6 more
wiley +1 more source

