Efficient Memory Management for Large Language Model Serving with PagedAttention [PDF]
High throughput serving of large language models (LLMs) requires batching sufficiently many requests at a time. However, existing systems struggle because the key-value cache (KV cache) memory for each request is huge and grows and shrinks dynamically ...
Woosuk Kwon +8 more
semanticscholar +1 more source
Immunological memory to SARS-CoV-2 assessed for up to 8 months after infection
Variable memory Immune memory against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) helps to determine protection against reinfection, disease risk, and vaccine efficacy.
J. Dan +20 more
semanticscholar +1 more source
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model [PDF]
We present XMem, a video object segmentation architecture for long videos with unified feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video object segmentation typically only uses one type of feature memory.
Ho Kei Cheng, A. Schwing
semanticscholar +1 more source
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of ...
Ben Langmead +3 more
semanticscholar +1 more source
Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection [PDF]
Deep autoencoder has been extensively used for anomaly detection. Training on the normal data, the autoencoder is expected to produce higher reconstruction error for the abnormal inputs than the normal ones, which is adopted as a criterion for ...
Dong Gong +6 more
semanticscholar +1 more source
The Banks of the Cohomology River [PDF]
We give sharp bounds on the vanishing of the cohomology of a tensor product of vector bundles on the n-dimensional projective space in terms of the vanishing of the cohomology of the factors.
David Eisenbud +3 more
core +1 more source
Conformally equivariant quantization: Existence and uniqueness [PDF]
We prove the existence and the uniqueness of a conformally equivariant symbol calculus and quantization on any conformally flat pseudo-Riemannian manifold $(M,\rg)$.
André Lichnerowicz +5 more
core +5 more sources
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler
BackgroundThere is a rapidly increasing amount of de novo genome assembly using next-generation sequencing (NGS) short reads; however, several big challenges remain to be overcome in order for this to be efficient and accurate.
Ruibang Luo +29 more
semanticscholar +1 more source
MemNet: A Persistent Memory Network for Image Restoration [PDF]
Recently, very deep convolutional neural networks (CNNs) have been attracting considerable attention in image restoration. However, as the depth grows, the longterm dependency problem is rarely realized for these very deep models, which results in the ...
Ying Tai +3 more
semanticscholar +1 more source
Measure and integral : new foundations after one hundred years [PDF]
The present article aims to describe the main ideas and developments in the theory of measure and integral in the course and at the end of the first century of its ...
Dedicated Memory +1 more
core +2 more sources

