Results 251 to 260 of about 114,287 (303)
Some of the next articles are maybe not open access.

Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models

AAAI Conference on Artificial Intelligence
Recent progress in Multimodal Large Language Models (MLLMs) often use large image tokens to compensate the visual shortcoming of MLLMs, which not only exhibits obvious redundancy but also greatly exacerbates the already high computation. Token pruning is
Weihao Ye   +3 more
semanticscholar   +1 more source

Boosting spatial pruning

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, 2010
Fast query processing of complex objects, e.g. spatial or uncertain objects, depends on efficient spatial pruning of objects' approximations, which are typically minimum bounding rectangles (MBRs). In this paper, we propose a novel effective and efficient criterion to determine the spatial topology between multi-dimensional rectangles.
Tobias Emrich   +4 more
openaire   +1 more source

LLM Pruning and Distillation in Practice: The Minitron Approach

arXiv.org
We present a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B parameters, respectively, using pruning and distillation.
Sharath Turuvekere Sreenivas   +8 more
semanticscholar   +1 more source

Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs

IEEE International Conference on Computer Vision
Large Vision-Language Models (LVLMs) generally contain significantly more visual tokens than their textual counterparts, resulting in a considerable computational burden.
Qizhe Zhang   +8 more
semanticscholar   +1 more source

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

arXiv.org
Structured pruning of modern large language models (LLMs) has emerged as a way of decreasing their high computational needs. Width pruning reduces the size of projection weight matrices (e.g., by removing attention heads) while maintaining the number of ...
Bo-Kyeong Kim   +6 more
semanticscholar   +1 more source

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

International Conference on Learning Representations
In this work, we investigate whether small language models can determine high-quality subsets of large-scale text datasets that improve the performance of larger language models.
Zachary Ankner   +5 more
semanticscholar   +1 more source

A guide to successful pruning. Pruning shrubs

2014
Discusses different pruning techniques for shrubs.
French, Sue (Sue C.)   +1 more
openaire   +2 more sources

ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models

Computer Vision and Pattern Recognition
Large Vision Language Models (LVLMs) have achieved significant success across multi-modal tasks. However, the computational cost of processing long visual tokens can be prohibitively expensive on resource-limited devices. Previous methods have identified
Xubing Ye   +4 more
semanticscholar   +1 more source

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

International Conference on Machine Learning
Despite the remarkable capabilities, Large Language Models (LLMs) face deployment challenges due to their extensive size. Pruning methods drop a subset of weights to accelerate, but many of them require retraining, which is prohibitively expensive and ...
Peijie Dong   +6 more
semanticscholar   +1 more source

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

International Conference on Learning Representations
Large language models (LLMs) have demonstrated outstanding performance in various tasks, such as text summarization, text question-answering, and etc.
Peng Xu   +8 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy