Results 1 to 10 of about 14,169,777 (290)
Photosynthesis in a Changing Global Climate: Scaling Up and Scaling Down in Crops
Photosynthesis is the major process leading to primary production in the Biosphere. There is a total of 7000bn tons of CO2 in the atmosphere and photosynthesis fixes more than 100bn tons annually.
Marouane Baslam+7 more
doaj +2 more sources
Swin Transformer V2: Scaling Up Capacity and Resolution [PDF]
We present techniques for scaling Swin Transformer [35] up to 3 billion parameters and making it capable of training with images of up to 1,536x1,536 resolution.
Ze Liu+11 more
semanticscholar +1 more source
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks [PDF]
The exponential growth of large language models (LLMs) has opened up numerous possibilities for multi-modal AGI systems. However, the progress in vision and vision-language foundation models, which are also critical elements of multi-modal AGI, has not ...
Zhe Chen+14 more
semanticscholar +1 more source
Scaling up GANs for Text-to-Image Synthesis [PDF]
The recent success of text-to-image synthesis has taken the world by storm and captured the general public's imagination. From a technical standpoint, it also marked a drastic change in the favored architecture to design generative image models.
Minguk Kang+6 more
semanticscholar +1 more source
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding [PDF]
We introduce OpenShape, a method for learning multi-modal joint representations of text, image, and point clouds. We adopt the commonly used multi-modal contrastive learning framework for representation alignment, but with a specific focus on scaling up ...
Minghua Liu+8 more
semanticscholar +1 more source
PaLI-X: On Scaling up a Multilingual Vision and Language Model [PDF]
We present the training recipe and results of scaling up PaLI-X, a multilingual vision and language model, both in terms of size of the components and the breadth of its training task mixture.
Xi Chen+42 more
semanticscholar +1 more source
Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs [PDF]
We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small kernels could
Xiaohan Ding+5 more
semanticscholar +1 more source
SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation [PDF]
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications. Despite encouraging progress, current state-of-the-art methods still depend largely on a confined set of training datasets. In this
Zhongang Cai+12 more
semanticscholar +1 more source
Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition [PDF]
We present a framework for robot skill acquisition, which 1) efficiently scale up data generation of language-labelled robot data and 2) effectively distills this data down into a robust multi-task language-conditioned visuo-motor policy. For (1), we use
Huy Ha, Peter R. Florence, Shuran Song
semanticscholar +1 more source
SVIT: Scaling up Visual Instruction Tuning [PDF]
Thanks to the emerging of foundation models, the large language and vision models are integrated to acquire the multimodal ability of visual captioning, question answering, etc. Although existing multimodal models present impressive performance of visual
Bo Zhao, Boya Wu, Tiejun Huang
semanticscholar +1 more source