Results 1 to 10 of about 14,169,777 (290)

Photosynthesis in a Changing Global Climate: Scaling Up and Scaling Down in Crops

open access: yesFrontiers in Plant Science, 2020
Photosynthesis is the major process leading to primary production in the Biosphere. There is a total of 7000bn tons of CO2 in the atmosphere and photosynthesis fixes more than 100bn tons annually.
Marouane Baslam   +7 more
doaj   +2 more sources

Swin Transformer V2: Scaling Up Capacity and Resolution [PDF]

open access: yesComputer Vision and Pattern Recognition, 2021
We present techniques for scaling Swin Transformer [35] up to 3 billion parameters and making it capable of training with images of up to 1,536x1,536 resolution.
Ze Liu   +11 more
semanticscholar   +1 more source

Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks [PDF]

open access: yesComputer Vision and Pattern Recognition, 2023
The exponential growth of large language models (LLMs) has opened up numerous possibilities for multi-modal AGI systems. However, the progress in vision and vision-language foundation models, which are also critical elements of multi-modal AGI, has not ...
Zhe Chen   +14 more
semanticscholar   +1 more source

Scaling up GANs for Text-to-Image Synthesis [PDF]

open access: yesComputer Vision and Pattern Recognition, 2023
The recent success of text-to-image synthesis has taken the world by storm and captured the general public's imagination. From a technical standpoint, it also marked a drastic change in the favored architecture to design generative image models.
Minguk Kang   +6 more
semanticscholar   +1 more source

OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding [PDF]

open access: yesNeural Information Processing Systems, 2023
We introduce OpenShape, a method for learning multi-modal joint representations of text, image, and point clouds. We adopt the commonly used multi-modal contrastive learning framework for representation alignment, but with a specific focus on scaling up ...
Minghua Liu   +8 more
semanticscholar   +1 more source

PaLI-X: On Scaling up a Multilingual Vision and Language Model [PDF]

open access: yesarXiv.org, 2023
We present the training recipe and results of scaling up PaLI-X, a multilingual vision and language model, both in terms of size of the components and the breadth of its training task mixture.
Xi Chen   +42 more
semanticscholar   +1 more source

Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs [PDF]

open access: yesComputer Vision and Pattern Recognition, 2022
We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small kernels could
Xiaohan Ding   +5 more
semanticscholar   +1 more source

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation [PDF]

open access: yesNeural Information Processing Systems, 2023
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications. Despite encouraging progress, current state-of-the-art methods still depend largely on a confined set of training datasets. In this
Zhongang Cai   +12 more
semanticscholar   +1 more source

Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition [PDF]

open access: yesConference on Robot Learning, 2023
We present a framework for robot skill acquisition, which 1) efficiently scale up data generation of language-labelled robot data and 2) effectively distills this data down into a robust multi-task language-conditioned visuo-motor policy. For (1), we use
Huy Ha, Peter R. Florence, Shuran Song
semanticscholar   +1 more source

SVIT: Scaling up Visual Instruction Tuning [PDF]

open access: yesarXiv.org, 2023
Thanks to the emerging of foundation models, the large language and vision models are integrated to acquire the multimodal ability of visual captioning, question answering, etc. Although existing multimodal models present impressive performance of visual
Bo Zhao, Boya Wu, Tiejun Huang
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy