Results 1 to 10 of about 11,979,430 (376)

Whole issue

open access: yesLanguage Value, 2019
Table of Contents From the editors Carme Manuel Cuenca Articles  Diasporic dialogues: The role of gender, language, and revision in neo-slave narrative.
Language Value
doaj   +14 more sources

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models [PDF]

open access: yesInternational Conference on Machine Learning, 2023
The cost of vision-and-language pre-training has become increasingly prohibitive due to end-to-end training of large-scale models. This paper proposes BLIP-2, a generic and efficient pre-training strategy that bootstraps vision-language pre-training from
Junnan Li   +3 more
semanticscholar   +1 more source

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding [PDF]

open access: yesNeural Information Processing Systems, 2022
We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength ...
Chitwan Saharia   +13 more
semanticscholar   +1 more source

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning [PDF]

open access: yesNeural Information Processing Systems, 2023
Large-scale pre-training and instruction tuning have been successful at creating general-purpose language models with broad competence. However, building general-purpose vision-language models is challenging due to the rich input distributions and task ...
Wenliang Dai   +8 more
semanticscholar   +1 more source

A Survey of Large Language Models [PDF]

open access: yesarXiv.org, 2023
Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language.
Wayne Xin Zhao   +21 more
semanticscholar   +1 more source

MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models [PDF]

open access: yesInternational Conference on Learning Representations, 2023
The recent GPT-4 has demonstrated extraordinary multi-modal abilities, such as directly generating websites from handwritten text and identifying humorous elements within images.
Deyao Zhu   +4 more
semanticscholar   +1 more source

Scaling Instruction-Finetuned Language Models [PDF]

open access: yesJournal of machine learning research, 2022
Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks.
Hyung Won Chung   +31 more
semanticscholar   +1 more source

PaLM-E: An Embodied Multimodal Language Model [PDF]

open access: yesInternational Conference on Machine Learning, 2023
Large language models excel at a wide range of complex tasks. However, enabling general inference in the real world, e.g., for robotics problems, raises the challenge of grounding.
Danny Driess   +21 more
semanticscholar   +1 more source

A Survey on Evaluation of Large Language Models [PDF]

open access: yesACM Transactions on Intelligent Systems and Technology, 2023
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role in both research and daily use, their evaluation becomes
Yu-Chu Chang   +15 more
semanticscholar   +1 more source

Lost in the Middle: How Language Models Use Long Contexts [PDF]

open access: yesTransactions of the Association for Computational Linguistics, 2023
While recent language models have the ability to take long contexts as input, relatively little is known about how well they use longer context. We analyze the performance of language models on two tasks that require identifying relevant information in ...
Nelson F. Liu   +6 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy