Big Code != Big Vocabulary: Open-Vocabulary Models for Source Code [PDF]
International Conference on Software Engineering, 2020Statistical language modeling techniques have successfully been applied to large source code corpora, yielding a variety of new software development tools, such as tools for code suggestion, improving readability, and API migration. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as
Rafael-Michael Karampatsis+4 more
arxiv +3 more sources
Git4Voc: Git-based Versioning for Collaborative Vocabulary Development [PDF]
arXiv, 2016Collaborative vocabulary development in the context of data integration is the process of finding consensus between the experts of the different systems and domains. The complexity of this process is increased with the number of involved people, the variety of the systems to be integrated and the dynamics of their domain. In this paper we advocate that
Auer, Sören+3 more
arxiv +3 more sources
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [PDF]
Computer Vision and Pattern Recognition, 2023We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies pre-trained text-image diffusion and discriminative models to perform open-vocabulary panoptic segmentation. Text-to-image diffusion models have the remarkable ability
Jiarui Xu+5 more
semanticscholar +1 more source
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP [PDF]
Computer Vision and Pattern Recognition, 2022Open-vocabulary semantic segmentation aims to segment an image into semantic regions according to text descriptions, which may not have been seen during training. Recent two-stage methods first generate class-agnostic mask proposals and then leverage pre-
Feng Liang+8 more
semanticscholar +1 more source
OpenMask3D: Open-Vocabulary 3D Instance Segmentation [PDF]
Neural Information Processing Systems, 2023We introduce the task of open-vocabulary 3D instance segmentation. Current approaches for 3D instance segmentation can typically only recognize object categories from a pre-defined closed set of classes that are annotated in the training datasets.
Ayca Takmaz+5 more
semanticscholar +1 more source
Simple Open-Vocabulary Object Detection with Vision Transformers [PDF]
arXiv.org, 2022Combining simple architectures with large-scale pre-training has led to massive improvements in image classification. For object detection, pre-training and scaling approaches are less well established, especially in the long-tailed and open-vocabulary ...
Matthias Minderer+13 more
semanticscholar +1 more source
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space [PDF]
Conference on Empirical Methods in Natural Language Processing, 2022Transformer-based language models (LMs) are at the core of modern NLP, but their internal prediction construction process is opaque and largely not understood. In this work, we make a substantial step towards unveiling this underlying prediction process,
Mor Geva+3 more
semanticscholar +1 more source
Identifying Indonesian-core vocabulary for teaching English to Indonesian preschool children: a corpus-based research [PDF]
K@ta: A Biannual Publication on the Study of Language and Literature, 2011This corpus-based research focuses on building a corpus of Indonesian children’s storybooks to find the frequent content words in order to identify Indonesian-core vocabulary for teaching English to Indonesian preschool children.
Maryani .
doaj +1 more source
This paper proposes a method of controlling the vortex-induced vibration (VIV) of wind turbine towers by adding continuous trapezoidal straight spoiler plates (TS) onto their outer surface: a fluid–solid coupling model was constructed to simulate the ...
Zheng Li+3 more
doaj +1 more source
Assessment of Speech Development in Senior Preschool Age: The Battery of Neuropsychological Tests and Norms [PDF]
Клиническая и специальная психология, 2021The main goal of the study was to implement subtests of the main neuropsychological test to the development of speech in samples of children 5–7 years old with normative development, and also to collect average indicators for this age in phonemic ...
Veraksa A.N.+3 more
doaj +1 more source