Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items [PDF]
Interactive 3D simulations have enabled break-throughs in robotics and computer vision, but simulating the broad diversity of environments needed for deep learning requires large corpora of photo-realistic 3D object models.
Laura Downs +7 more
semanticscholar +1 more source
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages [PDF]
We introduce the Universal Speech Model (USM), a single large model that performs automatic speech recognition (ASR) across 100+ languages. This is achieved by pre-training the encoder of the model on a large unlabeled multilingual dataset of 12 million (
Yu Zhang +26 more
semanticscholar +1 more source
How Robust is Google's Bard to Adversarial Image Attacks? [PDF]
Multimodal Large Language Models (MLLMs) that integrate text and other modalities (especially vision) have achieved unprecedented performance in various multimodal tasks. However, due to the unsolved adversarial robustness problem of vision models, MLLMs
Yinpeng Dong +8 more
semanticscholar +1 more source
Rigorous evidence identification is essential for systematic reviews and meta‐analyses (evidence syntheses) because the sample selection of relevant studies determines a review's outcome, validity, and explanatory power.
Michael Gusenbauer, Neal R Haddaway
semanticscholar +1 more source
Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation [PDF]
We propose a simple solution to use a single Neural Machine Translation (NMT) model to translate between multiple languages. Our solution requires no changes to the model architecture from a standard NMT system but instead introduces an artificial token ...
Melvin Johnson +11 more
semanticscholar +1 more source
Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval [PDF]
While image retrieval and instance recognition techniques are progressing rapidly, there is a need for challenging datasets to accurately measure their performance -- while posing novel challenges that are relevant for practical applications.
Tobias Weyand +3 more
semanticscholar +1 more source
Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations [PDF]
New sources of citation data have recently become available, such as Microsoft Academic, Dimensions, and the OpenCitations Index of CrossRef open DOI-to-DOI citations (COCI).
Alberto Mart'in-Mart'in +3 more
semanticscholar +1 more source
A neural encoder for earthquake rate forecasting
Forecasting the timing of earthquakes is a long-standing challenge. Moreover, it is still debated how to formulate this problem in a useful manner, or to compare the predictive power of different models.
Oleg Zlydenko +8 more
doaj +1 more source
DeepNull models non-linear covariate effects to improve phenotypic prediction and association power
GWAS often assume a linear phenotype-covariate relationship which may not hold in practice. Here the authors present DeepNull, in which they apply deep learning to identify and adjust for complex non-linear relationships, improving phenotypic prediction ...
Zachary R. McCaw +7 more
doaj +1 more source
Google Earth Engine: A Global Analysis and Future Trends
The continuous increase in the volume of geospatial data has led to the creation of storage tools and the cloud to process data. Google Earth Engine (GEE) is a cloud-based platform that facilitates geoprocessing, making it a tool of great interest to the
A. Velástegui-Montoya +5 more
semanticscholar +1 more source

