On the role of the corpus callosum in interhemispheric functional connectivity in humans [PDF]
Resting state functional connectivity is defined in terms of temporal correlations between physiologic signals, most commonly studied using functional magnetic resonance imaging. Major features of functional connectivity correspond to structural (axonal)
Jarod L Roland +2 more
exaly +3 more sources
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection [PDF]
The introduction of ChatGPT has garnered widespread attention in both academic and industrial communities. ChatGPT is able to respond effectively to a wide range of human questions, providing fluent and comprehensive answers that significantly surpass ...
Biyang Guo +7 more
semanticscholar +1 more source
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset [PDF]
As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings.
Hugo Laurenccon +53 more
semanticscholar +1 more source
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation [PDF]
We introduce VoxPopuli, a large-scale multilingual corpus providing 400K hours of unlabeled speech data in 23 languages. It is the largest open data to date for unsupervised representation learning as well as semi-supervised learning.
Changhan Wang +8 more
semanticscholar +1 more source
Genetic diversity across the mitochondrial genome of eastern oysters (Crassostrea virginica) in the northern Gulf of Mexico [PDF]
The eastern oyster, Crassostrea virginica, is divided into four populations along the western North Atlantic, however, the only published mitochondrial genome sequence was assembled using one individual in Delaware.
Chani R. Rue +11 more
doaj +2 more sources
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus [PDF]
Large language models have led to remarkable progress on many NLP tasks, and researchers are turning to ever-larger text corpora to train them. Some of the largest corpora available are made by scraping significant portions of the internet, and are ...
Jesse Dodge +6 more
semanticscholar +1 more source
A Neural Corpus Indexer for Document Retrieval [PDF]
Current state-of-the-art document retrieval solutions mainly follow an index-retrieve paradigm, where the index is hard to be directly optimized for the final retrieval target. In this paper, we aim to show that an end-to-end deep neural network unifying
Yujing Wang +15 more
semanticscholar +1 more source
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference [PDF]
This paper introduces the Multi-Genre Natural Language Inference (MultiNLI) corpus, a dataset designed for use in the development and evaluation of machine learning models for sentence understanding.
Adina Williams +2 more
semanticscholar +1 more source
Automatic Depression Detection: an Emotional Audio-Textual Corpus and A Gru/Bilstm-Based Model [PDF]
Depression is a global mental health problem, the worst case of which can lead to suicide. An automatic depression detection system provides great help in facilitating depression self-assessment and improving diagnostic accuracy. In this work, we propose
Yingli Shen, Huiyu Yang, Lin Lin
semanticscholar +1 more source
Spatiotemporal patterns in seagrass-epiphyte dynamics for Thalassia testudinum in the northwestern Gulf of Mexico were evaluated through biomass measurements and scanned-image based metrics to investigate the potentially harmful impact of excessive ...
Chi Huang +3 more
doaj +1 more source

