Results 51 to 60 of about 1,383,551 (338)

Speech and crosstalk detection in multichannel audio [PDF]

open access: yes, 2005
The analysis of scenarios in which a number of microphones record the activity of speakers, such as in a round-table meeting, presents a number of computational challenges.
Brown, G.J.   +3 more
core   +3 more sources

Contextualized Translation of Automatically Segmented Speech [PDF]

open access: yesInterspeech 2020, 2020
Direct speech-to-text translation (ST) models are usually trained on corpora segmented at sentence level, but at inference time they are commonly fed with audio split by a voice activity detector (VAD). Since VAD segmentation is not syntax-informed, the resulting segments do not necessarily correspond to well-formed sentences uttered by the speaker but,
Gaido M.   +4 more
openaire   +3 more sources

Automated Morphological Segmentation and Evaluation [PDF]

open access: yes, 2004
In this paper we introduce (i) a new method for morphological segmentation of part of speech labelled German words and (ii) some measures related to the MDL principle for evaluation of morphological segmentations. The segmentation algorithm is capable to
Reichel, Uwe D., Weilhammer, Karl
core   +4 more sources

Segmentation of Rhythmic Units in Word Speech by Japanese Infants and Toddlers

open access: yesFrontiers in Psychology, 2021
When infants and toddlers are confronted with sequences of sounds, they are required to segment the sounds into meaningful units to achieve sufficient understanding. Rhythm has been regarded as a crucial cue for segmentation of speech sounds.
Yeonju Cheong   +2 more
doaj   +1 more source

InfoLink: analysis of Dutch broadcast news and cross-media browsing [PDF]

open access: yes, 2005
In this paper, a cross-media browsing demonstrator named InfoLink is described. InfoLink automatically links the content of Dutch broadcast news videos to related information sources in parallel collections containing text and/or video.
Hessen, Arjan van   +3 more
core   +2 more sources

A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

open access: yesEURASIP Journal on Advances in Signal Processing, 2007
This paper describes ongoing work on selective dissemination of broadcast news. Our pipeline system includes several modules: audio preprocessing, speech recognition, and topic segmentation and indexation.
J. Neto   +4 more
doaj   +2 more sources

Segmentation cues in conversational speech: Robust semantics and fragile phonotactics

open access: yesFrontiers in Psychology, 2012
Multiple cues influence listeners’ segmentation of connected speech into words, but most previous studies have used stimuli elicited in careful readings rather than natural conversation. Discerning word boundaries in conversational speech may differ from
Laurence eWhite   +2 more
doaj   +1 more source

Text Preprocessing for Speech Synthesis [PDF]

open access: yes, 2006
In this paper we describe our text preprocessing modules for English text-to-speech synthesis. These modules comprise rule-based text normalization subsuming sentence segmentation and normalization of non-standard words, statistical part-of-speech ...
Pfitzinger, Hartmut R., Reichel, Uwe D.
core   +3 more sources

Changes in Body Composition in Children and Young People Undergoing Treatment for Acute Lymphoblastic Leukemia: A Systematic Review and Meta‐Analysis

open access: yesPediatric Blood &Cancer, EarlyView.
ABSTRACT Ongoing evidence indicates increased risk of sarcopenic obesity among children and young people (CYP) with acute lymphoblastic leukemia (ALL), often beginning early in treatment, persisting into survivorship. This review evaluates current literature on body composition in CYP with ALL during and after treatment.
Lina A. Zahed   +5 more
wiley   +1 more source

Wavelet transforms for non-uniform speech recognition [PDF]

open access: yes, 1996
An algorithm for nonuniform speech segmentation and its application in speech recognition systems is presented. A method based on the Modulated Gaussian Wavelet Transform based Speech Analyser (MGWTSA) and the subsequent parametrization block is used to ...
Javier, L   +3 more
core   +1 more source

Home - About - Disclaimer - Privacy