Results 101 to 110 of about 275,421 (290)
Speech recognition technology is an important branch in the field of artificial intelligence, aiming to transform human speech into computer-readable text information.
Xun Chen+3 more
doaj +1 more source
THE RECOGNITION OF SPEECH BY MACHINE [PDF]
"May 1, 1961." "Based on a thesis submitted to the Department of Electrical Engineering, M. I. T. ... 1959, in partial fulfillment of the requirements for the degree of Doctor of Science." "May 1, 1961."
openaire +2 more sources
Abstract Purpose Breast cancer is a neoplastic disease with high prevalence among women. Radiotherapy is one of the principal treatment modalities for this disease, but it poses significant challenges. This study aimed to compare and evaluate the technical and dosimetric performance of conventional C‐arm linac systems and a new design, Halcyon, in the ...
Mustafa Çağlar+8 more
wiley +1 more source
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years.
M. Bashirpour, M. Geravanchizadeh
doaj
This paper investigates the performance of a speech recognizer in an interactive voice response system for various coded speech signals, coded by using a vector quantization technique namely Multi Switched Split Vector Quantization Technique. The process of recognizing the coded output can be used in Voice banking application. The recognition technique
M. Satya Sai Ram+2 more
openaire +1 more source
Design and validation of a novel dosimetry phantom for motion management audits
Abstract Background We present a novel phantom design for conducting end‐to‐end dosimetry audits for respiratory motion management of two anatomical treatment sites. The design enables radiochromic film measurements of the dose administered to the target throughout the respiratory cycle (motion‐included) and the dose delivered to the time‐averaged ...
Alex Burton+5 more
wiley +1 more source
Modified Mel Filter Bank to Compute MFCC of Subsampled Speech [PDF]
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most speech and speaker recognition applications. In this work, we propose a modified Mel filter bank to extract MFCCs from subsampled speech. We also propose a stronger metric which effectively captures the correlation between MFCCs of original speech and MFCC ...
arxiv
A Python package for fast GPU‐based proton pencil beam dose calculation
Abstract Purpose Open‐source GPU‐based Monte Carlo (MC) proton dose calculation algorithms provide high speed and unparalleled accuracy but can be complex to integrate with new applications and remain slower than GPU‐based pencil beam (PB) methods, which sacrifice some physical accuracy for sub‐second plan calculation.
Mahasweta Bhattacharya+4 more
wiley +1 more source
Multi-thread Parallel Speech Recognition for Mobile Applications [PDF]
In this paper, the server based solution of the multi-thread large vocabulary automatic speech recognition engine is described along with the Android OS and HTML5 practical application examples.
LOJKA Martin+3 more
doaj
Augmenting Polish Automatic Speech Recognition System With Synthetic Data [PDF]
This paper presents a system developed for submission to Poleval 2024, Task 3: Polish Automatic Speech Recognition Challenge. We describe Voicebox-based speech synthesis pipeline and utilize it to augment Conformer and Whisper speech recognition models with synthetic data.
arxiv