Results 101 to 110 of about 275,421 (290)

Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)

open access: yesBig Data and Cognitive Computing
Speech recognition technology is an important branch in the field of artificial intelligence, aiming to transform human speech into computer-readable text information.
Xun Chen   +3 more
doaj   +1 more source

THE RECOGNITION OF SPEECH BY MACHINE [PDF]

open access: yes, 1961
"May 1, 1961." "Based on a thesis submitted to the Department of Electrical Engineering, M. I. T. ... 1959, in partial fulfillment of the requirements for the degree of Doctor of Science." "May 1, 1961."
openaire   +2 more sources

Surface dose analysis and dosimetric comparison of Halcyon versus Truebeam in breast cancer radiotherapy: An OSL dosimetry study

open access: yesJournal of Applied Clinical Medical Physics, EarlyView.
Abstract Purpose Breast cancer is a neoplastic disease with high prevalence among women. Radiotherapy is one of the principal treatment modalities for this disease, but it poses significant challenges. This study aimed to compare and evaluate the technical and dosimetric performance of conventional C‐arm linac systems and a new design, Halcyon, in the ...
Mustafa Çağlar   +8 more
wiley   +1 more source

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

open access: yesIranian Journal of Electrical and Electronic Engineering, 2016
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years.
M. Bashirpour, M. Geravanchizadeh
doaj  

Speech Coding And Recognition

open access: yes, 2009
This paper investigates the performance of a speech recognizer in an interactive voice response system for various coded speech signals, coded by using a vector quantization technique namely Multi Switched Split Vector Quantization Technique. The process of recognizing the coded output can be used in Voice banking application. The recognition technique
M. Satya Sai Ram   +2 more
openaire   +1 more source

Design and validation of a novel dosimetry phantom for motion management audits

open access: yesJournal of Applied Clinical Medical Physics, EarlyView.
Abstract Background We present a novel phantom design for conducting end‐to‐end dosimetry audits for respiratory motion management of two anatomical treatment sites. The design enables radiochromic film measurements of the dose administered to the target throughout the respiratory cycle (motion‐included) and the dose delivered to the time‐averaged ...
Alex Burton   +5 more
wiley   +1 more source

Modified Mel Filter Bank to Compute MFCC of Subsampled Speech [PDF]

open access: yesarXiv, 2014
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most speech and speaker recognition applications. In this work, we propose a modified Mel filter bank to extract MFCCs from subsampled speech. We also propose a stronger metric which effectively captures the correlation between MFCCs of original speech and MFCC ...
arxiv  

A Python package for fast GPU‐based proton pencil beam dose calculation

open access: yesJournal of Applied Clinical Medical Physics, EarlyView.
Abstract Purpose Open‐source GPU‐based Monte Carlo (MC) proton dose calculation algorithms provide high speed and unparalleled accuracy but can be complex to integrate with new applications and remain slower than GPU‐based pencil beam (PB) methods, which sacrifice some physical accuracy for sub‐second plan calculation.
Mahasweta Bhattacharya   +4 more
wiley   +1 more source

Multi-thread Parallel Speech Recognition for Mobile Applications [PDF]

open access: yesJournal of Electrical and Electronics Engineering, 2014
In this paper, the server based solution of the multi-thread large vocabulary automatic speech recognition engine is described along with the Android OS and HTML5 practical application examples.
LOJKA Martin   +3 more
doaj  

Augmenting Polish Automatic Speech Recognition System With Synthetic Data [PDF]

open access: yesarXiv
This paper presents a system developed for submission to Poleval 2024, Task 3: Polish Automatic Speech Recognition Challenge. We describe Voicebox-based speech synthesis pipeline and utilize it to augment Conformer and Whisper speech recognition models with synthetic data.
arxiv  

Home - About - Disclaimer - Privacy