Results 1 to 10 of about 263,653 (195)
Attention-Based Scene Text Detection on Dual Feature Fusion
The segmentation-based scene text detection algorithm has advantages in scene text detection scenarios with arbitrary shape and extreme aspect ratio, depending on its pixel-level description and fine post-processing.
Yuze Li+3 more
doaj +1 more source
Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data
Emotional speech synthesis is an important branch of human–computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text.
Jialin Zhang+3 more
doaj +1 more source
In sentiment analysis, biased user reviews can have a detrimental impact on a company’s evaluation. Therefore, identifying such users can be highly beneficial as their reviews are not based on reality but on their characteristics rooted in their ...
Shangwu Hou+2 more
doaj +1 more source
Globally Guided Confidence Enhancement Network for Image-Text Matching
Image-text matching is a crucial aspect of multi-modal intelligence. The main challenge in this area is accurately measuring the relevance between the image and text, using evidence obtained through matching.
Xin Dai+2 more
doaj +1 more source
A Robust Method: Arbitrary Shape Text Detection Combining Semantic and Position Information
There is a growing interest in scene text detection for arbitrary shapes. The effectiveness of text detection has also evolved from horizontal text detection to the ability to perform text detection in multiple directions and arbitrary shapes.
Zhenchao Wang+3 more
doaj +1 more source
VisdaNet: Visual Distillation and Attention Network for Multimodal Sentiment Classification
Sentiment classification is a key task in exploring people’s opinions; improved sentiment classification can help individuals make better decisions. Social media users are increasingly using both images and text to express their opinions and share their ...
Shangwu Hou+2 more
doaj +1 more source
DMS-YOLOv5: A Decoupled Multi-Scale YOLOv5 Method for Small Object Detection
Small objects detection is a challenging task in computer vision due to the limited semantic information that can be extracted and the susceptibility to background interference.
Tianyu Gao+2 more
doaj +1 more source
How Multilingual is Multilingual BERT? [PDF]
In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al. (2018) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in which task-specific annotations in one language are used to fine-tune the model for evaluation in another language ...
Eva Schlinger, Telmo Pires, Dan Garrette
openaire +3 more sources
Recently, the performance of end-to-end speech recognition has been further improved based on the proposed Conformer framework, which has also been widely used in the field of speech recognition.
Ting Guo, Nurmemet Yolwas, Wushour Slamu
doaj +1 more source
A Method Improves Speech Recognition with Contrastive Learning in Low-Resource Languages
Building an effective automatic speech recognition system typically requires a large amount of high-quality labeled data; However, this can be challenging for low-resource languages.
Lixu Sun, Nurmemet Yolwas, Lina Jiang
doaj +1 more source