Results 41 to 50 of about 55,011 (275)
Radical-Enhanced Chinese Character Embedding
We present a method to leverage radical for learning Chinese character embedding. Radical is a semantic and phonetic component of Chinese character. It plays an important role as characters with the same radical usually have similar semantic meaning and ...
Ji, Zhenzhou +5 more
core +1 more source
Adversarial Multi-Criteria Learning for Chinese Word Segmentation
Different linguistic perspectives causes many diverse segmentation criteria for Chinese word segmentation (CWS). Most existing methods focus on improve the performance for each single criterion.
Chen, Xinchi +3 more
core +1 more source
Chinese word segmentation at Peking University [PDF]
Word segmentation is the first step in Chinese information processing, and the performance of the segmenter, therefore, has a direct and great influence on the processing steps that follow. Different segmenters will give different results when handling issues like word boundary.
Duan Huiming +3 more
openaire +1 more source
Neural Chinese Word Segmentation with Lexicon and Unlabeled Data via Posterior Regularization
Existing methods for CWS usually rely on a large number of labeled sentences to train word segmentation models, which are expensive and time-consuming to annotate.
Cai Deng +23 more
core +1 more source
Neural Word Segmentation Learning for Chinese [PDF]
Most previous approaches to Chinese word segmentation formalize this problem as a character-based sequence labeling task where only contextual information within fixed sized local windows and simple interactions between adjacent tags can be captured. In this paper, we propose a novel neural framework which thoroughly eliminates context windows and can ...
Cai, Deng, Zhao, Hai
openaire +2 more sources
Chinese NER Using Dynamic Meta-Embeddings
Named entity recognition (NER) is one of the basic techniques in natural language processing tasks. Chinese NER is complicated and difficult which remains a major challenge.
Naixin Zhang +4 more
doaj +1 more source
A Levenshtein distance-based method for word segmentation in corpus augmentation of geoscience texts
For geoscience text, rich domain corpora have become the basis of improving the model performance in word segmentation. However, the lack of domain-specific corpus with annotation labelled has become a major obstacle to professional information mining in
Jinqu Zhang +6 more
doaj +1 more source
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts.
Ando, Rie Kubota, Lee, Lillian
core +2 more sources
ABSTRACT Objective To explore how cerebral hypoxia and Normal‐Appearing White Matter (NAWM) integrity affect MS lesion burden and clinical course. Methods Seventy‐nine MS patients, including 13 clinically isolated syndrome (CIS) patients and 66 relapsing–remitting multiple sclerosis (RRMS) patients, and 44 healthy controls (HCs) were recruited from ...
Xinli Wang +8 more
wiley +1 more source
ABSTRACT Objective High‐resolution MRI enables detailed assessment of intracranial vessel wall pathology in moyamoya vasculopathy. We aimed to classify adult moyamoya vasculopathy etiologies using high‐resolution MRI and to examine subtype‐specific associations between high‐resolution MRI features and ischemic infarction.
Guangsong Han +8 more
wiley +1 more source

