Chinese word segmentation - Open Access .click

Results 41 to 50 of about 62,130 (288)

Chinese word segmentation as LMR tagging [PDF]

Proceedings of the second SIGHAN workshop on Chinese language processing -, 2003
In this paper we present Chinese word segmentation algorithms based on the so-called LMR tagging. Our LMR taggers are implemented with the Maximum Entropy Markov Model and we then use Transformation-Based Learning to combine the results of the two LMR taggers that scan the input in opposite directions. Our system achieves F-scores of 95.9% and 91.6% on
Nianwen Xue, Libin Shen
openaire +1 more source

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation [PDF]

Annual Meeting of the Association for Computational Linguistics, 2020
Fully supervised neural approaches have achieved significant progress in the task of Chinese word segmentation (CWS). Nevertheless, the performance of supervised models always drops gravely if the domain shifts due to the distribution gap across domains ...
Ning Ding +6 more
semanticscholar +1 more source

Multi-Grained Chinese Word Segmentation [PDF]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Traditionally, word segmentation (WS) adopts the single-grained formalism, where a sentence corresponds to a single word sequence. However, Sproat et al. (1997) show that the inter-native-speaker consistency ratio over Chinese word boundaries is only 76%, indicating single-grained WS (SWS) imposes unnecessary challenges on both manual annotation and ...
Chen Gong 0004 +3 more
openaire +1 more source

A benchmark dataset and case study for Chinese medical question intent classification

BMC Medical Informatics and Decision Making, 2020
Background To provide satisfying answers, medical QA system has to understand the intentions of the users’ questions precisely. For medical intent classification, it requires high-quality datasets to train a deep-learning approach in a supervised way ...
Nan Chen +4 more
doaj +1 more source

The role of format familiarity and word frequency in Chinese reading

Journal of Eye Movement Research, 2023
For Chinese readers, reading from left to right is the norm, while reading from right to left is unfamiliar. This study comprises two experiments investigating how format familiarity and word frequency affect reading by Chinese people.
Mingjing Chen, Jiamei Lu
doaj +1 more source

Synthetic Word Parsing Improves Chinese Word Segmentation [PDF]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 2015
We present a novel solution to improve the performance of Chinese word segmentation (CWS) using a synthetic word parser. The parser analyses the internal structure of words, and attempts to convert out-of-vocabulary words (OOVs) into in-vocabulary fine-grained sub-words.
Fei Cheng 0002, Kevin Duh, Yuji Matsumoto 0001 +2 more
openaire +1 more source

The Trade-Off Between Format Familiarity and Word-Segmentation Facilitation in Chinese Reading

Frontiers in Psychology, 2021
In alphabetic writing systems (such as English), the spaces between words mark the word boundaries, and the basic unit of reading is distinguished during visual-level processing.
Mingjing Chen +4 more
doaj +1 more source

Analysing the Methods of Dzongkha Word Segmentation

Applied Computer Systems, 2017
In both Chinese and Dzongkha languages, the greatest challenge is to identify the word boundaries because there are no word delimiters as it is in English and other Western languages.
Dhungyel Parshu Ram, Grundspeņķis Jānis +1 more
doaj +1 more source

Word-Context Character Embeddings for Chinese Word Segmentation [PDF]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Neural parsers have benefited from automatically labeled data via dependency-context word embeddings. We investigate training character embeddings on a word-based context in a similar way, showing that the simple method improves state-of-the-art neural word segmentation models significantly, beating tri-training baselines for leveraging auto-segmented ...
Hao Zhou 0012 +5 more
openaire +1 more source

Introduction to CKIP Chinese word segmentation system for the first international Chinese Word Segmentation Bakeoff [PDF]

Proceedings of the second SIGHAN workshop on Chinese language processing -, 2003
In this paper, we roughly described the procedures of our segmentation system, including the methods for resolving segmentation ambiguities and identifying unknown words. The CKIP group of Academia Sinica participated in testing on open and closed tracks of Beijing University (PK) and Hong Kong Cityu (HK).
Wei-Yun Ma, Keh-Jiann Chen
openaire +1 more source

natural language processing
word segmentation
fos: computer and information sciences

computation and language cs.cl
computer science - computation and language
medicine

4. education
eye movements
chinese reading