Results 31 to 40 of about 55,011 (275)

Chinese Word Segmentation for Agriculture

open access: yesJournal of Software, 2013
Based on the Hash mechanism, a new algorithm is presented, the algorithm can realize search、update、deletion and addition operations for dictionary.  According to the characteristics of Chinese characters GB code, by preserving the GB code of first word in entry, this method effectively improves the utilization rate of the storage space.
Kui Fang   +3 more
openaire   +1 more source

Chinese word segmentation as LMR tagging [PDF]

open access: yesProceedings of the second SIGHAN workshop on Chinese language processing -, 2003
In this paper we present Chinese word segmentation algorithms based on the so-called LMR tagging. Our LMR taggers are implemented with the Maximum Entropy Markov Model and we then use Transformation-Based Learning to combine the results of the two LMR taggers that scan the input in opposite directions. Our system achieves F-scores of 95.9% and 91.6% on
Nianwen Xue, Libin Shen
openaire   +1 more source

A benchmark dataset and case study for Chinese medical question intent classification

open access: yesBMC Medical Informatics and Decision Making, 2020
Background To provide satisfying answers, medical QA system has to understand the intentions of the users’ questions precisely. For medical intent classification, it requires high-quality datasets to train a deep-learning approach in a supervised way ...
Nan Chen   +4 more
doaj   +1 more source

The role of format familiarity and word frequency in Chinese reading

open access: yesJournal of Eye Movement Research, 2023
For Chinese readers, reading from left to right is the norm, while reading from right to left is unfamiliar. This study comprises two experiments investigating how format familiarity and word frequency affect reading by Chinese people.
Mingjing Chen, Jiamei Lu
doaj   +1 more source

Orthographic input and phonological representations in learners of Chinese as a foreign language. [PDF]

open access: yes, 2006
This paper provides evidence that the second language orthographic input affects the mental representations of L2 phonology in instructed beginner L2 learners. Previous research has shown that orthographic representations affect monolinguals' performance
Bassetti, Benedetta
core   +1 more source

The Trade-Off Between Format Familiarity and Word-Segmentation Facilitation in Chinese Reading

open access: yesFrontiers in Psychology, 2021
In alphabetic writing systems (such as English), the spaces between words mark the word boundaries, and the basic unit of reading is distinguished during visual-level processing.
Mingjing Chen   +4 more
doaj   +1 more source

Combining classifiers for Chinese word segmentation [PDF]

open access: yesProceeding of the first SIGHAN workshop on Chinese language processing -, 2002
In this paper we report results of a supervised machine-learning approach to Chinese word segmentation. First, a maximum entropy tagger is trained on manually annotated data to automatically labels the characters with tags that indicate the position of character within a word.
Nianwen Xue, Susan P. Converse
openaire   +1 more source

Analysing the Methods of Dzongkha Word Segmentation

open access: yesApplied Computer Systems, 2017
In both Chinese and Dzongkha languages, the greatest challenge is to identify the word boundaries because there are no word delimiters as it is in English and other Western languages.
Dhungyel Parshu Ram   +1 more
doaj   +1 more source

Fast and Accurate Neural Word Segmentation for Chinese

open access: yes, 2017
Neural models with minimal feature engineering have achieved competitive performance against traditional methods for the task of Chinese word segmentation.
Cai, Deng   +5 more
core   +1 more source

Comparing Neural- and N-Gram-Based Language Models for Word Segmentation [PDF]

open access: yes, 2018
Word segmentation is the task of inserting or deleting word boundary characters in order to separate character sequences that correspond to words in some language.
Doval, Yerai, Gómez-Rodríguez, Carlos
core   +2 more sources

Home - About - Disclaimer - Privacy