Results 11 to 20 of about 55,011 (275)

BERTCWS: unsupervised multi-granular Chinese word segmentation based on a BERT method for the geoscience domain

open access: yesAnnals of GIS, 2023
Unlike alphabet-based languages such as English, the Chinese language has no specifying word boundaries. Segmentation, particularly for the Chinese language, is a fundamental step towards Chinese text processing, information retrieval, and knowledge ...
Qinjun Qiu, Zhong Xie, Kai Ma, Miao Tian
doaj   +3 more sources

Capsules Based Chinese Word Segmentation for Ancient Chinese Medical Books

open access: yesIEEE Access, 2018
Neural network models are popularly used in Chinese word segmentation task. The capsule architecture is proposed recently which has solved some defects of convolutional neural network. In this paper, we first introduce the capsule architecture to Chinese
Si Li   +5 more
doaj   +3 more sources

Adaptive Chinese word segmentation [PDF]

open access: yesProceedings of the 42nd Annual Meeting on Association for Computational Linguistics - ACL '04, 2004
This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are identified in a unified approach to word segmentation based on linear models. We explore several features and describe how to create training data by sampling.
Jianfeng Gao   +6 more
openaire   +1 more source

Synthetic Word Parsing Improves Chinese Word Segmentation [PDF]

open access: yesProceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 2015
We present a novel solution to improve the performance of Chinese word segmentation (CWS) using a synthetic word parser. The parser analyses the internal structure of words, and attempts to convert out-of-vocabulary words (OOVs) into in-vocabulary fine-grained sub-words.
Fei Cheng, Kevin Duh, Yuji Matsumoto
openaire   +1 more source

Segmenting Chinese Texts into Words for Semantic Network Analysis [PDF]

open access: yesJournal of Contemporary Eastern Asia, 2017
Unlike most languages, written Chinese has no spaces between words. Word segmentation must be performed before semantic network analysis can be conducted.
James A. Danowski
doaj   +1 more source

An Algorithm Rapidly Segmenting Chinese Sentences into Individual Words [PDF]

open access: yesMATEC Web of Conferences, 2019
This paper proposes an improved Trie tree structure. The tree node records the position information of the characters participating in the word formation, and the child node uses the hash search mechanism.
Xiong Zhibin
doaj   +1 more source

A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing

open access: yesTransactions of the Association for Computational Linguistics, 2020
Chinese word segmentation and dependency parsing are two fundamental tasks for Chinese natural language processing. The dependency parsing is defined at the word-level.
Yan, Hang, Qiu, Xipeng, Huang, Xuanjing
doaj   +1 more source

Research on performance variations of classifiers with the influence of pre-processing methods for Chinese short text classification.

open access: yesPLoS ONE, 2023
Text pre-processing is an important component of a Chinese text classification. At present, however, most of the studies on this topic focus on exploring the influence of preprocessing methods on a few text classification algorithms using English text ...
Dezheng Zhang   +3 more
doaj   +1 more source

The Extended Simple View of Reading in Adult Learners of Chinese as a Second Language

open access: yesFrontiers in Psychology, 2022
The Simple View of Reading (SVR) designates that reading comprehension is the product of decoding and listening comprehension and this conclusion has been supported by studies on school-aged native and nonnative speakers.
Meiling Hao   +5 more
doaj   +1 more source

Chinese Word Segmentation for Terrorism-Related Contents [PDF]

open access: yes, 2008
In order to analyze security and terrorism related content in Chinese, it is important to perform word segmentation on Chinese documents. There are many previous studies on Chinese word segmentation. The two major approaches are statistic-based and dictionary-based approaches.
Wang, F, Chau, MCL, Wei, D, Zeng, D
openaire   +3 more sources

Home - About - Disclaimer - Privacy