Results 21 to 30 of about 1,814 (232)
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
Accepted to COLING ...
Yixuan Wang +4 more
openaire +4 more sources
FlaCGEC: A Chinese Grammatical Error Correction Dataset with Fine-grained Linguistic Annotation
Chinese Grammatical Error Correction (CGEC) has been attracting growing attention from researchers recently. In spite of the fact that multiple CGEC datasets have been developed to support the research, these datasets lack the ability to provide a deep linguistic topology of grammar errors, which is critical for interpreting and diagnosing CGEC ...
Hanyue Du +6 more
openaire +4 more sources
Large language models (LLMs) have demonstrated exceptional error detection capabilities and can correct sentences with high fluency in grammatical error correction (GEC) tasks. However, when correcting Chinese academic papers, LLMs face significant challenges of over-correction. To delve deeper into this issue, we explore the underlying reasons. On one
Zixiao Kong +5 more
openaire +2 more sources
Chinese grammatical error correction based on knowledge distillation
In view of the poor robustness of existing Chinese grammatical error correction models on attack test sets and large model parameters, this paper uses the method of knowledge distillation to compress model parameters and improve the anti-attack ability of the model.
Peng Xia 0005 +4 more
openaire +2 more sources
FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction
Long paper, accepted at the Findings of EMNLP ...
Lvxiaowei Xu +4 more
openaire +2 more sources
Computational Grammars can be adapted to detect ungrammatical sentences, effectively transforming them into error detection (or correction) systems. In this paper we provide a theoretical account of how to adapt implemented HPSG grammars for grammatical ...
Morgado da Costa, Luis, Bond, Francis
core +1 more source
Whole word masking (WWM), which masks all subwords corresponding to a word at once, makes a better English BERT model. For the Chinese language, however, there is no subword because each token is an atomic character. The meaning of a word in Chinese is different in that a word is a compositional unit consisting of multiple characters.
Yong Dai 0001 +7 more
openaire +2 more sources
A new evaluation method: evaluation data and metrics for Chinese grammatical error correction
Abstract As a fundamental task in natural language processing (NLP), Chinese Grammatical Error Correction (CGEC) [1β3] has gradually received widespread attention and become a research hotspot. However, one obvious deficiency of the existing CGEC evaluation systems is that the evaluation values of the same error correction models are signif ...
Nankai Lin +4 more
openaire +1 more source
MaskGEC: Improving Neural Grammatical Error Correction via Dynamic Masking
Grammatical error correction (GEC) is a promising natural language processing (NLP) application, whose goal is to change the sentences with grammatical errors into the correct ones.
Wang, Houfeng, Zhao, Zewei
core +1 more source
Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction
Long paper, accepted at the Findings of EMNLP ...
Shirong Ma +11 more
openaire +2 more sources

