Automatic annotation of auxiliary words usage in rule-based Chinese language

Journal of Computer Applications ›› 2011, Vol. 31 ›› Issue (12): 3271-3274.

• Database technology • Previous Articles Next Articles

Automatic annotation of auxiliary words usage in rule-based Chinese language

HAN Ying-jie,ZAN Hong-ying,ZHANG Kun-li,CAI Yu-mei

College of Information Engineering，Zhengzhou University, Zhengzhou Henan 450001，China

Received:2011-06-27 Revised:2011-08-07 Online:2011-12-12 Published:2011-12-01
Contact: HAN Ying-jie

基于规则的现代汉语常用助词用法自动识别

韩英杰,昝红英,张坤丽,柴玉梅

郑州大学信息工程学院, 郑州 450001

通讯作者: 韩英杰
基金资助:
国家自然科学基金资助项目;北京大学计算语言学教育部重点实验室开放课题基金资助项目;河南省科技创新人才杰出青年基金资助项目

Abstract

Abstract: Existing results of auxiliary word are difficult to use in the automatic annotation of natural language processing. Based on the auxiliary words knowledge base, rule-based method is used in automatic annotation of auxiliary words usage. Contrast to the results of test, it shows that refining, extension and adjusting the matching order of the rules can promote the precision and recall effectively. It is also benefit for improve the quality of Chinese Corpus, deepen the processing depth, and reduce the artificial work.

Key words: auxiliary words, knowledge base, usage, rule, automatic annotation

摘要： 目前已有的助词研究成果很难直接应用于自然语言处理的机器识别。在现代汉语词典、规则库、语料库“三位一体”的助词知识库基础上，采用基于规则的方法进行了现代汉语常用助词用法的自动识别。对比规则优化前后的实验结果证明，对用法的规则进行细化、扩充和调序可以有效地提高助词用法识别的准确率和召回率，减轻人工标注的工作量，提高大规模语料库的质量。

关键词: 助词, 知识库, 用法, 规则, 自动识别

HAN Ying-jie ZAN Hong-ying ZHANG Kun-li CAI Yu-mei. Automatic annotation of auxiliary words usage in rule-based Chinese language[J]. Journal of Computer Applications, 2011, 31(12): 3271-3274.

韩英杰昝红英张坤丽柴玉梅. 基于规则的现代汉语常用助词用法自动识别[J]. 计算机应用, 2011, 31(12): 3271-3274.

[1]	Shipan JIANG, Shuwei CHEN, Guoyan ZENG. Strategy of invalid clause elimination in first-order logic theorem prover [J]. Journal of Computer Applications, 2024, 44(3): 677-682.
[2]	Zhongyu WANG, Xiaodong QIAN. Optimization of edge connection rules for supply chain network based on improved expectation maximization algorithm [J]. Journal of Computer Applications, 2024, 44(11): 3386-3395.
[3]	Heping FANG, Shuguang LIU, Yongyi RAN, Kunhua ZHONG. Integrated scheduling optimization of multiple data centers based on deep reinforcement learning [J]. Journal of Computer Applications, 2023, 43(6): 1884-1892.
[4]	Jihui LIU, Chengwan HE. Online detection of SQL injection attacks based on ECA rules and dynamic taint analysis [J]. Journal of Computer Applications, 2023, 43(5): 1534-1542.
[5]	Qingtang LIU, Xinqian MA, Jie ZHOU, Linjing WU, Pengxiao ZHOU. Understanding of math word problems integrating commonsense knowledge base and grammatical features [J]. Journal of Computer Applications, 2023, 43(2): 356-364.
[6]	LI Xingjia, YANG Qiuhui, HONG Mei, PAN Chunxia, LIU Ruihang. Test case prioritization approach based on historical data and multi-objective optimization [J]. Journal of Computer Applications, 2023, 43(1): 221-226.
[7]	Jie HU, Yan HU, Mengchi LIU, Yan ZHANG. Chinese named entity recognition based on knowledge base entity enhanced BERT model [J]. Journal of Computer Applications, 2022, 42(9): 2680-2685.
[8]	Shunkun YU, Hongxu YAN. Heuristic attribute value reduction model based on certainty factor [J]. Journal of Computer Applications, 2022, 42(2): 469-474.
[9]	Liqun ZHANG, Haitao LIN, Wenming HUAN, Wenting BI. Software defined network flow rule conflict detection system based on OpenFlow [J]. Journal of Computer Applications, 2022, 42(2): 528-533.
[10]	Duoqin LI, Xianwen FANG. Process modeling recommendation method based on behavioral profile definition target rules [J]. Journal of Computer Applications, 2022, 42(1): 223-229.
[11]	ZHANG Linfa, ZHANG Yufeng, WANG Kun, LI Zhiyao. Medical image fusion with intuitionistic fuzzy set and intensity enhancement [J]. Journal of Computer Applications, 2021, 41(7): 2082-2091.
[12]	LYU Jia, XIAN Yan. Co-training algorithm combining improved density peak clustering and shared subspace [J]. Journal of Computer Applications, 2021, 41(3): 686-693.
[13]	QIU Ningjia, WANG Xiaoxia, WANG Peng, WANG Yanchun. Analysis of double-channel Chinese sentiment model integrating grammar rules [J]. Journal of Computer Applications, 2021, 41(2): 318-323.
[14]	Tao WANG, Cong JIN, Xiaobing LI, Yun TIE, Lin QI. Multi-track music generative adversarial network based on Transformer [J]. Journal of Computer Applications, 2021, 41(12): 3585-3589.
[15]	Xuanyi LI, Yun ZHOU. BNSL-FIM： Bayesian network structure learning algorithm based on frequent item mining [J]. Journal of Computer Applications, 2021, 41(12): 3475-3479.

Automatic annotation of auxiliary words usage in rule-based Chinese language

基于规则的现代汉语常用助词用法自动识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics