摘要: This paper describes our approach for the Chinese clinical named entity recognition (CNER) task organized
by the 2020 China Conference on Knowledge Graph and Semantic Computing (CCKS) competition. In this
task, we need to identify the entity boundary and category labels of six entities from Chinese electronic
medical record (EMR). We constructed a hybrid system composed of a semi-supervised noisy label learning
model based on adversarial training and a rule post-processing module. The core idea of the hybrid system
is to reduce the impact of data noise by optimizing the model results. Besides, we used post-processing rules
to correct three cases of redundant labeling, missing labeling, and wrong labeling in the model prediction
results. Our method proposed in this paper achieved strict criteria of 0.9156 and relax criteria of 0.9660 on
the final test set, ranking first.
-
分类:
计算机科学
>>
计算机科学的集成理论
-
引用:
ChinaXiv:202211.00389
(或此版本
ChinaXiv:202211.00389V1)
DOI:10.1162/dint_a_00099
CSTR:32003.36.ChinaXiv.202211.00389.V1
-
科创链TXID:
44367c1c-3d16-49a2-ba3a-0335e5b0743e
- 推荐引用方式:
Zhucong, Li,Zhen, Gan,Baoli, Zhang,Yubo, Chen,Jing, Wan,Kang, Liu,Jun, Zhao,Shengping, Liu.Semi-Supervised Noisy Label Learning for Chinese Clinical Named Entity Recognition.中国科学院科技论文预发布平台.[DOI:10.1162/dint_a_00099]
(点此复制)