Semi-Supervised Noisy Label Learning for Chinese Clinical Named Entity Recognition postprint

Author: Zhucong, Li ^1,2 Zhen, Gan ^1,3 Baoli, Zhang ¹ Yubo, Chen ^1,2 Jing, Wan ³ Kang, Liu ^1,2 Jun, Zhao ^1,2 Shengping, Liu ⁴
Institute:

1. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

2. School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China

3. Beijing University of Chemical Technology, Beijing 100029, China

4. UNISOUND AI Technology Co., Ltd., Beijing 100096, China
Correspondent： Yubo, Chen Email:yubo.chen@nlpr.ia.ac.cn Jun, Zhao Email:jzhao@nlpr.ia.ac.cn
Submit Time:2022-11-27 19:09:59

Abstract: This paper describes our approach for the Chinese clinical named entity recognition (CNER) task organized by the 2020 China Conference on Knowledge Graph and Semantic Computing (CCKS) competition. In this task, we need to identify the entity boundary and category labels of six entities from Chinese electronic medical record (EMR). We constructed a hybrid system composed of a semi-supervised noisy label learning model based on adversarial training and a rule post-processing module. The core idea of the hybrid system is to reduce the impact of data noise by optimizing the model results. Besides, we used post-processing rules to correct three cases of redundant labeling, missing labeling, and wrong labeling in the model prediction results. Our method proposed in this paper achieved strict criteria of 0.9156 and relax criteria of 0.9660 on the final test set, ranking first.

Named entity recognition Electronic medical record Noisy label learning Semi-supervised Adversarial training

Subject: Computer Science >> Integration Theory of Computer Science
Cite as: ChinaXiv:202211.00389 (or this version ChinaXiv:202211.00389V1)
DOI:10.1162/dint_a_00099
CSTR:32003.36.ChinaXiv.202211.00389.V1
TXID： 44367c1c-3d16-49a2-ba3a-0335e5b0743e
Recommended references： Zhucong, Li,Zhen, Gan,Baoli, Zhang,Yubo, Chen,Jing, Wan,Kang, Liu,Jun, Zhao,Shengping, Liu.Semi-Supervised Noisy Label Learning for Chinese Clinical Named Entity Recognition.中国科学院科技论文预发布平台.[DOI:10.1162/dint_a_00099] (Click&Copy)

Version History

[V1]

2022-11-27 19:09:59

ChinaXiv:202211.00389V1

Download

Related Paper

1. MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning	2025-06-10
2. Semantic structures within natural language and their cognitive functions	2025-06-03
3. What surface characteristics truly affect thermal contact resistance -- An interpretability study based on deep learning and convolutional neural networks	2025-04-11
4. The Thermal Contact Resistance Dataset and the Artificial Intelligence-Driven Prediction of Thermal Contact Resistance in Multi-material Systems	2025-04-11
5. Individual-to-Individual EEG Conversion Using Swin Transformer	2025-03-01
6. Level-Navi Agent: A Framework and benchmark for Chinese Web Search Agents	2024-12-25
7. FairSort: Learning to Fair Rank for PersonalizedRecommendations in Two-Sided Platforms	2024-12-03
8. Animating the Past: Reconstruct Trilobite via Video Generation	2024-11-12
9. DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Self-Driving	2024-09-14
10. SteganoDDPM: A high-quality image steganography self-learning method using diffusion model	2024-04-23


Public comments Anonymous comments Send only to author