Application of Efficient Data Cleaning Using Text Clustering for Semistructured Medical Reports to Large-Scale Stool Examination Reports: Methodology Study
Application of Efficient Data Cleaning Using Text Clustering for Semistructured Medical Reports to Large-Scale Stool Examination Reports: Methodology Study
Hyunki Woo
1
* , BS ;
Kyunga Kim
1, 2
* , PhD ;
KyeongMin Cha
1
, MS ;
Jin-Young Lee
3
, MD, PhD ;
Hansong Mun
3
, MD, PhD ;
Soo Jin Cho
3
, MD ;
Ji In Chung
3
, MD, PhD ;
Jeung Hui Pyo
3
, MD ;
Kun-Chul Lee
4
, PhD ;
Mira Kang
1, 3
, MD, PhD
1
Department of Digital Health, Samsung Advanced Institute for Health Sciences & Technology, Sungkyunkwan University, Seoul, Republic of Korea
2
Statistics and Data Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, Republic of Korea
3
Center for Health Promotion, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
4
Jason TG, Seoul, Republic of Korea
*these authors contributed equally
Corresponding Author:
-
Mira Kang, MD, PhD
-
Center for Health Promotion
-
Samsung Medical Center
-
Sungkyunkwan University School of Medicine
-
81 Irwon-ro, Gangnam-gu
-
Seoul, 06351
-
Republic of Korea
-
Phone:
82 2-3410-3882
-
Fax: 82 2-3410-0054
-
Email: mira90.kang@samsung.com