Published on in Vol 21, No 1 (2019): January

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/10013, first published .
Application of Efficient Data Cleaning Using Text Clustering for Semistructured Medical Reports to Large-Scale Stool Examination Reports: Methodology Study

Application of Efficient Data Cleaning Using Text Clustering for Semistructured Medical Reports to Large-Scale Stool Examination Reports: Methodology Study

Application of Efficient Data Cleaning Using Text Clustering for Semistructured Medical Reports to Large-Scale Stool Examination Reports: Methodology Study

Hyunki Woo   1 * , BS ;   Kyunga Kim   1, 2 * , PhD ;   KyeongMin Cha   1 , MS ;   Jin-Young Lee   3 , MD, PhD ;   Hansong Mun   3 , MD, PhD ;   Soo Jin Cho   3 , MD ;   Ji In Chung   3 , MD, PhD ;   Jeung Hui Pyo   3 , MD ;   Kun-Chul Lee   4 , PhD ;   Mira Kang   1, 3 , MD, PhD

1 Department of Digital Health, Samsung Advanced Institute for Health Sciences & Technology, Sungkyunkwan University, Seoul, Republic of Korea

2 Statistics and Data Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, Republic of Korea

3 Center for Health Promotion, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea

4 Jason TG, Seoul, Republic of Korea

*these authors contributed equally

Corresponding Author:

  • Mira Kang, MD, PhD
  • Center for Health Promotion
  • Samsung Medical Center
  • Sungkyunkwan University School of Medicine
  • 81 Irwon-ro, Gangnam-gu
  • Seoul, 06351
  • Republic of Korea
  • Phone: 82 2-3410-3882
  • Fax: 82 2-3410-0054
  • Email: mira90.kang@samsung.com