TY - JOUR AU - Frias, Mario AU - Moyano, Jose M AU - Rivero-Juarez, Antonio AU - Luna, Jose M AU - Camacho, Ángela AU - Fardoun, Habib M AU - Machuca, Isabel AU - Al-Twijri, Mohamed AU - Rivero, Antonio AU - Ventura, Sebastian PY - 2021 DA - 2021/2/24 TI - Classification Accuracy of Hepatitis C Virus Infection Outcome: Data Mining Approach JO - J Med Internet Res SP - e18766 VL - 23 IS - 2 KW - HIV/HCV KW - data mining KW - PART KW - ensemble KW - classification accuracy AB - Background: The dataset from genes used to predict hepatitis C virus outcome was evaluated in a previous study using a conventional statistical methodology. Objective: The aim of this study was to reanalyze this same dataset using the data mining approach in order to find models that improve the classification accuracy of the genes studied. Methods: We built predictive models using different subsets of factors, selected according to their importance in predicting patient classification. We then evaluated each independent model and also a combination of them, leading to a better predictive model. Results: Our data mining approach identified genetic patterns that escaped detection using conventional statistics. More specifically, the partial decision trees and ensemble models increased the classification accuracy of hepatitis C virus outcome compared with conventional methods. Conclusions: Data mining can be used more extensively in biomedicine, facilitating knowledge building and management of human diseases. SN - 1438-8871 UR - https://www.jmir.org/2021/2/e18766 UR - https://doi.org/10.2196/18766 UR - http://www.ncbi.nlm.nih.gov/pubmed/33624609 DO - 10.2196/18766 ID - info:doi/10.2196/18766 ER -