Assessing Data Independence in Sequential Ultrasound Imaging for Machine Learning-Based Liver Disease Diagnosis
iacs CAI

Computing and Algorithm Insight

Computing and Algorithm Insight serves as a dynamic platform for researchers, scholars, and industry professionals,...

Publishing Model

Open Access
This journal published by Integra Academic Press

Abstract

Machine learning applications in medical imaging demand sufficient and independent data for effective training and testing. However, sequential imaging data, such as time-series images, often exhibit inherent correlations, leading to data interdependence. This study investigates statistical methods to assess the independence of sequential B-mode liver ultrasound images. We analyzed 1,180 ultrasound images containing 5,903 regions of interest from patients with liver fibrosis, steatosis, or normal liver conditions. Texture features extracted from these images were used to train machine learning models for computer-aided diagnosis. The models achieved strong performance, with logistic regression yielding an AUC of 0.928 for binary classification and random forest achieving an AUC of 0.917 for multiclass classification. To evaluate data independence, we applied Jensen Shannon (JS) divergence, which revealed that images from normal livers were independent, while those from diseased livers showed interdependence. These findings highlight the importance of testing for data independence in sequential medical imaging to ensure model generalizability and prevent data leakage. Such statistical tests can guide the use of same-subject images in training machine learning models for real-world medical imaging applications.

Keywords: Medical Imaging Quantitative Ultrasound Machine Learning Data Independence Liver Disease Diagnosis


References

Brattain, L.J.; Telfer, B.A.; Dhyani, M.; Grajo, J.R.; Samir, A.E. Machine learning for medical ultrasound: Status, methods, and future opportunities. Abdominal Radiology, 2018, 43, 786–799.

Aung, Y.Y.M.; Wong, D.C.S.; Ting, D.S.W. The promise of artificial intelligence: A review of the opportunities and challenges of artificial intelligence in healthcare. British Medical Bulletin, 2021, 139, ldab016.

Willemink, M.J.; Koszek, W.A.; Hardell, C.; Wu, J.; Fleischmann, D.; Harvey, H.; Folio, L.R.; Summers, R.M.; Rubin, D.L.; Lungren, M.P. Preparing Medical Imaging Data for Machine Learning. Radiology, 2020, 295, 4–15.

Vabalas, A.; Gowen, E.; Poliakoff, E.; Casson, A.J. Machine learning algorithm validation with a limited sample size. PLoS ONE, 2019, 14, e0224365.

Teng, D.; Li, F.; Zhang, W. Using comprehensive machine-learning models to classify complex morphological characters. Ecology and Evolution, 2021, 11, 10421–10431.

An, C.; Park, Y.W.; Ahn, S.S.; Han, K.; Kim, H.; Lee, S.K. Radiomics machine learning study with a small sample size: Single random training-test set split may lead to unreliable results. PLoS ONE, 2021, 16, e0256152.

Lehne, M.; Sass, J.; Essenwanger, A.; Schepers, J.; Thun, S. Why digital medicine depends on interoperability. NPJ Digital Medicine, 2019, 2, 79.

Undar, M.; Krishnapuram, B.; Bi, J.; Rao, R.B. Learning Classifiers When the Training Data Is Not IID. IJCAI International Joint Conference, 2007, 2007, 756–761.

Darrell, T.; Kloft, M.; Pontil, M.; Rätsch, G.; Rodner, E. Machine Learning with Interdependent and Non-identically Distributed Data (Dagstuhl Seminar 15152). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2015, 5, 4.

Tillman, R. Structure learning with independent non-identically distributed data. ACM International Conference Proceeding Series, 2009, 382, 131.

Kagadis, G.C.; Drazinos, P.; Gatos, I.; Tsantis, S.; Papadimitroulas, P.; Spiliopoulos, S.; Karnabatidis, D.; Theotokas, I.; Zoumpoulis, P.; Hazle, J.D. Deep learning networks on chronic liver disease assessment with fine-tuning of shear wave elastography image sequences. Physics in Medicine & Biology, 2020, 65, 215027.

Carrer, L.; Donini, E.; Marinelli, D.; Zanetti, M.; Mento, F.; Torri, E.; Smargiassi, A.; Inchingolo, R.; Soldati, G.; Demi, L.; et al. Automatic Pleural Line Extraction and COVID-19 Scoring From Lung Ultrasound Data. IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, 2020, 67, 2207–2217.

Faes, L.; Wagner, S.K.; Fu, D.J.; Liu, X.; Korot, E.; Ledsam, J.R.; Back, T.; Chopra, R.; Pontikos, N.; Kern, C.; et al. Automated deep learning design for medical image classification by health-care professionals with no coding experience: A feasibility study. Lancet Digital Health, 2019, 1, e232–e242.

Nirschl, J.J.; Janowczyk, A.; Peyster, E.G.; Frank, R.; Margulies, K.B.; Feldman, M.D.; Madabhushi, A. A deep-learning classifier identifies patients with clinical heart failure using whole-slide images of H&E tissue. PLoS ONE, 2018, 13, e0192726.

Qu, W.; Balki, I.; Mendez, M.; Valen, J.; Levman, J.; Tyrrell, P.N. Assessing and mitigating the effects of class imbalance in machine learning with application to X-ray imaging. International Journal of Computer Assisted Radiology and Surgery, 2020, 15, 2041–2048.

Pheng, H.S.; Shamsuddin, S.M.; Leng, W.Y. Kullback Leibler divergence for image quantitative evaluation. AIP Conference Proceedings, 2016, 1750, 020003.

Nielsen, F. On the Jensen-Shannon Symmetrization of Distances Relying on Abstract Means. Entropy, 2019, 21, 485.

Okada, K.; Periaswamy, S.; Bi, J. Stratified regularity measures with Jensen-Shannon divergence. In Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Anchorage, AK, USA, 23–28 June 2008; pp. 1–8.

Ré, M.A.; Azad, R.K. Generalization of entropy based divergence measures for symbolic sequence analysis. PLoS ONE, 2014, 9, e93532.

D’Souza, J.C.; Sultan, L.R.; Hunt, S.J.; Schultz, S.M.; Brice, A.K.; Wood, A.K.W.; Sehgal, C.M. B-mode ultrasound for the assessment of hepatic fibrosis: A quantitative multiparametric analysis for a radiomics approach. Scientific Reports, 2019, 9, 8708.

Sultan, L.R.; Chen, Y.T.; Cary, T.W.; Ashi, K.; Sehgal, C.M. Quantitative pleural line characterization outperforms traditional lung texture ultrasound features in detection of COVID-19. Journal of the American College of Emergency Physicians Open, 2021, 2, e12418.

Tripathi, A.; Goswami, T.; Trivedi, S.K.; Sharma, R.D. A multi class random forest (MCRF) model for classification of small plant peptides. International Journal of Information Management Data Insights, 2021, 1, 100029.

Frank, E.; Hall, M.A.; Witten, I.H. The WEKA Workbench. In Online Appendix for “Data Mining: Practical Machine Learning Tools and Techniques”, 4th ed.; Morgan Kaufmann: Burlington, MA, USA, 2016.

Streiner, D.L. Unicorns Do Exist: A Tutorial on ‘Proving’ the Null Hypothesis. Canadian Journal of Psychiatry, 2003, 48, 756–761.

Chauhan, A.; Sultan, L.R.; Furth, E.E.; Jones, L.P.; Khungar, V.; Sehgal, C.M. Diagnostic accuracy of hepatorenal index in the detection and grading of hepatic steatosis. Journal of Clinical Ultrasound, 2016, 44, 580–586.

Layer, G.; Zuna, I.; Lorenz, A.; Zerban, H.; Haberkorn, U.; Bannasch, P.; van Kaick, G.; Räth, U. Computerized ultrasound B-scan texture analysis of experimental diffuse parenchymal liver disease: Correlation with histopathology and tissue composition. Journal of Clinical Ultrasound, 1991, 19, 193–201.

Meszlenyi, R.J.; Hermann, P.; Buza, K.; Gal, V.; Vidnyanszky, Z. Resting state fMRI functional connectivity analysis using dynamic time warping. Frontiers in Neuroscience, 2017, 11, 75.

Salvador, S.; Chan, P. FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space. Intelligent Data Analysis, 2007, 11, 561–580.