Machine learning with a reduced dimensionality representation of comprehensive Pentacam tomography parameters to identify subclinical keratoconus

Ke Cao; Karin Verspoor; Elsie Chan; Mark Daniell; Srujana Sahebjada; Paul N Baird

doi:10.1016/j.compbiomed.2021.104884

Machine learning with a reduced dimensionality representation of comprehensive Pentacam tomography parameters to identify subclinical keratoconus

Comput Biol Med. 2021 Nov:138:104884. doi: 10.1016/j.compbiomed.2021.104884. Epub 2021 Sep 28.

Authors

Ke Cao¹, Karin Verspoor², Elsie Chan³, Mark Daniell³, Srujana Sahebjada¹, Paul N Baird⁴

Affiliations

¹ Centre for Eye Research Australia, Melbourne, Victoria, Australia; Department of Surgery, Ophthalmology, The University of Melbourne, Melbourne, Victoria, Australia.
² School of Computing Technologies, RMIT University, Melbourne, Australia; School of Computing and Information Systems, The University of Melbourne, Melbourne, Australia.
³ Centre for Eye Research Australia, Melbourne, Victoria, Australia; Department of Surgery, Ophthalmology, The University of Melbourne, Melbourne, Victoria, Australia; Royal Victorian Eye and Ear Hospital, Melbourne, Victoria, Australia.
⁴ Department of Surgery, Ophthalmology, The University of Melbourne, Melbourne, Victoria, Australia. Electronic address: pbaird@unimelb.edu.au.

PMID: 34607273
DOI: 10.1016/j.compbiomed.2021.104884

Abstract

Purpose: To investigate the performance of a machine learning model based on a reduced dimensionality parameter space derived from complete Pentacam parameters to identify subclinical keratoconus (KC).

Methods: All 1692 available parameters were obtained from the Pentacam imaging machine on 145 subclinical KC and 122 control eyes. We applied a principal component analysis (PCA) to the complete Pentacam dataset to reduce its parameter dimensionality. Subsequently, we investigated machine learning performance of the random forest algorithm with increasing numbers of components to identify their optimal number for detecting subclinical KC from control eyes.

Results: The dimensionality of the complete set of 1692 Pentacam parameters was reduced to 267 principal components using PCA. Subsequent selection of 15 of these principal components explained over 85% of the variance of the original Pentacam-derived parameters and input to train a random forest machine learning model to achieve the best accuracy of 98% in detecting subclinical KC eyes. The model established also reached a high sensitivity of 97% in identification of subclinical KC and a specificity of 98% in recognizing control eyes.

Conclusions: A random forest-based model trained using a modest number of components derived from a reduced dimensionality representation of complete Pentacam system parameters allowed for high accuracy of subclinical KC identification.

Keywords: Artificial intelligence; Dimensionality reduction; Keratoconus; Machine learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Cornea / diagnostic imaging
Corneal Topography
Humans
Keratoconus* / diagnostic imaging
Machine Learning
ROC Curve
Tomography