An enriched approach to combining high-dimensional genomic and low-dimensional phenotypic data

J Biopharm Stat. 2024 Apr 5:1-7. doi: 10.1080/10543406.2024.2330203. Online ahead of print.

Abstract

We describe an approach for combining and analyzing high-dimensional genomic and low-dimensional phenotypic data. The approach leverages a scheme of weights applied to the variables instead of observations and, hence, permits incorporation of the information provided by the low dimensional data source. It can also be incorporated into commonly used downstream techniques, such as random forest or penalized regression. Finally, the simulated lupus studies involving genetic and clinical data are used to illustrate the overall idea and show that the proposed enriched penalized method can select significant genetic variables while keeping several important clinical variables in the final model.

Keywords: Model selection; dimension reduction; penalized regression; precision medicine.