Longitudinal Reading Measures and Genome Imputation in the National Child Development Study: Prospects for Future Reading Research

Twin Res Hum Genet. 2023 Feb;26(1):10-20. doi: 10.1017/thg.2023.2. Epub 2023 Mar 10.

Abstract

Reading difficulties are prevalent worldwide, including in economically developed countries, and are associated with low academic achievement and unemployment. Longitudinal studies have identified several early childhood predictors of reading ability, but studies frequently lack genotype data that would enable testing of predictors with heritable influences. The National Child Development Study (NCDS) is a UK birth cohort study containing direct reading skill variables at every data collection wave from age 7 years through to adulthood with a subsample (final n = 6431) for whom modern genotype data are available. It is one of the longest running UK cohort studies for which genotyped data are currently available and is a rich dataset with excellent potential for future phenotypic and gene-by-environment interaction studies in reading. Here, we carry out imputation of the genotype data to the Haplotype Reference Panel, an updated reference panel that offers greater imputation quality. Guiding phenotype choice, we report a principal components analysis of nine reading variables, yielding a composite measure of reading ability in the genotyped sample. We include recommendations for use of composite scores and the most reliable variables for use during childhood when conducting longitudinal, genetically sensitive analyses of reading ability.

Keywords: Reading; cohort study; genotype imputation; longitudinal measures; principal components analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child Development*
  • Child, Preschool
  • Cognition*
  • Cohort Studies
  • Genotype
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide