ImmunoTyper-SR: A computational approach for genotyping immunoglobulin heavy chain variable genes using short-read data

Cell Syst. 2022 Oct 19;13(10):808-816.e5. doi: 10.1016/j.cels.2022.08.008.

Abstract

Human immunoglobulin heavy chain (IGH) locus on chromosome 14 includes more than 40 functional copies of the variable gene (IGHV), which are critical for the structure of antibodies that identify and neutralize pathogenic invaders as a part of the adaptive immune system. Because of its highly repetitive sequence composition, the IGH locus has been particularly difficult to assemble or genotype when using standard short-read sequencing technologies. Here, we introduce ImmunoTyper-SR, an algorithmic tool for the genotyping and CNV analysis of the germline IGHV genes on Illumina whole-genome sequencing (WGS) data using a combinatorial optimization formulation that resolves ambiguous read mappings. We have validated ImmunoTyper-SR on 12 individuals, whose IGHV allele composition had been independently validated, as well as concordance between WGS replicates from nine individuals. We then applied ImmunoTyper-SR on 585 COVID patients to investigate the associations between IGHV alleles and anti-type I IFN autoantibodies, which were previously associated with COVID-19 severity.

Keywords: ILP; WGS; algorithms; computational biology; genomics; genotyping; immunogenomics; immunoglobulin; next generation sequencing; optimization.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Autoantibodies / genetics
  • COVID-19* / genetics
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Immunoglobulin Heavy Chains / genetics
  • Immunoglobulin Variable Region* / genetics

Substances

  • Immunoglobulin Variable Region
  • Immunoglobulin Heavy Chains
  • Autoantibodies