Robust stratification of breast cancer subtypes using differential patterns of transcript isoform expression

PLoS Genet. 2017 Mar 6;13(3):e1006589. doi: 10.1371/journal.pgen.1006589. eCollection 2017 Mar.

Abstract

Breast cancer, the second leading cause of cancer death of women worldwide, is a heterogenous disease with multiple different subtypes. These subtypes carry important implications for prognosis and therapy. Interestingly, it is known that these different subtypes not only have different biological behaviors, but also have distinct gene expression profiles. However, it has not been rigorously explored whether particular transcriptional isoforms are also differentially expressed among breast cancer subtypes, or whether transcript isoforms from the same sets of genes can be used to differentiate subtypes. To address these questions, we analyzed the patterns of transcript isoform expression using a small set of RNA-sequencing data for eleven Estrogen Receptor positive (ER+) subtype and fourteen triple negative (TN) subtype tumors. We identified specific sets of isoforms that distinguish these tumor subtypes with higher fidelity than standard mRNA expression profiles. We found that alternate promoter usage, alternative splicing, and alternate 3'UTR usage are differentially regulated in breast cancer subtypes. Profiling of isoform expression in a second, independent cohort of 68 tumors confirmed that expression of splice isoforms differentiates breast cancer subtypes. Furthermore, analysis of RNAseq data from 594 cases from the TCGA cohort confirmed the ability of isoform usage to distinguish breast cancer subtypes. Also using our expression data, we identified several RNA processing factors that were differentially expressed between tumor subtypes and/or regulated by estrogen receptor, including YBX1, YBX2, MAGOH, MAGOHB, and PCBP2. RNAi knock-down of these RNA processing factors in MCF7 cells altered isoform expression. These results indicate that global dysregulation of splicing in breast cancer occurs in a subtype-specific and reproducible manner and is driven by specific differentially expressed RNA processing factors.

MeSH terms

  • 3' Untranslated Regions
  • Adult
  • Aged
  • Aged, 80 and over
  • Alternative Splicing
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / metabolism*
  • Cohort Studies
  • Estrogen Receptor alpha / genetics
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic*
  • Genome, Human
  • Humans
  • MCF-7 Cells
  • Middle Aged
  • Prognosis
  • Protein Isoforms / genetics
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA
  • Triple Negative Breast Neoplasms / diagnosis*
  • Triple Negative Breast Neoplasms / genetics*

Substances

  • 3' Untranslated Regions
  • ESR1 protein, human
  • Estrogen Receptor alpha
  • Protein Isoforms
  • RNA, Messenger