Overrepresentation of transcription factor families in the genesets underlying breast cancer subtypes

BMC Genomics. 2012 May 22:13:199. doi: 10.1186/1471-2164-13-199.

Abstract

Background: The human genome contains a large amount of cis-regulatory DNA elements responsible for directing both spatial and temporal gene-expression patterns. Previous studies have shown that based on their mRNA expression breast tumors could be divided into five subgroups (Luminal A, Luminal B, Basal, ErbB2(+) and Normal-like), each with a distinct molecular portrait. Whole genome gene expression analysis of independent sets of breast tumors reveals repeatedly the robustness of this classification. Furthermore, breast tumors carrying a TP53 mutation show a distinct gene expression profile, which is in strong association to the distinct molecular portraits. The mRNA expression of 552 genes, which varied considerably among the different tumors, but little between two samples of the same tumor, has been shown to be sufficient to separate these tumor subgroups.

Results: We analyzed in silico the transcriptional regulation of genes defining the subgroups at 3 different levels: 1. We studied the pathways in which the genes distinguishing the subgroups of breast cancer may be jointly involved including upstream regulators (1st and 2nd level of regulation) as well as downstream targets of these genes. 2. Then we analyzed the promoter areas of these genes (-500 bp tp +100 bp relative to the transcription start site) for canonical transcription binding sites using Genomatix. 3. We looked for the actual expression levels of the identified TF and how they correlate with the overrepresentation of their TF binding sites in the separate groups. We report that promoter composition of the genes that most strongly predict the patient subgroups is distinct. The class-predictive genes showed a clearly different degree of overrepresentation of transcription factor families in their promoter sequences.

Conclusion: The study suggests that transcription factors responsible for the observed expression pattern in breast cancers may lead us to important biological pathways.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / metabolism
  • Breast Neoplasms / pathology
  • Female
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Principal Component Analysis
  • Promoter Regions, Genetic
  • RNA, Messenger / metabolism
  • Receptor, ErbB-2 / genetics
  • Receptor, ErbB-2 / metabolism
  • Software
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism
  • Tumor Suppressor Protein p53 / genetics
  • Tumor Suppressor Protein p53 / metabolism

Substances

  • RNA, Messenger
  • Transcription Factors
  • Tumor Suppressor Protein p53
  • Receptor, ErbB-2