Introduction to TCGA

The Cancer Genome Atlas (TCGA) is a resource with omics data for over 20,000 cancer patient tissue profiles spanning 33 cancer types. It is a joint effort between the National Cancer Institute and the National Human Genome Research Institute and is publicly available. See the links below for more information:

  • Click here for the NIH’s introduction to TCGA. You can also visit TCGA’s Wikipedia page here.
  • This page describes cancers studied by the consortium.
  • This is the NIH’s Genomic Data Commons data portal where you can access their data.
  • Here is a list of the data types available in TCGA.

Literature

Below are some papers you can read for a better understanding of TCGA:

Data Analysis

See this page for a list of computational tools developed by researchers using TCGA data.

See this page for a user’s guide for accessing the GDC data portal.

recount2

recount2 is a resource that allows access to thousands of RNA-seq samples from TCGA, GTEx, and the Sequence Read Archive. It can be accessed through the recount2 website and the recount Bioconductor package. See below for some helpful resources: