Introduction into GTEx


This is an SOP on getting started with Genotype-Tissue Expression (GTEx). Here you can find resources, tools, and workflows that will help get you started with GTEx.


GTEx is a data resource and tissue bank established to study the relationships between variants and gene expression across several tissue types and different people.

It was originally created to study the relationship between genetic variants and gene expression in multiple human samples across tissues. It has whole genome sequencing, RNA-sequencing, SNP arrays, and whole exome data from 960 deceased adults with multiple different tissue types harvested from those individuals. From each individual, GTEx has obtained around 50 tissue samples.

The goals of GTEx are to increase understanding of how changes in genes contribute to common human diseases and to identify the pieces of DNA modulating genes.



Resources


Here, you can find resources that will help in giving an overview of what GTEx is.

Common Fund’s Program Snapshot of GTEx

  • An overall introduction to GTEx
  • Gives information regarding the entire program

The Genotype-Tissue Expression (GTEx) project

  • The paper that gives an introduction into GTEx

GTEx Portal

  • A user-friendly GUI that allows you to interact with GTEx data
  • Includes:
    • Gene expression
    • QTLs
    • Histology images

GTEx Training Videos

  • Videos that give an introduction to GTEx data and the portal

GTEx Publications Page

  • A list of nearly all of GTEx publications

GTEx Pilot Phase publications

  • Pilot Publications on Nature Collections
  • All of the manuscripts have been posted until 2017.

GTEx FAQs

  • FAQ page for GTEx
  • Answers that could be helpful during your analysis



Tools


Here are R packages and tools that can be used with GTEx data.

UCSCXenaTools

This is an R Package from UCSC Xena platform, from Cancer Multi-omics to Single-cell RNA-sequencing. The public omics data from UCSC Xena are supported through multiple turn-key Xena Hubs, which are a collection of UCSC-hosted public databases such as TCGA, ICGC, TARGET, GTEx, CCLE, and others. To access this package and the vignette that corresponds to it, click here.

recount

This is an R package that utilizes processed data from TCGA, SRA, and GTEx. Using this package, you can easily access RNA-seq data (e.g.: raw counts) from numerous sources in the R environment to perform downstream analysis with complementary packages (e.g.: DESeq2). Also, you can visualize base-level genome coverage data and perform analyses at multiple feature levels. To access this package, click here. To access the R Shiny app, recount2, click here.

TCGAbiolinks

This is an R package that was originally developed to perform integrative analysis with Genomic Data Commons data, including the data from The Cancer Genome Atlas. New functionalities allow for normal samples from the GTEx project to be used with this package.To access the the paper that gives an overview of using GTEx data with TCGAbiolinks, click here. To access the package, click here.

There is a workshop that has an example of using the TCGAbiolinks package to access GTEx data. You can find this example in the link here. In particular, look at the section titled GTEx gene expression data.



Workflows


Here are workflows that can be used with GTEx data.

Working with Big Data in Genomics

Here is an alternative introduction to working with GTEx data in R. The goal of this workflow is to teach you how to work with large, complex data sets and genomic data in general. To access this workflow, click here.

GTEx Tutorials

Here are GTEx tutorials in R. This will help you in selecting samples, loading expression data, and understanding cis-eQTL data. To access the tutorials, click here.

  • Note: The hashmap package is not working, so that tutorial is obsolete.

recountWorkflow

Here is a workflow for recount. It explains in detail how to use the recount package and how to integrate it with other Bioconductor packages for several analyses that can be carried out with the recount2 resource. To access the workflow, click here.