Select Data

Table
Expression Histogram

Select number of bins

Show expression for:

Distribution Summary

About the Browser

This browser allows the user to search for a gene by the HUGO gene symbol and dynamically returns information about its expression. Gene expression data were generated by Lappalainen et al. for a subset of the 1000 Genomes Project population samples.

The Table tab of the Browser page displays a table containing gene expression information for the selected samples with links to repository samples. This table can be downloaded by the user by clicking the 'Download' button beneath the table. The Expression Histogram tab of the Browser page displays an interactive histogram of the gene expression data across the selected samples. Individual samples can be highlighted on the histogram by typing or selecting their sample ID in the dropdown. The About page contains information about data collection and processing. Please Note: We strongly recommend experimental validation to confirm any gene expression profiles that are critical for your research.

Definitions:

Fragments per kilobase per million (FPKM)
Transcripts per kilobase per million (TPM)
Counts per million (CPM)
Reads per kilobase per million (RPKM)
Variance stabilizing transformation (VST)

Data Processing:

Preprocessed TPM and FPKM data were downloaded directly from EBI Expression Atlas and integrated into the application. Please see the experiment supplementary information for details about FPKM and TPM processing.

For CPM, RPKM, and VST normalized data, raw gene counts and annotation information were downloaded from Array Express for the E-GEUV-1 dataset. For more information about the generation of the raw count data see the E-GEUV Array Express page for details and links to the associated publications. Raw counts were summed across biological replicates and then normalized using TMM normalization in edgeR. CPM and RPKM normalized values were generated using the cpm() and rpkm() functions in edgeR, respectively. Gene length information required in the RPKM calculation was downloaded from GENCODE v12. VST normalized data were calculated on the summed count matrix using the DESeq2 package.

All analysis scripts and source code is available on the Coriell Research GitHub page.