Expression

Last updated: 25 days ago (view history), Time to read: 4 mins

Overview

This facet comprises three tabs, allowing users to explore the expression landscape of 3,432 RNA-Seq fresh frozen tumor samples (1,389 blood tumors, 888 solid tumors, and 1,155 brain tumors) using a t-SNE plot (Figure 1), gene expression violin plots organized by subtype for a gene of interest (Figure 2), gene expression overlayed on the t-SNE, or collectively within a data matrix.

expression t sne initial screen expression t sne sample view

Figure 1: t-SNE for Blood, Brain, and Solid Samples. Mouse over data points to access metadata details for each sample. Visualization powered by D3.

expression gene violin plots

Figure 2: Gene Expression for MYCN. Gene expression violin plots for each sample, filtered by the gene of interest. Visualization powered by Plotly.

Note

  • All samples use the hg38 reference genome.
  • Full metadata can be accessed through our manifest.

Features for the t-SNE Plot

Feature Description
Subtype Categorization Subtypes are color-coded, and a subset is labeled on the plot. These can be turned off in the 3 dot menu.
Sample Summary Clicking a data point opens a drawer with metadata and sample details.
Filters Filters are categorized by Tumor Sample, Patient Phenotype, and Sample Preparation.
Sample Search Search by individual or bulk (comma-separated) sample IDs. CompBio IDs must be exact.
Lasso Tool Select a region on the plot to retrieve a list of samples for further investigation.
Pan/Zoom Zoom in or pan to examine specific regions of the plot. This will disable subtype labels.

Warning Filtering by the sunburst will auto-populate the Root and Subtype filters. These can be manually edited but will not update the sunburst.


Features for Gene Expression

Feature Description
Gene Sandbox Violin plots for the gene of interest, filtered by root and subtypes.
Plotly Functions Pan and zoom features on the right side of the gene sandbox do not affect filter components.
Median Sort Sort the gene expression sandboxes by median expression across or within individual groups.
Outlier Toggle Toggle off data points to keep outliers intact for the cohort currently being filtered.

For data normalization details, refer to our Methods and Data page.

Gene Expression Overlay on t-SNE

Users can overlay gene expression on the t-SNE plot by selecting genes of interest. Count data is normalized using Median of Ratios (MoR). More details can be found on the Methods and Data page.


Features for the Data Matrix

The data matrix displays all filtered data with sortable headers for easier exploration.


Filters Explained

Tumor Sample

Filter Description
Sample ID Search by individual or bulk St. Jude CompBio IDs (comma-separated). Allows multi-select.
Subtype Root Custom-select a root to prompt applicable subtypes. Heme is defaulted upon loading the facet unless the sunburst is employed.
Subtype Custom-select subtypes to view on the plot. Parent node selection enables or disables child nodes.
Subtype Biomarker Multi-select subtype biomarkers to apply on the plot. General genes like “CTNNB1” are not accepted; users must select biomarkers from dropdown.
Sample Type Multi-select dropdown for sample types.

Patient Phenotype

Filter Description
Sex Multi-select dropdown for biological sex.
Age at Diagnosis Adjustable scale or manual input for age in years.
Race Multi-select dropdown for race.
Ethnicity Multi-select dropdown for ethnicity.

Sample Preparation

Filter Description
Library Selection Protocol Multi-select dropdown for library protocol types.
Preservative Multi-select dropdown for sample preservative types.

Warning Some fields may have a “Not Available” option for samples where the data wasn’t recorded (e.g., Race, Ethnicity, Sex).

Tip For a subset of this data, refer to Figure 4f of McLeod et al.


To see how the data was calculated and normalized, visit our Methods and Data page.