Getting Started

Last updated: 4 years ago (view history), Time to read: 4 mins

Use the search box on the top to find a protein by HUGO gene symbol, for example “TP53”.

image0

The TP53 protein is rendered as a color-banded ruler, with the numerical axis the representing number of amino acid residues from N-terminus to C-terminus. This example shows the protein translated from the preferred isoform of TP53 gene (NM_000546), shown on the top left. Click the gene symbol to view all isoforms for TP53:

image5

To view a different protein isoform, either click on the diagram of the isoform in above panel, or search by the isoform’s RefSeq accession number instead of gene symbol.

Protein domain

The source of protein domain annotation is primarily the NCBI CDD database, which has been supplemented by in-house curation efforts. In ProteinPaint, domains are represented by color bands, with varying colors indicating different types of domains. The list of domain types is shown in the legend according to their order of appearance on the protein. For manually curated domains, a PubMed link may be available to reference the source publication.

image3

Hovering the cursor over the protein will display a tooltip showing any domains under the cursor (in some cases there can be more than one):

image2

As a convenience, ProteinPaint offers an option to hide selected domains. To hide a domain type, click on one of the colored boxes in the legend to hide all occurrences of the selected type of domain from display. An “x” sign will appear inside the box to indicate a domain has been hidden. Following example shows TP53 protein with three types of domain hidden:

image6

image7

Hidden domains will be excluded from screenshots. Click the box marked by “X” to toggle the domain visible again.

Exon junction

When hovering the cursor over the protein, the tooltip indicates the amino acid residue under the cursor, and the associated exon number:

image12

Dashed lines represent exon junctions. The following example shows an exon junction is overlaps with Alanine 307, which is marked by exon numbers 8 and 9:

image11

To show additional details about A307, click on this position and ProteinPaint will zoom in to show that the exon junction falls inside codon A307, between the first and second nucleotides. The codon nucleotides are shown in the top row, and amino acid residues are shown in the middle. Alternating background shades help distinguish adjacent codons. During zoom-in, the genomic coordinates of nucleotides will also be shown in tooltip:

image4

The exon junctions are aligned to nucleotide positions by default. This alignment can be configured or turned to hidden using a drop-down menu in the legend:

image10

For example, by choosing the option “align to amino acid”, the exon junction inside A307 will be aligned to the center of the A307 residue:

image9

After zooming-in, clicking on either side of the protein will scroll left or right. To zoom out and view entire protein again, click the “Show all” button on the top:

image8

Summary

image1