Alignment Insertion/Deletion Display Options
The Genome Browser offers several ways to highlight gaps in alignments of query sequences (usually
transcripts) to the genome. Gaps result from sequences in the genome or query (or both) that cannot
be aligned.
Legend
-
double horizontal line (=): both the genome and query have unalignable sequence between regions of
aligned sequence, a double-sided insertion. If this option is not selected, it will display as a
single horizontal line.
-
orange lines or purple lines:
unalignable query sequence, orange for the middle of a sequence and purple for the beginning or
end.
-
green lines: poly-A tail or poly-T head that does not align to the
genome.
Details
When zoomed out past the base level, the browser chooses one color to represent many bases. The
priority of display, from most important to least important, is:
different mRNA base/nonsynonymous codon coloring (if enabled)
or different item bases (if enabled), unalignable query
sequence (orange or purple), an insertion in both genome and query (double horizontal line), and a
poly-A tail (green). The browser will not display genomic/mRNA
codon coloring when viewing large regions of the genome.
Interpretation of Display
Gaps are usually due to a deletion or insertion in one or both sequences, or, infrequently, a
problem in the sequencing. Often, the genome will have a large "insertion" relative to a
query (single horizontal line) that is actually an intron. Double-sided insertions (double
horizontal line) are unusual and may indicate an assembly error, sequencing error, or
polymorphism.
Unalignable query sequence in the middle of a query (orange) implies extra bases in the transcript
sequence or missing bases in the genome sequence. Insertions at the beginning or end of a query
(purple), implies a partial alignment of the query. For instance, a very short sequence next to
large intron gap may be incorrectly aligned. Unalignable query sequence may also be due to
polymorphisms.
Poly-T heads result from queries that are the reverse complement of the genomic sequence. Poly-A
tails and poly-T heads (green) of mRNAs usually can not be aligned to the genome; this is a special
case of an unalignable query sequence.
For information about mRNA codon and base coloring, click
here. For information about EST base coloring, click
here.