Description
This track shows alignments of D. hydei (DhydGB1) to the
D. hydei (DhydRefSeq1) genome using NCBI megablast. Repeats in
the genome assemblies were masked by WindowMasker prior to
generating the alignments.
The chain track displays boxes joined together by either single or double
lines. The boxes represent aligning regions. Single lines indicate gaps that
are largely due to a deletion in the D. hydei assembly or an insertion
in the D. hydei assembly. Double lines represent more complex gaps that
involve substantial sequence in both species. This may result from inversions,
overlapping deletions, an abundance of local mutation, or an unsequenced gap in
one species. In cases where multiple chains align over a particular region of
the D. hydei genome, the chains with single-lined gaps are often due to
processed pseudogenes, while chains with double-lined gaps are more often due to
paralogs and unprocessed pseudogenes.
In the "pack" and "full" display modes, the individual feature names indicate
the chromosome, strand, and location (in thousands) of the match for each
matching alignment.
Display Conventions and Configuration
By default, the chains to chromosome-based assemblies are colored based on which
chromosome they map to in the aligning organism. To turn off the coloring, check
the "off" button next to: Color track based on chromosome.
To display only the chains of one chromosome in the aligning organism, enter the
name of that chromosome (e.g. chr4) in box next to: Filter by chromosome.
Methods
The two genome assemblies were aligned to each other using Discontiguous
megablast. The resulting alignments were converted into axt format. The axt
alignments were fed into axtChain, which organizes all alignments between a
single D. hydei chromosome and a single D. hydei chromosome
into a group and creates a kd-tree out of the gapless subsections (blocks) of
the alignments. A dynamic program was then run over the kd-trees to find the
maximally scoring chains of these blocks. Chains scoring below a threshold were
discarded; the remaining chains are displayed in this track.
References
Morgulis A, Coulouris G, Raytselis Y, Madden TL, Agarwala R, Schäffer
AA. Database indexing for
production MegaBLAST searches. Bioinformatics. 2008 Aug
15;24(16):1757-64. doi: 10.1093/bioinformatics/btn322.
Morgulis A, Gertz EM, Schäffer AA, Agarwala R. WindowMasker: window-based
masker for sequenced genomes. Bioinformatics. 2006 Jan
15;22(2):134-41. doi: 10.1093/bioinformatics/bti774.
Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D.
Evolution's cauldron:
duplication, deletion, and rearrangement in the mouse and human
genomes. Proc Natl Acad Sci U S A. 2003 Sep 30;100(20):11484-9. doi:
10.1073/pnas.1932072100.
|