Schema for Simple Repeats - Simple Tandem Repeats by TRF
  Database: hub_32_GCF_009829145.1    Primary Table: hub_32_simpleRepeat Data last updated: 2022-12-19
Big Bed File: https://hgdownload.soe.ucsc.edu/hubs/GCF/009/829/145/GCF_009829145.1/bbi/GCF_009829145.1_bChiLan1.pri.simpleRepeat.bb
Item Count: 247,189
Format description: Describes the Simple Tandem Repeats
fieldexampledescription
chromNC_045637.1Reference sequence chromosome or scaffold
chromStart104232931Start position in chromosome
chromEnd104233067End position in chromosome
nameGGAGGSimple Repeats tag name
period5Length of repeat unit
copyNum27.2Mean number of copies of repeat
consensusSize5Length of consensus sequence
perMatch100Percentage Match
perIndel0Percentage Indel
score272Alignment Score = 2*match-7*mismatch-7*indel; minscore=50
A19Percent of A's in repeat unit
C0Percent of C's in repeat unit
G80Percent of G's in repeat unit
T0Percent of T's in repeat unit
entropy0.72Entropy
sequenceGGAGGSequence of repeat unit element

Sample Rows
 
chromchromStartchromEndnameperiodcopyNumconsensusSizeperMatchperIndelscoreACGTentropysequence
NC_045637.1104232931104233067GGAGG527.2510002721908000.72GGAGG
NC_045637.1104238205104238264TTCCA511.85100011818400401.51TTCCA
NC_045637.1104241193104241234AAAGTGAAAAAATGTA192.2199507363219141.41AAAGTGAAAAAATGTAGAA
NC_045637.1104244028104244083T155.0188083050940.31T
NC_045637.1104244031104244083TTTTTTTTTTC114.81197297050940.32TTTTTTTTTTC
NC_045637.1104264181104264222TTTGGTTTGGTTT133.213866570226700.99TTTGGTTTGGTTT
NC_045637.1104270591104270624TAGCAGCAATATTAGT171.91793057331221331.90TAGCAGCAATATTAGTT
NC_045637.1104325309104325377TGTTTT611.76796590016830.64TGTTTT
NC_045637.1104325312104325357TTTTGTTTTTTTGTGG162.8168610560020800.72TTTTGTTTTTTTGTGG
NC_045637.1104325346104325374T128.011000560001000.00T

Simple Repeats (hub_32_simpleRepeat) Track Description
 

Description

This track displays simple tandem repeats (possibly imperfect repeats) on the 03 Jan 2020 Chiroxiphia lanceolata/GCF_009829145.1_bChiLan1.pri/GCF_009829145.1 genome assembly, located by Tandem Repeats Finder (TRF) which is specialized for this purpose. These repeats can occur within coding regions of genes and may be quite polymorphic. Repeat expansions are sometimes associated with specific diseases.

There are 247,189 items in the track covering 17,825,287 bases, assembly size 1,089,631,598 bases, percent coverage % 1.64.

Methods

For more information about the TRF program, see Benson (1999).

Credits

TRF was written by Gary Benson.

References

Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999 Jan 15;27(2):573-80. PMID: 9862982; PMC: PMC148217

Credits

This track was generated using a modification of a program developed by G. Miklem and L. Hillier (unpublished).

References

Gardiner-Garden M, Frommer M. CpG islands in vertebrate genomes. J Mol Biol. 1987 Jul 20;196(2):261-82. PMID: 3656447