Hg38 4-way Genome size statistics

From genomewiki
Revision as of 23:38, 21 April 2017 by Hiram (talk | contribs) (initial contents)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

see also

statistics collected from UCSC 2bit files

  • The external link in the first column goes to the FTP location where the genome sequence was obtained
  • The external links from the UCSC database name go to the UCSC genome-preview browser for that genome
  • N bases == unknown bases in the assembly marked by "N"
  • non-N bases == called bases in the assembly
  • total size == all bases in assembly, N and non-N
  • The % masked is masking from Repeat Masker or Window Masker plus TRF/Simple Repeats, whichever is greater, usually WindowMasker for these genomes
  • The common name external links go to information pages for this genome
  • The coverage numbers are taken directly from information README files from the sequencing center
  • The N50 size is calculated directly from the UCSC chrom.sizes file which is the size of the chromosomes as seen in the UCSC genome browser

do not edit, automatically generated table

order
count/
ftp src
ucsc
db name/
genome-preview
chrom
count
total
size
non-N
bases
N base
count
% masked common name/
src info
coverage N50
chrom.sizes
alignment type % of hg38
matched
(chainLink table)
chain
minScore
chain
linearGap
01hg38 455 3,209,286,1053,049,315,783 159,970,322% 52.10 Human20X 145138636 referencereference 3000 medium
02galVar1 179,514 3,187,660,5722,802,837,430 384,823,142% 38.66 Malayan flying lemur55X 245189 Net57.036% 3000 medium
03tupChi1 50,750 2,846,580,2352,706,389,135 140,191,100% 40.02 Chinese tree shrew80X 3670124 Net45.256% 3000 medium
04mm10 66 2,730,871,7742,652,783,500 78,088,274% 45.18 Mouse20X 130694993 SyntenicNet31.653% 3000 medium