Minimal Browser Installation
Minimal Browser Installation
Usually a browser installation wants to be a subset of genomes compared to the entire UCSC Genome Browser
Instead of the entire rsync of everything mentioned in the Mirror Instructions , a subset of data can be downloaded.
A minimal browser database needs six tables:
- grp
- chromInfo
- trackDb
- hgFindSpec
- gold
- gap
The gateway page needs the hgcentral database to function. The hgcentral database can by copied directly from the MySQL data files from the ftp server ftp://hgdownload.cse.ucsc.edu/mysql/hgcentral or loaded from the SQL text file at http://hgdownload.cse.ucsc.edu/admin/hgcentral.sql
Currently the gateway page expects the a human database to exist in order to function without difficulty. But you do not need to download it: Simply run this mysql command to select another default database:
update hgcentral.defaultDb set name="YOURDEFAULTDATABASE_e.g._mm8" where genome="Human";
For the /gbdb/ data area, at a minimum you will need the .2bit file or the nib files for the assembly. This is either:
/gbdb/<database>/<database>.2bit or /gbdb/<database>/nib/*.nib
Various tracks use other files in this directory. If you don't care about all the tracks, you won't need other files here.
For the genbank sequences, you can check the gbExtFile table for your database to see exactly which files are used by that assembly in /gbdb/genbank/
Extract the "path" column from that table and use that list in a --files-from specification for your rsync.
Partial Mirrors
See this page: Browser Mirrors (they should probably be fused into one?)
See also
Building a new genome database
User notes
I made hgBlat work on my local browser installation by putting the full hostnames into hgcentral.blatservers, e.g. 'blat4' was replaced by the output of `blat4.cse.ucsc.edu`. I wonder if it wouldn't be a good idea to mention this in the mirroring instructions somewhere. --- max
Before you start using our blat servers, you need to verify with us that you have permission. We can't have everyone with a mirror site simply use our blat servers, the load would take them down for everyone. See also: Kent Informatics for a commercial blat license.
A nice command from Paul McKenna: UPDATE blatServers SET host=concat(host,’.cse.ucsc.edu’); Max 15:11, 3 February 2007 (PST)