Index of /full/2004-11-01

Icon  Name                               Last modified      Size  Description
[DIR] Parent Directory - [   ] DATESTAMP 31-Jan-2005 11:13 16 [TXT] README 31-Jan-2005 11:16 5.8K [   ] go_200411-assocdb-data.gz 31-Jan-2005 11:14 107M [   ] go_200411-assocdb-summary.txt.gz 31-Jan-2005 11:14 856 [   ] go_200411-assocdb-tables.tar.gz 31-Jan-2005 11:14 102M [   ] go_200411-assocdb.xml.gz 31-Jan-2005 11:14 8.6M [   ] go_200411-seqdb-data.gz 31-Jan-2005 11:15 187M [   ] go_200411-seqdb-summary.txt.gz 31-Jan-2005 11:15 879 [   ] go_200411-seqdb-tables.tar.gz 31-Jan-2005 11:15 180M [   ] go_200411-seqdblite-data.gz 31-Jan-2005 11:15 109M [   ] go_200411-seqdblite-summary.txt.gz 31-Jan-2005 11:15 823 [   ] go_200411-seqdblite-tables.tar.gz 31-Jan-2005 11:15 105M [   ] go_200411-seqdblite.fasta.gz 31-Jan-2005 11:16 26M [   ] go_200411-utilities-src.tar.gz 31-Jan-2005 11:16 12M [   ] go_200411.dtd.gz 31-Jan-2005 11:16 1.5K

For full documentation see


The go_200411 build was generated from a snapshot of the GO taken on October 30 2004

No termdb build is present in this build - for daily termdb builds, see
the daily section on the go database site


releases are named


200411 is the release date (the release export usually follows some
time after the monthly release, due to time taken to build)

the DATASET is one of:

* termdb - a database containing just the information on the
GO terms and relationships. These are the table that are populated:

term                        GO controlled vocab terms
term2term                   relationships between GO terms
term_definition             definitions of terms
dbxref                      external database identifier entities
term_dbxref                 links from terms to other databases
term_synonym                synonyms for terms
graph_path                  transitive closure (all paths) in graph

* assocdb - a database containing both the GO vocabulary and
associations between GO terms and gene products. This database
subsumes termdb. These are the extra tables that are populated:

gene_product                 gene or protein or entity annotated
association                  link between gene product and GO term
evidence                     evidence type and reference for an assoc
gene_product_count           recursive product counts per GO term

*seqdb - a database containing GO terms, gene products and the
sequences associated with these gene products. This db subsumes the
two above. It populates these additional tables:

seq                          biological sequence
gene_product_seq             link between a product and a sequence
seq_dbxref                   external database links for a sequence

NOTE: there are other unpopulated tables - we may or may not decide to
populate these at some point in the future.

*seqdblite - this is the same as seqdb, except all IEA associations
have been removed. The IEA associations provide relatively little
value compared to the curated associations, and they slow querying
down immensely. This is the distribution that AmiGO runs off of. We
are working on optimisations to allow AmiGO to run off of the full
seqdb release.

the TYPE is either

.xml - RDF XML export of the database. this comes as one single
file. Note there is no RDF XML export of seqdb, as we do not include
sequences in the xml yet. We do not include IEA evidence associations
in the xml. We may decide to split this xml file into multiple files
at a later date.

.tables - this is a directory containing the MySQL dump, see below

.sql - SQL CREATE TABLE and INSERT statements for building a local
instance of the database. equivalent to the .tables TYPE (but slower
to load)


In this distribution, uncomporess the file:


Which contains the (MySQL ported) schema used in this release 

You can also look at the HTML marked up version of the schema, or
schema diagrams here:

Go to

  click on "Schema Docs" or "Tables" in the sidebar

To guarantee that your schema, code and database release are in sync
you should use the files from the same release.


The database export was prepared from a mysql db - you should have no
problem importing it:

tar -zxvf go-200411-TYPE-tables.gz
cd <releasedir>
echo "create database mygo" | mysql
cat *.sql | mysql mygo
mysqlimport -L mygo *.txt

Note: if you are using Windows, you may see warning messages when
loading some tables; to avoid this, load tables this way:

mysql> load data infile
s\\association.txt" into table association lines terminated by '\r\n';

This can be avoided if you disable "TAR file smart CR/LF conversion"
when using Winzip (thanks to Henrik Edgren for the tip

We are unable to support Windows users - please refer to your MySQL
documentation; if you experience other problems, you may wish to try
posting a question to the go-database mail list to see if other
Windows users have any advice.


Your database is not supported; but we do have some tips below:

also: the perl api code is mostly DBMS neutral, it should in theory
work on non mysql setups


TIP: for converting mysql dumps to postgres, try my2pg

Thanks to Joe Morris of Affymetrix for the tip


see the directory sql/oracle/ in the go_200411-utilities_src software

This is contributed software, and not supported by the go-dev group.


you can access the data using the perl API - see

OR look at the perl API release for the data release:

tar -zxvf go_200411-utilities_src.tar.gz
cd go_200411-utilities_src
cd perl-api
perldoc GO/

sometimes the perl API must be in sync with the database, eg if the
schema changes in a way to break old code


you can build a local AmiGO installation using the source code and
data included in this distribution. You can load your own data into it
using either the scripts in go-dev/apps/db-loading, or the configure
script and makefile in go-dev/sql


Email the go-database mail list; go to

And click on "Mail List" in the sidebar

Be sure to have read the rest of the database documentation before
asking a question