Path: utzoo!utgpu!news-server.csri.toronto.edu!rpi!dali.cs.montana.edu!caen!sdd.hp.com!spool.mu.edu!uunet!bionet!kristoff From: kristoff@genbank.bio.net (David Kristofferson) Newsgroups: bionet.molbio.genbank Subject: Re: statistics on the number of sequences reported ? Message-ID: Date: 2 May 91 17:14:55 GMT References: Organization: GenBank Online Service Lines: 23 > Are there statistics available on the number of sequences and bp > in every release of the GenBank releases? And in other databases ? > I would prefer them in electronic form. Yes, this information is available in the release notes accompanying each GenBank release along with further statistical breakdowns. You can get it by anonymous FTP from genbank.bio.net [134.172.1.160] in the directory pub/db/gb-rel67/gbrel.txt.Z and use the UNIX uncompress utility. I have put an uncompressed version in pub/doc/gbrel.txt. The file is rather large to use e-mail for its retrieval unfortunately. If all you need are the two numbers that you mentioned, here's the first few lines of the file with that info: GBREL.TXT Genetic Sequence Data Bank 15 March 1991 GenBank(R) Release 67.0 Distribution Tape Release Notes 43903 loci, 55169276 bases, from 53763 reported sequences