Path: utzoo!utgpu!news-server.csri.toronto.edu!bonnie.concordia.ca!uunet!bionet!uwm.edu!cs.utexas.edu!yale!hsdndev!husc6!Frodo.MGH.Harvard.EDU!Cherry From: Cherry@Frodo.MGH.Harvard.EDU (J. Michael Cherry) Newsgroups: bionet.general Subject: Re: Codon usage tables available Message-ID: <5755@husc6.harvard.edu> Date: 19 Feb 91 01:14:38 GMT References: <1991Feb17.203115.7949@bronze.ucs.indiana.edu> Sender: news@husc6.harvard.edu Organization: Molecular Biology, Mass. General Hospital Lines: 29 In article <1991Feb17.203115.7949@bronze.ucs.indiana.edu> gilbertd@cricket.bio.indiana.edu (Don Gilbert) writes: > Mike Cherry (cherry@frodo.mgh.harvard.edu) has produced a set of > codon usage tables for 50 different species from Genbank release > 63, which he as generously made publicly available. These tables > are useable directly with the GCG software programs which ask for > codon tables. You can retrieve these codon usage tables for all the major species reported in GenBank directly from my machine via anonymous ftp: frodo.mgh.harvard.edu (IP number 132.183.190.10). Look in the [.codon] directory, the read.me file lists which files go with which species. I'll be updating these codon usage tables in a month or two, probably after GenBank release 67. I use the features table to get the coding regions then check that each coding region doesn't have more than one stop codon. From these extracted exons I manually eliminate all redundant entries and any sequence that is not wild type. This means I remove mutant genes, pseudogenes, viral and transposon sequences and those that are questionable. Then I remove all but one copy per type of things like immunoglobulins. Mike Cherry cherry@frodo.mgh.harvard.edu Department of Molecular Biology Massachusetts General Hospital, Boston 617-726-5955 (a Macintosh news reader user, this message comes from HyperCard NetNews)