Path: utzoo!utgpu!watserv1!watmath!att!linac!Firewall!caen!zaphod.mps.ohio-state.edu!wuarchive!uwm.edu!bionet!rutgers!mbcl!goldman From: goldman@mbcl.rutgers.edu Newsgroups: bionet.molbio.genbank Subject: Re: Data Exchange (updates revisited) Message-ID: <463.2846214b@mbcl.rutgers.edu> Date: 31 May 91 14:11:23 GMT References: <1991May29.185538.1@hmivax.humgen.upenn.edu> Lines: 51 In article <1991May29.185538.1@hmivax.humgen.upenn.edu>, bailey@hmivax.humgen.upenn.edu writes: > In article <9105281658.AA02439@histone.lanl.gov> on May 28, Paul Gilna writes: > >>All data processed by DDBJ (with the exception of confidential data) >>are passed to GenBank on a regular basis and incorporated >>immediately into the on-line flatfile servers and RDBMS satellites; >>These data are in turn propagated to EMBL and their distribution nodes. > > How rapidly does this integration of data occur? We have been updating local > copies of nucleic acid sequence databases by FTP, and I'm curious whether > there's a sufficient lag to justify maintaining incremental updates to GenBank > and EMBL separately (I gather from recent discussion that there's no access at > present to DDBJ until GenBank gets the data; correct me if I'm wrong). > > If anyone is familiar with the exchange rates or has looked at information > overlap in the updates between releases, I'd be interested to know what one can > expect. I don't have any hard information on this, but we also take both the genbank and the EMBL weekly updates. We use the GCG suite of programs, and I run AccessionNumbers on the Genbank data, and use that so that the "new" data from Genbank and EMBL does not have duplicate entries. From this, I would estimate that (per week) the number of entries that are in EMBL but not yet in Genbank is about a fifth of the total number of entries in the Genbank "new" data. (I just looked at the sizes of the em_new.ref and gb_new.ref files.) I hope this helps.... > > Finally, the concept of confidential data in a database is new to me. How > precisely does this work? Is this unique to DDBJ, or might there be data > quietly floating around in other databases as well? How does one officially > 'spot' references to confidential data? > > Thanks to all. Apologies if any of the above questions are common knowledge to > the rest of the world. > > > Charles Bailey > > !------------------------------------------------------------------------------- > ! Dept. of Human Genetics / Howard Hughes Medical Institute > ! University of Pennsylvania School of Medicine Rm. 430 Clinical Research Bldg. > ! 422 Curie Blvd. Philadelphia, PA 19104 USA Tel. (215) 898-1699 > ! Internet: bailey@hmivax.humgen.upenn.edu (IN 128.91.200.37) > !------------------------------------------------------------------------------- Adrian -- Adrian Goldman | Internet: Goldman@MBCL.Rutgers.Edu Molecular Biology Computing Laboratory | Bitnet: Goldman@BioVAX Waksman Insitute, | Phone: (908) 932-4864 Rutgers University, | Fax: (908) 932-5735 Piscataway, NJ 08855 USA |