Path: utzoo!attcan!uunet!decwrl!shelby!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 1 Jun 90 12:00:07 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 1490 Approved: lear@genbank.bio.net Checksum: 13934 92 LOCUS ECONANA 1243 bp ds-DNA BCT 01-JUN-1990 DEFINITION Escherichia coli nanA gene encoding N-acetylneuraminate lyase, complete cds. ACCESSION M20207 D00067 N00067 KEYWORDS N-acetylneuraminate lyase; NALase; nanA gene; npl gene. SOURCE E.coli JE1011 DNA, 1.2 kb HindIII-EcoRI fragment DNA containing nanA gene, which is cloned in pNL4, for [1]. E.coli C600 DNA, clone pMK6, for [2]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1243) AUTHORS Kawakami,B., Kudo,T., Narahashi,Y. and Horikoshi,K. TITLE Nucleotide sequence of the N-acetylneuraminate lyase gene of Escherichia coli JOURNAL Agric. Biol. Chem. 50, 2155-2158 (1986) STANDARD simple automatic FEATURES from to/span description pept 93 986 N-acetylneuraminate lyase BASE COUNT 300 a 309 c 322 g 312 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttctg tatggggtgt tgcttaattg atctggtata acaggtataa aggtatatcg 61 tttatcagac aagcatcact tcagaggtat ttatggcaac gaatttacgt ggcgtaatgg 121 ctgcactcct gactcctttt gaccaacaac aagcactgga taaagcgagt ctgcgtcgcc 181 tggttcagtt caatattcag cagggcatcg acggtttata cgtgggtggt tcgaccggcg 241 aggcctttgt acaaagcctt tccgagcgtg aacaggtact ggaaatcgtc gccgaagagg 301 cgaaaggtaa gattaaactc atcgcccacg tcggttgcgt cagcaccgcc gaaagccaac 361 aacttgcggc atcggctaaa cgttatggct tcgatgccgt ctccgccgtc acgccgttct 421 actatccttt cagctttgaa gaacactgcg atcactatcg ggcaattatt gattcggcgg 481 atggtttgcc gatggtggtg tacaacattc cagccctgag tggggtaaaa ctgaccctgg 541 atcagatcaa cacacttgtt acattgcctg gcgtaggtgc gctgaaacag acctctggcg 601 atctctatca gatggagcag atccgtcgtg aacatcctga tcttgtgctc tataacggtt 661 acgacgaaat cttcgcctct ggtctgctgg cgggcgctga tggtggtatc ggcagtacct 721 acaacatcat gggctggcgc tatcagggga tcgttaaggc gctgaaagaa ggcgatatcc 781 agaccgcgca gaaactgcaa actgaatgca ataaagtcat tgatttactg atcaaaacgg 841 gcgtattccg cggcctgaaa actgtcctcc attatatgga tgtcgtttct gtgccgctgt 901 gccgcaaacc gtttggaccg gtagatgaaa aatatctgcc agaactgaag gcgctggccc 961 agcagttgat gcaagagcgc gggtgagttg tttcccctcg ctcgccccta ccggtgaggg 1021 gaaataaacg catctgtacc ctacaatttt cataccaaag cgtgtgggca tcgcccaccg 1081 cgggagactc acaatgagta ctacaaccca gaatatcccg tggtatcgcc atctcaaccg 1141 tgcacaatgg cgcgcatttt ccgctgcctg gttgggatat ctgcttgacg gttttgattt 1201 cgttttaatc gccctggtac tcaccgaagt acaaggtgaa ttc // LOCUS MACEPO 1462 bp ss-mRNA PRI 01-JUN-1990 DEFINITION Monkey (Cynomolgus) erythropoietin mRNA, complete cds. ACCESSION M18189 M15818 M15819 M18188 KEYWORDS erythropoietin. SOURCE Monkey cDNA to mRNA. ORGANISM Macaca fascicularis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae; Cercopithecinae. REFERENCE 1 (bases 1 to 1462) AUTHORS Lin,F.-K., Lin,C.-H., Lai,P.-H., Browne,J.K., Egrie,J.C., Smalling,R., Fox,G.M., Chen,K.K., Castro,M. and Suggs,S. TITLE Monkey erythropoietin gene: Cloning, expression and comparison with the human erythropoietin gene JOURNAL Gene 44, 201-209 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.-K.Lin, 04-AUG-1987. FEATURES from to/span description pept 217 795 erythropoietin precursor sigp 217 288 erythropoietin signal peptide matp 289 792 erythropoietin BASE COUNT 297 a 455 c 433 g 277 t ORIGIN Unreported. 1 agagtccctg ggccaccccg gtcgctcgct gcgtgcgtcc cggagccgga ccggggccac 61 cgcgccggct ctgcaccgat cccgcgcccc ctggacagcc gccctctcct ccaggcccgt 121 ggggctggcc ctgccccgct gaacttcccg ggatgaggac tcccggtgtg gtcaccgcgc 181 gcctaggtcg ctgagggacc ccggccaggc gcggagatgg gggtgcacga atgtcctgcc 241 tggctgtggc ttctcctgtc tctcgtgtcg ctccctctgg gcctcccagt cccgggcgcc 301 ccaccacgcc tcatctgtga cagccgagtc ctggagaggt acctcttgga ggccaaggag 361 gccgagaatg tcacgatggg ctgttccgaa agctgcagct tgaatgagaa tatcaccgtc 421 ccagacacca aagttaactt ctatgcctgg aagaggatgg aggtcgggca gcaggctgta 481 gaagtctggc agggcctggc cctgctctca gaagctgtcc tgcggggcca ggccgtgttg 541 gccaactctt cccagccttt cgagcccctg cagctgcaca tggataaagc catcagtggc 601 cttcgcagca tcaccactct gcttcgggcg ctgggagccc aggaagccat ctccctccca 661 gatgcggcct cggctgctcc actccgaacc atcactgctg acactttctg caaactcttc 721 cgagtctact ccaatttcct ccggggaaag ctgaagctgt acacggggga ggcctgcagg 781 agaggggaca gatgaccagg tgcgtccagc tgggcacatc caccacctcc ctcaccaaca 841 ctgcctgtgc cacaccctcc ctcaccactc ccgaacccca tcgaggggct ctcagctaag 901 cgccagcctg tcccatggac actccagtgc cagcaatgac atctcagggg ccagaggaac 961 tgtccagagc acaactctga gatctaagga tgtcgcaggg ccaacttgag ggcccagagc 1021 aggaagcatt cagagagcag ctttaaactc aggagcagag acaatgctgg gaaaacacct 1081 gagctcactc ggccacctgc aaaatttgat gccaggacac gctttggagg caatttacct 1141 gtttttgcac ctaccatcag ggacaggatg acctggagaa cttaggtggc aagctgtgac 1201 ttctccaggc ctcacgggca ctcccttggt ggcaagagcc cccttgacac tgagagaata 1261 ttttgcaatc tgcagcagga aaaattacgg acaggttttg gaggttggag ggtacttgac 1321 aggtgtgtgg ggaagcaggg cggtaggggt ggagctggga tgcgagtgag aaccgtgaag 1381 acaggatggg ggctggcctc tggttctcgt ggggtccaag ctttgtgtat tcttcaatct 1441 cactggcaag aactgaaacc ac // LOCUS AFACFXP1 1751 bp ds-DNA BCT 01-JUN-1990 DEFINITION A.eutrophus phosphoribulokinase P (cfxP) gene, complete cds. ACCESSION M33562 KEYWORDS phosphoribulokinase. SOURCE A.eutrophus (strain H16) DNA. ORGANISM Alcaligenes eutrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 1751) AUTHORS Kossmann,J., Klintworth,R. and Bowien,B. TITLE Sequence analysis of the chromosomal and plasmid genes encoding phosphoribulokinase from Alcaligenes eutrophus JOURNAL Gene 85, 247-252 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 621 phosphoribulokinase F(AA at 1) pept 633 1511 phosphoribulokinase P BASE COUNT 310 a 614 c 542 g 285 t ORIGIN 1 ctgcagcccg gcagcgccca ggtggcggcc ggctacgcgc tctacggtcc caccaccatg 61 ctggtgctga ccgtgggcaa tggcgtcaac ggcttcacgc tcgatcccaa cctgggcgag 121 ttcttcctga cgcaccccca cctgcgggtg ccgcccgata cccaggaatt tgccatcaac 181 gcgtcgaaca gccgcttctg ggaagcgccg gtgcagcgct acatcggcga gtgcatggct 241 ggcaagagcg ggccgcgcgg caaggatttc aatatgcgct ggatcgcatc gatggtggcc 301 gaggcgcacc gcatcctgat gcgcggcggt gtcttcatgt acccgcgcga caccaaggat 361 cccgccaagc cgggccgcct tcgcctgctg tacgaggcca acccgatcgc cttcctgatg 421 gagcaggccg gcgggcgcgc cagcacgggc cggcagacgc tgatgtcggt ggcgccgggc 481 gcgctgcacc agcgtatcgg cgtgatcttc ggctcgcgca atgaagtgga acggatcgag 541 ggctaccaca ccaaccagac cgaccccgac cttccgagtc ccctgttcaa cgagcgcagc 601 ctgttccgcg cgtccgcctg aggtgcctgg ccatgtcaga acgttatccc attatcgcta 661 tcaccgggtc ctcgggcgcc ggtaccacgt cggtgacccg taccttcgag aacatcttct 721 gccgtgaagg cgtgaagtcg gtcgtgatcg aaggcgacag cttccaccgc tacgaccgtg 781 ccgagatgaa ggtcaagatg gccgaggccg aacgcaccgg caatatgaac ttcagccact 841 tcggcgcgga gaacaacctg ttcggcgacc tggagagcct gttccgctcc tacgcggaat 901 ccggcacggg catgcgccgg cgctatctgc acagcacgga ggaggccgcg ccgttcgggc 961 agcagcccgg caccttcacc gcgtgggagc cattgccggc cgataccgac ctgctgttct 1021 acgaaggcct gcacggcggc gtggtcaccg acgaggtcaa tgtcgcccag tatcccaacc 1081 tgctgatcgg cgtggtgccc gtcatcaacc tggagtggat tcagaagctc tggcgcgaca 1141 agaaacagcg cggctactcg accgaggccg tgaccgacac catcctgcgc cgcatgccgg 1201 actacgtgaa ctacatctgc ccgcagttct cgcgcacgca tgtgaacttc cagcgggtgc 1261 cgtgcgtgga cacgtccaac ccctttatct cgcgcgaaat ccccgcgccc gatgaaagca 1321 tggtggtgat ccgctttgcc aacccgaagg ggatcgactt ccagtacctg ctgagcatga 1381 tccacgactc cttcatgtcg cgcgccaaca ccatcgtggt gccgggcggc aagatggaac 1441 tggccatgca gctgatcttc acgcccttcg tgctgcgcat gatggagcgc cgcaagcgcg 1501 ccgcgctgta aggagccgag atgaacgcac ccgaacgcat cgaccccgca gcgcgttgcg 1561 ccaacgcgct gcgcttcctg gccgccgacg cggtggagtt ggcccgctcc ggccaccccg 1621 gcgcccccat gggcatggcc gagatggccg aggtcgtgtg gcggcgccac ctgcgccaca 1681 acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac ggccatgcct 1741 ccatgctgca g // LOCUS AFACFXP2 1641 bp ds-DNA BCT 01-JUN-1990 DEFINITION A.eutrophus phosphoribulokinase P (cfxF) gene, complete cds. ACCESSION M33563 KEYWORDS phosphoribulokinase. SOURCE A.eutrophus (strain H16) DNA. ORGANISM Alcaligenes eutrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 1641) AUTHORS Kossmann,J., Klintworth,R. and Bowien,B. TITLE Sequence analysis of the chromosomal and plasmid genes encoding phosphoribulokinase from Alcaligenes eutrophus JOURNAL Gene 85, 247-252 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 480 phosphoribulokinase F pept 492 1370 phosphoribulokinase P BASE COUNT 306 a 555 c 505 g 275 t ORIGIN 1 ctgcaggtgc cggccgatac ccaggaattt gccatcaatg cgtcgaacag ccgcttctgg 61 gaagcgccgg tgcagcgcta catcgccgag tgcatggccg gcaagagcgg gccgcgcggc 121 aaggatttca atatgcgctg gatcgcgtcg atggtggccg aggcgcaccg catcctgatg 181 cgtggcggcg tcttcatgta cccgcgcgac tccaaggatc ccgccaagcc gggccgcctg 241 cgcctgctgt acgaggccaa tccgatcgcc ttcctgatgg agcaggctgg cgggcgcgcc 301 agcacgggcc ggcagacgct gatgtcggtg gcgccgggtg cgctgcacca gcgcattggc 361 gtgatcttcg gctcgcgcaa tgaagtggaa cggatcgagg gctaccacac cgaccagacc 421 gatcccgacc ttccgagtcc cctgttcaac gagcgcagcc tgttccgcgc gtctgcctga 481 ggtgcctggc catgtcagaa cgttatccca ttatcgccat caccggctcc tcgggcgccg 541 gtaccacgtc ggtgacccgt accttcgaga acatcttccg ccgcgagggc gtgaagtcag 601 tggtgatcga aggcgacagc ttccaccgct atgaccgcgc cgagatgaag gtcaagatgg 661 ccgaggccga gcgcaccggc aatatgaact tcagccactt cggcgaggag aacaacctgt 721 tcggcgagct ggagaacctg ttccgctcct atgccgaaac cggcacgggc atgcaccggc 781 actacctgca tagcccggag gaagccgcgc cgttcgggca ggaacccggc accttcacgc 841 agtgggagcc gctgccggcc gataccgatc tgctgttcta cgagggcctg cacggcggcg 901 tggtcaccga ttcggtcaat gtcgcccagt atcccaacct gctgatcggc gtggtccctg 961 tcatcaacct ggaatggatc cagaagctct ggcgcgacaa gaagcagcgc ggctattcga 1021 ccgaggccgt gaccgacacc atcctgcgcc gcatgccgga ctacgtgaat tacatctgtc 1081 cgcagttctc gcgcacgcat gtgaacttcc agcgcgtgcc gtgcgtggat acgtccaacc 1141 cctttatctc gcgcgaaatc cccgcgcccg atgaaagcat ggtggtgatc cgctttgcca 1201 acccgaaggg gatcgacttc cagtacctgc tgagcatgat ccacgactcc ttcatgtcgc 1261 gcgccaacac catcgtggtg cccggcggca agatggagct ggccatgcag ctgatcttca 1321 cgcccttcgt gctgcgcatg atggagcgcc gcaagcgcgc cgcgcagtaa ggactcgacc 1381 gtgaaacgcc gactcgcgag gatctcctga atgaacgcac cagaacgcat cgattccgca 1441 gcgcgctgtg ccaacgcgct gcgcttcctg gccgccgacg cggtggagca ggccaaatcg 1501 ggccaccccg gcgcacccat gggcatggcc gaaatggccg aagtgctgtg gcggcgccac 1561 ctgcgacaca acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac 1621 ggccatgcgt ccatgctgca g // LOCUS BIFLLD 1767 bp ds-DNA BCT 01-JUN-1990 DEFINITION B.longum L-lactate dehydrogenase (ldh) gene, complete cds. ACCESSION M33585 KEYWORDS L-lactate dehydrogenase. SOURCE B.longum (strain aM101-2) DNA. ORGANISM Bifidobacterium longum Prokaryota; Bacteria; Irregular, nonsporing, gram-positive rods. REFERENCE 1 (bases 1 to 1767) AUTHORS Minowa,T., Iwata,S., Sakai,H., Masaki,H. and Ohta,T. TITLE Sequence and characteristics of the Bifidobacterium longum gene encoding L-lactate dehydrogenase and the primary structure of the enzyme: A new feature of the allosteric site JOURNAL Gene 85, 161-168 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 555 1517 L-lactate dehydrogenase (EC 1.1.1.27) mRNA 493 > 1517 L-lactate dehydrogenase mRNA BASE COUNT 373 a 555 c 488 g 351 t ORIGIN 1 gtcgacgcgg tcaatgacgt gttggcggac atcgaaggca cggcctcgat tccgcgtatt 61 ctcgtattca acaaggccga tcaggcggac gaggcgactc gtgaacgact cgccgcgctg 121 cagccagatg cgttcatcgt ctccgcctat accggtgagg gattggacga gctgcgtacc 181 gcggtcgaaa gtctgctgcc ggtcccgcat gtgcatgtca acgctctgct gccgtatacc 241 gctggctccc tgatctctcg tgtacgcgaa tacggcaagg tagacaaggt ggagtaccgc 301 gatgatggca tacagcttga agcggacgtt gatgcccatc ttgcgctcag gtggtcgaac 361 agtccattga ctaacgtgat aaacatcaca gtatattcgt gagcgctaac aaccgttgaa 421 aacattacca tacggttgtc aaacagggtg gtgtgccggt agcaaaacgt cttagcgggt 481 ttatagagtg aagacgttag ttacaaggcc tgccattcat cagcagaccg cctttgaaga 541 gaggttcatc catcatggcg gaaactaccg ttaagcccac gaagcttgct gttattggtg 601 ccggtgccgt tggctccacc ctcgccttcg ccgctgccca gcgtggcatc gctcgcgaga 661 tcgtgcttga agacatcgcc aaggagcgcg tggaagccga agtgctcgac atgcagcatg 721 gctccagctt ctacccgacc gtgtccatcg acggttccga cgatcctgag atctgccgcg 781 acgccgacat ggtcgtcatc accgctggtc cgcgtcagaa gccgggtcag tctcgtcttg 841 agctcgttgg cgctaccgtc aacatcctca aggccatcat gccgaacctg gtcaaggtgg 901 ctccgaacgc catctacatg ctcatcacca acccggtcga catcgctacc cacgtggctc 961 agaagctcac cggtctgccc gagaaccaga tcttcggttc cggcaccaac ctggactccg 1021 ctcgtctgcg cttcctgatt gcccagcaga ccggcgtcaa cgtcaagaac gtgcacgcct 1081 acatcgccgg cgagcacggc gactccgaag tcccgctgtg ggagtccgcc accatcggtg 1141 gcgtccccat gtgcgactgg accccgctgc ccggccacga tccgctcgac gccgacaagc 1201 gcgaggagat ccaccaggaa gtcaagaacg ccgcttacaa gatcatcaac ggtaagggtg 1261 ccaccaacta cgccatcggc atgtccggcg tcgacatcat cgaagccgtc ctgcacgaca 1321 ccaaccgcat tctgcccgtg agctccatgc tcaaggactt ccacggcatc tccgacatct 1381 gcatgtccgt gccgaccctc ctcaaccgtc agggcgtcaa caacaccatc aacaccccgg 1441 tctccgacaa ggagctcgcc gctctgaagc gctctgccga gacgctgaag gaaaccgccg 1501 cccagttcgg cttctgataa aaaatcgctg tacggagcgc ctttcgcacc gcagcaagac 1561 tcgacctacc tttgtaggcc ttcaccttgt gcggcacgaa aatcgcacac gtacagcgat 1621 tttttatacc ctgctgaatg ctcccgttgg gagcattttt tattcgtgga gttcgttgga 1681 ggagggggag ctatagccct cgggctcgag ctggaaggtg gtgtggggga ccgagaccgg 1741 gaagtgctcg cgcaggcaat cctgcag // LOCUS BOVP450SCC 1471 bp ds-DNA MAM 01-JUN-1990 DEFINITION Bovine P-450scc gene, exon 1. ACCESSION J05245 KEYWORDS P-450scc. SOURCE Bovine calf thymus DNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1471) AUTHORS Ahlgren,R., Simpson,E.R., Waterman,M.R. and Lund,J. TITLE Characterization of the promoter/regulatory region of the bovine CYP11A (P-450-scc) gene: Basal and cAMP-dependent expression JOURNAL J. Biol. Chem. 265, 3313-3319 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 948 / 1216 P-450scc, exon 1 pre-msg 897 > 1471 P-450scc mRNA and introns BASE COUNT 324 a 367 c 396 g 384 t ORIGIN 1 gcagatgtcc ctggtgattc ctgaaacagg ccctctgttt aaattcttca gcagttagag 61 ggaaggtcaa tttttcccaa ggcttttggg ctttgattgt tttcattttt aaattatctg 121 cattctaaag agatattttg ggtggcagat tttgctctcc tacaggactt tgtctaggag 181 acggctctca ggccagctcc gacgactgtt ccaaagaagt aagggaaagc tagggtttat 241 atcaatcttt ttttttgctg ggagaagggg gatgaacatg tagtcaaaca taaaaagatc 301 actgctaatc ccaaacaaca gacacctcaa gtgaatggtt ttagtgtttt tctatatatg 361 ttgtttagtc actaagtcct gtccgactct tttgcgactc catagactgt agcccaccaa 421 gctcctctgt ccatgggatt tttctaggca agaatactgg agtgggttgc catttccttc 481 tccctgggat cttcctaacc caaggactga acccttgtct cctgcattgc aggtggattt 541 tttaccgact gagccaccag ggaagttatg tgtgcaagaa tccggggtca tggaaatttt 601 cccttagata tacatcgtat ctagggacca gtacaatgca aatgcttcct gtttttcttc 661 atcctgaagt ctcctcaggg tgcattgagg gagggagtcc cctcaggtgg gtgaccacag 721 tggctgacgc ttgatgttgt agaactggaa tgatgggtta cattctttcg tttacagtac 781 tgagtctggg aggagctgtg tgggctggag tcagccggag gaggctgacc gccctgtcag 841 cttctcactt agccttgagc tggtgattat aagctgggtc ccagggtccc agggccagag 901 tcacctgctg cagtacgagc agagacagca gcagctgtgg gggcagcatg ctagcaaggg 961 ggcttcccct ccgttcagcc ctggtcaaag cctgcccacc catcctgagc tcagtggggg 1021 agggctgggg ccaccacagg gtgggcactg gagagggagc tggcatctcc acaaagaccc 1081 ctcgccccta cagtgagatc ccctcccctg gtgacaatgg ctggcttaac ctctaccatt 1141 tctggaggga gaagggctca cagagaatcc actttcgcca catcgagaac ttccagaagt 1201 atggccccat ttacaggtaa gcctggcagg aggattgggg ctggcgggat agggaagcct 1261 gtggtggccc cctccctgaa aggtctgccc tccccttcca ggctctggtt cacctctgac 1321 tttatttctt cctgcctggc ggtggcagga gtagagttaa tgcttcccag acagtgggtt 1381 cacttcccag ccctgaggcc tcaacagtcc ccgggctcta cacccttaga aactttgggg 1441 aggtggggag gcccaagaaa ataagccccg g // LOCUS BOVTGN 725 bp ds-DNA MAM 01-JUN-1990 DEFINITION Bovine minisatellite repeat DNA. ACCESSION J04756 KEYWORDS minisatellite sequence. SOURCE Bovine (strain Holstein-Friesian) blood DNA, clone lambda-gBJ32. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 725) AUTHORS Kashi,Y., Iraqi,F., Tikochinski,Y., Ruzinzki,B., Nave,A., Beckmann,J.S., Friedmann,A., Soller,M. and Gruenbaum,Y. TITLE (TG)n Uncovers a sex-specific hybridization pattern in cattle JOURNAL Genomics 7, 31-36 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Gruenbaum, 22-JAN-1990. FEATURES from to/span description rpt 311 703 minisatellite repeat BASE COUNT 134 a 131 c 226 g 234 t ORIGIN 1 aagcttcaca tcccgagaat tccctcccag cgctcgtggt cccacagagg gctctgctgg 61 acctgcctcg ggtcacatgg caggtctggg gaggacacac ctctccccgg cagagaaatg 121 gccagaagcc aggtctgctc cacacgtgcc ttctcccaat actctctaac tttaaaaaaa 181 ctgccaaaga aaaagcggta cgtaataaca agcgcacaga tacgtaattt ataatggctg 241 acacggttgg cagggaaatg tgttacgcag gaattatgtt tttatttatg tgtgtcctgt 301 tttggagaca gcataagtaa tcatgggtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc 361 tccagcgtaa gtaatcatgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagcgtaagt 421 aatcgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc tccagagtaa 481 gtaatcatgg gtgtgtgtgt gtgtgtgttg cctgtctcca gcataagtaa tcatgggtgt 541 gtgtgtgtgt gtgtgttgcc tgtctccagc ataagtaatc atgggtgtgt gtgtgtgtgt 601 gtgtgtgttg cctgtctcca gcataagtaa tcatgggggg gtgtgtgtgt gtgtgtgtgt 661 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagggacttt tgtacagaga 721 agctt // LOCUS CELCDPK 2305 bp ss-mRNA INV 01-JUN-1990 DEFINITION C.elegans cAMP-dependent protein kinase subunit R mRNA, complete cds. ACCESSION J05220 KEYWORDS cAMP-dependent protein kinase. SOURCE C.elegans mRNA. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 2305) AUTHORS Lu,X., Gross,R.E., Bagchi,S. and Rubin,C.S. TITLE Cloning, structure, and expression of the gene for a novel regulatory subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 3293-3303 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 132 1262 cAMP-dependent protein kinase subunit R mRNA < 1 2305 cAMP-dependent protein kinase subunit R mRNA BASE COUNT 651 a 543 c 417 g 694 t ORIGIN 1 gcggcatccc ccgtcaaaag tctgttacct tccccgccaa gttttcatgt catttcaaac 61 gtgtgttttc tgctattttt ctcgcatttt tgtctctttt gtttgtttaa aacaagaaat 121 gactgaaaat tatgaacaat tattcaggtg acattgtttt tatgtcgggt ggaaacgaag 181 aggatcagtt ggcacaatgc caggcctatg tgcaacgaca caacattcag cagcttgtca 241 aagaagcaat tgttgttctg tgcatacaca aacccgacaa cccggttctc ttcttgaagg 301 atcattttga aaaactcaac gaacaacgag cacaggaagg aggaaatccg gacgctgctg 361 atgatgacga catcattgtg gagccaccaa agagatcagg tggacgcaga accggaatct 421 ctgccgaacc aatcaaggag gacgataccg agtacaagaa ggttgtgatt ccaaaggatg 481 atgctacaag aaggtcgctt gagtcagcca tgaggaagaa cttgctcttt gctcatttgg 541 aggaggatga gcaaaaaacc atgtacgatg ccatgttccc agttgagaag tctgctggag 601 agaccatcat cgaacaggga gaagaaggag ataacttcta tgttattgat aagggaaccg 661 ttgatgtcta cgtcaatcac gagtacgttc tcaccatcaa cgagggtgga tcatttggag 721 aacttgcttt gatctatgga actccacgtg ctgcaaccgt cattgccaag attgacgtga 781 aattgtgggc catcgaccgt ctcacctacc gtcgtatctt gatgggatca gtaacaaaga 841 agagaaagat gtatgacgag ttcctctcca aggttcaaat cttggccgat ctcgaccagt 901 gggagcgcgt taacgtggct gatgctcttg agagatgcga cttcgagcca ggaactcatg 961 tcgtcgagca aggacaaccc ggagacgaat tcttcatcat tctcgaggga gaagccaatg 1021 tgttgcagaa gagaagcgac gacgcaccat tcgacgttgt cggacatctt ggaatgtcgg 1081 actatttcgg agaaatcgct cttcttctcg accgcccacg tgccgccacc gttgttgcca 1141 aaacccatct caagtgcatc aagctggatc gtaaccgctt cgaacgtgtc atgggaccag 1201 ttcgtgagat cctgaagaga gacgtttcaa actacaactc atacgtcaaa ctgatgacct 1261 aatcggatta caatccgttt gccccgcatc ttcctcatca ccgtatcttc ttttccacct 1321 tttcgccaaa gtactcgtaa ctattttttg aaagtatccc aaaaattctt ccttagtttt 1381 tttttagtat tgtatttttt cctgtatttt tttggatttt ctattattct gcatactgtc 1441 tagacctgta accttcttct ctcaccaacc gtgacatctt caagggcccc ccccccaatc 1501 aattcgttcc ctatcttcat ttctagtcag tcgatgtcaa attaaattct aaatgtctaa 1561 atttctaact catagccaaa acaacattca ttgttttaac cagcccaaaa tcctagtcag 1621 aagtttgcag aaaactattt gaaactcctc gtgaaaaagc taaaaaaatg ttcaactaga 1681 atagttttag tttttattcc ctgtgtaaca aaaaaaacag ttttgcatgc ttttctatcc 1741 accctaaaat ttcaacattt tcagtagctt tacagctcaa tctttaccaa ccatttcctt 1801 caattcctat aggttaactc aaagaagaaa cgaacaccat tcttcattaa ttaacattaa 1861 gcagtgaatg ccaaatcttg tctttataat atgtaacata taattttatg agaaataaga 1921 aatgcgcatc cacaacaatc gtcccccccc cccctaccca gcacacattt ttttcttatt 1981 tgttccactt ctgaattctg aactccgtca cccttatcca gcattgcgaa gtctcttatc 2041 tacgccccct tccctcaaca tccaacttct tttttttttc atttttatat atttttcaca 2101 ttcattggaa atccgctgtc atgtcgccat ctcaatgttt tcttgtctct tttttttcat 2161 ttctccacca cattttccac tagatttctg ttatccaaaa tttttgtcgg tttcccccgc 2221 gcacacacac acaaaaaaca gtatgttata aactatctat aatgtaatct tagaggttta 2281 ttggactaat aaaatttgga agttg // LOCUS ECHOGL 1803 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.chrysanthemi oligogalacturonate lysase (ogl) gene, complete cds. ACCESSION M33583 KEYWORDS oligogalacturonate lysase. SOURCE E.chrysanthemi DNA. ORGANISM Erwinia chrysanthemi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1803) AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J. TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes negatively regulated by the kdgR gene product JOURNAL Gene 85, 125-134 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 271 1437 oligogalacturonate lysase (ogl) BASE COUNT 480 a 393 c 441 g 489 t ORIGIN 1 agtgaatcaa tacagccgga ggatatttat tctgacgttt ggtctggctg cttattgccg 61 aatctggatc aatatcaggc aaaataatct tgcgtcagcg caaacagtca gtgaaaaatg 121 aaagaatgtt ttataaaata aaaccacgat cacggaaaaa tgaaacgttg tttctacaat 181 accggtataa caaggcgatt ggccgggatg acgggtaatc tttcacctgg cggcgtctga 241 tttttcacga taagcgctaa ggatttacgg atggccaaag gtaaaaagct ttctttttcg 301 ttccatactt accaggattc agtcaccggc accgaagtgg tgcgtctcac tcctcccgat 361 gttatctgcc accgcaacta cttctatcag aagtgttttt ccaatgatgg cagcaagctg 421 ctttttggtg gcgcctttga cgggccgtgg aactactatt tgctggatct gaaaactcag 481 caggcgacgc aactgaccga aggtaccggc gacaatactt ttggtggttt tctgtcacca 541 gatgatgacg cgctttatta tgtaaagaac gttcgtaatt tgatgcgtgt tgacctgaat 601 acactggaag aaaccaatat ttatcaggtg ccggacgact gggtcgggta cggtacctgg 661 gttgccaact ccgactgcac caaaatggtc ggtatcgaga tcaagaaaga ggattggaaa 721 ccactgaccg actggaaaaa attccaggaa ttctacttta ccaatccatg ctgccgtttg 781 attcgtatcg atctgaaaac cggcgaagcc accaccattc tgaaggaaaa ccaatggctg 841 ggtcatccta tttaccgtcc gggtgacgat aatacggtgg ccttctgcca tgaaggtccg 901 catgacctgg ttgatgcgcg tatgtggttc atcaatgaag atggctccaa tatgcgtaag 961 gtaaaagagc atgcgccggg cgaaagctgc actcacgaat tctgggtgcc gaatggttct 1021 gcgctggcct acgtttccta tctgaaaggc agtactaacc gtttcatttg cagcgttgat 1081 ccggtaacgc tggaaaaccg tcagttgact gaaatgccgc cgtgttctca cctgatgagt 1141 aactacgatg gtacgttgat ggtgggagat gggtgtaatg cgccggtgga tgtgaaagat 1201 gacggtggct acaagactga aaacgatccg ttcctgtatg tgttcaatat gaagaccggg 1261 aaacatttcc aggtcgctca acacaacacc tcctgggaag tgctggaggg cgatcgtcag 1321 gtaacgcatc cacatccgtc ctttacgccg gatgacaagc acattctgtt tacgtctgat 1381 gtcgatggta agccggcgtt atatctggca aaagtgcctg attccgtctg gcaataactg 1441 tacctgcaac cgtgcccgcc agggcacgtt ttttcgcttt tacaataaga ttattaattc 1501 ggcagtgcta ttgtgatgga tgatattaat cggtctgtca cttatagtaa tgcccgattg 1561 ccatattcaa agcgttccct ttatactaaa acacgtgttc tatttttttt aaaacaaaaa 1621 atactgagta gggtaaccac aaaaatggct attgcagatt tagataaaca acccgattcc 1681 gtgtcatctg tcttaaaggt ctttggtatt ttgcaggcgt taggcgaaga gcgtgagatt 1741 ggcattactg aactttcaca gcgcgtgatg atgtctaaaa gtactgttta tcgtttcctg 1801 cag // LOCUS ECHPLE 1758 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.chrysanthemi pectate lysase e isoenzyme (pelE) gene, complete cds. ACCESSION M33584 KEYWORDS pectate lysase e isoenzyme. SOURCE E.chrysanthemi DNA. ORGANISM Erwinia chrysanthemi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1758) AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J. TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes negatively regulated by the kdgR gene product JOURNAL Gene 85, 125-134 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 251 1465 pectate lysase e (pelE) BASE COUNT 534 a 480 c 379 g 365 t ORIGIN 1 tttaaaactt catttttaat ttaatcataa aaaataaaaa accaagtaac acaaagttac 61 aaatacagtc aatagtttat tttttattaa taaaaaccat tgtcatcatc gtgacaaagt 121 tcacaaaata gacactcaac cgcataaaca ttcgccaaag caaaagatga aatggtattt 181 cgtttttaga cacacatgta acaaatggac accatttgga tcgctcactg agcacacaag 241 gaaaattgcc atgaacaact cacgtatgtc ttccgtttca acacagaaaa caacaggacg 301 ttctgccttg ggaaccaaaa gcgcgctggc cgcgattatc gccaccacca tgatggtctc 361 agtcgcttcc gccgccagtc tgcaaaccac caaagcgaca gaagcggcgt caaccggctg 421 ggcaacgcag agcggcggca ccaccggcgg cgccaaagcg tcatcatcca aaatctacgc 481 ggtgaaaagc atcagcgaat tcaaagcagc actgaacgga accgattcgt cgcccaagat 541 catccaggtc accggagcaa ttgatatcag cggcggtaaa gcctacaccg actttgacga 601 tcagaaagcc cgcagccaga tcagcattcc gtccaacacc accatcatcg gtattggcaa 661 caaaggcaaa ttcaccaacg gttcgctggt ggtgaaaggc gtcagcaacg ttatcctgcg 721 taacctgtac atcgaaacgc cggtggacgt ggcgccgcat tacgaagagg gagatggttg 781 gaacgccgag tgggacgcag tagtgattga cagcacagac cacgtctggg ttgaccatgt 841 caccatcagc gacggcagcc tcaccgacga caaatacacc accaaaaacg gcgaaaaata 901 tgtgcagcat gacggctcgc tggatatcaa gcgcggctcc gactacgtta ccgtttctaa 961 tagccgcttc gagctgcacg acaaaaccat cctgatcggc cacagcgaca acaacggctc 1021 gcaggacgcc ggcaaactgc gcgtcacctt ccacaacaac ctgtttgacc gggtcggcga 1081 acgtaccccg cgcgtacgct tcggcagcgt tcacgcttac aacaacgttt acgttggcga 1141 cgtcaaccac aaagcctatc gctatcagta cagcttcggt atcggcacca gcggtagcct 1201 gctgtctgag tccaacgcgt ttaccattga taacatgaaa aaaatcagcg gccgcgacaa 1261 agaatgcagc gtggtcaagg cgtttaacgg taagatcttc tctgataaag gctcgatcat 1321 caacggcgct tcgtacaatc tgaatggttg cggctttggc ttcagcgcat acagcgccaa 1381 gatcccgtac aaatactcag cccaaaccat caccaccagc ctggcaaaca gcatcagcag 1441 caacgcgggc tacggtaaac tgtaatttcc cgcccagcgt aaaggccact tgcaaaagtg 1501 gcttttattt tcccatattc ccaccactcg tattaccccg atgttctccg aagcgccact 1561 caatacggtt caggttaatc cggccttgcc acaggcgttc atcaagaaat cgaccggttt 1621 cccccatcaa atgcagggac tagcaggaat tagtaagaga cacagattaa taaaattcta 1681 tttaataaaa atcgaatcaa ttgttaactg tttgggttat ttcagataaa aacgcttata 1741 catatagctg aatttaaa // LOCUS ECOFIMA1 66 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli type 1 fimbriae (fimA) gene with viral insertion site, partial cds. ACCESSION M33412 KEYWORDS type 1 fimbriae. SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 66) AUTHORS Hedegaard,L. and Klemm,P. TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous antigenic sequences JOURNAL Gene 85, 115-124 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 type 1 fimbriae (AA at 1) recomb 3 4 E.coli DNA end/ viral DNA start recomb 63 64 viral DNA end/ E.coli DNA start BASE COUNT 12 a 19 c 17 g 18 t ORIGIN 1 gttgatcccc ggacctgcat gatcatcgct caaggaacct ctatgtatcc ctcctgttgg 61 ggggat // LOCUS ECOFIMA2 66 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli type 1 fimbriae (fimA) gene and viral insertion site, partial cds. ACCESSION M33413 KEYWORDS type 1 fimbriae. SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 66) AUTHORS Hedegaard,L. and Klemm,P. TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous antigenic sequences JOURNAL Gene 85, 115-124 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 type 1 fimbriae (AA at 1) recomb 3 4 E.coli DNA end/ viral DNA start recomb 62 63 viral DNA end/ E.coli DNA start BASE COUNT 14 a 17 c 17 g 18 t ORIGIN 1 gttgatccaa acctgcgtgg agatctccag gttctggctc agaaagttgc tcgtacctta 61 ccggat // LOCUS ECOSPDE 2792 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli speED operon speE and speD genes encoding S-adenosyl- methionine decarboxylase and spermidine synthase, complete cds. ACCESSION J02804 J04247 KEYWORDS S-adenosylmethionine decarboxylase; speD gene; speE gene; spermidine synthase. SOURCE E.coli K12 DNA, clones pT7-5-1 and LC37-29. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 643 to 2524) AUTHORS Tabor,C.W. and Tabor,H. TITLE The speED operon of Escherichia coli: Formation and processing of a proenzyme form of S-adenosylmethionine decarboxylase JOURNAL J. Biol. Chem. 262, 16037-16040 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 642; 2525 to 2792) AUTHORS Xie,Q.-W., Tabor,C.W. and Tabor,H. TITLE Spermidine biosynthesis in Escherichia coli the promoter and the termination regions of the speED operon JOURNAL J. Bacteriol. 171, 4457-4465 (1989) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by H.Tabor, 02-DEC-1987 and for [2] 13-MAR-1990. FEATURES from to/span description pept 314 661 unidentified protein pept 767 1633 spermidine synthase /nomgen="speE" pept 1649 2443 S-adenosylmethionine decarboxylase (ttg start codon) /nomgen="speD" signal 2448 2482 hairpin terminator BASE COUNT 714 a 738 c 637 g 703 t ORIGIN 1 aactgaatgc gattacgggc atcggtcgtg agcaaatcag ggatcggtaa cgttgggcgt 61 tctgccgcaa atactgcgcg gctccacagc ggcaaagcga agccacaccc agcgcgacgg 121 aatattttaa gaaatcacga cgttgcatag ttatttcctt attcttaagc aggcgaaaca 181 aaatcaaacg ttgagcatag tccttcccct tacgggaagg tcaagccgca gacacaattt 241 aatcgacaat aatggtcgtc gcctcgctcg cagtgtgcta acgtttatct tctttaagcc 301 cctggtagaa gcaatgaaga cgtttttcag aacagtgtta ttcggcagcc tgatggccgt 361 ctgcgcaaac agttacgcgc tcagcgagtc tgaagccgaa gatatggccg atttaacggc 421 agtttttgtc tttctgaaga acgattgtgg ttaccagaac ttacctaacg ggcaaattcg 481 tcgcgcactg gtctttttcg ctcagcaaaa ccagtgggac ctcagtaatt acgacacctt 541 cgacatgaaa gccctcggtg aagacagcta ccgcgatctc agcggcattg gcattcccgt 601 cgctaaaaaa tgcaaagccc tggcccgcga ttccttaagc ctgcttgcct acgtcaaata 661 atccctgata cctttttgta gaaatattga ccgtgcatcc gcggtcaatg ttagctatta 721 tgttgcgccc tttttttacg ggtgttaaca aaggaggtat caacccatgg ccgaaaaaaa 781 acagtggcat gaaacgctac acgaccagtt tgggcagtac tttgcggtag ataacgttct 841 gtatcatgaa aagaccgatc accaggatct gatcattttt gagaacgctg catttggtcg 901 cgtaatggcg ctggatggcg tagtacaaac caccgagcgc gacgagttta tctatcatga 961 gatgatgacc catgttccgc tactggccca tggtcacgcg aaacatgtgc tgattatcgg 1021 cggcggcgac ggtgccatgc tgcgtgaagt aacccgacat aaaaacgttg agtcaatcac 1081 gatggtggaa atcgatgcgg gtgtcgtatc gttctgccgt cagtatctac ccaaccataa 1141 cgccggtagc tacgacgatc cgcgctttaa gctggtgatc gacgatggcg tcaatttcgt 1201 taatcaaacc agccagacct ttgatgtcat tatctccgac tgcaccgatc ctatcggtcc 1261 cggcgaaagc cttttcactt cggcatttta tgaaggctgc aaacgttgcc tgaatcctgg 1321 cggtatcttc gtcgcacaaa acggcgtctg ctttttacag caggaagaag ccatcgacag 1381 ccatcgcaaa ctcagccatt acttcagcga cgttggcttt tatcaggcgg cgatcccgac 1441 ctattacggc ggtatcatga cttttgcatg ggcgacagat aacgacgcct tacgccatct 1501 ctcaaccgaa attattcagg cgcgttttct cgcctctggc ctgaaatgcc gttattacaa 1561 tccggcaatc catacggcag cttttgcctt acctcagtat ctgcaagacg cactggcttc 1621 acagccgtcc taaggagaag ataagaaatt gaaaaaactg aaactgcatg gctttaataa 1681 tctgaccaaa agtctgagtt tttgtattta cgatatctgc tacgccaaaa ctgccgaaga 1741 gcgcgacggt tatattgctt atatcgatga actctataat gccaaccgtc tgaccgaaat 1801 cctgtcagaa acctgttcca ttatcggggc taatattctt aacatcgccc gccaggatta 1861 cgaaccacag ggtgccagcg tcactattct ggtgagtgaa gaaccggttg acccgaaact 1921 catcgacaaa acagaacacc ccggcccact gccagaaacg gtcgttgccc atcttgataa 1981 aagtcatatt tgcgtacata cctacccgga aagtcatcct gaaggcggtt tatgtacctt 2041 ccgcgccgat attgaagtct ctacctgcgg cgtgatttct ccgctgaagg cgctgaatta 2101 cctgatccac cagcttgagt ccgatatcgt aaccattgat tatcgcgtgc gcggttttac 2161 ccgcgacatt aacggtatga agcactttat cgaccatgag attaattcga ttcagaactt 2221 tatgtctgac gatatgaagg cgctgtatga catggtggat gtgaacgtct atcaggaaaa 2281 tatcttccat accaagatgt tgcttaaaga gttcgacctt aagcactaca tgttccacac 2341 caaaccggaa gacttaaccg acagcgagcg ccaggaaatt accgctgcgc tgtggaaaga 2401 aatgcgcgag atttattacg ggcgcaatat gccagctgtt taacggctct ggcggagctc 2461 ccaggctccg ccagatttat ttacttctgc tgcacgaaat tgcggtaagc cgccacgacc 2521 tgcagaaaat cctcaacgcc gcatagcgac aggctttctt cgtcgtagta gttcatcccc 2581 tcttccattt catcgccagc gaattccagt tgattggcgc gaaccatcac ctcttcaccg 2641 tccatccaca gggtgtattc atgccctgcc cgttgccagg accgttcgct acctttcagt 2701 gcgtgcgctg cttgttccac ttcatcaagc aaggccaggt tttctttcac ctcttcatta 2761 aaccagtgcc cgaccacttc atgccccatg ga // LOCUS HUMBTRO 608 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human beta-tropomyosin gene, exon 1. ACCESSION J05247 KEYWORDS beta-tropomyosin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 608) AUTHORS Libri,D., Mouly,V., Lemonnier,M. and Fiszman,M.Y. TITLE A nonmuscle tropomyosin is encoded by the smooth/skeletal beta- tropomyosin gene and its RNA is transcribed from an internal promoter JOURNAL J. Biol. Chem. 265, 3471-3473 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 441 / 572 beta-tropomyosin IVS 573 > 608 beta-tropomyosin intron A BASE COUNT 102 a 198 c 223 g 83 t 2 others ORIGIN 1 gtctctgcag cagatgtgtg gggactctgg agagcacccc tggatacact ccatgcagag 61 ctgctcacag cctctgcggc cccttttcct ctcaggaaaa taaacacccc gaccgccttc 121 ccctctgcct gctgggctcg ctgcggcatg cagccctcct ggnntctgtg ccctgctttg 181 gggctgtgcg cccatgctgg ggtcgacagc gtgcctccac ggggcagcca tgagccttat 241 cgggggacga cgtcgctcag cgcagggagc gcggcgggag cgggtcggtg tgcggcagga 301 ggaggaggag gaggaggagc aggaagggat ggaggcaggg cgggagcggc cgccgcctga 361 gcgcttcctg cccgagccag gcggatccca caaagggctc agcggagcgg ccttgctgcg 421 ctccccgccg gccgcaggcc atggccggca tcagctccat cgatgccgtc aagaagaaga 481 tccagagcct gcagcaggtg gctgacgagg cggaggaacg cgccgagcac ctgcagcggg 541 aggccgatgc cgagcggcag gcccgggagc gggtaaggct cctgggggca ccccgcgacc 601 cccagacc // LOCUS HUMGPIIB1 2238 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) gene, exon 1. ACCESSION M33319 KEYWORDS platelet glycoprotein IIb. SEGMENT 1 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2238) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1286 + 1473 platelet glycoprotein IIb (GPIIb), exon 1 pre-msg 1254 > 2238 GPIIb mRNA and introns IVS 1474 > 2238 GPIIb intron A BASE COUNT 596 a 554 c 595 g 493 t ORIGIN 1 gtcaacggat cagaaaatag aaatcaaaag gaaaatgtgg ctatggttac ccctagcgga 61 cctcttaaat cttcctgaga acctgctttt ttgggaaggc atgagtgcca gtaagacttg 121 gcactcctcc tcttccgctt accgagagaa aatgactttg cctttctgct caaaactcat 181 cccttcactt tgtcacccta tgtttgcatc ttccatcctt agtgtgtgtt tccatccatc 241 cagtctttca gcaatacacg tactacacat tggactcttg ggtagtctct agggctgtag 301 caaggagcct tgctcccaag ggactcattt acacaatcct gtgaacggac caagagtaaa 361 cagtgtgctc aatgctgtgc ctacgtgtgt tagcccacgc ggccagcctg aggagtcagg 421 gaaggctccc ctaggcaaag cccccaacca gaatcaagtc ttaatggtta aagagctcca 481 tcacccaaaa aggattgagg gcctaccttc aactgaacag ctaatgcata atctcagaaa 541 ctgtgagtca aaattccctg gaataactcc actttatccc caatctcctt gccacctaga 601 ccaaggtcca ttcaccaccc tgtccccagc actgactgca ctgctgtggc cacactaaag 661 cttggctcaa gacggaggag gagtgaggaa gctgctgcac caatatggct ggttgaggcc 721 gcccaaggtc ctagaaggag gaagtgggta aatgccatat ccaaaaagat acagaagcct 781 caggttttat cgggggcagc agcttccttc tccttccccg acctgtggcc aagtcacaaa 841 gcaccacagc tgtacagcca gatgggggaa gggaggagat tagaactgta ggctagagta 901 gacaagtatg gaccagttca caatcacgct atcccaagca gaaagtgatg gtggcttgga 961 ctagcacggt ggtagtagag atggggtaaa gattcaagag acatcattga taggcagaac 1021 caataggaca tggtaataaa ctattctcag gaaaggggag gagtcatggc tttcagccat 1081 gagcatccac cctctgggtg gcctcaccca cttcctggca attctagcca ccatgagtcc 1141 aggggctata gccctttgct ctgcccgttg ctcagcaagt tacttggggt tccagtttga 1201 taagaaaaga cttcctgtgg aggaatctga agggaaggag gaggagctgg cccattcctg 1261 cctgggaggt tgtggaagaa ggaagatggc cagagctttg tgtccactgc aagccctctg 1321 gcttctggag tgggtgctgc tgctcttggg accttgtgct gcccctccag cctgggcctt 1381 gaacctggac ccagtgcagc tcaccttcta tgcaggcccc aatggcagcc agtttggatt 1441 ttcactggac ttccacaagg acagccatgg gaggtgagcc gtaagggaag ttggggtatt 1501 gggagagagc aggacccctc cccatcactg cttctggggg cttcgagttt cccatttgcg 1561 atagcagttg agcaaggtga cttgtggggc ctattcaggt tgatttcttg tcaagaatgt 1621 tggggtccag gggactggct caggtgaagg tataagggca gggcacatgt gggctgatgg 1681 gcactgaaaa ctacagcaag aacaaaggga agacaagagt tgatgcttta ttttttcccc 1741 aagggtcagt tgtatgaacc actccaccct caacaccttg aaatgcagag aggaggccgg 1801 gcgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcggg cagatcacct 1861 gaggtcgaga attcgagacc agcctgacca acatggagaa accccgtctc tactaaaaat 1921 acaaaaaaaa aaaaaaaaaa aagaggccag gcacagtggc tcacacctgc aatcccagca 1981 ctttgggagg cagaggtggg cagatcatga ggtcaggagt tcaagaccag cctggccaat 2041 atggtgaaac cctgtctcta ttaaaaatac aaaaattagc tgagcatggt ggcacactcc 2101 tgtagtccca gctactcggg aggctgaggt aggagaatca cttgaacccg ggaggtggag 2161 gttgcagtga gctgagacta tgccactgca ctcccagccc tggggttgac agagtgacac 2221 tccgtctcaa aaaaaaaa // LOCUS HUMGPIIB2 13204 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exons 2-29. ACCESSION M33320 KEYWORDS platelet Glycoprotein IIb. SEGMENT 2 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 13204) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 498 619 platelet Glycoprotein IIb (GPIIb), exon 2 709 806 platelet Glycoprotein IIb, exon 3 912 1077 platelet Glycoprotein IIb, exon 4 1293 1342 platelet Glycoprotein IIb, exon 5 1419 1464 platelet Glycoprotein IIb, exon 6 1552 1680 platelet Glycoprotein IIb, exon 7 2042 2089 platelet Glycoprotein IIb, exon 8 2245 2288 platelet Glycoprotein IIb, exon 9 2461 2514 platelet Glycoprotein IIb, exon 10 2653 2705 platelet Glycoprotein IIb, exon 11 2897 3108 platelet Glycoprotein IIb, exon 12 5536 5718 platelet Glycoprotein IIb, exon 13 5952 5997 platelet Glycoprotein IIb, exon 14 6106 6210 platelet Glycoprotein IIb, exon 15 6295 6350 platelet Glycoprotein IIb, exon 16 6443 6594 platelet Glycoprotein IIb, exon 17 6783 6908 platelet Glycoprotein IIb, exon 18 7886 7953 platelet Glycoprotein IIb, exon 19 8087 8234 platelet Glycoprotein IIb, exon 20 8803 8895 platelet Glycoprotein IIb, exon 21 9506 9585 platelet Glycoprotein IIb, exon 22 10202 10282 platelet Glycoprotein IIb, exon 23 10406 10505 platelet Glycoprotein IIb, exon 24 10605 10757 platelet Glycoprotein IIb, exon 25 10874 10999 platelet Glycoprotein IIb, exon 26 11478 11591 platelet Glycoprotein IIb, exon 27 11828 11929 platelet Glycoprotein IIb, exon 28 12117 + 12233 platelet Glycoprotein IIb, exon 29 pre-msg < 1 > 13204 GPIIb mRNA and introns IVS < 1 497 GPIIb intron A IVS 620 708 GPIIb intron B IVS 807 911 GPIIb intron C IVS 1078 1292 GPIIb intron D IVS 1343 1418 GPIIb intron E (no splice consensus) IVS 1465 1551 GPIIb intron F IVS 1681 2041 GPIIb intron G IVS 2090 2244 GPIIb intron H (no splice consensus) IVS 2289 2460 GPIIb intron I IVS 2515 2652 GPIIb intron J IVS 2706 2896 GPIIb intron K IVS 3109 5535 GPIIb intron L IVS 5719 5951 GPIIb intron M IVS 5998 6105 GPIIb intron N IVS 6211 6294 GPIIb intron O IVS 6351 6442 GPIIb intron P IVS 6595 6782 GPIIb intron Q IVS 6909 7885 GPIIb intron R IVS 7954 8086 GPIIb intron S IVS 8235 8802 GPIIb intron T IVS 8896 9505 GPIIb intron U IVS 9586 10201 GPIIb intron V IVS 10283 10405 GPIIb intron W IVS 10506 10604 GPIIb intron X IVS 10758 10873 GPIIb intron Y IVS 11000 11477 GPIIb intron Z IVS 11592 11827 GPIIb intron AA IVS 11930 12116 GPIIb intron BB IVS 12235 > 13204 GPIIb intron CC BASE COUNT 3046 a 3579 c 3857 g 2722 t ORIGIN About 2000 bp after segment 1. 1 ctgcaggtca acggatctgc tagggtcctc ctatcagcac acacactcca gccccacttt 61 agaggtaccc gctaccttcc ctcattaaaa ccagctctca agaggggatc tggtaacagt 121 ctaggcaggc attccaggga gcatgtgaac cgctggttct tgttgcgggt ggaggatgga 181 ggtgttgtac agagtttagg tctttttcag caaagatctc caaaccccgg gtgttcaaaa 241 tcaaaccaaa ggggattata gtcccagctc tactcacaac tcactggtta ctttagccac 301 gagattgccc tcgctgagag tcggtttcac tgtccataag atgaagaagt acatcacggt 361 ggtctgtgag gtgtcattga ggaaagatgg tccagtgccc ccatgccaca tggccttcgg 421 gcagtgctcc cagcgccggc gccagggcct gggatacgct ggaatctgcg cggcgctcac 481 ccagctttcc tatgcagagt ggccatcgtg gtgggcgccc cgcggaccct gggccccagc 541 caggaggaga cgggcggcgt gttcctgtgc ccctggaggg ccgagggcgg ccagtgcccc 601 tcgctgctct ttgacctccg tgagtcccag gcaaggagag caaggttggg gtcagaggga 661 cgtggactgc ccgggcttca gcgccccacc ccttcttgtg ccttccaggt gatgagaccc 721 gaaatgtagg ctcccaaact ttacaaacct tcaaggcccg ccaaggactg ggggcgtcgg 781 tcgtcagctg gagcgacgtc attgtggtgg gccccgcggt acagggcaca gggaacaatc 841 gggggcaggg acactggggc caggaggagc ccaagtctcg cgccccgtcc ccatctgtgg 901 ccctttctca ggcctgcgcc ccctggcagc actggaacgt cctagaaaag actgaggagg 961 ctgagaagac gcccgtaggt agctgctttt tggctcagcc agagagcggc cgccgcgccg 1021 agtactcccc ctgtcgcggg aacaccctga gccgcattta cgtggaaaat gattttagta 1081 agcgccagct acgacctggc cccgcccact cgcgacggct tggccccgcc ccccatcgga 1141 tcccgccccc agcgccgcag cccttgcttt ggatctggcc tcgccccagg gccccgccga 1201 ctcaaggccc cgcccctgtc ccccagccct cctccgggct cgcgcgcgcc tcccttcacc 1261 cctgggctga cccctcctcc ttgtctcctc aggctgggac aagcgttact gtgaagcggg 1321 cttcagctcc gtggtcactc aggcgagtag ggagcaaaag cgcagtgggg gcggctccca 1381 aacagggccc cctctcaccc tcaggacttc ccttccaggc cggagagctg gtgcttgggg 1441 ctcctggcgg ctattatttc ttaggtacgt gcccatccgt acacctccct cccttctcgc 1501 ggccgaagga gaccgctttg ggcttcacac ccgctgtccc tcccgcccta ggtctcctgg 1561 cccaggctcc agttgcggat attttctcga gttaccgccc aggcatcctt ttgtggcacg 1621 tgtcctccca gagcctctcc tttgactcca gcaacccaga gtacttcgac ggctactggg 1681 gtaacaccgc cattccagac ttccagcacc ccgagggtca ccgcccaccg cagacggtca 1741 ggtcctgccc ctgtgggagc ctccatggcc acccctgccg gccaacccac cgcctaagcc 1801 gctcccgccc tccgctcctg cgcttccccg cagaccgccc acctcccatg cgcccaccgc 1861 tcccttccac tgcggactcg tagcgcagcc tggggcaggg cttggcccct cgaaggcctc 1921 cgtttttcca tctgcacaat gcagggctgg ggctgagtgg ccttaatctc ctccttcttt 1981 gccctccgtc ccctctgtgc ttcctcccct ggaaaagact aatttgcgcc cttgtcctca 2041 gggtactcgg tggccgtggg cgagttcgac ggggatctca acactacagg caagaaatcc 2101 acttagggcg ggagttgggt agcccagccc ggggaggagc gccttcctga aatctcccct 2161 atgtagctgg gtgcagaacg gggagcggga agtgggtagg ttctaaggct ctcattccct 2221 gagcctggct ctccctatcg ccagaatatg tcgtcgtgcc ccccacttgg agctggaccc 2281 tgggagcggt aagtgccccc accactgggc ctcccgaagc cccttatccc agttctcagg 2341 ctgacaactc ctgagcgccc cccacccccg ccccgcctcc accaaaccac cctttctcac 2401 ctggagtggg aggttgcttt gggtacaaga atgatgctct cgcctgcgct gtccgtgcag 2461 gtggaaattt tggattccta ctaccagagg ctgcatcggc tgcgcggaga gcaggtgggg 2521 gccaggtccc agtgggcgtg gctgggtgga gggggaactg agacttcaga atatttcatg 2581 ggaggtgagg gcccatttct taaagaggat gcttgtccag cggcgtgaat gatggtgctc 2641 ctcatcttgc agatggcgtc gtattttggg cattcagtgg ctgtcactga cgtcaacggg 2701 gatgggtgag gagggacatg cccccacccc tacccagttg ggtcccaaat taccagagct 2761 gcccctctgt ctccctttcc tagccctagt ctcacgtatc cactggagga acaggagagc 2821 aagggtcgag gagatttggc cctagcccca atatacccct ggtccagtcc catgtaacca 2881 ctcatctggc ccacaggagg catgatctgc tggtgggcgc tccactgtat atggagagcc 2941 gggcagaccg aaaactggcc gaagtggggc gtgtgtattt gttcctgcag ccgcgaggcc 3001 cccacgcgct gggtgccccc agcctcctgc tgactggcac acagctctat gggcgattcg 3061 gctctgccat cgcacccctg ggcgacctcg accgggatgg ctacaatggt gagggaagag 3121 aggagcccta cttgctgcag aggggttaac agccactcaa aaagcatgga gttggcctga 3181 gggcagccag aaccaggatg ggttttaagc atataagtat gtggcttaga cacatggggt 3241 gctgagtgga gagcagatgg gagagttgaa gactaattag gaagtgtttg ccttaatcca 3301 agcaagagac aatgaccacc tggatgtgga ttttggcagt ggagttagag atgggagtga 3361 cttcacagat atttaggact cggattatta ggacttggtg ggagactgga tgtggggcca 3421 ggggagaggt tggagttggg tgcctgtgat ggcctccact gcctggaact caggccgtgc 3481 agcaggtgct ggggagaggc gggagatcag cagttcagct ctggacctgt tgagcttgaa 3541 gggcttgggt gctttaggcg gaaatatcca aagaacagtt gggagtggct ctccccgctt 3601 ccacaagaga gatctgaatg ggagacaggg gtttggggaa agtggatgag gtcccgggac 3661 ctgtgaaata agaggcccag gatagagccc tagggagcaa aagcatttag gtgactccta 3721 caggaggtaa gtctgagaag gagacagagg agtgtccaga gagggaggag ggaacccagg 3781 gggtctgatg gcccgggact caaggaagag catgcgttaa agagcatgca caggaggaag 3841 tgggcgctgc agctcctgct gctgctgcaa gatacaatta ggtggggctg gagaaatatt 3901 catgggcttt agcaagaaga gggtgccagg catggtggct catacctgta atcccagcta 3961 cttgggaaat tgaagcagga gaatctcttg aacccgggaa gtggaggttg cactgagctg 4021 agcttgcgcc actactgcac tccagcctgg gtgacagagc aagactccat ctcaacaaaa 4081 taaaaaaaaa aatagagaaa gaaaggaaga aagaaaaaag aaggggaggt tattggtgac 4141 agtgacataa attgattcag gccaagatag ggtcagaagc cagaatgcaa tggggtaagg 4201 tatgaatgga gatgaaaaat tggatgcagc taatgtagac agctctttca acaggtttgt 4261 ggtaaaaagg aatttgagga atagaaagga aaaaaaaaaa catgtttgac tataagagga 4321 aaaagagaaa aggtgatcac agaaaagaga tgagggtcaa gggaagatta tttcaatgtg 4381 gaagaacatg tagtaggttg aaaatgatgt tgtggggaaa tggggggatg agccagcaga 4441 gagtccctgt gatgcctcag ggggtgggag ggtgactggc ccagtgtcag ggtgaaggaa 4501 ggaaacctct tccagggtca aatggggaaa gggaaaaaga aagttggtgt gggattatag 4561 cataacagtg ggctgcctct cttcctgaag taagagatta cgtcacctgc tgaaggaagt 4621 gtggggggtc tgggagtttg atggaatgga gaaggctaga aatagatgct agatggccag 4681 gcacggtggc tcacacctgg aatcccagca ctttgggagg ccgaggcagg aggatcactg 4741 gagcctagga gtttgacacc agcctggcca acatagggag atctcgtctc cataaaaatt 4801 tttaaaaatt agctgggcat ggtggctata gtctcaactg cttgggaagc tgaggtggga 4861 ggattgcttt agtccagaag gttgaggctg cagtaagcca tggttgcacc actgcacttc 4921 agcctgaatg acaagtgcaa gactgtctta aaataaaaaa tttaaagggc ttgggcacgg 4981 tggctcacac ctgtaatcca gcactttggg agcccaaggt gggcagatca cttgaggtca 5041 ggagttcgag atcagcctgg ccaatgtggt gaaaccccgt ctctactgaa aatacaaaaa 5101 ttagccgggc atggtggtag gcgcctgtaa tcccagctac tgaagaggct gaggcacaag 5161 aatcacttta acgggggagg cagaggttgc agtgagccga gatcgcacca ctgcactcca 5221 gccaggacaa cagagcgaga ctccatctca aaaaaaaaaa aatttagaaa agggaataat 5281 gatgcttaat tttcaggata tattttcctc aatagacagt gagagttgtc actgttttta 5341 taacaatcct acttggcagg tccctctccc acctgattgt taactcctgg agggtagggc 5401 agtgcctcct tcacccacac tttgcacccc tttcctagtc tcctgggatg ttcccagaga 5461 agctcaggaa agttttacag tcatctaggg aggctgaata acaatcagcc acttcctttc 5521 tgttactcct tccagacatt gcagtggctg ccccctacgg gggtcccagt ggccggggcc 5581 aagtgctggt gttcctgggt cagagtgagg ggctgaggtc acgtccctcc caggtcctgg 5641 acagcccctt ccccacaggc tctgcctttg gcttctccct tcgaggtgcc gtagacatcg 5701 atgacaacgg atacccaggt gccctggact gcctccagct agaaatgccc aagaaaggcc 5761 cttggacatt cgctggaagt gccaagagac acggccaggg ctcatgcctg gcctggtgtc 5821 ccactatgga ctgccagagg ggctgggtga aacctccagt gggggaggtg gtgtggggaa 5881 cccctgggaa gatgagatga ggatccccat accctaatcg ccaattctga cccattcctc 5941 gatgtctata gacctgatcg tgggagctta cggggccaac caggtggctg tgtacaggtg 6001 agcactggct ccaggggcgg gatggggaag gtcctgtgcc atcaagagga ggccaggcca 6061 ggaggagcca caatggcaag cctccccatc accctatccc atcagagctc agccagtggt 6121 gaaggcctct gtccagctac tggtgcaaga ttcactgaat cctgctgtga agagctgtgt 6181 cctacctcag accaagacac ccgtgagctg gtgaggaggc agagggcatg ggccttaaag 6241 gatctgggac ctcagaaagg ctccaacccc tgagccccac ttacgtcttt gcagcttcaa 6301 catccagatg tgtgttggag ccactgggca caacattcct cagaagctat gtgagtggca 6361 tgaagggggc aggagggagg tgggcttgga ctcccccgga ggctggccag ggaggtcctg 6421 actcttctgc ttgccctgcc agccctaaat gccgagctgc agctggaccg gcagaagccc 6481 cgccagggcc ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctgaacctg 6541 gatctgggcg gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagtacgc 6601 ccaggcaggg gattggcagg gctgggagag tagaacttac ccactggact tgttcatcta 6661 gccctggggc actgagctgg gtgctgtgag tccgggggtg gtcaggacac aggtgcctac 6721 tggccaggag aaggtgggat gtgtatggta gcaagatggc ctgactcttg cccctgtcct 6781 aggatgaggc agacttccgg gacaagctga gccccattgt gctcagcctc aatgtgtccc 6841 taccgcccac ggaggctgga atggcccctg ctgtcgtgct gcatggagac acccatgtgc 6901 aggagcaggt agggacaggc agggacaggc cagggaggtg caggacccct gatagcaaat 6961 caggattagg gttagtgcca agtcacaatg taaccccaaa accttgatgt cattccaaac 7021 cctaatgaaa acctcaaaat ccagccagtc atggtggctc acacctgtaa tcccagcact 7081 ttgggagacc gaggcaggca gattgcctga ggtcaggagt tagagaccaa cctggccaac 7141 atggtgaaaa cccatctcta ctaaaaatac aaaaaaaatt agccgggtgt ggtgacgcat 7201 gcctgtaatt ccagctactc gggaggctga agcaggagaa tcacttgaac ccaggaggca 7261 gaggttgcag tgagccaaga gtgtgccaca gcactccagc ctgggtgaca gagcaagact 7321 ctgtctcaaa aaaaaaaaaa aaagccaggc gcagtggcct cacgcctgta atcccagcac 7381 tttgggaggc caaggcgggt ggatcacgag gtcaggagat caagaccatc ctggctaaca 7441 cagtgaaacc ccgtctacta aaaatacaaa aaaaaaaaaa aaattagctg ggcgtggtgg 7501 cgggtacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaaccccgg 7561 gggcggacgt tgcagtgagc cgagatagtg ccactgcact ccagcctgga cgacagagcg 7621 agactccgtc tccaaaaata aaaaaacacc tgaaaatccc agtatcccct aagctctgat 7681 gtaaattgac aaaccctgac attgtcccaa acctccaaat ataacccgag ccccgatacc 7741 atctacaaac tccttttcgt cctcagatct tcttactccc taagccccta tgtgaacccc 7801 aagcccactg ttttcctaac cctgatgtaa tccctaaacc tcacacatcc ccaacttacc 7861 cgcacacccc aatgtgcccc tctagacacg aatcgtcctg gactgtgggg aagatgacgt 7921 atgtgtgccc cagcttcagc tcactgccag cgtgtgagga ggcctcccat tctgcccgac 7981 cctggccctt tctgcctatc atacctgctc cacaccttag tcccctcttt tcccacatcc 8041 tgggcccaga cccaggctcc ctggcttcac tcctctttcc ccacaggacg ggctccccgc 8101 tcctagttgg ggcagataat gtcctggagc tgcagatgga cgcagccaac gagggcgagg 8161 gggcctatga agcagagctg gccgtgcacc tgccccaggg cgcccactac atgcgggccc 8221 taagcaatgt cgaggtatgg cccccaccct gggaacagta cccgggacct gggaggcact 8281 ggagccttgg ctctctcatc tccctccctg agagtccctc ttctcttctg ctttgctgtc 8341 aaagatgtaa tttttttttt aatttggagg aggatacttg ctaatggtca gtcagaattc 8401 caaaactcta ttacaaaaac cagaaaaaca aaaaaggttt aggaaccaaa tgttaacagg 8461 aacctctgtt aacatttggt ggatttcctt ccagtctttt tttcaatatt gactcacact 8521 cacataagta tatatttatt ttttatgttg ttaatatagt ttataataat gggggtcata 8581 ctctaatgtt ttgtgttttt tatttccaaa atgaaaatgc ctaaaaagta gtagtgctac 8641 agcaatacac acactagcat gtgacagtcc cttgagcgac cccaccccaa gaaacccccc 8701 cctccctacc ttggcacaca aatctttcca gaccttccaa gggagcttaa atatatatat 8761 atgatgctct gtaatttctt tcttggaact gccttcctga agggctttga gagactcatc 8821 tgtaatcaga agaaggagaa tgagaccagg gtggtgctgt gtgagctggg caaccccatg 8881 aagaagaacg cccaggtgag gctgctgggt cgtggtaccg ggtctccacc aggggctcat 8941 gaataaccag attttagggg tgaggtttta gagccacata gttctgggcc agaatcttgg 9001 tcctcacact ccctttgcca acattgtcct tgggtgagtg actttccctc tctgagcccc 9061 tttaccagtg ggcttccagg taaaatagaa ataataatgg tggcctggtg cggtcgtcac 9121 gcctgtaatc ccagcactct gggaggccag agcgggtgga tcacgaggtc aggagttcaa 9181 gaccagcctg gccaacatag caaaaccccg tctctactaa aaatacaaaa attacccggg 9241 catggtggcg cacgcctata gtcagagcta ctcgggaggt tgaggcagaa aaatcacttg 9301 aacctgggag gtggaggttg cagtgagccg agatcatgcc actgcactcc agcctgggtg 9361 acagagtgag actccgtctc ggaaaaaaaa aaaaagaaaa agaatagtgg tgatcttgga 9421 gggtgaagac tggaggccac attcagggca gggctgtcct aagtggggca cttgggcagt 9481 gaccttggcc ctcctcatct cccagatagg aatcgcgatg ttggtgagcg tggggaatct 9541 ggaagaggct ggggagtctg tgtccttcca gctgcagata cggaggtact gacctggcga 9601 gcgtgcctac ccaccaccct tcccccgtct gacccccgtg cagagcccct caggtccctt 9661 ccatacagaa gggtctttcg aggccaggcg cagtggctca cacctgtaat cccagcacgt 9721 tgcgaggcca aggcagaagg atcactggag gtcaggagtt ggagaccagc ctggccaaca 9781 tggtgaaacc ccatctctac taaaatataa aattagctgg gcatggtggt gcgcacctac 9841 aatcccagct actcgggagg ctgaggcagg agaatagctt gaaccgaacc tgggaggtgg 9901 aggttgcagt gagctgagat tgggccactg cactccagcc ttccagcctg ggcgacagtg 9961 cgagattcta tctcaaaaga aaaaaaaaaa aaggtcttga agaagcctgg ttccctttct 10021 tcctcagaga tttagcgagt cttggagccc tagaggaagt tctttcccag gtctaacttc 10081 agtgtggcat gctctttgta taattagctc tctctgaact ctctaaaatt ctggcctcac 10141 ccccagaaag tcactgggct ggtgtccctg gccctgtttc tcctcatccc ctcccctcta 10201 gcaagaacag ccagaatcca aacagcaaga ttgtgctgct ggacgtgccg gtccgggcag 10261 aggcccaagt ggagctgcga gggtgagagg ccaggggtgg agaagggaga tggcattcag 10321 ggctctaaac tccagggggc gctggggaaa cctcacaggc caatcagggc atcacactct 10381 ctctgggggt cttgggcacc tgcaggaact cctttccagc ctccctggtg gtggcagcag 10441 aagaaggtga gagggagcag aacagcttgg acagctgggg acccaaagtg gagcacacct 10501 atgaggtatt ggggagcctc gcgtccctgg ctggggtgag cgggtcctca gaactccggg 10561 tgaggcgcta agctccccac accctgccac caccacccct tcagctccac aacaatggcc 10621 ctgggactgt gaatggtctt cacctcagca tccaccttcc gggacagtcc cagccctccg 10681 acctgctcta catcctggat atacagcccc aggggggcct tcagtgcttc ccacagcctc 10741 ctgtcaaccc tctcaaggta agagctgggt ggaagaaaga cctgggaagg cggccccaga 10801 ccaaccaccg gggcacctct gtgggctggg gttcggggga gacctgggcc tgaccactcc 10861 tttgcccccc caggtggact gggggctgcc catccccagc ccctccccca ttcacccggc 10921 ccatcacaag cgggatcgca gacagatctt cctgccagag cccgagcagc cctcgaggct 10981 tcaggatcca gttctcgtag tgagcaggct ctctggtctc gggcccggcc tccccgggac 11041 ccacggggca gaggggatgg gaggagggag aggggtccgg gtgtgctgtg ggcctctgtg 11101 ggccacgctt ggtccctggg agcacttcaa gtgaacatgg aggagcatgc tggcttgtgt 11161 ctggggtgag ctgaaagaca cttgcacttt ttaaaagctt cccagtacgt taaggagcat 11221 aaaacaatgc caaagcaagg ttatcataga tctgagcatt gtgcgctggg ggatgaccct 11281 ccctgcatct ctgggactat gtgagcaagc ccgtggaaag acagcatccg aagcttggat 11341 ccaaggccct tcctgatggg aaggccaccg cttcctgaac ccccggcccc ttctgcgttg 11401 ggtcctgggg gtaagggggt gggggatgat ggggtgatgg gccgggacgg ctggggactg 11461 acgatgcttc ccctcagagc tgcgactcgg cgccctgtac tgtggtgcag tgtgacctgc 11521 aggagatggc gcgcgggcag cgggccatgg tcacggtgct ggccttcctg tggctgccca 11581 gcctctacca ggtggggtgg gccgtggtgg ggcggggccg ggccttctgg gccgggacca 11641 ctttgctctg ggaggggcgg ggtttggtgt gggagggcag gaagagaggg aaggcaaggt 11701 ttactttggg ggattgcagt gggattaggt cagaggcagg gcttccccgc cgggtgtggg 11761 acctggactc cgtgcaacca ataggcctct tgtgggtgta aacggctttc aaccccaacc 11821 tgtccagagg cctctggatc agtttgtgct gcagtcgcac gcatggttca acgtgtcctc 11881 cctcccctat gcggtggccc cgctcagcct gccccgaggg gaagctcagg tgagtgtggg 11941 gggatggagc agagaccagt cctgcaggac ccattgtccc ccagtcagtg cccagccaga 12001 aaagtctgag gggtggtacg ggtgggtggc atggctggag gtcaccagcc tgaggtttga 12061 gtctttgtga aaggcaggtg tcaaggtgac tgaggagaca cgtgggtttg ccccaggtgt 12121 ggacacagct gctccgggcc ttggaggaga gggccattcc aatctggtgg gtgctggtgg 12181 gtgtgctggg tggcctgctg ctgctcacca tcctggtcct ggccatgtgg aaggtgaggt 12241 gtgaaggacg gtggagtccc cagcggggca caggcttggc tctgccctgc ctcacaggga 12301 gtcaaggaga gatggtggcc cacccaagtg ggtaatccag ggaccagggg tctatgtctc 12361 cactattaga atgtcattct cgtccagggg ggtggctcac acctgtaatc ccagcacttt 12421 ggcaggcaaa gcgtttagat cacctgaggt caagagttcg agaccagcct ggccaacatg 12481 gtgaaacccc atctctacta aaaataccaa attagccggg cgtgttgaca catgcctgta 12541 atctcagcta ctcgggaggc tgaggcagta gaattgcatg aacccaggag gcggaggttg 12601 cagtgagccg agatcacacc actgcactcc agcttgggca acagagcgag cctccatctc 12661 aaaaaaaaaa caaaaaaata gaatgtcttt ctctagtaga gcaaaaggca aaacaaacac 12721 aaaaatgtca ttctcctggg aacccttcca gacacatacc actggaaagg atagcacctg 12781 aaattctgag gcctttagac acccctgcca ccaaaaagat tcagaggata tagagggtat 12841 agagggtgta agtcctgcct tcaggaattc ctggctggtc tcaaggacaa gatgcacttc 12901 ttcctagccc tgcccttccc cttgagtgag gaagaggcca aggattggtc tagaccctat 12961 tccatacctt cctatgtggc cctggagggt cactcgctcc tctgcacctg gaggagtctc 13021 aagcacactg aagggaagac atggtgcttt tagggaaaac cacgcactag acccacaata 13081 atcaaataca tatcatcata tgctcgagtc atgcagacac aaacttcagt ataagaaaaa 13141 ttccaggctg ggcgttggtg gctcacaccg gtaaaatccc agcactttgg gaggccgagg 13201 tggg // LOCUS HUMGPIIB3 364 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exon 30. ACCESSION M34344 KEYWORDS platelet Glycoprotein IIb. SEGMENT 3 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 364) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 110 169 platelet Glycoprotein IIb (GPIIb), exon 30 pre-msg < 1 > 364 GPIIb mRNA and introns IVS < 1 109 GPIIb intron CC BASE COUNT 68 a 111 c 98 g 87 t ORIGIN About 600 bp after segment 2. 1 ttctgcgctg gtccagggag gtgctcatat gctagcatac ttcctcacat gtgctctggg 61 gccagcaaat catctgtata ccctgacctt ggcccccgtg tacccccagg tcggcttctt 121 caagcggaac cggccacccc tggaagaaga tgatgaagag ggggagtgat ggtgcagcct 181 acactattct agcaggaggg ttgggcgtgc tacctgcacc gccccttctc caacaagttg 241 cctccaagct ttgggttgga gctgttccat tgggtcctct tggtgtcgtt tccctcccaa 301 cagagctggg ctaccccccc tcctgctgcc taataaagag actgagccct gatgctgagc 361 atgc // LOCUS HUMIMPH 2858 bp ss-mRNA PRI 01-JUN-1990 DEFINITION Human IMP dehydrogenase type 1 mRNA complete cds. ACCESSION J05272 KEYWORDS IMP dehydrogenase. SOURCE Human spleen, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2858) AUTHORS Natsumeda,Y., Ohno,S., Kawasaki,H., Konno,Y., Weber,G. and Suzuki,K. TITLE Two distinct cDNAs for human IMP dehydrogenase JOURNAL J. Biol. Chem. 265, 5292-5295 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Natsumeda, 15-FEB-1990. FEATURES from to/span description pept 601 2145 IMP dehydrogenase type 1 (EC 1.1.1.205) signal 2843 2848 poly-A signal BASE COUNT 545 a 923 c 860 g 530 t ORIGIN 1 tcggaagggg ccaggagaca ctggaaggtc cggacggcag ggaaggggac ggggttcttt 61 ccagtcccac ccgtgtaggg acacctctcc ccctcatccc ccgatgtacc ctcgctgaat 121 ctgggatggg agagacgaac cgagtctagg catctgcgta gcagcgccgg ggagagcggg 181 gagcccaggc ggagcccagt cgactcccgg attcccctgc cccgcccccg gcacgaggcc 241 ccgccccggc gccccgcccc tcctcgggac tcgaccgggc tgcgctcact gcccagccgg 301 ggccccggga gcctccaggc tcgcccgccc tgagctgcgg cctccgcatg gagggccact 361 cactccacca ccgctgcagg gaggcggacg gcgctgttcc ggagcccgga gcccggcaac 421 acccgggaca cgagacggcg gcgcagggct acagcgcccg actgctgcag gccggctacg 481 agcccgagag ccctagattg gacctcgcta cacacccgac gacaccccgt tcagaactat 541 cttcagtggt cttactggca ggtgttggtg tccagatgga tcgccttcgc agggctagcc 601 atggcggact acctgatcag cggcggcacc ggctacgtgc ccgaggatgg gctcaccgcg 661 cagcagctct tcgccagcgc cgacgacctc acctacaacg acttcctgat tctcccagga 721 ttcatagact tcatagctga tgaggtggac ctgacctcag ccctgacccg gaagatcacg 781 ctgaagacgc cactcatctc ctcccccatg gacactgtga cagaggctga catggccatt 841 gccatggctc tgatgggagg tattgggttc attcaccaca actgcacccc agagttccag 901 gccaatgaag tacgcaaggt caagaacttt gaacagggct tcatcacgga ccctgtggtg 961 ctgagcccct cgcacactgt gggcgatgtg ctggaggcca agatgcggca tggcttctct 1021 ggcatcccca tcactgagac gggcaccatg ggcagcaagc tggtgggcat cgtcacctcc 1081 cgagacatcg actttcttgc tgagaaggac cacaccaccc tcctcagtga ggtgatgacg 1141 ccaaggattg aactggtggt ggctccagca ggtgtgacgt tgaaagaggc aaatgagatc 1201 ctgcagcgta gcaagaaagg gaagctgcct atcgtcaatg attgcgatga gctggtggcc 1261 atcatcgccc gcaccgacct gaagaagaat cgagactacc ctctggcctc caaggattcc 1321 cagaagcagc tgctctgtgg ggcagctgtg ggcacccgtg aggatgacaa ataccgtctg 1381 gacctgctga cccaggcggg ggtcgacgtc atagtcttcc actcgtccca agggaattcg 1441 gtgtatcaga tcgccatggt gcattacatc aaacagaagt acccccacct ccaggtgatt 1501 ggggggaacg tggtgacagc agcccaggcc aagaacctga ttgatgctgg tgtggacggg 1561 ctgcgcgtgg gcatgggctg cggctccatc tgcatcaccc aggaagtgat ggcctgtggt 1621 cggccccagg gcactgctgt gtacaaggtg gctgagtatg cccggcgctt tggtgtgccc 1681 atcatagccg atggcggcat ccagaccgtg ggacacgtgg tcaaggccct ggcccttgga 1741 gcctccacag tgatgatggg ctccctgctg gccgccacta cggaggcccc tggcgagtac 1801 ttcttctcag acggggtgcg gctcaagaag taccggggca tgggctcact ggatcccatg 1861 gagaagagca gcagcagcca gaaacgatac ttcagcgagg gggataaagt gaagatcgca 1921 cagggtgtct cgggctccat ccaggacaaa ggatccattc agaagttcgt gccctacctc 1981 atagcaggca tccaacacgg ctgccaggat atcggggccc gcagcctgtc tgtccttcgg 2041 tccatgatgt actcaggaga gctcaagttt gagaagcgga ccatgtcgcc ccagattgag 2101 ggtggtgtcc atggcctgca ctcttacgaa aagcggctgt actgaggaca gcggtggagg 2161 ccgaggtggt ggaggggatg caccccagtg tccacttttg ggcacaggct ccctccataa 2221 ctgagtggtc cacagatttg cactacgggt tctccagctc ctttccaggc agagaggagg 2281 ggaggtcctg aggggactgc tgcccctcac tcggcatccc ctgcagagtc aggactgctc 2341 ccgggggcca ggctgccctg ggaggccccc tccgagacca gccagccagg ctctcaggac 2401 ctgcgctgcc ttaggatctt tcttgctgca gcctgctcca gcctggcccc caccccaggg 2461 gcaggcggcc cctcctggct tctcctgtag ggcacctccc tgcccctagc ctcccagcaa 2521 atggtgctct cctggccctg ctctggccct tcccgggccg tgcccctcag ccatgtggca 2581 cttctgagct cctgacctag gccaagggga ggtctctgcc cccttccccg gccctgggct 2641 acccttgggt cctgctcctc aggccgctcc cctgtccctg gccatgggta ggagactgcc 2701 ctggtcatgg ccgcctgcct gtcattcctg actcaccacc gtccccaggt gaaccattcc 2761 tcccttctcc tcagctgcag tcgaaggctt taactttgca cacttgggat cacagttgcg 2821 tcattgtgta ttaaatactt ggaataaatc aagcaggt // LOCUS MBIFTRX 1260 bp ds-DNA BCT 01-JUN-1990 DEFINITION M.thermoautotrophicum tetrahydromethanopterin formyltransferase (FTR) gene, complete cds. ACCESSION J05173 KEYWORDS tetrahydromethanopterin formyltransferase. SOURCE M.thermoautotrophicum. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 1260) AUTHORS DiMarco,A.A., Sment,K.A., Konisky,J. and Wolfe,R.S. TITLE The formylmethanofuran: Tetrahydromethanopterin formyltransferase from Methanobacterium thermoautotrophicum delta-H: Nucleotide sequence and functional expression of the cloned gene JOURNAL J. Biol. Chem. 265, 472-476 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 153 unknown protein (AA at 1) pept 211 1101 tetrahydromethanopterin formyltransferase BASE COUNT 371 a 260 c 305 g 324 t ORIGIN 1 ggatccttgg agggagatga gggagatgag ggtatagtaa atatttttgt tctctggtgc 61 tgtaataacc atatgtttat cacccggcgc ataaccctct tattattcct aggggccctc 121 agattcatgg ttttacagtt atttctaagc tgaattgaaa gtagaaaaat atattaaaca 181 ttctcccagt aaatcaacta ggtgatagtt atggagataa atggtgttga aatagaagac 241 acatttgcag aggcctttgg cataaaggtt tcaagggttc ttgtaaccgc aacaaaaaaa 301 cttgccaaaa tagcagcaac cgaggcaaca ggttacggta cctcagttat agggtgccct 361 gcagaggctg gcatcgactg ctatgttcca cctgaggaga cacctgacgg aagaccaggg 421 tacattataa tgatatgcaa cccatccaag aagagccttg accatgagct ccttgaaagg 481 atagggatgg gtatactgac agcaccaacc accgctgtct tcgatgcact ggatgatgag 541 gatgagaagc tcaacatagg attcaaacta aaattcttcg gtgacggcta cgagaaggaa 601 cttgaaattg atggaagaaa gatccactca atcccaataa tgtcagggga cttcctgatt 661 gaaagccagt tcggaataaa ggatggagtt gccggcggaa acttcttcat aatgggtgac 721 agtcaggcat cagcgctact ggctgcacag gcagctgttg atgccatagc tgcagttgaa 781 ggtactgtaa caccattccc tggtggagtg gttgcttcag gatcaaaggt cggctcaaac 841 aagtacaagt tcctcaacgc ctcaaccaat gagaagatgt gtgtgaccct gaaggacgaa 901 gttgaggaca cccagatacc tgagaacgtc aacggggtct acgagatagt catagatggt 961 gttgatgaag aggccgtaag ggaggccatg aaggaaggta taaaggctgc ctgcacggtt 1021 ccaggcataa taaagataag tgcaggaaac tatggcggta acctgggagc ctacaagata 1081 aaactccatg acctcttctg atttttttat ccatgccctg atcctatcag ggttgacctt 1141 tttttattct atgcagaatt tttccatcat ttcgctttca actatttccc atatctcgtc 1201 tcgcggaggt taacaccgag agtatacccc ttttcctgag gtcatcttca attttcatta // LOCUS PFAROP1 3061 bp ds-DNA INV 01-JUN-1990 DEFINITION P.falciparum rhoptry associated protein 1 (rop1) gene, complete cds. ACCESSION M32853 KEYWORDS rhoptry associated protein 1. SOURCE P.falciparum (strain K1) blood stage DNA, clones lambda rop[1-3]. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 3061) AUTHORS Ridley,R.G., Takacs,B., Lahm,H.-W., Delves,C.J., Goman,M., Certa,V., Matile,H., Woollett,G.R. and Scaife,J.G. TITLE Characterisation and sequence of a protective antigen from Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by R.G.Ridley, 15-MAR-1990. FEATURES from to/span description pept 628 2976 rhoptry associated protein 1 (rop1) precursor sigp 628 695 rhoptry associated protein 1 (rop1) signal peptide matp 696 2973 rhoptry associated protein 1 (rop1) BASE COUNT 1266 a 335 c 357 g 1103 t ORIGIN 1 gtatacacac atacaaatat gtaaaaatag acatttcaaa atttttaata taaaataaca 61 tatatatata taaataaata gtaaaattaa ttttttttat ttcatcattt tatatataca 121 tatatatata tatatatata tatatattta tttatttatt tatatacata tataagttat 181 ttatttattt atttcctttt tggttgtatt ttcgtttttt tgattctatt tatatattta 241 tagtaatatt tataattata attattttat ttaatttttt atctttttca ttttatatat 301 taataacttt cttttaaata ttatgatatt aatttttatt ttttagtatt tatataagat 361 tttaaaatag acaaaataaa tataaataaa ttatatatta tatatatata tatatatttt 421 ttttttttta tagttatgca catatatatg tattaatata tattatcttt aaactattag 481 aatatatact ttattattct aaatattatt aatttgtaat actcacatat atatatatat 541 atatatatat atttatttat ttatttattt atttatttat ttatatagta aaaaatcaat 601 taattttttt tttttttttt atatataatg agtttctatt tgggtagctt agtaataata 661 ttccatgtac tcttccgtaa tgtcgctgat ggtataaatg taaacggaga taataattat 721 gggaaaacaa taatcaataa tgatttcaat tttgatgatt acaattattg gacaccaata 781 aataaaaagg aatttttaaa ttcctatgaa gatgaatttt caagtgaatc ctttttagaa 841 aataaatcta gtgttgatga tggaaatata aatttaacag atacaagtac atcaaataaa 901 agttctaaaa aaggacatgg tagaagtaga gtaagatcag catcagctgc tgcaattctt 961 gaagaagatg attcaaaaga tgatatggaa tttaaagctt ctccttcagt tgttaaaaca 1021 tctactccat caggtacaca gacatctggt ttaaaatcat ctagtccatc tagtacaaag 1081 tcatcaagtc catcaaatgt aaaatcagct agtccacatg gtgaatctaa ttcttctgaa 1141 gaaagtacta ctaaatcctc aaagagaagt gcttcggttg caggtattgt aggtgccgac 1201 gaagaagcac ctcctgcacc aaaaaacacc ctcactccat tagaagaatt atatcctact 1261 aatgttaatt tatttaacta taaatattca ttaaacaata tggaagaaaa tatcaatata 1321 cttaaaaacg aaggagattt agttgcacaa aaagaagaat ttgaatatga tgaaaatatg 1381 gaaaaagcta aacaagacaa aaaaaaagca cttgagaaaa taggaaaaca atcagacgaa 1441 gaacctttta tgttttcaga aaataaattt cttgaaaatc aagtaaaaga aagaaatgtt 1501 gctggatcct tttctcgatt tttcagtaaa ttaaatcctt ttaagaaaga tgaagtaata 1561 gaaaaaactg aagtatcaaa gaaaacattt tcaggtatag gttttaatct tactgacaaa 1621 gaagctaaag tattaggtgt aggtgcaacc tatcaagaat atccagaaac catgttatat 1681 aactgtccaa acaattctaa tttgtttgat actatagaat cattacaagg aagaataatt 1741 gatattaaaa aaagagaaag catgatatca acaactttcg aacaacaaaa agaatgttta 1801 aaaaatatgg gtgtacttga tcttgaatta aacgatacac aatgtaaatt tggtacatgt 1861 ataggtagct ttggagaaca tcatcttaga ttatacgaat ttgagaatga cttatttaaa 1921 tttcatccaa atattgatta tttaacttta gctgatggat ataaattaca aaaaaatcat 1981 atatatgaat tatcccatgt aaacttttgc ttattaaatc ctaaaacatt agaagaattt 2041 ttaaaaaaaa aagaaatcaa ggatcttatg ggtggtgatg atcttataaa atataaagaa 2101 aattttgata actttatgag tatatctata acatgccata ttgaatcttt aatatatgat 2161 gatattgaag catctcaaga tattgctgct gtattaaaaa ttgctaaaag taaattacat 2221 gtaataacat caggtttatc atataaagca agaaaattag tatataaaat ttatagtgaa 2281 attcaaaaaa atccagatga actctatgaa aaattaacat ggatttatga taatatctat 2341 atgattaaaa gatattatac tgcatatgct ttagaaggtg tctgttcata tcttgaacat 2401 gataaaagtc aaatgtatac agaattacat atttataaca aaatagtcga ctctgttcgt 2461 tattatagtt catgctttaa aaacgttatt gtttataatg ctatcatttc tggtatacat 2521 gaaaaaataa aacatttctt aaaattagta ccaagacaca actttctttt ggattatcac 2581 tttaattcaa tttttgaaaa agaaattaaa ccagccaaaa aatatagtac ttcacatatt 2641 tattttgatc caactgttgc atcatatgct tattataatt tagatagaag aaccatggtt 2701 actattatta atgattattt cgaagcaaaa aaaaaagaat taaccgttat agtatctcgt 2761 atgaaaacag atatgctcag tcttcaaaat gaagaatcaa aaataccaaa tgacaaaagt 2821 gcaaattcaa aactagctac aagattaatg aaaaaattta aagctgaaat cagagatttc 2881 ttcaaagaaa tgcgtataca atatgctaaa ttaataaaca tacgttacag atctcactta 2941 aagaaaaact actttgcctt caagagatta gattaagaat ataaacttga aaaatatata 3001 taatgtaaaa atatactaaa tatattaata taatatatat atatatatat atatatatat 3061 a // LOCUS STYFLICD 350 bp ds-DNA BCT 01-JUN-1990 DEFINITION S.typhimurium phase-1 flagellin C and D (fliC, fliD) genes, partial cds. ACCESSION M33541 KEYWORDS phase-1 flagellin. SOURCE S.typhimurium DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 350) AUTHORS Inoue,Y.H., Kutsukake,K., Iino,T. and Yamaguchi,S. TITLE Sequence analysis of operator mutants of the phase-1 flagellin- encoding gene, fliC, in Salmonella typhimurium JOURNAL Gene 85, 221-226 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 313 > 350 phase-1 flagellin C pept 56 < 1 (c) phase-1 flagellin D BASE COUNT 120 a 63 c 78 g 89 t ORIGIN 1 aacaactggt ctaacggtaa gtttgatccc acacctaatg atgaaattga agccatgcct 61 tcttcctttt tgattgcaaa cagtagttaa gcgcgttatc ggcaatctgg aggcaaagtt 121 taatgataat tttgcaaaaa taatgcgcgg aataatgatg cataaagcgg ctatttcgcc 181 gcctaagaaa aagatcgggg gaagtgaaaa attttctaaa gttcgaaatt caggtgccga 241 tacaagggtt acggtgagaa accgtgggca acagcccaat aacatcaagt tgtaattgat 301 aaggaaaaga tcatggcaca agtcattaat acaaacagcc tgtcgctgtt // LOCUS SYNCRPBS1 88 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33414 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 88) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 24 a 22 c 18 g 24 t ORIGIN 1 aaatgtgatc tagatcacat ttcaagcttg actgactgag atcagcttgc aacctttccg 61 gtcgcggcga taaagacatc ttcaccgt // LOCUS SYNCRPBS2 88 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33415 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 88) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 25 a 24 c 18 g 21 t ORIGIN 1 aaatgtgatc tagatcacat ttggaattcc cggggatccg ggccctcttc aaacaggccc 61 agttcgccaa taaaatcacc tgattcag // LOCUS SYNCRPBS3 56 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33416 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 56) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 17 a 10 c 14 g 15 t ORIGIN 1 aaatgtgatc tagatcacat ttggagtgcc tgaatgggcg tacagttatg aaaccc // LOCUS SYNCRPBS4 56 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33417 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 56) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 20 a 10 c 9 g 17 t ORIGIN 1 aaatgtgatc tagatcacat ttggaattcc aaatgtgatc tacagttatg aaaccc // LOCUS TRFCRE1 3940 bp ds-DNA INV 01-JUN-1990 DEFINITION C.fasciculata retrotransposable element I (CRE 1) gene complete cds. ACCESSION M33009 KEYWORDS integrase; retrotransposable element I; reverse transcriptase. SOURCE C.fasciculata. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 3940) AUTHORS Gabriel,A., Yen,T.J., Schwartz,D.C., Smith,C.L., Boeke,J.D., Sollner-Webb,B. and Cleveland,D.W. TITLE Rapidly rearranging retrotransposon within the miniexon gene locus of Crithidia fasciculata JOURNAL Mol. Cell. Biol. 10, 615-624 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Gabriel, 21-MAR-1990. FEATURES from to/span description ORF < 416 3837 ORF (AA at 2) rpt 1 415 mini-exon gene repeat site 386 415 target site duplication site 415 415 mini-exon splice junction rpt 3899 3940 mini-exon gene repeat site 3899 3927 target site duplication BASE COUNT 890 a 1032 c 1330 g 688 t ORIGIN 1 aagcttccgg aaacaaccgg cacaaatttt gaggcggaag cgctgctttt ttttgtgtcc 61 gggggggtgc tccttggggt ccccctgtcc agccccagcc ggtcgcccac cacataggaa 121 tttgcgaagg acccccaaaa atcccggtcc ccggggcgag ttgtcccaac tttttcaaac 181 ctcatgaaga gctagttgcg tcattgaaaa gttcgtgtgc agaaaccccc tcccccacgt 241 ttgtacaatg gaagagttta cgatacaggt tttctcacgg ttttgaggtg ttttttcgaa 301 aaacaaaaaa tatagaggtg tatagcgctt atttttgaca cccccctcaa aacatgctgg 361 gggtataggt ccttccaact aacgctatat aagtatcagt ttctgtactt tattgaacgg 421 cattcggtct agtgggtcca ctcccaacct tctcctcctt ggtctcgggt tcgattccgg 481 tcgggcacga aactctcttc ctctctaaac tccacatata catccacaat aaccactctc 541 ataactgttc tggtgcgggg ccatttcgaa ccatattgac atttggcgca ccaaactcac 601 acatactaac caaaaatggc agagccaccc ggggcgacgg tccgagcgtg gcggagcacc 661 tcaagcgccg gcgggtcgaa ggatctgagc cggtcgtggt ggggtctcgc caggagggtc 721 tctccggtga gtcagcagtc gaagcgattg ttgtggaaag tggcagcgag gctgacgaag 781 aatccacggc gagggggctg ggagcctctc gacaagcgcc aggtgatgac tcgggcgggc 841 cggtgcccag ggcggaggag gacctcgccc cggttgggta cctgtatccc gagaacctgg 901 ccgtcccaca gggggcggcg tgtccggtgg tgggttgtgg gtaccgaccg aacacccggg 961 tgggaccgag actggtggaa cacctgaaca cggtgcaccg ggatatcctg ggcaacattc 1021 ctgtcgacgc ctggcgacgt cagggactcg tccgctgcct acggtgtggc tcggccttga 1081 ccgcgtccgg tcatggtcgg ggtgcacacg gtgggaagtg tggtccgtac agaagcagaa 1141 acgcggccat tagggctcgg acgcagagtt tctttggtgg aatttcgcag aattctgaca 1201 cacttacggc gggaaacggg gtctcagaag ctggtggtct agtggaagtg ggtagggtag 1261 aagacccaac cacagacccg tggtacaggg ttcgaacccc cttgaagcga caaatttacc 1321 gtacggactt ccgggtgtgg cagggcttgg cccggccagt actcctgggg tactctctgg 1381 cggatacagc agggaaggag ggacgccttc tcgccctgct caatctcccc agggaccatc 1441 tggaggtcca ggtaaatgct aagaggggaa tccaaccaca gccggctgaa atcgaggccc 1501 aggttcgaag gaaggtagtg gagttggcag ggataggagc ggtaggacga gcgatggcag 1561 tgatgactcg ggggcggctg gtcgaggtgc ccctggagcg ggtgatggag cagctcgagg 1621 agctccaccc gcaggaggac ccgcggggtt atccggcagc accagatacg agcgaggtgc 1681 tgcgggccaa ggagcagaag gtgcggcggg cgatcgcggc acggatgggg aggggtacgg 1741 cgcccggcct cgatggctgg acgcgggagc tcctcctccc cctcgcagaa gacccggccc 1801 tgctacacga gatcacgtcg gtggtctcgg atatcatgca ggggaaggtg gccgaggtgg 1861 tggcgcggag gctgcggagc agcgccgtca ccccgattcc gaaggacgag gcggggacga 1921 agatacgccc gatcgtgccg gagtcggcct ggctgaagct ggcctcgctg gtggcgatgg 1981 cggagatacc atccagcttc aaggagacct tcaaggggtg gcagtacggg gtctgggggg 2041 acgtcgccaa agcagtggcg aagatccgcc gggacagcga ggagcacgag tacctggtgg 2101 cactcgacgg ggtcaatgca tacaatacga tgagcagggc ccacatcctc caagccgtgt 2161 acgccgagca gcgcctgaag ccgatctggg gggtggtgaa ggtggcgctt ggggggccgg 2221 ggttcctggg agtatacagg gacggctgcc tcaagggcaa cctgtggtcc accaagggaa 2281 tccggcaggg catggtgctg ggccccctcc tgtacgcgac cgggatggca gcggccatcg 2341 ggccggtacg gcagcgtatc cccggggtcc ccgtgacggc ctacatcgac gacatcaccc 2401 tcgcggccag cggggcggag ggagccaggg cagccgaggc atacgcagac gccctcgaga 2461 cggtcggggt ggtcaccaac gccaggaagt cgatggtggt ggggccagaa ggcacccggg 2521 tggggatcgg gggtgtagac ctgccggtgg tggccgaggc ccggatcctg ggggcccact 2581 tccgagcaag ggggacacca gaggcccgta ccatcgagtg gctgcaggcg gccgtcgaga 2641 agtggcgacc catccaccag aagctgcggc aggacatcat cccgaagaac attgcgatga 2701 tgatgacccg catcagcctg gggtccaaga tgaccttcct cctccagacc cactcgccac 2761 aggaactgga gaccgcagcg aagacggcag acgacgaggt cgagcagacc ctccagcacc 2821 ttatggggca ggtagagatc acgccccgag caaggctgct ggcacaactc ccgatcagag 2881 agggggggct gggtctccgg cgaagcagcg agattgcgaa gttcgcacag gcagacgtgg 2941 ggcaaggcga ggcccaccag gcacacacaa aggcactaga tgaagggatc aagcaccagt 3001 tacaaccact cctctcggag tccgaggtgc agatcctgaa gtcgaacgct ggaatggggg 3061 ccgggcgggt gctgacagat agtagcctga ggatcccaga cgtggcagca acaatcgcgc 3121 tgagggagag actcctgctc agggtgctcc cggagggatg cagtgtgtgt gtggggggga 3181 cgcgacgaac taccatgtac acacgtgctc caacataccc accaagcccc ggacccgacg 3241 acacgacggg gtggtggatg agctggtggc cctggccagg aagatggggt acgagcccag 3301 caaggagccg agggcggacg ttgacgagta gggcgaggcc ggacctgtac atcacaggaa 3361 gcctgaagcc ggcggcgacg gatgtaacaa taacctaccc gggcaggcag gcgaggggag 3421 cacactcccg ttgcagcagg cctaccggaa taagatgggg gcctgggagg catgggggaa 3481 cctgcgaggg gtggacatgc agccggtggt cctcgggacg aacgcggaga tacacccgga 3541 gagtgcgaat ggatacgaag gttgacctcg gtcgaagaca aagacaaaat acataccagt 3601 tacaacgagg tgacgggacg aatcgtggag acggtgttgg ttgggaacgt ggagctgttc 3661 aacgcagtga cgaacctggc gctggtcagg gagttgatgt aggaaggtac gggtgggtag 3721 tagtagaata tcgaacggcg actaggaagt tggaccattg ttacttgtta cttgtgattc 3781 tgacgacgac gaaagattgt tccttgttac ttgttaccct gaaatttgat attacttgaa 3841 ttgatatgat atgattgata caaaaaaatt taaaaaaaaa aaaaaaaaaa aaaaaaaaat 3901 ataagtatca gtttctgtac tttattggta taagaagctt // LOCUS SQDKINEA 3693 bp ss-mRNA INV 01-JUN-1990 DEFINITION Squid kinesin heavy chain mRNA, complete cds. ACCESSION J05258 KEYWORDS kinesin; kinesin heavy chain. SOURCE Squid, cDNA to mRNA. ORGANISM Loligo pealii Eukaryota; Animalia; Eumetazoa; Mollusca; Cephalopoda; Coleoidea; Teuthoidea; Myopsida; Loliginidae. REFERENCE 1 (bases 1 to 3693) AUTHORS Kosik,K.S., Orecchio,L.D., Schnapp,B., Inouye,H. and Neve,R.L. TITLE The primary structure and analysis of the squid kinesin heavy chain JOURNAL J. Biol. Chem. 265, 3278-3283 (1990) STANDARD simple staff_review FEATURES from to/span description pept 421 3324 kinesin heavy chain BASE COUNT 1220 a 684 c 822 g 967 t ORIGIN 1 aaaagaaatc tttctaggcg gccataagtc gctgttctac aaataatacc gagatcgtca 61 taacaaaccg cctatttaac acttccgaac aacttttata gaatatttac gactgttttg 121 aatagtatat ttagctgtca ggctgcgatt tgcttccttt tatttattta tttatttatt 181 tatatttttt gtagtagaaa tttacatttt ttttttttgc gggggagacg tcaggatatt 241 attacctact tttgtttgtg cgcgtgtgtt gaagaaacga aagacgttca agctttattg 301 gattagagaa ttgataaaga tcggattaac tgtataataa cgctaaactt taatttattt 361 gggacctagg aaagtcccat ataaacccaa gaatttatat cgtcctcaaa caacgccgcc 421 atggacgtgg cttccgagtg taacattaaa gtcatctgcc gagttagacc ccttaacgaa 481 gccgaggaac gggccggcag caagttcatc cttaaattcc caactgatga ttccatctct 541 atcgctggga aagtgttcgt atttgataag gttttaaaac ctaatgtctc acaagaatat 601 gtgtacaatg ttggtgctaa gcccattgtt gctgatgtac ttagtggttg caatggaaca 661 atttttgctt atggacaaac atccagtggg aaaactcata ctatggaggg tgtgctggac 721 aaaccctcca tgcacggcat cattccacgc attgtacaag acatcttcaa ctacatttat 781 ggaatggacg aaaacttgga gtttcatatc aagatttcat attatgaaat ttacttggac 841 aaaatcaggg atcttttgga tgttaccaag accaacttgg ctgtccatga agacaaaaac 901 agagtcccgt ttgtcaaggg tgccactgaa cgatttgtat ccagtcctga agaagtaatg 961 gaagttattg atgagggtaa aaacaacaga catgtagctg tgacaaacat gaatgaacac 1021 agttcgagaa gtcatagtgt atttttaatc aatgtcaagc aagaaaatgt tgaaacccag 1081 aaaaaattaa gtggtaaact ttacctagta gatcttgctg gtagtgaaaa agtaagcaaa 1141 actggtgcag aaggtgcagt ccttgatgaa gctaaaaaca tcaataaatc tttgtcagct 1201 ctgggaaatg tcatttcagc tcttgctgat ggaaataaat ctcacgtccc atacagagac 1261 agtaaattga cgagaatctt gcaggaaagt ttgggtggta atgcccgaac cactatggtc 1321 atatgttgtt cacctgcctc ttacaatgaa tccgagacca agtctacttt gctttttggc 1381 caaagagcta agactattaa gaatgtggta tcggttaatg aggagcttac agctgatgaa 1441 tggaaacgaa gatatgagaa agaaaaagaa agggttacta agcttaaagc cactatggct 1501 aagcttgagg cagaactaca gagatggaga actgggcaag ctgtatctgt agaagaacaa 1561 gttgatctta aggaagatgt ccctgctgaa tctcctgcca ctagtacaac tagcctggct 1621 ggaggactca tcgcttcaat gaatgagggt gaccgcaccc aactggaaga agagcgtctc 1681 aagctttatc aacagttgga tgacaaggat gatgaaatta acaatcagtc tcagttaatt 1741 gaaaaattga aggagcagat gatggaacaa gaagatttaa ttgcccaatc tcgtcgtgat 1801 tatgaaaatc tacaacaaga tatgtctcga atccaagctg acaatgagtc tgccaaggat 1861 gaagtcaaag aggtcttgca agctcttgag gaactggcca tgaactatga tcagaagtct 1921 caggaggtgg aagacaaaaa taaggagaat gagaatctaa gtgaggaact taaccagaaa 1981 ttgagcacct tgaatagctt acagaatgaa ttggatcagc tgaaggattc atctatgcac 2041 cataggaaac gagttactga catgatgatt aacttgttga aagatcttgg tgatattggt 2101 actattgtag gtggaaatgc tgcagaaact aagccaactg ctggttctgg tgagaaaatt 2161 gaggaggaat tcactgtagc tcgtctgtat attagcaaaa tgaaatctga ggtgaagacg 2221 ttggtgagcc gaaacaacca actggaaaat actcagcagg acaattttaa gaaaattgaa 2281 acacatgaaa aggatttgtc caactgcaaa ttactcattc aacaacatga agccaaaatg 2341 gctagtttac aggaagccat taaagattct gagaacaaga agaggatgtt ggaagacaat 2401 gtggattctc taaatgagga atatgccaaa cttaaagctc aagaacaaat gcatttagct 2461 gctctgtccg aacgggagaa ggagaccagc caagcaagtg aaactcgaga agttcttgag 2521 aaacagatgg aaatgcatcg ggaacaacac caaaaacaac ttcaatcttt acgggatgaa 2581 atttcagaaa agcaggctac tgtggataat cttaaagatg ataaccagag actttctttg 2641 gccctggaga aacttcaggc tgattatgac aaactcaaac aagaggaagt tgaaaaggct 2701 gccaagttgg ctgacttatc attgcagatt gaccgtcggg agcaagccaa acaggatctc 2761 aaaggacttg aagagactgt ggcaaaggag ctgcaaacac tacacaatct tagaaaactt 2821 tttgtacaag atctacaaaa taaggtcaaa aagtcatgca gcaagaccga agaagaagat 2881 gaagatactg gtggtaatgc tgcccagaaa caaaagattt ctttcttaga aaataatctt 2941 gagcagctga ctaaggttca caagcagctt gttcgagaca atgctgactt gcgctgtgag 3001 ttgcccaagc tggagaaaag gttacgtgct actatggagc gagtcaagtc cttggaatct 3061 gctctcaagg atgccaagga aggcgctatg agagacagaa aaaggtacca gcatgaggtg 3121 gatagaatca aggaagctgt ccgccagaag aatctcgctc gccgaggaca tgctgcacaa 3181 atagccaaac caatccggcc tggacaacac cagtctgttt ctcctgcgca ggcagcagct 3241 attcgcggtg gtggtggttt gtcccagaat ggaccaatga ttacatctac gccaattcga 3301 atggcacctg aaagcaaggc ataaatatct agttggaaga ggaattttta ctcccattca 3361 tcaaagaaca acagaactac aaagctttct agtgcattca ttgataattg aaatggagga 3421 tatgtatctc ctcctcctct actattatat ctcctcctcc tacatgaagc tgcattccgt 3481 tttgttagcg tgtaccctgt gtctcgctgc tacagtggag tgtatgtggg ctcttgcata 3541 ccactgcata gtggcagtta ggatggctgg ttacttagct atttaactgc ctttctacct 3601 gcctgcttcc cacctgcctt agtgatggca agtgagtgac tgctctggtg gcaatccaac 3661 ttttgtacaa ttattaaaaa aaaaaaaaaa aaa //