Path: utzoo!attcan!uunet!cs.utexas.edu!usc!apple!bionet!daemon From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: <9004101920.AA29042@life.lanl.gov.LANL.GOV> Date: 10 Apr 90 19:20:16 GMT Sender: daemon@genbank.BIO.NET Distribution: bionet Lines: 875 Approved: lear@genbank.bio.net Checksum: 58339 57 LOCUS BSUHEMAC 3795 bp ds-DNA BCT 15-FEB-1990 DEFINITION B.subtilis delta-aminolevulinate synthase (hemA) uroporphyrinogen I synthase (hemC) genes, complete cds. ACCESSION M32130 KEYWORDS delta-aminolevulinate synthase; uroporphyrinogen I synthase. SOURCE B.subtilis DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3795) AUTHORS Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt, TITLE Cloning and characterization of the hemA region of the Bacillus subtilis chromosome JOURNAL Unpublished (1990) Univ. of Lund, Sweden STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by I.C.Schroeder, 16-FEB-1990. FEATURES from to/span description pept 232 1599 delta-aminolevulinate synthase (hemA, EC 2.3.1.37) pept 1607 2437 ORF2 pept 2470 3414 uroporphyrinogen I synthase (hemC, EC 4.3.1.8) BASE COUNT 1068 a 689 c 953 g 1085 t ORIGIN 1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt 61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta 121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta 181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata 241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt 301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa 361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat 421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag 481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta 541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac 601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag 661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg 721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag 781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga 841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca 901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct 961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa 1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga 1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg 1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg 1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct 1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt 1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa 1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat 1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg 1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct 1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc 1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt 1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat 1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc 1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact 1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt 1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg 1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc 2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa 2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc 2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat 2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt 2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg 2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct 2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg 2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact 2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt 2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga 2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca 2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt 2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa 2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt 2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag 2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt 3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg 3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct 3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg 3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa 3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc 3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga 3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc 3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca 3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc 3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt 3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat 3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat 3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc 3781 tcttaagcag catgc // LOCUS ECOAFR1 1476 bp ds-DNA BCT 15-FEB-1990 DEFINITION E.coli AF/R1 major pili subunit (afrA) gene, complete cds. ACCESSION M32083 KEYWORDS afrA gene; major pili subunit; surface antigen. SOURCE E.coli (strain RDEC-1) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1127) AUTHORS Wolf,M.K. and Boedeker,E.C. TITLE Cloning of the genes for AF/R1 pili from rabbit enteroadherent Escherichia coli RDEC-1 and DNA sequence of the major structural subunit JOURNAL Infect. Immun. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 1476) AUTHORS Wolf,M.K. and Boedeker,E.C. JOURNAL Unpublished (1990) Walter reed Army Inst. of Res. Washington DC STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.K. Wolf, 13-FEB-1990. FEATURES from to/span description pept 627 1115 AF/R1 major pili subunit (afrA) site 349 390 region of dyad symmetry signal 493 498 pot. -35 region signal 515 520 pot. -10 region binding 612 616 pot. ribosome binding site BASE COUNT 447 a 278 c 287 g 462 t 2 others ORIGIN 1 bp upstream of EcoRI site. 1 gaattcccta gtgaatgtct gctgggaatc ataaaacaat ctttctgata tatccacaat 61 ttttaggttg gtaaatctta aaagaatagc cgctcgcgtt atcctgctta attgaatgta 121 tttacctaaa gtaacaccta tgttttcttt aaacagtaat tgcagatacc gtctgctgta 181 tccggagtaa tcaacgaggg catttatatc tatagatata ctctctaaat tatcatcaat 241 gtactgtgtt atcgcgttta tcgtaagtgt tttcagcatg tacgtagctc ctatatgtat 301 gtttacgtgt taccccacat catgttaata aaaccccttc tgttttttta gctgattgtg 361 cattgtacac ataccgtgca caattagcta acaacgcaga ccaatatttt ttaaaatacc 421 ccgcgttttc acatgacttg tatctattct cttagagaaa ttaatgcatc tctatcacat 481 catgtgtagt actggacaaa tagtcatggg agcctattac cgaacagcga agatggcata 541 tgttttctta ttaagaaaga ggaaagaata tggcgcactc gttttatctc aattttggta 601 aaaaaaatat atggagaatg tcagaaatga aaaaaacatt tattgcgtct gtaattgtaa 661 taaccataaa tacgggatca gcaattgctg ctcaaggcga tgttcagttc tttggtaccg 721 ttactgcgaa gacctgtgat cttgtcgttg aacacgaggg ggctgtggtc aatatgattc 781 agttgggttc tgtaactaat ggtggaacta atgctggcac cgatatcgga gcaaacaaat 841 cgtttaccct gaagccagca tcaggggtga catgcaatac catcactact gctaaaatgg 901 catggtcttc tcctgcaatg accgttaatg gtattggtaa tctatcaggt aaggctattg 961 atgcccatgt gaagttagtg gcgattaaca gcacgggtaa agttcaaact gataccaacg 1021 cagataagga aattaaagcg ggtcaaaata cagttgatta ctcaattact ggttctggcc 1081 tactgatgaa ggctttaaat ttaaagctca gttaattggc ggtaccattc caggtgactt 1141 cgatagtgct gctgcatatt ccgttgcata caactaatat ttgaatgtaa atccgggaag 1201 cccctccctt cccggattta atatttagaa cagcatattt aactggtgcc cttaactttg 1261 cttaggtgtg aagaggttag cttatgaaat taaaaacatt tcctaaaata tctctactgg 1321 ccctgagtat atggtattct cactccagct tggctgatga acttaatctg gattttatac 1381 agaacgtcag cgttattcca tcaattctga aaagtgacgc aatttacccg gaaggacaat 1441 atatcgttga cgtaaccgta aataaagaac gtatdd // LOCUS ECOCYS 5755 bp ds-DNA BCT 14-FEB-1990 DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT, cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes, complete cds. ACCESSION M32101 KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene; o-acetylserine (thiol)-lyase-B; sulfate permease; thiosulfate binding protein. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1292 to 5755) AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A. TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide sequence and expression of the cysTWAM gene cluster JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 1291) AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D. TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of a gene encoding a novel protein involved in thiosulfate binding JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Halanicka, 14-FEB-1990. FEATURES from to/span description pept 559 1575 thiosulfate binding protein (cysP) pept 1575 2408 sulfate permease (cysT) pept 2408 3283 sulfate permease (cysW) pept 3273 4370 sulfate permease (cysA) pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg start codon; EC 4.2.99.8) signal 516 521 -10 region site 5737 5755 right end of mu BASE COUNT 1259 a 1477 c 1682 g 1337 t ORIGIN 52 min on K12 map. 1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag 61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt 121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc 181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt 241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt 301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc 361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac 421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta 481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta 541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc 601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac 661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat 721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg 781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa 841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat 901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc 961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg 1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt 1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc 1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg 1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa 1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac 1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg 1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac 1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc 1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg 1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa 1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg 1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg 1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg 1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc 1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg 1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag 1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg 2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa 2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg 2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga 2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc 2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat 2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag 2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg 2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat 2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc 2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc 2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg 2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg 2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca 2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg 2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga 2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa 3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt 3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca 3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt 3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa 3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga 3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga 3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg 3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc 3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga 3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg 3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg 3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg 3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc 3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca 3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga 3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga 3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg 4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc 4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt 4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg 4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc 4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg 4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt 4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt 4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga 4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat 4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc 4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa 4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat 4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg 4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat 4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga 4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg 4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat 5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca 5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg 5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa 5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc 5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat 5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca 5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg 5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg 5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt 5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa 5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct 5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag // LOCUS HUMCOLA2I 1994 bp ds-DNA PRI 15-FEB-1990 DEFINITION Human collagen type I alpha-2 (COL1A2) gene, exon 1 (partial). ACCESSION M31886 KEYWORDS collagen. SOURCE Human DNA, clone pCOL-alpha-2-bGH. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1994) AUTHORS Sherwood,A.L., Bottenus,R.E., Martzen,M.R. and Bornstein,P. TITLE Structural and functional analysis of the first intron of the human alpha-2(I) collagen-encoding gene JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Bornstein, 02-FEB-1990. FEATURES from to/span description pre-msg < 1 > 1994 collagen alpha-2 type I mRNA and intron /nomgen="COL1A2" /map="7q21.3-q22.1" /hgml_locus_uid="LP0002V" IVS 156 > 1994 COL1A2 intron A binding 141 154 NF1 binding site binding 1034 1040 AP1 binding site binding 1061 1067 AP1 binding site site 1370 1409 gt-rich region BASE COUNT 580 a 413 c 456 g 545 t ORIGIN 1 gcatgcccgc gcccgccagg tgatacctcc gccggtgacc caggggctct gcgacacaag 61 gagtctgcat gtctaagtgc tagacatgct cagctttgtg gatacgcgga ctttgttgct 121 gcttgcagta accttatgcc tagcaacatg ccaatgtaag tgccttcagc ttgtttgggg 181 gagactgggt agagaggtta gatgggaggg caccctgccc tgaaaaggaa aacctgtaac 241 ctgaattcca ggtacacttg gagggcagac tctcaggcat gtgggaaaac gccggaattg 301 ataagaaaca tggaaattac tttaaaaaat gaaaacataa aagccttgcc aaaagttagg 361 gaacttttcc tctaagttca gagtgagaca gttaactcgg tctggctcct cagcttagta 421 acccccaaag ggagcggaag gtctttttcc ctaaggatga gatattaacg accaatgtgg 481 tggaggaagt caagggcctg caccccacag gccccataac cgcactgatg tccaccttgt 541 aaaacttgag gcctgcgtta gaaagccctt caactgagta atgtaaaact cacctcctaa 601 gagcttttat cttctgggca ttgtaaggct tgtccggagg aggaggatga cgatgctgat 661 atgatgatgg ttataaggcg ccctctggag gaaggaaaat gaaagtacag gggacagggc 721 cttaagcaga tggaatccca attaaagctt ctacggattt atacagatta atgatcagca 781 tttctggttg gagcctttcc cagtggctag tcagtgaacc ctggaaagaa gaatggatgc 841 tacttggagt gggtacattc tgaaaagtaa tataagtgtc tcaattcact ttctagtcat 901 ggaaatggta acatttttta actcaaatct gctctaaatt ttgtttgagc ctgagaatta 961 cccctttgac atgttcccag tgataagcaa acattatgaa cgcagcaagt tgagaaatat 1021 caacattgag atgagactca agagaccggg gtttttccca tgagtctgac accaatttgc 1081 tgcgtgactt tgggcaagtc aaacggcctt ttctaaaatg tgagacagag attaaaggga 1141 ccccaaggcc actttccagc tctaggttcc atggccagac tttcatgtca acagagaatg 1201 aagaagatca gtccgttttc atcttgaaaa tggctgccaa agtgctagac aaagatattg 1261 actagatggg ggatggtatt gtctgaccac acccagtact ccaaaaagtt gttccaccca 1321 cacagcacgg tgtctaccac tgcataattt ctaatgcatt tgtgtgcttg tgtgtgtgtg 1381 tgtgtgtgtg tgtctgtgtg tctgtgtgtc tcttccccct tcattcactt ttagtataca 1441 tactgtggat actaaggagt aattgcagtg aacaaattca cattaccgag ttcatatttt 1501 taatgagatc ttgagagtgg gaggaaagag tcggctccta gagaataaaa tgaaggcaga 1561 cttagggaaa tttgaaggta caaaggcaac ttaccttctg atcaacagcc aaccacagtc 1621 tggaataaat gttatcaaac acacattctt caaaatggtc cgtgtctgag taattaaaag 1681 gcaaatttcc aaaatcataa ggacttccgt taatcaagtc aggcataatt attcttccta 1741 ctgatgacac aatgaagtaa acatatcatt cttgtaattt aacagtaatt ctcgtaaatt 1801 gcccttaaat gtcagtgctg gatgtggtcc accctcctaa attgtgactg ttgcaacaga 1861 tgttctcact tcaaataacg cacttcttgg ccacctaatt aaagcaattt ttggggtgat 1921 tcatcctact gcaagcttgg ccacacttgt atcctgtatt aacctataat ttttgtaccg 1981 taggagaaga attc // LOCUS HUMP120PC 2612 bp ss-mRNA PRI 14-FEB-1990 DEFINITION Human proliferating-cell nucleolar protein P120 mRNA, complete cds. ACCESSION M32110 KEYWORDS proliferating cell nuclear protein. SOURCE Human fetal liver cell line CML and testis, cDNA to mRNA, and lymph node DNA (bases 1 to 30). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2612) AUTHORS Fonagy,A., Henning,D., Jhiang,S., Haidar,M., Busch,R.K., Larson,R., Valdez,B. and Busch,H. TITLE Cloning of the cDNA and sequence of the human proliferating cell nuclear protein P120 JOURNAL Cancer Communications 1, 243-251 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.Busch, 14-FEB-1990. FEATURES from to/span description pept < 1 2612 proliferating cell nuclear protein P120 (AA at 3) BASE COUNT 674 a 701 c 725 g 512 t ORIGIN 346 bp upstream of AvaI site. 1 cacgcgcgac gccaccttct cccatttctg cctgccacag taccatgggg cgcaagttgg 61 accctacgaa ggagaagcgg gggccaggcc gaaaggcccg gaagcagaag ggtgccgaga 121 cagaactcgt cagattcttg cctgcagtaa gtgacgaaaa ttccaagagg ctgtctagtc 181 gtgctcgaaa gagggcagcc aagaggagat tgggctctgt tgaagcccct aagacaaata 241 agtctcctga ggccaaacca tcgcctggaa agctaccaaa agggatctct gcaggagctg 301 tccagacagc tggtaagaag ggaccccagt ccctatttaa tgctcctcga ggcaagaagc 361 gcccagcacc tggcagtgat gaggaagagg aggaggaaga ctctgaagaa gatggtatgg 421 tgaaccacgg ggacctctgg ggctccgagg acgatgctga tacggtagat gactatggag 481 ctgactccaa ctctgaggat gaggaggaag gtgaagcgtt gctgcccatt gaaagagctg 541 ctcggaagca gaaggcccgg gaagctgctg ctgggatcca gtggagtgaa gaggagaccg 601 aggacgagga ggaagagaaa gaagtgaccc ctgagtcagg ccccccaaag gtggaagagg 661 cagatggggg cctgcagatc aatgtggatg aggaaccatt tgtgctgccc cctgctgggg 721 agatggagca ggatgcccag gctccagacc tgcaacgagt tcacaagcgg atccaggata 781 ttgtgggaat tctgcgtgat tttggggctc agcgggagga agggcggtct cgttctgaat 841 acctgaaccg gctcaagaag gatctggcca tttactactc ctatggagac ttcctgcttg 901 gcaagctcat ggacctcttc cctctgtctg agctggtgga gttcttagaa gctaatgagg 961 tgcctcggcc cgtcaccctc cggaccaata ccttgaaaac ccgacgccga gaccttgcac 1021 aggctctaat caatcgtggg gttaacctgg atcccctggg caagtggtca aagactggac 1081 tagtggtgta tgattcttct gtgcccattg gtgctacccc cgagtacctg gctgggcact 1141 acatgctgca gggagcctcc agcatgttgc ccgtcatggc cttggcaccc caggaacatg 1201 agcggatcct ggacatgtgt tgtgcccctg gaggaaagac cagctacatg gcccagctga 1261 tgaagaacac gggtgtgatc cttgccaatg acgccaatgc tgagcggctc aagagtgttg 1321 tgggcaactt gcatcggctg ggagtcacca acaccattat cagccactat gatgggcgcc 1381 agttccccaa ggtggtgggg ggctttgacc gagtactgct ggatgctccc tgcagtggca 1441 ctggggtcat ctccaaggat ccagccgtga agactaacaa ggatgagaag gacatcctgc 1501 gctgtgctca cctccagaag gagttgctcc tgagtgctat tgactctgtc aatgcgacct 1561 ccaagacagg aggctacctg gtttactgca cctgttctat cacagtagaa gagaatgagt 1621 gggtggtaga ctatgctctg aaaaagagga atgtgcgact ggtgcccacg ggcctagact 1681 ttggccagga aggttttacc cgctttcgag aaaggcgctt ccaccccagt ctgcgttcta 1741 cccgacgctt ctaccctcat acccacaata tggatgggtt cttcattgcc aagttcaaga 1801 aattttccaa ttctatccct cagtcccaga caggaaattc tgaaacagcc acacctacaa 1861 atgtagactt gcctcaggtc atccccaagt ctgagaacag cagccagcca gccaagaaag 1921 ccaagggggc tggaaagaca aagcagcagc tgcagaaaca gcaacatccc aagaaggcct 1981 ccttccagaa gctgaatggc atctccaaag gggcagactc agaattgtcc actgtacctt 2041 ctgtcacaaa gacccaagct tcctccagct tccaggatag cagtcagcca gctggaaaag 2101 ccgaagggat cagggagcca aaggtgactg ggaagctaaa gcaacgatca cctaaattac 2161 agtcctccaa gaaagttgct ttcctcaggc agaatgcccc tcccaagggc acagacacac 2221 aaacaccggc tgtgttatcc ccatccaaga ctcaggccac cctgaaacct aaggaccatc 2281 atcagcccct tggaagggcc aagggggttg agaagcagca gttcgcagag cagccttttg 2341 agaaagctgc cttccagaaa cagaatgata cccccaaggg cctcagcctc ccactgtgtc 2401 tcccatccgt tccagccgcc ccccaccagc aaagaggaag aaatctcagt ccaggggcaa 2461 cagccagctg ctgctatctt agatggttga aaactagacg ggtggctcac tgccattgtc 2521 accaggttgg aactcttgcc tctgtgagga tgccttctct actgtgcata cccatgaaat 2581 ttaatacaca ttttaaaacc tctggccact ga // LOCUS MUSH2RIIBP 2204 bp ss-mRNA ROD 10-APR-1990 DEFINITION Mouse MHC class I regulatory element binding protein (H-2RIIBP) mRNA, 3' end. ACCESSION M26804 KEYWORDS MHC class I regulatory element binding protein. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2204) AUTHORS Hamada,K., Gleason,S.L., Levi,B.-Z., Hirschfeld,S., Appella,E. and Ozato,K. TITLE H-2RIIBP, a member of the nuclear hormone receptor superfamily that binds to both the regulatory element of major histocompatibility class I genes and the estrogen response element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8289-8293 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by author, 10-AUG-1989. FEATURES from to/span description pept < 1 1341 MHC class I regulatory element binding protein (H-2RIIBP) mRNA < 1 2204 H-2RIIBP mRNA BASE COUNT 436 a 645 c 648 g 475 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccccc gaagcccaga cagctcctcc ccaaatcccc tttctcaggg gatccgtccg 61 tcttctcctc ctggcccacc tcttacccct tcagcacctc cacctccaat gccacccccg 121 ccactgggct cccccttccc agtcatcagt tcttccatgg ggtcccctgg tctgccccct 181 ccggctcccc caggattctc cgggcctgtc agcagccctc agatcaactc cacagtgtcg 241 ctccctgggg gtgggtctgg cccccctgaa gatgtgaagc caccggtctt aggggtccgg 301 ggcctgcact gtccaccccc tccaggtggt cctggggctg gcaaacggct ctgtgcaatc 361 tgcggggacc gaagctcagg caagcactat ggggtttaca gctgcgaggg ctgcaagggt 421 ttcttcaagc gcaccattcg gaaggacctg acctactcgt gtcgtgataa caaagactgt 481 acagtggaca agcgccagcg gaatcgctgt cagtactgtc gctatcagaa gtgcctggcc 541 actggcatga aaagggaggc ggttcaggag gagcgtcaac gggggaagga caaagacggg 601 gatggagatg gggctggggg agcccctgag gagatgcctg tggacaggat cctggaggca 661 gagcttgctg tggagcagaa gagtgaccaa ggcgttgagg gtcctggggc caccgggggt 721 ggtggcagca gcccaaatga cccagtgact aacatctgcc aggcagctga caaacagctg 781 ttcacactcg ttgagtgggc aaagaggatc ccgcacttct cctccctacc tctggacgat 841 caggtcatac tgctgcgggc aggctggaac gagctcctca ttgcgtcctt ctcccatcgg 901 tccattgatg tccgagatgg catcctcctg gccacgggtc ttcatgtgca cagaaactca 961 gcccattccg caggcgtggg agccatcttt gatcgggtgc tgacagagct agtgtccaaa 1021 atgcgtgaca tgaggatgga caagacagag cttggctgcc tgcgggcaat catactgttt 1081 aatccagacg ccaagggcct ctccaaccct ggagaggtgg agatccttcg ggagaaggtg 1141 tacgcctcac tggagaccta ttgcaagcag aagtaccctg agcagcaggg ccggtttgcc 1201 aagctgctgt tacgtcttcc tgccctccgc tccatcggcc tcaagtgtct ggagcacctg 1261 ttcttcttca agctcattgg cgacaccccc attgacacct tcctcatgga gatgcttgag 1321 gctccccacc agctagcctg agcccagatg cacaccgagt gtcactgagg aggacttgag 1381 cctgggcagg gggcagagcc atgggacagg tgcagagcag gaggggactt gcccagcctg 1441 ccagggatct ggcaacactt agcagggttc gcttggtctc caagtcgaag gggaccccag 1501 atccctgtga ggactttatg tctaccttca gtggccttga gtctctgaat ttgtcggggt 1561 ctcccatggt gcaggtgatt cttcatcctg gctccccagc acaaagcact gccctgcttc 1621 cttctcattt ggcctcactc ccttctgaag agtggaacag agctccccca gaaaggggtg 1681 ttgtggggca ggccccccaa gctgatgatc atgggagcag ggctctgaca gcctttatcc 1741 tctcagactt gacagatggg ggcagaggag ggacctgcct ctgtctcctg tcagccccat 1801 ttccacagtc cctcctgcag tcagactgaa gaataaaggg gtagtgaagg ggctgctgga 1861 ggtggaggaa cccattgctc ttttaatttc ctgtgaggag agactgggag ttagactcaa 1921 agaagtactg tacatcccca ggttgactta aatgtcaggg ctggagatgg catgtgggca 1981 aggaggcccc tcaggtgggc tgtcccaaag ctccctgggc tctgcctcgg gtggccctac 2041 agctcttccc tagtcttaag cacagctagg ctgggagcaa gtggggacat tgatgggggt 2101 ggccagcctg cagagttggg tgctgggctg catggttttt gccctggacc tcttttgggg 2161 gttccctccc atctttcact tgcacataaa gttgctttcc agtt // LOCUS MUSID 927 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Mouse helix-loop-helix DNA binding protein regulator (Id) mRNA, 3' end. ACCESSION M31885 KEYWORDS helix-loop-helix DNA binding protein regulator; helix-loop-helix protein; regulatory protein. SOURCE Mouse (strain DBA2) erythroleukemia cell line MEL, cDNA to mRNA, clone pMH18. REFERENCE 1 (bases 1 to 927) AUTHORS Benezra,R., Davis,R.L., Lockshon,D., Turner,D.L. and Weintraub,H. TITLE The protein Id: A negative regulator of helix-loop DNA binding proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.Benezra, 02-FEB-1990 FEATURES from to/span description pept < 1 533 helix-loop-helix protein (Id) (AA at 3) site 261 305 helix 1 site 306 335 loop site 336 383 helix 2 BASE COUNT 192 a 256 c 259 g 220 t ORIGIN 1 attgtacaac ctttctccaa cttcttgttc tcttcccaca ctctgttctc agcctcctcc 61 gctcccctcc gcctgttctc aggatcatga aggtcgccag tggcagtgcc gcagccgctg 121 caggccctag ctgttcgctg aaggcgggca ggacagcggg cgaggtggta cttggtctgt 181 cggagcaaag cgtggccatc tcgcgctgcg ctgggacgcg cctgcccgcc ttgctggacg 241 agcagcaggt gaacgtcctg ctctacgaca tgaacggctg ctactcacgc ctcaaggagc 301 tggtgcccac cctgccccag aaccgcaaag tgagcaaggt ggagatcctg cagcatgtaa 361 tcgactacat cagggacctg cagctggagc tgaactcgga gtctgaagtc gggaccaccg 421 gaggccgggg actgcctgtc cgcgccccgc tcagcaccct gaacggcgag atcagtgcct 481 tggcggccga ggcggcatgt gttccagccg acgatcgcat cttgtgtcgc tgaggcggcg 541 cactgaggga ccagatggac tccagccctt caggaggcaa gaggaaaaaa gtgctctcgg 601 ttccccaggg gatctctggg aaagacacta ccgcagccac cggactcttg gcggatcggt 661 ccagtgggta gagggtttga tcaacagagc ctcaccctct ccacctttca gcctccagag 721 actttgggga gggggttaat caaccccgcg tgtttctgtt ttattgaaaa agcagacatt 781 ttttttaaat ggtcacattt cgtgcttctc ggatttctga ggaaatattt tatattgtat 841 attacaatga tcactggctg aaaatattgt tttacaatag ttctatgggg gtgggttttt 901 tgttgttatt aaacaaacac tttagat // LOCUS MZEPPDK 644 bp ds-DNA PLN 16-FEB-1990 DEFINITION Z.mays pyruvate orthophosphate dikinase (PPDK) gene, 3' end. ACCESSION M32081 KEYWORDS pyruvate orthophosphate dikinase. SOURCE Z.mays mays (strain B73) DNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 644) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance JOURNAL Unpublished (1990) In Press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.A.Glacken, 13-FEB-1990. FEATURES from to/span description pept < 1 23 pyruvate orthophosphate dikinase (PPDK) (AA at 3) mRNA < 1 322 PPDK mRNA BASE COUNT 150 a 148 c 148 g 198 t ORIGIN 1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc ctgcagctca 61 ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca tgctggtgat 121 taataatact actatgacag agccatatgc tgttggtgca tgctggtgat taataatact 181 actatgacag agccatatgc tctgtgaaga gtattagtag cagcgctcat aaaagctaca 241 gttccatcta tctgtgaaga gtattagtag cagcgctcat aaaagctaca gttccatcta 301 ttttctcagc tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttctcagc 361 tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg 421 catgtcgttg ttgaggccat tgctgcacat ttttcgtggt gtgagatgtg catgtcgttg 481 ttgaggccat tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa 541 gctgatcatc tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc 601 ttttctgaga actctagact cgttttctga gaactctaga ctcg // LOCUS RATCLATP 4350 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Rat ATP citrate-lyase mRNA, complete cds. ACCESSION J05210 KEYWORDS ATP citrate-lyase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4350) AUTHORS Elshourbagy,N.A., Near,J.C., Metz,P.J., Sathe,G.M., Southan,C., Stickler,J.E., Gross,M., Young,J.F., Wells,T.N.C. and Groot,P.H.E. TITLE Rat ATP citrate-lyase: Molecular cloning and sequencing analysis of a full length cDNA and mRNA abundance as a function of diet, organ, and age JOURNAL J. Biol. Chem. 265, 1430-1435 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.A.Elshourbagy, 13-FEB-1990. FEATURES from to/span description pept 73 3375 ATP citrate-lyase mRNA < 1 4350 ATP cytrate-lyase mRNA BASE COUNT 1084 a 1114 c 1136 g 1016 t ORIGIN 1 taagctggtg cttacggaca gagagccaca ctcgggcttt ctcgaagagg taaaccaggt 61 ccctctgcag ccatgtcagc caaggcaatt tcagagcaga ccggcaaaga actcctttac 121 aagtacatct gtaccacctc agccatccag aaccggttca agtatgcccg ggttactccc 181 gacacagact gggcccatct cctgcaggac cacccctggc tgcttagcca gagcttggta 241 gtcaagccgg accagctgat caaacgtcga ggaaagcttg gtctagtcgg ggtcaacctc 301 tctctggatg gagtcaaatc ctggctgaaa cctcgactgg gacatgaggc caccgtcggc 361 aaggccaaag gcttcctcaa gaactttctg attgagccct tcgtccccca cagtcaggcg 421 gaggagttct acgtgtgcat ctatgctacc cgggaaggag actacgtcct gttccaccat 481 gaagggggtg tggatgtggg cgatgtggac accaaagccc agaagctgct tgtgggtgtg 541 gacgagaaac tgaacgctga agacattaag agacacctgt tggtccacgc ccccgaagac 601 aagaaagaaa tcctggccag cttcatctcc ggcctattca atttctacga agatctttac 661 ttcacctacc ttgagatcaa cccccttgtg gtgaccaaag atggtgtcta catccttgac 721 ctggcggcca aggtggacgc cactgctgac tacatctgca aagtcaagtg gggtgatata 781 gagttccctc ccccctttgg gcgtgaggca tacccagagg aagcctacat tgcagacctg 841 gatgccaaaa gtggggcgag cttgaagctg accttgctga accccaaggg gcggatctgg 901 accatggttg ccgggggtgg cgcctctgtc gtgtacagtg ataccatctg tgatcttgga 961 ggtgtcaacg aactggcgaa ttacggggag tactctggtg cccccagtga acaacagacc 1021 tatgactacg ccaagaccat cctctcactt atgactcgag agaagcaccc ggatggcaag 1081 atcctcatca ttggaggcag cattgcaaac ttcaccaacg tggccgccac cttcaagggc 1141 attgtgagag caattcgaga ttaccagggt tccctgaagg agcacgaggt caccatcttt 1201 gttcgaagag gtggcccgaa ctatcaagag ggattacgag tgatgggaga agttgggaag 1261 accactggaa tccccatcca tgtctttggc acagaaactc acatgacggc cattgtgggc 1321 atggcctggg caccggccat tcccaaccag ccacccacag cggctcacac tgccaacttc 1381 ctccttaatg ccagtgggag cacatcgaca ccagcaccca gcaggacagc gtctttttcc 1441 gagtccagag ctgacgaggt ggcccctgca aagaaagcca agccagccat gccccaagat 1501 tcagtcccaa gtccaagatc cctgcaagga aagagtgcca ccctcttcag ccgacatacc 1561 aaggctatcg tatggggcat gcagacccgg gctgtgcaag gcatgctgga ctttgactac 1621 gtgtgctccc gagatgagcc ttcagtggct gctatggtct acccgttcac gggggatcat 1681 aagcagaagt tttactgggg acacaaggaa atcctgatcc ctgtcttcaa gaacatggct 1741 gacgccatga aaaagcatcc ggaggtagac gtgctgatca actttgcatc tctgcgatcg 1801 gcttatgaca gcaccatgga gaccatgaac tatgcacaga tccggaccat agccatcata 1861 gcagaaggca tccctgaggc tctcacacgg aagctcatca agaaggcaga ccagaagggc 1921 gtgaccatca ttgggccagc cacggttggg ggcatcaagc ctggatgctt taagattggg 1981 aatactggtg ggatgctgga caacatcctg gcctccaaac tgtatcgccc aggcagtgtg 2041 gcctacgtct cgcgttcagg aggcatgtct aacgaactca ataatatcat ctctcggacc 2101 acagatggtg tctacgaggg tgttgccatc ggcggggaca ggtaccctgg gtccacattc 2161 atggatcacg tgctgcgtta ccaagacact ccaggagtca agatgattgt agttcttggg 2221 gagatagggg gtacagaaga atataagatc tgccggggca tcaaggaggg ccgcctcacc 2281 aagccagtgg tctgctggtg catcgggacc tgtgccacca tgttctcttc tgaggtccag 2341 tttggccacg ctggggcttg tgccaaccag gcttctgaaa cggcagtagc caagaaccag 2401 gccttgaagg aagcgggagt gtttgtgccc cgaagctttg atgagctcgg agaaatcatt 2461 cagtccgtgt atgaagatct tgtggccaaa ggcgccattg tacctgctca ggaagtgcca 2521 cctccaacag tacccatgga ctactcttgg gccagggagc tgggtttaat ccgaaaacct 2581 gcctcattca tgaccagcat ctgtgacgag cgggggcagg aactcattta tgcgggcatg 2641 cccatcaccg aggtcttcaa ggaagagatg ggcattggtg gtgtcctggg cctcctctgg 2701 ttccagagaa ggttgcccaa gtattcctgc cagttcattg agatgtgtct catggtcacc 2761 gctgatcacg ggccagctgt ctccggggcc cataacacta tcatctgtgc tcgggctggg 2821 aaggacctgg tctccagcct cacctcaggg ctgctcacca ttggggaccg gtttgggggt 2881 gccttggacg cagcagcgaa gatgttcagt aaagcctttg acagcggcat tattcccatg 2941 gagtttgtga acaagatgaa gaaggagggg aaactgatca tgggcatcgg ccatcgagtc 3001 aaatcgataa acaacccaga catgcgagtg cagatcctca aagactttgt caaacagcac 3061 ttccccgcca ccccgctgct cgactatgca ctggaagtgg agaaaatcac cacctcaaag 3121 aagccaaatc ttatcctgaa cgtggatggt ttcatcggcg ttgcgtttgt ggacatgctt 3181 aggaactgtg gctccttcac ccgggaggaa gctgacgagt atgttgacat tggagccctc 3241 aatggcgtct ttgtgctggg aaggagtatg ggcttcatcg ggcactatct tgaccagaag 3301 aggctgaagc aagggctgta tcgtcacccc tgggacgaca tttcctatgt tctcccggaa 3361 cacatgagca tgtaaccgag ccagcagccc taccgtagaa aaaggaagac aaaaactccc 3421 tcctcgacaa tatagcggac agacagctgg aaacagagcc cgttatgggc tgggcctgga 3481 atggaaatag ccattgatgt gcaggcatgg aaagccaaca ccacaggccc attcagtcca 3541 cacagagaag cttagtattt ttttttatat atatatctat atatatataa gcatagaaat 3601 ttaaaaccaa gccaatactt gtgacgtttg cgctgctacc tgctgtatct attacatgga 3661 agactgtaag caagcgctgt cagaataatg ttcttctagg gccttatgat gttgctttct 3721 ttttttaatt agttgaaaat ttatttttcc tctagaacta gtggatccga cttttaagac 3781 ttcaggatac tatctgtttg taggaccact gtctggtatc ccacctccca ctcatcttca 3841 caccacatga agaacactgt attaatctga ttttttagga tctttttttt tttttttgtg 3901 ttatgtgtta agggtttatt tagtatccca ctgaaacgtt ctgtgtttcg gaccaatgtc 3961 tacttatgtc aaggggagga gggttggggc cattgtaccc ttagccatcg tcacacatgt 4021 ggagtagtaa cttaaatgta aagttgtaac atacaagtgt ttaaaatgga aaccgcaaag 4081 caaaaagctg tgaaacgtct cgtgtcttgt gttctctgtg ttcatgcagc tgacttgtct 4141 gttactgaag tgtgggtcca aagactcaca tctgttccgc atctgtaacc cacagagatt 4201 ctggcagctg ccacctcagt ctcttctctg tattatcatg tttggtttaa ataaactaga 4261 tagtaaaaag aattcctgca gcccggggga tccactagtt ctagagcggc gcaccgcggt 4321 ggagctccag cttttgttcc ctttagtgag // LOCUS RATLOX 5351 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Rat aorta lysyl oxidase mRNA, complete cds. ACCESSION J02903 KEYWORDS lysyl oxidase. SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7, 13, 8-1 and IIB. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5351) AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D., Troxler,R.F. and Kagan,H.M. TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and predicted amino acid sequence JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.C.Trackman, 07-FEB-1990. FEATURES from to/span description pept 287 1516 lysyl oxidase precursor sigp 287 349 pot. lysyl oxidase signal peptide matp 350 1513 lysyl oxidase site 2252 2274 region of dyad symmetry BASE COUNT 1302 a 1528 c 1245 g 1276 t ORIGIN 192 bp upstream of TaqI site. 1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc 61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg 121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc 181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt 241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg 301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc 361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa 421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga 481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt 541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg 601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg 661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc 721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac 781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga 841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc 901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat 961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga 1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg 1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca 1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag 1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg 1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga 1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa 1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa 1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac 1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt 1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa 1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat 1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta 1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat 1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca 1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt 1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg 1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc 2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt 2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac 2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc 2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta 2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac 2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat 2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct 2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct 2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg 2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat 2641 gcagtactgt aatccccgtc cctcctggag cccgaattcc ttctacttca gacactgtgc 2701 gctctcccgg accgtcgtgc gccgctcccc gtcgccttcc aggactggga aaggggagag 2761 gaggacggtg ccacgtccga cggcctcctg ggctgggggc agggtctgcc gctcgccctt 2821 ggcaccagtc cctgcgaccc atccccgagc ctcgccctcc tcctccctgc tcgaagaggt 2881 ctccctcctt cgcgggatct gagtccctgt cttcattttt ctcctagcca cgtccctccc 2941 cgagaaggga cgagccggga gcatcatgcg tttcgcctgg accgtgctct ttctgggaca 3001 gctgcagttc tgtccccttc tccgctgcgc cccgcaggcc ccgcgcgagc ctcccgccgc 3061 ccccggtgcc tggcgccaga caatccaatg ggagaacaac gggcaggtgt tcagtctgtt 3121 gagcctgggg gcgcagtacc agcctcagcg acgccgcgac tccagcgcca ctgccccgag 3181 agccgacgca acgctgcagc acagccacgc acgcccattc tgctgctgcg tgacaaccgc 3241 actgcctctg cccgtgcgag gactccaagc ccatctgggg tcgccgcggg tcgtccccgg 3301 cccgcagccc gccactggtt ccaagttggt ttctcgccgt cgggggccgg cgatggagcc 3361 tcaaggcgcg cagaaccgga ctgcgtcgcc acagcctccg cagctcagta atctgaggcc 3421 acccagccac gtagatcgca tggtggcgac gacccctaca atccctacaa gtactccgac 3481 gacaacccct attataacta ctatgacact tatgagagac cggtccggga gcaggcaccg 3541 acctggatat ggcaccggtt acttccagta cggtctcccg gacctggtac ccgatcccta 3601 ctacatccag gcatccacgt acgtacaaaa gatgtctatg tacaacctga gatgcgctgc 3661 ggaagaaaac tgcctggcca gttcagcata tagggcggat gtcagagact atgaccacag 3721 ggtactgcta cgatttcctc agagagtgaa aaaccaaggg acgtctgact tcttaccaag 3781 ccgcccccgc tactcctggg agtggcacag ctgccaccaa cattaccaca gcatggatga 3841 attcagccac tacgacctgc tggatgccag cacacagagg agagtggccg agggccacaa 3901 agcaagcttc tgtctggagg acacttcctg tgattatggg taccacagac gatttgcctg 3961 tactgcacac acacaggggt tgagtcccgg atgttatgat acttatgcag cagacataga 4021 ctgccagtgg attgatatta cagatgtaca acccggaaat tacattctaa aggtcagtgt 4081 aaaccccagc tacctggtgc ctgaatcaga ctacagtaac aatgtcgtac gctgtgaaat 4141 tcgctacaca ggacatcacg cctatgcctc aggctgcacc atttcaccgt attagaaaga 4201 agctcacctt cccaaaggat gaagcagtac ctggtgtttg gacctatgaa aaccgtagat 4261 tagcttaagt aggaagactt agatatttta aaaggcaaac ggaaaaacaa caaagaaggt 4321 tttgtttgga ctctttcaca acaaatcaca taactggatt ttgagtgttt aaatcagcat 4381 tagattggca cattttaaat acttattcat gttgctttat gaagtaatgg tgtttcaatt 4441 ctgtgggtgc atagtgggct ctttcaaaga attctgaatt tcttaccttc ttttgaaatt 4501 atagtgcaaa aagaagagga tattttaatg aatgagccac aatttgaact gattactttc 4561 taaattgcca gacccatgag acaatgatga tgggtttgta tttgcctcaa catagattcg 4621 ctttttaaaa agggtgttcc tattgtatag gcaaaaatgg atacacttgg tgctgaggaa 4681 gggtcaaata ctaactattg ttgtcacgaa atataggtct acagcagaga gatggtgagt 4741 atatattcag atagttacat ccctatataa actatgttta cattttagat gcttttcttt 4801 ctgttaattg cttaatctca ctctgacttg aggtacaact tctgttttgg aatgaattag 4861 ataattccag attctggttt gataattgtt gacattcccc catgctactt tttctgaggg 4921 cagaaacgtc taatgtgacg actcttcaca ttaccattac gaggatacac agcacagcga 4981 aatcattccg atgacaggtg tgatagatgg agagctaaca tgcaactgcc gagtgtttca 5041 ctgttagcca gaactaagtc acttgcccca cacagcaatt acaccatgaa tctctaacat 5101 cacaaccttc tttcaaatac ccacggactc atccatcctt ccatccgtca tccatccatc 5161 cgtccgtccg tccgtcctga ctgcctagtg ccactgtctg gctaggcaca cccactatca 5221 acctggttca cctgtcatgg cagcctgtac ccacccccgc cacacacccc gacgctggcc 5281 tatagtgcaa aggttgtgcg ggctggtcct tcccacaatg cagtactgta atccccgtcc 5341 ctcctggagc c // LOCUS SHFIPAH 2900 bp ds-DNA BCT 16-FEB-1990 DEFINITION S.flexner invasion plasmid antigen (ipaH) gene, complete cds. ACCESSION M32063 KEYWORDS invasion plasmid antigen. SOURCE S.flexner (strain M90T-W), serotype S) DNA, clone pWR390. ORGANISM Shigella flexneri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2900) AUTHORS Hartman,A.B., Venkatesan,M.M., Oaks,E.V. and Buysse,J.M. TITLE Sequence and molecular characterization of a multicopy invasion plasmid antigen gene, ipaH, of Shigella flexner JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.B.Hartman, 12-FEB-1990. FEATURES from to/span description pept 251 1849 invasion plasmid antigen pept 2277 > 2900 ORF3 signal 139 144 -35 region signal 162 167 -10 region signal 2111 2116 -35 region signal 2132 2137 -10 region BASE COUNT 868 a 657 c 578 g 797 t ORIGIN 1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta 61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt 121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat 181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa 241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag 301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac 361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac 421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc 481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga 541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata 601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac 661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt 721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt 781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa 841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc 901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact 961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt 1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct 1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc 1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc 1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa 1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct 1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt 1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc 1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt 1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt 1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga 1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag 1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc 1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt 1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat 1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa 1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt 1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa 2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga 2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga 2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga 2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga 2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt 2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg 2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata 2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt 2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc 2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat 2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac 2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc 2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc 2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc 2881 ctcagagctt aaaatatttg // LOCUS VACCSBP 1020 bp ds-DNA VRL 06-DEC-1989 DEFINITION Vaccinia virus cell surface-binding protein gene, complete cds. ACCESSION J05190 KEYWORDS antigen; carbonic anhydrase-related transmembrane protein; cell surface-binding protein; envelope protein. SOURCE Vaccinia virus (wild type WR) DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1020) AUTHORS Maa,J.-S., Rodriguez,J.F. and Esteban,M. TITLE Structural and functional characterization of a cell surface binding protein of vaccinia virus JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Esteban 30-NOV-1989. FEATURES from to/span description pept 61 975 cell surface-binding protein site 822 972 attachment site site 9 19 alpha helix site 103 114 alpha helix site 270 280 alpha helix site 286 292 alpha helix BASE COUNT 354 a 182 c 167 g 317 t ORIGIN Map position HindIII-D. 1 catccattgt aattcccata ctaagagcta tttttaaaca gttatcattt catttttact 61 atgccgcaac aactatctcc tattaatata gaaactaaaa aagcaatttc taacgcgcga 121 ttgaagccgt tagacataca ttataatgag tcgaaaccaa ccactatcca gaacactgga 181 aaactagtaa ggattaattt taaaggagga tatataagtg gagggtttct ccccaatgaa 241 tatgtgttat catcactaca tatatattgg ggaaaggaag acgattatgg atccaatcac 301 ttgatagatg tgtacaaata ctctggagag attaatcttg ttcattggaa taagaaaaaa 361 tatagttctt atgaagaggc aaaaaaacac gatgatggac ttatcattat ttctatattc 421 ttacaagtat tggatcataa aaatgtatat tttcaaaaga tagttaatca attgcattcc 481 attagatccg ccaatacgtc tgcaccgttt gattcagtat tttatctaga caatttgctg 541 cctagtaagt tggattattt tacatatcta ggaacaacta tcaaccactc tgcagacgct 601 gtatggataa tttttccaac gccaataaac attcattctg atcaactatc taaattcaga 661 acactattgt cgtcgtctaa tcatgatgga aaaccgcatt atataacaga gaactataga 721 aatccgtata aattgaacga cgacacgcaa gtatattatt ctggggagat tatacgagca 781 gcaactacct ctccagcgcg cgagaactat tttatgagat ggttgtccga tttgagagag 841 acatgttttt catattatca aaaatatatc gaagagaata aaacattcgc aattattgcc 901 atagtattcg tgtttatact taccgctatt ctctttttta tgagtcgacg atattcgcga 961 gaaaaacaaa actagattcg ataccttgtt gagcctccat tagaacggca gtgacttcgc //