Path: utzoo!attcan!uunet!zephyr.ens.tek.com!uw-beaver!milton!dali.cs.montana.edu!uakari.primate.wisc.edu!zaphod.mps.ohio-state.edu!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 9 Aug 90 12:00:25 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 6403 Approved: lear@genbank.bio.net Checksum: 18105 422 LOCUS INS43AAA 130 bp ds-DNA BCT 09-AUG-1990 DEFINITION Insertion sequence IS2-43. ACCESSION M25093 KEYWORDS RNA polymerase binding site; insertion sequence; insertion sequence IS2. SOURCE Insertion sequence IS2 DNA. ORGANISM Insertion sequence IS2 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 130) AUTHORS Sommer,H., Cullum,J. and Saedler,H. TITLE IS2-43 and IS2-44: New alleles of the insertion sequence IS2 which have promoter activity JOURNAL Mol. Gen. Genet. 175, 53-56 (1979) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 41 a 23 c 22 g 44 t ORIGIN 1 cctaagacat caatcatctg ttctccaatg actagtctaa aaactagtat taagactatc 61 acttatttaa gtgatatact tatttaagtg atattggttg tctggagatt cagggggcca 121 gtctaatacc // LOCUS PSCIS1IN 146 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid pDG128 insertion element IS1 target region sequence. ACCESSION M25018 KEYWORDS insertion element; insertion element IS1. SOURCE Plasmid pDG128, a derivative of Plasmid pSC101, DNA, clone 128/10R7. ORGANISM Plasmid pSC101 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 146) AUTHORS Sommer,H., Schumacher,B. and Saedler,H. TITLE A new type of IS1-mediated deletion JOURNAL Mol. Gen. Genet. 184, 300-307 (1981) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 38 a 33 c 34 g 41 t ORIGIN 1 gctgcgaaaa tgccttatct ggcctacaga ttcgatgcga ttcgtaggtc ggataagatg 61 cgcaagcatc gcatccgaca ataagtgccg aatgcgacct acattcacat ggcgcttttt 121 acatctgacg gtttttattg aagtta // LOCUS BRVRNASA 197 bp ss-mRNA VRL 09-AUG-1990 DEFINITION Berne virus ORF5 mRNA, 5'end. ACCESSION M33503 M33501 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 197) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 > 197 ORF5 mRNA 113 > 197 RNA5 BASE COUNT 50 a 27 c 38 g 82 t ORIGIN 1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag 61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt 121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc 181 acctcaggtt gttgcat // LOCUS BRVRNASB 179 bp ss-RNA VRL 09-AUG-1990 DEFINITION Berne virus ORF3 mRNA, 5' end. ACCESSION M33502 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 179) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 153 > 179 ORF3 BASE COUNT 52 a 17 c 34 g 76 t ORIGIN 1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat 61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag 121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt // LOCUS CHKGLOBA 1204 bp ds-DNA VRT 09-AUG-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4. ACCESSION M30485 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.) BASE COUNT 282 a 252 c 263 g 407 t ORIGIN 1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg 61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat 121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg 181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg 241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt 301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac 361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga 421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg 481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag 541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc 601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga 661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag 721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg 781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac 841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt 901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca 961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc 1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt 1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg 1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact 1201 ggtt // LOCUS CHKGLOBB 582 bp ds-DNA VRT 09-AUG-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H10. ACCESSION M30486 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H10. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 582) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 241 > 582 pie-alpha-globin fragment mRNA H10 BASE COUNT 171 a 128 c 108 g 175 t ORIGIN 1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata 61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct 121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg 181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata 241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac 301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg 361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat 421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg 481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca 541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc // LOCUS ECOPHOAA 600 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end. ACCESSION M33536 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain K-12) cell line BW7710 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 600) AUTHORS Agrawal,D.K. and Wanner,B.L. TITLE A phoA structural gene mutation that conditionally affects formation of the enzyme bacterial alkaline phosphatase JOURNAL J. Bacteriol. 172, 3180-3190 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.K.Agrawal, 03-APR-1990. The phoA503 mutation does not interfere with export of active enzyme but does interfere with assembly. FEATURES from to/span description pept 283 > 600 alkaline phosphatase precursor (phoA) (EC 3.1.3.1) sigp 283 345 alkaline phosphatase signal peptide matp 346 > 600 alkaline phosphatase variant 413 413 c in wild type; t in phoA503 mutation BASE COUNT 159 a 130 c 151 g 160 t ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site. 1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc 61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg 121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt 181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct 241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt 301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg 361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt 421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat 481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat 541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac // LOCUS GCOEARA 1771 bp ds-DNA PLN 09-AUG-1990 DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene. ACCESSION M33640 KEYWORDS 18S ribosomal RNA. SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA. ORGANISM Gracilaria tikvahiae McLachlan Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gigartinales; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A. TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga Gracilaria tikvahiae mclachlan JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA B3H 3Z1 FEATURES from to/span description rRNA 1 1771 18S ribosomal RNA BASE COUNT 445 a 371 c 501 g 454 t ORIGIN 1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta 721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaag c // LOCUS HS6MCP 4440 bp ds-DNA VRL 09-AUG-1990 DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete cds. ACCESSION M33515 KEYWORDS major capsid protein. SOURCE Human herpesvirus type 6 DNA. ORGANISM Human herpesvirus type 6 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4440) AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R. TITLE Identification, cloning, and expression of the major capsid protein gene of human herpesvirus 6 JOURNAL J. Virol. 64, 714-722 (1990) STANDARD simple staff_review FEATURES from to/span description pept 235 4272 major capsid protein (MCP) BASE COUNT 1422 a 1169 c 785 g 1064 t ORIGIN 1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct 61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag 121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac 181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa 241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac 301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat 361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa 421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat 481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca 541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa 601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca 661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga 721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc 781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat 841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga 901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt 961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact 1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa 1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat 1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg 1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa 1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg 1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac 1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg 1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac 1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt 1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga 1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa 1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga 1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac 1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc 1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct 1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca 1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt 2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct 2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat 2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac 2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta 2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg 2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc 2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta 2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc 2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac 2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag 2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag 2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc 2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt 2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc 2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca 2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac 3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac 3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat 3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt 3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa 3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag 3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc 3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat 3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat 3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg 3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc 3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg 3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt 3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac 3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc 3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac 3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt 3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac 4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag 4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac 4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc 4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa 4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga 4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct 4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca // LOCUS HUMSEXREPB 916 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human sex chromosome repeat, clone pDP330. ACCESSION M33524 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP320. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 916) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics 7, 625-628 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 916 sex chromosome repeat BASE COUNT 206 a 228 c 229 g 253 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta 61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc 121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt 181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc 421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga 481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct 541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct 601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca 661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct 721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg 781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct 841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc 901 ctggaggctg gaattc // LOCUS HUMSEXRPA 918 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human sex chromosome repeat, clone pDP316. ACCESSION M33523 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP316. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 918) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics 7, 625-628 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 918 sex chromosome repeat BASE COUNT 207 a 242 c 226 g 243 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta 61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc 121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt 181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc 421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg 481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc 541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc 601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac 661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc 721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta 781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg 841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag 901 tcctggaggc tggaattc // LOCUS IRICAP 2461 bp ds-DNA VRL 09-AUG-1990 DEFINITION Iridescent virus type 1 capsid protein gene, complete cds. ACCESSION M33542 KEYWORDS capsid protein. SOURCE Iridescent virus type 1 DNA. ORGANISM Iridescent virus type 1 Viridae; ds-DNA nonenveloped viruses; Iridoviridae. REFERENCE 1 (bases 1 to 2461) AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L. TITLE Molecular cloning, characterization, and expression of the Tipula iridescent virus capsid gene JOURNAL J. Virol. 64, 125-136 (1990) STANDARD simple staff_review FEATURES from to/span description pept 601 1995 capsid protein mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp) BASE COUNT 717 a 462 c 443 g 839 t ORIGIN 1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat 61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa 121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta 181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt 241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat 301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt 361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt 421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt 481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt 541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac 601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa 661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga 721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct 781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg 841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta 901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat 961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct 1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca 1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca 1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag 1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct 1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct 1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt 1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt 1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca 1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat 1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt 1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga 1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt 1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc 1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg 1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc 1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac 1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga 2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt 2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa 2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa 2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt 2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc 2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc 2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct 2461 t // LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds. ACCESSION M33679 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1) BASE COUNT 98 a 83 c 89 g 93 t ORIGIN 1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg 61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg 121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac 181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac 241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt 301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc 361 tca // LOCUS MUSIGKABF 324 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds. ACCESSION M33678 KEYWORDS immunoglobulin light chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 324 Ig light-chain V-J region (AA at 1) BASE COUNT 96 a 77 c 73 g 77 t 1 others ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat 241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagt tggaaatcaa acgg // LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5' end. ACCESSION M33500 KEYWORDS T cell receptor; beta-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2567) AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and Fotedar,A. TITLE Transcription of the T cell receptor beta-chain gene is controlled by multiple regulatory elements JOURNAL J. Immunol. 144, 1111-1119 (1990) STANDARD simple staff_review FEATURES from to/span description pept 2544 > 2567 T cell receptor beta-chain V-2 region precursor sigp 2544 > 2567 T cell receptor beta-chain signal peptide mRNA 2478 > 2567 T cell receptor beta-chain mRNA BASE COUNT 708 a 560 c 583 g 716 t ORIGIN 1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag 61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc 121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga 181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag 241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga 301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc 361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc 421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc 481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga 541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt 601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt 661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga 721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg 781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga 841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa 901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata 961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac 1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag 1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt 1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta 1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta 1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat 1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa 1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc 1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct 1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa 1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc 1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc 1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag 1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag 1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa 1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc 1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct 1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat 2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg 2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc 2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta 2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact 2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca 2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt 2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt 2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat 2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc // LOCUS R751TRA 578 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid R751 traJ and traK genes, 5'end. ACCESSION M25422 KEYWORDS inverted repeat; transfer origin region. SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 578) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 48 < 1 (c) traJ protein pept 403 > 578 traK protein signal 243 211 promoter PL signal 266 294 promoter PR rpt 49 64 inverted repeat rpt 118 157 inverted repeat rpt 296 331 inverted repeat BASE COUNT 141 a 168 c 163 g 106 t ORIGIN 1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc 61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg 121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt 181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat 241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct 301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag 361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc 421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac 481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg 541 atgaaaacga tctgggagca catgaaggaa accggccg // LOCUS RP4TRAB 571 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid RP4 traJ and traK genes, 5' end. ACCESSION M25423 KEYWORDS inerted repeat; transfer origin region. SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 571) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 26 < 1 (c) traJ protein pept 394 > 571 traK protein rpt 48 63 inverted repeat rpt 118 157 inverted repeat signal 219 192 promoter PL rpt 281 318 inverted repeat signal 246 272 inverted repeat BASE COUNT 135 a 148 c 181 g 107 t ORIGIN 1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc 61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg 121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt 181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa 241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa 301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg 361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg 421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg 481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc 541 atttgggagc acatgcggga aacggggaag g // LOCUS STAREPEBR 2389 bp ds-DNA BCT 09-AUG-1990 DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA) genes, complete cds. ACCESSION M33479 KEYWORDS ethidium resistance protein; replication protein. SOURCE S.aureus plasmid DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 2389) AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J. TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium bromide resistance plasmid naturally occurring in Staphylococcus aureus JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First Avenue, RM 1166, New York, NY 10016 FEATURES from to/span description pept 1153 1476 ethidium resistance protein (ebr) BASE COUNT 796 a 403 c 290 g 900 t ORIGIN 1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa 61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca 121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt 181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg 241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa 301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg 361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc 421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat 481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa 541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata 601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt 661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc 721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt 781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt 841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt 901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg 961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac 1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca 1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa 1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga 1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata 1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat 1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt 1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc 1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt 1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc 1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa 1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt 1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata 1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta 1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg 1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg 1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg 1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca 2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg 2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct 2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa 2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc 2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag 2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg // LOCUS XANAVR 2100 bp ds-DNA BCT 09-AUG-1990 DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds. ACCESSION M32142 J03672 KEYWORDS avirulence protein. SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2100) AUTHORS Ronald,P.C. and Staskawicz,B.J. TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv. vesicatoria encodes a 50-kD protein JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Ronald, 15-FEB-1990. FEATURES from to/span description pept 308 622 ORF1 pept 713 2050 ORF2 BASE COUNT 656 a 423 c 505 g 516 t ORIGIN 1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca 61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag 121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc 181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa 241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt 301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg 361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt 421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa 481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct 541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga 601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc 661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga 721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc 781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc 841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc 901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag 961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt 1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc 1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc 1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca 1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc 1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg 1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt 1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct 1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga 1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat 1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct 1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac 1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca 1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac 1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga 1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga 1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga 1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg 2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc // LOCUS YSCSCD25 5055 bp ds-DNA PLN 09-AUG-1990 DEFINITION S.cerevisiae SCD25 gene, complete cds. ACCESSION M26647 M31771 KEYWORDS Ras protein; SCD25 gene; cell division cycle. SOURCE S.cerevisiae (strain OL136) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 2129 to 5055) AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M. TITLE The C-terminal part of a gene partially homologous to CDC25 gene suppresses the CDC25-5 mutation in Saccharomyces cerevisiae JOURNAL Gene 77, 21-30 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 3880) AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and Jacquet,M. TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is a dispensable gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak, 01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex FEATURES from to/span description pept 128 3880 SCD25 protein pept 4319 > 5055 ORF X BASE COUNT 1638 a 973 c 900 g 1544 t ORIGIN 1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc 61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc 121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga 181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt 241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa 301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa 361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa 421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa 481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg 541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag 601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt 661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat 721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa 781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt 841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt 901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag 961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa 1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc 1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct 1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac 1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac 1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg 1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt 1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg 1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag 1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca 1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc 1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac 1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga 1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg 1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga 1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga 1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa 1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga 2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa 2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt 2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca 2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat 2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga 2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg 2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg 2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga 2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt 2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga 2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag 2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc 2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac 2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga 2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac 2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt 3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat 3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag 3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag 3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa 3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt 3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa 3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag 3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg 3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata 3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa 3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt 3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc 3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga 3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt 3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt 3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca 3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta 4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc 4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc 4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca 4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg 4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat 4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg 4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt 4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag 4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac 4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag 4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt 4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa 4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta 4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt 4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga 4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt 4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa 5041 gaaaatcaaa gatct // LOCUS ADAMLPA1 630 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 major late promoter region DNA. ACCESSION M31631 KEYWORDS promoter. SEGMENT 1 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 630) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description pre-msg 126 > 630 leader sequence 1 mRNA and intron IVS 167 > 630 leader sequence 1 intron A signal 95 100 TATA box signal 273 292 downstream promoter element site 64 73 upstream regulatory sequence site 210 220 downstream regulatory sequence BASE COUNT 128 a 150 c 214 g 138 t ORIGIN Map position 16.0-16.6. 1 acggtgtgca ggcagaggtc cccgtcctcc gcatccaaaa aggtgattgg cttgtaggtg 61 taagtcacgt gaccttcctt tgggggcggg gggcgataaa agggggcggc gccgtcgtcg 121 ccgtcactgt cctctgcgtc gctgtggacg atcgccagct gctcgggtga gtagaggcgc 181 tcgaaggcgg gcatgacgtc ggcgctgagg gtgtcagttt ctacaaacga ggaggatttg 241 atgttaacct gcccggagcg atgcctttga gaagggcggg gtcgagctgg tcggcaaaaa 301 caattttttt attgtccagc ttagtggcaa aggacccgta gagggcgtag gtcgtaagaa 361 gcttcttgct ttttttccca cagctcgcga ttcaagaggt actcttggcg gttctgccag 421 tactcgggaa gcggaaaccc ctgcgcgtcg gctcggtaag cgcccagcat gtaaaattcg 481 ttaggcgctg acgatgcatt tgattaactg ctgcgtaggc acttgacgcc aggacctgaa 541 ggcggagaaa tccaccggat cggagaactt gtcgaggaag gcgtgtagcc agtcgcagtc 601 gcaaggtaag ctgaggacgg tttccggggg // LOCUS ADAMLPA2 135 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 leader region 2 DNA. ACCESSION M34220 KEYWORDS promoter. SEGMENT 2 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 135) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description IVS < 1 35 leader sequence 2 intron N-1 IVS 108 > 135 leader sequence 2 intron N site 36 107 leader sequence 2 BASE COUNT 28 a 36 c 36 g 35 t ORIGIN About 0.8 kb after segment 1; map postion 26.1-26.6. 1 aggtcgtaag aagcttcttg ctttttttcc cacagctcgc gattcaagag gtactcttgg 61 cggttctgcc agtactcggg aagcggaaac ccctgcgcgt cggctcggta agcgcccagc 121 atgtaaaatt cgtta // LOCUS ADAMLPA3 147 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 leader sequence 3 DNA. ACCESSION M34221 KEYWORDS promoter. SEGMENT 3 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 147) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description IVS < 1 35 leader sequence 3 intron N-1 IVS 123 > 147 leader sequence 3 intron N site 36 122 leader sequence 3 BASE COUNT 34 a 33 c 53 g 27 t ORIGIN About 2.5 kb after segment 2; map position 19.3-19.9. 1 ggcgctgacg atgcatttga ttaactgctg cgtaggcact tgacgccagg acctgaaggc 61 ggagaaatcc accggatcgg agaacttgtc gaggaaggcg tgtagccagt cgcagtcgca 121 aggtaagctg aggacggttt ccggggg // LOCUS TFEMERA 1730 bp ds-DNA BCT 09-AUG-1990 DEFINITION T.ferrooxidans mercuric reductase (merA) gene, complete cds. ACCESSION M32353 KEYWORDS mercuric reductase. SOURCE T.ferrooxidans (strain E-15) DNA, clones pTM31[4,5]. ORGANISM Thiobacillus ferrooxidans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 1730) AUTHORS Inoue,C., Sugawara,K., Shiratori,T., Kusano,T. and Kitagawa,Y. TITLE Nucleotide sequence of the Thiobacillus ferrooxidans chromosomal gene encoding mercuric reductase JOURNAL Gene 84, 47-54 (1989) STANDARD simple staff_review FEATURES from to/span description pept 65 1702 mercuric reductase BASE COUNT 323 a 542 c 574 g 291 t ORIGIN 1 gcgaccgacg gctgcgaaac gcccgccccg cgtagctgag cacatagaca ctttggagga 61 tattatgacc gagaacgcgc ccaccgaact cgctatcact ggcatgacct gcgacggttg 121 cgccgcgcat gtgcgcaaag cactcgaagg cgtgcccggc gtacgcgagg cgcaggtgtc 181 ctacccggat gccacggccc gggtcgtgct ggagggcgag gtgccgatgc agcggctaat 241 caaggcggtg gttgcaagtg gctatggtgt gcatccacgg agcgacggtg cctcctccac 301 aaacgatgga caggagctac acatcgctgt gatcggcacc ggcggagcgg cgatggcgtg 361 cgcattgaag gctgtcgagc ggggcgcgcg cgtgacgctg atcgaacgca gcaccatcgg 421 cggcacctgc gtgaacatcg gttgcgtgcc gtccaagatc atgatccgcg ccgcccatat 481 cgcccacctc cgccgggaaa gcccattcga tggcggcatc caggcggtcg cgccgaccat 541 ccagcgcaca gcgctgctgg tccaacagca ggcccgtgtc gatgaactgc gtcacgccaa 601 gtacgaaggc atcctggacg gcaacccggc catcaccgtt ctgcgcggtg aagcgcgttt 661 caaggacagc cggagtgttg tcgtccattt gaacgatggt ggcgagcgcg tcgtaatgtt 721 cgaccgctgc ctggttgcca cgggcgccag tccggccgtg ccgccgattc ccggcttgaa 781 agacactcct tattggacct ccaccgaagg gctggtcagc gaatcgatcc ccgagcgtct 841 ggccgtgatc ggctcgtcgg tggtggcgct ggaactggcg caagccttcg cccggctcgg 901 cagccatgtg acgatcctgg cgcgcggcac cttgttcctc cgggaagacc cggccatcgg 961 tgaggccatc acggcggcgt ttcgcgccga aggcatcgag gtgctggagc acacccaggc 1021 cagccaggtc gcttatgcgg atggcgaatt tgtgctagcc accgggcacg gcgaactgcg 1081 cgccgataag ctgctggtcg ccactggtcg cgcaccgaac acacgccgcc tgaatctgga 1141 agcggcgggc gtggccatca atgcgcaagg ggccatcgtc atcgaccagg gtatgcgcac 1201 gaacagcccg aacatttacg ccgctggcga ctgcaccgac cagccgcaat tcgtctacgt 1261 ggcggcagcg gccggcaccc gtgcggccat caacatgatg ggcggtagtg cagccctgga 1321 cttgacggcg atgccagccg tggtgttcac cgatccgcaa gtggcgactg tgggttacag 1381 cgcggaagcg catcgcgacg gcatcgaaac cgacagccgc atgacgctcg acaacgtgcc 1441 gcgggcgctc gccaatttca atacacgcgg cttcatcaag ctggtagccg aagtgggcag 1501 tggctcgcta atcggcgtgc aggtggtcgc cccggaagcg ggcgagctga tccagactgc 1561 cgcgctggcg attcgtaacc ggatgacggt acaggaactg gctgaccagt tgtttcccta 1621 cctgacgatg gtcgaagggc tgaagcttgc tgcccagacc ttcaccaggg atgtgaagca 1681 gttgtcctgc tgtgcgggtt gagacggatt gataaaggag tccctgttgc // LOCUS MMTELPMA 830 bp ss-RNA VRL 09-AUG-1990 DEFINITION Mouse mammary tumor virus (MMTV) phorbol myristate acetate induced mRNA, clone 14. ACCESSION M37198 M19737 M19738 M22729 KEYWORDS . SOURCE Mouse mammary tumor virus, cDNA to viral RNA, clone 14, passed in EL4.E1 cells. ORGANISM Mouse mammary tumor virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses. REFERENCE 1 (bases 1 to 202; 692 to 720) AUTHORS Elliott,J.F., Pohajdak,B., Talbot,D.J., Shaw,J. and Paetkau,V. TITLE Phorbol diester-inducible, cyclosporine-suppressible transcription from a novel promoter within the mouse mammary tumor virus env gene JOURNAL J. Virol. 62, 1373-1380 (1988) STANDARD simple staff_review REFERENCE 2 (bases 1 to 830) AUTHORS Paetkau,V.H. JOURNAL Unpublished (1990) STANDARD full staff_review FEATURES from to/span description pept 168 575 PMA-induced transcript mRNA 1 > 830 PMA-induced transcript mRNA revision 1 1 c in [2]; g in [1] revision 39 39 g in [2]; a in [1] site 94 95 intron site revision 119 119 a in [2]; g in [1] site 706 707 site of 491 bp deletion relative to MMTV virus BASE COUNT 233 a 183 c 184 g 230 t ORIGIN 1 cactgccaga tcgcctttaa gaaggacgcc ttctgggagg gagacgagtc tgctcctcca 61 cggtggttgc cttgcgcctt ccctgaccaa ggggtgcctt gcgaagagcc ttgaccaaat 121 gcagtcagat cttaacgtgc ttcttttaaa aaagaaaaaa gggggaaatg ccgcgcctgc 181 agcagaaatg gttgaactcc cgagagtgtc ctacacctag gggagaagca gccaaggggt 241 tgtttcccac caaggacgac ccgtctgcgc acaaacgggt gagcccatca gacaaagaca 301 tattcattct ctgctgcaaa cttggcatag ctctgctttg cctggggcta ttgggggaag 361 ttgcggttcg tgctcgcagg gctctcaccc ttgactcttt taatagctct tctgtgcaag 421 attacaatct aaacaattcg gagaactcga ccttcctcct gaggcaagga ccacagccaa 481 cttcctctta caagccgcat cgattttgtc cttcagaaat agaaataaga atgcttgcta 541 aaaattatat tttaccaata agaccaatcc aataggtaga ttattagtta ctatgttaag 601 aaatgaatca ttatctttta gtactatttt tactcaaatt ctgttgttag aaatgggaat 661 agaaaataga aagagacgct caacctcaat tgaagaacag gtgcaaggat gtgagacaag 721 tagtttcctg acttggtttg gtatcaaatg ttttgatcta agctctgaat gttctattct 781 cctatgttct tttgcaactt atccaaggtc ttatgtaaat ggcttagtaa // LOCUS MUSPBGD1 2663 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1. ACCESSION M28663 M29949 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 1 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2663) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept 505 537 porphobilinogen deaminase (housekeeping) exon 1 (EC 4.3.1.8) 2191 2307 porphobilinogen deaminase (housekeeping) exon 2 2457 + 2510 porphobilinogen deaminase (housekeeping) exon 3 pep$ 2475 + 2510 porphobilinogen deaminase (erythroid sp.) exon 1 pre-msg 341 > 2663 PGB deaminase (hk) mRNA and introns IVS 538 2190 PGB deaminase (hk) intron A (no splice consensus) IVS 2308 2456 PGB deaminase (hk) intron B IVS 2511 > 2663 PGB deaminase (hk) intron C pre-msg 2192 > 2663 PGB deaminase (ery.sp.) mRNA and introns IVS 2511 > 2663 PGB deaminase (ery.sp.) intron A binding 1126 1133 NPE binding site binding 312 317 Sp1 binding site binding 329 334 Sp1 binding site site 2101 2107 CACCC box site 2118 2124 CACCC box BASE COUNT 622 a 661 c 707 g 673 t ORIGIN Chromosome 9 1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga 61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca 121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg 181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga 241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc 301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac 361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc 421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc 481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg 541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg 601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc 661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag 721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc 781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag 841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc 901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt 961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc 1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc 1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag 1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat 1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt 1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac 1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca 1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt 1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg 1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc 1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg 1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt 1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg 1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga 1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg 1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc 1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta 1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga 2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc 2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc 2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg 2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc 2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga 2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg 2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag 2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag 2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct 2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc 2641 tctgaaaaga gagtctctgg agc // LOCUS MUSPBGD2 2763 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2. ACCESSION M28664 M29950 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 2 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2763) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 52 124 porphobilinogen deaminase (housekeeping), exon 4 (EC 4.3.1.8) 487 536 porphobilinogen deaminase (housekeeping), exon 5 619 674 porphobilinogen deaminase (housekeeping), exon 6 983 1060 porphobilinogen deaminase (housekeeping), exon 7 1971 2048 porphobilinogen deaminase (housekeeping), exon 8 2143 + 2218 porphobilinogen deaminase (housekeeping), exon 9 pep$ + 52 124 porphobilinogen deaminase (erythroid sp.) exon 2 487 536 porphobilinogen deaminase (erythroid sp.) exon 3 619 674 porphobilinogen deaminase (erythroid sp.) exon 4 983 1060 porphobilinogen deaminase (erythroid sp.) exon 5 1971 2048 porphobilinogen deaminase (erythroid sp.) exon 6 2143 + 2218 porphobilinogen deaminase (erythroid sp.) exon 7 pre-msg < 1 > 2763 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 51 PGB deaminase (hk) intron C; ery.sp. intron A IVS 125 486 PGB deaminase (hk) intron D; ery.sp. intron B IVS 537 618 PGB deaminase (hk) intron E; ery.sp. intron C IVS 675 982 PGB deaminase (hk) intron F; ery.sp. intron D IVS 1061 1970 PGB deaminase (hk) intron G; ery.sp. intron E IVS 2049 2142 PGB deaminase (hk) intron H; ery.sp. intron F IVS 2219 > 2763 PGB deaminase (hk) intron I; ery.sp. intron G BASE COUNT 704 a 605 c 726 g 728 t ORIGIN Chromosome 9; 150 bp upstream of segment 1. 1 gaaaggcagt ggccagggga ggtgagaaac catctgactc tctttcccca gctggctcgc 61 atacagaccg agactgtggt ggcgatgctg aaagccttgt accctggcat acagtttgaa 121 atcagtaagt tttcttgaga ggagtgattg gtagtgaacg ggaagccagt gaaccggagg 181 acagggcatc tctcgtttgc ctgtggtcaa agcctgcctt gtaagactat tctggctgct 241 tgtgaaggga aagaaagatt gtctcctgtg cacatctcct ccagctgccc gggctagcct 301 gacatttcca tactttctgc tttgggttct tttatgagta tgtctgcttt ttctgtcggt 361 gtgtgtatct gagagagtta ggggctgggt cttctatgcc tcagactcca ctgtgaatcc 421 agtcaaggcc tgaacgaggg gtgactcagt aggtgttaat gggtatctga ttgactctct 481 cctcagttgc tatgtccacc acgggagaca agattgttga tactgcactc tctaaggtaa 541 cgccagtcct tgtcccattc ttcttgtccc tctcccacgt gtaaggggtt cactctgagg 601 ctctctcttg cctggcagat tggagagaag agcctgttta ccaaggagct agaaaacgcc 661 ctggaaaaaa acgagtgagt gaggatggag gaatgtggta ccccgagcct agaaccccaa 721 agtggctctc caatattggc aggattgtcg ggttagactg tggagctcac aggctttcac 781 agagaagaga gccttgcctt ggagtagcct aactacctgg ggaatcagac tgccggggga 841 aaggggtaga gtagttgaga agagaccagg tcttagatct taagatgcta tcttcctgaa 901 cggtcaagga tgctggggtg ggtggtggag ataaggtcac ctactcaaag cctctctctg 961 tgcctccccc tgccgtctcc agagtggacc tggttgttca ctccctgaag gatgtgccta 1021 ccatactacc tcctggcttt actattggag ccatctgcaa gtaagcgggg aggacatgca 1081 tgggacggag ggccctgggc aggattaatc ctactgtggg aatctttgag tttttttttt 1141 ttttttttcc atttggaact taaccgctta gccgtctgtt ttgaaggttc tcagacatag 1201 tgtggcagga aagccaattg gttgacttgg ttgactattt agagtttgtg gagttgggct 1261 cagtggcacg gacctgaaat cccagctact gggaggctaa gacaggatca gagattctgg 1321 gccagcctgg gctacagagg gatttgaacc agcctgagga acttagattg tgccttaggg 1381 gcacagaagg ctggcttaca gtggcttagg tggtaaaggc attttttgct gtcaagccaa 1441 tgacctgagt tcagtccgtg gggtgcactt ggtgaaagaa gagggttgaa tcccacaagt 1501 tgtcatctga ctcatgcata catgctgtag aatgtttatg ctcctcatcc ctcaatgaaa 1561 atggaaacaa tcaaggaaat gaaatataaa acctgctggg tggtggtgcg cacgcctata 1621 atcccagcac ttgggaggca gaggcaggtg aattcaacct ggtctacaaa gtgagttcca 1681 ggactataca gagaaaccca gtcttaaaaa caaaacaaaa ctaaacaaca acaacaacaa 1741 caacaaaaaa gaaaaaacaa agaaagaaat ataaaacctt tccaaagaaa ataaaatgaa 1801 tttggcctgg tggctcatgc tataatctca gcattcagag agctgaggca ggagggttat 1861 tgtgagttaa aggctagctg gggtacagag aaaattttag gtcacctggg ctagagttaa 1921 ccctatctcc aaatgctaat acctttattt catcatcatt tgctttgcag acggcaaaac 1981 ccttgtgatg ctgttgtctt tcacccaaag tttattggaa agaccctgga aaccttgcca 2041 gagaaaaggt gagtgggcct agtgtgcggg ggagagaggc ctggacagtg gagaacagtt 2101 ggcagcctgg gttaagttta attctaaact ctctctgagc agtgccgtgg gaaccagctc 2161 tctgaggaga gtggctcagc tacagagaaa gttccccaac ctggaattca agagtattgt 2221 atcctttcag aagaaggagg ggaaaaagag ggaaagaagg accttccgaa gcaagtggtc 2281 catgcggtca gggggtcgtc tttccatctg tccgtccacc cacccaccca cccatccatc 2341 catccatcca cacatccaca gtcctttaat gttttgcttt tttttttttt tcctgagaca 2401 gggtttctct gtgtagcctg gctgtcctgg aactcacttt gtagaccaag ctggcctgca 2461 aagtgagaaa tccgcctgcc tctgcctcct gagtgctggg attaaaggtg cgccaccact 2521 gcctggcacc ctaatgtttt ttaaactcag gcctggcaat gaggacaatt tgcaaaacaa 2581 acatggttcc ttgttctata cagctgacat gttagacaga caggcaggca ctgcagatac 2641 tgaccggtga ccactcctgg tgcagggaca gaggcgcttc tgcttttact ttctgtgctg 2701 ctaagtggtt ttggttttta cagtgaatat gtgatatgtt tcataaaagt aatttttttt 2761 tct // LOCUS MUSPBGD3 800 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5. ACCESSION M28665 M29951 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 3 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 800) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 402 515 porphobilinogen deaminase (housekeeping), exon 10 (EC 4.3.1.8) 739 + 777 porphobilinogen deaminase (housekeeping), exon 11 pep$ + 402 515 porphobilinogen deaminase (erythroid sp.) exon 8 739 + 777 porphobilinogen deaminase (erythroid sp.) exon 9 pre-msg < 1 > 790 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 401 PGB deaminase (hk) intron I; ery.sp. intron G IVS 516 738 PGB deaminase (hk) intron J; ery.sp. intron H IVS 778 > 790 PGB deaminase (hk) intron K; ery.sp. intron I BASE COUNT 181 a 201 c 182 g 236 t ORIGIN Chromosome 9; 500 bp upstream of segment 2. 1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg 61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct 121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct 181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta 241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa 301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg 361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc 421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc 481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc 541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac 601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc 661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca 721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta 781 ggtaggtttg cctggagaga // LOCUS MUSPBGD4 1386 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4. ACCESSION M28666 M29952 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 4 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1386) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 154 273 porphobilinogen deaminase (housekeeping), exon 12 (EC 4.3.1.8) 459 512 porphobilinogen deaminase (housekeeping), exon 13 618 704 porphobilinogen deaminase (housekeeping), exon 14 783 956 porphobilinogen deaminase (housekeeping), exon 15 pep$ + 154 273 porphobilinogen deaminase (erythroid sp.) exon 10 459 512 porphobilinogen deaminase (erythroid sp.) exon 11 618 704 porphobilinogen deaminase (erythroid sp.) exon 12 783 956 porphobilinogen deaminase (erythroid sp.) exon 13 pre-msg < 1 1279 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 153 PGB deaminase (hk) intron K; ery.sp. intron I IVS 274 458 PGB deaminase (hk) intron L; ery.sp. intron J IVS 513 617 PGB deaminase (hk) intron M; ery.sp. intron K IVS 705 782 PGB deaminase (hk) intron N; ery.sp. intron L BASE COUNT 372 a 305 c 343 g 366 t ORIGIN Chromosome 9; 80 bp upstream of segment 3. 1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga 61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct 121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc 181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc 241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg 301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt 361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt 421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg 481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg 541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca 601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata 661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat 721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc 781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc 841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca 901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg 961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc 1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga 1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag 1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac 1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc 1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt 1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca 1381 gtcgac // LOCUS HUMIBP3 10884 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human insulin-like growth factor-binding protein-3 gene, complete cds. ACCESSION M35878 M35879 M35880 M35881 M35882 M35883 M35884 M35885 M35886 M36121 M36122 J05537 J05538 KEYWORDS insulin-like growth factor-binding protein-3. SOURCE Human leukocyte DNA and, cDNA to mRNA, clone #HL1006d. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 10884) AUTHORS Cubbage,M.L., Suwanichkul,A. and Powell,D.R. TITLE Insulin-like growth factor binding protein-3: Organization of the human chromosomal gene and demonstration of promoter activity JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Powell, 03-JUL-1990. The sequence presented here appears in Figures 2 and 3 of ref. [1]. FEATURES from to/span description pept 2038 2440 insulin-like growth factor-binding protein-3 precursor (IGFBP-3), exon 1 5726 5952 insulin-like growth factor-binding protein-3 precursor, exon 2 6497 6616 insulin-like growth factor-binding protein-3 precursor, exon 3 8212 8337 insulin-like growth factor-binding protein-3 precursor, exon 4 sigp 2038 2118 insulin-like growth factor-binding protein-3 signal peptide, exon 1 matp 2119 2440 insulin-like growth factor-binding protein-3, exon 1 5726 5952 insulin-like growth factor-binding protein-3, exon 2 6497 6616 insulin-like growth factor-binding protein-3, exon 3 8212 8334 insulin-like growth factor-binding protein-3, exon 4 pre-msg 1906 10775 IGFBP-3 mRNA and introns IVS 2441 5725 IGFBP-3 intron A IVS 5953 6496 IGFBP-3 intron B IVS 6617 8211 IGFBP-3 intron C signal 102 1905 bp 3 promoter binding 1808 1821 Sp1 and AP-2 binding domains signal 1876 1881 TATA box site 5728 5736 potential ASN-linked glycosylation site site 5917 5926 potential ASN-linked glycosylation site site 7087 7255 region homologous to IGFBP-3 genomic sequence signal 10751 10756 Poly-A signal BASE COUNT 2796 a 2578 c 2737 g 2773 t ORIGIN 1 ctgcagacct gggacctcaa gaattgcatt tgatgccgaa cccagctcta atttcagagt 61 caaggtctct gcgagtattt aaggaacgga tgtaaacctg ggggattcgt tttgtttcct 121 tcaattttcc aatgaaatca gagatcctgt tcttgggtgt caacgcagat actagaagga 181 ggtgatacaa gagaaaggaa acagcaagcg acgattatgg cacggtttcc tgtaaacaag 241 gttgagtgta gccacagcct gagcactgtg ggagaagagc tcataagaaa atgacggtgc 301 tgggccttcg tcaccccggg gccctccatt gttcttgtct ttggtctctt tttatttgta 361 gaggtccaat tatttattta tttagtacaa gagggaacga aattgatctt tccattctaa 421 aaggagagta tatatgtata aaaggaagct gtatagatat gggggaagag gtggacaggg 481 ggaaaagggg agaggacgag agagagaaag ggagggagag ggacaaggag agacactggg 541 cgagagatcg attaggagag acagaaatga tgaatgaaga ttaacttcac ccaaggcttc 601 gtcgctggag gggaatggag gagctcctga tttgctatta ctactccaaa ctgcaaaggg 661 ctccttcaag tcacctatcc acctcctaag gcaagcgtcc aatttcaaca gcgttcagga 721 aagtctcctc ccgcggaggt ctcaccgctt cccactccac ccccacaaac tctttggaaa 781 agtgccttga aaaatttaat cctcaatcca atcctggacc accagcgtcc tctgttggtc 841 accgaaggag ggggtgcgca gacaaaactg aagaaactcg agtgccagag aaggccgaca 901 ggagttacag cgacctcagc gcgcaattgc gccccgaact ttactgaaaa gtgtttagat 961 tgcagagata agctagaatc ccaacgcatc gagaatacag taatacgaag tcgccttcaa 1021 aaaatgacaa tgaaaattgc ctattaaagg actatttggt taattacgtt tcagcagtgc 1081 ccagtttatt gtctttatta ttcttttgtc gtgggtgtaa actccatttg aaaacataat 1141 cagggagaat acccaagaca agaagaacag ttgtcattta aaatatttga aaagccctgc 1201 cttaaggagc attcgcttgc cggtccactc ttaattgggg acttgcggtg tagcaacacg 1261 tgagagtctt cttgcgttga gaagtaagcc tggaaaggcg aaggccccgg ggcatcttca 1321 gatgcgtatt tgtgggcccc tggggatata aacagcccag cgggtgtaaa ttaaaccccg 1381 cagtgccttg gctccctgag acccaaatgt aagtcagaaa tgtcccaaga cttcgcctgc 1441 caacggaatt aaattttaga aagctccacg aggtacacac gaatgcggag cgctgtatgc 1501 cagtttcccc gacaccggct cgccgcaggg agacctcacc ccgagagcgg aaggggtaag 1561 ggcggcgggg tcaaggagat cgggggtgct gagttggcca ggagtgactg gggtgaccgg 1621 gggtgctgag gtggcctgga gtgccggggt ggccgggcac accttggttc ttgtagacga 1681 caaggtgacg ggctccgggc gtgagcacga ggagcaggtg cccgggcgag tctcgagctg 1741 cacgcccccg agctcggccc cggctgctca gggcgaagca cgggccccgc agccgtgcct 1801 gcgccgaccc gcccccctcc caacccccac tcctgggcgc gcgttccggg gcgtgtcctg 1861 ggccaccccg gcttctatat acgggccggc gcgcccgggc cgcccagatg cgagcactgc 1921 ggctgggcgc tgaggatcag ccgcttcctg cctggattcc acagcttcgc gccgtgtact 1981 gtcgccccat ccctgcgcgc ccagcctgcc aagcagcgtg ccccggttgc aggcgtcatg 2041 cagcgggcgc gacccacgct ctgggccgct gcgctgactc tgctggtgct gctccgcggg 2101 ccgccggtgg cgcgggctgg cgcgagctcg gggggcttgg gtcccgtggt gcgctgcgag 2161 ccgtgcgacg cgcgtgcact ggcccagtgc gcgcctccgc ccgccgtgtg cgcggagctg 2221 gtgcgcgagc cgggctgcgg ctgctgcctg acgtgcgcac tgagcgaggg ccagccgtgc 2281 ggcatctaca ccgagcgctg tggctccggc cttcgctgcc agccgtcgcc cgacgaggcg 2341 cgaccgctgc aggcgctgct ggacggccgc gggctctgcg tcaacgctag tgccgtcagc 2401 cgcctgcgcg cctacctgct gccagcgccg ccagctccag gtgagccgcc cgccaggtgc 2461 gctgcgtgca gcaccgccac tggcgccgaa gggcctgggg gttgctgggt gccgctgcgg 2521 gagactccgc ttttcttctc actggagata atatgtgggg aaactgaagg cgctccggga 2581 aaggtgaagg cggtcgccga gggaccctcc ccagccggcc ctctacttgc tcgattctct 2641 aagtgcagag tacttgtaaa ttgcaaagcg ctttcagtga aaatgggtaa aggtttccgg 2701 agctgagggg agcggtaccg atgtttagct gttggaaaga tcctggacac aggagattct 2761 cctcgccccg cacgggtgca cacggactgc aatcccaggg atgcttgggg atggggggat 2821 ataggcggat ttggaccaag gaaggtgggt aggcacgttg taggaaatag tacctctctt 2881 ttaaaatact gactttgcac agccttttgg tttgcaaagc aatgtctagt cccggtatgt 2941 ccaaaaacaa gtaaagtgga ttcgggtttt gatatcttct gcggttggaa aacctgaagc 3001 tgaaaaagaa gtaacttctt aaggttaccc agcggccaca acagagtgta ggtttgaact 3061 ccgcgtgcca ctttcagtac cataccattc ttacaactcg ggccacccct gcacctgcgc 3121 cgacctcaaa caaacttcca ggtgcgtggt gggtgcgggc aatgtggact aagtcaattt 3181 caatgacacg gcaagggaat tggaatcagt cctaggctgt ctcccttctt aatctgaaat 3241 gggggggggg aatgagatgt tgttaagggg agccccagaa gaggaaaaat gcaaacattt 3301 ggcagagtta ccctcttgct tagccactat cagtatcagg cagacagcga ctctggtaag 3361 ggcatcacat tgttccctta aaaaaaggag cgggggttgt ttaaatggat ttggcagctg 3421 ttctttcaag cattcttagc cagcctcacc tagttatatg agaaataaag ttcctgcctt 3481 gcacagctga aggctgggag aattctcccc atcctaattc ccccaactcc ccaacgatca 3541 cgttggacag atgtcactgg gcaggccccc atctagggct agcaggatga acagtccctt 3601 tataatttat gtagctgtag agttccacgc ccgggtgaag ttattttctg gctcggcaag 3661 gctggctctg ttcacccctg agaaatgctg gattcatgga aaggcaagat gcctgaaaca 3721 tacactggct ctggtcagct gttaaagctg ctggaggcat ttgtctctcg gggcaaagtt 3781 atgtcatttg ccaagtgtcg tacattattg tgcattttgg ggtattcaaa aagtgatctt 3841 agaaatactg atacacatcg tcattcttgg gctttagcaa tcatcatgat taccacctta 3901 gtagcactgt agtataggtt gatgtgagtt ataagattat aaaaagatct aagtgacttc 3961 tagaatctat ttgacaaaaa aaggtaaatt ttcgacagtc aaaagtcaca attatctgtt 4021 gcttaaatag aactgttttg tcttcatgcc ctagtctgca gcccaggcat taagaagaaa 4081 ccaaggaaat ttaagaaatt actcaaggtt cttagaaaag aagtataaat acgtttattt 4141 acatgttctt agagtattta cattcttagt atctctttta tctcagtatt tccttgaaaa 4201 agaaagcaag ctaagattaa aagaaattga aaccaaatcc tcgcaggtag ggacctcctc 4261 tgtgaggctc tgtgctggac cctgggaatg tgtgcttccc aaggtatgaa accccttggg 4321 gaactttaca gcaggacctc agtgagctgt ttggcaggtg aggaaactaa gacccagaga 4381 ggagagggac tttcctaagg ccctggtgag tgacctgcca gtagccactt ccaggggaga 4441 gcagagcatc tgcagccaaa tcattgcagc cccaggtagc tttctagata gactgtggac 4501 cagatgggcc acctgagctc cctgctaggg ttacacatta tagccctgtt tgtgtagtag 4561 agaaatttca tgactctcaa ttgtggactt aagccgatgc ctccagacct tggcatggtc 4621 cacaggccct gggagcatgg gctctgaatg tagcctttga tccccatagc ggtcttacag 4681 cccctccaag ttcattctga agaaggaatg gagtgagaat cctggctgca gatccagtct 4741 tgaatttagt catatactta aaattccaat tcaactgtta acattccagc atccatttta 4801 agcatcagac tttcttcatt tagcactttt tattataaaa gggagatctg ctggaggggg 4861 atttctccta ccccaccccc acccagggaa ggaaaagctc tttggcactt agaagtctga 4921 gccgtgagtg ggactttggc attgtctgca tccatgtgct gctgtgttca cccggggtga 4981 aaaggactca cttaggcagg caccagcaag atgcacaggg tctgtgtaga ccttgagttt 5041 tagagatgta acggggacct agaaaacaag ccaccaacat gcttgcatga ttctgagccc 5101 ctgaggcaaa acgctttgca ggtaataatt cagttttccc atctgagctg gacaccaagc 5161 tcttataagc gtgtttacct ggtagcattg aggacggtac tggtcaacct tggaattccc 5221 ataagggctt gttacaactc agactcgtgc cgccactcca gcgtttccgg agtggagaat 5281 gtgcatttct tccaagtccc cgggctgccg ctgctcccgc gggtgggagg accacacttg 5341 gagttgactg caaaatttct gagccggcgc tgcagcagcc tcccgtggct caggtctgcc 5401 ccctgccggt ggaagatgaa gcatactgcc ttcacctact gaggggcact gaagcgtttg 5461 tctgccttct ttagttgcag ctacttagga agagcacctg tcagattgac tttcaaacag 5521 ataacttctt gaggtagagc aaccaccatg tagtgagtag tatgatggaa taatacttca 5581 tcgaggtatt taaaaaaaaa acctcacttg gattgccaac taatattgtc atttacatgt 5641 gacctggttg caacgttaag atttttacaa gactgtgata gatattgatg actctcatgt 5701 gtttgtctct cttgggcgtt ttaaggaaat gctagtgagt cggaggaaga ccgcagcgcc 5761 ggcagtgtgg agagcccgtc cgtctccagc acgcaccggg tgtctgatcc caagttccac 5821 cccctccatt caaagataat catcatcaag aaagggcatg ctaaagacag ccagcgctac 5881 aaagttgact acgagtctca gagcacagat acccagaact tctcctccga gtccaagcgg 5941 gagacagaat atgtgagagc ttttcctctt gttaaaggag gagggcaaga cctgccaagc 6001 ctgggtactc agagcctctt gagggcaatt cttactcaac aaaccccagc gcctggctga 6061 tgggtgggca acccctagcc cctctgtgcc ctacctctct cctctcctta cataaagaat 6121 attgaccctt ttggagaatc ttatgaggat caagctgaaa taacactctt aaaagcatat 6181 gggatgtcat aaagacctct gcagataatg aaaatattct cataaagata gttttattta 6241 cttcatcctc tatgcttgtt gacctgctat tggttccatg ccagcttctg tgccttactc 6301 tgggaagagc aaaaaggaga cagggagtga tggttagctt attcggggga ctttcgtgct 6361 acatcagaca taaggtatct gaggagcaaa ttacaggtcc cacttttggt agttgtgcag 6421 catcgtaaga tttttaaagc acacattcta gagtaaaaac tgtgactctg ttgctctggt 6481 ccttcctgat ccccagggtc cctgccgtag agaaatggaa gacacactga atcacctgaa 6541 gttcctcaat gtgctgagtc ccaggggtgt acacattccc aactgtgaca agaagggatt 6601 ttataagaaa aagcaggtga gtgaggtcct cagtgtgttt tcttcctctt ctgttgacac 6661 agaggagaaa cccatgtcac cagcgcccag gctcttgtgg ccatagctct aactctgagc 6721 ctgtgcagca ccagtgccca ggacttggtg ccagtctcag gaggtcagac caagggctgc 6781 tttgacttgt tgctctgagt gctgctatat tggccataat cctcaaccct agtgcctttc 6841 caccacccgc ttcccactcc tgtcctttca atggttcacc cacaggcgga caagatgctg 6901 cccagtggca ccctttataa actgcaagtg gacatgttaa cacatttgtt aatgctgcgt 6961 cagggagtga catttcaaac aactattata gtcagtttcc aagaagtgtg acatgaggtc 7021 ataccacaaa aaagcttacc ctgaaatccc acaatcgtcc cctttcctac tgatgccttc 7081 ccgatagtga gcaggttgca atattaagat tttgaaaagg ctgttgctag atgttggtga 7141 ctcgtgtgtc tctgtctccc ttgggctttt caaggaaatg ctagtgagtg gggggatgac 7201 tgcagcatgg ccagcttgga gagcccagcc atccccagca cataccaggt gtctgtcttg 7261 gcgtggaggg gatggaactt gaaatcagac actcggtcca tgctggggat ggccagtctc 7321 tccaaactgg catgtggtct tcctccgagt cactggcatt tccctagaaa gtccaagtga 7381 gaagaaggca tgagagtcat caacatcaaa caacagtctt ttcaaaatct ttatattgca 7441 acatagtccc attcctggaa aaggaatgga gtgagaatcc tggctacaca tcagccccaa 7501 atgtagtcat tgcctaaaat cccaattaac ctgaaaatga tcaaacaaat ttaagatata 7561 gtaatattaa gctgtaataa atatgcttct ataggctttg tgttatgtga tggcactatt 7621 tcaattggct ttctaattgg acaattgata ctatgctatc tacagaattg gcctttggag 7681 acctaagtga gccacagtgg cctcagggtg accatatact aggattcata gcagtggcca 7741 cagtcagaag cctaagcttt cctccattgc cattgctcgt ttataccacg tttctgtcaa 7801 agtcatattc attcaacaaa gtcatactga gaaggtgtca tgtgaggctg gatgtgggct 7861 ccaaagtcat agctgtgaca ttcgcaggca gcgggatgtt ctcagttcca catttggcag 7921 agaagtcagt caagaggttc tacaagggct ggtgtccacc ttatactcct agaaacacaa 7981 aactgccccc acccccgctt tcttggagca ggaagttaca cccacacgca tgcacaggcg 8041 cacactcagc gggcctaggc agcgtggctc ttgtgttgcc ttagctgaaa tttctgttgt 8101 gctttctcag catagcagag tcacgctggc aaaccatcat gcgccctggc caccgacctg 8161 acaccagacc caggagcatt cacttctctg tcttctgttt ctctcccaca gtgtcgccct 8221 tccaaaggca ggaagcgggg cttctgctgg tgtgtggata agtatgggca gcctctccca 8281 ggctacacca ccaaggggaa ggaggacgtg cactgctaca gcatgcagag caagtagacg 8341 cctgccgcaa gggtgagtac tcaggagggg cagcctgggc tccagggcct cactgtcctt 8401 ggaccagcct caggggctgg gcgtggccac tggccttccc caggcttaca gacccaggag 8461 ctgcagctca gggccagaaa gagcaaagca aataggacag agccctcaga agggtgcagg 8521 gagagggaga ccccatcaac ccaaccaaac aagtgtgggg aaggaggccg gccagtgcac 8581 ctcagggaca ctctgcttta tctcagatac ctcacagcac ctaagctatc attcatccac 8641 acacaaagtg aagattttca aagttaggct ttacccgtga gtctggaggt catttatctt 8701 cacagagaac gtttatcgca gactgctaag atacatgttc taattaagat gtgatgtgag 8761 aacgctgaat gctcgttgga gactcagttg aagtgcagct ttttttctgt caaatatata 8821 atgaatattc tgttagtctg tggctaatat aattttaata aagttaattt aaatctgata 8881 gaaaaatgaa attttaaacg ataattttag agaatgctat tatatccagt cttctttttt 8941 cttttaataa atgagggaac tattggggga aaggaataaa tacattttct ttcattttat 9001 taagacaaat ttagtaagca gaagaaattt gcatgtttag ttataagggt ttcttttttc 9061 cttacaagtt ggaaaaaata attctaattt aagggtaact ctttgacaat gaacactgtg 9121 agcagcatct ggtactcgtt gctttgtttg aaaacatgag ttgagacccc agccgcactt 9181 gcagcctagt gccattagcc tgcaggctgt gctggatatc tcagggcaag agtcgagccc 9241 ttttgatttt ggggggatta tttcaatata tttgcttttt ctttttgttt tagttaatgt 9301 ggagctcaaa tatgccttat tttgcacaaa agactgccaa ggacatgacc agcagctggc 9361 tacagcctcg atttatattt ctgtttgtgg tgaactgatt ttttttaaac caaagtttag 9421 aaagaggttt ttgaaatgcc tatggtttct ttgaatggta aacttgagca tcttttcact 9481 ttccagtagt cagcaaagag cagtttgaat tttcttgtcg cttcctatca aaatattcag 9541 agactcgagc acagcaccca gacttcatgc gcccgtggaa tgctcaccac atgttggtcg 9601 aagcggccga ccactgactt tgtgacttag gcggctgtgt tgcctatgta gagaacacgc 9661 ttcaccccca ctccccgtac agtgcgcaca ggctttatcg agaataggaa aacctttaaa 9721 ccccggtcat ccggacatcc caacgcatgc tcctggagct cacagccttc tgtggtgtca 9781 tttctgaaac aagggcgtgg atccctcaac caagaagaat gtttatgtct tcaagtgacc 9841 tgtactgctt ggggactatt ggagaaaata aggtggagtc ctacttgttt aaaaaatatg 9901 tatctaagaa tgttctaggg cactctggga acctataaag gcaggtattt cgggccctcc 9961 tcttcaggaa tcttcctgaa gacatggccc agtcgaaggc ccaggatggc ttttgctgcg 10021 gccccgtggg gtaggaggga cagagagacg ggagagtcag cctccacatt cagaggcatc 10081 acaagtaatg gcacaattct tcggatgact gcagaaaata gtgttttgta gttcaacaac 10141 tcaagacgaa gcttatttct gaggataagc tctttaaagg caaagcttta ttttcatctc 10201 tcatcttttg tcctccttag cacaatgtaa aaaagaatag taatatcaga acaggaagga 10261 ggaatggctt gctggggagc ccatccagga cactgggagc acatagagat tcacccatgt 10321 ttgttgaact tagagtcatt ctcatgcttt tctttataat tcacacatat atgcagagaa 10381 gatatgttct tgttaacatt gtatacaaca tagccccaaa tatagtaaga tctatactag 10441 ataatcctag atgaaatgtt agagatgcta tatgatacaa ctgtggccat gactgaggaa 10501 aggagctcac gcccagagac tgggctgctc tcccggaggc caaacccaag aaggtctggc 10561 aaagtcaggc tcagggagac tctgccctgc tgcagacctc ggtgtggaca cacgctgcat 10621 agagctctcc ttgaaaacag aggggtctca agacattctg cctacctatt agcttttctt 10681 tattttttta actttttggg gggaaaagta tttttgagaa gtttgtcttg caatgtattt 10741 ataaatagta aataaagttt ttaccattaa aaaaatatct ttccctttgt tattgaccat 10801 ctctgggctt tgtatcacta attattttat tttattatat aataattatt ttattaaaat 10861 gttccctgct ttccctttta gcaa // LOCUS PINCABII2 583 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pinus sylvestris cab II/2 mRNA for chlorophyll a/b-binding protein. ACCESSION M37489 X14507 KEYWORDS Cab gene; chlorophyll a/b-binding protein; thylakoid protein. SOURCE P.sylvestris cotyledones cDNA to mRNA, clone pINE ab 11. ORGANISM Pinus sylvestris Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; Pinaceae. REFERENCE 1 (bases 1 to 583) AUTHORS Jansson,S. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 583) AUTHORS Jansson,S. and Gustafsson,P. TITLE Type I and type II genes for the chlorophyll a/b-binding protein in the gymnosperm Pinus sylvestris (Scots pine): cDNA cloning and sequence analysis JOURNAL Plant Mol. Biol. 14, 287-296 (1990) STANDARD simple automatic COMMENT [1] Author address Jansson,S. Plant Physiology Umea University S-901 87 Umea Sweden FEATURES from to/span description pept < 1 455 chlorophyll a/b-binding protein BASE COUNT 135 a 133 c 171 g 144 t ORIGIN 1 cggagctgtt ggttaaaaac ggggtgaaat ttggggaagc tgtgtggttc aaggccgggg 61 cgcagatatt ctcagaggga ggccttgact acctggggaa ccccaacctg atccacgcgc 121 agagcattct agccatctgg gcctgccagg ttgttctcat gggattgatt gaaggataca 181 gagtgggagg aggacccctt ggagaagggt tggaccctct gtacccaggg gatgccttcg 241 acccactggg gctggccgac gaccccgagg ccaaggcgga gctgaaggtg aaggagatta 301 agaacggtcg gctggccatg ttctccatgt tcggtttctt cgttcaggca atcgtgaccg 361 ggaagggccc cattgaaaat ctctacgacc acttggcgga ccccgttgcc aacaatgcct 421 gggcctacgc caccaatttc gttcctggca agtgaaggtg acggaaaata aaagaggcct 481 gtgatctgtg catcaatcat ttgacagcct tagtgttaat aaaatatgtt ctttcagctg 541 tatgtatttg ttggtgatct tcgttaataa aatattttct ttc // LOCUS RATMHCIAB 1563 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Rat MHC class I cell surface antigen mRNA. ACCESSION M25319 KEYWORDS antigen; cell surface antigen; class I gene; glycoprotein; histocompatibility antigen; major histocompatibility complex. SOURCE Rat cDNA to mRNA, clone pARI.5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1563) AUTHORS Radojcic,A., Stranick,K.S., Locker,J., Kunz,H.W. and Gill,T.J.III. TITLE Nucleotide sequence of a rat class I cDNA clone JOURNAL Immunogenetics 29, 134-137 (1989) STANDARD full staff_entry COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by J.J.Rushton 24-JAN-1990. FEATURES from to/span description pept < 1 1134 MHC class I antigen (AA at 1) sigp < 1 60 MHC class I antigen signal peptide site 61 330 alpha-1 domain (exon 2) site 381 606 alpha-2 domain (exon 3) site 607 882 alpha-3 domain (exon 4) site 883 1131 transmembrane and cytoplasmic domains (exons 5, 6, 7, and 8) signal 1535 1541 poly-A signal BASE COUNT 324 a 412 c 471 g 356 t ORIGIN 1 gcaccgcgca cgctgctcct gctgttggcg gccgccctgg ccccgaccca gattcacgcg 61 ggctcacact cgctgcggta tttcgacatc accgtgtccc ggcccggcct cggggagccc 121 cggttcatct ctgtcggcta cgtggacgac acggagttcg tgcgctacga cagcgacgca 181 gagaatccga gattcaagcc gcgggtccgg tggatggagc gggaggggcc ggagtattgg 241 gagcggatca cacggatcgc caaggaaagc gagcagattt accgagtggg cctgaggacc 301 ctgcgcggtt actacaacca gagcgagggc ggctctcaca ccatccagag attgtctggc 361 tgtgaggtgg ggtcggacgg gatcctcctc cgcgggtatg agcagttcgc ctacgacggc 421 cgcgattaca tcgccctgaa cgaagacctg aaaacgtggg cggcggcgga ctttgcagca 481 gggatcaccc ggaacaagtt ggagcgggat ggtgaggcag agagactcag ggcctacctt 541 gaaggcggga gcgtggagtg gctccgcaga tacttggagc tcaggaagga gacgctgctg 601 cgctcagaac ccccaaaggc acatgtgacc cttcactcca gacctgaagg tgatgtgacc 661 ctgaggtgct gggccttggg cttctaccct gctgacatat tcctgacctg gctgttgaat 721 ggggaggacc tgacccagga catggaactt gtggagacca ggcctgcagg ggatggaacc 781 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagaatta cacatgccat 841 gtggagcatg aggggctgcc tgagccgctc accctgagat gggagggtcc tccctccgcc 901 aactccaaca cgggaatgtc tgttattctt ggaactgtgg ccatcattgc agttatggcc 961 atcattgcag ctgtggcctt cattggacct gttgtgagga agaggtggat aaaaacagct 1021 tttcttctca caagtggaaa aggaggagac tacacccctg ctccaggcag ggacagctcc 1081 cagagctctg atgtgtctct cccagattgt aaagccatga agacagctgc ttgaggtgaa 1141 ctggatgccg gccgatgtgt tcaggtctct cttgtgacat ccggagccct cggttctctt 1201 tggacaccga tgcctgggat tccctatgat cctatgactt cggtataggg gactatggga 1261 cccggcccaa ccctacacac cgggacccta tccctgcact gtttgtgttt cctttcacag 1321 ccaaccttgc tggttcagcc tgggttgggg cctggacatc tgcatcctat cactcagtgg 1381 tgctttgaac tgcaactcct cacttctaca ctgagaataa gaatctgagt gtgaacttga 1441 ctgttcacat ccttgacaca gtgttgactg ctttttaaat tactggattg agaatactta 1501 gaggttgttt tttgtttttg ttttgttttg ttttaaataa atggcaggtg gagaagcttc 1561 cag // LOCUS HUMINT01 42 bp ss-mRNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1. ACCESSION M29165 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 1 of 7 SOURCE Human cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 42) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE cDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic FEATURES from to/span description mRNA < 1 > 42 P150,95 mRNA, exon 1 BASE COUNT 9 a 15 c 10 g 8 t ORIGIN 1 bp upstream of EcoRI site; chromosome 16p11-13.1. 1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag // LOCUS HUMINT02 3690 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 2 - 6. ACCESSION M29482 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 2 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3690, exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3690; exons and intron/exon boundaries only) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 3690; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12752 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 3690; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept 1028 1064 integrin alpha subunit precursor, exon 2 (first expressed exon) 1666 1771 integrin alpha subunit precursor, exon 3 2391 2494 integrin alpha subunit precursor, exon 4 2795 2865 integrin alpha subunit precursor, exon 5 3020 + 3131 integrin alpha subunit precursor, exon 6 sigp 1028 1064 integrin alpha subunit signal peptide 1666 1685 integrin alpha subunit signal peptide matp 1686 1771 integrin alpha subunit 2391 2494 integrin alpha subunit 2795 2865 integrin alpha subunit 3020 + 3131 integrin alpha subunit pre-msg < 1 > 3690 P150,95 mRNA and introns IVS < 1 975 P150,95 intron A IVS 1065 1665 P150,95 intron B IVS 1772 2390 P150,95 intron C IVS 2495 2794 P150,95 intron D IVS 2866 3019 P150,95 intron E IVS 3132 > 3690 P150,95 intron F BASE COUNT 764 a 1050 c 1020 g 856 t ORIGIN 1 bp upstream of BamHI site; chromosome 16p11-13.1. 1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc 61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg 121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct 181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc 241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca 301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa 361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt 421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc 481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata 541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc 601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt 661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag 721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca 781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg 841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc 901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc 961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct 1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca 1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg 1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc 1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg 1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa 1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct 1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg 1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc 1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg 1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca 1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc 1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg 1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc 1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact 1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc 1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg 1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta 2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta 2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca 2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc 2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa 2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc 2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt 2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac 2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc 2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt 2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc 2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc 2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg 2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc 2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg 2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc 2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag 3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat 3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt 3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt 3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa 3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg 3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct 3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag 3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta 3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta 3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata 3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg 3661 tgtgtgtgca tgtgcatgca ggttgagacg // LOCUS HUMINT03 4863 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 7 - 15. ACCESSION M29483 Y00093 KEYWORDS integrin; protein p150,95. SEGMENT 3 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 294 to 3967; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4863) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 4863; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 4863; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 294 424 integrin alpha subunit precursor, exon 7 511 656 integrin alpha subunit precursor, exon 8 901 1054 integrin alpha subunit precursor, exon 9 1659 1809 integrin alpha subunit precursor, exon 10 2428 2501 integrin alpha subunit precursor, exon 11 2665 2794 integrin alpha subunit precursor, exon 12 3207 3349 integrin alpha subunit precursor, exon 13 3531 3671 integrin alpha subunit precursor, exon 14 3760 + 3969 integrin alpha subunit precursor, exon 15 matp + 294 424 integrin alpha subunit 511 656 integrin alpha subunit 901 1054 integrin alpha subunit 1659 1809 integrin alpha subunit 2428 2501 integrin alpha subunit 2665 2794 integrin alpha subunit 3207 3349 integrin alpha subunit 3531 3671 integrin alpha subunit 3760 + 3969 integrin alpha subunit pre-msg < 1 > 4861 P150,95 mRNA and introns IVS < 1 293 P150,95 intron F IVS 425 510 P150,95 intron G IVS 657 900 P150,95 intron H IVS 1055 1658 P150,95 intron I IVS 1810 2427 P150,95 intron J IVS 2502 2664 P150,95 intron K IVS 2795 3206 P150,95 intron L IVS 3350 3530 P150,95 intron M IVS 3672 3759 P150,95 intron N IVS 3970 > 4863 P150,95 intron O BASE COUNT 947 a 1358 c 1460 g 1097 t 1 others ORIGIN Chromosome 16p11-13.1. 1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct 61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag 121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca 181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac 241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc 301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa 361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac 421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg 481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc 541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct 601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga 661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga 721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct 781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag 841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag 901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt 961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc 1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc 1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga 1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa 1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg 1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct 1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga 1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc 1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt 1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat 1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga 1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca 1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta 1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc 1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct 1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt 1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc 1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac 2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac 2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc 2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc 2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag 2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag 2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg 2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat 2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg 2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt 2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc 2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc 2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct 2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac 2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg 2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg 2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg 3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt 3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg 3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc 3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt 3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac 3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga 3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga 3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac 3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct 3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc 3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct 3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg 3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat 3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg 3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag 3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc 3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc 4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct 4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc 4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg 4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt 4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc 4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa 4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg 4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc 4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct 4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc 4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa 4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc 4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact 4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat 4861 ttt // LOCUS HUMINT04 2746 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 16 - 21. ACCESSION M29484 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 4 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 382 to 2672; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2746) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 2746; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 2746; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 382 512 integrin alpha subunit precursor, exon 16 628 791 integrin alpha subunit precursor, exon 17 924 1078 integrin alpha subunit precursor, exon 18 1656 1787 integrin alpha subunit precursor, exon 19 1875 1948 integrin alpha subunit precursor, exon 20 2531 + 2672 integrin alpha subunit matp + 382 512 integrin alpha subunit 628 791 integrin alpha subunit 924 1078 integrin alpha subunit 1656 1787 integrin alpha subunit 1875 1948 integrin alpha subunit 2531 + 2672 integrin alpha subunit pre-msg < 1 > 2746 P150,95 mRNA and introns IVS < 1 381 P150,95 intron O IVS 513 627 P150,95 intron P (no splice consensus) IVS 792 923 P150,95 intron Q IVS 1079 1655 P150,95 intron R IVS 1788 1874 P150,95 intron S IVS 1949 2530 P150,95 intron T IVS 2673 > 2746 P150,95 intron U BASE COUNT 602 a 821 c 746 g 577 t ORIGIN Chromosome 16p11-13.1. 1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag 61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg 121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt 181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt 241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc 301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc 361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat 421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct 481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc 541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc 601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat 661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc 721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct 781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc 841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac 901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct 961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag 1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt 1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga 1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag 1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg 1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag 1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc 1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg 1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt 1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg 1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc 1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc 1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc 1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca 1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt 1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca 1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc 1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct 2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt 2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca 2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac 2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa 2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact 2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa 2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc 2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc 2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga 2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag 2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc 2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg // LOCUS HUMINT05 1006 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 22 - 24. ACCESSION M29485 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 5 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 202 to 665) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1006) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 1006; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 1006; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 203 319 integrin alpha subunit precursor, exon 22 404 483 integrin alpha subunit precursor, exon 23 582 + 665 integrin alpha subunit precursor, exon 24 matp + 203 319 integrin alpha subunit 404 483 integrin alpha subunit 582 + 665 integrin alpha subunit pre-msg < 1 > 1006 P150,95 mRNA and introns IVS < 1 202 P150,95 intron U IVS 320 403 P150,95 intron V IVS 484 581 P150,95 intron W IVS 666 > 1006 P150,95 intron X BASE COUNT 228 a 286 c 229 g 263 t ORIGIN Chromosome 16p11-13.1. 1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag 61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt 121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac 181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga 241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat 301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac 361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac 421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag 481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc 541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg 601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt 661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa 721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa 781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg 841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta 901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt 961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt // LOCUS HUMINT06 1904 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 25 - 30. ACCESSION M29486 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 6 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 86 to 1528; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1904) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 1904; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 1904; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 86 164 integrin alpha subunit precursor, exon 25 280 387 integrin alpha subunit precursor, exon 26 505 588 integrin alpha subunit precursor, exon 27 788 901 integrin alpha subunit precursor, exon 28 1044 1145 integrin alpha subunit precursor, exon 29 1417 + 1527 integrin alpha subunit precursor, exon 30 matp + 86 164 integrin alpha subunit 280 387 integrin alpha subunit 505 588 integrin alpha subunit 788 901 integrin alpha subunit 1044 1145 integrin alpha subunit 1417 + 1527 integrin alpha subunit pre-msg < 1 > 1904 P150,95 mRNA and introns IVS < 1 85 P150,95 intron X IVS 165 279 P150,95 intron Y IVS 388 504 P150,95 intron Z IVS 589 787 P150,95 intron AA IVS 902 1043 P150,95 intron AB IVS 1146 1416 P150,95 intron AC IVS 1528 > 1903 P150,95 intron AD BASE COUNT 381 a 545 c 525 g 453 t ORIGIN Chromosome 16p11-13.1. 1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat 61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag 121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta 181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt 241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg 301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg 361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa 421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca 481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg 541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct 601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc 661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac 721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg 781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag 841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca 901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg 961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac 1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg 1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag 1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac 1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg 1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct 1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga 1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac 1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg 1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag 1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat 1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag 1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc 1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt 1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc 1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga // LOCUS HUMINT07 653 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exon 31. ACCESSION M29487 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 7 of 7 SOURCE Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 44 to 149) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic REFERENCE 2 (bases 1 to 653; revises [1]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 653; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 653; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 44 148 integrin alpha subunit precursor, exon 31 matp + 44 145 integrin alpha subunit pre-msg < 1 > 149 P150,95 mRNA and introns IVS < 1 85 P150,95 intron AC BASE COUNT 126 a 195 c 150 g 182 t ORIGIN Chromosome 16p11-13.1. 1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg 61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca 121 gacccccagc ccgcccagtg agaaatgatc cctctttgcc ttggacttct tctcccgcga 181 ttttccccac ttacttaccc tcacctgtca ggctgacggg gaggaaccac tgcaccaccg 241 agagaggctg ggatgggcct gcttcctgtc tttgggagaa aacgtcttgc ttgggaaggg 301 gcctttgtct tgtcaaggtt ccaactggaa acccttagga cagggtccct gctgtgttcc 361 ccaaaaggac ttgacttgca atttctacct agaaatacat ggacaatacc cccaggcctc 421 agtctccctt ctcccatgag gcacgaatga tctttctttc ctttcctttt tttttttttt 481 cttttctttt tttttttttt tgagacggag tctcgctctg tcacccaggc tggagtgcaa 541 tggcgtgatc tcggctcgct gcaacctccg cctcccgggt tcaagtaatt ctgctgtctc 601 agcctcctgc gtagctggga ctacaggcac acgccacctc gcccggcccg atc // LOCUS PEAHSP177A 772 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA, complete cds. ACCESSION M33901 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 772) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 65 523 17.7 kDa heat shock protein (hsp17.7) BASE COUNT 254 a 127 c 170 g 221 t ORIGIN 1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag 61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga 121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc 181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga 241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt 301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga 361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc 421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc 481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat 541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg 601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata 661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc 721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac // LOCUS PEAHSP179A 700 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA, complete cds. ACCESSION M33900 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 700) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept < 1 469 17.9 kDa heat shock protein (hsp17.9) BASE COUNT 209 a 123 c 170 g 198 t ORIGIN 1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt 61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa 121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt 181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga 241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg 301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc 361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa 421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct 481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg 541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa 601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt 661 gagtatgtga aagtttcttt tgattaagct gtatttatcc // LOCUS PEAHSP181A 862 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA, complete cds. ACCESSION M33899 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 862) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 12 488 18.1 kDa heat shock protein (hsp18.1) BASE COUNT 278 a 128 c 207 g 249 t ORIGIN 1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg 61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac 121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg 181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga 241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag 301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt 361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc 421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt 481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc 541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat 601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt 661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa 721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac 781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga 841 gtctctccaa gatggagaat tc // LOCUS PEAHSP227A 795 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA, complete cds. ACCESSION M33898 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 795) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 44 637 22.7 kDa heat shock protein (hsp22.7) BASE COUNT 261 a 122 c 170 g 242 t ORIGIN 1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct 61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc 121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc 181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca 241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca 301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga 361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca 421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa 481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca 541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga 601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa 661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg 721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa 781 cttgttggct tttgt // LOCUS ECOHGRF 140 bp ds-DNA SYN 09-AUG-1990 DEFINITION Synthetic human growth hormone releasing factor (hGRF) gene, complete cds. ACCESSION M26106 KEYWORDS growth hormone releasing factor; somatocrinin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 140) AUTHORS Cravador,A., Jacobs,P., Van Elsen,A., Lacroix,C., Colau,B., Van Alphen,P., Herzog,A. and Bollen,A. TITLE Total DNA synthesis and cloning in Escherichia coli of a gene coding for the human growth hormone releasing factor JOURNAL Biochimie 67, 829-834 (1985) STANDARD simple staff_review FEATURES from to/span description pept 2 139 synthetic human growth hormone releasing factor (hGRF) BASE COUNT 30 a 35 c 36 g 39 t ORIGIN 1 catgtacgct gacgctatct tcactaactc ttaccgtaaa gttctgggtc agctgtctgc 61 tcgtaaactg ctgcaggaca tcatgtctcg tgagcagggt gaatctaacc aggaacgtgg 121 tgctcgtgct cgtctgtaag // LOCUS HUMACALX 724 bp ss-mRNA PRI 09-AUG-1990 DEFINITION Human calcitonin mRNA, complete cds. ACCESSION M26095 KEYWORDS calcitonin. SOURCE Human cell-line BEN, cDNA to mRNA, clone hBEN-JR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 724) AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M., Al-Kazwini,S.J., Holman,J.J. and Marshall,I. TITLE Expression and function of the human calcitonin/alpha-CGRP gene in health and disease JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986) STANDARD simple staff_review FEATURES from to/span description pept 35 460 calcitonin precursor sigp 35 109 calcitonin signal peptide matp 287 382 calcitonin matp 383 457 flanking peptide BASE COUNT 163 a 195 c 200 g 166 t ORIGIN 1 ggtgagcccc gagattctgg ctcagagagg tgtcatgggc ttccaaaagt tctccccctt 61 cctggctctc agcatcttgg tcctgttgca ggcaggcagc ctccatgcag caccattcag 121 gtctgccctg gagagcagcc cagcagaccc ggccacgctc agtgaggacg aagcgcgcct 181 cctgctggct gcactggtgc aggactatgt gcagatgaag gccagtgagc tggagcagga 241 gcaagagaga gagggctcca gcctggacag ccccagatct aagcggtgcg gtaatctgag 301 tacttgcatg ctgggcacat acacgcagga cttcaacaag tttcacacgt tcccccaaac 361 tgcaattggg gttggagcac ctggaaagaa aagggatatg tccagcgact tggagagaga 421 ccatcgccct catgttagca tgccccagaa tgccaactaa actcctccct ttccttccta 481 atttcccttc ttgcatcctt cctataactt gatgcatgtg gtttggttcc tctctggtgg 541 ctctttgggc tggtattggt ggctttcctt gtggcagagg atgtctcaaa cttcagatgg 601 gaggaaagag agcaggactc acaggttgga agagaatcac ctgggaaaat accagaaaat 661 gagggccgct ttgagtcccc cagagatgtc atcagagctc ctctgtcctg ctttctgaat 721 gtgc // LOCUS HUMCALARP 234 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human calcitonin gene, exon 5. ACCESSION M26094 KEYWORDS calcitonin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 234) AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M., Al-Kazwini,S.J., Holman,J.J. and Marshall,I. TITLE Expression and function of the human calcitonin/alpha-CGRP gene in health and disease JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986) STANDARD simple staff_review FEATURES from to/span description pept / 36 195 calcitonin precursor, exon 5 (AA at 37) matp 54 165 alpha-calcitonin related peptide matp 166 192 carboxyl-terminal-flanking peptide (PDN-21) IVS < 1 35 alpha-calcitonin related peptide intron C BASE COUNT 59 a 63 c 61 g 51 t ORIGIN 1 cagatcttct cttctttctc catcctgcaa atcagaatca ttgcccagaa gagagcctgt 61 gacactgcca cctgtgtgac tcatcggctg gcaggcttgc tgagcagatc agggggtgtg 121 gtgaagaaca actttgtgcc caccaatgtg ggttccaaag cctttggcag gcgccgcagg 181 gaccttcaag cctgagcagc tgaacgactc aagaaggtca caataaagct gaac // LOCUS PIPVGB 1883 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid pIP630 (from S. aureus) virginiamycin B hydrolase (vgb) gene, complete cds. ACCESSION M36022 KEYWORDS virginiamycin B hydrolase; virginiamycin-resistance. SOURCE Plasmid pIP630 (from Staphylococcus aureus) DNA. ORGANISM Plasmid pIP630 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1883) AUTHORS Allignet,J., Loncle,V., Mazodier,P. and El Solh,N. TITLE Nucleotide sequence of a Staphylococcal plasmid gene, vgb, encoding a hydrolase inactivating the B components of virginiamycin-like antibiotics JOURNAL Plasmid 20, 271-275 (1988) STANDARD simple staff_review FEATURES from to/span description pept 641 1540 virginiamycin B hydrolase BASE COUNT 641 a 284 c 375 g 583 t ORIGIN 1 agatctacgg attttcgcca tgccacgaaa ttagcatcat gctagcaagt taaacgaaca 61 ctgacatgat atattagtgg ttagctatat ttttttactt tgcaacagaa ccattattat 121 ggtttcttaa aaaaatacaa tgctttttcg ttccttttta ttcatcttcc aattctttgg 181 catgactgtg tgcattttaa atttgttcag caaatgtgcc gtgtaatgga atacttttta 241 aatactgtgt aatgataatg caaggcacat actaaaagga atcttcgatt ttgttggctt 301 attatttgac ttttcataac aattatctta aggttaaaca aatcaataat cgaaagggtg 361 aaaaaaagca catgatcata taatcctaat tttaaaagaa atcgatattt tggccttggg 421 ttcaatttca aagtggtttt ggaatgaact ctatttgtta tcggcttttt tctgagatag 481 gattaatgta atgtgctttt ttggctttaa aaagaccttt gttatccaaa aagtcttttt 541 aagtgtcctt atccgtgcca cattgcctcc tatctcgaaa aaagagatgg aggctatttt 601 tgttttggaa atttaattta aataaaacgg aggggataga atggaattta aattacaaga 661 attaaatctt actaaccaag atacaggacc atatggtata accgtttcag ataaggggaa 721 agtttggatt acacaacata aagcaaatat gataagttgc atcaatttag atggaaaaat 781 tacagagtac ccactaccga caccagatgc aaaagtcatg tgtttaacta tatcctcaga 841 tggggaagtt tggtttactg agaatgcagc aaacaaaata gggaggatta caaaaaaagg 901 gattattaag gaatatacat tgcctaaccc agattcagca ccctacggta ttacagaagg 961 accaaatgga gatatatggt ttacagaaat gaatggcaac cgtattggac gtattacgga 1021 cgacggtaaa attcgtgaat acgagctgcc taataaagga tcttaccctt cttttatcac 1081 tttgggttct gataatgccc tgtggttcac agaaaatcaa aataatgcta ttggtagaat 1141 tacagaaagt ggggatatta cagagtttaa aattcctaca cctgcatcag gaccagttgg 1201 tattacaaag gggaacgacg atgctttatg gtttgtggaa attatcggta ataagatagg 1261 gcgaataact cctctggggg aaattaccga attcaaaatt ccaacgccaa acgctcgacc 1321 tcatgcaatt actgctggag caggaattga tttatggttt actgaatggg gggctaataa 1381 aataggaagg ctgacaagca ataatataat tgaggaatac ccaattcaaa tcaaaagtgg 1441 tgaaccacat ggcatttgtt tcgatggtga aacaatttgg tttgcaatgg agtgtgacaa 1501 gataggcaaa ttaactctca ttaaggataa tatggagtga gtcttttgaa tttaaacaat 1561 gaccatggac ctgatcccga aaatatttta ccgataaaag ggaatcggaa tcttcaattt 1621 ataaaaccta ctataacgaa cgaaaacatt ttggtggggg aatattctta ttatgatagt 1681 aagcgaggag aatcctttga agatcaagtc ttatatcatt atgaagtgat tggagataag 1741 ttgattatag gaagattttg ttcaattggt cccggaacaa catttattat gaatggtgca 1801 aaccatcgga tggatggatc aacatatcct tttcatctat tcaggatggg ttgggagaag 1861 tatatgcctt ccttaaaaga tct // LOCUS ECOLIVHMGF 8703 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli leucine-specific transport (LS-BP; LIV-BP) system (livHMGF) genes, complete cds. ACCESSION J05516 M13166 M10426 M10427 K02178 KEYWORDS heat shock protein; high affinity branched-chain amino acid transport system; htpR gene; isoleucine binding protein; leucine binding protein; leucine binding protein; livJ gene; livK gene; valine binding protein. SOURCE E.coli (K12 strain AE404) isolate W3110 DNA, clone pOX[1,15]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 90 to 1312) AUTHORS Landick,R., Vaughn,V., Lau,E.T., VanBogelen,R.A., Erickson,J.W. and Neidhardt,F.C. TITLE Nucleotide sequence of the heat shock regulatory gene of E. coli suggests its protein product may be a transcription factor JOURNAL Cell 38, 175-182 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 1312; revises [1]) AUTHORS Vaughn,V. JOURNAL Unpublished (1985) Univ Michigan Med School, Ann Arbor MI 48109 STANDARD full staff_review REFERENCE 3 (bases 1407 to 2507 and 3503 to 4609) AUTHORS Landick,R. and Oxender,D.L. TITLE The complete nucleotide sequences of the Escherichia coli LIV-BP and LS-BP genes: Implications for the mechanism of high-affinity branched-chain amino acid transport JOURNAL J. Biol. Chem. 260, 8257-8261 (1985) STANDARD full staff_review REFERENCE 4 (bases 4610 to 5696) AUTHORS Nazos,P.M., Antonucci,T.K., Landick,R. and Oxender,D.L. TITLE Cloning and characterization of livH, the structural gene encoding a component of the leucine transport system in Escherichia coli JOURNAL J. Bacteriol. 166, 565-573 (1986) STANDARD simple staff_review REFERENCE 5 (bases 1136 to 8703) AUTHORS Adams,M.D., Wagner,L.M., Graddis,T.J., Landick,R., Antonucci,T.K., Gibson,A.L. and Oxender,D.L. TITLE Nucleotide sequence and genetic characterization reveal six essential genes for the LIV-I and LS transport systems of Escherichia coli JOURNAL J. Biol. Chem. 265, 11436-11443 (1990) STANDARD full staff_review COMMENT Draft entry and sequence in computer readable form for [2] kindly provided by V.Vaughn, 15-NOV-1985. Draft entry and computer-readable sequence for [1] kindly submitted by M.D.Adams 19-APR-1990, for release after publication. The htpR (also known as "hin") gene product appears homologous to the sigma factor of RNA polymerase, and the two proteins are predicted to have similar secondary structures. In addition, two regions of the predicted htpR product resemble protein-DNA contact points conserved in known DNA-binding proteins. The htpR gene encodes a protein, which appears to be identical in size (33 kd by migration on two dimensional polyacrylamide gel) and isoelectric point with the protein, F33.4, normally present in E.coli but deficient in an htpR mutant. A region homologous to the rpoD gene is located at positions 508-549 [2]. The E.coli LIV-I and LS AA transport systems are high-affinity, periplasmic, binding protein-dependent systems that utilize the leucine-, isoleucine-, valine-binding protein (LIV-BP) and leucine-specific binding protein (LS-BP), respectively. These two binding proteins interact with a common set of membrane proteins to transport branched-chain AAs into the cytoplasm. The two BP genes are encoded in a regulon that also contains the genes for the common membrane protein components. FEATURES from to/span description pept 1407 2510 LIV-BP precursor (livJ) sigp 1407 1475 LIV-BP signal peptide matp 1476 2507 LIV-BP mature protein pept 3503 4612 LS-BP precursor (livK) sigp 3503 3571 LS-BP signal peptide matp 3572 4609 LS-BP mature peptide pept 4660 5586 leucine-specific binding protein (livH) pept 5583 6857 livM product pept 6854 7621 livG product pept 7623 8336 livF product mRNA 1303 > 4612 livJ mRNA [2] /nomgen="livJ" ORF 3078 2566 (c) ORF19 revision 101 102 ct in [2]; cgt in [1] revision 270 270 a in [2]; g in [1] revision 280 280 a in [2]; g in [1] revision 284 284 a in [2]; g in [1] revision 304 304 a in [2]; g in [1] revision 310 310 a in [2]; g in [1] revision 322 324 caa in [2]; agg in [1] revision 328 328 g in [2]; a in [1] revision 1453 1453 c in [5]; g in [3] revision 3832 3832 t in [5]; c in [3] BASE COUNT 2039 a 2176 c 2425 g 2062 t 1 others ORIGIN 76 min on the K12 map. 1 ctgcacggat caacattacg ccacttacgc ctgaataata aaagcgtgta tactctttcc 61 tgcaatgggt tccgtagcag ggaaagagac cccgttgtct cttcccggta tttcatctct 121 atgtcacatt ttgtgcgtaa tttattcaca agcttgcatt gaacttgtgg ataaaatcac 181 ggtctgataa aacagtgaat gataacctcg ttgctcttaa gctctggcac agttgttgct 241 accactgaag cgccagaaga tatcgattga gaggatttga atgactgaca aaatgcaaag 301 tttagcttta gccccagttg gcaacctgga ttcctacatc cgggcagcta acgcgtggcc 361 gatgttgtcg gctgacgagg agcgggcgct ggctgaaaag ctgcattacc atggcgatct 421 ggaagcagct aaaacgctga tcctgtctca cctgcggttt gttgttcata ttgctcgtaa 481 ttatgcgggc tatggcctgc cacaggcgga tttgattcag gaaggtaaca tcggcctgat 541 gaaagcagtg cgccgtttca acccggaagt gggtgtgcgc ctggtctcct tcgccgttca 601 ctggatcaaa gcagagatcc acgaatacgt tctgcgtaac tggcgtatcg tcaaagttgc 661 gaccaccaaa gcgcagcgca aactgttctt caacctgcgt aaaaccaagc agcgtctggg 721 ctggtttaac caggatgaag tcgaaatggt ggcccgtgaa ctgggcgtaa ccagcaaaga 781 cgtacgtgag atggaatcac gtatggcggc acaggacatg acctttgacc tgtcttccga 841 cgacgattcc gacagccagc cgatggctcc ggtgctctat ctgcaggata aatcatctaa 901 ctttgccgac ggcattgaag atgataactg ggaagagcag gcggcaaacc gtctgaccga 961 cgcgatgcag ggtctggacg aacgcagcca ggacatcatc cgtgcgcgct ggctggacga 1021 agacaacaag tccacgttgc aggaactggc tgaccgttac ggcgtttccg ctgagcgtgt 1081 acgccagctg gaaaagaacg cgatgaaaaa attgcgtgct gccattgaag cgtaatttcc 1141 gctattaagc agagaaccct agatgagagt ccggggtttt tgttttttgg gcctctgtaa 1201 taatcaattt cccctccggc aaaacgccaa tccccacgca gattgttaat aaactgtcaa 1261 aatagctatt ccaatatcat aaaaatcggg atatgtttta gcagagtatg ctgctaaagc 1321 acgggtagtc atgcataaaa cgaaataaag tgctgaaaaa caacatcaca acacacgtaa 1381 taaccagaag aatggggatt ctcaggatga acacaaaggg caaagcgtta ctggcaggat 1441 tgatcgcgct ggcattcagc aatatggctc tggcagaaga tattaaagtc gcggtcgtgg 1501 gcgcaatgtc cggtccggtt gcgcagtacg gtgaccagga gtttaccggc gcagagcagg 1561 cggttgcgga tatcaacgct aaaggcggca ttaaaggcaa caaactgcaa atcgcaaaat 1621 atgacgatgc ctgtgatccg aaacaggcgg ttgcggtggc gaacaaagtc gttaacgacg 1681 gcattaaata tgtgattggt cacctctgtt cctcatcaac gcagcctgcg tcggatatct 1741 acgaagacga aggcattttg atgatcaccc cagcggcaac cgcgccggag ctgaccgccc 1801 gtggctatca gctgatcctg cgaaccaccg gcctggattc cgaccaaggg ccgacggctg 1861 ccaaatatat tcttgagaaa gtgaaaccgc agcgtattgc tatcgttcac gacaaacagc 1921 aatacggcga aggtctggcg cgagcggtgc aggacggcct gaagaaaggc aatgcaaacg 1981 tggtgttctt tgatggcatc accgccgggg aaaaagattt ctcaacgctg gtggcgcgtc 2041 tgaaaaaaga gaatatcgac ttcgtttact acggcggtta tcacccggaa atggggcaaa 2101 tcctgcgtca ggcacgcgcg gcagggctga aaactcagtt tatggggccg gaaggtgtgg 2161 ctaacgtttc gctgtctaac attgcgggcg aatcagcgga agggctactg gtgaccaaac 2221 cgaagaacta cgatcaggtt ccggcgaaca aacccattgt tgacgcgatc aaagcgaaaa 2281 aacaggaccc aagtggcgca ttcgtttgga ccacctacgc cgcgctgcaa tctttgcagg 2341 cgggcctcaa tcagtctgac gatccggctg aaatcgccaa atacctgaaa gcgaactccg 2401 tggataccgt aatgggcccg ctgacctggg atgagaaagg cgatctgaaa ggctttgagt 2461 tcggcgtatt tgactggcac gccaacggca cggccaccga tgcgaagtaa tcattaatcg 2521 gcaactttgg gttgccgcca aattgctaat atcgagtacg ttgcttcatg ccggatgcgg 2581 cgtaaacgcc ttatccggcc tacaagatcc aaagaaatca gtaaattgca acacacattg 2641 taggcctgat aagcgtagcg catcaggcaa tacacttttg aaatcggact tgacgattaa 2701 cacttctccc agccgccctg ttgtgccgta aaccccagcg cctgcataaa cgccgtcatc 2761 acaccgcgat cttccacgcc gcagccgcca tccaccagca tgaaacgcca agattgttac 2821 gcaaaacctc ttccagcaga tattgcccca ccgcgacggc gggtgacttc ccgcacgcgc 2881 agggaatcca gtgctccctc ggtgccgctt aaggttgccc gcgcggcgcg agcaggcgct 2941 cgttaaacgc gcggcgtaga tacggtggtt atcgtcaacc tgtaacgagg aaggggaata 3001 ctcncggcca agatcttttg cgaggtcaat ccggtcttgg tcgctaaatt tttctaatcg 3061 aatgatggtc agcttcatgg gtaacccgtg taaatcacaa aagtgtaacc agtgtagcga 3121 aataatttaa tcggaggctt tctctttttt atttcttttg gcaggtgatt aattttttaa 3181 cagcaataat tacaaaatta aaacattaga gaatgaaaaa tgtccagcat aatcccctga 3241 atgatagtga attattccgc ccctttgtgc cgttatttta tgctgacaaa ggcacttttt 3301 tctgtttgtc tatcaataaa ttcggaatat tatctgttct taatcgactg aaaaatgggg 3361 attttaatcg ctattatcac aaaatactgc gctaacccct taatcagaca ggcaaaaaca 3421 gtgcagtata aaaaaagaac agtctgattt gttaacacat aaaaacaaag caacacaaca 3481 tcacgaatgg ggatttttga ctatgaaacg gaatgcgaaa actatcatcg cagggatgat 3541 tgcactggca atttcacaca ccgctatggc tgacgatatt aaagtcgccg ttgtcggcgc 3601 gatgtccggc ccgattgccc agtggggcat aatggaattt aacggcgcgg agcaggcgat 3661 taaagacatt aatgccaaag ggggaattaa gggcgataaa ctggttggcg tggaatatga 3721 cgacgcatgc gacccgaaac aagccgttgc ggtcgccaac aaaatcgtta atgacggcat 3781 taaatacgtt attggtcatc tgtgttcttc ttctacccag cctgcgtcag atatctatga 3841 agacgaaggt attctaatga tctcgccggg agcgaccgcg ccggaactaa cccaacgcgg 3901 ttatcaacac attatgcgta ctgccgggct ggactcttcc caggggccaa cggcggcaaa 3961 atacattctt gagacggtga agccccagcg catcgccatc atccacgaca aacaacagta 4021 tggcgaaggg ctggcgcgtt cggtgcagga cgggctgaaa gcggctaacg ccaacgtcgt 4081 cttcttcgat ggtattaccg ccggggagaa agatttctcc gcgctgatcg cccgcctgaa 4141 aaaagaaaac atcgacttcg tttactacgg cggttactac ccggaaatgg ggcagatgct 4201 gcgccaggcc cgttccgttg gcctgaaaac ccagtttatg gggccggaag gtgtgggtaa 4261 tgcgtcgttg tcgaacattg ccggtgatgc cgccgaaggc atgttggtca ctatgccaaa 4321 acgctatgac caggatccgg caaaccaggg catcgttgat gcgctgaaag cagacaagaa 4381 agatccgtcc gggccttatg tctggatcac ctacgcggcg gtgcaatctc tggcgactgc 4441 ccttgagcgt accggcagcg atgagccgct ggcgctggtg aaagatttaa aagctaacgg 4501 tgcaaacacc gtgattgggc cgctgaactg ggatgaaaaa ggcgatctta agggatttga 4561 ttttggtgtg ttccagtggc acgccgacgg ttcatccacg gcagccaagt gatcatccca 4621 ccgcccgtaa aatgcgggcg ggtttagaaa ggttacctta tgtctgagca gtttttgtat 4681 ttcttgcagc agatgtttaa cggcgtcacg ctgggcagta cctacgcgct gatagccatc 4741 ggctacacca tggtttacgg cattatcggc atgatcaact tcgcccacgg cgaggtttat 4801 atgattggca gctacgtctc atttatgatc atcgccgcgc tgatgatgat gggcattgat 4861 accggctggc tgctggtagc cgcgggattc gtcggcgcaa tcgtcattgc cagcgcctac 4921 ggctggagta tcgaacgggt ggcttaccgc ccggtgcgta actctaagcg cctgattgca 4981 ctcatctctg caatcggtat gtccatcttc ctgcaaaact acgtcagcct gaccgaaggt 5041 tcgcgcgacg tggcgctgcc gagcctgttt aacggtcagt gggtggtggg gcatagcgaa 5101 aacttctctg cctctattac caccatgcag gcggtgatct ggattgttac cttcctcgcc 5161 atgctggcgc tgacgatttt cattcgctat tcccgcatgg gtcgcgcgtg tcgtgcctgc 5221 gcggaagatc tgaaaatggc gagtctgctt ggcattaaca ccgaccgggt gattgcgctg 5281 acctttgtga ttggcgcggc gatggcggcg gtggcgggtg tgctgctcgg tcagttctac 5341 ggcgtcatta acccctacat cggctttatg gccgggatga aagcctttac cgcggcggtg 5401 ctcggtggga ttggcggcat tccgggggcg atgattggcg gcctgattct ggggattgcg 5461 gaggcgctct cttctgccta tctgagtacg gaatataaag atgtggtctc attcgccctg 5521 ccgattctgg tgctgctggt gatgccgacc ggtattctgg gtcgcccgga ggtagagaaa 5581 gtatgaaacc gatgcatatt gcaatggcgc tgctctctgc cgcgatgttc tttgtgctgg 5641 cgggcgtctt tatgggcgtg caactggagc tggatggcac caaactggtg gtcgacacgg 5701 cttcggatgt ccgttggcag tgggtgttta tcggcacggc ggtggtcttt ttcttccagc 5761 ttttgcgacc ggctttccag aaagggttga aaagcgtttc cggaccgaag tttattctgc 5821 ccgccattga tggctccacg gtgaagcaga aactgttcct cgtggcgctg ttggtgcttg 5881 cggtggcgtg gccgtttatg gtttcacgcg ggacggtgga tattgccacc ctgaccatga 5941 tctacattat cctcggtctc gggctgaacg tggttgttgg tctttctggt ctgctggtgc 6001 tggggtacgg cggtttttac gccatcggct tacacttttg cgctgctcaa tcactattac 6061 ggcttgggct tctggacctg cctgccgatt gctggattaa tggcagcggc ggcggcttcc 6121 tgctcggttt tccggtgctg cgtttgcgcg gtgactatct ggcgatcgtt accctcggtt 6181 tcggcgaaat tgtgcgcata ttgctgctca ataacaccga aattaccggc ggcccgaacg 6241 gaatcagtca gatcccgaaa ccgacactct tcggactcga gttcagccgt accgctcgtg 6301 aaggcggctg ggacacgttc agtaatttct ttggcctgaa atacgatccc tccgatcgtg 6361 tcatcttcct ctacctggtg gcgttgctgc tggtggtgct aagcctgttt gtcattaacc 6421 gcctgctgcg gatgccgctg gggcgtgcgt gggaagcgtt gggtgaagat gaaatcgcct 6481 gccgttcgct gggcttaagc ccgcgtcgta tcaagctgac tgcctttacc ataagtgccg 6541 cgtttgccgg ttttgccgga acgctgtttg cggcgcgtca gggctttgtc agcccggaat 6601 ccttcacctt tgccgaatcg gcgtttgtgc tggcgatagt ggtgctcggc ggtatgggct 6661 cgcaatttgc ggtgattctg gcggcaattt tgctggtggt gtcgcgcgag ttgatgcgtg 6721 atttcaacga atacagcatg ttaatgctcg gtggtttgat ggtgctgatg atgatctggc 6781 gtccgcaggg cttgctgccc atgacgcgcc ggcaactgaa gctgaaaaac ggcgcagcga 6841 aaggagagca ggcatgagtc agccattatt atctgttaac ggcctgatga tgcgcttcgg 6901 cggcctgctg gcggtgaaca acgtcaatct tgaactgtac ccgcaggaga tcgtctcgtt 6961 aatcggccct aacggtgccg gaaaaaccac ggtttttaac tgtctgaccg gattctacaa 7021 acccaccggc ggcaccattt tactgcgcga tcagcacctg gaaggtttac cggggcagca 7081 aattgcccgc atgggcgtgg tgcgcacctt ccagcatgtg cgtctgttcc gtgaaatgac 7141 ggtaattgaa aacctgctgg tggcgcagca tcagcaactg aaaaccgggc tgttctctgg 7201 cctgttgaaa acgccatcct tccgtcgcgc ccagagcgaa cggctcgacc gcgccgcgac 7261 ctggcttgag cgcattggtt tgctggaaca cgccaaccgt caggcgagta acctggccta 7321 tggtgaccag cgccgtcttg agattgcccg ctgcatggtg acgcagccgg agattttaat 7381 gctcgacgaa cctgcggcag gtcttaaccc gaaagagacg aaagagctgg atgagctgat 7441 tgccgaactg cgtaatcatc acaacaccac tatcttgttg attgaacacg atatgaagct 7501 ggtgatggga atttcggacc gaatttacgt ggtcaatcag gggacgccgc tggcaaacgg 7561 tagcccggag cagatccgta ataacccgga cgtgatccgt gcctatttag gtgaggcata 7621 agatggaaaa agtcatgttg tcctttgaca aagtcagcgc ccactacggc aaaatccagg 7681 cgctgcatga ggtgagcctg catatcaatc agggcgagat tgtcacgctg attggcgcga 7741 acggggcggg gaaaaccacc ttgctcggca cgttatgcgg cgatcccggt gccaccagcg 7801 ggcgaattgt gtttgatgat aaagacatta ccgactggca gacagcgaaa atcatgcgcg 7861 aagcggtggc gattgtcccg gaagggcgtc gcgtcttctc gcggatgacg gtggaagaga 7921 acctggcgat gggcggtttt tttgctgaac gcgaccagtt ccaggagcgc ataaagtggg 7981 cgtatgagct gtttccacgt ctgcatgagc gccgtattca gcgggcgggc accatgtccg 8041 gcggtgaaca gcagatgctg gcgattggtc gtgcgctgat gagcaacccg cgtttgctac 8101 tgcttgatga gccatcgctc ggtcttgcgc cgattatcat ccagcaaatt ttcgacacca 8161 tcgagcagct gcgcgagcag gggatgacta tctttctcgt cgagcagaac gccaaccagg 8221 ggctaaagct ggcggatcgc ggctacgtgc tggaaaacgg ccatgtagtg ctttccgata 8281 ctggtgatgc gctgctggcg aatgaagcgg tgagaagtgc gtatttaggc gggtaataac 8341 acgttgattg atagggagtc aaaagactcc tttgagacag gtgacaaatg taaaattgcc 8401 tgatgcgctg cgcttatcag gcctactggg tgagtggcaa tatgttgaat ttgcacgatc 8461 ttgtaggcct gataagcgtt taccgcgcat ccggcatgaa acgatgagca atctgtagag 8521 tttgattcag accttctata ttttcccgct tatccgtgcc ccatctccca ttttccctca 8581 cccacgccgt caccgccttg tcatctttct gacaccttac tatcttacaa atgtaacaaa 8641 aaagttattt ttctgtaatt cgagcatgtc atgttacccc gcgagcataa aacgcgtgaa 8701 ttc // LOCUS BOVGOA 472 bp ss-mRNA MAM 09-AUG-1990 DEFINITION B.taurus go-alpha mRNA, 3' end. ACCESSION J02900 KEYWORDS go-alpha. SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated regions JOURNAL Biochemistry 29, 5069-5076 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 472) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.R.Price, 12-JUN-1990. FEATURES from to/span description pept < 1 3 go-alpha (AA at 1) BASE COUNT 130 a 133 c 88 g 121 t ORIGIN 1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct 61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat 121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc 181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg 241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa 301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc 361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct 421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt // LOCUS CAJCAT 1334 bp ds-DNA BCT 09-AUG-1990 DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene, complete cds. ACCESSION M35190 KEYWORDS chloramphenicol acetyltransferase. SOURCE C.coli plasmid C-589 DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1334) AUTHORS Wang,Y. and Taylor,D.E. TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide sequence, expression and cloning vector construction JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Taylor, 15-JUN-1990. FEATURES from to/span description pept 309 932 chloramphenicol acetyltransferase (cat) mRNA 277 > 932 chloramphenicol acetyltransferase mRNA signal 242 271 promoter binding 297 301 ribosome binding site signal 960 1006 transcriptional termination signal BASE COUNT 433 a 232 c 282 g 387 t ORIGIN 1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag 61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc 121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat 181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt 241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag 301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg 361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta 421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca 481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt 541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt 601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg 661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg 721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg 781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata 841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt 901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac 961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga 1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac 1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat 1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac 1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag 1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga 1321 aataataaat ggct // LOCUS CLOCBA 5120 bp ds-DNA BCT 09-AUG-1990 DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and beta-D-galactosidase regulatory protein (cbgR) genes, complete cds. ACCESSION M35107 KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein. SOURCE C.acetobutylicum (strain NCIB2951) DNA. ORGANISM Clostridium acetobutylicum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1260 to 5120) AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B. TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is positively regulated in Escherichia coli by a novel regulatory gene, cbgR JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5120) AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.B.Scott, 11-JUN-1990. Author address:D.B.Scott: Molecular Genetics Unit Department of Microbiology and Genetics Massey University Palmerston North, New Zealand E-mail:D.B.Scott@massey.ac.nz FEATURES from to/span description pept 1560 4253 beta-D-galactosidase (cbgA) pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR) BASE COUNT 1921 a 683 c 876 g 1640 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac 61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa 121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat 181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat 241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt 301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt 361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga 421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa 481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga 541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt 601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat 661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg 721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat 781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct 841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt 901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt 961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc 1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt 1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa 1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata 1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt 1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa 1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta 1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc 1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg 1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta 1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata 1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag 1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt 1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata 1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta 1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc 1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt 1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag 2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg 2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc 2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa 2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt 2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag 2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt 2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac 2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag 2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct 2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg 2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac 2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag 2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat 2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa 2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct 2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt 3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta 3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt 3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca 3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc 3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta 3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta 3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg 3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat 3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag 3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa 3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg 3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag 3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca 3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt 3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa 3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg 3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa 4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag 4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt 4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact 4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct 4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt 4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag 4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc 4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa 4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata 4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa 4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga 4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc 4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa 4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa 4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag 4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa 4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca 5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg 5101 ttaatctaat atacgaattc // LOCUS ECOPNCB 1490 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene, complete cds. ACCESSION J05568 KEYWORDS nicotinic acid phosphoribosyl transferase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1490) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis and expression of the pncB gene encoding nicotinic acid phosphoribosyl transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by P.Terpstra, 31-MAY-1990. Nicotinic acid phosphoribosyl transferase is the first enzyme of the three enzyme Preiss-Handler pathway leading to the synthesis of NAD. The protein sequence shows similarity to orotate phosphoribosyl transferase (pyr5) from Dictyostelium discoideum (acc P09556, SWISS-PROT) FEATURES from to/span description pept 216 1418 nicotinic acid phosphoribosyl transferase (pncB) (EC 2.4.2.11) mRNA 158 1450 nicotinic acid phosphoribosyl transferase mRNA (3' end put.) signal 124 129 -35 region signal 146 151 -10 region rpt 170 185 inverted repeat binding 197 202 ribosome binding site signal 1426 1450 rho-independent transcription termination signal BASE COUNT 348 a 374 c 364 g 404 t ORIGIN 1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac 61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg 121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac 181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc 241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc 301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta 361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg 421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac 481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata 541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta 601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca 661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct 721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg 781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc 841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa 901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc 961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt 1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg 1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa 1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct 1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg 1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac 1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca 1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc 1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc // LOCUS NEUCCON13 2728 bp ds-DNA PLN 09-AUG-1990 DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds. ACCESSION M35120 KEYWORDS conidiation-specific protein. SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2728) AUTHORS Hager,K.M. and Yanofsky,C. TITLE Genes expressed during conidiation in Neurospora crassa: Molecular characterization of con-13 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Hager, 12-JUN-1990. Author address:K.M.Hager: Dept. of Physiology UCLA Medical School 10833 Le Conte Avenue Los Angeles, CA 90024-1751 E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU FEATURES from to/span description pept 1009 1275 conidiation-specific protein (con-13), exon 1 1333 1847 conidiation-specific protein, exon 2 1910 2150 conidiation-specific protein, exon 3 pre-msg 922 2367 con-13 mRNA and introns (alt.) pre-msg 927 2367 con-13 mRNA and introns (alt.) pre-msg 936 2367 con-13 mRNA and intron (alt.) pre-msg 946 2367 con-13 mRNA and intron (alt.) IVS 1276 1332 con-13 intron A IVS 1848 1909 con-13 intron B site 2364 2367 polyadenylation site BASE COUNT 653 a 695 c 720 g 660 t ORIGIN Linkage group IV. 1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc 61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt 121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc 181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt 241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa 301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc 361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca 421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta 481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca 541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta 601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt 661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg 721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc 781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt 841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg 901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca 961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct 1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg 1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa 1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg 1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat 1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta 1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc 1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc 1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac 1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt 1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag 1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag 1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc 1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc 1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc 1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt 1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg 1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct 2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg 2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact 2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg 2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt 2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga 2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc 2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa 2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct 2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag 2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga 2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat 2701 gatggaaacg agatcgaaac ccattgag // LOCUS VVUVVHAB 2237 bp ds-DNA BCT 09-AUG-1990 DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds. ACCESSION M34670 KEYWORDS cytolysin; cytotoxin; hemolysin; toxin. SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702. ORGANISM Vibrio vulnificus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 2237) AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G. TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship to Vibrio cholerae El Tor hemolysin JOURNAL Infect. Immun. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Kaper, 29-MAY-1990. FEATURES from to/span description pept 745 2160 cytolysin (vvhA) precursor sigp 745 804 cytolysin signal peptide matp 805 2157 cytolysin pept 237 743 pot. cytolysin (vvhB) signal 55 60 -35 region signal 80 85 -10 region signal 87 92 -35 region signal 110 115 -10 region signal 184 189 -35 region signal 206 211 -10 region signal 2185 2219 transcription termination signal binding 54 69 CRP binding site binding 59 74 Fur binding site binding 185 199 Fur binding site binding 226 231 vvhB ribosome binding site binding 730 735 vvhA ribosome binding site BASE COUNT 639 a 498 c 509 g 591 t ORIGIN 1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata 61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat 121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa 181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc 241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga 301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg 361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt 421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct 481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt 541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc 601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa 661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc 721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta 781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac 841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga 901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg 961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc 1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc 1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac 1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat 1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa 1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt 1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt 1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg 1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt 1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg 1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac 1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg 1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca 1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg 1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc 1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga 1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt 1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt 2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc 2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag 2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta 2221 ttcgccatct attcgtc // LOCUS TOBCPCG 155844 bp ds-DNA circular ORG 09-AUG-1990 DEFINITION N.tabacum (var. Bright Yellow 4) chloroplast, complete genome. ACCESSION Z00044 KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA; 4.5S ribosomal RNA; 5S ribosomal RNA; ATP synthetase; ATPase; NADH dehydrogenase; RNA polymerase; autonomous replication; carboxylase; chloroplast; complete genome; cytochrome; cytochrome b559; cytochrome b6; cytochrome f; initiation factor; phosphoprotein; ribosomal protein; ribosomal protein L14; ribosomal protein L16; ribosomal protein L2; ribosomal protein L20; ribosomal protein L22; ribosomal protein L23; ribosomal protein L33; ribosomal protein S11; ribosomal protein S12; ribosomal protein S15; ribosomal protein S18; ribosomal protein S19; ribosomal protein S2; ribosomal protein S3; ribosomal protein S7; ribosomal protein S8; ribulose bisphosphate carboxylase; transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp; transfer RNA-Cys; transfer RNA-Glu; transfer RNA-Gly; transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu; transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe; transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr; transfer RNA-Trp; transfer RNA-Tyr; transfer RNA-Val. SOURCE Nicotiana tabacum (var. Bright Yellow 4) chloroplast DNA, clone pHC79 (IR-A and IR-B). ORGANISM Chloroplast Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Nicotiana tabacum. REFERENCE 1 (sites) AUTHORS Shinozaki,K., Ohme,M., Tanaka,M., Wakasugi,T., Hayashida,N., Matsubayashi,T., Zaita,N., Chunwongse,J., Obokata,J., Yamaguchi-Shinozaki,K., Ohto,C., Torazawa,K., Meng,B.Y., Sugita,M., Deno,H., Kamogashira,T., Yamada,K., Kusuda,J., Takaiwa,F., Kato,A., Tohdoh,N., Shimada,H. and Sugiura,M. TITLE The complete nucleotide sequence of tobacco chloroplast genome: Its gene organization and expression JOURNAL EMBO J. 5, 2043-2049 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 155844) AUTHORS Sugiura,M. JOURNAL Unpublished (1986) Biology Dept, Nagoya Univ., Nagoya 464, Japan STANDARD full staff_review COMMENT The circular tobacco chloroplast DNA sequence is presented in a linearized form by cutting at the junction between IRA and LSC. The DNA strand which codes for the large subunit of ribulose-1,5-bisphosphate carboxylase is designated as A strand and the complementary strand as B strand. The nucleotide sequence of the B strand is presented. Large single copy region (LSC): 1-86684 (86684 bp) Inverted repeat B (IR-B): 86685-112023 (25339 bp) Small single copy region (SSC): 112024-130505 (18482 bp) Inverted repeat A (IR-A): 130506-155844 (25339 bp) Rps12 consists of three exons. There are two sets of exons 2 and 3. One set is located on the same strand in IR-B 28 kb upstream of exon 1. The oteher set is located on the opposite strand in IR-A 69 kb downstream of exon 1. The tobacco rps12 gene probably consists of three transcription units and requires trans-splicing. The chloroplast DNA segments capable of replication in yeast, ars1 and ars2, are located at positions 112768-113117 and 14570-15088 respectively. Seven open reading frames, (RF236, RF548, RF862, stop codon to stop codon) and (ORF151, ORF90, ORF80, ORF134, start codon to stop codon) are present near the rpoB gene. Four or these ORFs show some homology to portions of the beta'-subunit sequence of E.coli RNA polymerase [1]. Most open reading frames indicated in FEATURES are from start codon to stop codon. The intron boundaries for the ndhA and ndhB are not known and thus the largest possible intron is indicated (from stop codon to stop codon). FEATURES from to/span description tRNA 80 6 (c) His-tRNA (GUG) tRNA 4407 4371 (c) Lys-tRNA (UUU), exon 1 1844 1810 (c) Lys-tRNA (UUU), exon 2 tRNA 7487 7416 (c) Gln-tRNA (UUG) tRNA 8719 8632 (c) Ser-tRNA (GCU) tRNA 9499 9521 Gly-tRNA (UCC), exon 1 10213 10260 Gly-tRNA (UCC), exon 2 tRNA 10430 10501 Arg-tRNA (UCU) tRNA 28783 28854 Cys-tRNA (GCA) tRNA 31999 31926 (c) Asp-tRNA (GUC) tRNA 32191 32108 (c) Tyr-tRNA (GUA) tRNA 32323 32251 (c) Glu-tRNA (UUC) tRNA 33172 33243 Thr-tRNA (GGU) tRNA 37223 37132 (c) Ser-tRNA (UGA) tRNA 38050 38120 Gly-tRNA (GCC) tRNA 38421 38348 (c) fMet-tRNA (CAU) tRNA 47111 47197 Ser-tRNA (GGA) tRNA 48577 48505 (c) Thr-tRNA (UGU) tRNA 49288 49322 Leu-tRNA (UAA), exon 1 49826 49875 Leu-tRNA (UAA), exon 2 tRNA 50232 50304 Phe-tRNA (GAA) tRNA 54390 54353 (c) Val-tRNA (UAC), exon 1 53781 53747 (c) Val-tRNA (UAC), exon 2 tRNA 54581 54653 Met-tRNA (CAU) tRNA 68880 68807 (c) Trp-tRNA (CCA) tRNA 69118 69045 (c) Pro-tRNA (UGG) tRNA 88770 88697 (c) Ile-tRNA (CAU) tRNA 96507 96427 (c) Leu-tRNA (CAA) tRNA 102459 102530 Val-tRNA (GAC) tRNA 104547 104583 Ile-tRNA (GAU), exon 1 105291 105325 Ile-tRNA (GAU), exon 2 tRNA 105390 105427 Ala-tRNA (UGC), exon 1 106137 106171 Ala-tRNA (UGC), exon 2 tRNA 109973 110046 Arg-tRNA (ACG) tRNA 110699 110628 (c) Asn-tRNA (GUU) tRNA 116067 116146 Leu-tRNA (UAG) tRNA 131830 131901 Asn-tRNA (GUU) tRNA 132556 132483 (c) Arg-tRNA (ACG) tRNA 137139 137102 (c) Ala-tRNA (UGC), exon 1 136392 136358 (c) Ala-tRNA (UGC), exon 2 tRNA 137982 137946 (c) Ile-tRNA (GAU), exon 1 137238 137204 (c) Ile-tRNA (GAU), exon 2 tRNA 140070 139999 (c) Val-tRNA (GAC) tRNA 146022 146102 Leu-tRNA (CAA) tRNA 153759 153832 Ile-tRNA (CAU) rRNA 102758 104246 16S rRNA rRNA 106325 109134 23S rRNA rRNA 109236 109338 4.5S rRNA rRNA 109595 109715 5S rRNA rRNA 132934 132814 (c) 5S rRNA rRNA 133293 133191 (c) 4.5S rRNA rRNA 136204 133395 (c) 23S rRNA rRNA 139771 138283 (c) 16S rRNA RNA 32347 31836 (c) Asp-tRNA, Tyr-tRNA, Glu-tRNA RNA precursor pept 1595 534 (c) PSII 32kd protein (psbA) pept 6211 6172 (c) ribosomal protein S16 (rps16), exon 1 5311 5094 (c) ribosomal protein S16 (rps16), exon 2 pept 12148 10625 (c) ATPase alpha subunit (atpA) pept 13452 13308 (c) ATPase I subunit (atpF), exon 1 12612 12203 (c) ATPase I subunit (atpF), exon 2 pept 14099 13854 (c) ATPase III subunit (atpH) pept 16001 15258 (c) ATPase a subunit (atpI) pept 16938 16228 (c) ribosomal protein S2 (rps2) pept 27501 24289 (c) RNA polymerase beta subunit (rpoB) pept 34462 35523 PSII D2 protein (psbD) pept 35471 36892 PSII 44kd protein (psbC) pept 38873 38571 (c) ribosomal protein S14 (rps14) pept 41200 38996 (c) PSI P700 apoprotein A2 (psaB) pept 43478 41226 (c) PSI P700 apoprotein A1 (psaA) pept 48133 47528 (c) ribosomal protein S4 (rps4) pept 55276 54875 (c) ATPase epsilon subunit (atpE) pept 56769 55273 (c) ATPase beta subunit (atpB) pept 57587 59020 RuBisCO large subunit (rbcL) pept 64327 65289 cytochrome f (petA) pept 66860 66741 (c) PSII component (psbF) pept 67121 66870 (c) PSII cytochrome b559 (psbE) pept 70123 70323 ribosomal protein L33 (rpl33) pept 70510 70815 ribosomal protein S18 (rps18) pept 71401 71015 (c) ribosomal protein L20 (rpl20) pept 72326 72213 (c) ribosomal protein S12 A (rps12A), exon 1 100851 100620 (c) ribosomal protein S12 A (rps12A), exon 2 100083 100058 (c) ribosomal protein S12 A (rps12A), exon 3 pept 72326 72213 (c) ribosomal protein S12 B (rps12B), exon 1 141678 141909 ribosomal protein S12 B (rps12B), exon 2 142446 142471 ribosomal protein S12 B (rps12B), exon 3 pept 74950 76476 PSII P680 apoprotein (psbB) pept 77098 77319 PSII 10kd phosphoprotein (psbF) pept 77449 77454 cytochrome b6 (petB), exon 1 78208 78849 cytochrome b6 (petB), exon 2 pept 79845 80264 cytochrome b/f complex subunit 4 (petD) pept 81465 80452 (c) RNA polymerase alpha subunit (rpoA) pept 81947 81531 (c) ribosomal protein S11 (rps11) pept 82465 82175 (c) RF96 pept 83004 82600 (c) ribosomal protein S8 (rps8) pept 83544 83173 (c) ribosomal protein L14 (rpl14) pept 85093 85085 (c) ribosomal protein L16 (rpl16), exon 1 84064 83669 (c) ribosomal protein L16 (rpl16), exon 2 pept 85896 85240 (c) ribosomal protein S3 (rps3) pept 86348 85881 (c) ribosomal protein L22 (rpl22) pept 86680 86402 (c) ribosomal protein S19 (rps19) pept 88231 87841 (c) ribosomal protein L2 (rpl2), exon 1 87174 86741 (c) ribosomal protein L2 (rpl2), exon 2 pept 88531 88250 (c) ribosomal protein L23 (rpl23) pept 100004 99537 (c) ribosomal protein S7 (rps7) pept 125398 125135 (c) ribosomal protein S15 (rps15) pept 142525 142992 ribosomal protein S7 (rps7) pept 153998 154279 ribosomal protein L23 (rpl23) pept 154298 154688 ribosomal protein L2 (rpl2), exon 1 155355 155788 ribosomal protein L2 (rpl2), exon 2 mRNA 1680 441 (c) psbA mRNA (441 +/- 2 bp) mRNA 57025 54637 (c) atpB, atpE mRNA (alt.; 54637 +/- 1 bp) mRNA 57025 54676 (c) atpB, atpE mRNA (alt.; 54676 +/- 2 bp) mRNA 57405 59161 rbcL mRNA IVS 4370 1845 (c) Lys-tRNA intron IVS 6171 5312 (c) rps16 intron IVS 9522 10212 Gly-tRNA intron (no splice consensus) IVS 13307 12613 (c) atpF intron IVS 49323 49825 Leu-tRNA intron (no splice consensus) IVS 54352 53782 (c) Val-tRNA intron IVS 77455 78207 petB intron (no splice consensus) IVS 85084 84065 (c) rpl16 intron IVS 87840 87175 (c) rpl2 intron IVS 98349 97838 (c) ndhB intron IVS 72326 72213 (c) rps12A intron A IVS 141678 141909 rps12A intron B (no splice consensus) IVS 142446 142471 rps12A intron C (no splice consensus) IVS 104584 105290 Ile-tRNA intron (no splice consensus) IVS 105428 106136 Ala-tRNA intron (no splice consensus) IVS 123219 122140 (c) ndhA intron IVS 137101 136393 (c) Ala-tRNA intron IVS 137945 137239 (c) Ile-tRNA intron IVS 144180 144690 ndhB intron (no splice consensus) IVS 154689 155354 rpl2 intron (no splice consensus) rpt 86685 112023 inverted repeat B (IR-B) rpt 130506 155844 inverted repeat A (IR-A) site 1 86684 large single copy region (LSC) site 112024 130505 small single copy region (SSC) ORF 3658 2129 (c) ORF-509A cds ORF 7724 8020 ORF98 cds ORF 19753 17165 (c) RF862 cds ORF 20277 19873 (c) ORF134 cds ORF 20423 20181 (c) ORF80 cds ORF 20646 20374 (c) ORF90 cds ORF 21475 20765 (c) RF236 cds ORF 23127 21481 (c) RF548 cds ORF 24283 23828 (c) ORF151 cds ORF 37558 37241 (c) ORF105 cds ORF 37586 37774 ORF62 cds ORF 44264 44497 ORF77 cds ORF 45394 45146 (c) ORF82 cds ORF 46464 46240 (c) ORF74A cds ORF 48933 49145 ORF70A cds ORF 51457 50981 (c) ORF158 cds ORF 52417 51563 (c) bhpB cds ORF 52659 52297 (c) bhpA cds ORF 59785 61323 ORF512 cds ORF 62630 > 62630 ORF184 ORF 63407 64096 ORF229 ORF 66168 66467 ORF99A ORF 67580 67269 (c) ORF103 ORF 72686 72465 (c) ORF73 ORF 73547 73323 (c) ORF74B ORF 82162 82049 (c) ORF37 ORF 88883 90628 ORF581 ORF 90598 95724 ORF1708 ORF 95815 96078 ORF87 ORF 146472 96057 (c) ORF115 ORF 96116 96394 ORF92 ORF 96553 96792 ORF79 ORF 98889 98350 (c) ndhB, exon 1 97837 97047 (c) ndhB, exon 2 ORF 140581 101948 (c) ORF131 ORF 102099 102311 ORF70B ORF 110820 110593 (c) ORF75 ORF 111025 112077 ORF350 ORF 114198 112066 (c) ndhF ORF 116250 117191 ORF313 ORF 118958 117429 (c) ndhD ORF 119860 119555 (c) ndhE ORF 120383 120084 (c) ORF99B ORF 120612 120196 (c) ORF138 ORF 121512 121009 (c) ORF167 ORF 123840 123217 (c) ndhA, exon 1 122109 121597 (c) ndhA, exon 2 ORF 125023 123842 (c) ORF393 ORF 126482 125796 (c) ORF228 ORF 127561 126740 (c) ORF273 ORF 131501 127767 (c) ORF1244 ORF 131709 131936 ORF75 ORF 140186 140581 ORF131 ORF 140430 140218 (c) ORF70B ORF 145976 145737 (c) ORF79 ORF 146125 146472 ORF115 ORF 146413 146135 (c) ORF92 ORF 146714 146451 (c) ORF87 ORF 151931 146805 (c) ORF1708 ORF 153646 151901 (c) ORF581 ORF 96404 96057 (c) ORF 115 anticdn 45 43 (c) His-tRNA anticodon gtg anticdn 4376 4374 (c) Lys-tRNA anticodon ttt anticdn 7455 7453 (c) Gln-tRNA anticodon ttg anticdn 8685 8683 (c) Ser-tRNA anticodon gct anticdn 10222 10224 Gly-tRNA anticodon tcc anticdn 10463 10465 Arg-tRNA anticodon tct anticdn 28815 28817 Cys-tRNA anticodon gca anticdn 31965 31963 (c) Asp-tRNA anticodon gtc anticdn 32157 32155 (c) Tyr-tRNA anticodon gta anticdn 32289 32287 (c) Glu-tRNA anticodon ttc anticdn 33204 33206 Thr-tRNA anticodon ggt anticdn 37189 37187 (c) Ser-tRNA anticodon tga anticdn 38082 38084 Gly-tRNA anticodon gcc anticdn 38387 38385 (c) fMet-tRNA anticodon cat anticdn 47145 47147 Ser-tRNA anticodon gga anticdn 48544 48542 (c) Thr-tRNA anticodon tgt anticdn 49841 49843 Leu-tRNA anticodon taa anticdn 50265 50267 Phe-tRNA anticodon gaa anticdn 54357 54355 (c) Val-tRNA anticodon tac anticdn 54614 54616 Met-tRNA anticodon cat anticdn 68846 68844 (c) Trp-tRNA anticodon cca anticdn 69084 69082 (c) Pro-tRNA anticodon tgg anticdn 88737 88735 (c) Ile-tRNA anticodon cat anticdn 96474 96472 (c) Leu-tRNA anticodon caa anticdn 102491 102493 Val-tRNA anticodon gac anticdn 104579 104581 Ile-tRNA anticodon gat anticdn 105423 105425 Ala-tRNA anticodon tgc anticdn 110007 110009 Arg-tRNA anticodon acg anticdn 110667 110665 (c) Asn-tRNA anticodon gtt anticdn 116101 116103 Leu-tRNA anticodon tag anticdn 131862 131864 Asn-tRNA anticodon gtt anticdn 132522 132520 (c) Arg-tRNA anticodon acg anticdn 137106 137104 (c) Ala-tRNA anticodon tgc anticdn 137950 137948 (c) Ile-tRNA anticodon gat anticdn 140038 140036 (c) Val-tRNA anticodon gac anticdn 146055 146057 Leu-tRNA anticodon caa anticdn 153792 153794 Ile-tRNA anticodon cat BASE COUNT 47824 a 29991 c 28992 g 49037 t ORIGIN 2692 bp upstream of BamHI site. 1 ttatgggcga acgacgggaa ttgaacccgc gcatggtgga ttcacaatcc actgccttga 61 tccacttggc tacatccgcc ccctcgccta cttacattcc gtttttacat tatttaaatt 121 agaaaacaaa agattcaagt tcgaatatag ctcttctttc ttatttcaat gatattatta 181 tttcaaagat aagagatatt caaagataag agataagaag aagtcaaaat ttgatttttt 241 ttttggaaaa aaaaaatcaa aaagatatag taacattagc aagaagagaa acaagttcta 301 tttcacaatt taaacaaata caaaatcaaa atagaatact caatcatgaa taaatgcaag 361 aaaataacct ctccttcttt ttctataatg taaacaaaaa agtctatgta agtaaaatac 421 tagtaaataa ataaaaagaa aaaaagaaag gagcaatagc accctcttga tagaacaaga 481 aaatgattat tgctcctttc ttttcaaaac ctcctataga ctaggccagg atcttatcca 541 tttgtagatg gagcttcgat agcagctagg tctagaggga agttgtgagc attacgttca 601 tgcataactt ccataccaag gttagcacgg ttaatgatat cagcccaagt attaattaca 661 cggccttgac tgtcaactac agattggttg aaattgaaac catttaggtt gaaagccata 721 gtgctgatac ctaaagcggt aaaccagata cctactacag gccaagcagc taggaagaag 781 tgtaacgaac gagagttgtt gaaactagca tattggaaga tcaatcggcc aaaataacca 841 tgagcggcta cgatgttata agtttcttcc tcttgaccga atctgtaacc ttcattagca 901 gattcatttt ctgtggtttc cctgatcaaa ctagaagtta ccaaggaacc atgcatagca 961 ctgaataggg agccgccgaa tacaccagct acgcctaaca tgtgaaatgg gtgcataagg 1021 atgttgtgct cagcctggaa tacaatcatg aaattgaaag taccagagat tcctagaggc 1081 ataccatcag aaaaacttcc ttgaccaatt gggtagatca agaaaactgc ggtagcagct 1141 gcaacaggag ctgaatatgc aacagcaatc caaggtcgca tacccagacg gaaactaagc 1201 tcccactcac gacccatgta acaagctacg ccaagtaaga agtgtagaac aattagttca 1261 taaggaccac cgttgtataa ccattcatca acggatgccg cttcccagat tgggtaaaaa 1321 tgtaaaccta tagctgcaga agtaggaata atggcaccgg aaataatatt gtttccgtaa 1381 agtagagacc ctgaaacagg ttcacgaata ccatcaatgt ctactggagg agcagcaatg 1441 aaggcaataa taaatacaga agttgccgtc aataaggtag ggatcatcaa aacaccaaac 1501 catccaatgt aaagacggtt ttcagtgcta gttatccagt tacagaagcg accccatagg 1561 ctttcgcttt cgcgtctctc taaaattgca gtcatggtaa aatcttggtt tatttaatca 1621 tcagggactc ccaagcacac tagttttcta caaatcaaaa tagaaaatgg aaggcttttt 1681 attcaacagt ataacatgac ttatatactc gtgtcaacca aggtgtatgt agatctattc 1741 aaatttttaa tgaagttgat tggaaaaata cggacttctc tacagaaaat tagaatttcg 1801 atatgctagt gggttgcccg ggattcgaac ccggaactag tcggatggag tagataagtt 1861 ccttgttaaa taaaataaat gttaatctta aattaaataa acaagtaaag acccctcccc 1921 aagccgtgct tgcatttttc attgcacacg gctttcccta tgtatacatc agttcctttc 1981 ttatagaaat tagaaagact ttaaaaagtt gaatactcag ttgatttacc ccttaattac 2041 tattacaatc aacatttcag aatagtgaaa tttttttatc tcttcatcat ttagaaacaa 2101 atttccattt agaaaatcta agaatgaatc attgataatt cgccagatca ttgatacaaa 2161 aaatatccaa ataccaaatc cgacttctat atactcccca caaactagaa gaagctcgtg 2221 ggaaggtcaa agaaagaact tgttcttccg acgttaagaa ttcttccaat aattccgagc 2281 ccgatctttt caaaaaagtg cgtacagtac ttttgtgttt ccgagctaaa gttctagcac 2341 aagaaagtcg aagtatatac tttattcgat ataaagtctt ttttttggaa gatccgctat 2401 aataatgaaa aagatttctg catatacgcc caaatcggtc aataatatca gaatctgata 2461 aatcggacca aaccggttta ctaatgggat gccctaatac ggtacaaaag tttgctttag 2521 ctaatgatcc aatcaaagga ataattggaa caagggtatc gaacttctta attgcattat 2581 tgattagaaa tgaattttct aacatttgac tacgtaccat tgaaggattt agtcgcacac 2641 ttgaaagata gcccataaag tcacgggaat gattggataa ttggtttata tggatccttc 2701 ctgtgtgaaa gcacagagaa caatgacatt gccaaaaatt gacaaggtaa aatttccatt 2761 tattcatcaa aagaaacgtc ccttttgaag ccagaatgga ttttccttga tacctaacat 2821 aatgcatgaa aggatccttg aataaccata gggtaacctg aaaatcctta gcaaagactt 2881 ctacaagacg ttctattttt ccatagaaat atattcgttc aagaagggct ccaaaagatg 2941 ttgatcgtaa atgagaagat tggttccgta gaaagacgaa agtggattcg cattcatata 3001 cataagaatt atataagaag aagaagaatc tttgattttt ttttgaaaag gagtaaccgg 3061 gcttctttga agtaataaga ctattcaaat tccaaaattc atggagaaag aatcgtaata 3121 aatgtaaaga agaggcatct tttacccaat agcgaagagt ttgaaccaag atttccagat 3181 gaacagggta gggtattagt atatctaata cataatttag atgtgaaaaa ttgtcctcta 3241 aaaaaggaaa tgttgaatga attgatcgta aattataaga tttaaaaatc tttttgcctt 3301 cgaaagaaga taaagaagat attaatcgta gagaaaacgg aatttccaca ataaacgcaa 3361 atccctctga tatcatttga gaatacaaac tcttgttgca ccccaaaaat gaatttttgt 3421 tagaatcatt agtagaaata agaaaatgat tctgttgata cattcgagta attaaacgtt 3481 tcacaattag gaaacttaat ttattgttat aacctggatt ttccaacaaa atcgacctat 3541 ttctatttaa accatgatca tgagcaagtg cataaatata ctcctgaaag ataagtggat 3601 ataggaagtt gtgttgttgc gatctatctg gctgtaaata tctttggatt tcttccattt 3661 gaaattcgat ttgaaccaaa gacggaagat tttgagggtt atcaaatgat acatagtgcg 3721 atacagttaa aacaaagtat tttagtaaga atagatacct tggatacagg taaacttctc 3781 aacggattct ctatcatctc tttttttttt cgtttcgttt aattggtcta tgttatagtg 3841 ttataggata ataagatggt tagaaatcct ttattttttc aacctaatcg ctcttttgac 3901 ttcggaaaaa actttcttta tcaatatact gtttcttcta cacacacatc tccgtaatag 3961 aaaatggtaa tagttaggat tcattaaaaa aatggagaat ccactcatgg gacaagaaac 4021 ccttcccgca tcaggcacta ataaattttt aacgtctaat tagatcggga atcattcaaa 4081 ttaagaacaa aagctcgttg ctttttcttt ctttccctat aatttaattg aagccgcagc 4141 cctatccatt tattcattcg acccaacttt attttgttcc gttccaagaa ttctaacacg 4201 gttttatacc catctaggaa caatgaaata ttctcagaac tttccgttga tacgacatgc 4261 tatttttacc attcattccc tttcaggatc agtcgtggtc ttccaaactt taccgagagt 4321 atggacgaat ccctcacttc atccatatgt gtaaaagaga ctagccgcac ttaaaagccg 4381 agtactctac cgttgagtta gcaacccgaa gaaaatatcg aagaaaaata aataaagaga 4441 ttagacaaga caaccaaaaa ccattgaagg aataaatcta aaaaaataca ttcacatttt 4501 cgaattaatt taaaacataa aataaaacta aatagatcca cttcatttat cacaatgaat 4561 tatatttgtt cgatacactc tgttgtcata aatattgaat agtgaaaaaa aaaaaagaaa 4621 tttcaattga caacaataaa aaatattaaa aaaaaggact tgtgttagat tggcactaca 4681 aatctaatcc aaataaaata gatacaaaaa agtatagatg agagaataaa ttaagtggaa 4741 aacaaaacta caatttattt agatttattt aatccataat ggattcaatc aagttaagtg 4801 agataagcaa acttgatttc ttttttagtt ttagtagagt tccaatgaaa aacggaaaac 4861 cacccaattg aaggaaatgc ccgaattttc tatttcgagg atcaataaaa taagaggttt 4921 tgtcgttata gaacacggaa ttcaatggaa gcaatgataa aaaaatacaa atagaaaagg 4981 aaagggagga aatacaaaaa aatagaagag aaaagtcata caaagttata tacaaatgac 5041 tacccccctt tttgtatttc cttaatttat ttccttaatt gaatttcggt tgattaggat 5101 tgattaggac gaagttcctt aaaaacctcc gccttcttta aaatatcctg aacagttcct 5161 gtaggttgag cccctttttc aaggaaatat aaaatagcag gaacatttaa ataagtttga 5221 ttctttatcg gatcataaaa acccactttc cgaagatctt ttccttctct tcgggatcga 5281 acatcaattg caacgattcg atagacggct cattgggatt gatgtagatg aacaacaccc 5341 cccctagaaa cgtataggaa gctttctcct cgtacggctc gagaaaaatg attgattcga 5401 ggttttatct ctgtatggaa ttctatctaa gaaatgacaa ctgggtccat aaaatgatca 5461 aatcaattaa agatgtaagt cttttttttt cttctttctt cctgaaaatg aaaaagaaac 5521 cattcgtact ctcataactc aagttggata actttcaaac agttcaaagg aaaatctttc 5581 ggcaatttca tttattgagc ggtctttcct ccttttatgt ttgtctcgtt taaaatggat 5641 ttggattctt cagttcgatc cagttattaa gacaataaaa aaggtgtttc cttgttctgg 5701 gatcctttat ctttgtttta ttttaaatca ttgggtttag acattacttc ggtgcttttt 5761 aatcctttca aaatggcagc aacatacccc ttttgcgatt tctatgaaag aatcctacag 5821 acgatggatt cccgcgtgaa acactttgga tcgaaaagtt tgaatcaatt ccaaggaatt 5881 tttgaattgg aaacttgctc gaattggatt ctttcgattt ccataccgaa aatatattta 5941 cgaagttgtt ccaatttttt tattgattgg cattaaccct agactcttgc cccgagaaat 6001 aaattaatac tttctactcg agctccatca tggactattt acattccaag acaacaaaaa 6061 agaggggttc taatgaaaca gaaccaatga tgtcgagcca agagcacctt cattcctaca 6121 taaaatggtg gatgtacaaa tccacaacgg atcctgtcct tcaagtcgca cgttgctttc 6181 taccacatcg tttcaaacga agttttacca taacattcct ctaagaaccg gtctggaatt 6241 gattcaatta tggaatcatg aatagtcatt ggttgggctg atgtataaac accataatct 6301 atactttgtt ctatatctat atactataga gataggtgga taaatatttt tctttagtaa 6361 gaccccatcg ctaatattaa tttatctaac atattaatta atatttaata tataaatata 6421 tatagaaata ataataaata agaataataa taaataagac gaataaatga gttctttttg 6481 attctgcatc ttcacgtgac tcaataggag agattgacct atttcagact tcttcaaata 6541 gcaaagattc cgcttataag gaatgattaa aactatttat atttctaaat ttagaaagtt 6601 cccttttcga catcattatt tgaagaaaat ttgatagtta aagatcactt ttgatcatct 6661 taggaaagaa aaaagataag tctttctttt ttaattgaat catcaacgat ttcaatgatc 6721 taaaatagat aaatacacca aacaacaaat ccaatttttt tttatgagat ggataaaaaa 6781 agattaatat aaggtaagat tttcattctt attctttttt ttttttttca tctgattgat 6841 aaaatccaaa gaatggggag ggtttcgtat ctatcaattc gatcaaatag actgagcaat 6901 tgtcaccgtt tatagatatt gaaatgaatg ccttcccatt actgattaac tcctatctac 6961 cccattctat gggcctgatg cagcataaat caaaagaaaa gaggggggtg tcctagtctt 7021 tttgattttt acgaaatgcg agctgtctag gcacaaagcc aaacaagtcc agattaagtc 7081 aagtttttgc tcctattttt tgatatttta gcctaactca ttgattaaga attaagagac 7141 ttagtgaatt taattagtac caaaaatccc ctcttggcga aaagtcaaga aatccacaaa 7201 aaagaaaatg gaatctaatt aggctaattt aggggataga gaatacgaga tagggaatat 7261 agattctttc gcatctcgat tccgtttttg aaaaaaaaaa atgattcatc gaagaaaaaa 7321 atcagaaaca acaatcacat tccagctaac atttcgattt taaacagaac attgttaaaa 7381 aagcaatcta tattctcata gaatatatat atgttctggg acggaaggat tcgaacctcc 7441 gaatagcggg accaaaaccc gttgccttac cacttggcca cgccccattt agatttctat 7501 tcgatactaa gaaagtatat tgcttgtttt gtttgtttgt caactctagt ccaaatatct 7561 atagaataga ttagattggt actaggattt tgcgatgttt ttggtatgtg tagatataga 7621 attcaactta atttattgat cattacatat aattcaatta agatattgta tgaaaatatg 7681 attttttcga ttctcctttg agaaaaggag gatttttgat tgggtgggtt caaagaaaaa 7741 gaagtatttt ttgtttacct tacttacttt ccctttcctt atatcaataa cgcaatcaaa 7801 atgcaattat ctctccaaga acaaaaagtc tgttatgctt aataccttta gtttgatcgg 7861 tatctgtctt aattcgaccc ttttttcgag tagttttttc ttcggcaaat tgcccgaggc 7921 ctatgctttt ttgaatccaa tcgtagatat tatgccagtc atacctctgt ttttttttct 7981 cttagccttt gtttggcaag ctgctgtaag ttttcgatga gatccttaat aatatcctag 8041 aaaattcatg atttattcga gaaaaattct aaaataaata aaatcagata agctttaccg 8101 tttgaaacct cgattcaaac attgaaattc ttggatagtc acgagaaatc cggcttaact 8161 tatttcctta ttttttgacg ctttcccttc cagtgaaaga ccttattagg ctcctcacaa 8221 tacctaattg tgtatataaa aaaattttgg ttaatgacaa actcttagta gaaaagaatt 8281 tatgaaaatt cttttagaga aagagcttca ttgcttggtg tcaaactagg atatgcggta 8341 gaaaaatgga tgatctattc tctttttttc aaaaaaaatc atcttggaga ttgtgtaatg 8401 cttactctca aactcttcgt ttacacagta gtgatatttt ttgtttctct cttcatcttt 8461 ggattcctat ctaatgatcc cggacgtaat cctggacgtg aagaataaaa taaaaaaggt 8521 ttttccttgc ttgattttcc aattttctta tgatttggtc tattccacac atttaactaa 8581 gaataagaac aaaggatttc gaaatttgaa aaaaaaaaat caagtcatca acggaaagag 8641 agggattcga accctcggta cgattaactc gtacaacgga ttagcaatcc gccgctttag 8701 tccactcagc catctctccc aattgaaaaa gataattact acatgagata gcacataaga 8761 taaaggaaag aatctttctt tctctctttt cttctttcta tattatatag atatgtacaa 8821 cttttatcat caatttcctt tatttcttta tctaaagtaa aggaagggct cagaagagcc 8881 aagaatatca agaaaaataa agaagacctc ttttctttgt cttgattttg ttcgaaagga 8941 ccctcttatt ctcatggcct ggtctggtca gtacccagcc gggcctcttt tgttccaacg 9001 aatttgaatt tgaaaactaa aaagcctgtt atagttgtaa tatttcattt taattgaata 9061 gttaatattc aagcaacaag aaaaaattcc cattttttgt aaaagtaaaa taaaatatat 9121 aaaatagaaa attcgatcaa aataaaagtc tcatttctct ttctgctttt ttattttatg 9181 tttaccacct tactggacta aaaaaaagaa gctttcgagt attccacaat gcatttttat 9241 gttatgattt tagtggtttt gacgagccgt atctctatca aaactcctcc agcaaaagaa 9301 aagataaaac taaattctgt aatttagtta tttaaatgaa ccctcgtttc caaatctcat 9361 caaattggaa tccccccagg aaaaaagatc aacactctaa tttggatgat tctgtgacga 9421 ccctatctta tcctatcttg attaccacaa ttcccctgtt cgacaaaagt tgcatttgta 9481 tacaataatc ggattgtagc gggtatagtt tagtggtaaa agtgtgattc gttctattat 9541 cccttaaata gttaaagggt ccttcggttt gattcgtatt ccgatcaaaa acttgatttc 9601 taaaaaggat ttaatccttt tcctctcaat gacagattcg agaacaaata cacattctcg 9661 tgatttgtat ccaagggtca cttagacatt gaaaaattgg attatgaaat tgcgaaacat 9721 aattttggaa ttggatcaat acttccaatt gaataagtat gaataaagga tccatggatg 9781 aagatagaaa gttgatttct aatcgtaact aaatcttcaa tttcttattt gtaaagaaga 9841 aattgaagca aaatagctat taaacgatga ctttggttta ctagagacat caacatattg 9901 ttttagctcg gtggaaacaa aacccttttc ctcaggatcc tattaaatag aaatagagaa 9961 cgaaataact agaaaggttg ttagaatccc ctcttctaga aggatcatct acaaagctat 10021 tcgttttatc tgtattcaga ccaaaagctg acatagatgt tatgggtaga attctttttt 10081 tttttcgaat tttgttcaca tcttagatct ataaattgac tcatctccat aaaggagccg 10141 aatgaaacca aagtttcatg ttcggttttg aattagagac gttaaaaata atgaatcgtc 10201 gtcgactata acccctagcc ttccaagcta acgatgcggg ttcgattccc gctacccgct 10261 ctatatctat ttattctaaa tattttaatg tattcattaa atcaaattta gtttattagt 10321 attagtacat cattgaatat acaattccaa aaattctttc acatccgatt ctttctgttt 10381 tttttttcaa acaaaaagtt aaaatacgaa aaaaaaatca gaatgaaaag cgtccattgt 10441 ctaatggata ggacagaggt cttctaaacc tttggtatag gttcaaatcc tattggacgc 10501 aatttatttc catatatatt tttttttaga tttcgatagc aagaaagact gtttgaatat 10561 ttgaatccaa gacgcttgat tccttttttt tattaagatt aagacaaaag tgatcaatat 10621 ttctttatgc ttgttcctga agtataaaac ggtccatttg ttcctgaata gcttctttca 10681 aaagggcttc tgcttcctcg gtaaatgtct tggtagaaga tatgatttct tggaactgag 10741 gtttattagt ttttaagtaa gtacgtagct caacaagaaa tttccttacc tgtccaactt 10801 ctaatgaatc aagatagccg tttgttccgg tataaatagt cattatctgc tcttctaccg 10861 tgagaggagc tgattgggat tgtttaagca attcacgtaa tcgttgacct cttgccaatt 10921 gattctgagt agctttatcg agatcagaag caaattgtgc aaaggcttct aattctgcga 10981 attgtgctag ttctaatttt aatttaccag ctacttgttt catggctttt atttgagctg 11041 cggaccccac tctggaaacg gagataccca cattaatagc aggtctgatt ccagaattga 11101 ataggtcggc ggataagaag atttgtccat cagtaatgga aattacatta gtaggaatat 11161 aagccgaaac atctcccgat tgggtttcaa ctattggtaa ggcggtcata cttccttcac 11221 ctaaactaga acttaattta gcggctcttt ccaaaaggcg tgaatgcaaa taaaaaacat 11281 ctcctagata agcttcacga ccgggcggtc ttcgtaatag aagagacatt tggcgataag 11341 cttgcgcttg tttggaggga tcatcataaa tgattaaagt gtgtcgttca cgatacataa 11401 aatattcagc cagagctgct cctgtataag gagcaaggta ttgtaatgta gcaggggaat 11461 ctgccgtttc ggctaccaca atagtgtatt ccatcgctcc cctttcctgt aaagtagtta 11521 cgacctgggc cacagaagat gctttttgcc caatagctac ataaacacat attacatttt 11581 gaccttgttg attgaggatc gtatctgtgg ctactgctgt tttaccggtc tgtctgtccc 11641 caataattaa ttctcgctga ccacgtccta tagggatcat cgaatcaata gcaataagcc 11701 cggtttgaag aggctcatat acggaacggc gcgaaataat acccggggcg gcagattcga 11761 ttaatcgaaa ttcagaagct gaaatttcac ctctaccatc aataggttta gccagggcat 11821 ttataacacg acccaaataa gcctcactca cgggtatctg agcaattctt cccgttgctt 11881 ttacagaact tccttcttgt atcaataaac catcgcccat taatacaaca ccaacattat 11941 ttgattccaa attcagagca atgcctattg taccctcttc aaattcgact aattcacccg 12001 ccattacttc atcaagaccg tgaatacgag caatgccgtc gcctacttga agtacggtac 12061 cggtatttac aatctttact tctctattat attgttcaat acgttcacgg ataatattac 12121 taatttcgtc agctcgaatg gttaccatga ttctttcttt attatttttt gaaagaaaaa 12181 aataatacct acagtagaaa gactaatcag ttatttcttt cattgttccc aacatgccaa 12241 tattggacct aatggtacgt aaatgtaact cgttgttcaa acaactattc agagttccta 12301 gagctcctcg taaggcttgt tggaaaaccc gttgtcggac ttgattaatc gccctttgct 12361 gttcaaactg aatcgtttcg tttttgtaat tttctaattg ttccaaagtc ttataagttg 12421 aattaatcaa attcaatttt tctcgttcta tttcagagta tccattcact cgaaactgct 12481 cggcttcgct ttcgactttc cgtaagcgag aacgagcttt ttcgagttgt tcaatagccc 12541 ctccacgcag ttcttctgaa tttcgaatag tattcaagat cctctgtttt cgattatcta 12601 ataaatcact taatgaaagt agattatttt tccattcctt tccaaaattc cataatccct 12661 tcccgaacca aacatgaatc tttcgattca tttggctctc acgctcaatt acttaaggta 12721 aattctcata tcttttttta tgaatgtaat gagcctatct tctcttcttt gttcatattc 12781 caaaaagata tcgaaactaa tgtaatacca aaatattcgg aggactcttc tgacaaaata 12841 aaaaatatgt aattgtcagc aaagttgttt cttttttttt ttttcaaatc caaaaagctc 12901 ttcttactta gaataggtcg tcgattcagc attagataaa gggggtaaaa tccccgtttt 12961 tacaatttac aataagcggt tcaaatcatt ttatcaatat gagtatccta tatcgataaa 13021 atatttattt tgaaaccacc tctatattaa catagtggta gaaagagtac catgctgcgt 13081 ctagacttca aacagtttgt tttaaccatg ttaatagttc cacattattg gttaatagag 13141 aatcaaaatt gatttaccaa tgaatcgcga aatgctatgg ttcttacata taatttctga 13201 atttattcag aagtaattcg cgagatcatg cacctctctt tcctagttat aacggaaaag 13261 ggtacagctg ggtggtccag cctattcttg aaataaacaa ctcgcacaca ctccctttcc 13321 aaaaaaaatc aatacaccaa gcactacact tagatttatt ggatttgttg ctaaaatatc 13381 ggtattaaac ccgaaactcc cggcagatgg ccagtggccc aaagaaacga aagaatcggt 13441 tacgtttttc atatgatctc ctcttataga tagactaaaa aatcgaacag agttcttttt 13501 gtagcacttc gcccctcttt ttatttattc ttttattttt tctgaaattg agtcaaaaaa 13561 taaaaaatat tcgagttagt tataaattat gaactaacga actagccctt ttattggtta 13621 ttggaacact aacacttact aaaaagagtt tcccttggtc tatgaacggg aaggatgaaa 13681 gcgagtcagt atgctaattc ctcatccgca aatcagccct tcccgtaggt tcttttctca 13741 aagaataaag aattggagga gggaaatctt gatagaattt gaaaaagcaa acgacaagtc 13801 gaaggcaata aaatatgaaa aatgtattta tttttcatat ttctaagcta agattaaaca 13861 aaaggattcg caaataaaag tgctaatgct acaaccagtc cataaattgt taaagcttcc 13921 ataaaagcta gactaagcaa tagcgtacct cgtatttttc cctctgcctc aggctgtctc 13981 gcgataccct ctacagcttg acccgcagca gtcccttgac caactccggg tccaatagaa 14041 gcaagcccta cggccaatcc agccgcaata acggaagcgg cagaaatcag tggattcatg 14101 ataagttcct cgtaccaaaa aaaagaaatg gttaacgata caatcaacca atgagttatg 14161 acttaattat tccctcgcta ggaatcatcc agtcgaagta actaagaact tcggattgaa 14221 gtaataagat tattgaatca tcagaactac ttcgatatat cttttttact ttttagccac 14281 agagtctttg tgaacccata cgactttcgt tcttccattt cttggttcga actgttagtt 14341 gaattatttc ttgatttcat ccgtttattc attcaattca cagtcacaag gggccggaag 14401 gacttctagt ctattagaat cccctagagt agtaaaatta tatctttagt tcatttcata 14461 tataactagc actagtcaat atctaatatc acatatacat gtctttcttc cataacgtaa 14521 accaagcatt catcttagat tcaatcctat tcgagaatca agcgtcgaaa catctagaag 14581 ggttggctta tagttattca attacagata cctccctctc ctaaccgacc ctttctaaaa 14641 tactcaaaaa aatccctttt ttgtaaattc ttttgaacct taccttttct tattattcca 14701 cctagataaa tctaaatgga caaattgatt aggccgaata attccatatg tatagaaata 14761 tcattatttg attgatctaa gttcatgcaa tttattaata aaaatgaata atttatttat 14821 taattattaa tattttggtc aatcgttgaa taaaatcaac tgaaagggaa atcgtttcgc 14881 cctttttaat ttaatttaat tacacgtcgt aaacctatac aacaagaatt ataattattg 14941 acaaaaattc ttatattcaa attgttttaa caatgaatta ataatgagat ggactaagca 15001 atctaaagtg aatattcatt gagacgaagt atgatattaa gtgaaggaaa ggggaatttt 15061 aggaaaaaga tctttttttt ttagatcttt ttccccttac tctttaatat catcgtaatt 15121 tttttgctat cactctagat cgtatataaa atagttgtat atttagattc ccctattcta 15181 ttccctaagt taagtaattc tcttgagcca cccaccatat ttatacattg ctttgggcta 15241 agctaaataa gactatttca atgatggccc tccatggatt cacctatata agccgcggct 15301 aaagttgcaa aaataagagc ttgaatacca cttgtaaata atccaaggag catgacaggt 15361 ataggaacta ctaaaggtac taaagaaaca agaacaacaa ctactaattc atcagctaag 15421 atattcccga aaagtcgaaa actaagtgat aaaggttttg tgaaatcttc taagatgtta 15481 atgggtaaaa ggattggggt tggttgaata tattttccga aataacctaa tccttttttt 15541 gtaagacccg catagaaata tgccactgat gtgagtaaag ccaaagcaac agtagtattt 15601 atatcattcg tgggtgcggc taactcccca tgaggtaatt gtatgatttt ccaaggtaaa 15661 agagctcctg accaattaga aacaaaaata aataaaaaca tagtgccaat aaaaggaacc 15721 cagggcccat attcttcgcc aatttgagtt ttactcacat ctcgaataaa ttcaagaaca 15781 tattcgaaga aattctgacc cccggtcgga atggtttgtg ggttccgaac agctatagtg 15841 gctgaaccta ataagatagc aattacaacc caagaagtaa taagtacttg gccatgtact 15901 tggaaacccc ctatttgcca atagaaatgt tggcctactt ccacaccgga tatatcgtat 15961 aaccccttta gagtattgat ggaacatgat agaacattca tattgccttg ccctctgaaa 16021 aaattgaact ttaaacaaaa ttttttgatt caaccatctc tttgtctact tgaatcggat 16081 attttgaata ccaactaaga tttagaatac taataaatca cataatatcc ccagctattt 16141 ttatctcttt tttgaaattc agaaatagta agcgattcca taagggattt ctgaagtaag 16201 ttatttatct tattatgtta ttattaatca aggatttctt atatagctag aacgaccctc 16261 acaaattgcg aatactaatt tgttaagaat taatcggatt gaggatatgg cgtcatcatt 16321 cgctggaatt gaaatatctg cgagatcggg gtcacaattt gtatcggtta aacaaattgt 16381 tggaattcct aaagtaatac actctcgcag ggccgtatat tcttcgtgct gatcaacgat 16441 gattacaata tcgggtaccc ctgtcatata tttaatcccg cccagatatg tttgcaagcg 16501 agataattgt cttttcaaca tagcagcatc tcttttcggg agacggttga gtctccctgt 16561 tttttgttcc attctcaagt ccctgaactt atgaagtctt gtttcggtag tggaccaatt 16621 cgttaacata ccgccaagcc attttttatt aacataatga caccgggccc ttattgcagc 16681 ccactctact gaatcagctg ctttattttt ggtaccaaca attaagaatt gttttcccct 16741 acttgctgcg tcaaaaacta aatcacaagc ttctgataaa aaacgagcag ttctagtaag 16801 atttgtaata tgaataccct tacgctttgc agaaatataa ggcgccattt taggattcca 16861 tttcctagta ccatgaccaa aatgaactcc tgcctccatc atctcttcca aatttatgtt 16921 ccaatatctt cttgtcattt ctctccacac cccccctttt ttttttattc tttttcaaaa 16981 aaaaaaaaag agacgaggaa ccctgaactg aaataaataa ttgttccgat ggaaccttct 17041 cttctaccgt agattggacg tagatacacg acccaaacca ttattctttt ctattcatta 17101 ttctttttat taccaaagca aataaccata ccaaatgcag atagcgaaag agatgaatcc 17161 gttgttagga atcattaaat cctataaacg attgttcggg tatatcgtgg aaattttttg 17221 aaagacaaga atcaaataat tttttgtggt ggaacaaaat atctctcatc tccccctcga 17281 atagattctt tttttttgtt tccaaaggaa tgttgttatg ttgttttgaa gggtgcacta 17341 atcccttgaa tccggtacca acgggtatca ccccccccaa aacaacgttc tctttcaggc 17401 ctttcaacca atcgatacga ccccggagag ccgcttttgc taaaactcga gcagtttctt 17461 gaaaactcgc ttcagatatg aaactttgag tattgagaga tgctcttgtt attcccaata 17521 agacgactcg gtaacagatc gcttcttcca aagcgcgccc cattcgttct gctcgcaaca 17581 atccaataag ttctccgggt gaaaaaacat tagacattcc atcttctgaa accaacactt 17641 ttgatgttat ttgacgtaca ataatttcta gatgcctatt atgaatctgc accccctggg 17701 agcgataaac ctgttggatt ttattaacca aagagattcg gctttgcgct atagttagct 17761 cagcaccaat caagaatccc caaggaattc caagaattct tgttatacat ttgttccaac 17821 cctcaatcct cttttctaga ttcatggata ttgaatcaac cgaacgcact tctaacacct 17881 gttctacttt tggaagacct tgtgttatat caccagatct cgatttttca tatataaatg 17941 taactaatgt atctccttcg taaagggttt ccccataatg gccatgaaca gttgctccgg 18001 gggtggccaa ataaggctta gctgatcgta tcactatcga atcaacttga acaagtataa 18061 cttgacccga tttgaggggc ggtccatttt tggctataca tacattttca caaataaact 18121 gtccaagact aattatttta gatgtctctt cacaataatt gtgatggaga aaataccaat 18181 tcaaattgaa tggatttaaa ataatgttac gacacggatc gggattaaaa atttttccat 18241 tttcatccat taaataatat ttaaatttaa tcacttgaaa agtctgtttt aaattgtcaa 18301 gttgcaaata gttagttact aagatctgat tatgagttat taaatggtaa gatgaataaa 18361 aattctcaat tggaagggat gttcctaaag ggcccaatga attcctaatt ggaattaggg 18421 gatctttttt aattgatttt tttatcacac tgtgatattt tacatccttg aatggcccca 18481 ttcgagaaca attggctgct gacaaaatta tcaacgactg acattcctta tttctattta 18541 acaacgtatg aatagttcct tgaggttggt taatagattg ttgaattttt gccttggaat 18601 aggaataaat ggaagaaaag gggttgatat tggtacaatc tgatccatta tcagagagca 18661 atcctgaccc cgacggatca ttcctttttc cgatatacga aataggggat ttcactaagt 18721 tgattcttag gaaatgtcga atcaaaccat ttgtccttat ttcaacaaaa gaagcacggg 18781 cttcttcgca agaagaactt tttttgtctt ggttccaatt taatactaaa caagtccgaa 18841 ctaattgaat acttgtgtca gaaattcctc gaatcggttt gccatttcca taaaggatat 18901 aattgacaat tcgaagttgc acattatccc tttcctgcaa tggatccggt ggaaaaaggg 18961 ttgctaaatt tataccgtcc gttatttcat atgtgacgac aggtcgaact aaaacaaaaa 19021 actttttctt gctaggtgta atccgttgga catagatcca atttttcact tttttggatt 19081 ccttggaatt tctttttcct gttcctggtg gtatcaaaac gccggtatgt cgggatatct 19141 tatctgtctc tccaggaaaa tggatatctc cagaaaagat tttaagttca attcgttttt 19201 tttttctctc cacccggacc aacccaccga ctcggcttct tagatttaag gtgatttgtg 19261 tatctacccc aacgatacta ttgttccgta ccattatgga agaagatccg ggcaagatat 19321 gcacctcttc aggaatgaaa aaaaatcgat ctactttcat ttggtatttt ggcctaaatt 19381 ccttgactcc tcgatactca agcaaatcct cttttttgat gactgaatgc gtttctacag 19441 tcccatattt aataatgccc gaactctttc ttctgtatcg aggatcatcg aaataagcaa 19501 gaatactatt tcgacggaaa ataccattta cggggatttc aatcgagata cctgaacagg 19561 gcattagttc attctcgagt tcttgaatcg agtgtagtgg aatgatgaat ttatttcttc 19621 gcctttttga caataaatca gaattctcgt ggagaatagg cgaatatacg agattatact 19681 gaccagcaca tataattcga ttaaggtctg aataatcagg aatcctatct tcttttttac 19741 cagaaaaatc gaactaaata atttctgcct cgcctgatcg ttggttactc gagaggttag 19801 aagtatatct tcgcttgcca gaaagaaaat gcgcattcat ttgatcctga tccttgtgga 19861 tcgaaaggta gactagactg gacctcgagg ccttcctaat aatatccata aatggcttgt 19921 ttttggtaat agatgaacat taccgtatgt aaattcgggt gcatgataga catcggtact 19981 ccagtgcatt tctccgtctg aatcagaata aatatgtttt cgaaccttct ctttaaaatt 20041 caaagtggat attcctgcgc gaatctcagc aattacttgt tctgattcta catattgatc 20101 gttttgaact aaaagcaaac ttttgggtgg aatattcaca ttatgtagaa tatcttcact 20161 ctcaatagtt acatacaagt ctatagaaca tagaaaggcg ggatgcccat gacgtgtacg 20221 tgtcggatga accaaatcct cattgaattt gatttttcca ttagatgggg ctcgcacatg 20281 ttctgcagta ccccccgtga atatctccgg tatgaaaagt tcttaatgtt aattgagtac 20341 ccggttctcc aatcgattga cctgcaataa tacctacagc ttccccaatt caaccaggtc 20401 gccatgagta ggactccggc cataacataa tcgacaaatc caagatgtac tcctacaagt 20461 aaagggagtt cgaatagaga ttggttgtgc ccgaaaggtt atgaatcgat ttacaagtcc 20521 aatgccaatg tcttgatttc tagtggcaat acatcgcgga cccatgtata tatcatctgc 20581 taatacacga ccaattaatg tttggataaa aatcctttcc ggcatcatcc cattccgagg 20641 actcacagaa ataccccggg cggtgcacaa tccgttcgac gtacaacaat gtgttgaact 20701 acttcaacaa gtctgcgagt gagatatcca gcgtctgatg ttcgtacagc agtatccaca 20761 atctttaggg ctcgtagcaa gaaatgatgt attctgttaa agagagtcct tcgcgtaaat 20821 tgctttgaat aggtaaatca atcatttgtc cttgaggatc tgacattaat cctctcatac 20881 ctactaattg atgtacctga gatgcatttc ctctagctcc cgagaaagac attatatgaa 20941 ctggattaaa agggtcagtc atcctaaaat taggattcat ttcttgtcgc aaatattcac 21001 ttgtagcata ccatatttca atggattgac gtaatttttc taccgcgtgt acattcccat 21061 aatgatggtg tttttccaaa atcaaacttt gttgttcagc atcttgaact agccatccct 21121 tagaaggtat tgttaaaaga tcatcaattc ctaatgaaat ggatgtagca gtagcttgtt 21181 ggaaccccag agtttttact tgatccagga tatgtgatgt atatgccatt ccgaagtgat 21241 ctattaatct actaataagt cgtttcatgg cagttccgtt tatcgcttta ttgtgaaaga 21301 ccagattggc ccgttctgcc ataagtacct ccatattccg ctgagtagaa ttcgacaatg 21361 ggtttgagtc ggtgattgta aaacttcctt ttatcgatct tgattcgcgt ataaattccg 21421 gaactatgga cctagctgaa ccggagagcc ccgaagtccc acgggtatca tagaattacg 21481 ttaggtacca gatgaatagg cccgagaaaa cccctgtata gcttcttcga tttctcgata 21541 aagagcaata tgaccaacag tggttcgaat gtatataaaa aggatttgtt tttttagact 21601 tcttactatt agatagtgtc cataaatctc ataaaaagta cctaaagatt catagtgaac 21661 ttcgatggga gtttctcttg aagcaataac gcgttgatct agtcgccacc ggagccacaa 21721 aggactatct aaattgattc gtttctgccg ataagcccca attgcatcat aggaattaga 21781 aaaaaacggt tctttcgtat acttatagtg actattgtca cttctttttt gattttgata 21841 gtttctgcga ttacatggat tatatctatt tacacaaata cctcgatgat ttccgctcgt 21901 taatacatag agtccaataa gcatatcttg cgttggtacg gaaatgggat ccccaatagc 21961 cggagacaaa agattcatat gagaaaacat aagtaaacgg gcctctactt gagcctccaa 22021 ggataaaggt acatgaacag ccatttgatc tccatcaaaa tcagcattga atcccttgca 22081 aactaatgga tgtaaacaaa tagcgcgccc ctccactaaa acgggctgga atgcctgtat 22141 gcctaatcta tgcaaagtgg gtgctctatt cagcaataca ggatgtccct gcataacttc 22201 ttgaaggatt tcccaaacaa tcggctcttt ttctcgaatt ttactcttag caactcctat 22261 gttcgaagca agatgttgtc taattagacc acgaattaca aatgtctgga aaagttctat 22321 tgctatttca cgaggcaatc cacatcgatg taatgaaagt gaaggaccca cgacaatgac 22381 agaacgtcct gaataatcga ctcgtttgcc aagcagagtc tcacgaaatc ttccctcttt 22441 gccttcaatt acatcagaaa atgacttgta aactttatta tgaccgtccc tcattggttg 22501 tccccggatt ccattatcaa gaagtgtatc cacggcttct tgtactaatt tctcctgaca 22561 cattactaat tctcctggcg tagatctact tgttgttaat agatcggtaa gggtattgtt 22621 ccgatagata actcttctat agagttcatt aatatctgag ctcattagtt tacccccatc 22681 tatctgaatg atcggtctca actcaggagg aagaactggt aatagacaca aaaccatcca 22741 ttctggctct atatttgttc gaataaaatg cttagccaat tccacgcgtc taaccaaaaa 22801 gtcctttctt cttccaactt ttcgatcttc ccattcattc cctgtgtgcc cttcttcccc 22861 caattcttcc cattctacca acgaattctc tataataatt cgtaaatcta gatcggctaa 22921 ttgttctcgg atagcacccg cgccagtaga gatttctcga ttgcgaaatg tatcgaaacc 22981 ctgggtagta aaaaaaagtg ggatgctgta tttccaagat tggatttcat attcgaataa 23041 acctcgtaat cgtaagaaag tgggcttttt agttatgggc ctagcaaaag aaaaattggg 23101 ataggattct ataggatctc ccccccttca aaatcggacg tgaaagtttc ctttcatccg 23161 gctcaagtag gtacaccaaa taaggaaagg agttctcgtt ttcaaactct agaaaatccc 23221 aaaataaaaa ggtctactcc ttactcaagt tcccagtgaa gacgaaacaa gatttcagtg 23281 attccgtctt ctattaattc tttattcaaa ttcaattcca acaaataaaa tagaaaattc 23341 ttgagtagtc tacttccctt tgaatgataa atcccttaac tcttaataat taaaggaata 23401 ccttggaacc cataagggat ttacttgtct atatattgtt ccattcgatc ttttaggtcc 23461 cgacttcacc tcgatggtta ggccaccacg cccttaaagt ctatacgcga tagatagact 23521 cctagaacca tgacatattt gcttacttga acataatttc tttccacgaa aagaaaggaa 23581 atgtttcatt ctacaaaata aaaagctttt tttacgatgg tacaaataga aattcctctt 23641 tatttgattt gttacgaaat cgaccataga tcaattccct ttttatttgg gagtattgac 23701 tacaccccaa ttctgagctt catgttactc tttccaagtg cacatgtcag gtccagggca 23761 tcccaattgg attgactggg atgacagttt ctccttccga gtctgtaaaa tcagaatttc 23821 gatcaaatca cacatcgcag tatactaggc cttctaattc tttaagaggt ttatctaaaa 23881 gattcgcaat ataactagga agacgtttta aataccacac atgggttact gggcatgcga 23941 gtttgatata gcccatttga taccttcgta tccgagaatc aacaaattcg accccgcatt 24001 gttcacaaaa tttcgggtct tctttttcat ctccgattac tcgataattt ccacaagcac 24061 aaattccgct ttttatagga ccaaaattct tcacaaaata atccatcttt tccggtttgt 24121 tagttttgta atgaaaagta tagggttttg ttacctctcc aactatctct ccattaggca 24181 ggattttagt ggcccaagca cttatttgtt gaggagaaac tgatccaatt cggagctgtt 24241 gatgtttata tcgatcgatc atagaagaaa aattattatt cattccgatt aagcttcctt 24301 cctattaatc tggaagttct tctcagatac aaggaaatga ttcagttcca gagctaaaga 24361 tcgtagttct cgaacgagca atcgaaaaga ttctggagca tcttcgggat taggtattgt 24421 tcccccaatg atcgtagtac caagtacttc ctggcgagct ctaatatgat ccgatttata 24481 agtaagcatc tcttgtaaaa tatgagcaac cccaaaccct tctagagccc aaacctccat 24541 ttctcctacc cgctgtcccc cctgtttggc tcttcctcta aggggttgtt gtgtaacaag 24601 cgcataatgt ccactggagc gcccatggat tttatcatca acttgatgaa ttaatttcaa 24661 gatataaggc tttcctatta taacgggttg ttcaaaagga ttccccgtcc ttccatcaaa 24721 tattctgctt tttcctggat attcgggttc aaatacccat ggattcgctg tttgcttact 24781 ggcttcatat aattcagaaa acacaagttt tctcgaagct tcttgttcat atctctcatc 24841 aaaaggtgct attcgataat gtctgtctag cagactccct gctaacccta gtgaacattc 24901 aaatatctgt cctacattca ttcgtgaagg tactcctaat gggttaaaga ccatatcaac 24961 ggatcttcca tcttgtaaat aaggcatatc ttgtctaggc aaaattttgg aaatgatacc 25021 tttatttccg tgtcttccag ctactttatc gcctactttg atttcacgtt tctgtaaaat 25081 atatacacga atcgtttcgg gattataact agaaccaccc ctcttctgga tccacctcac 25141 atcaataacc cgacccctgc cacctatagg tagttttaga caagtttctt ttgaagtaga 25201 tacctgaata ccaagtatag ctcgtaacaa tctatcttcc ggggcatacg acgattcttt 25261 cacgacctgg ggtgttaatt tacctactaa aatatcacct gtctctaccc aagatcccag 25321 catcacaatt ccatttttat ctaaattgcg gagtaaatgg gcttctaaat gcggtatttc 25381 attagttact ttttcagggc cttggcttgt cacatgagtc tgaatttcat atttccgtat 25441 gtgaaaagaa gtataaatat cttcatatac caaacgctcg ctaataagta ctgcatcttc 25501 agaattgtaa ccctcccacg gcatataagc tactaatacg tttttcccca aagcaagttc 25561 gccaccaacc gtagcagcac catccgctaa aatttgtccc tttttaatgc atttaccccg 25621 aggaacctgg agtttttgat gcatacaagt atttttattg gaacgttgat atataactaa 25681 tggaatgctt agaatatctc cattacctgc taaaagaatc ttgtcagtat tggtataaac 25741 gacccttccc tcgcgttcgg ctatagcaag agcccccgaa tctagagctg cttgtcgttc 25801 caacccagtt ccaacaatgc atttctcgga gcgagaaaga ggaactgctt gacgttgcat 25861 attagaactc attaaagctc gattcgcatc attatgttcg ataaaaggaa tgagggaagc 25921 tccaatagaa aaatattgaa aaggaaaaat acttcgaaga tgaacctgtt cccatgcaat 25981 agtcaagaat tcttgacggt atcgagctgg aacaacctgt tcttcctgaa tatcctgatt 26041 taaggctaaa gaatttcctg ccgctaccat atagtattca tctctacctg gtgataaata 26101 aagcatccgt accccggttg acctctcaga aatttcataa aaagggcttt ctagagatcc 26161 ccaatgacca atcctcgcat gaattgctaa ggatccaata agtccaacat tgattccttc 26221 agatgtgtca attgggcaaa tacgtccata gtgactagga tggatatctc gtatccgaaa 26281 actagcagtg cgccctgtca gtcctccagg gcccaaataa cttaattttc tcccatgaac 26341 tatttgtgtc aatggattag ttcgatccaa aacttgagat aatgggtgta aaccgaaaaa 26401 ggattcataa gtagttgtta atggagttga ggttaccaaa ttctgaggtg tcggtatcaa 26461 tttatgccga attgctccac atatagtccc ccgaaccaca ttttctaaac gaaccagagc 26521 caatccgaat tgatcttgta aaagatctgc tacagaacga atacgtttat ttttcaaatg 26581 attcatatcg tcaagtgcac ccattccaaa tttcagccca atcaaatgat cggcggctgc 26641 caatatatct cgtggtaaca aaaatgtatt gttctggggt atatcaaggt tcagtcttcg 26701 gttcatattt cgtcgaccaa tccttcctaa ttcacatctt tgttgaaaga atttcttttg 26761 taattcctta cataaggatt cagaaaatac cggatcgcca cctacacaag caaattgttg 26821 ataaaactcc aaaatggcat tttcttttga cccaattttt tttctctcct tatcactcag 26881 aaaagacaaa aaaatttcag gatagcaaac attctctaga atttctctta gattcaaacc 26941 catagctgat gatagaacta gaatagatat tttttgtttc ctacttacac gagcccatat 27001 ccttgctttt ctatcaattt ctaattctga tcttcctccc caatctgata ttatggtgcc 27061 ggtatagacc gaaattccgt tatggtccaa ttctgatcgg taataaatac cgggactttg 27121 caatatttga ttgatcacaa ttctatatat tccattgact atagaagttc ccagggaatt 27181 cattagagga atgtttccga taaaaattgt ttgttcttgc atatccctac tgtttttcca 27241 aattaatccc gcggatacat ataattcaga agaatatgtg agtgattcat acacagcatc 27301 tctttccttt atcaagggtt cgaccaattg atatgtttcc acaaataatt gaaattcaat 27361 ttcttgatct gtatcttcaa tttttggaaa cttataaagt tcttccgtca aaccttgatc 27421 aatgaaccta caaaatcctt caaattgtat ctgattaaat ccaggtattg tagatattcc 27481 ctcatttcca tccccgagca tttttaattt cccatttatc aaaaaatacc actattggtt 27541 cattcttcat ctaattagat agattagata aatgatctag caatgatggc atttctattt 27601 tgtttaccga atcacatgaa attttaccca actccatatc tggaatgtat gaaatacgta 27661 tgaacggagg aagaaagaga attttctact taaattgaat tggaatttat tggaattttc 27721 aacagataca aatggaaaga aattgataaa acatccctag aaacagactt ctgctactta 27781 gacttattaa ttaagttata gaattttgta tagaatatca aaacaaaaat gattccattt 27841 ctaccattat tatgataata cacattccaa cctgcttgaa taccagaaaa ataaatggat 27901 tcgacatttg atcttttcgc tgagataaag gcataaaaat aagaaagaat atatagaatt 27961 agaatcggtt ttttagcatt taaccccctt ttctgttatg gatttcgttg ctaaaaaaat 28021 gatttgtaga gaagagagag attttgttta cggatttttg aatagaatac gattgtgaag 28081 tgtataagaa aagaaggttt gtatggctta accacgtgtg gagatatcta taatatccgt 28141 ctttcttctc ttttattgtt ttattgtcgt tctctgttct attcggggca acccgggttg 28201 tgctctatga aaacagaatt tcaattttct attcaattca aaattcaaat tgaagtatga 28261 tacttttctg atatctgata attctctatc ggaacatata taaataatat ataccgtcta 28321 acaatttctc ttgggggttt acatatactc ataattgttg ttataattaa aattgagaag 28381 gattttttga ttgaaaaaat ccatactgat tagttatata tcaagttgta ttttcttatg 28441 tcattaggaa aacaaaattt ggagattcaa atccaagaat cattcatgca ttctaagtca 28501 atagttaatg gttccgattt tcagaaattt gaattttgga ttttgcgact gaaaatccac 28561 atttgatttt tcaatagaaa ggtaagagaa agctttgaac attatgaatt tggagatcga 28621 aattgaaagg atgaatcaaa cccaatcaaa agggaagaag gattaggatt tctttgactt 28681 ttaggaaaaa ttaaggaaaa cagaactcaa ggtgcaagta caataaaaaa gcagttcagt 28741 aatcctggaa agttttcatc tattttgtat ttgtagcatt ttggcgacat ggccgagtgg 28801 taaggcagag gactgcaaat ccttttttcc ccagttcaaa tccgggtgtc gcctgatcaa 28861 caaaaaactc gaaatctctt cttttcttct gttctgttga tataacccgc cgaatgattc 28921 cccagcagaa gcagagaaag cagactgttg atacttgttt gattctaaac atctggtctg 28981 ggggtttttc taaaaaattg taaatatctt tgcattgcat atttaggctt caaggaaata 29041 ttcgaatgct agaggggcta tcaagacttc gcaattacct tctactacaa atcaaaattt 29101 tctattatta atgcattgta taatgactgg accttgaatt agattggaga gcccgatagg 29161 aaatctaaat agttgtggaa gggggcggaa gatactttat tatatacgag gaactcacga 29221 aaatctctga gtgctcaagc atccaatcaa ttgaaatgag ggtcaacaaa aaaagaatag 29281 gacctattat tcctacatgt tccattagta acattccctt gagatgttac tgcagatttt 29341 gcttgtgttt aatctttccc gattagaaat cctataggaa tttcttataa aatgagcgaa 29401 tttattggat tggtttatta atagtcttcg ttctttttga ctctgcgcca ttgattccac 29461 tattattagt gaggaataac ggaacaattc ctttatattt atagagatag gggacataat 29521 tcatatggat atagtaagtc ttgcttgggc tgctttaatg gtagtcttta ctttttccct 29581 ttcactcgta gtgtggggaa gaagtggact ctaggggtcc tactaattga gttaaggaag 29641 caaactgtat caatatcaat tgctttcgag atcgttctgc aacacgtttt gaacaaaatc 29701 aaaatatctt cattttgaaa ttccattgga ctcgactgga gtaatgtatt ataggaatca 29761 tcctctttca atcaaagagc tatttcaacg attcccatgt ttgtagttcg aaaggaagag 29821 gatcccagga aatttattcg aacctaattc ttccgaaatt ttctattcca atcaacggcc 29881 tcttacaggt gatactgagg agggccggac ccttttttta tttctttctc tctttactgt 29941 tcaaagaaga ggtagttttg ttaagtgtat acgcactttg tatgagaaag aaaggatata 30001 aacatagtgg ttgtctaacg agatactatg cagaataaga tcttcagatg agtcacatat 30061 tgcgcattta ccgctttcga atttttgaaa ttggatttat gctttatcga cttatttcat 30121 atcatggttc aggcgttaaa aatcggtgag gtttactctt ccttttcgat gcccgtggaa 30181 ctactgtcaa tggtttactc aattacttct tgggaatgtt aaaaaaaaga ttactacgtg 30241 attttttgaa tctgcctata tctatcgctt ttccttcatt gatttgattc tttcaataga 30301 taccgagatt cagattggaa atcaaaaatc tagtaattca aactataaga cataagagta 30361 atttagattg atcagaacaa atagatatag caaataaatg gaattggatg ctatgtcaat 30421 cccatatatg gaattgatat tcacatatat caagataata ttgtagattg atctatagat 30481 ccatatcaaa agatccatat caaatgcagc ctctatcttt attttattcc agggggcagc 30541 tttataacta caatctaact aataaatagt atggtagaaa gaaatagatg aatctttctt 30601 tctaccatac tatctatcta ttagaatact gccgattcta gtccatacat tttcatttaa 30661 gacatgaaat tagaatcttt ttcattttat ttcgtcaatt ttggctaaga actcagaagt 30721 caagtttcat tcaaattagt taataattaa tcgttttgac tgactgtttt tacgtaaatg 30781 ataagtagaa aagcggtagg aactagaata aatagtgcag tagcaataaa tgcaagaata 30841 tttacttcca taatctcatc ggttttttac ttcgcaataa ctcgggattt aatcccatag 30901 agatgataaa tctttggcct gtaaattcaa tgaatgaata ttacctctcg atgatcttga 30961 atcggatcaa tatcatgaat aacaatatct gaactatcaa atcaattcgt cgtcgagaat 31021 tgaatagtat aacataggaa gttcttttat ccataccgcc ccaaacttgg attcctgacc 31081 caatccaaaa ttcctttatt tatttatcat tatcattttt tctcatctgt tctttttttc 31141 tctctaatct atctagttcc ttcttgtaca atcatctgat gaagtctcat caaatagctc 31201 ttccacttcc agtggtcaca catagttaca aacccaaaca aacaataaaa gctaaatgga 31261 aaaagaaagg agtttagaac taaactattt ttgacttgga agacaaagaa gtgtgataaa 31321 gatgagaccg tataaaatga atattcatca aattgactat tttccgattt gttctttcgt 31381 cgatgggggc cttaaaacaa aatgaaaaat cggaaaaatg attcattccc ctttctaaga 31441 ggagtaggat ctttcctttc ccctcctttc ttcgtagatt attagccccg ggacacctat 31501 accaaaagct cagtgtgcaa tttgcatgaa atctattttt caacttcaaa ctagtaagtg 31561 aggttccata aatccgtagc cagaaaaata aattgttttt ttttttgttt tttctgggaa 31621 agtattttct tatattaaat tttgtattgg acaagaaagg aattcccctt gtgtatgcgc 31681 gcctcaaaaa ggtatagtac tcgattccat tacatgcatc gggggcaatc gaaaaagcca 31741 gcatttcttg gaatactgac tataatgcta ccaataatcg tactaatcca accgcatatg 31801 tctttctcct accaaaagga aagaaaaaag aaataaggat ttcccctttg ctttgacaat 31861 gaaattctgc ccccggtccc cttcataaaa agggagagat ttattgatat atttattgga 31921 tccatcggga ctgacggggc tcgaacccgc agcttccgcc ttgacagggc ggtgctctga 31981 ccaattgaac tacaatccca gggaaatacg ggatctagca gaaaatttga ttctttttta 32041 tctccggatc gggtatttct gaagtacgaa gggggttata tcatctcatg gcggattggc 32101 gaatttttgg gccgagctgg atttgaacca gcgtagacat attgccaacg aatttacagt 32161 ccgtccccat taaccgctcg ggcatcgacc caagaagaat caattttaga cttattggta 32221 atccatgatc aacttccttt cgtagtaccc tacccccagg ggaattcgaa tccccgctgc 32281 ctccttgaaa gagagatgtc ctaaaccact agacgatggg ggcctgcttg accaaccgcc 32341 atcatactat gatcatagta tgatcagttt tttgaaattg tcaatataat cgaatgattc 32401 tatccgaggg atctttcccc ctttcagaat tgcatagaat ttttttattc gtcattgatg 32461 aattattcat tagaatcgcc attagaaatc tagtagtagt attttttttt ttttggaatt 32521 atttcaattg aatttctttc gattatttta gtttagatta tttagtattt agaattttct 32581 ttttttatta taaataaaaa aaaaattaat aaatacaaaa aatagaaata ataaggaaga 32641 gtaggatttt tgcagggaat gattggtccg tcagaaaagg aaaaaggtgt gaaattctat 32701 ttctttcact ttcatttgat tcattgttaa gacgagatat ccttatctcc ctcccaccaa 32761 gacaggaaat taacaaacga gaaatctagt aagcgggatc aagaagaaaa ttcttttttc 32821 tccaagaatt tagttcagga gacaagtaga atctcttcat tccatgattc gatgaaatat 32881 cttgaatttt atgttgaatt gctaggtgta tgtacatgta tcaatcaagt gaattttgtt 32941 ctggtgggat caattcaata aaagaaaaaa agcaattcga gtcggtcttg aaacaattca 33001 ttgcattttc tcctagactt cctaggtaaa tccattttat tattcaacaa tgagccacta 33061 gacactatgt atctactgca tgtacttatg catatatact tatgtttata atatatgtac 33121 ctatagatat tttatccaca tagtgaataa ttccggaatt aaatcaaaaa ggccctttta 33181 actcagtggt agagtaacgc catggtaagg cgtaagtcat cggttcaaat ccgataaggg 33241 gctttgtaaa actccaatct agtattcata tttgagggga gaattgtatt tttatttgta 33301 ataaaaaaag taactaactg gataatacat tatcattata cttaattatt atacttagtt 33361 ataaagttga acatttgttt agtcaatttt cattattatg aatttctgaa taatgaaaag 33421 tcacttcttg aactcaccga atattcctat tttccattat accaaccaaa tccattcgaa 33481 aggttagaaa tcaacaaaag aaaaagtaag tggacctgac ctattgaatc atgactatat 33541 ccgctattct gatattaaaa ttcgatagag atgaaattgg agcagttgat ttttttttaa 33601 tttcattttt ttgttttgga ttccacaaga atttgtcgat atttccgatt aaatcttctt 33661 gttactagat tttctatagg aaaaattata ggaataaatt gttattcctt tcctctacag 33721 agaaaccttt cttccaagtc acaccataag agccatttat tatctttctt tgattccaga 33781 tcaaagatta atttcatcat taatttctat ctagattata tatctatatt atattaagta 33841 gattgtagat ttcgatgtat atctatcaga tcgtggcttc atgtaccaaa tatttcaata 33901 tcgttgcatc cggtattttt gttttgttcc aacagtgtga tgaagaatag atccgagaaa 33961 gagactttca ttttcagtct cttatttatt ttatttttat tgaattttcg attttctaaa 34021 aggaaaatct aaaaggaaaa atagtagatt atctcttttt ctaacagata aaagaatcta 34081 aaaataaata ttcgatcgaa ctgtcttttt tccttcgatc cgtggaaaga tatactctgg 34141 ggttttagat ttatttatat gaagtatgaa ggaaagggat cgcttggtcc ttgaagagtt 34201 ctttcaaaac aaaggattga ttgaattgtc ttattaggac aattaatggt tcatatgctt 34261 agtcagaagg aataatccaa tggagttcat ggatttacct aggtcagttt atgggctaat 34321 caataaagca tttttatctt cgaaacccat tggaaagggc agtgcaagag aaatcataca 34381 aaaatgatcg aatcttcgga cgccccgaaa aagatatgag gtgctcggaa atggtcgaag 34441 tagttgaata ggaggatcac tatgactata gcccttggta agtttaccaa agacgaaaat 34501 gatttatttg atattatgga tgactggtta cggagggacc gtttcgtttt tgtaggctgg 34561 tccggtctat tgctctttcc ttgtgcctat ttcgctgtag ggggttggtt cacaggtaca 34621 acctttgtaa cttcatggta tacccatgga ttggccagtt cttatttgga aggctgcaat 34681 ttcttaactg ccgcggtttc tactcctgct aatagtttag cacattcgtt gttgttacta 34741 tggggtcctg aagcacaagg agattttact cgttggtgtc aattgggggg tctgtggact 34801 tttgttgctc tccatggagc ttttggccta ataggtttca tgttacgtca attcgagctt 34861 gctcgatctg ttcaattgag accttataat gcaatcgcat tctctggtcc aattgctgtt 34921 tttgtttctg tatttctgat ttatccactg ggtcagtctg gttggttctt tgcacctagt 34981 tttggtgtag cagctatatt tcgattcatc ctcttttttc aagggtttca taattggacg 35041 ttgaacccat ttcatatgat gggagttgcc ggtgtattgg gcgctgcttt gctatgcgcc 35101 attcatggtg ctaccgtaga aaatacttta tttgaagacg gtgatggtgc aaatacattc 35161 cgtgctttta acccaactca agccgaagaa acttattcaa tggtcaccgc taaccgcttt 35221 tggtcccaaa tctttggggt tgctttttcc aataaacgtt ggttacattt ctttatgtta 35281 tttgtaccag taaccggttt atggatgagt gctcttggag tagtcggtct agccctgaac 35341 ctacgtgcct atgacttcgt ttctcaggaa attcgcgcag cggaagatcc tgaatttgag 35401 actttctaca ccaaaaatat tctcttaaac gaaggtattc gcgcttggat ggcggctcaa 35461 gatcagcctc atgaaaacct tatattccct gaggaggttc taccacgtgg aaacgctctt 35521 taatggaact ttagccttag ctggtcgtga ccaagaaacc actggtttcg cttggtgggc 35581 cgggaatgcc cgacttatca atttatccgg taaactacta ggggctcatg tagcccatgc 35641 tggattaatc gtattctggg ccggagcaat gaacctattt gaagtggccc atttcgtacc 35701 agagaagcct atgtatgaac aaggattaat tttacttccc cacctagcta ctctaggttg 35761 gggggtaggc cctgggggag aagttataga cacctttcca tactttgtat ctggagtact 35821 tcatttaatt tcttctgcag tattgggctt tggcggcatt tatcatgcac ttctgggacc 35881 tgagacactt gaagaatctt ttcccttctt tggttatgtc tggaaagatc gaaataaaat 35941 gaccacaatt ttaggtattc acttaatctt gttaggtcta ggtgcttttc ttctagtatt 36001 caaggctctt tattttgggg gcgtatatga tacctgggct ccgggagggg gagatgtaag 36061 aaaaattacc aacttgaccc ttagcccgag tatcatattt ggttatttac taaaatcccc 36121 ttttggaggg gaaggatgga ttgttagtgt ggacgattta gaagatataa tcggaggaca 36181 tgtatggtta ggttccattt gtatacttgg tggaatctgg catatcttaa ccaaaccctt 36241 cgcatgggct cgacgcgcac ttgtatggtc tggagaggct tacttatctt atagtttagg 36301 ggctttatcc gtctttggtt tcattgcttg ttgttttgtc tggttcaata ataccgctta 36361 tcctagtgaa ttttacggac ctactggacc agaagcttct caagctcaag catttacttt 36421 tctagttaga gaccaacgtc ttggggctaa cgtgggatcc gctcaaggac ctactggttt 36481 aggtaaatat ctaatgcgtt ccccgactgg agaagtcatt tttggaggag aaactatgcg 36541 tttttgggat ctgcgtgctc catggttaga gcctctaagg ggtccaaatg ggttagactt 36601 gagtaggttg aaaaaagaca tacaaccttg gcaggaacgg cgttccgcag aatatatgac 36661 tcatgctcct ttaggttctt taaattccgt gggtggtgta gctaccgaga tcaatgcagt 36721 caattatgtc tctcctagaa gttggttagc tacctctcat tttgttctag gattcttctt 36781 cttcgtaggt catttgtggc acgcgggaag ggctcgtgca gctgcagcag gatttgaaaa 36841 aggaattgat cgtgactttg aacctgttct ttccatgacc cctcttaatt gagatgagac 36901 aggagatcca atgcttgaat gaagtaaaaa tcactttgat tcaatcatac atcttggaat 36961 cagcctaagt attccttttt tgtattcctt ttttcttttt ttttttcaat tcattttatc 37021 taatttattt ttctggcttg gctaggtggg atagccgagc cattcccttt tctttcggat 37081 agcaggttgg gcaaaaccac taaagaaaaa aatctattca attagcaaaa aaggagagag 37141 agggattcga accctcgata gttctttgtt aaaactatac cggttttcaa gaccggggct 37201 atcaaccgct cagccatctc tccgaaagac tatttttatt ttattcctcc gaatagaaca 37261 tggccatagg ggtggatacc cccactatct gtactatctg taaaaagatc tcaggtgcga 37321 atccaccggt cgatctatct atccgtatat agatatatga tctagcatgc ccatttgtga 37381 aataaaaaat aaaattccat ttccccccac tccatgtacg aataaagtgc gaaaggggga 37441 gtagtaataa gtcatataga atcaatggat tcatgataaa gtaaaatccc tcgatgacat 37501 attttatcac aattaatatt ttttggctga tagagggatc aaatggtata tagttcattt 37561 gttggtagct tggaggatta aaagcatgac tcttgctttc caattggctg tttttgcatt 37621 aattgctact tcattaatct tattgattag cgtacccgtt gtatttgctt ctcctgatgg 37681 ctggtcaagt aacaaaaatg ttgtattttc tggtacatcc ttatggattg gattagtctt 37741 tctggtgggt atccttaatt ctctcatctc ttgaacctat tcgtcgcaga cccaaaacca 37801 aaatgacccc cctaattttt ctcggttgtg agacacatta aattggaatc taagtcccca 37861 aagaaaacgc aaatcaaata aagaaaacaa aaaaattaga ggggggtcaa acttcttgaa 37921 taaaaagaat acaattaaaa aaataattgg aatcgttccg aagagaatat gtgtcccggc 37981 actgcacaaa aaagatccgg ttatatatca tatatgtggg tacatattgt gtatcaagaa 38041 caaaaaaatg cggatatggt cgaatggtaa aatttctctt tgccaaggag aagatgcggg 38101 ttcgattccc gctatccgcc caagatccaa gataaagtaa ttttattact atttatttat 38161 tatttaattt cataaatagc attaaatata tccttaaatt aaggatttgg tatagttggc 38221 cgtgatagtg tagtgattct atccctcccc tacgttttct ttttccttcc acccccaaaa 38281 agcgaaaggc gggaattaat tactagttaa cagagtcaac cctaaaatag tttggcaaaa 38341 caagatgttg cggagacagg atttgaaccc gtgacctcaa ggttatgagc cttgcgagct 38401 accaaactgc tctaccccgc gccgaagata agaactgaaa actaatagat aaacaaggat 38461 taaatgcgcc cctccaccct atctgtacaa atagaatagc ccatttatac agaatggtaa 38521 aggggcttct atgatcatcg accatagaaa tagaaatgaa gcgttaatcc ttaccaactt 38581 gatcttgttg ctcctggcaa caaacatgca tgaaccattt cacgaagtat gtgtccggat 38641 agtccaaagt ctcgatagtt agctctcggc cttccggtca aaaaacaacg tcgatgaagg 38701 cgtgtaggtg cactattccg tggtggggat tgtaactttc cataaatttc ccatttgtca 38761 ctcaacgacg gaaccttgct tatttctttc tttgaggatc gacgaatcga atgatatttc 38821 tgttccaatt tttgcctctt cttctccctc tgaatcaaac ttttccttgc cataatggtt 38881 gaattcctat tagtatccat gatacaagtc gaatcctaga tgtagaaata gaagaaggtg 38941 gaccccctct ccgtcgaaag aaatgagatt atcgcagata cacacattaa aaatattaac 39001 caaatttgcc cgacgtagag gcaatcaaga aagccgcata agtgaatata taacctacag 39061 aaaagtgagc taatccaacc aatcttgctt gtacaatgga aagggccact ggtttatctc 39121 tccagcgaat caaattggcc aaaggtgtgc gttcatgagc ccatgctaaa gtttcaatca 39181 attcctgcca atatccacgc caagaaatta agaacataaa tccagtagcc caaacaagat 39241 gtccaaataa gaacatccat gcccaaaccg ataaactatt cataccaaaa ggattatatc 39301 cgttgataag ttgtgaagag tttaaccata aataatccct taaccagccc atcaaataag 39361 tggaagattc attaaactgt gaaacgttac cctgccataa tgtgatgtgc ttccaatgcc 39421 aataaaaagt aacccatcca atagtattta acatccaaaa aactgccaaa taaaacgcgt 39481 cccatgccga aatatcacaa gtaccgcctc gtcctgggcc atcgcacgga aaactataac 39541 cgaaatcctt tttatctggc attaacttgg aaccacgtgc atctaaagca ccttttacta 39601 agatcaatgt agttgtatgt aaaccaagag caatagcatg atgaaccaaa aagtctccag 39661 gacctattgt taaaaataat gaattactat tttcattaac agcatttaac caacccggca 39721 accagatgct tcgacccgca ttgaatgctg gaccactcgt tgaagataaa agtacatcga 39781 acccatatga agttttacca tgagcggatt gtatccattg agcaaatata ggttcaatca 39841 agatttgctt ctccggagtg ccaaaggcaa gcatgacatc attatgaaca taaagtccca 39901 gggtatggaa tcccagaaag aggctggccc aacttaaatg agatatgata gcttctttat 39961 gctctaacat tcttgccaat acattatctt cattttgctc cggattgtaa tctctaatga 40021 aaaatatagc tccatgagca aaagctcctg tcatgatgaa tcctgcgata tattggtggt 40081 gggtatataa tgcagcttga gtagtaaagt cttgtgctat gaatgcataa gcaggtaaag 40141 agtacatgtg ttgagctacc aaagaagtaa taacccctaa agaagctaga gcaaggccta 40201 attgaaaatg aagcgaatta ttgattgtgt cataaagacc cttatgtcca cgccccaatc 40261 gtcccccggg gggaatatgt gcatctaaaa ggtctttcat actgtgccca atcccgaaat 40321 tggttctata catatgacca gcaacgagaa aaataaatgc aatagctaaa tggtgatggg 40381 caatatcagt cagccataaa ctttgcgttt gtggatggaa tcccccgaga agagttagaa 40441 tggcagttcc cgccccttgg gcggtaccaa ataaatgact acttgaatcg gggttttgag 40501 cataaagatt ccattgacct gtaaaaagtg ggcctaaccc ttggggatgc ggtaatacat 40561 ctaagaaatt attccaccga acgtactccc ctctggatgc aggaatagca acatgaacta 40621 aatgccctgt ccaagccaag gaacttacgc caaagagtcc tgacaaatga tgattcagac 40681 gagattcggc atttttgaac caggaaacgc tcggtttcca tttcggttgt aggtgtaacc 40741 aacctgctat taaggatatg gcagaaagaa ataatagaaa aagagcacca gtataaagat 40801 cttcattagt gcgtaaaccg attgtatacc accactgata aacaccagaa taagcgatat 40861 tcactgggcc aagagcaccc cctcgagtaa aagcttccac ggccggttga ccaaaatgag 40921 gatcccaaat tgcatgagca ataggtctta catgtaaagg gtcctgtacc cacgactcaa 40981 aatttccttg ccaagctaca tgaaacagat ttccggaagt ccacagaaaa attattgcta 41041 attgaccaaa gtgagaagca aaaatattct gataaagacg ttcctcagta atatcatcat 41101 gactctcgaa gtcatgtgcg gtagcaatac caaaccaaat acgacgagta gtggggtcct 41161 gagctaagcc ttggctaaac cttggaaatc gtaatgccat aatgcttttc aaatcctcct 41221 agccattatc ctactgcaat aattcttgct aagaagaatg cccatgttgt ggcaattcca 41281 cccagaaggt aatgggttac tcctacagca cgtccttgta taatgctcaa ggctctcggc 41341 tgagtagcag gagcaacttt taatttatta tgagcccaaa cgatggattc aataagttct 41401 tgccaataac cacgtccact gaatagaaac attaaactaa aagcccagac aaaatgagca 41461 cctaggaaaa aaaggccata tgcagataat gaagaaccat aagactgaat tacctgggat 41521 gcctgtgccc ataagaaatc gcggagccac ccattaatag taatagaact ttgcgcaaag 41581 tttcctcccg tgatatgagt tactacccct tgatcactta cactgcccca aacatctgac 41641 tgcattttcc aactgaaatg gaatattact accgaaattg cattgtacat ccagaatagt 41701 cctaagaaga catgatccca ggccgatact tgacatgtac cccctcttcc aggtccatca 41761 caaggaaaac gaaaaccaag gtttgcttta tccggtgtca aacgggaact gcgagcaaat 41821 agaacacctt tcaagagtat cagtgccgtc acatgaatcg taaatgcatg aatgtgatgt 41881 accaagaaat ccgcggttcc taatggaata ggcaacaaag ccaccttgcc acccactgcc 41941 actaaatcac caccccccca agttaaactg gtacttgctg ttgcaccagg agccgttgca 42001 ccaggtgcta aagcatgggt gttttgtatc cattgagcaa aaacgggttg taattgtata 42061 gcggtatctg aaaacatatc ttgaggacgc cctaaagcgc tcatggtatc attatgaata 42121 tacaaaccaa aactgtgaaa gcctagaaat atacatgccc agttgagatg ggatatgatt 42181 gcatcacgat gtctaaggac acgatctaat agatcgttgt accgagtagt tggatcataa 42241 tctcttacca taaaaatggc tgcatgcgcg gcagcaccaa ctatgagaaa tccaccaatc 42301 cacatgtgat gtgtgaacaa tgacagttgt gtaccatagt cagtagctag atacggataa 42361 gggggcatgg aatacatatg gtgagctaca acaatggtta aagagcctaa catagctaag 42421 ttaagagata attgagcatg ccatgacgtt gttaggatct catataggcc tttatggccc 42481 tgacctgtaa atggaccttt atgagcttct aaaatatctt ttagtccatg accaataccc 42541 cagttggtcc tatacatgtg acccgctatc aggaaaagaa ttgcaatagc taaatggtga 42601 tgggcaatat cagtcagcca cagaccccca gttactggat ctaatcctcc acgaaaagta 42661 agaaagtccg catattttga ccaattcaag gtgaaaaatg gggttgctcc ctcggcaaaa 42721 ctgggataaa gttgagccaa aagatctcga ttcaagataa attcatgagg aagtggtatc 42781 tctttaggat ctactccagc gtttagaaat tggttaatcg gtaaagatac atgtacttga 42841 tgccccgccc aagagagaga cccaagtcct agtagccctg ccaaatggtg attcagcata 42901 gattctacat cttgaaacca agccaatttt ggcgccgctt tatgataatg aaaccaacca 42961 gcaaaaagca ttaacgctgc aaagaccaat gccccaattg ctgtacaata gagttgtaat 43021 tcactagtta ttccagatgc tcgccaaatc tgaaaaaaac cagaggttat ttgtattcct 43081 cggaaacccc cgcctacgtc accatttaat atttcttggc ccactattgg ccaaaccacc 43141 tgggcactag gcccaatgtg agttggatca cttagccacg cttcataatt agaaaaacga 43201 gcaccgtgga aatacatgcc gctcagccaa agaaagatga tggagagttg accgaaatgt 43261 gcactaaata cttttcgaga gatctcctcc aaatcactgg tatggctatc gaaatcgtga 43321 gcatcagcat gtaggttcca gatccaagtg gtagtatcag gccctttagc tattgttctt 43381 gagaaatgac ccggtctggc ccattcctcg aacgaagttt ttacgggatc cctatctacc 43441 aaaattttaa cttctggttc cggcgaacga ataatcattg agtcctcctc tttccggaca 43501 acacatacaa agagacccgc caacagtcaa ataattagtg aaccttagag atagagagat 43561 atttctataa ttagttcgtt tctcttctat ttttctatct cccatctatc tattttcttt 43621 agttatttac tagagcaatt atgatctgga agtcgatccg gggcaagtgt tcggatctat 43681 tatgacatag ccttgaggcg ctcaacggac cttttaacct tctaaaaacc tttttgggct 43741 ttggattgat ccaaaaacga cttttttgtg caacctagtg tatattcata gaagttatta 43801 gatggagctc tttaattttt tacctagaag attttaatta ctctattcca aatcacgcga 43861 gtagccatta gacattacta agagacatcc ccgctatata tatttagtga ttcgagggtt 43921 tattttatta gttttaataa taagaatttt gtttaattta atataataaa caaagtctat 43981 tttgtactct atctgtgtat ccttttttat tcctaaaaaa tagcagatga aatagaaggc 44041 ttagaaggga gataatgaaa ttatgtgatt gggtcttcca aaagcaaagg aataatccgt 44101 tttttagtta actgatctga tgggtccaac aaacaataaa ttataacaaa tatctaaatt 44161 ctaaataaaa aaatcaaaaa taatagacta agattctaaa taaaggataa taaataaacg 44221 ggatcttctt ttattcgaaa cgtctcgtga tcttcaacca attatgcgct tcaatataat 44281 taccgggagt aagcgctata gcctgtttcc aatactcagc ggcttgatcg aaccaagcct 44341 ctgcaatttc agaatctccc tgttgaatgg cctgttctcc ccggccggaa taggtagttc 44401 aattccttcc cttagaaccg tacttgagaa tttcttacct catacggctc agcagtcaat 44461 tcttttggtg tcccattttg atctatacca tatctaataa aatctaatga gatttctcat 44521 ggatctatcc cagttttagg gttaaccaaa agccaaatag gttaattaca tgagtttcaa 44581 actgaaattt ggatgaataa tccgtttatt tagttttatc ttttttccca ccttcagaag 44641 aataaagcat aggcatttct actagtgtta gaattttatg aaaggtaact atctcggttt 44701 catagataaa tttatataga atctttgaaa aagactttct ttcataagaa agaaaatact 44761 tactatcttt gggatctgat cctacaccgc tgctcaagac tttagtggat cgactctatt 44821 acataagtta attcctaatt tttatttcac atcatgagat aagtatttct tccatcatga 44881 cataagtacg cagttattat tgtatcggcc caaaacctcg ctaattgatc tttacggtgc 44941 ttcctctatc tctatcaatt aaagccttat atccatagaa aaaagttgct aggcattttt 45001 attttttcct attttgactt ctatgaagtt tctttctttg ctacagctga taaaaatcgt 45061 tgttttagac gatgcatatg tagaaagcct atttggttct actagttact ttactagatt 45121 tttctttttt tttttttttt tctttctata gtggagatag tcgcacgtaa tgacagatca 45181 cggccatatt attaaaagct tgtggtaaga atgggtttcg ttctagtgct cgaaaataat 45241 attccaaagc tttcgtatgt tctccattac ttgtgtggat aagccctata ttatagagta 45301 tataacttcg atcataggga tcaatttcta gtcgcatagc ttcataataa ttctgcaaag 45361 cttccgcgta atttccttcg gattgagccg acatccgtta cggtcgtcat tcaattgaaa 45421 gaatctccgt tccagaaccg tacgtgagat tttcacctca tacggctcct cccttatgtg 45481 cataatgaga ataatacata gaatcaaaaa agattcaacg atgaaaatat tctcattatg 45541 aactcagcag ggctagtgtt tttacaagaa atctctagcc aaccttcctg caagagattc 45601 tttcttaaca tcaagcctat tgggactaga tagaaatgat aagataactc caacaatttc 45661 tttgttttta acgcctccta atttccagga attagtcact tcaatagcct tcgatggtta 45721 tacgggtatc caaaggacga acgagatgga tgtttgttgt cccaaccatt cttttagtcc 45781 caagcccgct aaggaaaggg ctgacttaga acaaagtttt cgtgttgttg attcctaggt 45841 gtagtgcttc ttcccctctg ctgcctatta gcgctagtag agtaggattg acccgtaata 45901 cagaacctct aggcgtaacc tttcgcttaa tactagaatc gagaatcgaa acatagcatc 45961 tgaggttgca ttaatcgagg atacacgaca gaaggaattg ttctatttcc aaacttcacc 46021 ttcaaaaagc gtagattttt tcaaaaattt tctcgaatca cgtgtttttc tcctcgtaag 46081 actgagagaa atgactaaat atgaaataaa aaaaaaaaaa gaatcaaatc gcaccatctc 46141 tgtaataggt aaatgcctct ttttctcctg aagttgtcgg aattactcgt aataagatat 46201 tggctacaat tgaaaaggtc ttatcaataa aatttccatt tatccgtgat ctaggcatag 46261 gtagcaatcc attctagaat tcttctcatt acctctcatg ggaaaaagat cccacaaaga 46321 aaagaattgt atagtacgaa ataacataaa aacttctttt ttttttaaga aaaaaacaaa 46381 agatatgaat cctctattcc aattgttcct ttttgacagg aatcgataag aaataagaaa 46441 tatttcaagg cgattcgatt tcatactaat gtagtagtat aggaactatt ccgatttcgg 46501 tgaagttaca aattcgaaga actcgagaaa ttttgattga atcatgatac aaattacaaa 46561 gaagaaaaaa gaccgaataa tcattctatg atgaaaatag aataactgcc aattttgtgt 46621 acataacggg tatacactat acaatcaaat ctaaattttt tttatgaatt tctattctaa 46681 tagaggggta ggtgtttgtt gttgagaact ccaaaaccga aaagtaattt gaaaattttt 46741 ctggtatgga atcatagtct atataattag aattatgatt taagagtatc cattaactat 46801 agtctaaaag atatagacca tcaatcagtt gattcgttct aattcattga attaatccgt 46861 tataaaatat cagaaaaaga aaaagaaggg aacgttgttt tgcaaacatg aatcgaattt 46921 tttttttcac aatttttacg caaaattgta tctttatccc ggagcctcga aggaaagaaa 46981 aatcgttctt tgctttgact ttgatgaaaa attttcagtt aaaatggatt gatcatacct 47041 atccaataat ggaatatgga ttatgactga ctcgctattc actcggtttt tgggtcataa 47101 tcgttatgta ggagagatgg ccgagtggtt gaaggcgtag cattggaact gctatgtagg 47161 cttttgttta ccgagggttc gaatccctct ctttccgtac cttcgcttaa ttcaccaatt 47221 ttactaacaa caagggctca aatagcaatg gataccatta ttccaacagc tagacccttc 47281 tttgatctaa agatatagat tctcaattcc taattgctgt gacgcgtaaa atagaatact 47341 aaaaaataat aataatcaaa atactggaaa gaaaagagta gacaaggaat gaaaatagat 47401 ccttggtcta tgatacaaaa atgggggaaa tccagatcaa actcggattt atcttactta 47461 accttaggtt aatttacttc gcctaaaggg aagaaaattt tccgaaccct cggtttcagt 47521 ctgaggttta agtctgacga gaataatatt ctacgactag caattcattt attttcaaac 47581 cgacccattt actatctatt atttgattga ctaatccttt atattggaat gggtgaaggg 47641 tcaaatggtt tggtaattcc tcatgagggg atgaatcgag agaaatttga atcagagctc 47701 tggatttttg ttcatccttt gccgtaataa tatctcgggg tttgcagcga taactcggta 47761 tatctactat acgaccatta actaaaatat gtcgatggtt aactaattga cgggctgcgg 47821 gaatagttga agccataccc aatcgaaaaa ggatgttatc caaacgcatt tcaagtaatt 47881 gtagtaaaac ttgacctgtt gaccccttgg cttttctggc gatacgaacg tatttaagta 47941 attgtcgttc tgtaagacca taatgaaaac gcaatttttg tttttcttct agacgaatac 48001 gatattgaga ttttttcccg gaacgcgatt ggtttctaag atcacttccg ttcctaggct 48061 ttttattagt tagtcctggt aaagccccca ggcggcgtat ttttttgaaa cgaggtcctc 48121 ggtaacgcga cataaagact ccttattctt atttcttatt tagtatttcg aattaattct 48181 tatttctatt tattttattt tttattgaat tttattttac agaataaacc taaactaaaa 48241 ctaaactgaa tctaaatgaa gcgaagttta ctgaaatagt gtacttgtac tattactata 48301 aagaaaagaa gaatgggatg aattggataa atatacagac ccccttctat tatatatata 48361 atcctttccc gacataattg gaagttccta taataaattg atagcttttg gaaaaggaag 48421 aaggcgctat ttcaatattc tttgatttca aaggaacatt atcaatcatc taaaaaatgg 48481 aataaaaaaa aaagaatagg gaaaagccgg ctatcggaat cgaaccgatg accatcgcat 48541 tacaaatgcg atgctctaac ctctgagcta agcgggccca cataacagaa atcttatatg 48601 catagtaatt gactaaacta ttggaattgg aatcttagtt attaactatt caatattata 48661 ttgaatattc tagaacataa ggattaatat agcgatatag aatttcgatt tatcacaatt 48721 ctaataacaa ttctaatact aatattatta aatagtgatt gtaaatattg ttaatattct 48781 ttttttttca ttttccattt gaatggtaaa tgttcttttt catttctttt tttgtcattt 48841 gaaatccttt tgatttttta ttacagttct atattttatt ctatatcata tatatctctc 48901 attctatatt tatttcaaat tctaattgtt taatggaatg gttagttata actaatgaga 48961 cattcctccg ctttcaggcg aaagtgaaga taaaaaaaaa gaatcgaccg ttcaagtatt 49021 ccaaattgaa tggcaaaatg gcaggaagag agacatatag atggggtata tatccatcta 49081 tattgaattg cggattccga aatgataaaa tcatttttga ttggacaaaa aaaggtctcc 49141 tatagaagat agttaagaaa atcaaagagg agaaaacacg ttttcgagat aggaatcggt 49201 atctaatgaa ttcaatggtt ccagtataaa tgaaagaaaa agaaaaagga atgacatcac 49261 aacgagatcc taatctcaaa aagaaagggg gatatggcga aatcggtaga cgctacggac 49321 ttaattggat tgagccttgg tatggaaact tactaagtga tcactttcaa attcagagaa 49381 accctggaat taacaaaaat gggcaatcct gagccaaatc ctgttttccg aaaacaaaca 49441 aaggttcaga aaaaaaggat aggtgcagag actcaatgga agctattcta acaaatggag 49501 ttaaatgcgt tggtagagga atctttacat cgaaacttca gaaagaaaaa gaatgaagtg 49561 aaggataaac gtatatacat acgtattgaa tactatatca aaatcaaatg attaatgatg 49621 acccgaatct gtattttttc tataaaaaat agaagaattg gtgtgaatcg attctacatt 49681 gaagaaagaa tcgaatattc attgatcaaa ccattcactc catagtctga tagatctttt 49741 gaagaactga ttaatcggac gagaataaag atagagtccc gttctacatg tcaataccgg 49801 caacaatgaa atttatcgta agaggaaaat ccgtcgactt taaaaatcgt gagggttcaa 49861 gtccctctat ccccaaaaag actatttcac tccccaacta tttatccgac cccctttcct 49921 tagcggttcc aaattcctta tctttctcat tcactctatt cttttagaaa tggatttgag 49981 cgtaaatggc tttctcttat cacaagtctt gtgatatata tgatacacat agaaatgaac 50041 gtctttgagc aaggaatccc tagttgaatg attccctatc aatatcatta ctcatactga 50101 aacttacaaa gtcatctttt tgaagatcga agaaattccc cggctttgag aaaattttta 50161 atctactttt gtccttgtaa ttgacataga ccccagttct ctaataaaat gaggatacta 50221 cattgggaat agccgggata gctcagttgg tagagcagag gactgaaaat cctcgtgtca 50281 ccagttcaaa tctggttcct ggcacatgat taatttgtat gggtctctct tccctcgaat 50341 taatttctaa ttaattgata tgaatcaaca tacatattct tttagagtct agattagaat 50401 aatagcttta tccagtttgg cgagatatac cccatctatg ttctagatgg gtagagtttc 50461 ttagataaag tatctaaaag aattggattc tatctcctct tttttttctc ctctcgttca 50521 accgaatttg aatacgtaat acatattcga aaggttcaat tggttaattg ttgaaaggct 50581 caaaagtcga atccgaatct aggggggttg aaatagacaa gattcagctc agatccaaag 50641 aaatagaatc cgatattctc tcatttcttt gtcttttctt tcatattcga tttcttcatt 50701 ccggatttct ccattccttc ctatatgcct ttctagaacc catctaagta atgtgcgcag 50761 tacaaagttc atgatgcaga actcatttgg ttcatcctat tggtgtgacc catccgaaat 50821 aagtatcttc caaataaatg tgagaattcc aatgaatccc taattgtctt tttttgttag 50881 cctatcgata attccctaaa ttagacctgc ttaatctaga acagaacgtg caatccttga 50941 atatctgaaa ttgtctaagt ggaaatagct ttcttatcat tcaatgagca tcttgtattt 51001 cataaaaatt gggggcaata taatccttac gtaagggcca tcctatccaa ctttcaggca 51061 ttaagatacg tttcaagcgt ggatgattat cataagagat tcccaacata tcatatgatt 51121 ctcgttcttg aaaatccaca cttttccaaa cccagaaaac agacggaatt ctaggattcc 51181 tcctggaggc aaatactttt atgcatacct cctctggttg atccacacca tcctctattc 51241 tcgtaagatg atacacacta gctaacagcc cgccaggcgc tacatcatag gcacattgag 51301 agcggagata gttgtaccca tatacataaa aaatgacagc aatggaatgc caatcctcgg 51361 gctttatttg taaagtctct attccttggt aatcaaagcc caaagatcta tgaattagcc 51421 catgcttgac tagccaagca gacaaacgac cctgcatctt ttttatctct cccgcatttt 51481 tatttatata agtatttcac atttacgatg aaatttctga aaattgaccc accacttttt 51541 attctggaca aaggaatcct gtctaattca ctaattcggg ggaagatact gaatttttgt 51601 atttgaaaaa gatttccgta gggatctctg aagtagatgg gggttgataa agaactcttt 51661 gatcataatt tcccgtatga atactgtgtt gaacatgaaa cttgtgattg gtagtaaaac 51721 accgattcgc tcgttgagac ctaattcgat cttcatagag ttctcgagat attttcttac 51781 gaagttttgt tatagcatct ataaccgctt ccggtttagg tgggcaacct ggcaaatata 51841 catctacagg aattagctta tcgactcccc gaacagtact ataagaatcg gtactgaaca 51901 tcccgcctgt aattgtacag gctcccatag caataacata ttttggttca ggcatttgct 51961 catataatct cactaaagag ggggccattt tcattgttac tgttccggct gttaaaatta 52021 gatccgcttg tctaggactc gatcttggta ctagtccata acgatcaaag tcgaagcgtg 52081 agcctattag tgaagcaaat tcaatgaagc aacaactggt accatagaga agcggccata 52141 aactagagag tcttgaccaa tttgaaagat catttaatgt agttgaaata actgaatttt 52201 gggttgttcg atcaagtaaa ggaaactgaa tggaattcat aactgtctca atcttatttt 52261 ttccgttttt ctttttattg tctgaatatt caggagctaa gaccattcca atgccccctt 52321 tcgccatgca taaactaaac caataattaa gataagcacg aaaatgaaag cttctataaa 52381 tacagataca cccaatacgt cgaaactcat tgcccatgga taaagaaaaa ccgtttcaac 52441 atcaaaaaca acaaaaacta gagcaaacat ataataacgg attcgaaatt gtaaccaagc 52501 atcgcccatt ggttctatac ccgactcata agtagaaagt ttctccggcc ctttgctaat 52561 cggggctaac actccggaaa ttaaaaatgc caaaatagga acaaggatag atattattag 52621 aaatgcccaa aaaaaatcat attcgtaaag cagaaacata aacgcactcc tatgaacgtg 52681 gaaaatatac cggattcgat tggtcgattc gaattggaat tgtcaagtca tccataacta 52741 tttagtcaaa acaagaattc attttgatcg aaccgtctag tttgctttgt ttattggttt 52801 attgtagggc atatctcatt gcaagattca tcgactggaa tccgatttta tttccattat 52861 acttatttcc attttattta gttagtagaa ccttctaact atatattact cttatacaaa 52921 ttctcttgtt tctcttgttt tcatccagga ttttctctaa agacggggaa ttctaaatta 52981 attacttatc ttatttcttc tttaattaga aattctttaa agatttctat ttttttctat 53041 aaatagaatc aggaggtctt ttttcttatt ttttcttagt gatttagaat agaacaagta 53101 atcaaataga agagaatgta taggaatttc catctcaaga tttagaagat cttgtgttgg 53161 tatattcctt attattatta tttaataata gtattagggt tcgaatccag gtgacggggt 53221 ttttcttggt tgaatacaga aaaagaggac tggccttttt cgtgttgtgc ttcgctaggt 53281 cgaggtaagt aaggtatacg aaggaaaagc ctatttgaca atgaaagtga ccaaaggtat 53341 tcgtttttca aaaaacttta gcttgtacac aaatacagca ggcccttcct aaatccatgt 53401 gaattcctct tcgtagtttt tcatttcacc aggcccgtga aatgatttga cttccacaac 53461 tcaataagat tggggatatc aaaagaaagg gagtctcact aattctttta ttgtggatat 53521 gaatatgtaa ttcgcctccg aagattaatg acgaaaggtt ggtttcttta tccgcaattg 53581 aaaaaatcaa tatcgattgg atccgttgat atgcattttt tctttcatct gcttaaacga 53641 ttgccgtgag taaacttata ggaataattg gatttcactt agttacaagc aagaaataat 53701 aatgaagaaa tgaaaattat agaatttttt ggattttgca tttttatagg gctatacgga 53761 ctcgaaccgt agaccttctc ggtaaaacag gtcaaactta ttattattaa aatgatctga 53821 actgtttcaa agacccaaca tgcatttttt ttgcattggg ctctttcatt aactgatata 53881 aatatcagtt agtctgccat tttttttctt gacagaaaaa aagataagga aatggctcca 53941 tgtgctctga ttcattattt gggagcatta ccaaagtgtt tcaaaggtgg gattatcttg 54001 acgtaggtct gtctctggcc tagatcaacc taagttaaat gaagtctcta tcgttctgct 54061 gaaaaaatca aatatgaaac ttcatacacc ttaaagttca tatgacgaaa agagattttt 54121 ttgaggtcct tatactcatt atgcctagca ttgaatagac tgggtattca ccttatcaag 54181 atctcaaatc aatgatgggg tctgtttggc acctcctaaa tgggcgtcca aattggaccg 54241 aactctttgt caggctatgg ttccctcaaa gttatggagt aagacatcga tttctcaaca 54301 agatcaattt ttctgattgt atgatgaact cccttgaaaa acattggcgc gcgtgtaaac 54361 gagttgctct accaactgag ctatagccct tagtgcttgt gatacatatt ttatcatgta 54421 gataaattct tgtcaagata aatattccat gatccaacat caacaatctt tgatctcttt 54481 gagcggtatt ccttagatta gtattgctta ttaagtaata tgatatttat aatccatcga 54541 caggatgggt ttcatttggt tctctttggg atgataaatg acctacttaa ctcagtggtt 54601 agagtactgc tttcatacgg cgggagtcat tggttcaaat ccaatagtag gtaaaactta 54661 ttagatacca gagtcaatgg tatctaataa ggtttacgac ccacccttag tgatattgat 54721 tttttgattt tgtatctttt ctatttcatt tttgaatttg aatttttgca tcagaattgg 54781 attctgtttg attgtatttg attgtattca cccgacagaa tctaaatagg attagaaaga 54841 gaacttcttt ttattattcg aacgtaccaa ctagttatga aatcggattg atagcctcca 54901 cccgtgttct agctcgtcgg agagctagat ttgcctcaat tttttgtctc cttccttcag 54961 cctttttcac attagcttcc gctagttcaa gagtttgctg agcttcttgt ggatcaatgt 55021 cactaccctt ctccgcatca tttactaaaa cagtgatctc attattgcct attctagcaa 55081 aaccacccat cagagccatc gttaaccatt ggtcgttaag acgtattctc aaaatcccta 55141 tatctacagc tgtggcaata ggggcgtgat ttggtaatat gccaatttga ccgctattag 55201 tagataaaac aatttcttcc acttctgaat cccaaacaat tcgattaggg gtcagtacac 55261 taagatttaa ggtcatttct tcaaattgct ctccatttct aagttcatag ccttcgcggt 55321 agcttcatcg atattaccta ccaaataaaa ggcctgttca ggaagaccat ctaattctcc 55381 ggaaaggatc aattgaaatc ctcgaattgt ttctgctaga ccaacatatt tacctggaga 55441 accggtaaat acttctgcta cgaaaaaggg ttgtgataag aaacgctcaa tttttcgcgc 55501 tcttgctacg agtaaacgat cctcttcgga taattcgtcc aatccaagga tagctataat 55561 gtcctgaagt tctttgtaac gttgtaaagt ttgcttaact ctttgggcgg tttcgtaatg 55621 ttcctcacca acgatccgag gttgaagcat ggttgacgtt gaatctaaag gatctactgc 55681 tggataaata cctttggcag ccaatcctct tgatagtacg gtagtagcat ctaaatgtgc 55741 aaatgtcgta gcaggagcag ggtcggtcaa atcgtctgcg ggtacataaa ctgcttgaat 55801 agaggttatg gacccttctt tggtagaagt aattctttct tgtaaagaac ccatttcggt 55861 actcagggtg ggttgataac ccacagcgga aggcattcta cccaataagg ccgatacttc 55921 ggatcctgct tggacgaaac ggaagatatt gtcaataaaa agaagtacgt cttgctcatt 55981 aacatctcgg aaatattccg ccatagttag ggcagtcaaa ccaactctca tacgagctcc 56041 cggcggttca ttcatctgac cataaactag ggctactttt gattctgcaa tattttcttc 56101 attaattact ccagattctt tcatttccat gtaaagatca tttccttccc gagtacgttc 56161 acccactccg ccaaatacgg atacgccccc gtgagcttta gcaatattgt taatcaattc 56221 cataataagt actgttttac ccactccagc tcccccgaat agtccgattt ttcctccacg 56281 gcgataaggg gctaaaagat ctactacttc aattcctgtt tcaaaaatag ataattttgt 56341 atccaactgt ataaaggcgg gcgcagatct atgaatagga gacgttgtac tagtatctac 56401 aggccctaaa ttatcaacag gttctccgag cacgttaaaa attcgtccca gagtcgctcc 56461 cccgaccgga acacttatag gagctcctgt gtcaatcact tccattcctc tcgttagacc 56521 ctctgtagca ctcatagcta tagccctaac tcgattattt cctaataatt gctgtacctc 56581 acaagccaca ttaattggtt gaccaacact atctcgacct tgaactacca gagcgttata 56641 aatattcggc atcttgcccg ggggaaaggc tacatctagt accggaccga tgatttggac 56701 gacacgcccc gggttttttt tttcaagcgt ggaaacccca gaaccagaag tagtaggatt 56761 gattctcata ataataaaat aaataaatat gtcgaaatgt ttttgcaaaa attatcgaat 56821 tcaaaataaa tgtccgctag cacgtcgatc ggttaattca ataaaatggg aattagcact 56881 cgatttcgtt ggcaccatgc aattgaaccg attcaattgt ttacttattc actgagactg 56941 agtgaatttg caagcccacc caacctattt taattttaaa atctcaagtg gatgaatcag 57001 aatcttgaga aagtctttca tttgtctatc attatagaca atcccatcca tattatctat 57061 tctatggaat tcgaacctga actttatttt ctatttctat tacgattcat tatttgtatc 57121 taattggctc ctcttcttat ttatttttga tttcaatttc agcatatcga tttatgccta 57181 gcctattctt ttctttgtgt ttttctttct tttttatacc tttcatagat tcatagagga 57241 attccgtata ttttcacatc taggatttac atatacaaca tataccactg tcaaggggga 57301 agttcttatt atttaggtta gtcaggtatt tccatttcaa aaaaaaaaaa agtaaaaaag 57361 aaaaattggg ttgcgctata tatatgaaag agtatacaat aatgatgtat ttggcaaatc 57421 aaataccatg gtctaataat caaacattct gattagttga taatattagt attagttgga 57481 aattttgtga aagattccta tgaaaagttt cattaacacg gaattcgtgt cgagtagacc 57541 ttgttgttgt gagaattctt aattcatgag ttgtagggag ggatttatgt caccacaaac 57601 agagactaaa gcaagtgttg gattcaaagc tggtgttaaa gagtacaaat tgacttatta 57661 tactcctgag taccaaacca aggatactga tatattggca gcattccgag taactcctca 57721 acctggagtt ccacctgaag aagcaggggc cgcggtagct gccgaatctt ctactggtac 57781 atggacaact gtatggaccg atggacttac cagccttgat cgttacaaag ggcgatgcta 57841 ccgcatcgag cgtgttgttg gagaaaaaga tcaatatatt gcttatgtag cttacccttt 57901 agaccttttt gaagaaggtt ctgttaccaa catgtttact tccattgtag gtaacgtatt 57961 tgggttcaaa gccctgcgcg ctctacgtct ggaagatctg cgaatccctc ctgcttatgt 58021 taaaactttc caaggtccgc ctcatgggat ccaagttgaa agagataaat tgaacaagta 58081 tggtcgtccc ctgttgggat gtactattaa acctaaattg gggttatctg ctaaaaacta 58141 cggtagagcc gtttatgaat gtcttcgcgg tggacttgat tttactaaag atgatgagaa 58201 cgtgaactca caaccattta tgcgttggag agatcgtttc ttattttgtg ccgaagcact 58261 ttataaagca caggctgaaa caggtgaaat caaagggcat tacttgaatg ctactgcagg 58321 tacatgcgaa gaaatgatca aaagagctgt atttgctaga gaattgggcg ttccgatcgt 58381 aatgcatgac tacttaacgg ggggattcac cgcaaatact agcttggctc attattgccg 58441 agataatggt ctacttcttc acatccaccg tgcaatgcat gcggttattg atagacagaa 58501 gaatcatggt atccacttcc gggtattagc aaaagcgtta cgtatgtctg gtggagatca 58561 tattcactct ggtaccgtag taggtaaact tgaaggtgaa agagacataa ctttgggctt 58621 tgttgattta ctgcgtgatg attttgttga acaagatcga agtcgcggta tttatttcac 58681 tcaagattgg gtctctttac caggtgttct acccgtggct tcaggaggta ttcacgtttg 58741 gcatatgcct gctctgaccg agatctttgg ggatgattcc gtactacagt tcggtggagg 58801 aactttagga catccttggg gtaatgcgcc aggtgccgta gctaatcgag tagctctaga 58861 agcatgtgta aaagctcgta atgaaggacg tgatcttgct caggaaggta atgaaattat 58921 tcgcgaggct tgcaaatgga gcccggaact agctgctgct tgtgaagtat ggaaagagat 58981 cgtatttaat tttgcagcag tggacgtttt ggataagtaa aaacagtaga cattagcaga 59041 taaattagca ggaaataaag aaggataagg agaaagaact caagtaatta tccttcgttc 59101 tcttaattga attgcaatta aactcggccc aatcttttac taaaaggatt gagccgaata 59161 caacaaagat tctattgcat atattttgac taagtatata cttacctaga tatacaagat 59221 ttgaaataca aaatctagaa aactaaatca aaatctaaga ctcaaatctt tctattgttg 59281 tcttggatcc acaattaatc ctacggatcc ttaggattgg tatattcttt tctatcctgt 59341 agtttgtagt ttccctgaat caagccaagt atcacacctc tttctaccca tcctgtatat 59401 tgtccccttt gttccgtgtt gaaatagaac cttaatttat tacttatttt tttattaaat 59461 tttagatttg ttagtgatta gatattagta ttagacgaga ttttacgaaa caattatttt 59521 tttatttctt tataggagag gacaaatctc ttttttcgat gcgaatttga cacgacatag 59581 gagaagccgc cctttattaa aaattatatt attttaaata atataaaggg ggttccaaca 59641 tattaatata tagtgaagtg ttcccccaga ttcagaactt tttttcaata ctcacaatcc 59701 ttattagtta ataatcctag tgattggatt tctatgctta gtctgatagg aaataagata 59761 ttcaaataaa taattttata gcgaatgact attcatctat tgtattttca tgcaaatagg 59821 gggcaagaaa actctatgga aagatggtgg tttaattcga tgttgtttaa gaaggagttc 59881 gaacgcaggt gtgggctaaa taaatcaatg ggcagtcttg gtcctattga aaataccaat 59941 gaagatccaa atcgaaaagt gaaaaacatt catagttgga ggaatcgtga caattctagt 60001 tgcagtaatg ttgattattt attcggcgtt aaagacattc ggaatttcat ctctgatgac 60061 acttttttag ttagtgatag gaatggagac agttattcca tctattttga tattgaaaat 60121 catatttttg agattgacaa cgatcattct tttctgagtg aactagaaag ttctttttat 60181 agttatcgaa actcgaatta tcggaataat ggatttaggg gcgaagatcc ctactataat 60241 tcttacatgt atgatactca atatagttgg aataatcaca ttaatagttg cattgatagt 60301 tatcttcagt ctcaaatctg tatagatact tccattataa gtggtagtga gaattacggt 60361 gacagttaca tttatagggc cgtttgtggt ggtgaaagtc gaaatagtag tgaaaacgag 60421 ggttccagta gacgaactcg cacgaagggc agtgatttaa ctataagaga aagttctaat 60481 gatctcgagg taactcaaaa atacaggcat ttgtgggttc aatgcgaaaa ttgttatgga 60541 ttaaattata agaaattttt gaaatcaaaa atgaatattt gtgaacaatg tggatatcat 60601 ttgaaaatga gtagttcaga tagaattgaa cttttgatcg atccgggtac ttgggatcct 60661 atggatgaag acatggtctc tctagatccc attgaatttc attcggagga ggagccttat 60721 aaagatcgta ttgattctta tcaaagaaag acaggattaa ccgaggctgt tcaaacaggc 60781 ataggccaac taaacggcat tcccgtagca attggggtta tggattttca gtttatgggg 60841 ggtagtatgg gatccgtagt cggagagaaa atcacccgtt tgattgaata cgctgccaat 60901 caaattttac cccttattat agtgtgtgct tctggggggg cgcgcatgca ggaaggaagt 60961 ttgagcttga tgcaaatggc taaaatatcg tctgctttat atgattatca attaaataaa 61021 aagttatttt atgtatcaat ccttacatct ccgacaactg gtggagtgac agctagtttt 61081 ggtatgttgg gggatatcat tattgccgaa cccaacgcct acattgcatt tgcaggtaaa 61141 agagtaattg aacaaacatt gaataaaaca gtacccgaag gttcacaagc agctgaatac 61201 ttattccaga agggtttatt cgacctaatt gtaccacgta atcttttaaa aagcgttctg 61261 agtgagttat ttaagctcca cgcctttttt cctttgaatc aaaagtcaag caaaatcaag 61321 tagagcacta agttcaatta ttttatttgt gtttgtagca aaaaagtagt tagtttgtcg 61381 gaatcaaagt aaataagata ataatggcgc tttctttggt gatagaagat ctaattgtag 61441 aaagaatcaa aactaaagtt gaggataact ctttttttga cctatattcc tgattacgaa 61501 tcaagaagcc tttatcaaca agagtgagtt cttcctttcg tgaaattagg aaaataaaac 61561 gaatttcttc ttcttgtctt aggtatataa tttgaaattc aaatatagat aatagagttt 61621 tgtatctttc tctatctccc gaaaaaccat tttagctaaa aattcatgtt gggtcggatt 61681 cgaacgaatc tttcgataat ctgtaagaaa ctctttatct atttttagaa aattagaaga 61741 caagaacaaa agacaaagaa atgaagaaaa ataataaagt ttattatgat acatatcttt 61801 ctcatgtagg ggatgaataa gtccatttat ttagttctac agttctacat tctttgcact 61861 tattatacct actcagttag atttagatat atagatactt agatctatac taagaatttc 61921 aaattcttca aattctatta ataataaata ttatctaatt tctaattagt aattagaatt 61981 caaattctta atttaattat aattattaca agatatcttt atttatataa taacataata 62041 acagatacaa atagtaaatc gaggtacccc ttctatgaca aatttgaacc ttccatctat 62101 ttttgtgccg ttagtaggcc tagtctttcc ggcaattgca atggcttctt tatttcttca 62161 tgttcaaaaa aataagattg tttagatccg ctgggaccca atctcatcca tttttttttt 62221 gaaaacgtgg acttgtatca taacacagat atctatttat tggaatatag tataacatgt 62281 gatttccacc gaacataaag gaaaaaactc ttatgcccgc agaaatatga tatatggata 62341 tatcaattct aacaattttc aaatagatca ggatcgctgg atggctgaaa tgtagtcggt 62401 gaatctctat gtatatcgat atgtatagtg ggatcgtatt aaataaagag tatgttatta 62461 ttttagattt aaccaatttg atgaattact cctaaaggtt gacatcaaac tagtgctagt 62521 tcacctcaaa ctagtgctag ttgatgagag ttacttcgga aacaaaaaag taaagtcaaa 62581 tttctctggg gtattatctc aattccaata aaatgcaatc gggtaaagta tgacttggcg 62641 atcagaacat atatggatag aacttataac ggggtctcga aaaataagta atttctgctg 62701 ggcctttatc ctttttttag gttcattagg cttcttatta gttggaactt ccagttatct 62761 tggtagaaat ttgatatctt tttttccgcc tcagcaaatc attttttttc cacaaggact 62821 cgtgatgtct ttctacggaa ttgcgggtct ctttattagc tcttatttgt ggtgcacaat 62881 ttcctggaat gtaggtagtg gttatgatcg attcgataga aaggaaggaa tagtctgtat 62941 ttttcgttgg ggatttccgg gaaaaaatcg tcgcatattc ctccgattcc ttataaaaga 63001 tattcagtcc gttagaatag aagttaaaga gggtatttct gctcgtcgtg ttctttatat 63061 ggacatccga ggccaggggt ccattccctt gactcgtact gatgagaatt tgactccacg 63121 agaaattgaa caaaaggctg ctgaattagc ctatttcttg cgtgtaccaa ttgaagtatt 63181 ttgagaaatt gagatatcag tatcaggaaa caatattctg aatttcttca ttcgaagtga 63241 attcttagct tttttctgga ttctttctag attcaaagac taaccacaaa atcacaaaga 63301 aaatagattc attagtccga taccttgtat aaaactcatg tgtgtaagaa atattcgatc 63361 gcatagagtg tacgaatggg ttgattaaca attcacagat gaaaaaatgg caaaaaagaa 63421 agcattcact cctcttttct atcttgcatc tatagtattt ttgccctggt ggatttcttt 63481 ctcagttaat aaatgtctgg aatcttgggt taccaattgg tggaatactg ggcaatccga 63541 aatttttttg aataatattc aagaaaagag tcttctagaa aaattcatag aattagagga 63601 actcctcttc ttggacgaaa tgatcaagga atactcggaa acacatctcg aagagtttgg 63661 gataggaatc cataaagaaa cgatccaatt aatcaagata caaaatgaga atcgtatcca 63721 tacgattttg cacttctcga caaatatcat ctgttttatt attctaagcg ggtattcaat 63781 tttgggtaat gaaaaacttg ttattcttaa ctcttgggct caggaattcc tatataactt 63841 aagtgacaca gtaaaagctt tttctattct tttattaact gatttatgta tcggattcca 63901 ttcaccccac ggttgggaat taatgattgg ctctatctat aaagattttg gatttgttca 63961 taatgatcaa atcatatctg gtcttgtttc cacctttcca gtcattctcg atacaatttt 64021 taaatattgg attttccgtt atttaaatcg tctgtctccg tcacttgtag ttatttatca 64081 ttcaatgaat gactgataaa ggatccattg atattaatct aatccaatta gaatgcttgg 64141 tactttgtag ttgtacataa gcaaagtatt gaaaatcata tttactcttt ctatttctaa 64201 ccatcgggga gattcatcct atattattcc tagattattc cagcaaatag cagaatcgtg 64261 gctagggaac tatactagcg acctacccaa tttattgtag aaattttcgc gatcaatgat 64321 tggaccatgc aaactagaaa tgctttttct tggctaaaga aacagattac tcgatctatt 64381 tccgtatcgc tcatgatata tatcttaact cggacatcca tttcaagtgc atatcccatt 64441 tttgcacagc agggttatga aaatccacga gaagcgactg ggcgtattgt atgtgccaat 64501 tgccatttag ctaataagcc cgtggagatt gaggttccac aagcggtact tcctgatact 64561 gtatttgaag cagttgttcg aattccttat gatatgcaac tgaaacaggt tcttgctaat 64621 ggtaaaaggg gggggttgaa cgtgggggct gttcttattt taccggaggg gtttgaatta 64681 gctcctcccg atcgtatttc tcccgagatg aaagaaaaga ttggcaattt gtcttttcag 64741 agctatcgcc ccaataaaaa aaatattctt gtgataggcc ctgtccctgg tcaaaaatat 64801 agtgaaataa ccttccctat tctttccccg gaccctgcta ctaagaagga tgttcacttc 64861 ttaaaatatc ctatatacgt aggcgggaac aggggaaggg gtcagattta tcccgacggc 64921 agcaagagta acaatactgt ttataatgct acagcagcag gtatagtaag caaaatcata 64981 cgaaaagaaa agggtgggta tgagataacc ataacggatg cgtcggatgg acgtcaagtg 65041 gttgatatta tccctcccgg accagaactt cttgtttccg agggcgaatc tatcaaattt 65101 gatcaaccat taacgagtaa tcctaatgta ggcggatttg gtcagggaga tgcagaaata 65161 gtacttcaag atccattacg tgtccaagga cttttgttct tcttggcatc tgttattttg 65221 gcacaaatct ttttggttct taaaaagaaa cagttcgaga aggttcaatt ggccgaaatg 65281 aatttctaga ttcgcagatt tgtcgacatc aagttcgtaa aaagaaccaa attcttgttg 65341 gcgattattt atgatcaaaa aaatgaaatt ctgaaaactc ctttgtctta tttatactct 65401 tcttcaaaat ctacatacta tgtggtacaa gggattccca gcatctcgta gaaaaagagt 65461 atgtaatgta gaatttgaag aagagtattt gactttcatt atttttattt cgttttttaa 65521 aattggagta gtgtgactat gttactattg acagatttca atgccataag acgtatcaat 65581 agttttctat tctaaataga aagaaagtca aatttgtcta aatactagac ataaggaagc 65641 aggggataaa tgcggggaac aaaaaattct aggagggatt atttgtcttc ctagtcttcg 65701 acacaagaaa ggggtgtaga aaaatccttt tttcttgtgt cgaaacgaaa gagtaatgat 65761 tcttgatcct gtttgttaaa aattcctagt cttggtttcg atttttccag atgtatcaga 65821 aaccctttac cttaccccca ccccctttac gtataatata ctaagtggtg gacaaacaaa 65881 acaaaaaaag agaggaaatt ttattaatta aataaaactt cttcaatcaa cttatcttat 65941 acaaaatttg atgatgaaat atgaaaacaa taaaaaataa atagagtaat gtaatagaga 66001 gagtaaggtt ctacattaga ttagtataga aaggatttgc acgatatcta atatattata 66061 gcagccaaga aattgagtga ttccttcttt cttccaactt tgaaagtacc gatagatact 66121 atcatagaaa aagaagaggt ggtccgaata gtgaattttt caaaaacatg atcagaaaaa 66181 tgagaaaaat ggagtttttg aaaagaaaaa gaaatccatt ttatcattta gacgaaaaaa 66241 atattatgat tcttaagaac tcaacgggcc cttccccttc gaatcaaaca aacaaagaag 66301 ggaattccgt tgagttctta cgctttcatg ttgacgactc aattcattcg attactagag 66361 ggatgaaccc aatccggaat atgaaccata aaagaaaata cctattaaac cgattacaag 66421 aataccagct acagtaccta ttatccaaag aggaatcctt ccagtagtat cggccattta 66481 ccccacttcc ctccagattt catcaagtgg tcatgctaga gacataaaca gtcatggata 66541 attaaattat gagatccttc cgaatgagct aagagaatct tattgattct ctttcgtttt 66601 cttaattgaa gaaataattg gaaaataaaa cagcaagtac aaaaatgagt aataaccccc 66661 agtagagact ggtacgattc aattcaacat tttgttcgtt cgggtttgat tgtgtcgtag 66721 ctctataatt cggattaagt ttatcgttgg atgaactgca ttgctgatat tgatcccaaa 66781 aaaaagacgg taggtacagc taggccgtga acagccaacc atcgtactgt aaaaattgga 66841 taggttcgat ctatagtcat tagggcctcc taaaacgatc tactaaattc atcgagttgt 66901 tccaaaggat caaaacggcc agttattaat ggaattcctt gtcggctctc tgtaaaatac 66961 tcgtttggcc gagggcttcc aaacacatcg taagctaaac cggtgctgac aaataaccaa 67021 cccgcaatga atagggaagg tatagtaatg ctatgaatga cccagtatcg aatactggta 67081 ataatatcag caaacgaacg ttctcctgtg cttccagaca tgctgagctc cacatattct 67141 tgtacagtca aagaagatcg attccgtaaa agatgagatc agtaaatgac aattcactga 67201 aatttcatct ttgtgagatc gtcaatattg taccgaaggc gtctttagag tataccgaat 67261 cagtatagct atccttcttc tgacacagca acgcaatttg aaatagtatc aaaagtaagt 67321 actaaataat ttcttttttc ctttacttgt tgatgtaaaa tcatcttcca ttcaatagaa 67381 aattctttca attcaacgaa agagattctc atattcacac aatttaagta gatgcgagat 67441 atagaaattt gcttttcgta gttgtggaag cagttttgtt gttggaatcc tttttttaaa 67501 gaagaagtta atggtcgagt aagaaataag agtagtagat catattcgag gaaagaaaaa 67561 atcgaataat tggaatccat agttgtgatg cattgttgtg gatctcgatc caaaggttct 67621 ttcttgatct agctacaagg atggggcagt agggaaagat aaaatgtgga acctaataga 67681 aattactagt tttagaatct agttggacaa aaaaaagatt ttttcaagcg attgtgtgat 67741 aactttttct tcttctccat cattcaagat attatgtgaa ttaatatatt actaaatcta 67801 atgagttaaa cttaaatgaa agtaaaaaga aaaagtttta taaggtaact gttcgcttta 67861 aaatcgaaaa tggagtcgat acaattcaac agaatctaag aaatgatcaa attcgaaaat 67921 catttctatt tttattctat aaaaattcaa gtttcatttt tgaatgcagt tagacgatac 67981 agctcttatt agtttaatag tttactcaag agttactcaa tgaatcggtt gattggaatt 68041 gcgggatgga tagatgttac agatgatgaa tcaatttctt ttatatgtct gtcactttat 68101 ctttgttagt gctgtctgcc tataatgata gataaatcaa aaacttttca ttcaacttat 68161 tctttcaatt gaaattgaga tttttgccta tcctcctatt ttattttgaa aaatttgaaa 68221 cttaggtaag tgctttttaa acatatgtat aaaaagaaca tatttcattt aatttagccc 68281 cttcatgctt actataacta gttatttcgg ttttctatta gcggctttaa ctataacctc 68341 agctctattt attggtctga gcaagatacg acttatttaa actgaatatt taaaatgaac 68401 aattcataaa aagaaatcct tctgtgggat tacgcgtatt ctatatttac ttacgttacc 68461 aattgtcaat tcttgttcat tgtcattgag attcatgtca attcggatta atatttaggt 68521 atcgatatta cctctttttt tctcctttca aacaaataaa aatgattgaa gtttttctat 68581 ttggaatcgt gttaggtcta attcctatta ctttggctgg attattcgta actgcatatt 68641 tacaatatag gcgtggtgat cagttggacc tttgattaat taacatctct ttttgattga 68701 cctcctcctt tctttaattc acaggcacag gaggtcaaat tccgattgtt gtgaaagtta 68761 ctgaatgaat ctattttatt ctaattcgat ctaagaagaa aaaaatcacg ctctgtagga 68821 tttgaaccta cgacatcggg ttttggagac ccacgttcta ccgaactgaa ctaagagcgc 68881 tttcttatca gaatagataa gactgtaaac aaaaggattc ttttcataac cccaatacat 68941 tttgtatgca tatactagaa tagcatgata aaaatcaaag attatgtcca atttgaggcg 69001 atctcaattg atccctcgtt actgctcctt tgagcagtaa taggtaggga tgacaggatt 69061 tgaacctgtg acattttgta cccaaaacaa acgcgctacc aagctgcgcc acatcccttc 69121 aattgttcca cagtgtaatt gtagagaatt cctgtcttgt tttccacatg gttatttcct 69181 ccattgatat atacaaattt tctgctcatt tcgtcttttt ggtctcattt aacatataat 69241 agtaaaataa aaggaaaaga cttctcttat agattatata gaaaatactt atatacaatt 69301 atatacaaaa tatataaata cagaacccgt cgtaaaaatc aattagtatt tttcggaaat 69361 tctcggtaag aaagaagggg atgtattttt tttttctgtt ttaagaaaag gaaaatctta 69421 tttcccgaat cattgtacat tgcaatttga attaggaatt ctgtgtccaa ctctaagcag 69481 cccttaacta catatgcatc tgattatata tgtattatct attccaacaa ataatacaaa 69541 agaaggaggt ttttcaatgc gagatctaaa aacatatctc tctgtggcac cagtactaag 69601 tacgctatgg ttcggggctt tagcaggtct attgatagag attaatcgtt ttttcccgga 69661 tgcgttgaca ttcccctttt tttcattcta gttattgtca tgggaaggaa tgaagaagat 69721 tagagatcca atcaaatatt ggtgatgaat ccctctcccc ctcttttctc ttttttccct 69781 ttttagaata agggaggaaa gagaaagaat aaaaaaagtg gattcaacat tcgggctcaa 69841 gttcgaatta actgaatatt aataatagag gaatgggggt agaatagaag atctagggca 69901 agagtattat acaagatact taaatgatta cttcaatttg aaatatactt tagaaaaatc 69961 gttgtatttt actatgactt tgctttacta ttactttatt ttcttgattt taatctttta 70021 cttttagaat tggatttcaa gttagtaact tctattttat cctttcttcg ttttgaatcg 70081 aaaatagaag agttgagtaa atcaaaaatc caaaggaggt tcatggccaa ggggaaagat 70141 gtccgagtaa cggtgatttt ggaatgtact agttgtgtcc gaaacagtgt tgataaggta 70201 tcaagaggta tttccagata tattactcaa aagaaccggc acaatacgcc taatcgatta 70261 gaattgaaaa aattctgtcc ctattgttac aaacatacga ttcatgggga gataaagaaa 70321 tagagcgaac caagtacctg tgtcttaccc tttcaaggaa ggggaaaaaa tgacattata 70381 tatataacat atttaaatag aaaataaaca aatcttattt tttaaaaatc ctattttggg 70441 tggatttaaa ctgaattaga attaagaaat aggattttag ggataaggaa taaattaaac 70501 aaacaaacca tggataaatc caagcgacct tttcttaaat tcaagcgatc ttttcgtagg 70561 cgtttgcccc cgattcaatc gggggatcga attgattata gaaacatgag tttaattagt 70621 cgatttatta gtgaacaagg aaaaatatta tcaagacgag tgaatagatt gaccttgaaa 70681 caacaacgat taattactct tgctataaaa caagctcgta ttttatcttt gttacccttt 70741 ctcaataatg agaaacaatt tgaaagaacc gagtcgaccg ctagaactac tggttttaaa 70801 gcccgaaata aataggctta ctttttcttc acttgaatca taattacaag aatctagatt 70861 tgagtatcgt gtcgtaagaa aaaaaatgaa tcggaaaaaa agatttcttt ttttattgaa 70921 ttgaacgtgt tcattcattt tgactacttt agcatatttt ctcatagaaa tttctactct 70981 accttcccgg agttcattct ccggggaact ccatttaaat tattctggtg gattctttcc 71041 aatctacttc ctttatgatt tcgttcgaaa tcatataaag acaattccta tttgatatag 71101 ctatttgtgc aagtatttta cggttaagaa gcaactgtct cttgtacaga tcgtgtatta 71161 atctactata actataggat actccccttt cgcgaattac tgcgtttatc cgagtgatcc 71221 acaaacgacg aaaatctctc tttttcctat ccctatcccg atgagccgaa actaaagctc 71281 ttattttctg ttgagtaata gttcgagtaa gccttgaatg agccccccga aagcttgatg 71341 caaataaacg aatttttgtt ctacgtctcc gagctatata tccccgttta attctggtca 71401 ttgaataaat gaaactttga cgaataacta atcgattgcc tttctttcag ttattctttt 71461 cccccttcct agtctattaa taacaaaacg gatttttcca atgtataaaa taaaaattcc 71521 aatggctttg gctactctaa ccttcccgac cacgattttt tctttttttt ttttttaggt 71581 atttcactgc gaaataagaa agaaataaaa aattgtattt tcctaggtat caaaaatcta 71641 gtaaataaaa gaaatcaaaa aataaagtag tgggttcctt cgtttctatg gttacttctt 71701 aaacggtgag gtcttctcta tacaccggag cctttacttt atactttaat ttaatattta 71761 atcaactaat tgatgttatt gggaacttgt atagttcaca ctctttggct ctacccatga 71821 attatccagt aataggtctt tcacaatcag atctacctat acagtaagcg gtatttaatt 71881 atgaaagttt gctgggtagc tgaccctctt agtccgttct tgccagagtg ggagcctgcc 71941 taatctttat gttttatgct ttttaaataa gatttcctcc gcttaatgga taaccatttg 72001 ttaccaatgg agaatttctt atcatctgtg attggattta caccaacgga aaccataaac 72061 ttcatacaca atagagggat atgagagagt tttttttaaa taatgaatgg agttccttct 72121 tccatcctat cccattcacc ggtactgatc attgatactg taaaagtcgt tttcttgctt 72181 ttgtgccagc tcatgatcta aacgagtcgc acatacaccc tagtacatgt tcctcgacgc 72241 tgaggacagc cccgaagagc gggggatttc gtgacatttc tgattggctg tcttgtattt 72301 ctaataagtt gtttaatagt tggcatgttg aatcgtatac ataatatgat gggttggttt 72361 agattgatcc taaccgaatg atgatgaatt acttctattt aatagaatat tcaattcgaa 72421 gataaaatct caaatcacag atttgcgcga aatccatgtt attttcattc aaccgctaca 72481 agatcaacaa ttccataagc ttgggcttct gttgctgaca taaaaacatc tctttccata 72541 tcttcggata caacccataa gggtttcccc gttctttgta cataaaccct tgtgagggtt 72601 tcacgcagtt tcagcagttc ttccgcttcc aggacaaatt cgcctgtttg tgcctcataa 72661 aaagaactag caggttgatg gatcattacc ctgatgatat aacaaaataa aagcttcccc 72721 tatctcgcat gataaagcaa agagaaaaga aagataaaga atagaaaaaa gatagaattg 72781 aaccaaccgt acaggccatc ttttgtgcat acggcctcta caagaaaatt gacctcccct 72841 cctttctatt gaagaaagag aaaaaataga atctatcaga ctcagatggg taaatgatca 72901 aattccgatc cttcctttcg gaggagttaa aaaatactat gatggctccg ttgctttata 72961 tgtttatttt ttcttttttt ttttttgtct gtgattcacg aatcccaaag tttcttttta 73021 atccgatcaa ataaggaaaa aagtcttttt tttttttttt cgtactcttt cataacataa 73081 atattgttaa gaactctccg gcatgaaaac aaaaaagttt gtgacgctga actgaactcc 73141 cgatagataa gagaaaatcg gaaatacccc ttatctcata ctactctctc gatacagaat 73201 ctaatgtttt gaaaaaaaaa caatacaaaa atttctcata tcgaattcga agtgccatgc 73261 tattattact tagtattcat atggcgaagg catagtcttc ttttttctct caaataaaaa 73321 cctcattggc gccaagcgtg agggaatgct agacgtttgg taatttctcc tccgaccagg 73381 ataaaagatc ccattgaagc ggctaatccc atgcatattg tatggacatc tggtcgcaca 73441 aattgcatag tatcataaat agccacccca ggtattaccc agcccccagg agagtttata 73501 aacaaataca gatctttggt ctcatcctcg atactgagat ataccataag accaataagt 73561 tgattcgaaa tctcgctatc aacctcttgg cctaaaaaaa gtaatctttc tcgataaagt 73621 cggttgatta gggtaaaatt gtatccctta ggaaccgtac atgcgccttt tgatgcatac 73681 ggttcaaaaa aaaaatggtg aatcaatgta tagattccag tcctctttct ttttttctag 73741 aaaggttctt tcttacttct aacgaaaggg cttttcttcg attttttaat aaagacgagt 73801 tttgactcct tttttatatt ttcgattttc cattataaaa tttgaagtta taagaaaggg 73861 tcattaaact tatcgaatta acttctcatt gatgtattct ttcatcgaga tttaatccaa 73921 accgcgatgg tattttcttg ttcctgaatg ggtctgtttc atctttttag gtttatgctc 73981 tactccgggt aaagatccgc ccgatttgga tttgtacata taggacaaat gctcccatta 74041 ccatttcttt ttgtatttct tttttttttt caattcattt tatacaagta tttcttagag 74101 ttgagataac tttgcttgac aattaggatc tctttacaaa gaaaaaatat gaatagcaat 74161 catagatatc ttaccaatcc aattgggttt tttctaaacg gagcctggat acttcatttt 74221 tttagtccaa ccaagccaac cataaattat tctaattgaa tttttctaat tgataatagt 74281 aatatgaatc ccctcaaaaa tggatctaat tgcacttcac gctccaaatt tttgatgatt 74341 aaatttatct ttcttgggtg aaacggggga tatctcgatc gggggagaga acggggaaat 74401 accatatgac ccaatatatc tgacaagtcg cactatacgt caacccaaga tgcatcttcc 74461 tctccaggac ttcggaaagg gacttttgga acaccaatag gcattaaatg aaagaaagaa 74521 ctaaatacta tatttcactt tgaggtggaa acgtaacaat tttttttatt gtctttataa 74581 tattcatatt ggtttttatc gtatttattt tatccataga ttataaaaat tcataaagaa 74641 agacagaatg aataaactca aattattacg aataggtctt tctaatgata aataagtatg 74701 gactcattcg ctcatagaaa atgggatcaa ctcccccatt gcgtattggt acttatcgag 74761 tatagaataa atctgcttct ctttgttcct acgaacagaa ttgttccatt attaccaaca 74821 gaatagaaca cccttgttcg gaaataatcg actgaacaag agtggtccat aggatagtca 74881 tattatagtc ttttccaatg caataaagtt acgtagtgtc tatttatctt tgatataagg 74941 ggtatttcca tgggtttgcc ttggtatcgt gttcataccg ttgtattgaa tgatcccggt 75001 cggttgcttt ctgttcatat aatgcataca gctctggttg ctggttgggc cggttcgatg 75061 gctctgtatg aattagcggt ttttgatcct tctgatcctg ttcttgatcc aatgtggaga 75121 cagggtatgt tcgttatacc cttcatgact cgtttaggaa taaccaattc atggggcggt 75181 tggagtatca caggggggac tgtaacgaat ccgggtattt ggagttacga aggtgtagct 75241 ggagcacata ttgtgttttc tggcttatgc tttttggcag ctatctggca ttgggtgtat 75301 tgggatctag aaatattttg tgatgaacgt acaggaaaac cttctttgga tttgccaaag 75361 atctttggaa ttcatttatt tctctcaggg gtggcttgct ttggttttgg tgcatttcat 75421 gtaacaggct tgtatggtcc cggaatatgg gtgtccgacc cttatggact aacgggaaaa 75481 gtacaacctg taaatccagc gtggggcgtg gaaggttttg atccttttgt tccaggagga 75541 atagcctctc atcatattgc agcaggaaca ttgggcatat tagcgggcct attccatctt 75601 agcgtccgtc cgccacaacg tctatacaaa ggattgcgta tgggaaatat tgaaaccgtc 75661 ctttccagta gtatcgctgc tgtctttttt gcagcttttg ttgttgccgg aactatgtgg 75721 tatggttcgg caacaacccc gattgaatta tttgggccca ctcgttacca atgggatcag 75781 gggtacttcc agcaagaaat atatcgaaga gttagtgctg ggctagcaga aaatcaaagt 75841 ttatcagaag cctggtctaa aattcctgaa aaattagctt tttatgatta catcggcaat 75901 aatccggcaa aagggggatt attcagagcg ggctcaatgg ataacgggga tggaatagcg 75961 gttggatggt taggacaccc tatctttaga gataaagaag gccgtgaact ttttgtacgt 76021 cgtatgccta ctttttttga aacatttccg gtcgttttgg tagatggcga tggaattgtt 76081 agagccgatg ttccttttag aagggcagaa tcgaagtata gtgttgaaca agtaggtgta 76141 actgttgagt tctacggcgg tgaactcaac ggcgtcagtt atagtgatcc tgctactgtg 76201 aaaaaatatg ctagacgtgc tcaattgggt gaaatttttg aattagatcg tgctactttg 76261 aaatccgatg gtgtttttcg tagcagtcca aggggttggt ttacttttgg gcatgcttcg 76321 tttgctttgc tcttcttctt cggacacatt tggcatggtg ctagaacctt gttcagagat 76381 gtttttgctg gtattgaccc agatttagat gctcaagtcg aatttggagc attccaaaaa 76441 cttggagatc caactacaaa aagacaggca gcctgataca acattacttt ggtatctttc 76501 tttcgccctt attttctttc ttttactttt attgacatag ggtaccagag aaatctttat 76561 ttgaatcaac ttcgttttta ctcttgttcg ttctttatcc ggaagatgac aaaaaaaaga 76621 aaataaaaag aaacaaacag gtatgaaagc tataattgta aaccacgatc gaatctatgg 76681 aagcattggt ttatacattc ctcttagtct cgactctagg gataattttt ttcgctatct 76741 tttttcgaga accgcctaaa gttccaacta aaaagaacta aaaaggtgaa ataattcttc 76801 attatctcag ttgaagtact gagcctcccg ataccgggag gctcagtact tcaactagtc 76861 tccatgttcc tcgaatggat ctcttagttg ttgagaaggt tgcccaaaag cggtatataa 76921 ggcgtaccca gtaaaactta caagtaaacc agatataaag atggcgacta gggttgctgt 76981 ttccattctt atcatattta taaaatttca agaccccaat ggatctatga taggatcgtt 77041 tatttacaac ggaatggtat acaaagtcaa cagatctcaa tgaatacaat aggatttatg 77101 gctacacaaa ctgttgaaaa cagttctaga tctggtccaa gacgaactgc ggtaggagat 77161 ttattaaaac cattgaattc ggaatatggt aaagtagctc ctgggtgggg aactactcct 77221 ttgatgggtg tcgcaatggc cttatttgcg gtatttctat ctattatttt ggagatttat 77281 aattcttccg ttttattgga tggaatttca atgaattaga tctataagaa ccgcaaagtt 77341 cttgcttttg agtccaaaat gaatcattta gagctccgat ttctagtcca ttctattttc 77401 ttttggtagt tcgatcgtgg aatttctttg tttctgtatt tccggagtat gagtgtgtga 77461 cttgttataa ttgatcctat tgatagtaca gagaatgggt ctgtcatctt gatagagatg 77521 gttctacttc gtcagatatt tattctaata tttggaacac gaaatagatt aagaaatatt 77581 tgaactatga ttcatactta atattcagac ctcgtgtccg ggctccaaaa aattttcaaa 77641 caaagaattc taatttctaa atcgaaagat tcttttcttt caacccctat ttatattttg 77701 accaaaagca aaacctttct ttgaattttt agtcattcta tttattcagg gaataagtga 77761 tgatccgagg attcttactc agggaatcct tgatttgatt taggttaggt ttttttattg 77821 aatcatcgtg gttctagtat gaatctgagg ttttaatcga ttcatagggt cttaacaaga 77881 gaattcctat caataataaa gaaaacaaat aataaaagcc atattccaca aaaacaaatt 77941 ctagaaagaa atagggaaaa agagaattca agaggcccat aagtatcaaa ataaagataa 78001 agacgactgc gccaacttga tattttggta ttatcgccac aaagaagagc tttcggattt 78061 tccagagaag atgggatcag aacttaataa atttaaaact ttctattcca tatccgttgc 78121 aactagtatt tgggtgtttt tgcttgagct gtacgagatg aaagtctcat atacggttct 78181 cagaggggga gttccgccta tctcaataaa gtatatgatt ggttcgaaga acgtctcgag 78241 attcaagcaa ttgcggatga tataactagt aaatacgttc ctccccacgt caatatattt 78301 tattgtttag ggggaattac gcttacttgt tttttagtac aagtagctac tgggtttgct 78361 atgacttttt actatcgtcc gaccgttact gaggcttttg cttctgttca atacataatg 78421 actgaagcca actttggttg gttaatccga tcagttcatc gatggtcggc aagtatgatg 78481 gtcctaatga tgatcctgca tgtatttcgt gtgtatctca ccggcggatt taaaaaacct 78541 cgcgaattga cttgggttac aggtgtggtt ctggctgtat taaccgcatc ttttggcgta 78601 actggttatt ccttaccttg ggaccaagtc ggttattggg cagtgaaaat agtaacaggt 78661 gtccctgacg ctattcctgt aataggatca cccttggtcg aattattgcg cggaagcgct 78721 agtgtgggac aatctacttt gacccgtttt tatagtttac acacttttgt attgccgctt 78781 cttactgccg tatttatgtt aatgcacttt ccaatgatac gtaaacaagg tatttctggg 78841 cctttataga gaaaagaaaa atagatccta aatatttgta atcaatcatt tatcacttgg 78901 tggaggaata tatagtattt cattgctaca agtatggatt attgaaaata ataagacatg 78961 gatttggata tttcccttta actattcatg tcaactaaac ggggggattg aagggaattt 79021 tgtgaagaga aaatggatta tgggagtgtg tgacttgaac tattgattgg tctgtgtaga 79081 tatatgcctg ccacatggga attcacaacc aaatgtgtct ttgttccaat cgccgtgtaa 79141 gccctataca gaggataggc tggttcgctt aaagagaatc ttttctatga tcaggtccga 79201 atcatgttgt acatgagcag gctccgtaag atccagtata agtgaactag ataaaacgga 79261 atcaagattc cgttttatct agttcactta taagattaaa tagtatgtaa atgtattcat 79321 ttcctctgca gtgacacgat caatactact atcggagtga aacaagggat ctaaagaaga 79381 agagaggcta gactatatta gtaacaagca aaccttgtat gtgtatctcc aaatattttg 79441 gagataaata ccaattagaa ggtctgagac gacccagaaa gcacttgatc atatcatgat 79501 ctgatttgta agcctacttg ggtcttgagt atttacttgt aagaacggaa ttctttgttt 79561 tgtaatggat agttgcaact ccgtaaaaaa gaattcagtc aaatttttct tacattgaac 79621 cattcctata tcatatatgt gtatgtgtaa atacaggtac catatatata ttttatatgg 79681 atatatggag tcatttggtt ctttttattc ttgctcgagc tggatgatta aaaattatca 79741 tgtccagttc cctcggggga tggatctata agaattcacc tatcccaata acaaaaaaac 79801 ctgacttgaa tgatcctgta ttaagagcta aattggctaa aggtatgggt cataattatt 79861 atggagagcc cgcatggccc aatgatcttt tatatatttt tccagtagta attctaggta 79921 ctattgcatg taatgtaggc ttagccgttt tagaaccatc aatgattggt gaaccggcag 79981 atccatttgc aacccctttg gaaatattac ctgaatggta tttctttcct gtatttcaaa 80041 tacttcgtac agtgcccaat aaattattgg gggttctttt aatggtttca gtacctgcgg 80101 gattattaac agtacctttt ttagagaatg ttaataaatt ccaaaatcca tttcgccgtc 80161 cagtagcgac gactgtcttt ttgattggta ccgcagtcgc cctttggttg ggcattggtg 80221 caacattacc tattgataaa tccctaactt taggtctttt ttaaattttt aaattgattc 80281 aattgtgaaa taacacgaca tgtgtatcta gggaatagtt tcttcaaagc gaattctccc 80341 tagatacatc tattcaattt aattctgaat ttattttgaa tatatgatat attaatatat 80401 taattgtgct aaagagtttc aatctatttt cactaagtaa gtccaataga tttaaaactt 80461 attttttgct aaatcaatta cgaaatattt ttctaaaatg cccaatatcc gttttacatc 80521 ttcgctacga aaatgttcaa ttttcataag atcttcttgg ctgttattca aaaggtccaa 80581 caatgtatat atattggaca ttttgaggca attatagatc ctggaaggca attctgattg 80641 gtcaataaaa atcgatttca atgctatttt ttttttgttt tttatgagtt tagccaattt 80701 atcatgaaag gtaaaagggg ataaaggaac cgtgtgttga ttgtcctgta aatataagtt 80761 gtcttcctcc atatgtaaaa agggaataaa taaatcaatt aaatttcggg atgcttcatg 80821 aagtgcttct ttcggagtta aacttccgtt tgtccatatt tcgagaaaaa gtatctcttg 80881 tttttcattc ccattcccat aagaatgaat actatgattc gcgtttcgaa caggcatgaa 80941 tacagcatct ataggataac ttccatcttg aaagttatgt ggcgttttta taagatatcc 81001 acgatttctc tctatttgta atccaataca aaaatcaatt ggttccgtta aactggctat 81061 atgttgtgta ttatcaacga tttctacata aggcggcaag atgatatctt gggcagttac 81121 agatccagga cccttgacac aaatagatgc gtcagaagtt ccatatagat tacttcttaa 81181 tataatttct ttcaaattca ttaaaatttc atgtaccgat tcttgaatgc ccgttatggt 81241 agaatattca tgtgggactt tctcagattt tacacgtgtg atacatgttc cttctatttc 81301 tccaagtaaa gctcttcgca tcgcaatgcc tattgtgtcg gcttggcctt tcataagtgg 81361 agacagaata aagcgtccat aataaaggcg tttactgtct gttcttgatt caacacactt 81421 ccactgtagt gtccgagtag atactgttac tttctctcga accatagtac tattatttga 81481 ttagatcatc gaatctttta tttctcttga gatttcttca atgttcagtt ctacacacgt 81541 ctttttttcg gaggtctaca gccattatgt ggcataggag ttacatcccg tacgaaagtt 81601 aatagtatac cacttcgacg aatagctcgt aatgctgcat ctcttccgag accgggacct 81661 tttatcatga cttctgctcg ttgcatacct tgatccacta ctgtacggat agcgtttgct 81721 gctgcggttt gagcagcaaa cggtgttcct cttctcgtac ctttgaatcc agaagtaccg 81781 gcggaggacc aagaaactac tcgaccccgt acatctgtaa cagtgacaat ggtattattg 81841 aaacttgctt gaacatgaat aactcccttt ggtattctac gtgcaccctt acgtgaacca 81901 atacgtccat tcctacgcga actaattttc ggtatagctt ttgccatatt ttatcatctc 81961 gtaaatatga gtcagagata tatggatata tccatttcat gtcaaaacag attctttatt 82021 tgtacatcgg ctcttctggc aagtctgatt atccctgtct ttgtttatgt ctcgggttgg 82081 aacaaattac tataattcgt ccccgcctac ggattagtcg acatttttca caaattttac 82141 gaacggaagc tcttattttc atatttctca ttccttacct taattctgaa tctatttctt 82201 ggaagaaaat aagtttcttg aaatttttca tctcgaattg tattcccacg aaaggaatgg 82261 tgaagttgaa aaacgaatcc ttcaaatctt tgttgtggag tcgataaatt atacgccctt 82321 tggttgaatc ataaggactt acttcaattt tgactctatc tcctggcagt atccgtataa 82381 aactatgccg gatctttcct gaaacataat ttataatcag atctaaacaa acccggaaca 82441 gaccgttggg aaggcgattc agtaattaaa gcttcatgac tcctttttgg ttcttaaagt 82501 ccctttgagg tatcaactaa taagaaagat attagacaac cccccttttt tctttttcac 82561 aaataggaag tttcgaatcc aatttggata ttaaaaggat taccagatat aacacaaaat 82621 ctctccacct attccttcta gtcgagcctc tcggtctgtc attatacctc gagaagtaga 82681 aagaattaca atccccattc cacctaaaat tcgcggaatt cgttgataat tagaatagat 82741 tcgtagacca ggtcgactga ttcgttttaa atttaaaata tttctatagg gtcttttcct 82801 attccttcta tgtcgcaggg ttaaaaccaa aaaatatttg tttttttctc gatgttttct 82861 cacgttttcg ataaaacctt ctcgtaaaag tatttgaaca atattttcgg taatattagt 82921 agatgctatt cgaaccaccc tttttcgatc catatcagca tttcgtatag aagttattat 82981 ctcagcaata gtgtccctac ccatgatgaa ctaaaattat tggggcctcc aaatttgata 83041 taatcaacgt gttttttact tatttttttt ttgaatatga tatgaattat taaagatata 83101 tgcgtgagac acaatctact aattaatcta tttctttcaa ataccccact agaaacagat 83161 cacaatttca ttttataata cctcgggagc taatgaaact attttagtaa aatttaattc 83221 tctcaattcc cgggcgattg caccaaaaat tcgagttcct tttgatttcc ttccttcttg 83281 atcaataaca actgcagcat tgtcatcata tcgtattatc atcccgttgt cacgtttgag 83341 ttctttacag gtccgcacaa ttacagctct gactacttct gatctttcta ggggcatatt 83401 tggtacggct tctttgatca cagcaacaat aacgtcacca atatgagcat atcgacgatt 83461 gctagctcct atgattcgaa tacacatcaa ttctcgagcc ccgctgttat ccgctacatt 83521 taaatgggtc tgaggttgaa tcattttttt aatccgttct ttgaatgcaa agggcgaaga 83581 aaaaaaagaa atatttttgt ccaaaaaaaa agaaacatgc ggtttcgttt catatctaag 83641 agccctttcc gcattttttt ctattacatt acgaaataat gaattgagtt cgtataggca 83701 ttttagatgc tgctagtgaa atagcccttc tggctatatt ttctgttact ccacccattt 83761 cataaagtat tcgacccggt ttaacaacag ctacccaata ttcaggggat ccttttcctg 83821 aacccatacg tgtttctgcg ggtcttagtg taactggttt gtctggaaat atacgtaccc 83881 atatttttcc accacgacgt gcatttcgtg tcattgctcg tcggcctgct tctatttgtc 83941 tagatgtaat ccaagcaggt tcaagtgcct gaagagcata tttaccgaaa gaaatatgat 84001 tacctcgatg agatattccc ttcattcttc ctctatgttg tttacggaat ctggttcttt 84061 tggggttata gttgatggtt gtttctgaat tccatctcta ctacagaacc ggacgtgaga 84121 gtttcttctc atccagctcc tcgcgaataa aaggattcaa aaaatttaat tagaattaag 84181 ctagaatagt caatcttaag ttaagatata tatgtattta ctgagtaata ccttgaacgt 84241 gggattcttt gagatttcat tcaatctatt agtaatttgt atatcttgtt tgaatagata 84301 actaaacttt tgagttttat aaatagaaat ctaaaaaaaa attgtattat tataccaaat 84361 ccttattttg tcctttattg tattgtccta aattttgcaa taaaaaaagt tttcgcgggc 84421 gaatattgac tctttcaatc cctatttcat ttgtagggtt aactcgtgac ttctcagatc 84481 tccgaataca tgaattaatc tctggttcgt tccgccatcc cgaccagtga atcattaaga 84541 ttcctttttc aatagaatct tttgcattca caagttccgt cgttcccatc acttcttact 84601 taatggttag gtccgaattc tacaatggag ctcagaatga aattggttct tgagtcaatc 84661 ttctcagtct ttattggctc gaagctcttg attttttgtt ctatttctat aagaagattc 84721 attttattat ggtatgaatg cgtattgatg ctttattaca ctgcctttta tgagattact 84781 catagacctt acatattgga attttatatc attggtattc tttttctctc tttctctcat 84841 ccttccattt atccacatct tttttgtcta ttttgcttta caacttagaa tcagatttcc 84901 ttttttgttt atgcaaaaga tttcagttgc tacaaagata tgacctatat atcatatctt 84961 gactggttct ttagatccag ataatgcgaa gtgatgggtt ggttattagt tctatagttt 85021 ttagttcata ctatgtgggc tggtcttttt taatcctaac cctaaaaaac caacggagtc 85081 acacactaag catagcaatt atatcaaatg gtcaatcgaa tttttattca accttataga 85141 attaagaatt agaaatgttt cccttgattg attagaaaaa gaatgaattt gtcttttttt 85201 gttcaatcat tggatagaag ggaaagacaa gtagtaaaat tattcctcgt ctagaaatat 85261 ccaaattttg atgcccaata ctccatagat agttcgaact gtataagagc aataatcaat 85321 tttcgctcga atcgtttgta ggggaaccct accttctctg atccattcga cacgtgcaat 85381 ttcttttccg tcgatacgcc ccgcaatttg tatttgaatt ccttttgtat ctgcttgttc 85441 tgttaattca atagcctttt tcattgcttt tcgaaaggaa actctattct ttaattgtcc 85501 agctataaat tctgcaagaa tattagggtt tccataaggt tttgcaattc ttgtgacagc 85561 aatgttcagt tttcggttta cacaatgaaa ttctttttgt aaggtcgttt gtaattcttc 85621 gattccgcgc ggtcgacttt ctattaataa ttttgggaat cccataaaga ttatgacctg 85681 gatcagatcg attctttttt gaatctctat acgtgcaatt ccctcgacgc cagaggacgt 85741 tctcatattc ttttgtacat aattcttgat acaatctctt attttttgat cttcttgtaa 85801 accttcagaa taattttttg gttgtgaaaa ccaaagggaa tgatgacctt gggttgtacc 85861 cagtctgaaa ccaagtggat ttattttttg tcccataatc ccccactatt atacatatca 85921 cgatacggca tagctgtaga tttttttttc catctcgttt tttttaacga atacatctct 85981 acatattcat catctaaaga tatatctttc attacaatag ttatatgaca ggtcgatctt 86041 tttattggaa aactacgtcc tcgagctcga ggtttcaatt tcttcacagt agtacctcca 86101 ttgacttcgg ctttactaat gactaaattg gcttcgctgg aacccatatt gtaactagca 86161 tttgctgctg cagaataaat caatttcaaa atgggataac atgctcgata gggcatgagt 86221 tctagtatca taagcgtttc ctcataggaa cggccgcgaa tttgattaat tactcttcgt 86281 gctttgtcag cagacataga tatatgttca cctaaagcat atacttctgt ttttttcttc 86341 tttagcataa ggtttgcctc ctactactga atcataagca tctagatttt ttttattaat 86401 attaacgacg agatctatta tcgctttttg catgtcctct aaaatttaat gtaggtgcaa 86461 attctcccaa tttgtggcct accatactat ccgttatata aataggcaaa tgctcttttc 86521 cattatggat agcaatcgta tgaccgatca ttgtgggtat aatggtagat gcccgggacc 86581 aagttactat tatttctttt tctgcttttg tgttaagctt atcaattttt tttaataaat 86641 gattggctac aaagggattt ttttttagtg aacgtgtcac aagcttactc ctattttttt 86701 tttttttgta aaaacgaaga atttaattcg attttctctc ctatttacta cggcgacgaa 86761 gaatcaaatt atcactatat ttattccttt ttctacttct tcttccaagt gcaggataac 86821 cccaaggggt tgtgggtttt tttctaccaa ttggggctct cccttcacca cccccatggg 86881 gatggtctac agggttcata actactcctc ttactacagg acgcttacct agccaacgct 86941 tagatccggc tctacccaaa cttttctggt tcaccccaac attccccact tgtccgactg 87001 ttgctgagca gtttttggat atcaaacgga cctccccaga aggtaatttt aatgtggccg 87061 atttcccctc ttttgcaatc agtttcgcta cagcacccgc tgctctagct aattgtccac 87121 cctttccaag tgtgatttct atgttatgta tggccgtgcc taagggcata tcggttgaag 87181 tagattcttc ttttgatcaa tcaaaacccc ttcccaaact gtacaagctt cttccaaagc 87241 atacttcttt ctggatgtag atgatgatat ctatacagat ggatcttata tatatcgtag 87301 aatgaagtac cacatgggtg gatatatata tgaatccaaa tctgccgaat cactcatgtt 87361 atgatcttct acatcctggg tcttcccgtt ccgtcatctg gcttatgttc ttcatgtagc 87421 attcagaccg aatgactcta tgaaattacg tcgatacttc cacatattat gggtaacgta 87481 ggagacatct ctatttttcc cccggggaat ctttagaatt cccactgctt aactttcaat 87541 tcgcctctga ccatcaaatg aaatgtgaat aacccgtcct cctctctttg aaagaagggg 87601 cgcttccggt tctgtcggtg cttgaaacaa ttttgtcttc tccatattac tatatctcta 87661 gagtcaataa ttttatatga ggaactactg aactcaatca cttgctgccg ttactcttca 87721 gttttctgtt gaggtctatc ctgcagaggt actcaaattg gatcagtgat cgatttctag 87781 gtttcgtcgt aaacctaatt ggttatttcc aattacgtaa atcaatagtt caaaccgcac 87841 tcaaaggtag ggcatttccc atttttatag gaacttctgt accagaaaca atggtatctc 87901 caattatagc ccctctggga tgtaaaatat atctcttctc accatcccca tagtgtatga 87961 gacaaatgta tgcatttcga ttagggtcgt attctatggt tacgattcta ccatatatgt 88021 ctttttcatt ccgtcgaaaa tcgattttac ggtatagacg cttatgacct ccccctctat 88081 gccttgcggt aatgattcct ctggcattac gacctttacc acaatgatgc tgtccataga 88141 tcaaattatt tcgtggattg gatttcactt gactgtctac ggttccattg cgtgtgctcg 88201 gggtagaagt tttgtataaa tgtatcgcca tgctattaag tatttttttt taagttcttt 88261 tctttctaag aggtggaata gaataacccg gttgaagcgt aatgatcata cgtctgtaat 88321 gcattgtatg tcccataata ggtcccattc ttctactctt tcccggaagt cgatgactat 88381 tcatagctat taccttgaca ccaaagaaga gttcgaccca atgctttatt tctgtcctag 88441 ttgatcctga ttcgacatta gaagtatatt gatttttccc caataaccga atacttttgt 88501 ctgtaaatac tgcatatttg attccatcca taaatcgatt ttcttcccta tgagttatag 88561 tctcaataag aatgctagtt cttactgttc atatattatg atatgaatat accacaccaa 88621 ttcgttatgt atggatgatg agattccatt gatacagagc caattccaat agacttattg 88681 gagggtccca ttggcgtgca tccagtagga attgaaccta cgaattcgcc aattatgagt 88741 tgggcgcttt aaccattcag ccatggatgc ttagcgggga tcctcgtaca tggtgaataa 88801 ccaaattcca attgaaatga aatctttagg ataaatcaat gcaatttagt taggataaat 88861 caatgcaatt taggaggaat caatgagagg acatcaattc aaatcctgga ttttcgaatt 88921 gagagagata ttgagagaga tcaagaattc tcaccatttc ttagattcat ggacccaatt 88981 caattcagcg ggatccttca ttcacatttt tttccaccaa gaacgttttc taaaactctt 89041 tgacccccga atttggagta tcctactttc acgcaattca cagggttcaa caagcaatcg 89101 atatttcacg atcaagggtg taatactctt tgtagtagcg gtccttatat atcgtattaa 89161 caatcgaaat atggtcgaaa gaaaaaatct ctatttgata gggcttcttc ctatacctat 89221 gaattccatt ggacccagaa atgatacatt ggaagaatcc gttgggtctt ccaatatcaa 89281 taggttgatt gtttcgctcc tgtatcttcc caaaggaaaa aagatctctg agagttgttt 89341 cctgaatccg aaagagagta cttgggttct cccaataact aaaaagtgta gcatgcctga 89401 atctaactgg ggttcgcgtt ggtggaggaa ctggatcgga aaaaagaggg attctagttg 89461 taagatatct aatgaaaccg tcgctggaat tgagatctta ttcaaagaga aagatctcaa 89521 atatctggag tttctttttg tatattatat ggatgatccg atccgcaagg accatgattg 89581 ggaattgttt gatcgtcttt ctctgaggaa gagtcgaaat agaatcaact tgaattcggg 89641 accgctattc gaaatcttag tgaaacactg gatttcttat ctcatgtctg cttttcgtga 89701 aaaaatacca attgaagtgg agggtttctt caaacaacaa ggggctgggt caactattca 89761 atcaaatgat attgagcatg tttcccatct cttctcgaga aacaagtggg ctatttcttt 89821 gcaaaactgt gctcaatttc atatgtggca attccgccaa gatctcttcg ttagttgggg 89881 gaagaatccg cccgaatcgg attttttgag gaacgtatcg agagagaatt ggatttggtt 89941 agacaatgtg tggttggtaa acaaggatcg gttttttagc aaggtacaga atgtatcgtc 90001 aaatattcaa tatgattcca caagatctag tttcgttcaa gtaacggatt ctagccaact 90061 gaaaggatct tctgatcaat ccagagatca tttggattcc attagtaatg aggattcgga 90121 atatcacaca ttgattaatc aaagagagat tcaacaacga aaagaaagat cgattctttg 90181 ggatccttcc tttcttcaaa cggaacgaaa agagatagaa tcaggccgat tcccgaaatg 90241 cctttctgga tattcctcaa tgtcccggct attcacggaa cgtgagaagc agatgattaa 90301 tcatctgttt ccggaagaaa tcgaagaatt tcttgggaat cctacaagat ccgttcgttc 90361 ttttttctct gatagatggt cagaacttca tctgggttcg aatcctactg agaggtccac 90421 tagggatcag aaattgttga agaaacaaca agatctttct tttgtccctt ccaagcgatc 90481 ggaaaataaa gaaatggtta atatattcaa gataattacg tatttacaaa ataccgtctc 90541 aattcatcct atttcatcag atccgggatg tgatatggtt ccgaagatga accggatatg 90601 gacagttcca ataagatttc attcttgaac aaaaatccat tttttgattt atttcatcta 90661 ttccatgacc ggaacagggg aggatacacg ttacactacg attttgaatc agaagagaga 90721 tttcaagaaa tggcagatct attcactcta tcaataaccg agccggatct ggtgtatcat 90781 aagggatttg ccttttctat tgattcctgc ggattggatc aaaaacaatt cttgaatgag 90841 gccagggatg aatcgaaaaa gaaatcttta ttggttctac ctcctatttt ttatgaagag 90901 aatgaatctt tttctcgaag gatcagaaaa aaatgggtcc ggatctcctg cgggaatgat 90961 ttggaagatc caaaaccaaa aatagtggta tttgctagca acaacataat ggaggcagtc 91021 actcaatata gattgatccg aaatctgatt caaatccaat atagtaccta tgggtacata 91081 agaaatgtat tgaatcgatt ctttttaatg aatagatccg atcgcaactt cgaatatgga 91141 attcaaaggg atcaaatagg aaaggatact ctgaatcata gaactataat gaaatatacg 91201 atcaaccaat atttatcgaa tttgaaaaag agtcagaaga aatggttcga gcctcttatt 91261 ttgatttctc gaaccgagag atccatgaat cgggatcctg atgcatatag atacaaatgg 91321 tccaatggga gcaagaattt ccaggaacat ttggaacagt ccgtttcgga gcagaagagc 91381 cgttttcaag tagtgttcga tcgattacgt attaatcaat attcgattga ttggtctgag 91441 gttatcgaca aaaaagattt gtctaagcca cttcgtttct ttttgtccaa gtcacttctt 91501 tttttgtcca agttgctttt ctttttgtct aactcacttc cttttttctg tgtgagtttc 91561 ggaaatatcc ccattcatag gtccgagatc tacatctatg aattgaaagg tccgaatgat 91621 caactctgca atcagttgtt agaatcaata ggtcttcaaa ttgttcattt gaaaaaatgg 91681 aaacccttct tattggacga tcatgatact tcccaaaaat cgaaattctt gatcaatgga 91741 ggaacaatat caccattttt gttcaataag ataccaaagt ggatgattga ctcattccat 91801 actagaaata atcgcaggaa atcctttgat aacccggatt cctatttctc aatgatattc 91861 cacgatcaag acaattggct gaatcccgtg aaaccatttc atagaagttc attgatatct 91921 tctttttata aagcaaatcg acttcgattc ttgaataatc cacatcactt ctgcttctat 91981 tggaacacaa gattcccctt ttctgtggaa aaggcccgta tcaataattc tgattttacg 92041 tatggacaat tcctcaatat cttgttcatt cgcaacaaaa tattttcttt gtgcgtcggt 92101 aaaaaaaaac atgctttttg ggggagagat actatttcac caatcgagtc acaggtatct 92161 aacatattca tacctaacga ttttccacaa agtggtgacg aaacgtataa cttgtacaaa 92221 tctttccatt ttccaagtcg atccgatcca ttcgttcgta gagctattta ctcgatcgca 92281 gacatttctg gaacacctct aacagagggg caaatagtca attttgaaag aacttattgt 92341 caacctcttt cagatatgaa tctatctgat tcagaaggga agaacttgca tcagtatctc 92401 aatttcaatt caaacatggg tttgattcac actccatgtt ctgagaaaga tttatcatcc 92461 gaaaagagga aaaaacggag tctttgtcta aagaaatgcg ttgagaaagg gcagatgtat 92521 agaacctttc aacgagatag tgctttttca actctctcaa aatggaatct attccaaaca 92581 tatatgccat ggttccttac ttcgacaggg tacaaatatc taaatttgat atttttagat 92641 actttttcag acctattgcc aatactaagt agcagtcaaa aatttgtacc catttttcat 92701 gatattatgc atggatcagg tatatcatgg cgaattcttc agaaaaaatt gtgtcttcca 92761 caatggaatc tgataagtga gatctcgagt aagtgtttac ataatcttct tctgtccgaa 92821 gaaatgattc atcgaaataa tgagtcacca ttgatatcga cacatctgag atcgccaaat 92881 gctcgggagt tcctctattc aatccttttc cttcttcttg ttgctggata tctcgttcgt 92941 acacatcttc tctttgtttc ccgggcctct agtgagttac agacagagtt cgaaaaggtc 93001 aaatctttga tgattccatc atctatgatt gagttgcgaa aacttctgga taggtatcct 93061 acatctgaac cgaattcttt ctggttaaag aatctctttc tagttgctct ggaacaatta 93121 ggagattctc tagaagaaat acggggttct gcttctggcg gcaacatgct tggtcccgct 93181 tatggggtca aatcaatacg ttctaagaag aaagattgga atatcaatct catcgagatc 93241 atcgatctca taccaaatcc catcaatcga atcacttttt cgagaaatac gagacatcta 93301 agtcatacaa gtaaagagat ctattcattg ataagaaaaa gaaaaaacgt gaacggggat 93361 tggattgatg ataaaataga atcctgggtc gcgaacagtg attcgattga tgatgaagaa 93421 agagaattct tggttcagtt ctccacctta acgacagaaa ataggattga tcaaattcta 93481 ttgagtctga ctcatagtga tcgtttatca aagaatgact ctggttatca aatgattgaa 93541 caaccgggag caatttactt acgatactta gttgacattc ataaaaagca tctaatgaat 93601 tatgagttca atccatcctg tttagcagaa agacggatat tccttgctca ttatcagaca 93661 atcacttatt cacaaacttc gtgtggggaa aatagttttc atttcccatc tcatggaaaa 93721 cccttttcgc tccgcttagc cttatccccc tctaggggta ttttagtgat aggttctata 93781 ggaactggac gatcctattt ggtcaaatac ctagcgacaa actcctatgt tcctttcatt 93841 acggtatttc tgaacaagtt cctggataac aagcctaaag gttttcttct tgatgagatc 93901 gatattgatg atagtgacga tattgatgat agtgacaatc ttgatgctag tgacgatatc 93961 gatcgtgacc ttgatacgga gctgaaactg ctaactagga tgaatgggct aactatggat 94021 atgatgccgg aaatagaccg attttatatc acccttcaat tcgaattagc aaaagcaatg 94081 tctccttgca taatatggat tccaaacatt catgatctgg atgtgaatga gtcgaatgac 94141 ttagccctcg gtctattagt gaaccatctc tccagggatt gtgaaagatg ttctactaga 94201 aatattcttg ttattgcttc gactcatatt ccccaaaaag tggatcccgc tctaatagct 94261 ccgaataaat taaatacgtg cattaagata cgaaggcttc ttcttccaca acaacgaaag 94321 cactttttca ctctttcata tactagggga tttcacttgg aaaagaaaat gttccatact 94381 aacggattcg ggtccataac catgggttcc aatgcacgag atcttgtagc acttaccaat 94441 gaggtcctat cgattagtat tacacagaag aaatcaatta tagacactaa tacaattaga 94501 tccgctcttc atagacaaac ttgggatttg cgatcccagg taagatcggt tcaggatcat 94561 gggatccttt tctatcagat aggaagggct gtagcacaaa atgtacttct aagtaattgc 94621 cccatagatc ctatatctat ctatatgaag aagaaatcat gtaacgaagg ggattcttat 94681 ttgtacaaat ggtacttcga gcttggaacg agcatgaaga gattaacgat acttctttat 94741 cttttgagtt gttctgccgg atcggtcgct caagatcttt ggtctttatc cggacccgat 94801 gaaaaaaatg ggatcacttc ttatggactc gttgagaatg attctgatct agttcatggc 94861 ctattagaag tagaaggcgc tctggtggga tcttcacgga cagaaaaaga ttgcagtcag 94921 tttgataatg atcgagtgac attgcttctt cggcccgaac cgaggaatcc cttagatatg 94981 atgcaaaacg gctcttgttc tatccttgat cagagatttc tctatgaaaa atatgaatcg 95041 gagtttgaag aaggggaggg agaaggagcc cttgacccgc aggaggattt attcaatcac 95101 atagtttggg ctcctagaat atggcgccct tggggctttc tatttgattg tatcgaaagg 95161 cccaatgaat tgggatttcc ctattggtcc aggtcatttc ggggcaagcg gatcatttat 95221 gatgaagagg atgagcttca agagaatgat tcggagttct tgcagagtgg aaccatgcag 95281 taccagacac gagatagatc ttccaaagaa caaggccttt ttcgaataag ccaattcatt 95341 tgggaccctg cagatccact ctttttccta ttcaaagatc agccccctgg ctctgtgttt 95401 tcacatcgag aattatttgc agatgaagag atgtcaaagg ggcttcttac ttcccaaaca 95461 gaccctccta catctatata taaacgctgg tttatcaaga atacgcaaga aaagcacttc 95521 gaattgttga ttaatcgtca gagatggctt agaaccaaca gttcattatc taatggatct 95581 ttccgttcta atactctatc cgagagttat cagtatttat caaatctgtt cctatctaac 95641 ggaacgctat tggatcaaat gacaaagaca ttgttgagaa aaagatggct tttcccggat 95701 gaaatgaaaa ttggattcat gtaacaggag aaagatttcc cattccttag ccggaaagat 95761 atgtggccat gaaagaggga ttaagtggaa cagaattgac tgggtggtag agtcgtggaa 95821 acgcttgttt cttccatatt ttggacctta gctccatgga agaatatgtt actgctgaaa 95881 cacggaagaa ttgaaatctt agatcaaaac actatgtatg gatggtatga actgcctaaa 95941 caagaattct tgaacagcaa acaaccagtt cagatattca cgaccaagaa gtactggatt 96001 ctctttcgga taggccctga aaggagaagg aaggctggaa tgccaacagg cgtctattat 96061 attgaattta cccgatagtc cccattttgg gaacgtccag tgccaaagtc actgaatggg 96121 taagtcgcca atccctggac tatgtaatgt actttatctg ctgggttacg ggcgggcatt 96181 ttaccagagg tttctaatct acccttgtgt gattcctgtt gaagcatata ctcggggggt 96241 gggtgcaggg cggacgattt taaagcggac tccccattca ttagatagag aagatcacca 96301 agatttcgcg atccgctgcc gaatttattc caattccaag agctcggatc gaatcggtat 96361 atcaataccg attcgatccg agctctctta ttgagaatgc tcattcaatg agcattctca 96421 atattatgcc ttgaagagga ctcgaacctc cacgctattt agcacgagat tttgagtctc 96481 gcgtgtctac catttcacca ccaaggcatc ttgaaagtga atcgtattcc atgaatatga 96541 tatctatcta gtgtgatgta tggaatatat gacaaaggtg gatctattga tcggtcatgt 96601 catataggcc cgagttggac atccaattgc ttcgatttga attatccgga gaatgcaatg 96661 cctgatatat atcaaaaaga tggacaatca aacctatttc tcgattcact caaagaggtg 96721 aatagggtcc caatagagat atgtaaaaag caggtccgat tacgcgtatt cctaatccta 96781 aatggaatgt aatgatgtag gaatccatat gtaaacatag tatctattta gataggcccg 96841 aatgacccct tctcataatg agaatgtata taaccctatt ccggcctggt ccggtatgga 96901 atgaacttat aatcatggaa tcgactcgat catcagatta taagttcata accctagccc 96961 attcccattt tgggcggaac agatctacta attctttgat tccagttagt aagagggatc 97021 ttgaactaag aaatagaccc tagaagctaa aaaaggctat cctgagcaat tgcaataatt 97081 gggttcattg atattcctgg tatagtagat gctatcacac atacaatcat actcaattcg 97141 atggaattgt ttgatcttaa aggggatctt ctataatttc gcacgtgagg ggttatttct 97201 tggtttcgtc cagtcattaa taactttatt atttttagat aatagtagat agaaacaacg 97261 cttgtaagga gtcctattaa aaccaagaaa tataggcctg cctgccatcc acaccagaat 97321 aaatagagtt ttccgaaaaa acctgctagt ggaggaagac ctcctaggga taagagacat 97381 agggctaaag agagagccaa aaaaggatct tttgtgtata atcctgcata atctcgaatg 97441 ttatcagttc cggtacgtag accaaataat acaatgcaag caaaagttcc tagattcatg 97501 gagatataga acagcatata agttatcatg cttgcatatc catcatttga gtctccaaca 97561 attattccaa taattacata tccgatttgg cctatggacg aatatgcaag catacgtttc 97621 atgcttgttt gagtaatagc aatgagattt cccaatatca tgctaagaat agctaggatt 97681 tccagaagaa gatgccattc gtttgatgag aaataaaaag gaatatcgaa aattcgagtg 97741 gctgaagctg aagcagctac tttcgaagta acagaaagaa aagcaacgac tggagtggga 97801 gagtcagagt cgaaaagagg attcctcact tctttctctc attcaaaacc gtgcatgaga 97861 ctttcatctc acacggctcc taagtgataa aagaaagaag aacccatttt ctttcttttt 97921 tgattacctt cctcgcgtat gtataagacc gaatccattc gatttctaaa aaggattact 97981 aatccttaac ttttcgagga atccttcatc agtggttgtg aatgactgat tttttcaatc 98041 ttttcgacct tggtttcgta ggagcaagtc agaaagattg agaaatagaa ccatctgatt 98101 taattcgttc tcaatagcca cgagatgatc atcttagggt gatccttttg tcgacggatg 98161 ctcttattac actcgtagtc tctgaaggat gagaaccaac tatgtagcat ctacatcgag 98221 aattcaagta ttgtatacgt cattagtccg atcctttgta ggaactaccc gtaataacga 98281 acttgcaaaa tggatctgtt tatcataaag agattcgtcg ttcctgaccc tgcttcacct 98341 taattgttat ttgaacaagt aaaagttctg tcttggtccg agtggggata gcatttctct 98401 tctgcatgtc catggagttt tgaaaaatcc aaacatctca gagatagata gagaggtagg 98461 aatttctcga acgaaccgca ctccttcgta tacgtcagga gtccattgat gagaaggggc 98521 tggggaaagc ttgaacccaa ttcctacggt aatgaatatg agcgcaattg aaattcctgg 98581 ggagttatac atttgtgtat tgataagacc gtttactatt tcttgaagct caatctctcc 98641 cccggatgaa ccatatagcc aagagaaacc atgaaccaga atagaagagc ttgccccacc 98701 catgagtaaa tatttcatag tagcctcatt agaccgtaca tctttcttgg tatatccaga 98761 taataggtag gagcataaac tgaaacattc tggggctaca aagatagtta ttaaatcgtt 98821 agcaccgcat aaaaacattc cccctagagt agctgttaat acgaataaga gaaactctgt 98881 tatagccatt tctgtacatt caatgtactc tacggataga ggaatacata gagttgaaca 98941 tagtaaaata agaaattgaa agatttcgtt gaaattgttc gtttggaaat ttcccgaaaa 99001 gctaatcata ggttcttctc tccatcggaa caatagggcc gttatgctca ttactaaact 99061 tgttgaagag atgaaatata accaaggtat atctttttga tcagaggttg aatcgatcat 99121 cagaagaaga attaggccaa aaattaggat acattctggg aaaatcaaac ttccatcgaa 99181 gagaagcaaa tgaaaggctt tcataaaaat tctcgtagaa tcgagaatga agttttcatt 99241 ctgtacatgc cagatcatga attagtaact gcttccaatt tccaaaaaaa atcccaattg 99301 tgtcgaactt tccatttttg gaatagttac ggaatctcca tgaataggat caaaccttat 99361 tccatggtat ttacatgagg ttcctcttta agaaagtccc cgagaggctt agttgatcca 99421 tgatttatgt ttcatctttc cttttcgttt gtttcgagaa atctatcgat caattccgat 99481 tctttctttt tctcttgatt cttttccgat cgagatgtat agatcctgtt catggattaa 99541 cgaaaatgtg caaaagctct atttgcctct gccattctat gagtctcttc ctttttgcgt 99601 atggcatcgc cactcccttt ggcagcatcc actaattcgg aacttaattt gaaagccata 99661 tttcgacccg gacgttttcg ggatgccgct aataaccaac gaatggcaag tgcttttcct 99721 tgtgtggatc ctatttcaat gggaacttga tgagtcgatc cacctacacg tcttgctttt 99781 actgttatat cgggagttac tccacgtatt gcttgacgta aaacggatag tggatttgtt 99841 tctgtctttt gttgaatctt tttcacggct cgatagataa tttgataagc caatgatttt 99901 tttccgtgtt tcagaatacg gttaaccaac atgttaacta atcgattacg ataaattgga 99961 tcggattttg ctgttttttt ttctgcagta cctcgacgtg acatgagcgt gaaaggggtt 100021 caagaatcag ttttcttttt ataagggcta aaatcactta ttttggcttt tttaccccat 100081 attgtagggt ggatctcgaa agatatgaaa gatctccctc caagccgtac atacgacttt 100141 catcgaatac ggctttccgc agaattctat atgtatctat gagatcgagt atggaattct 100201 gtttactcac tttaaattga gtatccgttt ccctcccttt cctgctagga ttggaaatcc 100261 tgtattttac atatccatac gattgagtcc ttgggtttcc gaaatagtgt aaaaagaagt 100321 gcttcgaatc attgctattt gactcggacc tgttctaaaa aagtcgaggt atttcgaatt 100381 gtttgttgac acggacaaag tcagggaaaa cctctgaaat tatttcaata ttgaaccttg 100441 gacatataag agttccgaat cgaatctctt tagaaagaag atcttttgtc tcatggtagc 100501 ctgctccagt ccccttacga aactttcgtt attgggttag ccatacactt cacatgtttc 100561 tagcgattca catggcatca tcaaatgata caagtcttgg ataagaatct acaacgcact 100621 agaacgccct tgttgacgat cctttactcc gacagcatct agggttcctc gaacaatgtg 100681 atatctcaca ccgggtaaat ccttaaccct tccccctctt actaagacta cagaatgttc 100741 ttgtaaatta tggccaatac cgggtatata agcagtgatt tcaaatccag aggttaatcg 100801 tactctggca actttacgta aggcagagtt tggttttttt ggggtgatag tggaaaagtt 100861 gacagataag tcacccttac tgccactcta cagaaccgta catgagattt tcacctcata 100921 cggctcctcg ttcaattctt tcgaattcat tggatccttt ccgcgttcga gaatcccccc 100981 cttcttccac tccgccccga agagtaacta ggaccaattt agtcacgttt tcatgttcca 101041 attgaacact gtccattttt gattattctc aaaggataag attattctct ttaccaaaca 101101 tatgcggatc caatcacgat cttatatata agaagaacaa aagatctttc ttgatcaatc 101161 cctttgcccc tcattcttca agaataagga agatcctttt caagtttgaa tttgttcatt 101221 tggaatctgg gttcttctac ttcatattta tttaatatga atattttccc tctctttttt 101281 ttatatcatt ccttaagtcc cataggtttg atcctgtaga atttgaccca ttttctcatt 101341 gaacgaaagg tacgaaataa atcagattga taaaagtacc atgtgaaatc ttcggttttt 101401 ccccttcctc gatccctatc ccataggtta ggtacagtgt ttgaatcaat agagaacctt 101461 ttcttctgta tgaatcgata ttattccatt ccaaatcctt cccgatacct cccaaggaaa 101521 atctcgaatt tggatcccaa attgacgggt tagtgtgagc ttatccatgc ggttatgcac 101581 tctttgaata ggaatccgtt ttctgaaaga tcctggcttt cgtactttgg tgggtctccg 101641 agatcctttc gatgacctat gttgaaggga tatctatcta atccgatcga ttgcgtaaag 101701 cccgcggtag caacggaacc ggggaaagta tacagaaaag acagttcttt tctattatat 101761 tagtattttc tattatatta gatatattag actattatat tagattagta ttagttagtg 101821 atcccgactt agtgagtctg atgaattgtt ggcaccagtc ctacattttg tctctgtgga 101881 ccgaggagaa aaggggctcg gcgggaagag gagtgtacca tgagagaagc aaggaggtca 101941 acctctttca aatatacaac atggattctg gcaatgtagt tggactctca tgtcgatccg 102001 aatgaatcat cctttccacg gaggtaaatc tttgcctgct aggcaagagg atagcaagtt 102061 ccaaattctg tctcggtagg acatgtattt ctattactat gaaattcata aatgaagtag 102121 ttaatggtag ggttaccatt atcctttttg tagtgacgaa tcttgtatgt gttcctaaga 102181 aaaggaattt gtccattttt cggggtctca aaggggcgtg gaaacgcata agaactcttg 102241 aatggaaaag agatgtaact ccagttcctt cggaatcggt agtcaatcct atttccgata 102301 ggggcagttg acaattgaat ccgattttga ccattatttt catatccgta atagtgcgaa 102361 aagaaggccc ggctccaagt tgttcaagaa tagtggcgtt gagtttctcg accctttgac 102421 ttaggattag tcagttctat ttctcgatgg ggcggggaag ggatataact cagcggtaga 102481 gtgtcacctt gacgtggtgg aagtcatcag ttcgagcctg attatcccta agcccaatgt 102541 gagtttttct agttggattt gctcccccgc cgtcgttcaa tgagaatgga taagaggctc 102601 gtgggattga cgtgaggggg cagggatggc tatatttctg ggagcgaact ccgggcgaat 102661 atgaagcgca tggatacaag ttatgccttg gaatgaaaga caattccgaa tccgctttgt 102721 ctacgaacaa ggaagctata agtaatgcaa ctatgaatct catggagagt tcgatcctgg 102781 ctcaggatga acgctggcgg catgcttaac acatgcaagt cggacgggaa gtggtgtttc 102841 cagtggcgga cgggtgagta acgcgtaaga acctgccctt gggaggggaa caacagctgg 102901 aaacggctgc taataccccg taggctgagg agcaaaagga ggaatccgcc cgaggagggg 102961 ctcgcgtctg attagctagt tggtgaggca atagcttacc aaggcgatga tcagtagctg 103021 gtccgagagg atgatcagcc acactgggac tgagacacgg cccagactcc tacgggaggc 103081 agcagtgggg aattttccgc aatgggcgaa agctgacgga gcaatgccgc gtggaggtag 103141 aaggcccacg ggtcgtgaac ttcttttccc ggagaagaag caatgacggt atctggggaa 103201 taagcatcgg ctaactctgt gccagcagcc gcggtaatac agaggatgca agcgttatcc 103261 ggaatgattg ggcgtaaagc gtctgtaggt ggctttttaa gtccgccgtc aaatcccagg 103321 gctcaaccct ggacaggcgg tggaaactac caagctggag tacggtaggg gcagagggaa 103381 tttccggtgg agcggtgaaa tgcgtagaga tcggaaagaa caccaacggc gaaagcactc 103441 tgctgggccg acactgacac tgagagacga aagctagggg agcgaatggg attagatacc 103501 ccagtagtcc tagccgtaaa cgatggatac taggcgctgt gcgtatcgac ccgtgcagtg 103561 ctgtagctaa cgcgttaagt atcccgcctg gggagtacgt tcgcaagaat gaaactcaaa 103621 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaaagcgaa 103681 gaaccttacc agggcttgac atgccgcgaa tcctcttgaa agagaggggt gccttcggga 103741 acgcggacac aggtggtgca tggctgtcgt cagctcgtgc cgtaaggtgt tgggttaagt 103801 cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt gagtttggaa ccctgaacag 103861 actgccggtg ataagccgga ggaaggtgag gatgacgtca agtcatcatg ccccttatgc 103921 cctgggcgac acacgtgcta caatggccgg gacaaagggt cgcgatcccg cgaggtgagc 103981 taaccccaaa aacccgtcct cagttcggat tgcaggctgc aactcgcctg catgaagccg 104041 gaatcgctag taatcgccgg tcagccatac ggcggtgaat tcgttcccgg gccttgtaca 104101 caccgcccgt cacactatgg gagctggcca tgcccgaagt cgttacctta accgcaagga 104161 gggggatgcc gaaggcaggg ctagtgactg gagtgaagtc gtaacaaggt agccgtactg 104221 gaaggtgcgg ctggatcacc tccttttcag ggagagctaa tgcttgttgg gtattttggt 104281 ttgacactgc ttcacacccc caaaaaaaag aagggagcta cgtctgagtt aaacttggag 104341 atggaagtct tctttccttt ctcgacggtg aagtaagacc aagctcatga gcttattatc 104401 ctaggtcgga acaagttgat aggaccccct tttttacgtc cccatgttcc ccccgtgtgg 104461 cgacatgggg gcgaaaaaag gaaagagagg gatggggttt ctctcgcttt tggcatagcg 104521 ggcccccagt gggaggctcg cacgacgggc tattagctca gtggtagagc gcgcccctga 104581 taattgcgtc gttgtgcctg ggctgtgagg gctctcagcc acatggatag ttcaatgtgc 104641 tcatcggcgc ctgaccctga gatgtggatc atccaaggca cattagcatg gcgtactcct 104701 cctgttcgaa ccggggtttg aaaccaaact cctcctcagg aggatagatg gggcgattcg 104761 ggtgagatcc aatgtagatc caactttcga ttcactcgtg ggatccgggc ggtccggggg 104821 ggaccaccac ggctcctctc ttctcgagaa tccatacatc ccttatcagt gtatggacag 104881 ctatctctcg agcacaggtt tagcaatggg aaaataaaat ggagcaccta acaacgcatc 104941 ttcacagacc aagaactacg agatcgcccc tttcattctg gggtgacgga gggatcgtac 105001 cattcgagcc gtttttttct tgactcgaaa tgggagcagg tttgaaaaag gatcttagag 105061 tgtctagggt tgggccagga gggtctctta acgccttctt ttttcttctc atcggagtta 105121 tttcacaaag acttgccagg gtaaggaaga aggggggaac aagcacactt ggagagcgca 105181 gtacaacgga gagttgtatg ctgcgttcgg gaaggatgaa tcgctcccga aaaggaatct 105241 attgattctc tcccaattgg ttggaccgta ggtgcgatga tttacttcac gggcgaggtc 105301 tctggttcaa gtccaggatg gcccagctgc gccagggaaa agaatagaag aagcatctga 105361 ctacttcatg catgctccac ttggctcggg gggatatagc tcagttggta gagctccgct 105421 cttgcaattg ggtcgttgcg attacgggtt ggatgtctaa ttgtccaggc ggtaatgata 105481 gtatcttgta cctgaaccgg tggctcactt tttctaagta atggggaaga ggaccgaaac 105541 gtgccactga aagactctac tgagacaaag atgggctgtc aagaacgtag aggaggtagg 105601 atgggcagtt ggtcagatct agtatggatc gtacatggac ggtagttgga gtcggcggct 105661 ctcccagggt tccctcatct gagatctctg gggaagagga tcaagttggc ccttgcgaac 105721 agcttgatgc actatctccc ttcaaccctt tgagcgaaat gcggcaaaag aaaaggaagg 105781 aaaatccatg gaccgacccc atcatctcca ccccgtagga actacgagat caccccaagg 105841 acgccttcgg catccagggg tcacggaccg accatagaac cctgttcaat aagtggaacg 105901 cattagctgt ccgctctcag gttgggcagt cagggtcgga gaagggcaat gactcattct 105961 tagttagaat gggattccaa ctcagcacct tttgagtgag attttgagaa gagttgctct 106021 ttggagagca cagtacgatg aaagttgtaa gctgtgttcg ggggggagtt attgtctatc 106081 gttggcctct atggtagaat cagtcggggg acctgagagg cggtggttta ccctgcggcg 106141 gatgtcagcg gttcgagtcc gcttatctcc aactcgtgaa cttagccgat acaaagcttt 106201 atgatagcac ccaatttttc cgattcggcg gttcgatcta tgatttatca ttcatggacg 106261 ttgataagat ccatccattt agcagcacct taggatggca tagccttaaa agtgaagggc 106321 gaggttcaaa cgaggaaagg cttacggtgg atacctaggc acccagagac gaggaagggc 106381 gtagtaatcg acgaaatgct tcggggagtt gaaaataagc atagatccgg agattcccga 106441 atagggcaac ctttcgaact gctgctgaat ccatgggcag gcaagagaca acctggcgaa 106501 ctgaaacatc ttagtagcca gaggaaaaga aagcaaaagc gattcccgta gtagcggcga 106561 gcgaaatggg agcagcctaa accgtgaaaa cggggttgtg ggagagcaat acaagcgtcg 106621 tgctgctagg cgaagcagcc cgaatgctgc accctagatg gcgaaagtcc agtagccgaa 106681 agcatcacta gcttatgctc tgacccgagt agcatggggc acgtggaatc ccgtgtgaat 106741 cagcaaggac caccttgcaa ggctaaatac tcctgggtga ccgatagcga agtagtaccg 106801 tgagggaagg gtgaaaagaa cccccatcgg ggagtgaaat agaacatgaa accgtaagct 106861 cccaagcagt gggaggagcc agggctctga ccgcgtgcct gttgaagaat gagccggcga 106921 ctcataggca gtggcttggt taagggaacc caccggagcc gtagcgaaag cgagtcttca 106981 tagggcaatt gtcactgctt atggacccga acctgggtga tctatccatg accaggatga 107041 agcttgggtg aaactaagtg gaggtccgaa ccgactgatg ttgaagaatc agcggatgag 107101 ttgtggttag gggtgaaatg ccactcgaac ccagagctag ctggttctcc ccgaaatgcg 107161 ttgaggcgca gcagttgact ggacatctag gggtaaagca ctgtttcggt gcgggccgcg 107221 agagcggtac caaatcgagg caaactctga atactagata tgacctcaaa ataacagggg 107281 tcaaggtcgg ctagtgagac gatgggggat aagcttcatc gtcgagaggg aaacagcccg 107341 gatcaccagc taaggcccct aaatgatcgc tcagtgataa aggaggtagg ggtgcagaga 107401 cagccaggag gtttgcctag aagcagccac ccttgaaaga gtgcgtaata gctcactgat 107461 cgagcgctct tgcgccgaag atgaacgggg ctaagcgatc tgccgaagct gtgggatgta 107521 aaaatacatc ggtaggggag cgttccgcct tagagagaag cctccgcgcg agcggtggtg 107581 gacgaagcgg aagcgagaat gtcggcttga gtaacgcaaa cattggtgag aatccaatgc 107641 cccgaaaacc taagggttcc tccgcaaggt tcgtccacgg agggtgagtc agggcctaag 107701 atcaggccga aaggcgtagt cgatggacaa caggtgaata ttcctgtact gccccttgtt 107761 ggtcccgagg gacggaggag gctaggttag ccgaaagatg gttatcggtt caagaacgta 107821 aggtgtccct gctttgtcag ggtaagaagg ggtagagaaa atgcctcgag ccaatgttcg 107881 aataccaggc gctacggcgc tgaagtaacc catgccatac tcccaggaaa agctcgaacg 107941 actttgagca agagggtacc tgtacccgaa accgacacag gtgggtaggt agagaatacc 108001 taggggcgcg agacaactct ctctaaggaa ctcggcaaaa tagccccgta acttcgggag 108061 aaggggtgcc tcctcacaaa gggggtcgca gtgaccaggc ccgggcgact gtttaccaaa 108121 aacacaggtc tccgcaaagt cgtaagacca tgtatggggg ctgacgcctg cccagtgccg 108181 gaaggtcaag gaagttggtg acctgatgac aggggagccg gcgaccgaag ccccggtgaa 108241 cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac 108301 ccgcacgaaa ggcgtaacga tctgggcact gtctcggaga gaggctcggt gaaatagaca 108361 tgtctgtgaa gatgcggact acctgcacct ggacagaaag accctatgaa gcttcactgt 108421 tccctgggat tggctttggg cctttcctgc gcagcttagg tggaaggcga agaaggcctc 108481 cttccggggg ggcccgagcc atcagtgaga taccactctg gaagggctag aattctaacc 108541 ttgtgtcagg acctacgggc caagggacag tctcaggtag acagtttcta tggggcgtag 108601 gcctcccaaa aggtaacgga ggcgtgcaaa ggtttcctcg ggccggacgg agattggccc 108661 tcgagtgcaa aggcagaagg gagcttgact gcaagaccca cccgtcgagc agggacgaaa 108721 gtcggcctta gtgatccgac ggtgccgagt ggaagggccg tcgctcaacg gataaaagtt 108781 actctaggga taacaggctg atcttcccca agagctcaca tcgacgggaa ggtttggcac 108841 ctcgatgtcg gctcttcgcc acctggggct gtagtatgtt ccaagggttg ggctgttcgc 108901 ccattaaagc ggtacgtgag ctgggttcag aacgtcgtga gacagttcgg tccatatccg 108961 gtgtgggcgt tagagcattg agaggacctt tccctagtac gagaggaccg ggaaggacgc 109021 acctctggtg taccagttat cgtgcccacg gtaaacgctg ggtagccaag tgcggagcgg 109081 ataactgctg aaagcatcta agtagtaagc ccaccccaag atgagtgctc tcctattccg 109141 acttccccag agcctccggt agcacagccg agacagcgac gggttctctg cccctgcggg 109201 gatggagcga cagaagtttt tttgagaatt caagagaagg tcacggcgag acgagccgtt 109261 tatcattacg ataggtgtca agtggaagtg cagtgatgta tgcagctgag gcatcctaac 109321 agaccggtag acttgaacct tgttcctaca tgacctgatc aattcgatca ggcactcgcc 109381 atctattttc attgttcaaa tctttgacaa cacgaaaaaa ccattgttca actctttgac 109441 aacatgaaaa aaccaaaagc tctgccctcc ctctctatct atccaaggga tggaagggca 109501 gaggcctttg gtgtcccctc cagtcaagaa ttggggcctc acaatcacta gccaatatgc 109561 ttttctctca tgcctttctt cgttcatggt tcgatattct ggtgtcctag gcgtagagga 109621 accacaccaa tccatcccga acttggtggt taaactctac tgcggtgacg atactgtagg 109681 ggaggtcctg cggaaaaata gctcgacgcc aggatgataa aaagcttaac acctctcatt 109741 cttattactt tttcaatatg aaaacgaaaa aaaaaaaaat gaaaaatcaa aaggtcgttt 109801 tattcaaaac cccaattgtg acatcccttc tctcccactt cacacctcgg aacgcaccct 109861 tcttatagag ataaacgcgc cttcacatct tcttaacccg aaatggctgg ggagaggaaa 109921 ggttcctttt tttgagggta ctcccgggaa cagatccagt ggagacgggg tggggcctgt 109981 agctcagagg attagagcac gtggctacga accacggtgt cgggggttcg aatccctcct 110041 cgcccacaac cggcccaaaa gggaagtacc tttccctctg ggggtaggaa aatcatgatc 110101 gggatagcga accaaaagct atggaacttg ggtgtgggtc ttttgtcgaa atggaatggc 110161 ttttcttttt ctctttttat ttatcgtgaa tgggggaatc attacacata gtatgcccgg 110221 tcagcatatt tttttgtttt acgccccgta actcttcctc agccaggctt gggcagaata 110281 gcagagcaag tattagtagc ataacaaaaa agccttcctc gtcattaata tctttgctcg 110341 cggcaattgt gacctctcgg gagaatcgat gactgcatct ttgatgcagt gctagtatat 110401 ctgagacttc ttaattggct agttgtaaat agccccaggg ctatggaaca aaggattatc 110461 tcggacctag accgaggtat tgatggtgat tttctaatct cgcagaacag aatgtgatac 110521 gatgagatag aatgcaatag aaacaaagac agggaacggg ttacctactc ttaacgggca 110581 aagcgagccc ctttattctg aattctttaa ttcagaatca atcaaatctc cccaagtagg 110641 attcgaacct acgaccaatc ggttaacagc cgaccgctct accactgagc tactgaggaa 110701 caacaggaga ttcgatctca tagagttcaa ttcccgttcc caacccatga ccaatatgag 110761 ctcgaagctt ccttcgtaac tcccggaact tcttcgtagt ggctccctta catgcctcat 110821 ttcagaggga acctcaaagt ggctctattt cattatattc catccatatc ccaattccat 110881 tcatttaata tccctttggt gtcattgaca taacagatgt cgtttctagt ctatctcttt 110941 ctatttcttt tctatatatg gaaagttcaa aaatcatcat ataataatcc agaaattgca 111001 atagaaaaga aataagggag gtttgtgatg atttttcaat cttttctact aggtaatcta 111061 gtatccttat gcatgaagat aatcaattcg gtcgttgtgg tcggactcta ttatggattt 111121 ctgaccacat tctccatagg gccctcttat ctcttccttc tccgagctct ggttatggaa 111181 gaaggaaccg agaagaaggt atcagcaaca actggtttta ttacggggca gctcatgatg 111241 ttcatatcga tctattatgc gcctctgcat ctagcattgg gtagacctca tacaataact 111301 gtcctagctc taccatatct tttgtttcat ttcttctgga acaatcacaa acactttttt 111361 gattatggat ctactaccag aaattcaatg cgtaatctca gcattcaatg tgtattcctg 111421 aataatctca tttttcaatt attcaaccat ttcattttac caagttcaat gttagccaga 111481 ttagtcaaca tttatctctt tcgatgcaac agcaagatct tatttgtaac aagtggtttt 111541 gttggttggt taattggtca cattttattc atgaaatggc ttggattggt attagtctgg 111601 atacggcaaa atcattctat tagatcgaat aagtacattc gatctaataa gtaccttgtg 111661 ttagaattga gaaattctat ggctcggatc tttagtattc tcttatttat tacctgtgtc 111721 tactatttag gcagaatacc ctcacccatt cttactaaga aactaaaaga agcctcaaaa 111781 acagaagaaa gggtggaaag tgaggaagaa agagatgtag aaatagaaac agcttccgaa 111841 atgaagggga ctaaacagga acaagaggga tccactgaag aagatcctta tccttctcct 111901 tccctttttt cggaagaaag gtgggatccg gacaaaatcg atgaaacgga agaaatccga 111961 gtgaatggaa aggacaaaat aaaggataaa ttccactctc accttacaga gacaggctat 112021 aattgtaatt gtgaattaaa aaaaacagaa aataaggaat ttgattcaca aagttgaaaa 112081 gagtaagtaa taaactaata aaaagattga aacataagct aaatacaaga aaagataaga 112141 agagatgcgt ccgcccccta tatatttgat accttctcct acaatgaaac taataacccc 112201 aaccccgtta tcagtcccat caattactcg tcgatcaaaa aaatgagtaa attcagctaa 112261 tcctcttatc ccaccaacta agaatcttgt ataaaaagca tctatgtaag cacgattata 112321 tgaccaatca tatatgccat ttataatttt gtcccacaga attctcttag gacccttttt 112381 aacaaaagaa ttaattaact caaaattttt taaagaagaa taaatgggtt tatataaaaa 112441 ggatgctata aatattccga aataagctaa ccatttcgat aatatatcca aattccctcc 112501 ctcttggttg aaaggaattc ctatagatcc aacaaacaaa gtaaagagtc ctaatacaaa 112561 tattgggaat agcatagtat tgtccgattc ataaggatag gaataaaccg ctttatgctc 112621 aaaatgagca atagtcataa aaggtcgtgt catctttctt ccatttttat caattggata 112681 tttagttttt gcaaaaaaat aagtactttc attattattc atagttaata aacaagagtt 112741 tttcttaact ccgtttttac cccatagaga tattgaatag aagggggttt tttgtttccc 112801 accataattt ggaaaatgag cgtttaaatg cccttcaaaa gtaagtaaat agatccgaaa 112861 catataaaat gcggttaatc ccgccgtggc ccaagctatt attgcgaaaa ttggcgaata 112921 caaccaacta tcattaagaa tttcatcttt ggaccaaaaa caagcaagag gtggaatacc 112981 acaaagagaa agtgtaccta ataaaaatgt gattttgcta attggtacat gttttcttaa 113041 acctcccata agacccatat tctgactttt agctggagaa tatccaacaa tagtttccat 113101 tgaatgaata atggatccgg atcctaaaaa taataatgct ttggaataag catgagtaat 113161 caaatgaaat aaagcgcttc gataagaccc cataccaaga gctaacatca tataacccaa 113221 ttgagacatt gtggaatagg ctaaacctct cttaatgtct ttttgagcaa gagctaaagt 113281 agctcctaat aatactgtta ttattcctat aaccgagatc aaatacatta tgtaaggtat 113341 aactctgaaa agaggaagaa gccgagctac aagaaaaatt cccgccgcta ccatagtagc 113401 agcatgtata agagccgaaa tgggagtagg cccctccatg gcatcaggta accatacatg 113461 aagggggaat tgggcggatt tagcaactgc accggcaaat aagagaacag cacataaagt 113521 aacaaataaa aaatcgactt cattattata aatcaagtta ttgaatattt cgaataaatc 113581 cctaaattcg aaactccctg ttatccaata aaaacctaaa attcctaata ataaaccaaa 113641 atcccctaca cgattagtta caaacgcttt ttgacaagca tttgccgcaa caggtcgtgt 113701 aaaccaaaat cctattaata gataggaaca cagcccaacc aattcccaaa aaatataaat 113761 ttgtatcaaa ttcgaactag taactaatcc caacatggaa gtactgaaaa aactcatata 113821 agcaaaaaat ctcaaatagc cttgatcatg agccatataa ttatcactat aaataagaac 113881 cataattcca accgtagtga ttaatattga cataatagaa gtaagtgggt cgatcaagta 113941 tccgaagtct aaagaaaaat cattattgat gatccaagac catacatatt gataaaaaga 114001 actgctattt atttgctgaa tagacaggta gattgaaaaa accatgacta tgcttaacaa 114061 taaaacactc tgaaaagccc acatacggcg aaaacttttt gttgccgttg gaaaaagaaa 114121 aagtcccgct cctattaaca tagggactgg aagtggaatg aaaggtatga tccacgcata 114181 ttcatatgtc tgttccataa aaaagttttg aattcttaat taattgtttc cgattcaccg 114241 gatcttacct cttttgaaag gagtcaataa aaagtcaaaa tatggactaa ctgaaactaa 114301 tttaaaactt aaatcgaatt ttctattctt acttattctg agtctttgct aaatacttca 114361 actattgaaa tcaagaagtt acaattggtc aaatgatatg aaagggatta attactagtc 114421 tcttttgaaa taggcctatt tttctccaag tttgaccagt gaatcgaacg gggattcaag 114481 tttttcattt catgaagtaa aaatgcggtt cttatcttta aacctttcga ggtattttat 114541 tgcatgtaaa tgaaatgtgg aaccataaat agaaatcgag tattttttgg attctttatt 114601 ttattttttc tttttattaa gttcaactaa tttcctttct acagaacagc cgattagcaa 114661 attctatagg tatagatttt atgaatcaaa aataatgtga aataaagata ccagtcaata 114721 gagaaccttt tttttacaat tatgaatgtt ttatggaata gaaaaacttg aaaaaaacac 114781 atattgacct tcttttttta tttccagtat tatgcaattt tcacacatct tttgcctatc 114841 tcgataatgt tttattttag gacgacacta ttagctcgaa aataaatagt agtaaaaaga 114901 attcgttttg aacaatagat gtctttcaca tccagctata acaatgagta attttttaat 114961 ttctaaatgg cagttccaaa aaaacgcact tcgacatcaa aaaagcgtat tcgtaaaaat 115021 atttggaaaa ggaagggata ttcgatcgca ttaaaggctt tttcattagc gaaatctctt 115081 tctaccggga attcaaaaag tttttttgta cgccaaacaa aaataaataa gtaataaaac 115141 gttcgaataa tttgaatcaa cttgaaaaaa gaattcaatt attcttaaat tattcaatta 115201 gataataatt gaataattta acgatttccc tttcatattt gatattgatt agctcaccaa 115261 tcaatacgta atggaactcg cttcgctttt ctgattgata gataaaataa tagaattagg 115321 aaatcctcta tttactgaat aataactttt ttgttgacaa aagagtaaac atcatttcta 115381 ttccaaggtg gggagtttca ttttccccat cgacctattt gcagaattcc attaaaaaaa 115441 aattctatat ttccattcta tttccatatc tatagaagaa cgtatataaa aatctttagt 115501 gaaattagtg aaagttaaga actcattgaa actaattgat tctattttga aacctttttg 115561 ttttgtctaa ctttctaact ctttattttc tctgaattat tatatagata cccatgtata 115621 tcttgccctt aacccaatag agaaaattgc ttaatgaaat tctgtatgac tggttgtcaa 115681 ttttgagcga tgcaaaatag gttcttttct ttctattttg tcttcaaaat ccattttttg 115741 ttttagattt ctgaaataaa ataaatagga aatagctgat taaacaatga aaacaaaaaa 115801 tttgggaact ctattcctta attgagtata gaacggttta gttacaagag ttcaattcga 115861 ggaaagcata aaatatggga aagtcccagg ttaaataaaa aaaactaaga ctctaaactc 115921 aaatctaaaa taatgaacct tcaacttcaa attcctattt gaacaacttt ttattgttat 115981 tgatccattt gaatcattac taaactaaaa tagcttcctc aatctcgacg attgcttatt 116041 cataggctat tatgagttca agacaggccg ctatggtgaa attggtagac acgctgctct 116101 taggaagcag tgctaatgca tctcggttcg agtccgagtg gcggcatacc gtcttctaaa 116161 aaggataaat agatcttata atgaattcaa ttcccgattt cctttttaga attatgtaat 116221 taagggactc ttctttttta agatttttta tgatattttc aaccttagag catatattaa 116281 ctcacatttc cttttcgatc gtttcaattg taattacaat tcatttgata acctttttag 116341 tcgatgaaat cgtaaaacta tacgattcat cagaaaaggg cataatagtt acttttttct 116401 gtataacagg attattagtt actcgttgga tttcttctgg acatttccca ctaagcgatt 116461 tatatgaatc attaattttc ctttcatgga gtttctccct tattcatata attccgtatt 116521 tcaaaaaaaa tgttttaatt ttaagtaaaa taactggccc tagtgctatt tttacccaag 116581 gctttgctac gtcaggtatt ttaactgaaa tacaccaatc tgtaatatta gtacctgctc 116641 ttcaatccga gtggttaata atgcacgtaa gtatgatgat attgggctat gcagctcttt 116701 tatgtggatc attattatca gtagcacttc tagtgattac atttcgaaaa aacagacagc 116761 ttttttataa gagcaatggt tttttaaacg agtcattttt cttgggtgaa aatgttttac 116821 aaaatacttc ttttttttct gctaaaaatt attacaggtc ccaattgatt caacaattgg 116881 attattggag ttatcgggtt attagtttag gatttacttt tttaaccata ggaatccttt 116941 cgggagcggt atgggctaat gaagcgtggg ggtcgtattg gaattgggac ccaaaagaaa 117001 cttgggcatt tattacttgg atcgtatttg caatttattt acatactcga acaaatagaa 117061 atttgcgggg tgcaaattct gcaattgtag cgtctatagg ctttcttata atttggatat 117121 gctattttgg ggtcaatctt ttaggaatag ggttacatag ttatggttct tttccatcaa 117181 catttaattg aattcaagac aagttattac aaatacaaga gcgggcggcg cattgtatga 117241 accagcgtgc ggaccgtgtg aatcatcaat acaatatttg attcacacgg ttttctacca 117301 tatgtagttc aatttcattg tttttactta acttaagagt taagagaaga aaaaaagtct 117361 tctttttttc attgtccaag aatgtttttc aaaacaaaca taggtttttt ttatttcagt 117421 catccaaatt atctataaaa aaaattagat agaataactt cgaccttgtc aactgctaat 117481 gaaagaacga aatccgggta tataccaata cctattacgg gtaaaaagat ggagatcgaa 117541 agaaataact ctcgcggtcc agaatcaaaa aaagaatcct tcggggcatt aaatagcttg 117601 tatccataga acatctggcg tgacatagat aatgaataaa taggagttaa tatcattcca 117661 attgccatta caaaagtaat tagtattttt ggaattaaaa gatatttttg gccggtaatt 117721 attccaaaaa atactatcaa ttcggcaaca aaaccactca tacctggtaa tgcaagggaa 117781 gccatcgaaa agctactgaa catcgtgaac atttttggca ttggaatagc tattccgccc 117841 atttcgtcaa gataaacaag gcggattcta tcataagtcg ttcccgccaa gaaaaaaagt 117901 gcagcaccaa taaatccatg agatattatt tgtaaaaggg ctccattaag tcccgtgtcg 117961 gttagagaac taattcctat aattatgaaa cccatatgag agacagagga ataggctatt 118021 ctttttttta aattccgttg gccaagagat gttaaagctg catagattat ttgtattgta 118081 cctattatca tcaaccaagg agaaaatata gaatgggcat gaggtaataa ttccatattg 118141 attcgaatta atccatacgc tcccattttt aataaaattc cggctagaag catacaagta 118201 ctgtaatgtg cttctccatg ggtatctggt aaccatgtgt gtagggggat aatgggcgat 118261 ttgacagcaa aagcaataaa aaatccaata tagaatatta tttctaaaac cacaggatat 118321 gactgattaa ctgatgtttc aaaatttaat gttggttcat tagaaccata taaagcaaga 118381 cccaaaactc ccattaagag aaaaacagaa ccccccgccg tgtacaaaat aaattttgta 118441 gctgagtaca gacgtttctt tcctccccac atgcatagaa gtagataaac aggaattaat 118501 tctaactccc acatgatgaa aaaaagtaaa aggtcccgag acgaaaatga tccaatttga 118561 ccactgtaca ttgctaacat gagaaaatgg aataatcgag aatctcgagt aactggccaa 118621 gccgctaaag tagctaaagt agtgataaat cctgttaata aaatgggtcc tatagaaagt 118681 ccatctattc ctaatctcca atggaaatca aaaaaattga tccatttata atcctccact 118741 agttggatta atggatcatc cgattggaaa tgataacaaa atgcataagt cgttagaagg 118801 agttctaaaa tacatataca tatcgtatac cacctaatta ccctatttcc tttatgggga 118861 agaaagaaaa ttaaggaacc cgcaaatatt ggaaaaacta caattattgt taaccaagga 118921 aaataattcg tagtaaagac aagatacact tggaccataa aaacccgtgc tcaaaatatt 118981 gtgattttcg agcacaggtt tgtcggtaaa aaaaattaaa tggattcaag tagagttttc 119041 tcgaacgtat caataagcta gacccatact gcgagttgtt tcatgccata aataaactcg 119101 gacactcaag aaatctgttg gacaggcgga ttcacatctc ttacaaccaa cacagtcctc 119161 tgttcgtgga gcagaagcaa tttgtttagc cttacaaccg tcccaaggta tcatttctaa 119221 tacatcggtg ggcaggctcg gacacattga gtacatccta tacacgtatc ataaatcttt 119281 actgaatgtg acattgggtc tatacgtttt tgaatgttag aaattttcga tctagtaaac 119341 ttagaaacga atcatataat catatattta tataccagat gaatcaatga gttatcataa 119401 ttttctaatc aacccccttc tggattggtt tatgagatat gagagagggc caaaatactt 119461 tgatttctta tgttttgcaa acaagatcac accttacgta gcaaacatgc taattaaaat 119521 cgatttatca atattagaat ctagatgatt aatactaatt attcaacaaa tttgattggt 119581 tgatacgagt tgattttctg ttacggtaaa ttgatgaaac aatagccagt ccaatggctg 119641 cttcagcggc tgcaatagct ataacaaaaa ttgagaaaat gtctcctttt aattgacgat 119701 tatcaaaaaa atcagaaaat gttacaaaat ttatattaac cgcattcaat ataagttcaa 119761 gacacataag ggctctaacc atatttcgac ttgtgatcaa tccatagatc ccgatagaaa 119821 ataaataggc actcaaaaca agtacatgtt cgagaatcat taaacaactc cttatcaatc 119881 tcgactcctt tcaatatgaa caacaattca accgatttaa ttgactagta tataacaagt 119941 atggaacaaa gaaatatatt ggtactagat tgacctaaag tctttctatt tatacaacag 120001 gaattcaaat agaattgaag gaaaatgaat gtgataagac agaacaaaat tttatttgaa 120061 ttccaagttt taatagaaat tttttattga cgagctacag caattgcacc tattaaagca 120121 actaaaagga ttattgaaat cagttcaaat ggaagaaaaa aatctgttga taaatgaatt 120181 ccaatttgtt gactattact tataaaatct tgctctataa tctggtttga tcttgtagtc 120241 caaataatcc cgtaccatga cgtatctgaa atagtagtaa ttagtgaaat aaaaagactt 120301 atacaaacca tcgaagtaat tccatctcct acggtccaaa gatgaaaatc tttgtaatat 120361 tctgaaccat tcatgaacat cacagcaaaa atgattaaaa catttatagc tcctacgtaa 120421 ataagtactc gcagcagcta caaaatagga gttagataga atatagaata acgatgtaca 120481 aacaagaacc aatcccaagg aaaaggcaga ataaattgga ttgggaagta ataccactcc 120541 tagaccccct aatataagac ccgaccctag aaagactaaa agaaaatcat gtattggttc 120601 agataaatcc attttttatc aaaaatcaaa aacgaagaat ttcatgactt tattgacctg 120661 accaggaaaa aagaagtttt tcaatttttt atgatacttc ttaattgtta attgaatgaa 120721 attgtaatgg gtatgaattg acgtagatgc ttttatttta ttggaccact atcaattctt 120781 tattcgtcga acgagtagtt taaacctatc gattttggat atcatttatc tactttgaaa 120841 ccattactat tattataact ataatataga aatccgtttt gttttcaatc taaattaagc 120901 taggagtctc attaaccaac cactagtttg aattgaacaa gcaaaaatat cattctttta 120961 gatccgaact aagccttcgt aattcggaat ttttttcgaa tttagggttt attcattttt 121021 tatttgaggt aaattcgaaa ttgttcgaat tgtgtaatca tcaattactg acattggtaa 121081 gcgacccaaa gcgatttgat tataattcaa ttcgtgacga tcataagtag aaagttcata 121141 ttcttcggtc attgataaac aatttgttgg acaatactca acgcaattac cacaaaatat 121201 acagattcca aaatcaatac tgtaattaag caatcgtttc tttcgaatat cagtttccaa 121261 cttccaatca acaacgggta aatctatagg acatacacgc acacatactt cacaagcaat 121321 gcatttatca aattcaaagt ggattcggcc tcggaaacgt tccgatgtga tcaatttttc 121381 gtaggggtat tgaatagtta caggtaaacg atttgcgtgg gacagggtaa tcatgaaacc 121441 ttggccgatg tatctggcgg ctcgtattgt ttgttgacca taatttatga attcagttat 121501 catagggagc atatttagaa tatctataaa aaagatttta tgcttgtttc tttctcttgt 121561 ttgagacaag tcgtgaatct agaatattgt agtcttttac agtgaaagaa gttgggacga 121621 ggttgtcaat aatagattac ctagagaaat aggtaaaaga aatttccacc caagatttaa 121681 tagttggtcc attctcagcc tcggtaaagt ccatcttgtt gcaataggaa tgaacaaaaa 121741 caaataagtt ttggctaatg tgataaagat accaattagt gttccaaaga ctttacccct 121801 tttatttatg ccaaatagct caggaacaaa tatgtacgga atagaaagat tccaacctcc 121861 caaataaaga actgttacaa ataatgaaga aactagtaga ttcagatatg aagcaactgt 121921 aaaatcaaac caaatttgat acctgaatat tcggtttgat accctgctac taattcttct 121981 tctgcttctt ggtaaatcaa aaggtaatct ttcacactcg gctagagaag aaattagaaa 122041 aacgataaac ccgatgggtt gacgccacaa attccacccc caaaagccat attttgactg 122101 cgcttccact atatcaactg tacttaaact gttagataat catagtcgat gataacatca 122161 ctgtgcccat cgctattaca gaaccgtacg tgagattttc atctcatacg gctcctcaga 122221 ggtcacaaat aaatctaagg accctttcct attctttatc ttgatatgtt tgtcagatag 122281 agtaaaaatc tatcctaagg tcccaaatta gaccaatgga attctgtctg ctatatttaa 122341 aactaataaa tacgggcttc tgaattgatc tcatctttta agaattttca tttttctttg 122401 ttgattaata accttatcat taaataaaat gcgctttata gcaatatcac atatacattt 122461 caacctcgaa ttctcaatta cgaaaaaaat tagagagtcc attagttcat gaatcatgac 122521 aaaaaatttc tctctcgaac tagaaatcaa aatggaatta taggaaagaa agaataaaaa 122581 caaaaaaaga aaaaagtaag aaaaaaaaag acatcccccc tttttgcttt tgcaattaga 122641 ttcttttctt tctatttcta ttttatttca ttcctattct cctttctcag aaaaagggcc 122701 tttaaccaaa gtaaaagatt acttcgttct tgatagttat ttacttactc agtggatagg 122761 aacatactct ggatcagaat catggggagt acttcttgat catttctacg aacgtaaagc 122821 cccaattcga attcctttta tgtacagaaa tatcctcttg gataacttac ataatctcaa 122881 ttactaatcc tttgtgtatc ttggtcttcc taaccatcca ctcatttttg ctttcaacct 122941 cccgttgtgg aaatccatct atggtaatag acagtaaaaa ctccatacag ttgatctttt 123001 gaacccgctt caagctatca tgacaattca ccaatcttgg ggtaaacaat ctctattgct 123061 tatgtttact tttttcacca tttgattctt gtacatagga aatgagactc aaccttttta 123121 ctgcaaattt agaagccgtt ttctttcact catataacta tctggtttag ttcatcaacc 123181 caaatgctga ataaaaatga aaatatatat attcaatcaa atctttttac ctttgtttct 123241 agaaagaaaa gaatttggag aaattttagg tctcaccgaa tcacacgtag agatattgat 123301 aacacacata gagctaatgg tattttcata actaattgat tgagcagctg cccgtagacc 123361 acctaaaaaa gaatatttat tatttgatcc atatcccgac ataagaagtc caacgggagc 123421 aatacttgaa atggcaatcc agaaaaaaac accaatacta agatcggcta gaacaaggtg 123481 atcaccaaaa ggaattactg aataacttag aaagatggat attactgcta tggatggtcc 123541 gatactgaat aaacgagtat ctcctgtaga tggaataagg ttctctttca aaagtagttt 123601 tgtcccatct gctagagctt gaagaattcc taaagggcca gcatattcag gtccgatacg 123661 ttgttgtatt cctgcagata tttctctttc taaccaaaca attactagta cacctattgt 123721 gattcctaat acaagagtca aaatagggaa aagcatccat atgatcccat agacttcttt 123781 taaggattcc aatttggaaa aagaattgat agtttctatt tctgttgtat caattatcat 123841 ttcaacgatc aacttctccc ataatgatat ctatgctacc tagtattgtc ataatatcag 123901 ccaatttcat tcttttaact aactgaggaa gaatttgcaa attgataaaa cctggtgggc 123961 gaattttcca tctccaagga aaaacgctct gatctcctat gagaaaaatt cccaattctc 124021 cttttggggc ttcaactctc acataaagtt cttgtttcga caattcaaaa gttggagaag 124081 gttttttact aataaaccga tattcaaaat cattccattc aggatctttt aatctgtcaa 124141 aacgtcggat ttctaaattt tcgtaaggcc ctcctggaat tccttccaga gcctgttgaa 124201 taatctttat ggattctgtc atttcaccga ttcgtactaa ataacgagct aatgaatccc 124261 cttctcgttg ccattgaacc tgccaatcaa attcgtcgta agactcataa tgatcaactt 124321 tacgaagatc ccattctatt ccggaagctc gtagcattgg tcccgataac ccccaattta 124381 atgcttcgtc tcccccaata atgcctacgc cttcaactcg ttctaaaaaa ataggattcc 124441 gggtaataag tttttgatac tcagcaaccc ctgttaaaaa ataatcgcaa aaatccaaac 124501 atttatctat ccagccatag ggtagatcgg cagccactcc cccgatacga aaataattat 124561 gcatcattcg cataccggtg gcagcttcga agaggtcata tatcaattct ctttctcgaa 124621 aaatatagaa gaaaggggtc tgcgcaccaa tatccgccat aaaagggcct agccataaca 124681 aatgagaagc tatccgactc aactccaaca taatgactct gatatagcta gcccttttag 124741 gtacttgaat attgcctaat tgttcgggtc catttatggt tattgcttct gtgaacatag 124801 tagctaaata atcccaacgt gttacataag gcaaatattg tataattgtt cggttttccg 124861 caattttctc catccctcta tgtaaataac ccaatattgg ttcgcagtcg acaacatctt 124921 caccatctag agtaacgatg agtcgaagaa caccgtgcat tgatgggtgc tgaggcccca 124981 tattgactat catgaggtct tttcttgtag ttggtgcagt cataagtttt ttaccgattc 125041 attcttccat gaattgctga aagtgaaaag aagttcatca aaatttaatc gaaacatata 125101 agtgaaaatg aaatgactct tcaaataaat caaattaacg agtttttgtc tctcgaatgt 125161 ccaactgatt aattaattct ttataacgta ctctattttt ttttgacaaa taagctagga 125221 gtcgttgacg ttttcccaaa attttcttca aacctctctg agataaatag tcttttttgt 125281 gcaattctaa atgtgaagta agtctccgta tcttattggt gaaattgaat acttgaaatt 125341 caacagatcc tctcttttct tcttgagaaa taactgaaat gacagaattt tttaccataa 125401 aagaatttcc cctttcttta ttttacagat atggatttta tcgaatttta tcgatcagta 125461 ataataatgc cagtaatttg aacgtggtat atagacttaa tttctttatg aactcctaat 125521 tttatcaatt ccaataaatt aatcaaattc aaaatttgat tcagatagga atccaaaaag 125581 atggtaggta cttttttttt cattcacaaa agcgactaat ttaaacctaa aatcctaaaa 125641 tgaagaagat tttgttgatt cctttctaga tctaatcgat actttattga tttagtatcg 125701 tctactcgaa ttagattcga atgagatgta agaaaaagca tgtgtacatt tgtttacttt 125761 cagatactct atacgaaaca ggatatatag tactatcaat ttattttcaa ttgtggatac 125821 atatgtatcc ttaagatact gaaacgacta ccattattgg tatcaaacca ataacgattc 125881 atacaagcta aatcttctaa tcgataatta ggccaaagaa agaacttcaa tttaattaat 125941 tcatttttct ctttataaag aggtttcctt tcatccaaaa attgactcca gttttttaca 126001 ttgttttcgt tgcaaaatac tgaatttcta tcgatgccat tccaattcaa agaattaaac 126061 aaacttcgaa ttctcaattc tctacgacgt ctagaccata aaatattttc aggaacaagc 126121 aaatcaaaat gatttttgtc tgtatttatt ctttgagttt gaggttgcag aatgaattca 126181 tcaaaattct ttttatcaac atatctttgt tcggggtatc tttgattagt ttggtgttta 126241 cttttatgaa ccaatgaaat acctatggtt tgatacataa taaattgtcc attatttttt 126301 acagacaacc gaataggttc gataattaat atccccttct tcatcaattc tgtaagagtt 126361 aaattcttct gaatcagcat tatatccaaa ctcatttctc tcctttgaat tgacgatata 126421 gcaattttgc ttggatttat cagtcgaagc aggagacaat ataccttgat attctcgatc 126481 attctttgat tcaaagcatc gttccatctc aattgaaaaa gcaaataacg tttcaagaac 126541 aaatctagtt ctgcttccgt gttgcttttg tattgttttt tctttttacc cttctttgtg 126601 tctgattccg cgtaatcttt tttaagagcg ttttgatgtt ttgagagaac agggcccaga 126661 tttcctttgt tttctatatc tgatccacgc tctttttctc cttgacttgc gggttctttt 126721 gcttcttgaa ttcgattctt tattttttta tttgatcgta gaaaaaagtt ttgtttttgg 126781 tttttattga tgtttttatt tgactaacat tttcatttgt attcaaattt aaaagaagta 126841 atttgcttgg tataatccac ggttttattt tatatacatt ataaagtggt acaaattctg 126901 ggaagaacca aaattccaga ttcaatatgg gacgatttaa tattttttca ttcattccca 126961 tccaatcaaa aaaggctttt ttcgaatttt tttgattgtt ttctggattt tgatgaatcg 127021 taagataaaa aaagcctttt ttatcaattt tatcaattat ttgataatta ttaataccaa 127081 ttttagtatt tggattactg ttggtatcga tcttaaccca ggcctcaata tcttcttttt 127141 gtctaagaga aaaatggata attttccaat caaaatattt tctatcgaga tttctttcta 127201 tatatagaat attgcctttt cttagataat tattgatatg aagattgccg agcatatcaa 127261 aaaggttgtg tttggacgtg ttggaattag aagaaatttc gaggttctta tttacttgaa 127321 agggtaatct agaaataaaa gagtcatttt ttttttcata attaatcgat ttatatgcta 127381 aaagatcata tctataacat ttttgaaaat tatctttttg gtttgctaat gaatagagct 127441 cagaatcatt ttcttttttg taatgaatta attggtcttt ttcatatgaa ttccatttgt 127501 ttaaatttcg attttgagcc atacaacctt gattaaccct atttcgccat ttttgtggca 127561 ttaatctaga ccatctaatc tgagataaat cgtattgata atgccgtctt aaccagtttt 127621 tccattgatt gattctataa ctctgaagtt tcttatgttt taattcagaa tgaaatattc 127681 ctagtgttcg aaaatagtcc tttattttag tcttaaggaa aaaagacgtt ctgttatatt 127741 gaagaacaga tcttaattta gacaaattaa taacttgggg ttgtgataat ttgtaaaata 127801 cgatatgctt gtgataagta ggataaatca aaaaaaatat gtgaattttt cttactaata 127861 ttataaagtg acttttttat agtcgaaata aagtgaattt ttttttgatt attaattttt 127921 tcttgattta tttcattatt ggaaatgtat ttatcaatca atttgtttgt tgattcaaga 127981 aagagttgtg tattaattct gggaatatta atgatagata aaaatagatc gatgtataat 128041 ctttgaatga ataattttag aaaataatgg aatttccata ttaatcgagt atttcttctt 128101 tttaatattt ggaaaatctt ttttggcgat tcgaattttt taatattatt tgttttatta 128161 ggactaatgt ctatttctgg agttactttc tttttctctt ttgtaattct ttctatttga 128221 tttttgattg tacttgttct atcagtcaaa tccttcattt tgctttctat cagtgaagaa 128281 tttggccaat ttccagattc aatttgacta aatgattcgt taattatctg attactcatt 128341 agagaatctt tttctttttt cgtttcattc gattcatcta tttctttgag tctaaataat 128401 acaattggat ttacttttga aagttctttt ttcatttttt ttataaatag actacttttg 128461 ataagccatt ttttggtttc ttttgaaatt cttcgaaata attttatttt tcctttgaaa 128521 acttttagag ttataaaata tttctttttg aattttccaa tttttttttc gagttcctta 128581 aaaatgggct caaaaaaaga agggcgtttt cggggagaac caaagggaag ttcagcttcc 128641 attccccaaa ctgttaaaaa acaaaaatca tctttttgtt ttttcttttt cattagctct 128701 ccacgggagg agtacagttt agatatatgc caaggtttca gacaaaaagg aaataatatt 128761 ttgatctgaa tgccatcttt caaccaattt tttggaaatt ctgtttctga taattgaaca 128821 ccattataag tacatttaat atgcatttct ctattccatt cctgcaaatc ttcagaccat 128881 tcaggaagtt gcaagactaa catacgcccg agatttttgg ctattatcaa tgaaggtaat 128941 acaatatatt ttcgaagaat tgattgagtt attaacatgt aacctcttat tatttgcgca 129001 aaaggaatgg tatcccaggc ttctgctatc tctatccgtg ctttttcctt tcttttgttc 129061 tccccttttt tgtccttttc ctttttctct tctctttttg tttgttcttc tctagactct 129121 agaatcttga attctccttc tttacctgac caatttcgaa aaattggttt aatcagtcca 129181 gagatatcaa aagaaaaaag aaaggggggg gttattctgt caagaaaaag gggggaatgc 129241 acatttgctt gaaagagttt ccaaataact gttttgcgcc tttgagcccg catagagcct 129301 ttgattatac ctcgccgaaa atctggttgt tgcgaatagc gtattaaagc cacttccttt 129361 gtttgatctt gatctgcggt atcagtatct ttggtatcag gatcgttatt ctggttgttg 129421 gcagtaaaaa tcactacacg tttggctttt cttgaacgaa tttgatgatc cagtggtacg 129481 ccctcttgat agtcacccga ttgttgttcc aattcggtga ttaatttatg tgaccagcga 129541 ggtatttttt tactgatttc ttttattcca atcgattttt tttcagatgt tgtcccatta 129601 ggagcaattg cattgaatac aaattttaca aatttagttc ttttttctga attcactctt 129661 ccctgttctt ggtctgaaaa taaagaaagg tctttcaaat ttaaactcga ttttggttcg 129721 ttaccaaatt cattgattaa agttaagaac tcgtcaattt ctgttgataa tggtttttta 129781 gcaaccgtat ccactttttg ttccaattct tggtaatcag tattcggaag aaagatagta 129841 tgaatcctat ttattctaac cctctctttc aaattttcta gcgaagtatt gtttatgatt 129901 gaaggtgaaa actttttttt gattgttcct cgatatggtc catttaacaa aggatcatac 129961 attttaggca cgtattcttt tttagtatca tcattacaca atctagtcct tgtttcaagt 130021 atatcgagag aaaaagattc cttgtctaga acttcaagtc gatttaaaaa ttccttattc 130081 agattattac ttttttcttt gttggtagaa atccactgat tgtccagttc attagggagt 130141 gttttttgga gtgacaatag gggtatcctt ctttttatca ttttccaaaa agttgataaa 130201 cttggcgggt atgtaaaaga tattctttgt tttccatcac ttttacatgt gttaaaaaaa 130261 tattgtgaca tttccgttct tatggcctgt tcaaatcgat tattctttat gtagcgaaat 130321 ggtcgattcc atcgattata atcgaaaaga agactcacaa gaggctgttg aaaccagaag 130381 aggtctttat tttcattttt tttatcaagc agttgcaatt taaaaatttc tgtattcccc 130441 gtgttattat tattcagata agaatcctca taatcataaa ttggactatt actagtatta 130501 atattattat agcctgtctc tgtaaggtga gagtggaatt tatcctttat tttgtccttt 130561 ccattcactc ggatttcttc cgtttcatcg attttgtccg gatcccacct ttcttccgaa 130621 aaaagggaag gagaaggata aggatcttct tcagtggatc cctcttgttc ctgtttagtc 130681 cccttcattt cggaagctgt ttctatttct acatctcttt cttcctcact ttccaccctt 130741 tcttctgttt ttgaggcttc ttttagtttc ttagtaagaa tgggtgaggg tattctgcct 130801 aaatagtaga cacaggtaat aaataagaga atactaaaga tccgagccat agaatttctc 130861 aattctaaca caaggtactt attagatcga atgtacttat tcgatctaat agaatgattt 130921 tgccgtatcc agactaatac caatccaagc catttcatga ataaaatgtg accaattaac 130981 caaccaacaa aaccacttgt tacaaataag atcttgctgt tgcatcgaaa gagataaatg 131041 ttgactaatc tggctaacat tgaacttggt aaaatgaaat ggttgaataa ttgaaaaatg 131101 agattattca ggaatacaca ttgaatgctg agattacgca ttgaatttct ggtagtagat 131161 ccataatcaa aaaagtgttt gtgattgttc cagaagaaat gaaacaaaag atatggtaga 131221 gctaggacag ttattgtatg aggtctaccc aatgctagat gcagaggcgc ataatagatc 131281 gatatgaaca tcatgagctg ccccgtaata aaaccagttg ttgctgatac cttcttctcg 131341 gttccttctt ccataaccag agctcggaga aggaagagat aagagggccc tatggagaat 131401 gtggtcagaa atccataata gagtccgacc acaacgaccg aattgattat cttcatgcat 131461 aaggatacta gattacctag tagaaaagat tgaaaaatca tcacaaacct cccttatttc 131521 ttttctattg caatttctgg attattatat gatgattttt gaactttcca tatatagaaa 131581 agaaatagaa agagatagac tagaaacgac atctgttatg tcaatgacac caaagggata 131641 ttaaatgaat ggaattggga tatggatgga atataatgaa atagagccac tttgaggttc 131701 cctctgaaat gaggcatgta agggagccac tacgaagaag ttccgggagt tacgaaggaa 131761 gcttcgagct catattggtc atgggttggg aacgggaatt gaactctatg agatcgaatc 131821 tcctgttgtt cctcagtagc tcagtggtag agcggtcggc tgttaaccga ttggtcgtag 131881 gttcgaatcc tacttgggga gatttgattg attctgaatt aaagaattca gaataaaggg 131941 gctcgctttg cccgttaaga gtaggtaacc cgttccctgt ctttgtttct attgcattct 132001 atctcatcgt atcacattct gttctgcgag attagaaaat caccatcaat acctcggtct 132061 aggtccgaga taatcctttg ttccatagcc ctggggctat ttacaactag ccaattaaga 132121 agtctcagat atactagcac tgcatcaaag atgcagtcat cgattctccc gagaggtcac 132181 aattgccgcg agcaaagata ttaatgacga ggaaggcttt tttgttatgc tactaatact 132241 tgctctgcta ttctgcccaa gcctggctga ggaagagtta cggggcgtaa aacaaaaaaa 132301 tatgctgacc gggcatacta tgtgtaatga ttcccccatt cacgataaat aaaaagagaa 132361 aaagaaaagc cattccattt cgacaaaaga cccacaccca agttccatag cttttggttc 132421 gctatcccga tcatgatttt cctaccccca gagggaaagg tacttccctt ttgggccggt 132481 tgtgggcgag gagggattcg aacccccgac accgtggttc gtagccacgt gctctaatcc 132541 tctgagctac aggccccacc ccgtctccac tggatctgtt cccgggagta ccctcaaaaa 132601 aaggaacctt tcctctcccc agccatttcg ggttaagaag atgtgaaggc gcgtttatct 132661 ctataagaag ggtgcgttcc gaggtgtgaa gtgggagaga agggatgtca caattggggt 132721 tttgaataaa acgacctttt gatttttcat tttttttttt ttcgttttca tattgaaaaa 132781 gtaataagaa tgagaggtgt taagcttttt atcatcctgg cgtcgagcta tttttccgca 132841 ggacctcccc tacagtatcg tcaccgcagt agagtttaac caccaagttc gggatggatt 132901 ggtgtggttc ctctacgcct aggacaccag aatatcgaac catgaacgaa gaaaggcatg 132961 agagaaaagc atattggcta gtgattgtga ggccccaatt cttgactgga ggggacacca 133021 aaggcctctg cccttccatc ccttggatag atagagaggg agggcagagc ttttggtttt 133081 ttcatgttgt caaagagttg aacaatggtt ttttcgtgtt gtcaaagatt tgaacaatga 133141 aaatagatgg cgagtgcctg atcgaattga tcaggtcatg taggaacaag gttcaagtct 133201 accggtctgt taggatgcct cagctgcata catcactgca cttccacttg acacctatcg 133261 taatgataaa cggctcgtct cgccgtgacc ttctcttgaa ttctcaaaaa aacttctgtc 133321 gctccatccc cgcaggggca gagaacccgt cgctgtctcg gctgtgctac cggaggctct 133381 ggggaagtcg gaataggaga gcactcatct tggggtgggc ttactactta gatgctttca 133441 gcagttatcc gctccgcact tggctaccca gcgtttaccg tgggcacgat aactggtaca 133501 ccagaggtgc gtccttcccg gtcctctcgt actagggaaa ggtcctctca atgctctaac 133561 gcccacaccg gatatggacc gaactgtctc acgacgttct gaacccagct cacgtaccgc 133621 tttaatgggc gaacagccca acccttggaa catactacag ccccaggtgg cgaagagccg 133681 acatcgaggt gccaaacctt cccgtcgatg tgagctcttg gggaagatca gcctgttatc 133741 cctagagtaa cttttatccg ttgagcgacg gcccttccac tcggcaccgt cggatcacta 133801 aggccgactt tcgtccctgc tcgacgggtg ggtcttgcag tcaagctccc ttctgccttt 133861 gcactcgagg gccaatctcc gtccggcccg aggaaacctt tgcacgcctc cgttaccttt 133921 tgggaggcct acgccccata gaaactgtct acctgagact gtcccttggc ccgtaggtcc 133981 tgacacaagg ttagaattct agcccttcca gagtggtatc tcactgatgg ctcgggcccc 134041 cccggaagga ggccttcttc gccttccacc taagctgcgc aggaaaggcc caaagccaat 134101 cccagggaac agtgaagctt catagggtct ttctgtccag gtgcaggtag tccgcatctt 134161 cacagacatg tctatttcac cgagcctctc tccgagacag tgcccagatc gttacgcctt 134221 tcgtgcgggt cggaacttac ccgacaagga atttcgctac cttaggaccg ttatagttac 134281 ggccgccgtt caccggggct tcggtcgccg gctcccctgt catcaggtca ccaacttcct 134341 tgaccttccg gcactgggca ggcgtcagcc cccatacatg gtcttacgac tttgcggaga 134401 cctgtgtttt tggtaaacag tcgcccgggc ctggtcactg cgaccccctt tgtgaggagg 134461 caccccttct cccgaagtta cggggctatt ttgccgagtt ccttagagag agttgtctcg 134521 cgcccctagg tattctctac ctacccacct gtgtcggttt cgggtacagg taccctcttg 134581 ctcaaagtcg ttcgagcttt tcctgggagt atggcatggg ttacttcagc gccgtagcgc 134641 ctggtattcg aacattggct cgaggcattt tctctacccc ttcttaccct gacaaagcag 134701 ggacacctta cgttcttgaa ccgataacca tctttcggct aacctagcct cctccgtccc 134761 tcgggaccaa caaggggcag tacaggaata ttcacctgtt gtccatcgac tacgcctttc 134821 ggcctgatct taggccctga ctcaccctcc gtggacgaac cttgcggagg aacccttagg 134881 ttttcggggc attggattct caccaatgtt tgcgttactc aagccgacat tctcgcttcc 134941 gcttcgtcca ccaccgctcg cgcggaggct tctctctaag gcggaacgct cccctaccga 135001 tgtattttta catcccacag cttcggcaga tcgcttagcc ccgttcatct tcggcgcaag 135061 agcgctcgat cagtgagcta ttacgcactc tttcaagggt ggctgcttct aggcaaacct 135121 cctggctgtc tctgcacccc tacctccttt atcactgagc gatcatttag gggccttagc 135181 tggtgatccg ggctgtttcc ctctcgacga tgaagcttat cccccatcgt ctcactagcc 135241 gaccttgacc cctgttattt tgaggtcata tctagtattc agagtttgcc tcgatttggt 135301 accgctctcg cggcccgcac cgaaacagtg ctttacccct agatgtccag tcaactgctg 135361 cgcctcaacg catttcgggg agaaccagct agctctgggt tcgagtggca tttcacccct 135421 aaccacaact catccgctga ttcttcaaca tcagtcggtt cggacctcca cttagtttca 135481 cccaagcttc atcctggtca tggatagatc acccaggttc gggtccataa gcagtgacaa 135541 ttgccctatg aagactcgct ttcgctacgg ctccggtggg ttcccttaac caagccactg 135601 cctatgagtc gccggctcat tcttcaacag gcacgcggtc agagccctgg ctcctcccac 135661 tgcttgggag cttacggttt catgttctat ttcactcccc gatgggggtt cttttcaccc 135721 ttccctcacg gtactacttc gctatcggtc acccaggagt atttagcctt gcaaggtggt 135781 ccttgctgat tcacacggga ttccacgtgc cccatgctac tcgggtcaga gcataagcta 135841 gtgatgcttt cggctactgg actttcgcca tctagggtgc agcattcggg ctgcttcgcc 135901 tagcagcacg acgcttgtat tgctctccca caaccccgtt ttcacggttt aggctgctcc 135961 catttcgctc gccgctacta cgggaatcgc ttttgctttc ttttcctctg gctactaaga 136021 tgtttcagtt cgccaggttg tctcttgcct gcccatggat tcagcagcag ttcgaaaggt 136081 tgccctattc gggaatctcc ggatctatgc ttattttcaa ctccccgaag catttcgtcg 136141 attactacgc ccttcctcgt ctctgggtgc ctaggtatcc accgtaagcc tttcctcgtt 136201 tgaacctcgc ccttcacttt taaggctatg ccatcctaag gtgctgctaa atggatggat 136261 cttatcaacg tccatgaatg ataaatcata gatcgaaccg ccgaatcgga aaaattgggt 136321 gctatcataa agctttgtat cggctaagtt cacgagttgg agataagcgg actcgaaccg 136381 ctgacatccg ccgcagggta aaccaccgcc tctcaggtcc cccgactgat tctaccatag 136441 aggccaacga tagacaataa ctcccccccg aacacagctt acaactttca tcgtactgtg 136501 ctctccaaag agcaactctt ctcaaaatct cactcaaaag gtgctgagtt ggaatcccat 136561 tctaactaag aatgagtcat tgcccttctc cgaccctgac tgcccaacct gagagcggac 136621 agctaatgcg ttccacttat tgaacagggt tctatggtcg gtccgtgacc cctggatgcc 136681 gaaggcgtcc ttggggtgat ctcgtagttc ctacggggtg gagatgatgg ggtcggtcca 136741 tggattttcc ttccttttct tttgccgcat ttcgctcaaa gggttgaagg gagatagtgc 136801 atcaagctgt tcgcaagggc caacttgatc ctcttcccca gagatctcag atgagggaac 136861 cctgggagag ccgccgactc caactaccgt ccatgtacga tccatactag atctgaccaa 136921 ctgcccatcc tacctcctct acgttcttga cagcccatct ttgtctcagt agagtctttc 136981 agtggcacgt ttcggtcctc ttccccatta cttagaaaaa gtgagccacc ggttcaggta 137041 caagatacta tcattaccgc ctggacaatt agacatccaa cccgtaatcg caacgaccca 137101 attgcaagag cggagctcta ccaactgagc tatatccccc cgagccaagt ggagcatgca 137161 tgaagtagtc agatgcttct tctattcttt tccctggcgc agctgggcca tcctggactt 137221 gaaccagaga cctcgcccgt gaagtaaatc atcgcaccta cggtccaacc aattgggaga 137281 gaatcaatag attccttttc gggagcgatt catccttccc gaacgcagca tacaactctc 137341 cgttgtactg cgctctccaa gtgtgcttgt tccccccttc ttccttaccc tggcaagtct 137401 ttgtgaaata actccgatga gaagaaaaaa gaaggcgtta agagaccctc ctggcccaac 137461 cctagacact ctaagatcct ttttcaaacc tgctcccatt tcgagtcaag aaaaaaacgg 137521 ctcgaatggt acgatccctc cgtcacccca gaatgaaagg ggcgatctcg tagttcttgg 137581 tctgtgaaga tgcgttgtta ggtgctccat tttattttcc cattgctaaa cctgtgctcg 137641 agagatagct gtccatacac tgataaggga tgtatggatt ctcgagaaga gaggagccgt 137701 ggtggtcccc cccggaccgc ccggatccca cgagtgaatc gaaagttgga tctacattgg 137761 atctcacccg aatcgcccca tctatcctcc tgaggaggag tttggtttca aaccccggtt 137821 cgaacaggag gagtacgcca tgctaatgtg ccttggatga tccacatctc agggtcaggc 137881 gccgatgagc acattgaact atccatgtgg ctgagagccc tcacagccca ggcacaacga 137941 cgcaattatc aggggcgcgc tctaccactg agctaatagc ccgtcgtgcg agcctcccac 138001 tgggggcccg ctatgccaaa agcgagagaa accccatccc tctctttcct tttttcgccc 138061 ccatgtcgcc acacgggggg aacatgggga cgtaaaaaag ggggtcctat caacttgttc 138121 cgacctagga taataagctc atgagcttgg tcttacttca ccgtcgagaa aggaaagaag 138181 acttccatct ccaagtttaa ctcagacgta gctcccttct tttttttggg ggtgtgaagc 138241 agtgtcaaac caaaataccc aacaagcatt agctctccct gaaaaggagg tgatccagcc 138301 gcaccttcca gtacggctac cttgttacga cttcactcca gtcactagcc ctgccttcgg 138361 catccccctc cttgcggtta aggtaacgac ttcgggcatg gccagctccc atagtgtgac 138421 gggcggtgtg tacaaggccc gggaacgaat tcaccgccgt atggctgacc ggcgattact 138481 agcgattccg gcttcatgca ggcgagttgc agcctgcaat ccgaactgag gacgggtttt 138541 tggggttagc tcacctcgcg ggatcgcgac cctttgtccc ggccattgta gcacgtgtgt 138601 cgcccagggc ataaggggca tgatgacttg acgtcatcct caccttcctc cggcttatca 138661 ccggcagtct gttcagggtt ccaaactcaa cgatggcaac taaacacgag ggttgcgctc 138721 gttgcgggac ttaacccaac accttacggc acgagctgac gacagccatg caccacctgt 138781 gtccgcgttc ccgaaggcac ccctctcttt caagaggatt cgcggcatgt caagccctgg 138841 taaggttctt cgctttgcat cgaattaaac cacatgctcc accgcttgtg cgggcccccg 138901 tcaattcctt tgagtttcat tcttgcgaac gtactcccca ggcgggatac ttaacgcgtt 138961 agctacagca ctgcacgggt cgatacgcac agcgcctagt atccatcgtt tacggctagg 139021 actactgggg tatctaatcc cattcgctcc cctagctttc gtctctcagt gtcagtgtcg 139081 gcccagcaga gtgctttcgc cgttggtgtt ctttccgatc tctacgcatt tcaccgctcc 139141 accggaaatt ccctctgccc ctaccgtact ccagcttggt agtttccacc gcctgtccag 139201 ggttgagccc tgggatttga cggcggactt aaaaagccac ctacagacgc tttacgccca 139261 atcattccgg ataacgcttg catcctctgt attaccgcgg ctgctggcac agagttagcc 139321 gatgcttatt ccccagatac cgtcattgct tcttctccgg gaaaagaagt tcacgacccg 139381 tgggccttct acctccacgc ggcattgctc cgtcagcttt cgcccattgc ggaaaattcc 139441 ccactgctgc ctcccgtagg agtctgggcc gtgtctcagt cccagtgtgg ctgatcatcc 139501 tctcggacca gctactgatc atcgccttgg taagctattg cctcaccaac tagctaatca 139561 gacgcgagcc cctcctcggg cggattcctc cttttgctcc tcagcctacg gggtattagc 139621 agccgtttcc agctgttgtt cccctcccaa gggcaggttc ttacgcgtta ctcacccgtc 139681 cgccactgga aacaccactt cccgtccgac ttgcatgtgt taagcatgcc gccagcgttc 139741 atcctgagcc aggatcgaac tctccatgag attcatagtt gcattactta tagcttcctt 139801 gttcgtagac aaagcggatt cggaattgtc tttcattcca aggcataact tgtatccatg 139861 cgcttcatat tcgcccggag ttcgctccca gaaatatagc catccctgcc ccctcacgtc 139921 aatcccacga gcctcttatc cattctcatt gaacgacggc gggggagcaa atccaactag 139981 aaaaactcac attgggctta gggataatca ggctcgaact gatgacttcc accacgtcaa 140041 ggtgacactc taccgctgag ttatatccct tccccgcccc atcgagaaat agaactgact 140101 aatcctaagt caaagggtcg agaaactcaa cgccactatt cttgaacaac ttggagccgg 140161 gccttctttt cgcactatta cggatatgaa aataatggtc aaaatcggat tcaattgtca 140221 actgccccta tcggaaatag gattgactac cgattccgaa ggaactggag ttacatctct 140281 tttccattca agagttctta tgcgtttcca cgcccctttg agaccccgaa aaatggacaa 140341 attccttttc ttaggaacac atacaagatt cgtcactaca aaaaggataa tggtaaccct 140401 accattaact acttcattta tgaatttcat agtaatagaa atacatgtcc taccgagaca 140461 gaatttggaa cttgctatcc tcttgcctag caggcaaaga tttacctccg tggaaaggat 140521 gattcattcg gatcgacatg agagtccaac tacattgcca gaatccatgt tgtatatttg 140581 aaagaggttg acctccttgc ttctctcatg gtacactcct cttcccgccg agcccctttt 140641 ctcctcggtc cacagagaca aaatgtagga ctggtgccaa caattcatca gactcactaa 140701 gtcgggatca ctaactaata ctaatctaat ataatagtct aatatatcta atataataga 140761 aaatactaat ataatagaaa agaactgtct tttctgtata ctttccccgg ttccgttgct 140821 accgcgggct ttacgcaatc gatcggatta gatagatatc ccttcaacat aggtcatcga 140881 aaggatctcg gagacccacc aaagtacgaa agccaggatc tttcagaaaa cggattccta 140941 ttcaaagagt gcataaccgc atggataagc tcacactaac ccgtcaattt gggatccaaa 141001 ttcgagattt tccttgggag gtatcgggaa ggatttggaa tggaataata tcgattcata 141061 cagaagaaaa ggttctctat tgattcaaac actgtaccta acctatggga tagggatcga 141121 ggaaggggaa aaaccgaaga tttcacatgg tacttttatc aatctgattt atttcgtacc 141181 tttcgttcaa tgagaaaatg ggtcaaattc tacaggatca aacctatggg acttaaggaa 141241 tgatataaaa aaaagagagg gaaaatattc atattaaata aatatgaagt agaagaaccc 141301 agattccaaa tgaacaaatt caaacttgaa aaggatcttc cttattcttg aagaatgagg 141361 ggcaaaggga ttgatcaaga aagatctttt gttcttctta tatataagat cgtgattgga 141421 tccgcatatg tttggtaaag agaataatct tatcctttga gaataatcaa aaatggacag 141481 tgttcaattg gaacatgaaa acgtgactaa attggtccta gttactcttc ggggcggagt 141541 ggaagaaggg ggggattctc gaacgcggaa aggatccaat gaattcgaaa gaattgaacg 141601 aggagccgta tgaggtgaaa atctcatgta cggttctgta gagtggcagt aagggtgact 141661 tatctgtcaa cttttccact atcaccccaa aaaaaccaaa ctctgcctta cgtaaagttg 141721 ccagagtacg attaacctct ggatttgaaa tcactgctta tatacccggt attggccata 141781 atttacaaga acattctgta gtcttagtaa gagggggaag ggttaaggat ttacccggtg 141841 tgagatatca cattgttcga ggaaccctag atgctgtcgg agtaaaggat cgtcaacaag 141901 ggcgttctag tgcgttgtag attcttatcc aagacttgta tcatttgatg atgccatgtg 141961 aatcgctaga aacatgtgaa gtgtatggct aacccaataa cgaaagtttc gtaaggggac 142021 tggagcaggc taccatgaga caaaagatct tctttctaaa gagattcgat tcggaactct 142081 tatatgtcca aggttcaata ttgaaataat ttcagaggtt ttccctgact ttgtccgtgt 142141 caacaaacaa ttcgaaatac ctcgactttt ttagaacagg tccgagtcaa atagcaatga 142201 ttcgaagcac ttctttttac actatttcgg aaacccaagg actcaatcgt atggatatgt 142261 aaaatacagg atttccaatc ctagcaggaa agggagggaa acggatactc aatttaaagt 142321 gagtaaacag aattccatac tcgatctcat agatacatat agaattctgc ggaaagccgt 142381 attcgatgaa agtcgtatgt acggcttgga gggagatctt tcatatcttt cgagatccac 142441 cctacaatat ggggtaaaaa agccaaaata agtgatttta gcccttataa aaagaaaact 142501 gattcttgaa cccctttcac gctcatgtca cgtcgaggta ctgcagaaaa aaaaacagca 142561 aaatccgatc caatttatcg taatcgatta gttaacatgt tggttaaccg tattctgaaa 142621 cacggaaaaa aatcattggc ttatcaaatt atctatcgag ccgtgaaaaa gattcaacaa 142681 aagacagaaa caaatccact atccgtttta cgtcaagcaa tacgtggagt aactcccgat 142741 ataacagtaa aagcaagacg tgtaggtgga tcgactcatc aagttcccat tgaaatagga 142801 tccacacaag gaaaagcact tgccattcgt tggttattag cggcatcccg aaaacgtccg 142861 ggtcgaaata tggctttcaa attaagttcc gaattagtgg atgctgccaa agggagtggc 142921 gatgccatac gcaaaaagga agagactcat agaatggcag aggcaaatag agcttttgca 142981 cattttcgtt aatccatgaa caggatctat acatctcgat cggaaaagaa tcaagagaaa 143041 aagaaagaat cggaattgat cgatagattt ctcgaaacaa acgaaaagga aagatgaaac 143101 ataaatcatg gatcaactaa gcctctcggg gactttctta aagaggaacc tcatgtaaat 143161 accatggaat aaggtttgat cctattcatg gagattccgt aactattcca aaaatggaaa 143221 gttcgacaca attgggattt tttttggaaa ttggaagcag ttactaattc atgatctggc 143281 atgtacagaa tgaaaacttc attctcgatt ctacgagaat ttttatgaaa gcctttcatt 143341 tgcttctctt cgatggaagt ttgattttcc cagaatgtat cctaattttt ggcctaattc 143401 ttcttctgat gatcgattca acctctgatc aaaaagatat accttggtta tatttcatct 143461 cttcaacaag tttagtaatg agcataacgg ccctattgtt ccgatggaga gaagaaccta 143521 tgattagctt ttcgggaaat ttccaaacga acaatttcaa cgaaatcttt caatttctta 143581 ttttactatg ttcaactcta tgtattcctc tatccgtaga gtacattgaa tgtacagaaa 143641 tggctataac agagtttctc ttattcgtat taacagctac tctaggggga atgtttttat 143701 gcggtgctaa cgatttaata actatctttg tagccccaga atgtttcagt ttatgctcct 143761 acctattatc tggatatacc aagaaagatg tacggtctaa tgaggctact atgaaatatt 143821 tactcatggg tggggcaagc tcttctattc tggttcatgg tttctcttgg ctatatggtt 143881 catccggggg agagattgag cttcaagaaa tagtaaacgg tcttatcaat acacaaatgt 143941 ataactcccc aggaatttca attgcgctca tattcattac cgtaggaatt gggttcaagc 144001 tttccccagc cccttctcat caatggactc ctgacgtata cgaaggagtg cggttcgttc 144061 gagaaattcc tacctctcta tctatctctg agatgtttgg atttttcaaa actccatgga 144121 catgcagaag agaaatgcta tccccactcg gaccaagaca gaacttttac ttgttcaaat 144181 aacaattaag gtgaagcagg gtcaggaacg acgaatctct ttatgataaa cagatccatt 144241 ttgcaagttc gttattacgg gtagttccta caaaggatcg gactaatgac gtatacaata 144301 cttgaattct cgatgtagat gctacatagt tggttctcat ccttcagaga ctacgagtgt 144361 aataagagca tccgtcgaca aaaggatcac cctaagatga tcatctcgtg gctattgaga 144421 acgaattaaa tcagatggtt ctatttctca atctttctga cttgctccta cgaaaccaag 144481 gtcgaaaaga ttgaaaaaat cagtcattca caaccactga tgaaggattc ctcgaaaagt 144541 taaggattag taatcctttt tagaaatcga atggattcgg tcttatacat acgcgaggaa 144601 ggtaatcaaa aaagaaagaa aatgggttct tctttctttt atcacttagg agccgtgtga 144661 gatgaaagtc tcatgcacgg ttttgaatga gagaaagaag tgaggaatcc tcttttcgac 144721 tctgactctc ccactccagt cgttgctttt ctttctgtta cttcgaaagt agctgcttca 144781 gcttcagcca ctcgaatttt cgatattcct ttttatttct catcaaacga atggcatctt 144841 cttctggaaa tcctagctat tcttagcatg atattgggaa atctcattgc tattactcaa 144901 acaagcatga aacgtatgct tgcatattcg tccataggcc aaatcggata tgtaattatt 144961 ggaataattg ttggagactc aaatgatgga tatgcaagca tgataactta tatgctgttc 145021 tatatctcca tgaatctagg aacttttgct tgcattgtat tatttggtct acgtaccgga 145081 actgataaca ttcgagatta tgcaggatta tacacaaaag atcctttttt ggctctctct 145141 ttagccctat gtctcttatc cctaggaggt cttcctccac tagcaggttt tttcggaaaa 145201 ctctatttat tctggtgtgg atggcaggca ggcctatatt tcttggtttt aataggactc 145261 cttacaagcg ttgtttctat ctactattat ctaaaaataa taaagttatt aatgactgga 145321 cgaaaccaag aaataacccc tcacgtgcga aattatagaa gatccccttt aagatcaaac 145381 aattccatcg aattgagtat gattgtatgt gtgatagcat ctactatacc aggaatatca 145441 atgaacccaa ttattgcaat tgctcaggat agcctttttt agcttctagg gtctatttct 145501 tagttcaaga tccctcttac taactggaat caaagaatta gtagatctgt tccgcccaaa 145561 atgggaatgg gctagggtta tgaacttata atctgatgat cgagtcgatt ccatgattat 145621 aagttcattc cataccggac caggccggaa tagggttata tacattctca ttatgagaag 145681 gggtcattcg ggcctatcta aatagatact atgtttacat atggattcct acatcattac 145741 attccattta ggattaggaa tacgcgtaat cggacctgct ttttacatat ctctattggg 145801 accctattca cctctttgag tgaatcgaga aataggtttg attgtccatc tttttgatat 145861 atatcaggca ttgcattctc cggataattc aaatcgaagc aattggatgt ccaactcggg 145921 cctatatgac atgaccgatc aatagatcca cctttgtcat atattccata catcacacta 145981 gatagatatc atattcatgg aatacgattc actttcaaga tgccttggtg gtgaaatggt 146041 agacacgcga gactcaaaat ctcgtgctaa atagcgtgga ggttcgagtc ctcttcaagg 146101 cataatattg agaatgctca ttgaatgagc attctcaata agagagctcg gatcgaatcg 146161 gtattgatat accgattcga tccgagctct tggaattgga ataaattcgg cagcggatcg 146221 cgaaatcttg gtgatcttct ctatctaatg aatggggagt ccgctttaaa atcgtccgcc 146281 ctgcacccac cccccgagta tatgcttcaa caggaatcac acaagggtag attagaaacc 146341 tctggtaaaa tgcccgcccg taacccagca gataaagtac attacatagt ccagggattg 146401 gcgacttacc cattcagtga ctttggcact ggacgttccc aaaatgggga ctatcgggta 146461 aattcaatat aatagacgcc tgttggcatt ccagccttcc ttctcctttc agggcctatc 146521 cgaaagagaa tccagtactt cttggtcgtg aatatctgaa ctggttgttt gctgttcaag 146581 aattcttgtt taggcagttc ataccatcca tacatagtgt tttgatctaa gatttcaatt 146641 cttccgtgtt tcagcagtaa catattcttc catggagcta aggtccaaaa tatggaagaa 146701 acaagcgttt ccacgactct accacccagt caattctgtt ccacttaatc cctctttcat 146761 ggccacatat ctttccggct aaggaatggg aaatctttct cctgttacat gaatccaatt 146821 ttcatttcat ccgggaaaag ccatcttttt ctcaacaatg tctttgtcat ttgatccaat 146881 agcgttccgt tagataggaa cagatttgat aaatactgat aactctcgga tagagtatta 146941 gaacggaaag atccattaga taatgaactg ttggttctaa gccatctctg acgattaatc 147001 aacaattcga agtgcttttc ttgcgtattc ttgataaacc agcgtttata tatagatgta 147061 ggagggtctg tttgggaagt aagaagcccc tttgacatct cttcatctgc aaataattct 147121 cgatgtgaaa acacagagcc agggggctga tctttgaata ggaaaaagag tggatctgca 147181 gggtcccaaa tgaattggct tattcgaaaa aggccttgtt ctttggaaga tctatctcgt 147241 gtctggtact gcatggttcc actctgcaag aactccgaat cattctcttg aagctcatcc 147301 tcttcatcat aaatgatccg cttgccccga aatgacctgg accaataggg aaatcccaat 147361 tcattgggcc tttcgataca atcaaataga aagccccaag ggcgccatat tctaggagcc 147421 caaactatgt gattgaataa atcctcctgc gggtcaaggg ctccttctcc ctccccttct 147481 tcaaactccg attcatattt ttcatagaga aatctctgat caaggataga acaagagccg 147541 ttttgcatca tatctaaggg attcctcggt tcgggccgaa gaagcaatgt cactcgatca 147601 ttatcaaact gactgcaatc tttttctgtc cgtgaagatc ccaccagagc gccttctact 147661 tctaataggc catgaactag atcagaatca ttctcaacga gtccataaga agtgatccca 147721 tttttttcat cgggtccgga taaagaccaa agatcttgag cgaccgatcc ggcagaacaa 147781 ctcaaaagat aaagaagtat cgttaatctc ttcatgctcg ttccaagctc gaagtaccat 147841 ttgtacaaat aagaatcccc ttcgttacat gatttcttct tcatatagat agatatagga 147901 tctatggggc aattacttag aagtacattt tgtgctacag cccttcctat ctgatagaaa 147961 aggatcccat gatcctgaac cgatcttacc tgggatcgca aatcccaagt ttgtctatga 148021 agagcggatc taattgtatt agtgtctata attgatttct tctgtgtaat actaatcgat 148081 aggacctcat tggtaagtgc tacaagatct cgtgcattgg aacccatggt tatggacccg 148141 aatccgttag tatggaacat tttcttttcc aagtgaaatc ccctagtata tgaaagagtg 148201 aaaaagtgct ttcgttgttg tggaagaaga agccttcgta tcttaatgca cgtatttaat 148261 ttattcggag ctattagagc gggatccact ttttggggaa tatgagtcga agcaataaca 148321 agaatatttc tagtagaaca tctttcacaa tccctggaga gatggttcac taatagaccg 148381 agggctaagt cattcgactc attcacatcc agatcatgaa tgtttggaat ccatattatg 148441 caaggagaca ttgcttttgc taattcgaat tgaagggtga tataaaatcg gtctatttcc 148501 ggcatcatat ccatagttag cccattcatc ctagttagca gtttcagctc cgtatcaagg 148561 tcacgatcga tatcgtcact agcatcaaga ttgtcactat catcaatatc gtcactatca 148621 tcaatatcga tctcatcaag aagaaaacct ttaggcttgt tatccaggaa cttgttcaga 148681 aataccgtaa tgaaaggaac ataggagttt gtcgctaggt atttgaccaa ataggatcgt 148741 ccagttccta tagaacctat cactaaaata cccctagagg gggataaggc taagcggagc 148801 gaaaagggtt ttccatgaga tgggaaatga aaactatttt ccccacacga agtttgtgaa 148861 taagtgattg tctgataatg agcaaggaat atccgtcttt ctgctaaaca ggatggattg 148921 aactcataat tcattagatg ctttttatga atgtcaacta agtatcgtaa gtaaattgct 148981 cccggttgtt caatcatttg ataaccagag tcattctttg ataaacgatc actatgagtc 149041 agactcaata gaatttgatc aatcctattt tctgtcgtta aggtggagaa ctgaaccaag 149101 aattctcttt cttcatcatc aatcgaatca ctgttcgcga cccaggattc tattttatca 149161 tcaatccaat ccccgttcac gttttttctt tttcttatca atgaatagat ctctttactt 149221 gtatgactta gatgtctcgt atttctcgaa aaagtgattc gattgatggg atttggtatg 149281 agatcgatga tctcgatgag attgatattc caatctttct tcttagaacg tattgatttg 149341 accccataag cgggaccaag catgttgccg ccagaagcag aaccccgtat ttcttctaga 149401 gaatctccta attgttccag agcaactaga aagagattct ttaaccagaa agaattcggt 149461 tcagatgtag gatacctatc cagaagtttt cgcaactcaa tcatagatga tggaatcatc 149521 aaagatttga ccttttcgaa ctctgtctgt aactcactag aggcccggga aacaaagaga 149581 agatgtgtac gaacgagata tccagcaaca agaagaagga aaaggattga atagaggaac 149641 tcccgagcat ttggcgatct cagatgtgtc gatatcaatg gtgactcatt atttcgatga 149701 atcatttctt cggacagaag aagattatgt aaacacttac tcgagatctc acttatcaga 149761 ttccattgtg gaagacacaa ttttttctga agaattcgcc atgatatacc tgatccatgc 149821 ataatatcat gaaaaatggg tacaaatttt tgactgctac ttagtattgg caataggtct 149881 gaaaaagtat ctaaaaatat caaatttaga tatttgtacc ctgtcgaagt aaggaaccat 149941 ggcatatatg tttggaatag attccatttt gagagagttg aaaaagcact atctcgttga 150001 aaggttctat acatctgccc tttctcaacg catttcttta gacaaagact ccgttttttc 150061 ctcttttcgg atgataaatc tttctcagaa catggagtgt gaatcaaacc catgtttgaa 150121 ttgaaattga gatactgatg caagttcttc ccttctgaat cagatagatt catatctgaa 150181 agaggttgac aataagttct ttcaaaattg actatttgcc cctctgttag aggtgttcca 150241 gaaatgtctg cgatcgagta aatagctcta cgaacgaatg gatcggatcg acttggaaaa 150301 tggaaagatt tgtacaagtt atacgtttcg tcaccacttt gtggaaaatc gttaggtatg 150361 aatatgttag atacctgtga ctcgattggt gaaatagtat ctctccccca aaaagcatgt 150421 ttttttttac cgacgcacaa agaaaatatt ttgttgcgaa tgaacaagat attgaggaat 150481 tgtccatacg taaaatcaga attattgata cgggcctttt ccacagaaaa ggggaatctt 150541 gtgttccaat agaagcagaa gtgatgtgga ttattcaaga atcgaagtcg atttgcttta 150601 taaaaagaag atatcaatga acttctatga aatggtttca cgggattcag ccaattgtct 150661 tgatcgtgga atatcattga gaaataggaa tccgggttat caaaggattt cctgcgatta 150721 tttctagtat ggaatgagtc aatcatccac tttggtatct tattgaacaa aaatggtgat 150781 attgttcctc cattgatcaa gaatttcgat ttttgggaag tatcatgatc gtccaataag 150841 aagggtttcc attttttcaa atgaacaatt tgaagaccta ttgattctaa caactgattg 150901 cagagttgat cattcggacc tttcaattca tagatgtaga tctcggacct atgaatgggg 150961 atatttccga aactcacaca gaaaaaagga agtgagttag acaaaaagaa aagcaacttg 151021 gacaaaaaaa gaagtgactt ggacaaaaag aaacgaagtg gcttagacaa atcttttttg 151081 tcgataacct cagaccaatc aatcgaatat tgattaatac gtaatcgatc gaacactact 151141 tgaaaacggc tcttctgctc cgaaacggac tgttccaaat gttcctggaa attcttgctc 151201 ccattggacc atttgtatct atatgcatca ggatcccgat tcatggatct ctcggttcga 151261 gaaatcaaaa taagaggctc gaaccatttc ttctgactct ttttcaaatt cgataaatat 151321 tggttgatcg tatatttcat tatagttcta tgattcagag tatcctttcc tatttgatcc 151381 ctttgaattc catattcgaa gttgcgatcg gatctattca ttaaaaagaa tcgattcaat 151441 acatttctta tgtacccata ggtactatat tggatttgaa tcagatttcg gatcaatcta 151501 tattgagtga ctgcctccat tatgttgttg ctagcaaata ccactatttt tggttttgga 151561 tcttccaaat cattcccgca ggagatccgg acccattttt ttctgatcct tcgagaaaaa 151621 gattcattct cttcataaaa aataggaggt agaaccaata aagatttctt tttcgattca 151681 tccctggcct cattcaagaa ttgtttttga tccaatccgc aggaatcaat agaaaaggca 151741 aatcccttat gatacaccag atccggctcg gttattgata gagtgaatag atctgccatt 151801 tcttgaaatc tctcttctga ttcaaaatcg tagtgtaacg tgtatcctcc cctgttccgg 151861 tcatggaata gatgaaataa atcaaaaaat ggatttttgt tcaagaatga aatcttattg 151921 gaactgtcca tatccggttc atcttcggaa ccatatcaca tcccggatct gatgaaatag 151981 gatgaattga gacggtattt tgtaaatacg taattatctt gaatatatta accatttctt 152041 tattttccga tcgcttggaa gggacaaaag aaagatcttg ttgtttcttc aacaatttct 152101 gatccctagt ggacctctca gtaggattcg aacccagatg aagttctgac catctatcag 152161 agaaaaaaga acgaacggat cttgtaggat tcccaagaaa ttcttcgatt tcttccggaa 152221 acagatgatt aatcatctgc ttctcacgtt ccgtgaatag ccgggacatt gaggaatatc 152281 cagaaaggca tttcgggaat cggcctgatt ctatctcttt tcgttccgtt tgaagaaagg 152341 aaggatccca aagaatcgat ctttcttttc gttgttgaat ctctctttga ttaatcaatg 152401 tgtgatattc cgaatcctca ttactaatgg aatccaaatg atctctggat tgatcagaag 152461 atcctttcag ttggctagaa tccgttactt gaacgaaact agatcttgtg gaatcatatt 152521 gaatatttga cgatacattc tgtaccttgc taaaaaaccg atccttgttt accaaccaca 152581 cattgtctaa ccaaatccaa ttctctctcg atacgttcct caaaaaatcc gattcgggcg 152641 gattcttccc ccaactaacg aagagatctt ggcggaattg ccacatatga aattgagcac 152701 agttttgcaa agaaatagcc cacttgtttc tcgagaagag atgggaaaca tgctcaatat 152761 catttgattg aatagttgac ccagcccctt gttgtttgaa gaaaccctcc acttcaattg 152821 gtattttttc acgaaaagca gacatgagat aagaaatcca gtgtttcact aagatttcga 152881 atagcggtcc cgaattcaag ttgattctat ttcgactctt cctcagagaa agacgatcaa 152941 acaattccca atcatggtcc ttgcggatcg gatcatccat ataatataca aaaagaaact 153001 ccagatattt gagatctttc tctttgaata agatctcaat tccagcgacg gtttcattag 153061 atatcttaca actagaatcc ctcttttttc cgatccagtt cctccaccaa cgcgaacccc 153121 agttagattc aggcatgcta cactttttag ttattgggag aacccaagta ctctctttcg 153181 gattcaggaa acaactctca gagatctttt ttcctttggg aagatacagg agcgaaacaa 153241 tcaacctatt gatattggaa gacccaacgg attcttccaa tgtatcattt ctgggtccaa 153301 tggaattcat aggtatagga agaagcccta tcaaatagag attttttctt tcgaccatat 153361 ttcgattgtt aatacgatat ataaggaccg ctactacaaa gagtattaca cccttgatcg 153421 tgaaatatcg attgcttgtt gaaccctgtg aattgcgtga aagtaggata ctccaaattc 153481 gggggtcaaa gagttttaga aaacgttctt ggtggaaaaa aatgtgaatg aaggatcccg 153541 ctgaattgaa ttgggtccat gaatctaaga aatggtgaga attcttgatc tctctcaata 153601 tctctctcaa ttcgaaaatc caggatttga attgatgtcc tctcattgat tcctcctaaa 153661 ttgcattgat ttatcctaac taaattgcat tgatttatcc taaagatttc atttcaattg 153721 gaatttggtt attcaccatg tacgaggatc cccgctaagc atccatggct gaatggttaa 153781 agcgcccaac tcataattgg cgaattcgta ggttcaattc ctactggatg cacgccaatg 153841 ggaccctcca ataagtctat tggaattggc tctgtatcaa tggaatctca tcatccatac 153901 ataacgaatt ggtgtggtat attcatatca taatatatga acagtaagaa ctagcattct 153961 tattgagact ataactcata gggaagaaaa tcgatttatg gatggaatca aatatgcagt 154021 atttacagac aaaagtattc ggttattggg gaaaaatcaa tatacttcta atgtcgaatc 154081 aggatcaact aggacagaaa taaagcattg ggtcgaactc ttctttggtg tcaaggtaat 154141 agctatgaat agtcatcgac ttccgggaaa gagtagaaga atgggaccta ttatgggaca 154201 tacaatgcat tacagacgta tgatcattac gcttcaaccg ggttattcta ttccacctct 154261 tagaaagaaa agaacttaaa aaaaaatact taatagcatg gcgatacatt tatacaaaac 154321 ttctaccccg agcacacgca atggaaccgt agacagtcaa gtgaaatcca atccacgaaa 154381 taatttgatc tatggacagc atcattgtgg taaaggtcgt aatgccagag gaatcattac 154441 cgcaaggcat agagggggag gtcataagcg tctataccgt aaaatcgatt ttcgacggaa 154501 tgaaaaagac atatatggta gaatcgtaac catagaatac gaccctaatc gaaatgcata 154561 catttgtctc atacactatg gggatggtga gaagagatat attttacatc ccagaggggc 154621 tataattgga gataccattg tttctggtac agaagttcct ataaaaatgg gaaatgccct 154681 acctttgagt gcggtttgaa ctattgattt acgtaattgg aaataaccaa ttaggtttac 154741 gacgaaacct agaaatcgat cactgatcca atttgagtac ctctgcagga tagacctcaa 154801 cagaaaactg aagagtaacg gcagcaagtg attgagttca gtagttcctc atataaaatt 154861 attgactcta gagatatagt aatatggaga agacaaaatt gtttcaagca ccgacagaac 154921 cggaagcgcc ccttctttca aagagaggag gacgggttat tcacatttca tttgatggtc 154981 agaggcgaat tgaaagttaa gcagtgggaa ttctaaagat tccccggggg aaaaatagag 155041 atgtctccta cgttacccat aatatgtgga agtatcgacg taatttcata gagtcattcg 155101 gtctgaatgc tacatgaaga acataagcca gatgacggaa cgggaagacc caggatgtag 155161 aagatcataa catgagtgat tcggcagatt tggattcata tatatatcca cccatgtggt 155221 acttcattct acgatatata taagatccat ctgtatagat atcatcatct acatccagaa 155281 agaagtatgc tttggaagaa gcttgtacag tttgggaagg ggttttgatt gatcaaaaga 155341 agaatctact tcaaccgata tgcccttagg cacggccata cataacatag aaatcacact 155401 tggaaagggt ggacaattag ctagagcagc gggtgctgta gcgaaactga ttgcaaaaga 155461 ggggaaatcg gccacattaa aattaccttc tggggaggtc cgtttgatat ccaaaaactg 155521 ctcagcaaca gtcggacaag tggggaatgt tggggtgaac cagaaaagtt tgggtagagc 155581 cggatctaag cgttggctag gtaagcgtcc tgtagtaaga ggagtagtta tgaaccctgt 155641 agaccatccc catgggggtg gtgaagggag agccccaatt ggtagaaaaa aacccacaac 155701 cccttggggt tatcctgcac ttggaagaag aagtagaaaa aggaataaat atagtgataa 155761 tttgattctt cgtcgccgta gtaaatagga gagaaaatcg aattaaattc ttcgttttta 155821 caaaaaaaaa aaaaatagga gtaa // LOCUS TIPNPSS 6425 bp ds-DNA SYN 09-AUG-1990 DEFINITION A.tumefaciens T-DNA vector containing octopine T-DNA borders and markers: neomycin-phosphotransferase - octopine synthase (3' end) and Sp/Sm adenyltransferase. complete cds. ACCESSION M35007 KEYWORDS neomycin phosphotransferase; streptomycin/spectinomycin adenyltransferase. SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6425) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 2782 1985 (c) neomycin phosphotransferase (gtg start codon) pept.ps 5009 4042 (c) streptomycin/spectinomycin adenyltransferase (gtg start codon) mRNA / 1811 1105 (c) octopine synthase (3' end) recomb 24 25 T-DNA end/plant DNA start recomb 40 41 plant DNA end/T-DNA start recomb 1094 1095 plant DNA end/T-DNA start recomb 2786 2787 T-DNA end/plant DNA start recomb 3044 3045 T-DNA end/plant DNA start recomb 3354 3355 plant DNA end/T-DNA start recomb 5585 5586 T-DNA end/plant DNA start recomb 6389 6390 T-DNA end/plant DNA start signal 1650 1645 (c) poly-A signal signal 1686 1681 (c) poly-A signal site 1634 1633 (c) major poly-A site site 1 24 left T-DNA border site 372 395 24 bp border-like sequence site 1569 1592 24 bp border-like sequence site 1669 1692 24 bp border-like sequence site 1779 1756 (c) 24 bp border-like sequence site 2128 2105 (c) 24 bp border-like sequence site 2449 2472 24 bp border-like sequence site 2485 2462 (c) 24 bp border-like sequence site 3660 3683 24 bp border-like sequence site 3875 3898 24 bp border-like sequence site 4359 4336 (c) 24 bp border-like sequence site 5868 5891 24 bp border-like sequence BASE COUNT 1509 a 1754 c 1710 g 1452 t ORIGIN 1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa 61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca 121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa 181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct 241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc 301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat 361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc 421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt 481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt 541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa 601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg 661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa 721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc 781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac 841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc 901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc 961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc 1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc 1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc 1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa 1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt 1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat 1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga 1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata 1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta 1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata 1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt 1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct 1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc 1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa 1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc 1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg 2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga 2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg 2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg 2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc 2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa 2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg 2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata 2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa 2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca 2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag 2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg 2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc 2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg 3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg 3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg 3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc 3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg 3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat 3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc 3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac 3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag 3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg 3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat 3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat 3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa 3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg 3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt 3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg 3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg 3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg 4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta 4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag 4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc 4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg 4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag 4321 ocgttaaggt ttcatttagc gcctcaaata gatcctgttc aggaaccgga tcaaagagtt 4381 cctccgccgc tggacctacc aaggcaacgc tatgttctct tgcttttgtc agcaagatag 4441 ccagatcaat gtcgatcgtg gctggctcga agatacctgc aagaatgtca ttgcgctgcc 4501 attctccaaa ttgcagttcg cgcttagctg gataacgcca cggaatgatg tcgtcgtgca 4561 caacaatggt gacttctaca gcgcggagaa tctcgctctc tccaggggaa gccgaagttt 4621 ccaaaaggtc gttgatcaaa gctcgccgcg ttgtttcatc aagccttacg gtcaccgtaa 4681 ccagcaaatc aatatcactg tgtggcttca ggccgccatc cactgcggag ccgtacaaat 4741 gtacggccag caacgtcggt tcgagatggc gctcgatgac gccaactacc tctgatagtt 4801 gagtcgatac ttcggcgatc accgcttccc tcatgatgtt taactttgtt ttagggcgac 4861 tgccctgctg cgtaacatcg ttgctgctcc ataacatcaa acatcgaccc acggcgtaac 4921 gcgcttgctg cttggatgcc cgaggcatag actgtacccc aaaaaaacag tcataacaag 4981 ccatgaaaac cgccactgcg ccgttaccac cgctgcgttc ggtcaaggtt ctggaccagt 5041 tgcgtgaggc catacgctac ttgcattaca gcttacgaac cgaacaggct tatgtccact 5101 gggttcgtgc cttcatccgt ttccacggtg tgcgtcaccc ggcaaccttg ggcagcagcg 5161 aagtcgaggc atttctgtcc tggctggcga acgagcgcaa ggtttcggtc tccacgcatc 5221 gtcaggcatt ggcggccttg ctgttcttct acggcaagtg ctgtgcacgg atctgccctg 5281 gcttcaggag atcggaagac ctcggccgtc cgggcgcttg ccggtggtgc tgaccccgga 5341 tgaagtggtt cgcatcctcg gttttctgga aggcgagcat cgtttgttcg cccagcttct 5401 gtatggaacg ggcatgcgga tcagtgaggg tttgcaactg cgggtcaagg actggatttc 5461 gatcacggca cgatcatcgt gcgggagggc aagggctcca aggatcgggc cttgatgtta 5521 cccgagagct tggcacccag cctgcgcgag cagctgtctc gtgcacgggc atggtggctg 5581 aaggactagg ccgagggccg cagcggcgtt gcgcttcccg acgcccttga gcggaagtat 5641 ccgcgcgccg ggcattcctg gccgtggttc tgggtttttg cgcagcacac gcattcgacc 5701 gatccacgga gcggtgtcgt gcgtcgccat cacatgtatg accagacctt tcagcgcgcc 5761 ttcaaacgtg ccgtagaaca agcaggcatc acgaagcccg ccacaccgca caccctccgc 5821 cactcgttcg cgacggcctt gctccgcagc ggttacgaca ttcgaaccgt gcaggatctg 5881 ctcggccatt ccgacgtctc tacgacgatg atttacacgc atgtgctgaa agttggcggt 5941 gccggagtgc gctcaccgct tgatgcctgc cgcccctcac tgtgagaggt agggcagcgc 6001 aagtcaatcc tagcggattc actacccctg cgcgaaggcc atcggtgccg catcgaacgg 6061 ccggttgcgg aaagtcctcc ctgcgtccgc tgatggccgg cagcagcccg tcgttgaagg 6121 atccctgaaa gcgacgttgg atgttaacat ctacaaattg ccttttctta cgaccatgta 6181 cgtaagcgct tacgtttttg gtggaccctt gaggaaactg gtagctgttg tgggcctgtg 6241 gtctcaagat ggatcattaa tttccacctt cacctacgat ggggggcatc gcaccggtga 6301 gtaatattgt acggctaaga gcgaatttgg cctgtagacc tcaattgcga gctttctaat 6361 ttcaaactat tcgggcctaa cttttggtgt gatgatgctg actggcagga tatataccgt 6421 tgtaat // LOCUS TOBPRMMG 200 bp ds-DNA PLN 09-AUG-1990 DEFINITION N.tabacum promoter activating a promoterless nptII marker gene. ACCESSION M34757 KEYWORDS . SOURCE N.tabacum (strain SR1) DNA. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 198 > 200 ORF mRNA 130 > 200 mRNA (5' end +/- 2 bp) recomb 193 194 T-DNA end/plant DNA start signal 25 30 CAAT box signal 47 55 CAAT box signal 101 107 TATA box BASE COUNT 64 a 53 c 32 g 51 t ORIGIN 1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg 61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta 121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg 181 atacaccaaa tcggatcatg //