Path: utzoo!utgpu!news-server.csri.toronto.edu!bonnie.concordia.ca!uunet!bionet!will From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Herpes simplex virus type 1 (HSV1) short unique region DNA Message-ID: Date: 28 May 91 18:42:14 GMT Sender: will@genbank.bio.net Distribution: bionet Lines: 443 Approved: lear@genbank.bio.net Checksum: 14577 26 LOCUS HS1HSV1SU 12979 bp ds-DNA VRL 28-MAY-1991 DEFINITION Herpes simplex virus type 1 (HSV1) short unique region DNA ACCESSION X02138 KEYWORDS glycoprotein; unidentified reading frame. SOURCE Herpes simplex virus type 1 DNA. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 12979) AUTHORS Mcgeoch,D.J., Dolan,A., Donald,S. and Rixon,F.J. TITLE Sequence determination and genetic content of the short unique region in the genome of herpes simplex virus type 1 JOURNAL J. Mol. Biol. 181, 1-13 (1985) STANDARD full automatic COMMENT SWISS-PROT; P03170; IE12$HSV11. SWISS-PROT; P03171; VGLD$HSV11. SWISS-PROT; P04413; KR1$HSV11. SWISS-PROT; P04485; IE68$HSV11. SWISS-PROT; P04487; DNB$HSV11. SWISS-PROT; P04488; VGLE$HSV11. SWISS-PROT; P06480; US05$HSV11. SWISS-PROT; P06481; US09$HSV11. SWISS-PROT; P06484; VGLG$HSV11. SWISS-PROT; P06485; US02$HSV11. SWISS-PROT; P06486; US10$HSV11. SWISS-PROT; P06487; VGLI$HSV11. From EMBL entry HEHSV1SU; dated 19-DEC-1990. FEATURES Location/Qualifiers precursor_RNA <1..1356 /note="primary transcript Us1" misc_feature 1..12979 /note="HSV1 unique sequence Us" CDS 40..1299 /product="Umw 68 (Us1)" /codon_start=40 repeat_region 1182..11196 /note="direct repeat 6" misc_feature 1337..1342 /note="polyadenylation signal (Us1)" precursor_RNA complement(1419..2700) /note="primary transcript Us2" misc_feature complement(1432..1437) /note="polyadenylation signal (Us2)" CDS complement(1452..2324) /note="32K (Us2)" /codon_start=2324 CDS complement(2286..2321) /note="pot. signal peptide for membrane-bound translation (Us2) (aa 2-13)" /codon_start=2321 promoter 2330..2336 /note="TATA-box like sequence (Us3a)" precursor_RNA 2360..4927 /note="primary transcript Us3a" promoter 2559..2564 /note="TATA-box like sequence (Us3b)" precursor_RNA 2585..4927 /note="primary transcription Us3b" CDS 2618..4060 /note="53K (Us3) (aa 1-481)" /codon_start=2618 promoter complement(2723..2729) /note="TATA-box like sequence (Us2)" promoter 4098..4105 /note="TATA-box like sequence (Us4)" precursor_RNA 4125..4927 /note="primary transcript (Us4)" CDS 4140..4853 /note="25K (Us4) (aa 1-238)" /codon_start=4140 misc_feature 4155..4157 /note="pot. alternative start codon for 25K" CDS 4161..4214 /note="pot. signal peptide for membrane-bound translation (Us4) (aa 8-22)" /codon_start=4161 CDS 4707..4772 /note="put. transmembrane region (Us4) (aa 190-211)" /codon_start=4707 misc_feature 4904..4909 /note="polyadenylation signal (Us3)" misc_feature 4904..4909 /note="polyadenylation signal (Us4)" promoter 4991..4997 /note="TATA-box like sequence (Us5)" precursor_RNA 5026..8429 /note="pot. primary transcript (Us5)" CDS 5127..5402 /note="9K (Us5) (aa 1-92)" /codon_start=5127 CDS 5151..5195 /note="pot. signal peptide for membrane-bound translation (Us5) (aa 9-23)" /codon_start=5151 CDS 5256..5336 /note="pot. transmembrane sequence (Us5) (aa 44-92)" /codon_start=5256 misc_feature 5409..5423 /note="variable C tract" promoter 5705..5712 /note="TATA-box like sequence (Us6)" misc_feature 5729..5738 /note="multiple mRNA 5' site (Us6)" CDS 5815..6996 /product="glycoprotein gD (Us6)" /codon_start=5815 CDS 5836..5874 /note="pot. signal peptide for membrane-bound translation (Us6) (aa 8-20)" /codon_start=5836 CDS 6811..6906 /note="pot. transmembrane sequence (Us6) (aa 333-364)" /codon_start=6811 promoter 7062..7069 /note="TATA-box like sequence (Us7)" precursor_RNA 7090..8429 /note="pot. primary transcription Us7" CDS 7181..8350 /note="41K (Us7) (aa 1-390)" /codon_start=7181 CDS 7193..7249 /note="pot. signal peptide for membrane-bound translation (Us7) (aa 5-23)" /codon_start=7193 repeat_region 7800..7810 /note="direct repeat x1" repeat_region 7811..7820 /note="direct repeat y1" repeat_region 7821..7831 /note="direct repeat x2" repeat_region 7832..7841 /note="direct repeat y2" repeat_region 7842..7852 /note="direct repeat z1" repeat_region 7853..7862 /note="direct repeat y3" repeat_region 7863..7873 /note="direct repeat z2" CDS 7978..8068 /note="pot. transmembrane sequence (Us7) (aa 267-296)" /codon_start=7978 CDS 8069..8069 /note="pot. anchor sequence (Us7) (aa 297-308)" /codon_start=8069 misc_feature 8409..8414 /note="polyadenylation signal (Us5)" misc_feature 8409..8414 /note="polyadenylation signal (Us6)" misc_feature 8409..8414 /note="polyadenylation signal (Us7)" misc_feature 8429..8429 /note="mRNA 3' site (Us6)" promoter 8535..8541 /note="TATA-box like sequence (Us8)" precursor_RNA 8567..11088 /note="primary transcript (Us8)" CDS 8639..10288 /product="glycoprotein gE (Us8)" /codon_start=8639 CDS 8648..8707 /note="pot. signal peptide for membrane-bound translation (Us8) (aa 4-23)" /codon_start=8648 CDS 9896..9970 /note="pot. transmembrane sequence (Us8) (aa 420-444)" /codon_start=9896 promoter 10614..10619 /note="TATA-box like sequence (Us9)" precursor_RNA 10641..11088 /note="primary transcript (Us9)" CDS 10708..10977 /note="10K (Us9) (aa 1-90)" /codon_start=10708 misc_feature 11063..11068 /note="polyadenylation signal (Us8 Us9)" repeat_region 11107..11121 /note="direct repeat 1" repeat_region 11122..11136 /note="direct repeat 2" repeat_region 11137..11151 /note="direct repeat 3" repeat_region 11152..11166 /note="direct repeat 4" repeat_region 11167..11181 /note="direct repeat 5" repeat_region 11197..11211 /note="direct repeat 7" repeat_region 11212..11226 /note="direct repeat 8" repeat_region 11227..11241 /note="direct repeat 9" repeat_region 11242..11256 /note="direct repeat 10" repeat_region 11257..11259 /note="imperfect direct repeat" precursor_RNA complement(11514..12561) /note="primary transcript (Us10)" precursor_RNA complement(11514..12855) /note="18K (Us11) (aa 1-134)" precursor_RNA complement(11514..>12979) /note="primary transcript (Us12)" misc_feature 11530..11535 /note="polyadenylation signal (Us10)" misc_feature 11530..11535 /note="polyadenylation signal (Us11)" misc_feature 11530..11535 /note="polyadenylation signal (Us12)" CDS complement(11556..12490) /note="34K (Us10) (aa 1-284)" /codon_start=12490 CDS complement(12159..12641) /note="TATA-box like sequence (Us11)" /codon_start=12641 repeat_region 12204..12220 /note="direct repeat 1" repeat_region 12223..12238 /note="direct repeat 2" repeat_region 12241..12256 /note="direct repeat 3" repeat_region 12259..12263 /note="imperfect direct repeat" promoter complement(12582..12588) /note="TATA-box like sequence (Us10)" CDS complement(12709..12972) /note="Umw12 (Us12) (aa 1-85)" /codon_start=12972 promoter complement(12879..12886) /note="primary transcript (Us11)" BASE COUNT 2286 a 4271 c 4078 g 2344 t ORIGIN 1 cggggggaag ccactgtggt cctccgggac gttttctgga tggccgacat ttccccaggc 61 gcttttgcgc cttgtgtaaa agcgcggcgt cccgctctcc gatccccgcc cctgggcacg 121 cgcaagcgca agcgcccttc ccgccccctc tcatcggagt ctgaggtaga atccgataca 181 gccttggagt ctgaggtcga atccgagaca gcatcggatt cgaccgagtc tggggaccag 241 gatgaagccc cccgcatcgg tggccgtagg gccccccgga ggcttggggg gcggtttttt 301 ctggacatgt cggcggaatc caccacgggg acggaaacgg atgcgtcggt gtcggacgac 361 cccgacgaca cgtccgactg gtcttatgac gacattcccc cacgacccaa gcgggcccgg 421 gtaaacctgc ggctcacgag ctctcccgat cggcgggatg gggttatttt tcctaagatg 481 gggcgggtcc ggtctacccg ggaaacgcag ccccgggccc ccaccccgtc ggccccaagc 541 ccaaatgcaa tgctacggcg ctcggtgcgc caggcccaga ggcggagcag cgcacgatgg 601 acccccgacc tgggctacat gcgccagtgt atcaatcagc tgtttcgggt cctgcgggtc 661 gcccgggacc cccacggcag tgccaaccgc ctgcgccacc tgatacgcga ctgttacctg 721 atgggatact gccgagcccg tctggccccg cgcacgtggt gccgtttgct gcaggtgtcc 781 ggcggaacct ggggcatgca cctgcgcaac accatacggg aggtggaggc tcgattcgac 841 gccaccgcgg aacccgtgtg caagcttcct tgtttggaga ccagacggta cggcccggag 901 tgtgatctta gtaatctcga gattcatctc agcgcgacaa gcgatgatga aatctccgat 961 gccaccgatc tggaggccgc cggttcggac cacacgctcg cgtcccagtc cgacacggag 1021 gatgccccct cccccgttac gctggaaacc ccagaacccc gcgggtccct cgctgtgcgt 1081 ctggaggatg agtttgggga gtttgactgg accccccagg agggctccca gccctggctg 1141 tctgcggtcg tggccgatac cagctccgtg gaacgcccgg gcccatccga ttctggggcg 1201 ggtcgcgccg cagaagaccg caagtgtctg gacggctgcc ggaaaatgcg cttctccacc 1261 gcctgcccct atccgtgcag cgacacgttt ctccggccgt gagtccggtc gccccgaccc 1321 ccttgtatgt ccccaaaata aaagaccaaa atcaaagcgt ttgtcccagc gtcttaatgg 1381 cgggaagggc ggagagaaac agaccacgcg gacatggggg gtgtttgggg gtttattggc 1441 accgggggct aaagggtggt aaccggatag cagatgtgag gaagtcgggg ccgttcgccg 1501 cgaacggcga tcagagggtc agtttcttgc ggaccacggc ccggcgatgt gggttgctcg 1561 tctgggacct cgggcatgcc catacacgca caacacggac gccgcaccgg atgggacgtc 1621 gtaagggggc ctggggtagc tgggtggggt ttgtgcagag caatcaggga ccgcagccag 1681 cgcatacaat cgcgctcccg tccgtttgtc ccgggcagta ccacgccgta ctggtattcg 1741 taccggctga gcagggtctc cagggggtgg ttgggggccg cggggaacgg ggtccacgcc 1801 acggtccact cgggcaaaaa ccgagtcggc acggcccacg gttctcccac ccacgcgtct 1861 ggggtcttga tggcgataaa tcttaccccg agccggattt tttgggcgta ttcgagaaac 1921 ggcacacaca gatccgccgc gcctaccacc cacaagtggt agaggcgagg ggggctgggt 1981 tggtctcggt gcagcagtcg gaagcacgcc acggcgtcca cgacctcggt gctctccaag 2041 gggctgtcct ccgcaaacag gcccgtggtg gtgtttgggg ggcagcgaca ggacctagtg 2101 cgcacgatcg ggcgggtggg tttgggtaag tccatcagcg gctcggccaa ccgtcgaagg 2161 ttggccggac gaacgacgac cggggtaccc aggggttctg atgccaaaat gcggcactgc 2221 ctaagcagga agctccacag ggccgggctt gcgtcgacgg aagtccgggg cagggcgttg 2281 ttctggtcaa ggagggtcat tacgttgacg acaacaacgc ccatgttggt atattacagg 2341 cccgtgtccg atttggggca cttgcagatt tgtaaggcca cgcacggcgg ggagacaggc 2401 cgacgcgggg gctgctctaa aaatttaagg gccctacggt ccacagaccc gccttcccgg 2461 gggggccctt ggagcgaccg gcagcggagg cgtccggggg aggggagggt gatttacggg 2521 ggggtaggtc agggggtggg tcgtcaaact gccgctcctt aaaaccccgg ggcccgtcgt 2581 tcggggtgct cgttggttgg cactcacggt gcggcgaatg gcctgtcgta agttttgtcg 2641 cgtttacggg ggacagggca ggaggaagga ggaggccgtc ccgccggaga caaagccgtc 2701 ccgggtgttt cctcatggcc ccttttatac cccagccgag gacgcgtgcc tggactcccc 2761 gcccccggag acccccaaac cttcccacac cacaccaccc agcgaggccg agcgcctgtg 2821 tcatctgcag gagatccttg cccagatgta cggaaaccag gactacccca tagaggacga 2881 ccccagcgcg gatgccgcgg acgatgtcga cgaggacgcc ccggacgacg tggcctatcc 2941 ggaggaatac gcagaggagc tttttctgcc cggggacgcg accggtcccc ttatcggggc 3001 caacgaccac atccctcccc cgtgtggcgc atctcccccc ggtatacgac gacgcagccg 3061 ggatgagatt ggggccacgg gatttaccgc ggaagagctg gacgccatgg acagggaggc 3121 ggctcgagcc atcagccgcg gcggcaagcc cccctcgacc atggccaagc tggtgactgg 3181 catgggcttt acgatccacg gagcgctcac cccaggatcg gaggggtgtg tctttgacag 3241 cagccatcca gattaccccc aacgggtaat cgtgaaggcg gggtggtaca cgagcacgag 3301 ccacgaggcg cgactgctga ggcgactgga ccacccggcg atcctgcccc tcctggacct 3361 gcatgtcgtc tccggggtca cgtgtctggt cctccccaag taccaggccg acctgtatac 3421 ctatctgagt aggcgcctga acccactggg acgcccgcag atcgcagcgg tctcccggca 3481 gctcctaagc gccgttgact acattcaccg ccagggcatt atccaccgcg acattaagac 3541 cgaaaatatt tttattaaca cccccgagga catttgcctg ggggactttg gcgccgcgtg 3601 cttcgtgcag ggttcccgat caagcccctt cccctacgga atcgccggaa ccatcgacac 3661 caacgccccc gaggtcctgg ccggggatcc gtataccacg accgtcgaca tttggagcgc 3721 cggtctggtg atcttcgaga ctgccgtcca caacgcgtcc ttgttctcgg ccccccgcgg 3781 ccccaaaagg ggcccgtgcg acagtcagat cacccgcatc atccgacagg cccaggtcca 3841 cgttgacgag ttttccccgc atccagaatc gcgcctcacc tcgcgctacc gctcccgcgc 3901 ggccgggaac aatcgcccgc cgtacacccg accggcctgg acccgctact acaagatgga 3961 catagacgtc gaatatctgg tttgcaaagc cctcaccttc gacggcgcgc ttcgccccag 4021 cgccgcagag ctgctttgtt tgccgctgtt tcaacagaaa tgaccgcccc ctgggggcgg 4081 tgctgtttgc gggttggcac aaaaagaccc cgatccgcgt ctgtggtgtt tttggcatca 4141 tgtcgcaggg cgccatgcgt gccgttgttc ccattatccc attccttttg gttcttgtcg 4201 gtgtatcggg ggttcccacc aacgtctcct ccaccaccca accccaactc cagaccaccg 4261 gtcgtccctc gcatgaagcc cccaacatga cccagaccgg caccaccgac tctcccaccg 4321 ccatcagcct taccacgccc gaccacacac cccccatgcc aagtattgga ctggaggagg 4381 aggaagagga ggagggggcc ggggacggcg aacatcttga ggggggagat gggacccgtg 4441 acaccctacc ccagtccccg ggcccagcct tcccgttggc tgaggacgtc gagaaggaca 4501 aacccaaccg tcccgtagtc ccatcccccg atcccaacaa ctcccccgcg cgccccgaga 4561 ccagtcgccc gaagacaccc cccaccatta tcgggccgct ggcaactcgc cccacgaccc 4621 gactcacctc aaagggacga cccttggttc cgacgcctca acataccccg ctgttctcgt 4681 tcctcactgc ctcccccgcc ctggacaccc tcttcgtcgt cagcaccgtc atccacacct 4741 tatcgttttt gtgtattggt gcgatggcga cacacctgtg tggcggttgg tccagacgcg 4801 ggcgacgcac acaccctagc gtgcgttacg tgtgcctgcc gtccgaacgc gggtagggta 4861 tggggcgggg gatggggaga gcccacatgc ggaaagcaag aacaataaag gcggtggtat 4921 ctagttgata tgcatctctg ggtgtttttg gggtgtggcg gacgcggggc ggtcattgga 4981 cggggtgcag ttaaatacat gcccgggacc catgaagcat gcgcgacttc cgggcctcag 5041 aacccacccg aaacggccaa cggacgtctg agccaggcct ggctatccgg agaaacagca 5101 cacgacttgg cgttctgtgt gtcgcgatgt ctctgcgcgc agtctggcat ctggggcttt 5161 tgggaagcct cgtgggggct gttcttgccg ccacccatcg gggacctgcg gccaacacaa 5221 cggacccctt aacgcacgcc ccagtgtccc ctcaccccag ccccctgggg ggctttgccg 5281 tccccctcgt agtcggtggg ctgtgcgccg tagtcctggg ggcggcatgt ctgcttgagc 5341 tcctgcgtcg tacgtgccgc gggtgggggc gttaccatcc ctacatggac ccagttgtcg 5401 tataatttcc cccccccccc cccttctccg cgtgggtgat gtcgggtcca aactcccgac 5461 accaccagct ggcatggtat aaatcaccgg tgcgcccccc aaaccatgtc cggcaggggg 5521 atgggggggc aatgcggagg gcacccaaca acaccgggct aaccaggaaa tccgtggccc 5581 cggcccccaa taaagatcgc ggtagcccgg ccgtgtgaca ctatcgtcca taccgaccac 5641 accgacgaat cccccaaggg ggaggggcca ttttacgagg aggaggggta taacaaagtc 5701 tgtctttaaa aagcaggggt tagggagttg ttcggtcata agcttcagcg cgaacgacca 5761 actaccccga tcatcagtta tccttaaggt ctcttttgtg tggtgcgttc cggtatgggg 5821 ggggctgccg ccaggttggg ggccgtgatt ttgtttgtcg tcatagtggg cctccatggg 5881 gtccgcagca aatatgcctt ggtggatgcc tctctcaaga tggccgaccc caatcgcttt 5941 cgcggcaaag accttccggt cctggaccag ctgaccgacc ctccgggggt ccggcgcgtg 6001 taccacatcc aggcgggcct accggacccg ttccagcccc ccagcctccc gatcacggtt 6061 tactacgccg tgttggagcg cgcctgccgc agcgtgctcc taaacgcacc gtcggaggcc 6121 ccccagattg tccgcggggc ctccgaagac gtccggaaac aaccctacaa cctgaccatc 6181 gcttggtttc ggatgggagg caactgtgct atccccatca cggtcatgga gtacaccgaa 6241 tgctcctaca acaagtctct gggggcctgt cccatccgaa cgcagccccg ctggaactac 6301 tatgacagct tcagcgccgt cagcgaggat aacctggggt tcctgatgca cgcccccgcg 6361 tttgagaccg ccggcacgta cctgcggctc gtgaagataa acgactggac ggagattaca 6421 cagtttatcc tggagcaccg agccaagggc tcctgtaagt acgccctccc gctgcgcatc 6481 cccccgtcag cctgcctctc cccccaggcc taccagcagg gggtgacggt ggacagcatc 6541 gggatgctgc cccgcttcat ccccgagaac cagcgcaccg tcgccgtata cagcttgaag 6601 atcgccgggt ggcacgggcc caaggcccca tacacgagca ccctgctgcc cccggagctg 6661 tccgagaccc ccaacgccac gcagccagaa ctcgccccgg aagaccccga ggattcggcc 6721 ctcttggagg accccgtggg gacggtggcg ccgcaaatcc caccaaactg gcacataccg 6781 tcgatccagg acgccgcgac gccttaccat cccccggcca ccccgaacaa catgggcctg 6841 atcgccggcg cggtgggcgg cagtctcctg gcagccctgg tcatttgcgg aattgtgtac 6901 tggatgcgcc gccacactca aaaagcccca aagcgcatac gcctccccca catccgggaa 6961 gacgaccagc cgtcctcgca ccagcccttg ttttactaga taccccccct taatgggtgc 7021 gggggggtca ggtctgcggg gttgggatgg gaccttaact ccatataaag cgagtctgga 7081 aggggggaaa ggtggacagt cgataagtcg gtagcggggg acgcgcacct gttccgcctg 7141 tcgcacccac agcttttttt gcgaaccgtc ccgttccggg atgccgtgcc gcccgttgca 7201 gggcctggtg ctcgtgggcc tctgggtctg tgccaccagc ctggttgtcc gtggccccac 7261 ggtcagtctg gtatcaaact catttgtgga cgccggggcc ttggggcccg acggcgtagt 7321 ggaggaagac ctgcttattc tcggggagct tcgctttgtg ggggaccagg tcccccacac 7381 cacctactac gatgggggcg tagagctgtg gcactacccc atgggacaca aatgcccacg 7441 ggtcgtgcat gtcgtcacgg tgaccgcgtg cccacgtcgc cccgccgtgg cattcgccct 7501 gtgtcgcgcg accgacagca ctcacagccc cgcatatccc accctggagc tcaatctggc 7561 ccaacagccg cttttgcggg tccagagggc aacgcgggac tatgccgggg tgtacgtgtt 7621 acgcgtatgg gtcggtgacg cgccaaacgc cagcctgttt gtcctgggga tggccatagc 7681 cgccgaaggg actctggcgt acaacggctc ggcctatggc tcctgcgacc cgaaactgct 7741 tccgtcttcg gccccgcgtc tggccccggc gagcgtatac caacccgccc ctaaccaggc 7801 ctccaccccc tcgaccacca cctccacccc ctcgaccacc atccccgctc cctcgaccac 7861 catccccgct ccccaagcat cgaccacgcc cttccccacg ggagatccaa aaccacaacc 7921 tcccggggtc aaccacgaac ccccatctaa tgccacgcga gcgacccgcg actcgcgata 7981 cgcgctaacg gtgacccaga taatccagat agccatcccc gcgtccatca tagccctggt 8041 gtttctgggg agctgtattt gctttataca cagatgtcaa cgccgctacc gacgctcccg 8101 tcgcccgatt tacagccccc agatgcccac gggcatctca tgcgcggtga acgaagcggc 8161 catggcccgc ctcggagccg agctcaaatc gcatccgagc acccccccca aatcccggcg 8221 ccggtcgtca cgcacgccaa tgccctccct gacggccatc gccgaagagt cggagcccgc 8281 tggggcggct gggcttccga cgccccccgt ggaccccacg acacccaccc caacgcctcc 8341 cctgttggta taggtccacg gccactggcc gggagcacca cataaccgac cgcagtccct 8401 gagttgggaa taaaccggta ttatttacct atatccgtgt atgtcgattt ctttcccccc 8461 ctccccggaa accaaagaag gaagcaaaga atggatggga ggagttcagg aagccgggga 8521 gagggcccgc ggcgcattta aggcgttgtt gtgttgactt tgcctcttct ggcgggttgg 8581 tgcggtgctg tttgttgggc tcccatttta cccgaagatc ggctgctatc cccgggacat 8641 ggatcgcggg gcggtggtgg ggtttcttct cggtgtttgt gttgtatcgt gcttggcggg 8701 aacgcccaaa acgtcctgga gacgggtgag tgtcggcgag gacgtttcgt tgcttccagc 8761 tccggggcct acggggcgcg gcccgaccca gaaactacta tgggccgtgg aacccctgga 8821 tgggtgcggc cccttacacc cgtcgtgggt ctcgctgatg ccccccaagc aggtgcccga 8881 gacggtcgtg gatgcggcgt gcatgcgcgc tccggtcccg ctggcgatgg cgtacgcccc 8941 cccggcccca tctgcgaccg ggggtctacg aacggacttc gtgtggcagg agcgcgcggc 9001 cgtggttaac cggagtctgg ttattcacgg ggtccgagag acggacagcg gcctgtatac 9061 cctgtccgtg ggcgacataa aggacccggc tcgccaagtg gcctcggtgg tcctggtggt 9121 gcaaccggcc ccagttccga ccccaccccc gaccccagcc gattacgacg aggatgacaa 9181 tgacgagggc gaggacgaaa gtctcgccgg cactcccgcc agcgggaccc cccggctccc 9241 gcctcccccc gcccccccga ggtcttggcc cagcgccccc gaagtctcac atgtgcgtgg 9301 ggtgaccgtg cgtatggaga ctccggaagc tatcctgttt tcccccgggg agacgttcag 9361 cacgaacgtc tccatccatg ccatcgccca cgacgaccag acctactcca tggacgtcgt 9421 ctggttgagg ttcgacgtgc cgacctcgtg tgccgagatg cgaatatacg aatcgtgtct 9481 gtatcacccg cagctcccag aatgtctgtc cccggccgac gcgccgtgcg ccgcgagtac 9541 gtggacgtct cgcctggccg tccgcagcta cgcggggtgt tccagaacaa accccccacc 9601 gcgctgttcg gccgaggctc acatggagcc cgtcccgggg ctggcgtggc aggcggcctc 9661 cgtcaatctg gagttccggg acgcgtcccc acaacactcc ggcctgtatc tgtgtgtggt 9721 gtacgtcaac gaccatattc acgcctgggg ccacattacc atcagcaccg cggcgcagta 9781 ccggaacgcg gtggtggaac agcccctccc acagcgcggc gcggatttgg ccgagcccac 9841 ccacccgcac gtcggggccc ctccccacgc gcccccaacc cacggcgccc tgcggttagg 9901 ggcggtgatg ggggccgccc tgctgctgtc tgcactgggg ttgtcggtgt gggcgtgtat 9961 gacctgttgg cgcaggcgtg cctggcgggc ggttaaaagc agggcctcgg gtaaggggcc 10021 cacgtacatt cgcgtggccg acagcgagct gtacgcggac tggagctcgg acagcgaggg 10081 agaacgcgac caggtcccgt ggctggcccc cccggagaga cccgactctc cctccaccaa 10141 tggatccggc tttgagatct tatcaccaac ggctccgtct gtataccccc gtagcgatgg 10201 gcatcaatct cgccgccagc tcacaacctt tggatccgga aggcccgatc gccgttactc 10261 ccaggcctcc gattcgtccg tcttctggta aggcgcccca tcccgaggcc ccacgtcggt 10321 cgccgaactg ggcgaccgcc ggcgaggtgg acgtcggaga cgagctaatc gcgatttccg 10381 acgaacgcgg acccccccga catgaccgcc cgcccctcgc cacgtcgacc gcgccctcgc 10441 cacacccgcg acccccgggc tacacggccg ttgtctcccc gatggccctc caggctgtcg 10501 acgccccctc cctgtttgtc gcctggctgg ccgctcggtg gctccggggg gcttccggcc 10561 tgggggcctc ctgtgtggga ttgcgtggta tgtgacgtca attgcccgag gcgcataaag 10621 ggccggtggt ccgcctagcc gcagcaaatt aaaaatcgtg agtcacagcg accgcaactt 10681 cccacccgga gctttcttcc ggcctcgatg acgtcccggc tctccgatcc caactcctca 10741 gcgcgatccg acatgtccgt gccgctttat cccacggcct cgccagtttc ggtcgaagcc 10801 tactactcgg aaagcgaaga cgaggcggcc aacgacttcc tcgtacgcat gggccgccaa 10861 cagtcggtat taaggcgtcg acgcagacgc acccgctgcg tcggcatggt gatcgcctgt 10921 ctcctcgtgg ccgttctgtc gggcggattt ggggcgctcc tgatgtggct gctccgctaa 10981 aagaccgcat cgacacgcgc gtccttcttg tcgtctctct tcccccccat caccccgcaa 11041 tttgcaccca gcctttaact acattaaatt gggttcgatt ggcaatgttg tctcccggtt 11101 gatttttggg tgggtgggga gtgggtgggt ggggagtggg tgggtgggga gtgggtgggt 11161 ggggagtggg tgggtgggga gtgggtgggt ggggagtggg tgggtgggga gtgggtgggt 11221 ggggagtggg tgggtgggga gtgggtgggt ggggagtggc aaggaagaaa caagcccgac 11281 caccagacag aaaatgtaac catacccaaa ccgactctgg gggctgtttg tggggtcgga 11341 accataggat gaacaaacca ccccgtacca cccgcaccca agggtgcggt ggctcatcgg 11401 catctgtccg gtatgggttg ttccccaccc actcgcgttc ggacgtctta gaatcatggc 11461 ggttttctat gccgacatcg gttttctccc ccgcaataag acacgatgcg ataaaatctg 11521 tttgtaaaat ttattaaggg tacaaattgc cctagcacag gggtggggtt agggccgggt 11581 ccccacaccc aaacgcacca aacagatgca ggcagtgggt cgagtacagc cccgcgtacg 11641 aacacgtcga tgcgtgtgtc agacagcacc agaaagcaca ggccatcaac aggtcgtgca 11701 tgtgtcggtg ggtttggacg cggggggcca tggtggtgat aaagttaatg gccgccgtcc 11761 gccagggcca caggggcgac gtctcttggt tggcccggag ccactgggtg tggaccagcc 11821 gcgcgtggcg gcccaacatg gcccctgtag ccgggggcgg gggatcgcgc acgtttgcag 11881 cgcacatgcg agacacctcg accacggttc gaaagaaggc ccggtggtcc gcgggcaaca 11941 tcaccaggtg cgcaagcgcc cgggcgtcca gagggtagag ccctgagtca tccgaggttg 12001 gctcatcgcc cgggtcttgc cgcaagtgcg tgtgggttgg gcttccggtg ggcgggacgc 12061 gaaccgcggt gtggatcccg acgcgggccc gagcgtatgc tccatgttgt ggggagaagg 12121 ggtctgggct cgccaggggg gcatacttgc ccgggctata cagacccgcg agccgtacgt 12181 ggttcgcggg gggtgcgtgg ggtccggggc tcccggggag accggggctc ccggggagac 12241 cggggctccc tgggagaccg gggttgtcgt ggatccctgg ggtcacgcgg taccctgggg 12301 tctctgggag ctcgcggtac tctgggttcc ctaggttctc ggggtggtcg cggaacccgg 12361 ggctcccggg gaacacgcgg tgtcctgggg attgttggcg gtcggacggc ttcagatggc 12421 ttcgagatcg tagtgtccgc accgactcgt agtagacccg aatctccaca ttgccccgcc 12481 gcttgatcat tatcaccccg ttgcgggggt ccggagatca tgcgcgggtg tcctcgaggt 12541 gcgtgaacac ctctggggtg catgccggcg gacggcacgc cttttaagta aacatctggg 12601 tcgcccggcc caactggggc cgggggttgg gtctggctca tctcgagagc cacggggggg 12661 aaccaccctc cgcccagaga ctcgggtgat ggtcgtaccc gggactcaac gggttaccgg 12721 attacgggga ctgtcggtca cggtcccgcc ggttcttcga tgtgccacac ccaaggatgc 12781 gttgggggcg atttcgggca gcagcccggg agagcgcagc aggggacgct ccgggtcgtg 12841 cacggcggtt ctggccgcct cccggtcctc acgccccctt ttattgatct catcgcgtac 12901 gtcggcgtac gtcctgggcc caacccgcat ggtgtccagg aaggtgtccg ccatttccag 12961 ggcccacgac atgctcccc //