Path: utzoo!attcan!uunet!ns-mx!iowasp!maverick.ksu.ksu.edu!zaphod.mps.ohio-state.edu!usc!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 29 Jun 90 12:00:13 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 4062 Approved: lear@genbank.bio.net Checksum: 29907 246 LOCUS MUSTKBLK 2094 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse B lymphocyte kinase (blk) mRNA, complete cds. ACCESSION M30903 KEYWORDS B lymphocyte kinase; tyrosine kinase. SOURCE Mouse (strain C57B410, haplotype b) B-lymphoid hybridoma cell line LB27.4 + 71.67, cDNA to mRNA, clones 102, 103, 201, 205, 215. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V. TITLE Specific expression of a tyrosine kinase gene, blk, in B lymphoid cells JOURNAL Science 247, 332-336 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2094) AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V. JOURNAL Unpublished (1990) Howard Hughes Med. Inst., Baltimore MD 21205 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.V.Desiderio, 21-DEC-1989. FEATURES from to/span description pept 350 1849 B lymphocyte kinase BASE COUNT 485 a 542 c 607 g 460 t ORIGIN 1 cagcctggct gaaggtcagg tgcccatgaa acttctcaga aacacctccg gggctgagga 61 gccagaagtc tccagaagac ttgactggag ggcagagctc ccactgagct gcttcttagg 121 gctgttcatg gcggtctgat cacaacaggg ggcacacaca cctctgtgtg ctgttggcag 181 gaagccacaa gccactgaag ctgactgaga tgagaagaat ccatctgggg atgggattgg 241 cttttcttaa ctgggtccag agcttctggt tgccgctaga aagttcttca accacacaga 301 ccctgggatc tgcacagcta tgttgagaag ctactgaagc cctgcaagga tggggctgct 361 gagcagcaag aggcaggtca gtgagaaggg caagggctgg agccccgtga agatccgcac 421 ccaggacaag gctcccccac ccctgccacc cctggttgtc ttcaaccacc ttgccccacc 481 atctcctaac caggacccag atgaagagga gcgttttgtg gtggctctgt ttgactatgc 541 cgctgtgaat gacagggacc ttcaggtgct gaagggtgag aagctccagg tcttgaggag 601 cactggagac tggtggttgg ccaggtcact cgtcacagga agagaaggtt atgtgcccag 661 caactttgtg gccccagtag agactctgga agtagaaaaa tggttcttca ggaccatcag 721 ccggaaggat gctgagaggc agttgctggc tccgatgaac aaggccggct cctttctcat 781 cagagagagt gagagcaata aaggtgcctt ttccctgtcc gtgaaagata tcaccaccca 841 gggggaggtg gtcaagcact ataagatccg atcactggac aatggaggct attacatctc 901 cccccggatc acctttccca ccctccaggc cctggtgcag cactattcaa agaaagggga 961 tggtttgtgt cagaagttga ctctgccctg tgtgaacctg gccccgaaga acctttgggc 1021 ccaagatgaa tgggaaatcc ccaggcagtc tctcaagttg gtccggaaac ttgggtctgg 1081 gcagtttggc gaagtctgga tgggttatta caaaaataac atgaaggtgg ccatcaagac 1141 cctgaaggag ggaaccatgt caccggaagc tttcctgggc gaggccaacg tgatgaaaac 1201 cctgcagcat gagaggctgg ttcgtctcta cgctgtggtc accagagagc ccatttacat 1261 cgtcactgaa tacatggcca gaggatgctt gctggatttt ctgaagaccg atgaaggtag 1321 caggttgtcc cttccaaggc tgattgacat gtcagcccag gttgcagagg ggatggctta 1381 catagagcgc atgaattcca tccaccgtga cctgcgggca gccaacatcc tggtgtctga 1441 gacgttgtgc tgcaaaatcg ctgacttcgg cttggccagg atcattgaca gtgaatacac 1501 tgcccaagag ggggccaagt tccccatcaa gtggaccgcc ccggaggcca tccacttcgg 1561 ggtgtttacc atcaaggctg atgtgtggtc cttcggagtc ttgctgatgg tgattgtcac 1621 ctatgggcgc gttccctacc caggaatgag caaccctgag gtcatccgta gcctggagca 1681 cggctaccga atgccatgcc cggagacatg tccaccggag ttgtacaatg atatcatcac 1741 tgagtgctgg cggggccggc cagaggagcg gcctaccttt gagttcctgc agtcggtgtt 1801 ggaggacttc tacacagcca cggagggcca atatgagctg cagccctagc aaccaatggt 1861 ctcttcaggg ctcccagcct gagcctcctt ctggactctg actttgggtt aatatgttca 1921 cacttagctc ttcttccagg taggttgcag aatcagggaa gaatggtgtg cctgcttctc 1981 aagtgaggga gccaggagac aaagatggct tcatggtgag gcatgagctc tcaaggcaca 2041 acctcggtcc acctttcact ggctgtgtgc ctttggccaa gttactttgc tgcg // LOCUS CHKSUH2AC 75 bp ds-DNA SYN 29-JUN-1990 DEFINITION Chicken and sea urchin chimeric TATA-H2A-c gene. ACCESSION J02525 KEYWORDS conalbumin; histone. SOURCE Chicken and sea urchin DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 75) AUTHORS Grosschedl,R., Wasylyk,B., Chambon,P. and Birnstiel,M.L. TITLE Point mutation in the TATA box curtails expression of sea urchin H2A histone gene in vivo JOURNAL Nature 294, 178-180 (1981) STANDARD full staff_review COMMENT The TATA-H2A-c gene was formed by replacing the "TATA box" of sea urchin H2A with that of chicken conalbumin. A single base change from 't' to 'g' in the third position of the "TATA box" reduces the rate of transcription by 80%. FEATURES from to/span description mRNA 68 > 75 H2A-c mRNA recomb 23 24 sea urchin DNA/BamHI linker recomb 25 26 BamHI linker/chicken DNA mut 41 41 t in wild type; g in "TAGA" mutation recomb 60 61 chicken DNA/sea urchin DNA BASE COUNT 20 a 21 c 21 g 13 t ORIGIN 68 bp upstream of H2A-c mRNA start 1 tcgctgaccg gtctctccga tccggagggc tgctcctcta taaaagggga agaaagagga 61 accattcaag ccagc // LOCUS CPVW8 70 bp ds-DNA VRL 29-JUN-1990 DEFINITION CPV-BR white pock mutant recombination junction, variant W8. ACCESSION M11471 KEYWORDS recombination joint. SOURCE Cow pox virus (strain Brighton red; CPV-BR) DNA, passed in chorioallantoic membrane (CAM) of 11-day old chicken embryo. ORGANISM Cowpox virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 70) AUTHORS Pickup,D.J., Ink,B.S., Parsons,B.L., Hu,W. and Joklik,W.K. TITLE Spontaneous deletions and duplications of sequences in the genome of cowpox virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6817-6821 (1984) STANDARD full staff_review COMMENT In the white pock variants of cowpox virus, 32 to 38 kb of the 3' end (region 2) were replaced by inverted copies, 21 to 50 kb long, of the 5' end (region 1) of the genome. These variants possess inverted terminal repeats. The limits of the inner ends of the deletions appear to be defined by the location of the nearest essential gene on one end and the location of the gene that encodes pock-redness on the other. The sequences adjacent to the recombination/transfer sites show no homology to each other or any unusual structural feature. At the recombination site of variant W8 one base pair "t" is shared by the two combining sequences. FEATURES from to/span description recomb 36 37 region 1 end/region 2 start (crossover event) BASE COUNT 19 a 11 c 14 g 26 t ORIGIN Undetermined number of bp downstream of EcoRI site. 1 ctttttattg agtggtggta gttacggata tctaatttat ccatccagta tgggtataca 61 acacgaattc // LOCUS DRORGM108 1059 bp ds-DNA INV 29-JUN-1990 DEFINITION D.melanogaster 28S rRNA gene with 1.006 kb type I insertion. ACCESSION K03141 KEYWORDS insertion sequence. SOURCE D.melanogaster ribosomal DNA, clone MB27. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1059) AUTHORS Roiha,H. and Glover,D.M. TITLE Duplicated rDNA sequences of variable lengths flanking the short type I insertions in the rDNA of Drosophila melanogaster JOURNAL Nucleic Acids Res. 9, 5521-5532 (1981) STANDARD simple staff_review COMMENT The rRNA genes of D.melanogaster can contain two types of non-homologous insertion sequences, which are unlike intervening sequences found in other genes. These insertion sequences appear to have a repressive effect on the transcription of rRNA genes. FEATURES from to/span description rpt 24 30 rRNA duplication rpt 1037 1043 28S rRNA duplication recomb 30 31 28S rRNA end/insertion sequence start recomb 1036 1037 insertion sequence end/28S rRNA start BASE COUNT 237 a 244 c 311 g 265 t 2 others ORIGIN 164 bp upstream of BstI site. 1 gaatggatta acgagattcc tactgtccct gtcttagctg ggagcagagg aagactcgcc 61 tagaggagtg tttaatccag agttggcaga acagatggga cgatgacagc gaaccaggac 121 gggtgacgca taggtttatc ccatacgtta gtcttgccta tcgggatcca agttttggat 181 tctcgatgag gacgtctttc ctgcttacag ggcacgggtc gttcaatgca tttttgcacg 241 ggagagccct cagcgatacc actgcttgcg catgtggcga tccatatgag gactggatgc 301 atatcttgtg cgcttgcccc ctatatgcag atctgcggga cctagatgga cttggagtgc 361 agcgccttgg cgaaactgga tcttcgaggg aatcctcgat gatcaagaga agactcaacg 421 gctggcaatg tttgcggaag aagtgttcct gaggaggagg ggcgtttagc tcagtcatct 481 ctcgcgtgtg gtttagcggg cgagaatact accacagttc cgctgttgct tgtcgtaaga 541 gacgactaat acagcgatag gattcctcta accctgcttg tcggagcaaa agggggaggc 601 ccaccgagcc tcttttcggt accacgggtt gagcagctat ccaagactgc tcattgaggt 661 aggccccctg gtgggagtat cgtggtggct gtggttggta cccatatcgc gggtagagcc 721 ttcatgctcg acgtttgagt tacggtgcta gttgcgcaaa actcgggtgc tgtgacccag 781 agatcagtag agattttagg tagatctcgc tcctcagcaa gggggagtgc ttgcccggca 841 agcangntac tcgaattgct accggggtgg tcgctatgta catagctata gcttctagtc 901 cgggacgctt gtctggcgta tccagacaca tgcaccatat gctcacttgt gggtgtatag 961 ggtgccgtgg ttgtaatccc ttcagtgtgg aacacgccac gtaaaataag ttcggaggga 1021 tccgaaaagc atacattgtc cctatctact atctagcaa // LOCUS ECOLACIS 113 bp ds-DNA BCT 29-JUN-1990 DEFINITION E.coli lacZ and IS1T insertion junction. ACCESSION K03504 KEYWORDS insertion element IS1T; lacZ/IS1 fusion protein. SOURCE E.coli K12 DNA, clone pCSH11. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 113) AUTHORS Malamy,M.H., Rahaim,P.T., Hoffman,C.S., Baghdoyan,D., O'Connor,M.B. and Miller,J. TITLE A frameshift mutation at the junction of an IS1 insertion within lacZ restores beta-galactosidase activity via formation of an active lacZ-IS1 fusion protein JOURNAL J. Mol. Biol. 181, 551-555 (1985) STANDARD full staff_review COMMENT The site of integration of IS1 into lacZ is found 18 nucleotides from the end of the sequence determining the C-terminus of beta-galactosidase. Reversion to lac+, promoted by ICR191, results from the loss of a "g" from the "ggg" sequence located at the junction of lacZ and IS1. As a result an active but temperature-sensitive lacZ/IS1 fusion protein is formed containing six amino acids derived from IS1 which replace six amino acids encoded by lacZ FEATURES from to/span description pept < 1 57 lacZ/IS1 fusion protein (AA at 1) recomb 37 38 E.coli DNA end/IS1 DNA start BASE COUNT 24 a 25 c 28 g 36 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc tgagcgccgg tcgctaccat taccaggtaa tgctgccaac ttactgattt 61 agtgtatgat ggtgatttta aggtgcttgc gtggcttcca tttccatcag atg // LOCUS ECOOMPASA 70 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene-plasmid pBR322 junction (synthetic), clone pOMPA21. ACCESSION J02534 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E.coli and plasmid pBR322 DNA, clone pOMPA21. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 70) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT [1] explored to what extent ompA gene can be shortened without impairing outer membrane incorporation of resulting protein fragments, and found that more than 133 NH2-terminal residues of the mature OmpA protein suffice but are unstable, while 193 such residues are stable. FEATURES from to/span description pept < 1 70 OmpA' protein (modified OmpA; aa at 2) recomb 13 14 E.coli end/pBR322 start BASE COUNT 15 a 17 c 24 g 14 t ORIGIN 13 bp upstream of BamHI site 1 gctgagcaac ctggatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg 61 ctccaagtag // LOCUS ECOOMPASB 66 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene / AD3 insert / plasmid pBR322 synthetic construct, clone pOMPA21-11. ACCESSION J02532 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E.coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-11. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT See comments for first entry in this series. AD3 is a derivative of a synthetic lac operator. FEATURES from to/span description pept < 1 42 OmpA' (modified OmpA protein) recomb 13 14 E.coli end/AD3 insert start recomb 60 61 AD3 insert end/pBR322 start BASE COUNT 19 a 14 c 17 g 16 t ORIGIN 13 bp upstream of BamHI site 1 gcgatcactc ctggatccgg aattcaaatt gtgagcggat aacaatttga attccggatc 61 gcgaag // LOCUS ECOOMPASC 66 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene / AD3 insert / plasmid pBR322 synthetic construct, clone pOMPA21-16. ACCESSION J02533 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E. coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-16. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT See comments for first entry in this series. AD3 is a derivative of a synthetic lac operator. FEATURES from to/span description pept < 1 34 OmpA' (modified OmpA; aa at 2) recomb 13 14 E.coli end/AD3 insert start recomb 60 61 AD3 insert end/pBR322 start BASE COUNT 22 a 14 c 15 g 15 t ORIGIN 20 bp upstream of EcoRI site 1 acagaccaag cacgatccgg aattcaaatt gtgagcggat aacaatttga attccggatc 61 tgatcg // LOCUS HUMIGLAM2 872 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human Ig rearranged lambda-chain mRNA VJC-region subgroup lambda-IV from heterohybridoma H6-3C4. ACCESSION M18645 KEYWORDS immunoglobulin; immunoglobulin light chain; lambda-immunoglobulin; variable region; variable region subgroup lambda-IV. SOURCE Human hybridoma H6-3C4 (cell fusion of mouse myeloma NS1 cells and human peripheral blood lymphocytes), cDNA to mRNA, clone p3C4-lambda-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 872) AUTHORS Yamasaki,N., Komori,S. and Watanabe,T. TITLE Complementary DNA for a human subgroup IV immunoglobulin lambda-chain JOURNAL Mol. Immunol. 24, 981-985 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 31 732 Ig VJC-region recomb 90 91 V-region start recomb 375 376 V-region end/J-region start recomb 412 413 J-region end/C-region start recomb 729 730 C-region end BASE COUNT 211 a 269 c 220 g 172 t ORIGIN 254 bp upstream of BamHI site. 1 tcagaagcag agttctgggg tgtctccacc atggcctgga cccctctctg gctcactctc 61 ttcactcttt gcataggttc tgtggtttct tctgaactga ctcaggaccc tgttgtgtct 121 gtggccttgg gacagacagt caggatcaca tgccaaggag acagcctcag aacctatcat 181 gcaagctggt accagcagaa gccaagacag gcccctgtac ttgtcatcta tgatgaaaac 241 aaccggccct cagggatccc agaccgattc tctggctcca cctcaggaaa cacagcttcc 301 ttgaccatca ctggggctca ggcggaagat gaggctgact attactgtaa ctcccgggac 361 agcagtggta accgtctggt attcggcgga gggaccaagc tgaccgtcct aggtcagccc 421 aaggctgccc cctcggtcac tctgttcccg ccctcctctg aggagcttca agccaacaag 481 gccacactgg tgtgtctcat aagtgacttc tacccgggag ccgtgacagt ggcctggaag 541 ggagatagca gccccgtcaa ggcgggagtg gagaccacca caccctccaa acaaagcaac 601 aacaagtacg cggccagcag ctatctgagc ctgacgcctg agcagtggaa gtcccacaga 661 agctacagct gccaggtcac gcatgaaggg agcaccgtgg agaagacagt ggcccctaca 721 gaatgttcat aggttctcaa ccctcacccc caccacggga gactagagct gcaggatccc 781 aggggagggg tctctcctcc caccccaagg catcaagccc ttctccctgc actcaataaa 841 ccctcaataa atattctcat tgtcaaggaa tt // LOCUS HUMTCAXU 401 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human T-cell receptor active alpha-chain mRNA, clone HAP36 (V-alpha-9.1 family; J-alpha-H family). ACCESSION M13737 KEYWORDS T-cell receptor; T-cell receptor alpha-chain. SOURCE Human adult PHA-stimulated peripheral blood T lymphocyte, cDNA to mRNA, clone HAP36. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 401) AUTHORS Yoshikai,Y., Kimura,N., Toyonaga,B. and Mak,T.W. TITLE Sequences and repertoire of human T-cell receptor alpha chain variable region genes in mature T lymphocytes JOURNAL J. Exp. Med. 164, 90-103 (1986) STANDARD simple staff_review FEATURES from to/span description ORF < 1 > 401 T-cell receptor active alpha-chain precursor recomb 321 322 V region end/D-J region start recomb 387 388 D-J region end/C region start BASE COUNT 115 a 102 c 87 g 97 t ORIGIN 1 atgaagccca ccctcatctc agtgcttgtg ataatattta tactcagagg aacaagagcc 61 cagagagtga ctcagcccga gaagctcctc tctgtcttta aaggggcccc agtggagctg 121 aagtgcaact attcctattc tgggagtcct gaactcttct ggtatgtcca gtactccaga 181 caacgcctcc agttactctt gagacacatc tctagagaga gcatcaaagg cttcactgct 241 gaccttaaca aaggcgagac atctttccac ctgaagaaac catttgctca agaggaagac 301 tcagccatgt attactgtgc tctaagtgtt tataaccagg gaggaaagct tatcttcgga 361 cagggaacgg agttatctgt gaaacccaat atccagaacc c // LOCUS HUMTCAYR 406 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human T-cell receptor active alpha-chain V-region (V-J-C) mRNA, partial cds, clone AA25. ACCESSION M17657 J02992 KEYWORDS T-cell receptor alpha-chain; antigen receptor; constant region; joining region; processed gene; variable region. SOURCE Human peripheral blood lymphocyte, cDNA to mRNA, clone AA25. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 406) AUTHORS Klein,M.H., Concannon,P., Everett,M., Kim,L.D.H., Hunkapiller,T. and Hood,L. TITLE Diversity and structure of human T-cell receptor alpha-chain variable region genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6884-6888 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1 > 406 T-cell receptor alpha-chain V-region (V-J-C) precursor /nomgen="TCRA" /map="14q11.2" /hgml_locus_uid="LX0123X" sigp 1 69 T-cell receptor alpha-chain signal peptide matp 70 > 403 T-cell receptor alpha-chain recomb 340 341 TCR V-region DNA end/TCR J-region DNA start recomb 400 401 TCR J-region DNA end/TCR C-region DNA start BASE COUNT 104 a 95 c 98 g 109 t ORIGIN Chromosome 14q11.2. 1 atgatgatat ccttgagagt tttactggtg atcctgtggc ttcagttaag ctgggtttgg 61 agccaacgga aggaggtgga gcaggatcct ggacccttca atgttccaga gggagccact 121 gtcgctttca actgtactta cagcaacagt gcttctcagt ctttcttctg gtacagacag 181 gattgcagga aagaacctaa gttgctgatg tccgtatact ccagtggtaa tgaagatgga 241 aggtttacag cacacgtcaa tagagccagc cagtatattt ccctgctcat cagagactcc 301 aagctcagtg attcagccac ctacctctgt gtggtgaaca ttcgcccagg aaacacacct 361 cttgtctttg gaaagggcac aagactttct gtgattccaa atatcc // LOCUS LAMCG 48502 bp ds-DNA circular PHG 29-JUN-1990 DEFINITION Bacteriophage lambda, complete genome. ACCESSION J02459 M17233 KEYWORDS DNA binding protein; circular; coat protein; complete genome; origin of replication; repressor; unidentified reading frame. SOURCE Lambda wild-type and lambda strain cI857s7. ORGANISM Bacteriophage lambda Viridae; ds-DNA nonenveloped viruses; Siphoviridae. REFERENCE 1 (bases 37224 to 37940) AUTHORS Sauer,R.T. TITLE DNA sequence of the bacteriophage lambda cI gene JOURNAL Nature 276, 301-302 (1978) STANDARD full staff_review REFERENCE 2 (bases 37990 to 38982) AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H. TITLE Nucleotide sequence of cro, cII and part of the O gene in phage lambda DNA JOURNAL Nature 272, 410-414 (1978) STANDARD full staff_review REFERENCE 3 (bases 34957 to 35615; r-strand) AUTHORS Franklin,N.C. and Bennett,G.N. TITLE The N protein of bacteriophage lambda, defined by its DNA sequence, is highly basic JOURNAL Gene 8, 107-119 (1979) STANDARD full staff_review REFERENCE 4 (bases 37768 to 40293) AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H. TITLE The primary structure of the phage lambda P gene completes the nucleotide sequence of the plasmid lambda-dvh93 JOURNAL Biochem. Int. 1, 386-394 (1980) STANDARD full staff_review REFERENCE 5 (bases 27724 to 29275) AUTHORS Hoess,R.H., Foeller,C., Bidwell,K. and Landy,A. TITLE Site-specific recombination functions of bacteriophage lambda: DNA sequence of regulatory regions and overlapping structural genes for int and xis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2482-2486 (1980) STANDARD full staff_review REFERENCE 6 (bases 43682 to 45218) AUTHORS Daniels,D.L. and Blattner,F.R. TITLE Nucleotide sequence of the Q gene and the Q to S intergenic region of bacteriophage lambda JOURNAL Virology 117, 81-92 (1982) STANDARD full staff_review REFERENCE 7 (bases 1 to 48502; fragments) AUTHORS Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F. and Petersen,G.B. TITLE Nucleotide sequence of bacteriophage lambda DNA JOURNAL J. Mol. Biol. 162, 729-773 (1982) STANDARD full staff_review REFERENCE 8 (review; complete genome) AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F. and Blattner,F.R. TITLE Appendix I: A molecular map of coliphage lambda JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A. (Eds.); Lambda II: 469-517; Cold Spring Harbor Laboratory, New York (1983) STANDARD full staff_review REFERENCE 9 (review; complete genome with annotation) AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F., Petersen,G.B. and Blattner,F.R. TITLE Appendix II: Complete annotated lambda sequence JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A. (Eds.); Lambda II: 519-674; Cold Spring Harbor Laboratory, New York (1983) STANDARD full staff_review REFERENCE 10 (bases 45493 to 45963) AUTHORS Imada,M. and Tsugita,A. TITLE Amino acid sequence of lambda phage endolysin JOURNAL Nature New Biol. 233, 230-231 (1971) STANDARD full staff_review REFERENCE 11 (bases 44588 to 44780) AUTHORS Lebowitz,P., Weissman,S.M. and Radding,C.M. TITLE Nucleotide sequence of a ribonucleic acid transcribed in vitro from lambda phage deoxyribonucleic acid JOURNAL J. Biol. Chem. 246, 5120-5139 (1971) STANDARD full staff_review REFERENCE 12 (bases 1 to 12; both strands) AUTHORS Wu,R. and Taylor,E. TITLE Nucleotide sequence analysis of DNA. II. Complete nucleotide sequence of the cohesive ends of bacteriophage lambda DNA JOURNAL J. Mol. Biol. 57, 491-511 (1971) STANDARD full staff_review REFERENCE 13 (bases 38597 to 38672) AUTHORS Dahlberg,J.E. and Blattner,F.R. TITLE In vitro transcription products of lambda DNA: Nucleotide sequences and regulatory sites JOURNAL (in) Fox,C.F. and Robinson,W.S. (Eds.); Virus Research. Proceedings of 1973 ICN-UCLA Symposium: 533-544; Academic Press, New York (1973) STANDARD full staff_review REFERENCE 14 (sites; fragments at the 3'-terminus) AUTHORS Weigel,P.H., Englund,P.T., Murray,K. and Old,R.W. TITLE The 3'-terminal nucleotide sequences of bacteriophage lambda DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 70, 1151-1155 (1973) STANDARD full staff_review REFERENCE 15 (bases 35434 to 35618) AUTHORS Dahlberg,J.E. and Blattner,F.R. TITLE Sequence of the promoter-operator proximal region of the major leftward of bacteriophage lambda JOURNAL Nucleic Acids Res. 2, 1441-1458 (1975) STANDARD full staff_review REFERENCE 16 (bases 35583 to 35600) AUTHORS Kleid,D.G., Agarwal,K.L. and Khorana,H.G. TITLE The nucleotide sequence in the promoter region of the gene N in bacteriophage lambda JOURNAL J. Biol. Chem. 250, 5574-5582 (1975) STANDARD full staff_review REFERENCE 17 (bases 37945 to 38018) AUTHORS Maniatis,T., Jeffrey,A. and Kleid,D.G. TITLE Nucleotide sequence of the rightward operator of phage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1184-1188 (1975) STANDARD full staff_review REFERENCE 18 (bases 44588 to 44773) AUTHORS Sklar,J., Yot,P. and Weissman,S.M. TITLE Determination of genes, restriction sites, and DNA sequences surrounding the 6s template of bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1817-1821 (1975) STANDARD full staff_review REFERENCE 19 (bases 37945 to 38027) AUTHORS Maniatis,T., Ptashne,M., Backman,K., Kleid,D., Flashman,S., Jeffrey,A. and Maurer,R. TITLE Recognition sequences of repressor and polymerase in the operators of bacteriophage lambda JOURNAL Cell 5, 109-113 (1975) STANDARD full staff_review REFERENCE 20 (bases 37946 to 38039) AUTHORS Smith,G.R., Eisen,H., Reichardt,L. and Hedgpeth,J. TITLE Deletions of lambda phage locating a p-rm mutation within the rightward operator JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 712-716 (1976) STANDARD full staff_review REFERENCE 21 (bases 35578 to 35667; 37903 to 38027) AUTHORS Ptashne,M., Bachman,K., Humayun,M.Z., Jeffrey,A., Maurer,R., Meyer,B. and Sauer,R.T. TITLE Autoregulation and function of a repressor in bacteriophage lambda JOURNAL Science 194, 156-161 (1976) STANDARD full staff_review REFERENCE 22 (bases 37905 to 37989) AUTHORS Walz,A., Pirrotta,V. and Ineichen,K. TITLE Lambda repressor regulates the switch between p-r and p-rm promoters JOURNAL Nature 262, 665-669 (1976) STANDARD full staff_review REFERENCE 23 (bases 27617 to 27934) AUTHORS Landy,A. and Ross,W. TITLE Viral integration and excision: structure of the lambda att sites JOURNAL Science 197, 1147-1160 (1977) STANDARD full staff_review REFERENCE 24 (bases 39062 to 39170) AUTHORS Denniston-Thompson,K., Moore,D.D., Kruger,K.E., Furth,M.E. and Blattner,F.R. TITLE Physical structure of the replication origin of bacteriophage lambda JOURNAL Science 198, 1051-1056 (1977) STANDARD full staff_review REFERENCE 25 (bases 38610 to 38732) AUTHORS Scherer,G., Hobom,G. and Koessel,H. TITLE DNA base sequence of the p-o promoter region of phage lambda JOURNAL Nature 265, 117-121 (1977) STANDARD full staff_review REFERENCE 26 (bases 38041 to 38241) AUTHORS Roberts,T.M., Shimatake,H., Brady,C. and Rosenberg,M. TITLE Sequence of cro gene of bacteriophage lambda JOURNAL Nature 270, 274-275 (1977) STANDARD full staff_review REFERENCE 27 (bases 27616 to 28935) AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E. TITLE Nucleotide sequence of the attachment site of coliphage lambda JOURNAL Nature 270, 757-760 (1977) STANDARD full staff_review REFERENCE 28 (bases 35578 to 35667) AUTHORS Humayun,Z., Jeffrey,A. and Ptashne,M. TITLE Completed DNA sequences and organization of repressor-binding sites in the operators of phage lambda JOURNAL J. Mol. Biol. 112, 265-277 (1977) STANDARD full staff_review REFERENCE 29 (bases 37206 to 37263; 37914 to 37970) AUTHORS Humayun,Z. TITLE DNA sequence at the end of the cI gene in bacteriophage lambda JOURNAL Nucleic Acids Res. 4, 2137-2143 (1977) STANDARD full staff_review REFERENCE 30 (bases 44467 to 44807) AUTHORS Sklar,J.L. TITLE Structure and function of two regions of DNA controlling the synthesis of prokaryotic RNAs JOURNAL Thesis (1977) Yale University, New Haven, CT, USA STANDARD full staff_review REFERENCE 31 (bases 38470 to 39189) AUTHORS Hobom,G., Grosschedl,R., Lusky,M., Scherer,G., Schwarz,E. and Koessel,H. TITLE Functional analysis of the replicator structure of lambdoid bacteriophage DNAs JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 165-178 (1979) STANDARD full staff_review REFERENCE 32 (bases 37938 to 38016; 35589 to 35666; comp strand) AUTHORS Flashman,S.M. TITLE Mutational analysis of the operators of bacteriophage lambda JOURNAL Mol. Gen. Genet. 166, 61-73 (1978) STANDARD full staff_review REFERENCE 33 (bases 38212 to 38362) AUTHORS Rosenberg,M., Court,D., Shimatake,H., Brady,C. and Wulff,D.L. TITLE The relationship between function and DNA sequence in an intercistronic regulatory region in phage lambda JOURNAL Nature 272, 414-423 (1978) STANDARD full staff_review REFERENCE 34 (bases 29711 to 29811; 31043 to 31058) AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E. TITLE Determination of the endpoints of partial deletion mutants of the attachment site of bacteriophage lambda by DNA sequencing JOURNAL Nucleic Acids Res. 5, 3209-3218 (1978) STANDARD full staff_review REFERENCE 35 (bases 21661 to 31129; fragments) AUTHORS Hoess,R.H. and Landy,A. TITLE Structure of the lambda att sites generated by int-dependent deletions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5437-5441 (1978) STANDARD full staff_review REFERENCE 36 (bases 38453 to 38500) AUTHORS Sprague,K.U., Faulds,D.H. and Smith,G.R. TITLE A single base-pair change creates a chi recombinational hotspot in bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 6182-6186 (1978) STANDARD full staff_review REFERENCE 37 (bases 13 to 72; 48391 to 48502) AUTHORS Nichols,B.P. and Donelson,J.E. TITLE 178-Nucleotide sequence surrounding the cos site of bacteriophage lambda DNA JOURNAL J. Virol. 26, 429-434 (1978) STANDARD full staff_review REFERENCE 38 (bases 38597 to 39688) AUTHORS Scherer,G. TITLE Nucleotide sequence of the O gene and of the origin of replication in bacteriophage lambda DNA JOURNAL Nucleic Acids Res. 5, 3141-3156 (1978) STANDARD full staff_review REFERENCE 39 (bases 38453 to 38500) AUTHORS Smith,G.R., Faulds,D.H. and Sprague,K.U. TITLE Nucleotide-sequence analysis of a chi site JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 1067-1068 (1979) STANDARD full staff_review REFERENCE 40 (sites; transcription termination sites) AUTHORS Adhya,S. and Gottesman,M. TITLE Control of transcription termination JOURNAL Annu. Rev. Biochem. 47, 967-996 (1978) STANDARD full staff_review REFERENCE 41 (bases 38008 to 39328) AUTHORS Moore,D.D., Denniston-Thompson,K., Kruger,K.E., Furth,M.E., Williams,B.G., Daniels,D.L. and Blattner,F.R. TITLE Dissection and comparative anatomy of the origins of replication of lambdoid phages JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 155-163 (1979) STANDARD full staff_review REFERENCE 42 (bases 27711 to 27826) AUTHORS Ross,W., Landy,A., Kikuchi,Y. and Nash,H. TITLE Interaction of int protein with specific sites on lambda att DNA JOURNAL Cell 18, 297-307 (1979) STANDARD full staff_review REFERENCE 43 (bases 37305 to 37352) AUTHORS Lieb,M. TITLE Is5 increases recombination in adjacent regions as shown for the repressor gene of coliphage lambda JOURNAL Gene 12, 277-280 (1980) STANDARD full staff_review REFERENCE 44 (bases 29055 to 29131) AUTHORS Miller,H.I., Abraham,J., Benedik,M., Campbell,A., Court,D., Echols,H., Fischer,R., Galindo,J.M., Guarneros,G., Hernandez,T., Mascarenhas,D., Montanez,C., Schindler,D., Schmeissner,U. and Sosa,L. TITLE Regulation of the integration-excision reaction by bacteriophage lambda JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 439-445 (1981) STANDARD full staff_review REFERENCE 45 (bases 36245 to 36343) AUTHORS Pirrotta,V., Ineichen,K. and Walz,A. TITLE An unusual polymerase binding site in the immunity region of phage lambda JOURNAL Mol. Gen. Genet. 180, 369-376 (1980) STANDARD full staff_review REFERENCE 46 (bases 37940 to 38016) AUTHORS Rosen,E.D., Hartley,J.L., Matz,K., Nichols,B.P., Young,K.M., Donelson,J.E. and Gussin,G.N. TITLE DNA sequence analysis of prm- mutations of coliphage lambda JOURNAL Gene 11, 197-205 (1980) STANDARD full staff_review REFERENCE 47 (bases 38212 to 38467) AUTHORS Wulff,D.L., Beher,M., Izumi,S., Beck,J., Mahoney,M., Shimatake,H., Brady,C., Court,D. and Rosenberg,M. TITLE Structure and function of the cy control region of bacteriophage lambda JOURNAL J. Mol. Biol. 138, 209-230 (1980) STANDARD full staff_review REFERENCE 48 (bases 38237 to 38334) AUTHORS Court,D., Brady,C., Rosenberg,M., Wulff,D.L., Behr,M., Mahoney,M. and Izumi,S. TITLE Control of transcription termination: A rho-dependent termination site in bacteriophage lambda JOURNAL J. Mol. Biol. 138, 231-254 (1980) STANDARD full staff_review REFERENCE 49 (bases 38102 to 38166) AUTHORS Calva,E. and Burgess,R.R. TITLE Characterization of a rho-dependent termination site within the cro gene of bacteriophage lambda JOURNAL J. Biol. Chem. 255, 11017-11022 (1980) STANDARD full staff_review REFERENCE 50 (bases 28929 to 29198) AUTHORS Abraham,J., Mascarenhas,D., Fischer,R., Benedik,M., Campbell,A. and Echols,H. TITLE DNA sequence of regulatory region for integration gene of bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2477-2481 (1980) STANDARD full staff_review REFERENCE 51 (bases 27724 to 29525) AUTHORS Davies,R.W. TITLE DNA sequence of the int-xis p-i region of the bacteriophage lambda; overlap of the int and xis genes JOURNAL Nucleic Acids Res. 8, 1765-1782 (1980) STANDARD full staff_review REFERENCE 52 (bases 27501 to 27615) AUTHORS Mizuuchi,M. and Mizuuchi,K. TITLE Integrative recombination of bacteriophage lambda: extent of the DNA sequence involved in attachment site function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3220-3224 (1980) STANDARD full staff_review REFERENCE 53 (bases 27479 to 27633) AUTHORS Hsu,P.-L., Ross,W. and Landy,A. TITLE The lambda phage att site: functional limits and interaction with int protein JOURNAL Nature 285, 85-91 (1980) STANDARD full staff_review REFERENCE 54 (bases 23131 to 23248) AUTHORS Rosenvold,E.C., Calva,E., Burgess,R.R. and Szybalski,W. TITLE In vitro transcription from the b2 region of bacteriophage lambda JOURNAL Virology 107, 476-487 (1980) STANDARD full staff_review REFERENCE 55 (bases 30493 to 30569) AUTHORS Smith,G.R., Schultz,D.W. and Crasemann,J.M. TITLE Generalized recombination: Nucleotide sequence homology between chi recombinational hotspots JOURNAL Cell 19, 785-793 (1980) STANDARD full staff_review REFERENCE 56 (bases 37940 to 38023) AUTHORS Meyer,B.J., Maurer,R. and Ptashne,M. TITLE Gene regulation at the right operator (o-r) of bacteriophage lambda. II. o-r-1, o-r-2, and o-r-3: their roles in mediating the effects of repressor and cro JOURNAL J. Mol. Biol. 139, 163-194 (1980) STANDARD full staff_review REFERENCE 57 (bases 29055 to 29124) AUTHORS Abraham,J. and Echols,H. TITLE Regulation of int gene transcription by bacteriophage lambda: location of the start generated by an int constitutive mutation JOURNAL J. Mol. Biol. 146, 157-165 (1981) STANDARD full staff_review REFERENCE 58 (bases 43681 to 45634) AUTHORS Daniels,D.L. TITLE Control of late transcription in bacteriophage lambda JOURNAL Thesis (1981) University of Wisconsin, Madison, WI, USA STANDARD full staff_review REFERENCE 59 (bases 35468 to 35819) AUTHORS Horn,G.T. and Wells,R.D. TITLE The leftward promoter of bacteriophage lambda JOURNAL J. Biol. Chem. 256, 1998-2002 (1981) STANDARD full staff_review REFERENCE 60 (bases 32503 to 35905) AUTHORS Ineichen,K., Shepherd,J.C.W. and Bickle,T.A. TITLE The DNA sequence of the phage lambda genome between p-l and the gene bet JOURNAL Nucleic Acids Res. 9, 4639-4653 (1981) STANDARD full staff_review REFERENCE 61 (bases 43860 to 45001) AUTHORS Petrov,N.A., Karginov,V.A., Mikryukov,N.N., Serpinski,O.I. and Kravchenko,V.V. TITLE Complete nucleotide sequence of the bacteriophage lambda DNA region containing gene Q and promoter p-r JOURNAL FEBS Lett. 133, 316-320 (1981) STANDARD full staff_review REFERENCE 62 (bases 44972 to 45057) AUTHORS Smith,G.R., Comb,M., Schultz,D.W., Daniels,D.L. and Blattner,F.R. TITLE Nucleotide sequence of the chi recombinational hotspot chi+d in bacteriophage lambda JOURNAL J. Virol. 37, 336-342 (1981) STANDARD full staff_review REFERENCE 63 (bases 38686 to 39224) AUTHORS Moore,D.D., Denniston,K.J. and Blattner,F.R. TITLE Sequence organization of the origins of DNA replication in lambdoid coliphages JOURNAL Gene 14, 91-101 (1981) STANDARD full staff_review REFERENCE 64 (bases 35468 to 35711) AUTHORS Remaut,E., Stanssens,P. and Fiers,W. TITLE Plasmid vectors for high-efficiency expression controlled by the pl promoter of coliphage lambda JOURNAL Gene 15, 81-93 (1981) STANDARD full staff_review REFERENCE 65 (bases 35468 to 35541) AUTHORS Drahos,D. and Szybalski,W. TITLE Antitermination and termination functions of the cloned Nutl, N and tl1 modules of coliphage lambda JOURNAL Gene 16, 261-274 (1981) STANDARD full staff_review REFERENCE 66 (bases 27650 to 27741) AUTHORS Kravchenko,V.V. and Mikryukov,N.N. TITLE Localization of the promoter p-att of the binding site of Escherichia coli polymerase on phage lambda DNA near the integration site JOURNAL Dokl. Biochem. 264, 148-151 (1982) STANDARD full staff_review REFERENCE 67 (bases 39219 to 39338) AUTHORS Moore,D.D. and Blattner,F.R. TITLE Appendix: Sequence of lambda ri c 5b JOURNAL J. Mol. Biol. 154, 81-83 (1982) STANDARD full staff_review REFERENCE 68 (bases 37938 to 38018) AUTHORS Hawley,D.K. and McClure,W.R. TITLE Mechanism of activation of transcription initiation from the lambda p-rm promoter JOURNAL J. Mol. Biol. 157, 493-525 (1982) STANDARD full staff_review REFERENCE 69 (bases 25157 to 27484) AUTHORS Hong,G.F. TITLE A systematic DNA sequencing strategy JOURNAL J. Mol. Biol. 158, 539-549 (1982) STANDARD full staff_review REFERENCE 70 (bases 35577 to 35647) AUTHORS Hyman,H.C. and Honigman,A. TITLE The use of the plasmid pha10 in the isolation of lambda pl promoter mutations JOURNAL Mol. Gen. Genet. 185, 515-517 (1982) STANDARD full staff_review REFERENCE 71 (bases 38262 to 38386) AUTHORS Lau,L.F., Roberts,J.W. and Wu,R. TITLE Transcription terminates at lambda tr1 in three clusters JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 6171-6175 (1982) STANDARD full staff_review REFERENCE 72 (bases 31299 to 31408) AUTHORS Luk,K.-C. and Szybalski,W. TITLE Transcription termination: Sequence and function of the rho-independent t-l3 terminator in the major leftward operon of bacteriophage lambda JOURNAL Gene 17, 247-258 (1982) STANDARD full staff_review REFERENCE 73 (bases 35437 to 37348) AUTHORS Landsmann,J., Kroeger,M. and Hobom,G. TITLE The rex region of bacteriophage lambda: Two genes under three-way control JOURNAL Gene 20, 11-24 (1982) STANDARD full staff_review REFERENCE 74 (bases 40218 to 43972) AUTHORS Kroeger,M. and Hobom,G. TITLE A chain of interlinked genes in the NinR region of bacteriophage lambda JOURNAL Gene 20, 25-38 (1982) STANDARD full staff_review REFERENCE 75 (bases 31299 to 31408) AUTHORS Luk,K.-C. and Szybalski,W. TITLE Characterization of the cloned terminators t-r1, t-l3 and t-i, and the Nutr antitermination site of coliphage lambda JOURNAL Gene 20, 127-134 (1982) STANDARD full staff_review REFERENCE 76 (bases 48424 to 48500) AUTHORS Miwa,T. and Matsubara,K. TITLE Identification of sequences necessary for packaging DNA into lambda phage heads JOURNAL Gene 20, 267-279 (1982) STANDARD full staff_review REFERENCE 77 (bases 18414 to 18746) AUTHORS Luk,K.-C. and Szybalski,W. TITLE A cluster of leftward, rho-dependent t'j terminators in the J gene of coliphage lambda JOURNAL Gene 21, 175-191 (1983) STANDARD full staff_review REFERENCE 78 (bases 48469 to 48498) AUTHORS Miwa,T. and Matsubara,K. TITLE Lambda phage DNA sequences affecting the packaging process JOURNAL Gene 24, 199-206 (1983) STANDARD full staff_review REFERENCE 79 (bases 33287 to 33486) AUTHORS Knight,D.M. and Echols,H. TITLE The cIII gene and protein of bacteriophage lambda JOURNAL J. Mol. Biol. 163, 505-510 (1983) STANDARD full staff_review REFERENCE 80 (bases 33000 to 33244; 33420 to 33543; 33629 to 34080) AUTHORS Luk,K.-C. and Szybalski,W. TITLE The tl2 cluster of transcription termination sites between genes bet and ral of coliphage lambda JOURNAL Virology 125, 403-418 (1983) STANDARD full staff_review REFERENCE 81 (bases 37938 to 38019) AUTHORS Shih,M.-C. and Gussin,G.N. TITLE Mutations affecting two different steps in transcription initiation at the phage lambda p-rm promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 496-500 (1983) STANDARD full staff_review REFERENCE 82 (bases 1 to 56; 48474 to 48502) AUTHORS Feiss,M., Kobayashi,I. and Widner,W. TITLE Separate sites for binding and nicking of bacteriophage lambda DNA by terminase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 955-959 (1983) STANDARD full staff_review REFERENCE 83 (bases 29063 to 29140) AUTHORS Benedik,M., Mascarenhas,D. and Campbell,A. TITLE The integrase promoter and t1' terminator in bacteriophages lambda and 434 JOURNAL Virology 126, 658-668 (1983) STANDARD full staff_review REFERENCE 84 (bases 45901 to 46443) AUTHORS Taylor,A., Benedik,M. and Campbell,A. TITLE Location of the R-z gene in bacteriophage lambda JOURNAL Gene 26, 159-163 (1983) STANDARD full staff_review REFERENCE 85 (sites; cohesive ends) AUTHORS Hohn,B. TITLE DNA sequences necessary for packaging of bacteriophage lambda DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7456-7460 (1983) STANDARD full staff_review REFERENCE 86 (sites; Pre-promoter mutations) AUTHORS Wulff,D.L., Mahoney,M., Shatzman,A. and Rosenberg,M. TITLE Mutational analysis of a regulatory region in bacteriophage lambda that has overlapping signals for the initiation of transcription and translation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 555-559 (1984) STANDARD full staff_review REFERENCE 87 (sites; attP recombination site) AUTHORS Craig,N.L. and Nash,H.A. TITLE E. coli integration host factor binds to specific sites in DNA JOURNAL Cell 39, 707-716 (1984) STANDARD full staff_review REFERENCE 88 (sites; major leftward transcription unit) AUTHORS Edlind,T.D., Cooley,T.E., Richards,S.H. and Ihler,G.M. TITLE Long range base-pairing in the leftward transcription unit of bacteriophage lambda: Characterization by electron microscopy and computer-aided sequence analysis JOURNAL J. Mol. Biol. 179, 351-365 (1984) STANDARD full staff_review REFERENCE 89 (sites; prohead binding) AUTHORS Frackman,S., Siegele,D.A. and Feiss,M. TITLE A functional domain of bacteriophage lambda terminase for prohead binding JOURNAL J. Mol. Biol. 180, 283-300 (1984) STANDARD full staff_review REFERENCE 90 (sites; cII binding site mutations) AUTHORS Place,N., Fien,K., Mahoney,M.E., Wulff,D.L., Ho,Y.-S., Debouck,C., Rosenberg,M., Shih,M.-C. and Gussin,G.N. TITLE Mutations that alter the DNA binding site for the bacteriophage lambda cII protein and affect the translation efficiency of the cII gene JOURNAL J. Mol. Biol. 180, 865-880 (1984) STANDARD full staff_review REFERENCE 91 (sites; nutR mutations) AUTHORS Warren,F. and Das,A. TITLE Formation of termination-resistant transcription complex at phage lambda nut locus: Effects of altered translation and a ribosomal mutation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 3612-3616 (1984) STANDARD full staff_review REFERENCE 92 (sites; light chain oligonucleotides) AUTHORS Coleclough,C. and Erlitz,F.L. TITLE Use of primer-restriction-end adapters in a novel cDNA cloning strategy JOURNAL Gene 34, 305-314 (1985) STANDARD full staff_review REFERENCE 93 (sites; nutL antiterminator) AUTHORS Peltz,S.W., Brown,A.L., Hasan,N., Podhajska,A.J. and Szybalski,W. TITLE Thermosensitivity of a DNA recognition Site: Activity of a truncated nutL Antiterminator of coliphage lambda JOURNAL Science 228, 91-93 (1985) STANDARD full staff_review REFERENCE 94 (sites; rho utilization sites A and B) AUTHORS Chen,C.-Y.A. and Richardson,J.P. TITLE Sequence elements essential for rho-dependent transcription termination at lambda-tR1 JOURNAL J. Biol. Chem. 262, 11292-11299 (1987) STANDARD simple staff_review REFERENCE 95 (bases 2521 to 3300) AUTHORS Hong,G.F. TITLE Sequencing of large double-stranded DNA using the dideoxy sequencing technique JOURNAL Biosci. Rep. 2, 907-912 (1982) STANDARD simple staff_entry COMMENT Contributed on tape by F.Sanger via D.L.Daniels. Most of references [10] through [80] are either annotated by [8] and [9], which are the immediate sources for the annotation below, or they are cited in Table 3 of [8]. Only references [1] through [9] are represented in the features table herein. This is the best representation to date of the wild-type lambda l-strand, though much of the sequence was determined for the cI857s7 strain and changed to wild-type [8]. All reported variations leading to the strains cI857s7, imm21, imm434, lac5, Nin5 and b2 are included in the annotation. The first twelve bases are the sticky ends. A significant fraction of the known mutations affecting replication and transcription have been annotated below; a large number of point mutations, deletions and substitutions have not. For a complete account of lambda mutations in relation to the sequence, see [8]. Each coding sequence belongs to a reading frame (orf) whose number, given in parentheses, should indicate the number of amino acids coded. The starting points for translation are known with varying degrees of certainty; for example, the start site for the N protein, given here as 35438, may turn out to be downstream (on the complementary strand) at 35360. When direct empirical evidence such as mutation or amino acid sequence is lacking, the start point is said to be putative. For a summary of the evidence bearing upon the coding sequences, see [7],[8]. Intergenic spaces in lambda are typically short and overlapping: the multiple reading frames (mult) range between a span of 1 and a span of 103. In most cases, a start codon precedes a termination codon, exceptions being the m-l boundary (13429) and the 314-194 boundary (21973) which show the E.coli trp operon pattern of "translational coupling" (see ). Transcription in the central region, bases 22686 to 37940, is leftward off the l-strand. In our annotation, this is indicated by the letter "c" and the descriptive term "comp strand". Signals and recognition sites in this region, without judgement made about their polarity, are treated accordingly, hence their span should be read toward the left rather than toward the right. Furthermore some leftward transcription is located outside the central region, and that is also indicated by "c" and "comp strand". In general, the estimates for the extent or span of signals (e.g. operators), binding sites (e.g. Nutr, int-binding sites, etc.) and of the attachment site (att) vary in the literature. This annotation follows [8]. No attempt is made to annotate promoters as signals because of the indefiniteness of their span, however known promoter mutants are given. The cII protein is known to bind in the -35 regions of p-i (29091) and pre(38369). Transcript termination sites must be understood to be conditional on the N and Q proteins and less than 100% efficient. There remain terminators to be found and some of those annotated may have significance only in vitro. FEATURES from to/span description pept 191 736 nu1 (DNA packaging;181) pept 711 2636 A (DNA packaging;641) pept 2633 2839 W (head-tail joining;68) pept 2836 4437 B (capsid component;533) sigp 2839 2901 leader peptide matp 2902 4434 processed B pept 4418 5737 C (capsid component;439) pept 5132 5737 nu3 (capsid assembly;201) pept 5747 6079 D (head-DNA stabilization;110) pept 6135 7160 E (capsid component;341) pept 7202 7600 Fi (DNA packaging;117) pept 7612 7965 Fii (head-tail joining;117) pept 7977 8555 Z (tail component;192) pept 8552 8947 U (tail component;131) pept 8955 9695 V (tail component;256) pept 9711 10133 G (tail component;140) pept 10115 10549 T (tail component;144) pept 10542 13103 H (tail component;853) pept 13100 13429 M (tail component;109) pept 13429 14127 L (tail component;232) pept 14276 14875 K (tail component;199) pept 14773 15444 I (tail component;223) pept 15505 18903 J (tail:host specificity;1132) pept 18965 19585 lom (outer host membrane;206a) pept 19650 20855 orf-401 pept 21029 21973 orf-314 pept 21973 22557 orf-194 pept 23918 22686 (c) ea47 pept 25399 24509 (c) ea31 (296) pept 26973 25396 (c) ea59 (525) pept 28882 27812 (c) int (integration;356) pept 29078 28860 (c) xis (excision;72) pept 29655 29374 (c) ea8.5 (93) pept 30395 29847 (c) ea22 (182) pept 32028 31348 (c) exo (exonuclease;226) pept 32810 32025 (c) bet (recombination;261) pept 33232 32816 (c) gam (recombination;138) pept 33330 33187 (c) kil(host-killing;54) pept 33463 33299 (c) cIII (antitermination;89) pept 33904 33536 (c) ea10 (ssb;122) pept 34287 34087 (c) ral(restriction alleviation;66) pept 35438 35037 (c) N (early gene regulator;133) pept 36259 35825 (c) rexb (exclusion;144) pept 37114 36275 (c) rexa (exclusion;279) pept 37940 37227 (c) cI (repressor;237) pept 38041 38241 cro (antirepressor; also tof;66) pept 38360 38653 cII (antitermination;119) pept 38686 39585 O (DNA replication;299) pept 39582 40283 P (DNA replication;233) pept 40280 40570 ren(exclusion;96) pept 40644 41084 Nin 146 (pept unknown;146) pept 41081 41953 Nin 290 (pept unknown;290) pept 41950 42123 Nin 57 (pept unknown;57) pept 42090 42272 Nin 60 (pept unknown;60) pept 42269 42439 Nin 56 (pept unknown;56) pept 42429 43043 Nin 204 (pept unknown;204) pept 43040 43246 Nin 68 (pept unknown;68) pept 43224 43889 Nin 221 (pept unknown;221) pept 43886 44509 Q (late gene regulator;207) pept 44621 44815 orf-64 pept 45186 45509 S (cell lysis;107) pept 45493 45969 R (cell lysis;158) pept 45966 46427 Rz (cell lysis;153) mRNA 35582 18482 (c) mRNA-pl (alt.; via t'j4 terminator) mRNA 35582 18597 (c) mRNA-pl (alt.; via t'j3 terminator) mRNA 35582 18637 (c) mRNA-pl (alt.; via t'j2 terminator) mRNA 35582 18671 (c) mRNA-pl (alt.; via t'j1 terminator) mRNA 35582 27538 (c) mRNA-pl (alt.; via ti terminator) mRNA 35582 31262 (c) mRNA-pl (alt.; via tl3 terminator) mRNA 35582 33100 (c) mRNA-pl (alt.; via tl2d terminator) mRNA 35582 33141 (c) mRNA-pl (alt.; via tl2c terminator) mRNA 35582 33494 (c) mRNA-pl (alt.; via tl2b terminator) mRNA 35582 33930 (c) mRNA-pl (alt.; via tl2a terminator) mRNA 35582 34560 (c) mRNA-pl (alt.; via tl1 terminator) mRNA 29065 27538 (c) mRNA int (integration; 356; via ti terminator) mRNA 23231 < 23231 (c) mRNA-pbl mRNA 37940 35798 (c) mRNA-prm (via timm terminator) mRNA 38343 35798 (c) mRNA-pre (via timm terminator) mRNA 36256 35798 (c) mRNA-plit (via timm terminator) mRNA 38023 38135 mRNA-pr (alt.; via tr0 terminator) mRNA 38023 38315 mRNA-pr (alt.; via tr1a terminator) mRNA 38023 38337 mRNA-pr (alt.; via tr1b terminator) mRNA 38023 38370 mRNA-pr (alt.; via tr1c terminator) mRNA 38023 40624 mRNA-pr (alt.; via tr2 terminator) mRNA 38675 38599 (c) mRNA-oop transcription mRNA mRNA 44587 44780 mRNA-pr' transcription (late genes) mRNA binding 27583 27602 int binding site 1 binding 27615 27634 int binding site 2 binding 27714 27747 int binding site 3 binding 27814 27854 int binding site 4 binding 35534 35518 (c) Nutl(N-utilization leftward)putative binding site binding 38265 38281 Nutr(N-utilization rightward)putative binding site binding 39034 39051 ori iteron 1(O binding site) binding 39054 39071 ori iteron 2(O binding site) binding 39078 39095 ori iteron 3(O binding site) binding 39101 39118 ori iteron 4(O binding site) mut 19368 23278 lac5 substitution mut 21737 > 21737 b2 substitution terminating at the att site recomb 24389 24390 lambda::lambdoid hybridization site mut 27537 27537 t in sib3 , c in wild-type mut 27547 27547 a in hef13 , g in wild-type mut 27568 27568 a in sib2 , c in wild-type mut 27573 27573 t in sib1 , g in wild-type recomb 27723 27738 attachment core(att)for host chromosome insertion mut 29063 29063 a in xis am6 , g in wild-type recomb 31266 31267 lambda::lambdoid hybridization site mut 34378 38617 imm21 region mut 35528 35528 a in Nutl63,g in Nutl96,t in Nutl18,c in wild-type mut 35530 35530 g in wild-type deleted in Nutl3 mut 35583 38245 imm434 region mut 35596 35596 a in vir2, t in v003, c in wild-type mut 35606 35606 c in vir101 , t in wild-type signal 35607 35591 (c) operator-l1 (first base on comp strand) mut 35621 35621 t in v305 , c in wild-type mut 35622 35622 t in v305 , g in wild-type signal 35631 35615 (c) operator-l2 (first base on comp strand) signal 35651 35635 (c) operator-l3 (first base on comp strand) mut 35940 35940 a in rex209 , g in wild-type mut 35947 35947 a in rex111 , g in wild-type mut 37287 37287 a in cIam14, c in wild-type mut 37308 37308 c in cIam504, g in wild-type mut 37313 37313 a in cIam505, g in wild-type variant 37589 37589 t in strain cI857s7([2]); c in wild type mut 37589 37589 t in ind1 , c in wild-type mut 37629 37629 c in cIam499, g in wild-type mut 37635 37635 c in cIam212, a in wild-type mut 37680 37680 a in cIam34, c in wild-type variant 37742 37742 t in strain ci857s7([2]); c in wild-type mut 37742 37742 t in ci857 , c in wild-type mut 37808 37808 a in cIam282, g in wild-type mut 37872 37872 c in cIam302, a in wild-type signal 37951 37967 operator-r3 mut 37954 37954 t in prm-e37 , c in wild-type mut 37955 37955 g in vc3 , a in wild-type mut 37957 37957 t in or3-r1 , c in wild-type mut 37958 37958 t in or3-r2, a in or3-r3 mutants, g in wild-type mut 37965 37965 g in or3-c12 , a in wild-type mut 37966 37966 c in or3-c10 , t in wild-type mut 37971 37971 g inp-rmup-1 , a in wild-type mut 37973 37973 t in prm-m104, 116, u31 mutants, c in wild-type signal 37974 37990 operator-r2 mut 37978 37978 t in prm-e104, g in vc3, a in wild-type mut 37979 37979 a in virl, t in prm-e93, c in wild-type mut 37985 37985 t in vn , g in wild-type mut 37989 37989 t deleted in mah4 mutant mut 37990 37990 g deleted in mch9 mutant mut 37991 37991 g in pr-x3 , a in wild-type signal 37998 38014 operator-r1 mut 38003 38003 a in vs326 , c in wild-type mut 38007 38007 t in prm-uv8, a in vir3, c in wild-type mut 38008 38008 a in prm-uv93, m36 mutants, g in wild-type mut 38009 38009 c in vs387, t in vc1, g in wild-type site 38249 38266 rho utilization site A (rutA) site 38282 38301 rho utilization site B (rutB) mut 38302 38302 a in cin-1 , g in wild-type mut 38306 38306 c in cnc1 , t in wild-type mut 38307 38307 g in cnc8 , a in wild-type mut 38350 38350 g in cy3048, a in wild-type mut 38354 38354 c in cy2001, t in wild-type mut 38357 38357 t in cy3019, c in wild-type mut 38364 38364 g in can1 , t in wild-type mut 38370 38370 t in cy3003 , c in wild-type mut 38371 38371 t in cy42 , a in wild-type mut 38376 38376 g in cy844 , a in wild-type mut 38379 38379 a in cy3008 , g in wild-type mut 38380 38380 t in cy3001 , c in wild-type mut 38430 38430 c in cII2002 , t in wild-type signal 38543 38557 ice(inceptor signal for DNA replication) mut 39122 39122 a in ti-12 , c in wild-type recomb 39157 39158 lambda::lambdoid hybridization site recomb 39165 39166 lambda::lambdoid hybridization site mut 39268 39268 t in ric5b , c in wild-type mut 39292 39292 a in ric5b , g in wild-type mut 40501 43307 Nin5 substitution variant 43082 43082 a in strain cI857s7 ([2]); g in wild-type unsure 43082 43082 g or a, cited in [8] recomb 43884 43885 lambda::lambdoid hybridization site variant 45352 45352 a in strain cI857s7 ([2]); g in wild-type mut 45352 45352 a in sam7 , g in wild-type BASE COUNT 12334 a 11362 c 12820 g 11986 t ORIGIN 5' end of the l-strand of the lambda chromosome (sticky end). 1 gggcggcgac ctcgcgggtt ttcgctattt atgaaaattt tccggtttaa ggcgtttccg 61 ttcttcttcg tcataactta atgtttttat ttaaaatacc ctctgaaaag aaaggaaacg 121 acaggtgctg aaagcgaggc tttttggcct ctgtcgtttc ctttctctgt ttttgtccgt 181 ggaatgaaca atggaagtca acaaaaagca gctggctgac attttcggtg cgagtatccg 241 taccattcag aactggcagg aacagggaat gcccgttctg cgaggcggtg gcaagggtaa 301 tgaggtgctt tatgactctg ccgccgtcat aaaatggtat gccgaaaggg atgctgaaat 361 tgagaacgaa aagctgcgcc gggaggttga agaactgcgg caggccagcg aggcagatct 421 ccagccagga actattgagt acgaacgcca tcgacttacg cgtgcgcagg ccgacgcaca 481 ggaactgaag aatgccagag actccgctga agtggtggaa accgcattct gtactttcgt 541 gctgtcgcgg atcgcaggtg aaattgccag tattctcgac gggctccccc tgtcggtgca 601 gcggcgtttt ccggaactgg aaaaccgaca tgttgatttc ctgaaacggg atatcatcaa 661 agccatgaac aaagcagccg cgctggatga actgataccg gggttgctga gtgaatatat 721 cgaacagtca ggttaacagg ctgcggcatt ttgtccgcgc cgggcttcgc tcactgttca 781 ggccggagcc acagaccgcc gttgaatggg cggatgctaa ttactatctc ccgaaagaat 841 ccgcatacca ggaagggcgc tgggaaacac tgccctttca gcgggccatc atgaatgcga 901 tgggcagcga ctacatccgt gaggtgaatg tggtgaagtc tgcccgtgtc ggttattcca 961 aaatgctgct gggtgtttat gcctacttta tagagcataa gcagcgcaac acccttatct 1021 ggttgccgac ggatggtgat gccgagaact ttatgaaaac ccacgttgag ccgactattc 1081 gtgatattcc gtcgctgctg gcgctggccc cgtggtatgg caaaaagcac cgggataaca 1141 cgctcaccat gaagcgtttc actaatgggc gtggcttctg gtgcctgggc ggtaaagcgg 1201 caaaaaacta ccgtgaaaag tcggtggatg tggcgggtta tgatgaactt gctgcttttg 1261 atgatgatat tgaacaggaa ggctctccga cgttcctggg tgacaagcgt attgaaggct 1321 cggtctggcc aaagtccatc cgtggctcca cgccaaaagt gagaggcacc tgtcagattg 1381 agcgtgcagc cagtgaatcc ccgcatttta tgcgttttca tgttgcctgc ccgcattgcg 1441 gggaggagca gtatcttaaa tttggcgaca aagagacgcc gtttggcctc aaatggacgc 1501 cggatgaccc ctccagcgtg ttttatctct gcgagcataa tgcctgcgtc atccgccagc 1561 aggagctgga ctttactgat gcccgttata tctgcgaaaa gaccgggatc tggacccgtg 1621 atggcattct ctggttttcg tcatccggtg aagagattga gccacctgac agtgtgacct 1681 ttcacatctg gacagcgtac agcccgttca ccacctgggt gcagattgtc aaagactgga 1741 tgaaaacgaa aggggatacg ggaaaacgta aaaccttcgt aaacaccacg ctcggtgaga 1801 cgtgggaggc gaaaattggc gaacgtccgg atgctgaagt gatggcagag cggaaagagc 1861 attattcagc gcccgttcct gaccgtgtgg cttacctgac cgccggtatc gactcccagc 1921 tggaccgcta cgaaatgcgc gtatggggat gggggccggg tgaggaaagc tggctgattg 1981 accggcagat tattatgggc cgccacgacg atgaacagac gctgctgcgt gtggatgagg 2041 ccatcaataa aacctatacc cgccggaatg gtgcagaaat gtcgatatcc cgtatctgct 2101 gggatactgg cgggattgac ccgaccattg tgtatgaacg ctcgaaaaaa catgggctgt 2161 tccgggtgat ccccattaaa ggggcatccg tctacggaaa gccggtggcc agcatgccac 2221 gtaagcgaaa caaaaacggg gtttacctta ccgaaatcgg tacggatacc gcgaaagagc 2281 agatttataa ccgcttcaca ctgacgccgg aaggggatga accgcttccc ggtgccgttc 2341 acttcccgaa taacccggat atttttgatc tgaccgaagc gcagcagctg actgctgaag 2401 agcaggtcga aaaatgggtg gatggcagga aaaaaatact gtgggacagc aaaaagcgac 2461 gcaatgaggc actcgactgc ttcgtttatg cgctggcggc gctgcgcatc agtatttccc 2521 gctggcagct ggatctcagt gcgctgctgg cgagcctgca ggaagaggat ggtgcagcaa 2581 ccaacaagaa aacactggca gattacgccc gtgccttatc cggagaggat gaatgacgcg 2641 acaggaagaa cttgccgctg cccgtgcggc actgcatgac ctgatgacag gtaaacgggt 2701 ggcaacagta cagaaagacg gacgaagggt ggagtttacg gccacttccg tgtctgacct 2761 gaaaaaatat attgcagagc tggaagtgca gaccggcatg acacagcgac gcaggggacc 2821 tgcaggattt tatgtatgaa aacgcccacc attcccaccc ttctggggcc ggacggcatg 2881 acatcgctgc gcgaatatgc cggttatcac ggcggtggca gcggatttgg agggcagttg 2941 cggtcgtgga acccaccgag tgaaagtgtg gatgcagccc tgttgcccaa ctttacccgt 3001 ggcaatgccc gcgcagacga tctggtacgc aataacggct atgccgccaa cgccatccag 3061 ctgcatcagg atcatatcgt cgggtctttt ttccggctca gtcatcgccc aagctggcgc 3121 tatctgggca tcggggagga agaagcccgt gccttttccc gcgaggttga agcggcatgg 3181 aaagagtttg ccgaggatga ctgctgctgc attgacgttg agcgaaaacg cacgtttacc 3241 atgatgattc gggaaggtgt ggccatgcac gcctttaacg gtgaactgtt cgttcaggcc 3301 acctgggata ccagttcgtc gcggcttttc cggacacagt tccggatggt cagcccgaag 3361 cgcatcagca acccgaacaa taccggcgac agccggaact gccgtgccgg tgtgcagatt 3421 aatgacagcg gtgcggcgct gggatattac gtcagcgagg acgggtatcc tggctggatg 3481 ccgcagaaat ggacatggat accccgtgag ttacccggcg ggcgcgcctc gttcattcac 3541 gtttttgaac ccgtggagga cgggcagact cgcggtgcaa atgtgtttta cagcgtgatg 3601 gagcagatga agatgctcga cacgctgcag aacacgcagc tgcagagcgc cattgtgaag 3661 gcgatgtatg ccgccaccat tgagagtgag ctggatacgc agtcagcgat ggattttatt 3721 ctgggcgcga acagtcagga gcagcgggaa aggctgaccg gctggattgg tgaaattgcc 3781 gcgtattacg ccgcagcgcc ggtccggctg ggaggcgcaa aagtaccgca cctgatgccg 3841 ggtgactcac tgaacctgca gacggctcag gatacggata acggctactc cgtgtttgag 3901 cagtcactgc tgcggtatat cgctgccggg ctgggtgtct cgtatgagca gctttcccgg 3961 aattacgccc agatgagcta ctccacggca cgggccagtg cgaacgagtc gtgggcgtac 4021 tttatggggc ggcgaaaatt cgtcgcatcc cgtcaggcga gccagatgtt tctgtgctgg 4081 ctggaagagg ccatcgttcg ccgcgtggtg acgttacctt caaaagcgcg cttcagtttt 4141 caggaagccc gcagtgcctg ggggaactgc gactggatag gctccggtcg tatggccatc 4201 gatggtctga aagaagttca ggaagcggtg atgctgatag aagccggact gagtacctac 4261 gagaaagagt gcgcaaaacg cggtgacgac tatcaggaaa tttttgccca gcaggtccgt 4321 gaaacgatgg agcgccgtgc agccggtctt aaaccgcccg cctgggcggc tgcagcattt 4381 gaatccgggc tgcgacaatc aacagaggag gagaagagtg acagcagagc tgcgtaatct 4441 cccgcatatt gccagcatgg cctttaatga gccgctgatg cttgaacccg cctatgcgcg 4501 ggttttcttt tgtgcgcttg caggccagct tgggatcagc agcctgacgg atgcggtgtc 4561 cggcgacagc ctgactgccc aggaggcact cgcgacgctg gcattatccg gtgatgatga 4621 cggaccacga caggcccgca gttatcaggt catgaacggc atcgccgtgc tgccggtgtc 4681 cggcacgctg gtcagccgga cgcgggcgct gcagccgtac tcggggatga ccggttacaa 4741 cggcattatc gcccgtctgc aacaggctgc cagcgatccg atggtggacg gcattctgct 4801 cgatatggac acgcccggcg ggatggtggc gggggcattt gactgcgctg acatcatcgc 4861 ccgtgtgcgt gacataaaac cggtatgggc gcttgccaac gacatgaact gcagtgcagg 4921 tcagttgctt gccagtgccg cctcccggcg tctggtcacg cagaccgccc ggacaggctc 4981 catcggcgtc atgatggctc acagtaatta cggtgctgcg ctggagaaac agggtgtgga 5041 aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc 5101 ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca 5161 gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt 5221 gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga 5281 tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg 5341 aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac 5401 tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc 5461 gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga 5521 ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt 5581 gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac 5641 tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc 5701 tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga 5761 aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc 5821 cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg 5881 taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc 5941 tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga 6001 tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac 6061 ggcaatcagc atcgtttaac tttacccttc atcactaaag gccgcctgtg cggctttttt 6121 tacgggattt ttttatgtcg atgtacacaa ccgcccaact gctggcggca aatgagcaga 6181 aatttaagtt tgatccgctg tttctgcgtc tctttttccg tgagagctat cccttcacca 6241 cggagaaagt ctatctctca caaattccgg gactggtaaa catggcgctg tacgtttcgc 6301 cgattgtttc cggtgaggtt atccgttccc gtggcggctc cacctctgaa tttacgccgg 6361 gatatgtcaa gccgaagcat gaagtgaatc cgcagatgac cctgcgtcgc ctgccggatg 6421 aagatccgca gaatctggcg gacccggctt accgccgccg tcgcatcatc atgcagaaca 6481 tgcgtgacga agagctggcc attgctcagg tcgaagagat gcaggcagtt tctgccgtgc 6541 ttaagggcaa atacaccatg accggtgaag ccttcgatcc ggttgaggtg gatatgggcc 6601 gcagtgagga gaataacatc acgcagtccg gcggcacgga gtggagcaag cgtgacaagt 6661 ccacgtatga cccgaccgac gatatcgaag cctacgcgct gaacgccagc ggtgtggtga 6721 atatcatcgt gttcgatccg aaaggctggg cgctgttccg ttccttcaaa gccgtcaagg 6781 agaagctgga tacccgtcgt ggctctaatt ccgagctgga gacagcggtg aaagacctgg 6841 gcaaagcggt gtcctataag gggatgtatg gcgatgtggc catcgtcgtg tattccggac 6901 agtacgtgga aaacggcgtc aaaaagaact tcctgccgga caacacgatg gtgctgggga 6961 acactcaggc acgcggtctg cgcacctatg gctgcattca ggatgcggac gcacagcgcg 7021 aaggcattaa cgcctctgcc cgttacccga aaaactgggt gaccaccggc gatccggcgc 7081 gtgagttcac catgattcag tcagcaccgc tgatgctgct ggctgaccct gatgagttcg 7141 tgtccgtaca actggcgtaa tcatggccct tcggggccat tgtttctctg tggaggagtc 7201 catgacgaaa gatgaactga ttgcccgtct ccgctcgctg ggtgaacaac tgaaccgtga 7261 tgtcagcctg acggggacga aagaagaact ggcgctccgt gtggcagagc tgaaagagga 7321 gcttgatgac acggatgaaa ctgccggtca ggacacccct ctcagccggg aaaatgtgct 7381 gaccggacat gaaaatgagg tgggatcagc gcagccggat accgtgattc tggatacgtc 7441 tgaactggtc acggtcgtgg cactggtgaa gctgcatact gatgcacttc acgccacgcg 7501 ggatgaacct gtggcatttg tgctgccggg aacggcgttt cgtgtctctg ccggtgtggc 7561 agccgaaatg acagagcgcg gcctggccag aatgcaataa cgggaggcgc tgtggctgat 7621 ttcgataacc tgttcgatgc tgccattgcc cgcgccgatg aaacgatacg cgggtacatg 7681 ggaacgtcag ccaccattac atccggtgag cagtcaggtg cggtgatacg tggtgttttt 7741 gatgaccctg aaaatatcag ctatgccgga cagggcgtgc gcgttgaagg ctccagcccg 7801 tccctgtttg tccggactga tgaggtgcgg cagctgcggc gtggagacac gctgaccatc 7861 ggtgaggaaa atttctgggt agatcgggtt tcgccggatg atggcggaag ttgtcatctc 7921 tggcttggac ggggcgtacc gcctgccgtt aaccgtcgcc gctgaaaggg ggatgtatgg 7981 ccataaaagg tcttgagcag gccgttgaaa acctcagccg tatcagcaaa acggcggtgc 8041 ctggtgccgc cgcaatggcc attaaccgcg ttgcttcatc cgcgatatcg cagtcggcgt 8101 cacaggttgc ccgtgagaca aaggtacgcc ggaaactggt aaaggaaagg gccaggctga 8161 aaagggccac ggtcaaaaat ccgcaggcca gaatcaaagt taaccggggg gatttgcccg 8221 taatcaagct gggtaatgcg cgggttgtcc tttcgcgccg caggcgtcgt aaaaaggggc 8281 agcgttcatc cctgaaaggt ggcggcagcg tgcttgtggt gggtaaccgt cgtattcccg 8341 gcgcgtttat tcagcaactg aaaaatggcc ggtggcatgt catgcagcgt gtggctggga 8401 aaaaccgtta ccccattgat gtggtgaaaa tcccgatggc ggtgccgctg accacggcgt 8461 ttaaacaaaa tattgagcgg atacggcgtg aacgtcttcc gaaagagctg ggctatgcgc 8521 tgcagcatca actgaggatg gtaataaagc gatgaaacat actgaactcc gtgcagccgt 8581 actggatgca ctggagaagc atgacaccgg ggcgacgttt tttgatggtc gccccgctgt 8641 ttttgatgag gcggattttc cggcagttgc cgtttatctc accggcgctg aatacacggg 8701 cgaagagctg gacagcgata cctggcaggc ggagctgcat atcgaagttt tcctgcctgc 8761 tcaggtgccg gattcagagc tggatgcgtg gatggagtcc cggatttatc cggtgatgag 8821 cgatatcccg gcactgtcag atttgatcac cagtatggtg gccagcggct atgactaccg 8881 gcgcgacgat gatgcgggct tgtggagttc agccgatctg acttatgtca ttacctatga 8941 aatgtgagga cgctatgcct gtaccaaatc ctacaatgcc ggtgaaaggt gccgggacca 9001 ccctgtgggt ttataagggg agcggtgacc cttacgcgaa tccgctttca gacgttgact 9061 ggtcgcgtct ggcaaaagtt aaagacctga cgcccggcga actgaccgct gagtcctatg 9121 acgacagcta tctcgatgat gaagatgcag actggactgc gaccgggcag gggcagaaat 9181 ctgccggaga taccagcttc acgctggcgt ggatgcccgg agagcagggg cagcaggcgc 9241 tgctggcgtg gtttaatgaa ggcgataccc gtgcctataa aatccgcttc ccgaacggca 9301 cggtcgatgt gttccgtggc tgggtcagca gtatcggtaa ggcggtgacg gcgaaggaag 9361 tgatcacccg cacggtgaaa gtcaccaatg tgggacgtcc gtcgatggca gaagatcgca 9421 gcacggtaac agcggcaacc ggcatgaccg tgacgcctgc cagcacctcg gtggtgaaag 9481 ggcagagcac cacgctgacc gtggccttcc agccggaggg cgtaaccgac aagagctttc 9541 gtgcggtgtc tgcggataaa acaaaagcca ccgtgtcggt cagtggtatg accatcaccg 9601 tgaacggcgt tgctgcaggc aaggtcaaca ttccggttgt atccggtaat ggtgagtttg 9661 ctgcggttgc agaaattacc gtcaccgcca gttaatccgg agagtcagcg atgttcctga 9721 aaaccgaatc atttgaacat aacggtgtga ccgtcacgct ttctgaactg tcagccctgc 9781 agcgcattga gcatctcgcc ctgatgaaac ggcaggcaga acaggcggag tcagacagca 9841 accggaagtt tactgtggaa gacgccatca gaaccggcgc gtttctggtg gcgatgtccc 9901 tgtggcataa ccatccgcag aagacgcaga tgccgtccat gaatgaagcc gttaaacaga 9961 ttgagcagga agtgcttacc acctggccca cggaggcaat ttctcatgct gaaaacgtgg 10021 tgtaccggct gtctggtatg tatgagtttg tggtgaataa tgcccctgaa cagacagagg 10081 acgccgggcc cgcagagcct gtttctgcgg gaaagtgttc gacggtgagc tgagttttgc 10141 cctgaaactg gcgcgtgaga tggggcgacc cgactggcgt gccatgcttg ccgggatgtc 10201 atccacggag tatgccgact ggcaccgctt ttacagtacc cattattttc atgatgttct 10261 gctggatatg cacttttccg ggctgacgta caccgtgctc agcctgtttt tcagcgatcc 10321 ggatatgcat ccgctggatt tcagtctgct gaaccggcgc gaggctgacg aagagcctga 10381 agatgatgtg ctgatgcaga aagcggcagg gcttgccgga ggtgtccgct ttggcccgga 10441 cgggaatgaa gttatccccg cttccccgga tgtggcggac atgacggagg atgacgtaat 10501 gctgatgaca gtatcagaag ggatcgcagg aggagtccgg tatggctgaa ccggtaggcg 10561 atctggtcgt tgatttgagt ctggatgcgg ccagatttga cgagcagatg gccagagtca 10621 ggcgtcattt ttctggtacg gaaagtgatg cgaaaaaaac agcggcagtc gttgaacagt 10681 cgctgagccg acaggcgctg gctgcacaga aagcggggat ttccgtcggg cagtataaag 10741 ccgccatgcg tatgctgcct gcacagttca ccgacgtggc cacgcagctt gcaggcgggc 10801 aaagtccgtg gctgatcctg ctgcaacagg gggggcaggt gaaggactcc ttcggcggga 10861 tgatccccat gttcaggggg cttgccggtg cgatcaccct gccgatggtg ggggccacct 10921 cgctggcggt ggcgaccggt gcgctggcgt atgcctggta tcagggcaac tcaaccctgt 10981 ccgatttcaa caaaacgctg gtcctttccg gcaatcaggc gggactgacg gcagatcgta 11041 tgctggtcct gtccagagcc gggcaggcgg cagggctgac gtttaaccag accagcgagt 11101 cactcagcgc actggttaag gcgggggtaa gcggtgaggc tcagattgcg tccatcagcc 11161 agagtgtggc gcgtttctcc tctgcatccg gcgtggaggt ggacaaggtc gctgaagcct 11221 tcgggaagct gaccacagac ccgacgtcgg ggctgacggc gatggctcgc cagttccata 11281 acgtgtcggc ggagcagatt gcgtatgttg ctcagttgca gcgttccggc gatgaagccg 11341 gggcattgca ggcggcgaac gaggccgcaa cgaaagggtt tgatgaccag acccgccgcc 11401 tgaaagagaa catgggcacg ctggagacct gggcagacag gactgcgcgg gcattcaaat 11461 ccatgtggga tgcggtgctg gatattggtc gtcctgatac cgcgcaggag atgctgatta 11521 aggcagaggc tgcgtataag aaagcagacg acatctggaa tctgcgcaag gatgattatt 11581 ttgttaacga tgaagcgcgg gcgcgttact gggatgatcg tgaaaaggcc cgtcttgcgc 11641 ttgaagccgc ccgaaagaag gctgagcagc agactcaaca ggacaaaaat gcgcagcagc 11701 agagcgatac cgaagcgtca cggctgaaat ataccgaaga ggcgcagaag gcttacgaac 11761 ggctgcagac gccgctggag aaatataccg cccgtcagga agaactgaac aaggcactga 11821 aagacgggaa aatcctgcag gcggattaca acacgctgat ggcggcggcg aaaaaggatt 11881 atgaagcgac gctgaaaaag ccgaaacagt ccagcgtgaa ggtgtctgcg ggcgatcgtc 11941 aggaagacag tgctcatgct gccctgctga cgcttcaggc agaactccgg acgctggaga 12001 agcatgccgg agcaaatgag aaaatcagcc agcagcgccg ggatttgtgg aaggcggaga 12061 gtcagttcgc ggtactggag gaggcggcgc aacgtcgcca gctgtctgca caggagaaat 12121 ccctgctggc gcataaagat gagacgctgg agtacaaacg ccagctggct gcacttggcg 12181 acaaggttac gtatcaggag cgcctgaacg cgctggcgca gcaggcggat aaattcgcac 12241 agcagcaacg ggcaaaacgg gccgccattg atgcgaaaag ccgggggctg actgaccggc 12301 aggcagaacg ggaagccacg gaacagcgcc tgaaggaaca gtatggcgat aatccgctgg 12361 cgctgaataa cgtcatgtca gagcagaaaa agacctgggc ggctgaagac cagcttcgcg 12421 ggaactggat ggcaggcctg aagtccggct ggagtgagtg ggaagagagc gccacggaca 12481 gtatgtcgca ggtaaaaagt gcagccacgc agacctttga tggtattgca cagaatatgg 12541 cggcgatgct gaccggcagt gagcagaact ggcgcagctt cacccgttcc gtgctgtcca 12601 tgatgacaga aattctgctt aagcaggcaa tggtggggat tgtcgggagt atcggcagcg 12661 ccattggcgg ggctgttggt ggcggcgcat ccgcgtcagg cggtacagcc attcaggccg 12721 ctgcggcgaa attccatttt gcaaccggag gatttacggg aaccggcggc aaatatgagc 12781 cagcggggat tgttcaccgt ggtgagtttg tcttcacgaa ggaggcaacc agccggattg 12841 gcgtggggaa tctttaccgg ctgatgcgcg gctatgccac cggcggttat gtcggtacac 12901 cgggcagcat ggcagacagc cggtcgcagg cgtccgggac gtttgagcag aataaccatg 12961 tggtgattaa caacgacggc acgaacgggc agataggtcc ggctgctctg aaggcggtgt 13021 atgacatggc ccgcaagggt gcccgtgatg aaattcagac acagatgcgt gatggtggcc 13081 tgttctccgg aggtggacga tgaagacctt ccgctggaaa gtgaaacccg gtatggatgt 13141 ggcttcggtc ccttctgtaa gaaaggtgcg ctttggtgat ggctattctc agcgagcgcc 13201 tgccgggctg aatgccaacc tgaaaacgta cagcgtgacg ctttctgtcc cccgtgagga 13261 ggccacggta ctggagtcgt ttctggaaga gcacgggggc tggaaatcct ttctgtggac 13321 gccgccttat gagtggcggc agataaaggt gacctgcgca aaatggtcgt cgcgggtcag 13381 tatgctgcgt gttgagttca gcgcagagtt tgaacaggtg gtgaactgat gcaggatatc 13441 cggcaggaaa cactgaatga atgcacccgt gcggagcagt cggccagcgt ggtgctctgg 13501 gaaatcgacc tgacagaggt cggtggagaa cgttattttt tctgtaatga gcagaacgaa 13561 aaaggtgagc cggtcacctg gcaggggcga cagtatcagc cgtatcccat tcaggggagc 13621 ggttttgaac tgaatggcaa aggcaccagt acgcgcccca cgctgacggt ttctaacctg 13681 tacggtatgg tcaccgggat ggcggaagat atgcagagtc tggtcggcgg aacggtggtc 13741 cggcgtaagg tttacgcccg ttttctggat gcggtgaact tcgtcaacgg aaacagttac 13801 gccgatccgg agcaggaggt gatcagccgc tggcgcattg agcagtgcag cgaactgagc 13861 gcggtgagtg cctcctttgt actgtccacg ccgacggaaa cggatggcgc tgtttttccg 13921 ggacgtatca tgctggccaa cacctgcacc tggacctatc gcggtgacga gtgcggttat 13981 agcggtccgg ctgtcgcgga tgaatatgac cagccaacgt ccgatatcac gaaggataaa 14041 tgcagcaaat gcctgagcgg ttgtaagttc cgcaataacg tcggcaactt tggcggcttc 14101 ctttccatta acaaactttc gcagtaaatc ccatgacaca gacagaatca gcgattctgg 14161 cgcacgcccg gcgatgtgcg ccagcggagt cgtgcggctt cgtggtaagc acgccggagg 14221 gggaaagata tttcccctgc gtgaatatct ccggtgagcc ggaggctatt tccgtatgtc 14281 gccggaagac tggctgcagg cagaaatgca gggtgagatt gtggcgctgg tccacagcca 14341 ccccggtggt ctgccctggc tgagtgaggc cgaccggcgg ctgcaggtgc agagtgattt 14401 gccgtggtgg ctggtctgcc gggggacgat tcataagttc cgctgtgtgc cgcatctcac 14461 cgggcggcgc tttgagcacg gtgtgacgga ctgttacaca ctgttccggg atgcttatca 14521 tctggcgggg attgagatgc cggactttca tcgtgaggat gactggtggc gtaacggcca 14581 gaatctctat ctggataatc tggaggcgac ggggctgtat caggtgccgt tgtcagcggc 14641 acagccgggc gatgtgctgc tgtgctgttt tggttcatca gtgccgaatc acgccgcaat 14701 ttactgcggc gacggcgagc tgctgcacca tattcctgaa caactgagca aacgagagag 14761 gtacaccgac aaatggcagc gacgcacaca ctccctctgg cgtcaccggg catggcgcgc 14821 atctgccttt acggggattt acaacgattt ggtcgccgca tcgaccttcg tgtgaaaacg 14881 ggggctgaag ccatccgggc actggccaca cagctcccgg cgtttcgtca gaaactgagc 14941 gacggctggt atcaggtacg gattgccggg cgggacgtca gcacgtccgg gttaacggcg 15001 cagttacatg agactctgcc tgatggcgct gtaattcata ttgttcccag agtcgccggg 15061 gccaagtcag gtggcgtatt ccagattgtc ctgggggctg ccgccattgc cggatcattc 15121 tttaccgccg gagccaccct tgcagcatgg ggggcagcca ttggggccgg tggtatgacc 15181 ggcatcctgt tttctctcgg tgccagtatg gtgctcggtg gtgtggcgca gatgctggca 15241 ccgaaagcca gaactccccg tatacagaca acggataacg gtaagcagaa cacctatttc 15301 tcctcactgg ataacatggt tgcccagggc aatgttctgc ctgttctgta cggggaaatg 15361 cgcgtggggt cacgcgtggt ttctcaggag atcagcacgg cagacgaagg ggacggtggt 15421 caggttgtgg tgattggtcg ctgatgcaaa atgttttatg tgaaaccgcc tgcgggcggt 15481 tttgtcattt atggagcgtg aggaatgggt aaaggaagca gtaaggggca taccccgcgc 15541 gaagcgaagg acaacctgaa gtccacgcag ttgctgagtg tgatcgatgc catcagcgaa 15601 gggccgattg aaggtccggt ggatggctta aaaagcgtgc tgctgaacag tacgccggtg 15661 ctggacactg aggggaatac caacatatcc ggtgtcacgg tggtgttccg ggctggtgag 15721 caggagcaga ctccgccgga gggatttgaa tcctccggct ccgagacggt gctgggtacg 15781 gaagtgaaat atgacacgcc gatcacccgc accattacgt ctgcaaacat cgaccgtctg 15841 cgctttacct tcggtgtaca ggcactggtg gaaaccacct caaagggtga caggaatccg 15901 tcggaagtcc gcctgctggt tcagatacaa cgtaacggtg gctgggtgac ggaaaaagac 15961 atcaccatta agggcaaaac cacctcgcag tatctggcct cggtggtgat gggtaacctg 16021 ccgccgcgcc cgtttaatat ccggatgcgc aggatgacgc cggacagcac cacagaccag 16081 ctgcagaaca aaacgctctg gtcgtcatac actgaaatca tcgatgtgaa acagtgctac 16141 ccgaacacgg cactggtcgg cgtgcaggtg gactcggagc agttcggcag ccagcaggtg 16201 agccgtaatt atcatctgcg cgggcgtatt ctgcaggtgc cgtcgaacta taacccgcag 16261 acgcggcaat acagcggtat ctgggacgga acgtttaaac cggcatacag caacaacatg 16321 gcctggtgtc tgtgggatat gctgacccat ccgcgctacg gcatggggaa acgtcttggt 16381 gcggcggatg tggataaatg ggcgctgtat gtcatcggcc agtactgcga ccagtcagtg 16441 ccggacggct ttggcggcac ggagccgcgc atcacctgta atgcgtacct gaccacacag 16501 cgtaaggcgt gggatgtgct cagcgatttc tgctcggcga tgcgctgtat gccggtatgg 16561 aacgggcaga cgctgacgtt cgtgcaggac cgaccgtcgg ataagacgtg gacctataac 16621 cgcagtaatg tggtgatgcc ggatgatggc gcgccgttcc gctacagctt cagcgccctg 16681 aaggaccgcc ataatgccgt tgaggtgaac tggattgacc cgaacaacgg ctgggagacg 16741 gcgacagagc ttgttgaaga tacgcaggcc attgcccgtt acggtcgtaa tgttacgaag 16801 atggatgcct ttggctgtac cagccggggg caggcacacc gcgccgggct gtggctgatt 16861 aaaacagaac tgctggaaac gcagaccgtg gatttcagcg tcggcgcaga agggcttcgc 16921 catgtaccgg gcgatgttat tgaaatctgc gatgatgact atgccggtat cagcaccggt 16981 ggtcgtgtgc tggcggtgaa cagccagacc cggacgctga cgctcgaccg tgaaatcacg 17041 ctgccatcct ccggtaccgc gctgataagc ctggttgacg gaagtggcaa tccggtcagc 17101 gtggaggttc agtccgtcac cgacggcgtg aaggtaaaag tgagccgtgt tcctgacggt 17161 gttgctgaat acagcgtatg ggagctgaag ctgccgacgc tgcgccagcg actgttccgc 17221 tgcgtgagta tccgtgagaa cgacgacggc acgtatgcca tcaccgccgt gcagcatgtg 17281 ccggaaaaag aggccatcgt ggataacggg gcgcactttg acggcgaaca gagtggcacg 17341 gtgaatggtg tcacgccgcc agcggtgcag cacctgaccg cagaagtcac tgcagacagc 17401 ggggaatatc aggtgctggc gcgatgggac acaccgaagg tggtgaaggg cgtgagtttc 17461 ctgctccgtc tgaccgtaac agcggacgac ggcagtgagc ggctggtcag cacggcccgg 17521 acgacggaaa ccacataccg cttcacgcaa ctggcgctgg ggaactacag gctgacagtc 17581 cgggcggtaa atgcgtgggg gcagcagggc gatccggcgt cggtatcgtt ccggattgcc 17641 gcaccggcag caccgtcgag gattgagctg acgccgggct attttcagat aaccgccacg 17701 ccgcatcttg ccgtttatga cccgacggta cagtttgagt tctggttctc ggaaaagcag 17761 attgcggata tcagacaggt tgaaaccagc acgcgttatc ttggtacggc gctgtactgg 17821 atagccgcca gtatcaatat caaaccgggc catgattatt acttttatat ccgcagtgtg 17881 aacaccgttg gcaaatcggc attcgtggag gccgtcggtc gggcgagcga tgatgcggaa 17941 ggttacctgg attttttcaa aggcaagata accgaatccc atctcggcaa ggagctgctg 18001 gaaaaagtcg agctgacgga ggataacgcc agcagactgg aggagttttc gaaagagtgg 18061 aaggatgcca gtgataagtg gaatgccatg tgggctgtca aaattgagca gaccaaagac 18121 ggcaaacatt atgtcgcggg tattggcctc agcatggagg acacggagga aggcaaactg 18181 agccagtttc tggttgccgc caatcgtatc gcatttattg acccggcaaa cgggaatgaa 18241 acgccgatgt ttgtggcgca gggcaaccag atattcatga acgacgtgtt cctgaagcgc 18301 ctgacggccc ccaccattac cagcggcggc aatcctccgg ccttttccct gacaccggac 18361 ggaaagctga ccgctaaaaa tgcggatatc agtggcagtg tgaatgcgaa ctccgggacg 18421 ctcagtaatg tgacgatagc tgaaaactgt acgataaacg gtacgctgag ggcggaaaaa 18481 atcgtcgggg acattgtaaa ggcggcgagc gcggcttttc cgcgccagcg tgaaagcagt 18541 gtggactggc cgtcaggtac ccgtactgtc accgtgaccg atgaccatcc ttttgatcgc 18601 cagatagtgg tgcttccgct gacgtttcgc ggaagtaagc gtactgtcag cggcaggaca 18661 acgtattcga tgtgttatct gaaagtactg atgaacggtg cggtgattta tgatggcgcg 18721 gcgaacgagg cggtacaggt gttctcccgt attgttgaca tgccagcggg tcggggaaac 18781 gtgatcctga cgttcacgct tacgtccaca cggcattcgg cagatattcc gccgtatacg 18841 tttgccagcg atgtgcaggt tatggtgatt aagaaacagg cgctgggcat cagcgtggtc 18901 tgagtgtgtt acagaggttc gtccgggaac gggcgtttta ttataaaaca gtgagaggtg 18961 aacgatgcgt aatgtgtgta ttgccgttgc tgtctttgcc gcacttgcgg tgacagtcac 19021 tccggcccgt gcggaaggtg gacatggtac gtttacggtg ggctattttc aagtgaaacc 19081 gggtacattg ccgtcgttgt cgggcgggga taccggtgtg agtcatctga aagggattaa 19141 cgtgaagtac cgttatgagc tgacggacag tgtgggggtg atggcttccc tggggttcgc 19201 cgcgtcgaaa aagagcagca cagtgatgac cggggaggat acgtttcact atgagagcct 19261 gcgtggacgt tatgtgagcg tgatggccgg accggtttta caaatcagta agcaggtcag 19321 tgcgtacgcc atggccggag tggctcacag tcggtggtcc ggcagtacaa tggattaccg 19381 taagacggaa atcactcccg ggtatatgaa agagacgacc actgccaggg acgaaagtgc 19441 aatgcggcat acctcagtgg cgtggagtgc aggtatacag attaatccgg cagcgtccgt 19501 cgttgttgat attgcttatg aaggctccgg cagtggcgac tggcgtactg acggattcat 19561 cgttggggtc ggttataaat tctgattagc caggtaacac agtgttatga cagcccgccg 19621 gaaccggtgg gcttttttgt ggggtgaata tggcagtaaa gatttcagga gtcctgaaag 19681 acggcacagg aaaaccggta cagaactgca ccattcagct gaaagccaga cgtaacagca 19741 ccacggtggt ggtgaacacg gtgggctcag agaatccgga tgaagccggg cgttacagca 19801 tggatgtgga gtacggtcag tacagtgtca tcctgcaggt tgacggtttt ccaccatcgc 19861 acgccgggac catcaccgtg tatgaagatt cacaaccggg gacgctgaat gattttctct 19921 gtgccatgac ggaggatgat gcccggccgg aggtgctgcg tcgtcttgaa ctgatggtgg 19981 aagaggtggc gcgtaacgcg tccgtggtgg cacagagtac ggcagacgcg aagaaatcag 20041 ccggcgatgc cagtgcatca gctgctcagg tcgcggccct tgtgactgat gcaactgact 20101 cagcacgcgc cgccagcacg tccgccggac aggctgcatc gtcagctcag gaagcgtcct 20161 ccggcgcaga agcggcatca gcaaaggcca ctgaagcgga aaaaagtgcc gcagccgcag 20221 agtcctcaaa aaacgcggcg gccaccagtg ccggtgcggc gaaaacgtca gaaacgaatg 20281 ctgcagcgtc acaacaatca gccgccacgt ctgcctccac cgcggccacg aaagcgtcag 20341 aggccgccac ttcagcacga gatgcggtgg cctcaaaaga ggcagcaaaa tcatcagaaa 20401 cgaacgcatc atcaagtgcc ggtcgtgcag cttcctcggc aacggcggca gaaaattctg 20461 ccagggcggc aaaaacgtcc gagacgaatg ccaggtcatc tgaaacagca gcggaacgga 20521 gcgcctctgc cgcggcagac gcaaaaacag cggcggcggg gagtgcgtca acggcatcca 20581 cgaaggcgac agaggctgcg ggaagtgcgg tatcagcatc gcagagcaaa agtgcggcag 20641 aagcggcggc aatacgtgca aaaaattcgg caaaacgtgc agaagatata gcttcagctg 20701 tcgcgcttga ggatgcggac acaacgagaa aggggatagt gcagctcagc agtgcaacca 20761 acagcacgtc tgaaacgctt gctgcaacgc caaaggcggt taaggtggta atggatgaaa 20821 cgaacagaaa agcccactgg acagtccggc actgaccgga acgccaacag caccaaccgc 20881 gctcagggga acaaacaata cccagattgc gaacaccgct tttgtactgg ccgcgattgc 20941 agatgttatc gacgcgtcac ctgacgcact gaatacgctg aatgaactgg ccgcagcgct 21001 cgggaatgat ccagattttg ctaccaccat gactaacgcg cttgcgggta aacaaccgaa 21061 gaatgcgaca ctgacggcgc tggcagggct ttccacggcg aaaaataaat taccgtattt 21121 tgcggaaaat gatgccgcca gcctgactga actgactcag gttggcaggg atattctggc 21181 aaaaaattcc gttgcagatg ttcttgaata ccttggggcc ggtgagaatt cggcctttcc 21241 ggcaggtgcg ccgatcccgt ggccatcaga tatcgttccg tctggctacg tcctgatgca 21301 ggggcaggcg tttgacaaat cagcctaccc aaaacttgct gtcgcgtatc catcgggtgt 21361 gcttcctgat atgcgaggct ggacaatcaa ggggaaaccc gccagcggtc gtgctgtatt 21421 gtctcaggaa caggatggaa ttaagtcgca cacccacagt gccagtgcat ccggtacgga 21481 tttggggacg aaaaccacat cgtcgtttga ttacgggacg aaaacaacag gcagtttcga 21541 ttacggcacc aaatcgacga ataacacggg ggctcatgct cacagtctga gcggttcaac 21601 aggggccgcg ggtgctcatg cccacacaag tggtttaagg atgaacagtt ctggctggag 21661 tcagtatgga acagcaacca ttacaggaag tttatccaca gttaaaggaa ccagcacaca 21721 gggtattgct tatttatcga aaacggacag tcagggcagc cacagtcact cattgtccgg 21781 tacagccgtg agtgccggtg cacatgcgca tacagttggt attggtgcgc accagcatcc 21841 ggttgttatc ggtgctcatg cccattcttt cagtattggt tcacacggac acaccatcac 21901 cgttaacgct gcgggtaacg cggaaaacac cgtcaaaaac attgcattta actatattgt 21961 gaggcttgca taatggcatt cagaatgagt gaacaaccac ggaccataaa aatttataat 22021 ctgctggccg gaactaatga atttattggt gaaggtgacg catatattcc gcctcatacc 22081 ggtctgcctg caaacagtac cgatattgca ccgccagata ttccggctgg ctttgtggct 22141 gttttcaaca gtgatgaggc atcgtggcat ctcgttgaag accatcgggg taaaaccgtc 22201 tatgacgtgg cttccggcga cgcgttattt atttctgaac tcggtccgtt accggaaaat 22261 tttacctggt tatcgccggg aggggaatat cagaagtgga acggcacagc ctgggtgaag 22321 gatacggaag cagaaaaact gttccggatc cgggaggcgg aagaaacaaa aaaaagcctg 22381 atgcaggtag ccagtgagca tattgcgccg cttcaggatg ctgcagatct ggaaattgca 22441 acgaaggaag aaacctcgtt gctggaagcc tggaagaagt atcgggtgtt gctgaaccgt 22501 gttgatacat caactgcacc tgatattgag tggcctgctg tccctgttat ggagtaatcg 22561 ttttgtgata tgccgcagaa acgttgtatg aaataacgtt ctgcggttag ttagtatatt 22621 gtaaagctga gtattggttt atttggcgat tattatcttc aggagaataa tggaagttct 22681 atgactcaat tgttcatagt gtttacatca ccgccaattg cttttaagac tgaacgcatg 22741 aaatatggtt tttcgtcatg ttttgagtct gctgttgata tttctaaagt cggttttttt 22801 tcttcgtttt ctctaactat tttccatgaa atacattttt gattattatt tgaatcaatt 22861 ccaattacct gaagtctttc atctataatt ggcattgtat gtattggttt attggagtag 22921 atgcttgctt ttctgagcca tagctctgat atccaaatga agccataggc atttgttatt 22981 ttggctctgt cagctgcata acgccaaaaa atatatttat ctgcttgatc ttcaaatgtt 23041 gtattgatta aatcaattgg atggaattgt ttatcataaa aaattaatgt ttgaatgtga 23101 taaccgtcct ttaaaaaagt cgtttctgca agcttggctg tatagtcaac taactcttct 23161 gtcgaagtga tatttttagg cttatctacc agttttagac gctctttaat atcttcagga 23221 attattttat tgtcatattg tatcatgcta aatgacaatt tgcttatgga gtaatctttt 23281 aattttaaat aagttattct cctggcttca tcaaataaag agtcgaatga tgttggcgaa 23341 atcacatcgt cacccattgg attgtttatt tgtatgccaa gagagttaca gcagttatac 23401 attctgccat agattatagc taaggcatgt aataattcgt aatcttttag cgtattagcg 23461 acccatcgtc tttctgattt aataatagat gattcagtta aatatgaagg taatttcttt 23521 tgtgcaagtc tgactaactt ttttatacca atgtttaaca tactttcatt tgtaataaac 23581 tcaatgtcat tttcttcaat gtaagatgaa ataagagtag cctttgcctc gctatacatt 23641 tctaaatcgc cttgtttttc tatcgtattg cgagaatttt tagcccaagc cattaatgga 23701 tcatttttcc atttttcaat aacattattg ttataccaaa tgtcatatcc tataatctgg 23761 tttttgtttt tttgaataat aaatgttact gttcttgcgg tttggaggaa ttgattcaaa 23821 ttcaagcgaa ataattcagg gtcaaaatat gtatcaatgc agcatttgag caagtgcgat 23881 aaatctttaa gtcttctttc ccatggtttt ttagtcataa aactctccat tttgataggt 23941 tgcatgctag atgctgatat attttagagg tgataaaatt aactgcttaa ctgtcaatgt 24001 aatacaagtt gtttgatctt tgcaatgatt cttatcagaa accatatagt aaattagtta 24061 cacaggaaat ttttaatatt attattatca ttcattatgt attaaaatta gagttgtggc 24121 ttggctctgc taacacgttg ctcataggag atatggtaga gccgcagaca cgtcgtatgc 24181 aggaacgtgc tgcggctggc tggtgaactt ccgatagtgc gggtgttgaa tgatttccag 24241 ttgctaccga ttttacatat tttttgcatg agagaatttg taccacctcc caccgaccat 24301 ctatgactgt acgccactgt ccctaggact gctatgtgcc ggagcggaca ttacaaacgt 24361 ccttctcggt gcatgccact gttgccaatg acctgcctag gaattggtta gcaagttact 24421 accggatttt gtaaaaacag ccctcctcat ataaaaagta ttcgttcact tccgataagc 24481 gtcgtaattt tctatctttc atcatattct agatccctct gaaaaaatct tccgagtttg 24541 ctaggcactg atacataact cttttccaat aattggggaa gtcattcaaa tctataatag 24601 gtttcagatt tgcttcaata aattctgact gtagctgctg aaacgttgcg gttgaactat 24661 atttccttat aacttttacg aaagagtttc tttgagtaat cacttcactc aagtgcttcc 24721 ctgcctccaa acgatacctg ttagcaatat ttaatagctt gaaatgatga agagctctgt 24781 gtttgtcttc ctgcctccag ttcgccgggc attcaacata aaaactgata gcacccggag 24841 ttccggaaac gaaatttgca tatacccatt gctcacgaaa aaaaatgtcc ttgtcgatat 24901 agggatgaat cgcttggtgt acctcatcta ctgcgaaaac ttgacctttc tctcccatat 24961 tgcagtcgcg gcacgatgga actaaattaa taggcatcac cgaaaattca ggataatgtg 25021 caataggaag aaaatgatct atattttttg tctgtcctat atcaccacaa aatggacatt 25081 tttcacctga tgaaacaagc atgtcatcgt aatatgttct agcgggtttg tttttatctc 25141 ggagattatt ttcataaagc ttttctaatt taacctttgt caggttacca actactaagg 25201 ttgtaggctc aagagggtgt gtcctgtcgt aggtaaataa ctgacctgtc gagcttaata 25261 ttctatattg ttgttctttc tgcaaaaaag tggggaagtg agtaatgaaa ttatttctaa 25321 catttatctg catcatacct tccgagcatt tattaagcat ttcgctataa gttctcgctg 25381 gaagaggtag ttttttcatt gtactttacc ttcatctctg ttcattatca tcgcttttaa 25441 aacggttcga ccttctaatc ctatctgacc attataattt tttagaatgg tttcataaga 25501 aagctctgaa tcaacggact gcgataataa gtggtggtat ccagaatttg tcacttcaag 25561 taaaaacacc tcacgagtta aaacacctaa gttctcaccg aatgtctcaa tatccggacg 25621 gataatattt attgcttctc ttgaccgtag gactttccac atgcaggatt ttggaacctc 25681 ttgcagtact actggggaat gagttgcaat tattgctaca ccattgcgtg catcgagtaa 25741 gtcgcttaat gttcgtaaaa aagcagagag caaaggtgga tgcagatgaa cctctggttc 25801 atcgaataaa actaatgact tttcgccaac gacatctact aatcttgtga tagtaaataa 25861 aacaattgca tgtccagagc tcattcgaag cagatatttc tggatattgt cataaaacaa 25921 tttagtgaat ttatcatcgt ccacttgaat ctgtggttca ttacgtctta actcttcata 25981 tttagaaatg aggctgatga gttccatatt tgaaaagttt tcatcactac ttagtttttt 26041 gatagcttca agccagagtt gtctttttct atctactctc atacaaccaa taaatgctga 26101 aatgaattct aagcggagat cgcctagtga ttttaaacta ttgctggcag cattcttgag 26161 tccaatataa aagtattgtg taccttttgc tgggtcaggt tgttctttag gaggagtaaa 26221 aggatcaaat gcactaaacg aaactgaaac aagcgatcga aaatatccct ttgggattct 26281 tgactcgata agtctattat tttcagagaa aaaatattca ttgttttctg ggttggtgat 26341 tgcaccaatc attccattca aaattgttgt tttaccacac ccattccgcc cgataaaagc 26401 atgaatgttc gtgctgggca tagaattaac cgtcacctca aaaggtatag ttaaatcact 26461 gaatccggga gcactttttc tattaaatga aaagtggaaa tctgacaatt ctggcaaacc 26521 atttaacaca cgtgcgaact gtccatgaat ttctgaaaga gttacccctc taagtaatga 26581 ggtgttaagg acgctttcat tttcaatgtc ggctaatcga tttggccata ctactaaatc 26641 ctgaatagct ttaagaaggt tatgtttaaa accatcgctt aatttgctga gattaacata 26701 gtagtcaatg ctttcaccta aggaaaaaaa catttcaggg agttgactga attttttatc 26761 tattaatgaa taagtgctta cttcttcttt ttgacctaca aaaccaattt taacatttcc 26821 gatatcgcat ttttcaccat gctcatcaaa gacagtaaga taaaacattg taacaaagga 26881 atagtcattc caaccatctg ctcgtaggaa tgccttattt ttttctactg caggaatata 26941 cccgcctctt tcaataacac taaactccaa catatagtaa cccttaattt tattaaaata 27001 accgcaattt atttggcggc aacacaggat ctctctttta agttactctc tattacatac 27061 gttttccatc taaaaattag tagtattgaa cttaacgggg catcgtattg tagttttcca 27121 tatttagctt tctgcttcct tttggataac ccactgttat tcatgttgca tggtgcactg 27181 tttataccaa cgatatagtc tattaatgca tatatagtat cgccgaacga ttagctcttc 27241 aggcttctga agaagcgttt caagtactaa taagccgata gatagccacg gacttcgtag 27301 ccatttttca taagtgttaa cttccgctcc tcgctcataa cagacattca ctacagttat 27361 ggcggaaagg tatgcatgct gggtgtgggg aagtcgtgaa agaaaagaag tcagctgcgt 27421 cgtttgacat cactgctatc ttcttactgg ttatgcaggt cgtagtgggt ggcacacaaa 27481 gctttgcact ggattgcgag gctttgtgct tctctggagt gcgacaggtt tgatgacaaa 27541 aaattagcgc aagaagacaa aaatcacctt gcgctaatgc tctgttacag gtcactaata 27601 ccatctaagt agttgattca tagtgactgc atatgttgtg ttttacagta ttatgtagtc 27661 tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt 27721 tcagcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat ttgttgcaac 27781 gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg tcccactccc 27841 tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg agaagatgtt 27901 gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg cgcaactcgt 27961 gaaaggtagg cggatcccct tcgaaggaaa gacctgatgc ttttcgtgcg cgcataaaat 28021 accttgatac tgtgccggat gaaagcggtt cgcgacgagt agatgcaatt atggtttctc 28081 cgccaagaat ctctttgcat ttatcaagtg tttccttcat tgatattccg agagcatcaa 28141 tatgcaatgc tgttgggatg gcaattttta cgcctgtttt gctttgctcg acataaagat 28201 atccatctac gatatcagac cacttcattt cgcataaatc accaactcgt tgcccggtaa 28261 caacagccag ttccattgca agtctgagcc aacatggtga tgattctgct gcttgataaa 28321 ttttcaggta ttcgtcagcc gtaagtcttg atctccttac ctctgatttt gctgcgcgag 28381 tggcagcgac atggtttgtt gttatatggc cttcagctat tgcctctcgg aatgcatcgc 28441 tcagtgttga tctgattaac ttggctgacg ccgccttgcc ctcgtctatg tatccattga 28501 gcattgccgc aatttctttt gtggtgatgt cttcaagtgg agcatcaggc agacccctcc 28561 ttattgcttt aattttgctc atgtaattta tgagtgtctt ctgcttgatt cctctgctgg 28621 ccaggatttt ttcgtagcga tcaagccatg aatgtaacgt aacggaatta tcactgttga 28681 ttctcgctgt cagaggcttg tgtttgtgtc ctgaaaataa ctcaatgttg gcctgtatag 28741 cttcagtgat tgcgattcgc ctgtctctgc ctaatccaaa ctctttaccc gtccttgggt 28801 ccctgtagca gtaatatcca ttgtttctta tataaaggtt agggggtaaa tcccggcgct 28861 catgacttcg ccttcttccc atttctgatc ctcttcaaaa ggccacctgt tactggtcga 28921 tttaagtcaa cctttaccgc tgattcgtgg aacagatact ctcttccatc cttaaccgga 28981 ggtgggaata tcctgcattc ccgaacccat cgacgaactg tttcaaggct tcttggacgt 29041 cgctggcgtg cgttccactc ctgaagtgtc aagtacatcg caaagtctcc gcaattacac 29101 gcaagaaaaa accgccatca ggcggcttgg tgttctttca gttcttcaat tcgaatattg 29161 gttacgtctg catgtgctat ctgcgcccat atcatccagt ggtcgtagca gtcgttgatg 29221 ttctccgctt cgataactct gttgaatggc tctccattcc attctcctgt gactcggaag 29281 tgcatttatc atctccataa aacaaaaccc gccgtagcga gttcagataa aataaatccc 29341 cgcgagtgcg aggattgtta tgtaatattg ggtttaatca tctatatgtt ttgtacagag 29401 agggcaagta tcgtttccac cgtactcgtg ataataattt tgcacggtat cagtcatttc 29461 tcgcacattg cagaatgggg atttgtcttc attagactta taaaccttca tggaatattt 29521 gtatgccgac tctatatcta taccttcatc tacataaaca ccttcgtgat gtctgcatgg 29581 agacaagaca ccggatctgc acaacattga taacgcccaa tctttttgct cagactctaa 29641 ctcattgata ctcatttata aactccttgc aatgtatgtc gtttcagcta aacggtatca 29701 gcaatgttta tgtaaagaaa cagtaagata atactcaacc cgatgtttga gtacggtcat 29761 catctgacac tacagactct ggcatcgctg tgaagacgac gcgaaattca gcattttcac 29821 aagcgttatc ttttacaaaa ccgatctcac tctcctttga tgcgaatgcc agcgtcagac 29881 atcatatgca gatactcacc tgcatcctga acccattgac ctccaacccc gtaatagcga 29941 tgcgtaatga tgtcgatagt tactaacggg tcttgttcga ttaactgccg cagaaactct 30001 tccaggtcac cagtgcagtg cttgataaca ggagtcttcc caggatggcg aacaacaaga 30061 aactggtttc cgtcttcacg gacttcgttg ctttccagtt tagcaatacg cttactccca 30121 tccgagataa caccttcgta atactcacgc tgctcgttga gttttgattt tgctgtttca 30181 agctcaacac gcagtttccc tactgttagc gcaatatcct cgttctcctg gtcgcggcgt 30241 ttgatgtatt gctggtttct ttcccgttca tccagcagtt ccagcacaat cgatggtgtt 30301 accaattcat ggaaaaggtc tgcgtcaaat ccccagtcgt catgcattgc ctgctctgcc 30361 gcttcacgca gtgcctgaga gttaatttcg ctcacttcga acctctctgt ttactgataa 30421 gttccagatc ctcctggcaa cttgcacaag tccgacaacc ctgaacgacc aggcgtcttc 30481 gttcatctat cggatcgcca cactcacaac aatgagtggc agatatagcc tggtggttca 30541 ggcggcgcat ttttattgct gtgttgcgct gtaattcttc tatttctgat gctgaatcaa 30601 tgatgtctgc catctttcat taatccctga actgttggtt aatacgcttg agggtgaatg 30661 cgaataataa aaaaggagcc tgtagctccc tgatgatttt gcttttcatg ttcatcgttc 30721 cttaaagacg ccgtttaaca tgccgattgc caggcttaaa tgagtcggtg tgaatcccat 30781 cagcgttacc gtttcgcggt gcttcttcag tacgctacgg caaatgtcat cgacgttttt 30841 atccggaaac tgctgtctgg ctttttttga tttcagaatt agcctgacgg gcaatgctgc 30901 gaagggcgtt ttcctgctga ggtgtcattg aacaagtccc atgtcggcaa gcataagcac 30961 acagaatatg aagcccgctg ccagaaaaat gcattccgtg gttgtcatac ctggtttctc 31021 tcatctgctt ctgctttcgc caccatcatt tccagctttt gtgaaaggga tgcggctaac 31081 gtatgaaatt cttcgtctgt ttctactggt attggcacaa acctgattcc aatttgagca 31141 aggctatgtg ccatctcgat actcgttctt aactcaacag aagatgcttt gtgcatacag 31201 cccctcgttt attatttatc tcctcagcca gccgctgtgc tttcagtgga tttcggataa 31261 cagaaaggcc gggaaatacc cagcctcgct ttgtaacgga gtagacgaaa gtgattgcgc 31321 ctacccggat attatcgtga ggatgcgtca tcgccattgc tccccaaata caaaaccaat 31381 ttcagccagt gcctcgtcca ttttttcgat gaactccggc acgatctcgt caaaactcgc 31441 catgtacttt tcatcccgct caatcacgac ataatgcagg ccttcacgct tcatacgcgg 31501 gtcatagttg gcaaagtacc aggcattttt tcgcgtcacc cacatgctgt actgcacctg 31561 ggccatgtaa gctgacttta tggcctcgaa accaccgagc cggaacttca tgaaatcccg 31621 ggaggtaaac gggcatttca gttcaaggcc gttgccgtca ctgcataaac catcgggaga 31681 gcaggcggta cgcatacttt cgtcgcgata gatgatcggg gattcagtaa cattcacgcc 31741 ggaagtgaat tcaaacaggg ttctggcgtc gttctcgtac tgttttcccc aggccagtgc 31801 tttagcgtta acttccggag ccacaccggt gcaaacctca gcaagcaggg tgtggaagta 31861 ggacattttc atgtcaggcc acttctttcc ggagcggggt tttgctatca cgttgtgaac 31921 ttctgaagcg gtgatgacgc cgagccgtaa tttgtgccac gcatcatccc cctgttcgac 31981 agctctcaca tcgatcccgg tacgctgcag gataatgtcc ggtgtcatgc tgccaccttc 32041 tgctctgcgg ctttctgttt caggaatcca agagctttta ctgcttcggc ctgtgtcagt 32101 tctgacgatg cacgaatgtc gcggcgaaat atctgggaac agagcggcaa taagtcgtca 32161 tcccatgttt tatccagggc gatcagcaga gtgttaatct cctgcatggt ttcatcgtta 32221 accggagtga tgtcgcgttc cggctgacgt tctgcagtgt atgcagtatt ttcgacaatg 32281 cgctcggctt catccttgtc atagatacca gcaaatccga aggccagacg ggcacactga 32341 atcatggctt tatgacgtaa catccgtttg ggatgcgact gccacggccc cgtgatttct 32401 ctgccttcgc gagttttgaa tggttcgcgg cggcattcat ccatccattc ggtaacgcag 32461 atcggatgat tacggtcctt gcggtaaatc cggcatgtac aggattcatt gtcctgctca 32521 aagtccatgc catcaaactg ctggttttca ttgatgatgc gggaccagcc atcaacgccc 32581 accaccggaa cgatgccatt ctgcttatca ggaaaggcgt aaatttcttt cgtccacgga 32641 ttaaggccgt actggttggc aacgatcagt aatgcgatga actgcgcatc gctggcatca 32701 cctttaaatg ccgtctggcg aagagtggtg atcagttcct gtgggtcgac agaatccatg 32761 ccgacacgtt cagccagctt cccagccagc gttgcgagtg cagtactcat tcgttttata 32821 cctctgaatc aatatcaacc tggtggtgag caatggtttc aaccatgtac cggatgtgtt 32881 ctgccatgcg ctcctgaaac tcaacatcgt catcaaacgc acgggtaatg gattttttgc 32941 tggccccgtg gcgttgcaaa tgatcgatgc atagcgattc aaacaggtgc tggggcaggc 33001 ctttttccat gtcgtctgcc agttctgcct ctttctcttc acgggcgagc tgctggtagt 33061 gacgcgccca gctctgagcc tcaagacgat cctgaatgta ataagcgttc atggctgaac 33121 tcctgaaata gctgtgaaaa tatcgcccgc gaaatgccgg gctgattagg aaaacaggaa 33181 agggggttag tgaatgcttt tgcttgatct cagtttcagt attaatatcc attttttata 33241 agcgtcgacg gcttcacgaa acatcttttc atcgccaata aaagtggcga tagtgaattt 33301 agtctggata gccataagtg tttgatccat tctttgggac tcctggctga ttaagtatgt 33361 cgataaggcg tttccatccg tcacgtaatt tacgggtgat tcgttcaagt aaagattcgg 33421 aagggcagcc agcaacaggc caccctgcaa tggcatattg catggtgtgc tccttattta 33481 tacataacga aaaacgcctc gagtgaagcg ttattggtat gcggtaaaac cgcactcagg 33541 cggccttgat agtcatatca tctgaatcaa atattcctga tgtatcgata tcggtaattc 33601 ttattccttc gctaccatcc attggaggcc atccttcctg accatttcca tcattccagt 33661 cgaactcaca cacaacacca tatgcattta agtcgcttga aattgctata agcagagcat 33721 gttgcgccag catgattaat acagcattta atacagagcc gtgtttattg agtcggtatt 33781 cagagtctga ccagaaatta ttaatctggt gaagtttttc ctctgtcatt acgtcatggt 33841 cgatttcaat ttctattgat gctttccagt cgtaatcaat gatgtatttt ttgatgtttg 33901 acatctgttc atatcctcac agataaaaaa tcgccctcac actggagggc aaagaagatt 33961 tccaataatc agaacaagtc ggctcctgtt tagttacgag cgacattgct ccgtgtattc 34021 actcgttgga atgaatacac agtgcagtgt ttattctgtt atttatgcca aaaataaagg 34081 ccactatcag gcagctttgt tgttctgttt accaagttct ctggcaatca ttgccgtcgt 34141 tcgtattgcc catttatcga catatttccc atcttccatt acaggaaaca tttcttcagg 34201 cttaaccatg cattccgatt gcagcttgca tccattgcat cgcttgaatt gtccacacca 34261 ttgattttta tcaatagtcg tagtcatacg gatagtcctg gtattgttcc atcacatcct 34321 gaggatgctc ttcgaactct tcaaattctt cttccatata tcaccttaaa tagtggattg 34381 cggtagtaaa gattgtgcct gtcttttaac cacatcaggc tcggtggttc tcgtgtaccc 34441 ctacagcgag aaatcggata aactattaca acccctacag tttgatgagt atagaaatgg 34501 atccactcgt tattctcgga cgagtgttca gtaatgaacc tctggagaga accatgtata 34561 tgatcgttat ctgggttgga cttctgcttt taagcccaga taactggcct gaatatgtta 34621 atgagagaat cggtattcct catgtgtggc atgttttcgt ctttgctctt gcattttcgc 34681 tagcaattaa tgtgcatcga ttatcagcta ttgccagcgc cagatataag cgatttaagc 34741 taagaaaacg cattaagatg caaaacgata aagtgcgatc agtaattcaa aaccttacag 34801 aagagcaatc tatggttttg tgcgcagccc ttaatgaagg caggaagtat gtggttacat 34861 caaaacaatt cccatacatt agtgagttga ttgagcttgg tgtgttgaac aaaacttttt 34921 cccgatggaa tggaaagcat atattattcc ctattgagga tatttactgg actgaattag 34981 ttgccagcta tgatccatat aatattgaga taaagccaag gccaatatct aagtaactag 35041 ataagaggaa tcgattttcc cttaattttc tggcgtccac tgcatgttat gccgcgttcg 35101 ccaggcttgc tgtaccatgt gcgctgattc ttgcgctcaa tacgttgcag gttgctttca 35161 atctgtttgt ggtattcagc cagcactgta aggtctatcg gatttagtgc gctttctact 35221 cgtgatttcg gtttgcgatt cagcgagaga atagggcggt taactggttt tgcgcttacc 35281 ccaaccaaca ggggatttgc tgctttccat tgagcctgtt tctctgcgcg acgttcgcgg 35341 cggcgtgttt gtgcatccat ctggattctc ctgtcagtta gctttggtgg tgtgtggcag 35401 ttgtagtcct gaacgaaaac cccccgcgat tggcacattg gcagctaatc cggaatcgca 35461 cttacggcca atgcttcgtt tcgtatcaca caccccaaag ccttctgctt tgaatgctgc 35521 ccttcttcag ggcttaattt ttaagagcgt caccttcatg gtggtcagtg cgtcctgctg 35581 atgtgctcag tatcaccgcc agtggtattt atgtcaacac cgccagagat aatttatcac 35641 cgcagatggt tatctgtatg ttttttatat gaatttattt tttgcagggg ggcattgttt 35701 ggtaggtgag agatctgaat tgctatgttt agtgagttgt atctatttat ttttcaataa 35761 atacaattgg ttatgtgttt tgggggcgat cgtgaggcaa agaaaacccg gcgctgaggc 35821 cgggttattc ttgttctctg gtcaaattat atagttggaa aacaaggatg catatatgaa 35881 tgaacgatgc agaggcaatg ccgatggcga tagtgggtat catgtagccg cttatgctgg 35941 aaagaagcaa taacccgcag aaaaacaaag ctccaagctc aacaaaacta agggcataga 36001 caataactac cgatgtcata tacccatact ctctaatctt ggccagtcgg cgcgttctgc 36061 ttccgattag aaacgtcaag gcagcaatca ggattgcaat catggttcct gcatatgatg 36121 acaatgtcgc cccaagacca tctctatgag ctgaaaaaga aacaccagga atgtagtggc 36181 ggaaaaggag atagcaaatg cttacgataa cgtaaggaat tattactatg taaacaccag 36241 gcatgattct gttccgcata attactcctg ataattaatc cttaactttg cccacctgcc 36301 ttttaaaaca ttccagtata tcacttttca ttcttgcgta gcaatatgcc atctcttcag 36361 ctatctcagc attggtgacc ttgttcagag gcgctgagag atggcctttt tctgatagat 36421 aatgttctgt taaaatatct ccggcctcat cttttgcccg caggctaatg tctgaaaatt 36481 gaggtgacgg gttaaaaata atatccttgg caaccttttt tatatccctt ttaaattttg 36541 gcttaatgac tatatccaat gagtcaaaaa gctccccttc aatatctgtt gcccctaaga 36601 cctttaatat atcgccaaat acaggtagct tggcttctac cttcaccgtt gttcggccga 36661 tgaaatgcat atgcataaca tcgtctttgg tggttcccct catcagtggc tctatctgaa 36721 cgcgctctcc actgcttaat gacattcctt tcccgattaa aaaatctgtc agatcggatg 36781 tggtcggccc gaaaacagtt ctggcaaaac caatggtgtc gccttcaaca aacaaaaaag 36841 atgggaatcc caatgattcg tcatctgcga ggctgttctt aatatcttca actgaagctt 36901 tagagcgatt tatcttctga accagactct tgtcatttgt tttggtaaag agaaaagttt 36961 ttccatcgat tttatgaata tacaaataat tggagccaac ctgcaggtga tgattatcag 37021 ccagcagaga attaaggaaa acagacaggt ttattgagcg cttatctttc cctttatttt 37081 tgctgcggta agtcgcataa aaaccattct tcataattca atccatttac tatgttatgt 37141 tctgagggga gtgaaaattc ccctaattcg atgaagattc ttgctcaatt gttatcagct 37201 atgcgccgac cagaacacct tgccgatcag ccaaacgtct cttcaggcca ctgactagcg 37261 ataactttcc ccacaacgga acaactctca ttgcatggga tcattgggta ctgtgggttt 37321 agtggttgta aaaacacctg accgctatcc ctgatcagtt tcttgaaggt aaactcatca 37381 cccccaagtc tggctatgca gaaatcacct ggctcaacag cctgctcagg gtcaacgaga 37441 attaacattc cgtcaggaaa gcttggcttg gagcctgttg gtgcggtcat ggaattacct 37501 tcaacctcaa gccagaatgc agaatcactg gcttttttgg ttgtgcttac ccatctctcc 37561 gcatcacctt tggtaaaggt tctaagctca ggtgagaaca tccctgcctg aacatgagaa 37621 aaaacagggt actcatactc acttctaagt gacggctgca tactaaccgc ttcatacatc 37681 tcgtagattt ctctggcgat tgaagggcta aattcttcaa cgctaacttt gagaattttt 37741 gcaagcaatg cggcgttata agcatttaat gcattgatgc cattaaataa agcaccaacg 37801 cctgactgcc ccatccccat cttgtctgcg acagattcct gggataagcc aagttcattt 37861 ttcttttttt cataaattgc tttaaggcga cgtgcgtcct caagctgctc ttgtgttaat 37921 ggtttctttt ttgtgctcat acgttaaatc tatcaccgca agggataaat atctaacacc 37981 gtgcgtgttg actattttac ctctggcggt gataatggtt gcatgtacta aggaggttgt 38041 atggaacaac gcataaccct gaaagattat gcaatgcgct ttgggcaaac caagacagct 38101 aaagatctcg gcgtatatca aagcgcgatc aacaaggcca ttcatgcagg ccgaaagatt 38161 tttttaacta taaacgctga tggaagcgtt tatgcggaag aggtaaagcc cttcccgagt 38221 aacaaaaaaa caacagcata aataaccccg ctcttacaca ttccagccct gaaaaagggc 38281 atcaaattaa accacaccta tggtgtatgc atttatttgc atacattcaa tcaattgtta 38341 tctaaggaaa tacttacata tggttcgtgc aaacaaacgc aacgaggctc tacgaatcga 38401 gagtgcgttg cttaacaaaa tcgcaatgct tggaactgag aagacagcgg aagctgtggg 38461 cgttgataag tcgcagatca gcaggtggaa gagggactgg attccaaagt tctcaatgct 38521 gcttgctgtt cttgaatggg gggtcgttga cgacgacatg gctcgattgg cgcgacaagt 38581 tgctgcgatt ctcaccaata aaaaacgccc ggcggcaacc gagcgttctg aacaaatcca 38641 gatggagttc tgaggtcatt actggatcta tcaacaggag tcattatgac aaatacagca 38701 aaaatactca acttcggcag aggtaacttt gccggacagg agcgtaatgt ggcagatctc 38761 gatgatggtt acgccagact atcaaatatg ctgcttgagg cttattcggg cgcagatctg 38821 accaagcgac agtttaaagt gctgcttgcc attctgcgta aaacctatgg gtggaataaa 38881 ccaatggaca gaatcaccga ttctcaactt agcgagatta caaagttacc tgtcaaacgg 38941 tgcaatgaag ccaagttaga actcgtcaga atgaatatta tcaagcagca aggcggcatg 39001 tttggaccaa ataaaaacat ctcagaatgg tgcatccctc aaaacgaggg aaaatcccct 39061 aaaacgaggg ataaaacatc cctcaaattg ggggattgct atccctcaaa acagggggac 39121 acaaaagaca ctattacaaa agaaaaaaga aaagattatt cgtcagagaa ttctggcgaa 39181 tcctctgacc agccagaaaa cgacctttct gtggtgaaac cggatgctgc aattcagagc 39241 ggcagcaagt gggggacagc agaagacctg accgccgcag agtggatgtt tgacatggtg 39301 aagactatcg caccatcagc cagaaaaccg aattttgctg ggtgggctaa cgatatccgc 39361 ctgatgcgtg aacgtgacgg acgtaaccac cgcgacatgt gtgtgctgtt ccgctgggca 39421 tgccaggaca acttctggtc cggtaacgtg ctgagcccgg ccaaactccg cgataagtgg 39481 acccaactcg aaatcaaccg taacaagcaa caggcaggcg tgacagccag caaaccaaaa 39541 ctcgacctga caaacacaga ctggatttac ggggtggatc tatgaaaaac atcgccgcac 39601 agatggttaa ctttgaccgt gagcagatgc gtcggatcgc caacaacatg ccggaacagt 39661 acgacgaaaa gccgcaggta cagcaggtag cgcagatcat caacggtgtg ttcagccagt 39721 tactggcaac tttcccggcg agcctggcta accgtgacca gaacgaagtg aacgaaatcc 39781 gtcgccagtg ggttctggct tttcgggaaa acgggatcac cacgatggaa caggttaacg 39841 caggaatgcg cgtagcccgt cggcagaatc gaccatttct gccatcaccc gggcagtttg 39901 ttgcatggtg ccgggaagaa gcatccgtta ccgccggact gccaaacgtc agcgagctgg 39961 ttgatatggt ttacgagtat tgccggaagc gaggcctgta tccggatgcg gagtcttatc 40021 cgtggaaatc aaacgcgcac tactggctgg ttaccaacct gtatcagaac atgcgggcca 40081 atgcgcttac tgatgcggaa ttacgccgta aggccgcaga tgagcttgtc catatgactg 40141 cgagaattaa ccgtggtgag gcgatccctg aaccagtaaa acaacttcct gtcatgggcg 40201 gtagacctct aaatcgtgca caggctctgg cgaagatcgc agaaatcaaa gctaagttcg 40261 gactgaaagg agcaagtgta tgacgggcaa agaggcaatt attcattacc tggggacgca 40321 taatagcttc tgtgcgccgg acgttgccgc gctaacaggc gcaacagtaa ccagcataaa 40381 tcaggccgcg gctaaaatgg cacgggcagg tcttctggtt atcgaaggta aggtctggcg 40441 aacggtgtat taccggtttg ctaccaggga agaacgggaa ggaaagatga gcacgaacct 40501 ggtttttaag gagtgtcgcc agagtgccgc gatgaaacgg gtattggcgg tatatggagt 40561 taaaagatga ccatctacat tactgagcta ataacaggcc tgctggtaat cgcaggcctt 40621 tttatttggg ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac 40681 atcagcaaaa cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg 40741 tagtaaccat tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct 40801 taggtgacgt ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga 40861 agtgtgtgtt taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg 40921 gctttgtggt aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat 40981 tagagcttat acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac 41041 tggctctgga gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt 41101 ctccggtggc aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg 41161 taaagacgtg cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt 41221 tgtcagggaa gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa 41281 cccggagctt ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg 41341 aatgcctgtt ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg 41401 cggcgcgttc tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca 41461 tttcgggcga gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct 41521 aaagccaaag cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat 41581 cctcgcatgg tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg 41641 catattctgc attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg 41701 attgcagcgt gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga 41761 aacgccaaag gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta 41821 ttcagaaaat gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac 41881 cggctcttgt tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag 41941 ggaagctgca tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga 42001 ccaaatcaac cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg 42061 ttaccactac cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca 42121 taatctgcga aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa 42181 tcccaaaaga atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt 42241 ggctaagacg tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga 42301 aagcgtcgag cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt 42361 tcacgtgtgt gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga 42421 ggaagaagat gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat 42481 ggtttcaccc tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag 42541 cactcgaacg acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac 42601 gacgagagga gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc 42661 gcagttactg gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc 42721 gcgacttacc atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt 42781 accggacaac tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat 42841 gcgtggtgtg caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta 42901 gccgcatcgg gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga 42961 ctatcgaaga gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa 43021 atagcagaag tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga 43081 agcatacgga aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag 43141 aaaatacgtt gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt 43201 tcatcgcgga tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc 43261 gaaatatttg ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg 43321 atacgattgg attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc 43381 gtggtgcaga gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac 43441 gtggaaacca tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact 43501 ggctgcttaa tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta 43561 aagctcttgc ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata 43621 aaaaatatgt tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag 43681 ttgatcatca gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg 43741 tgaaagaaat caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac 43801 tcaagtttgc caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat 43861 tgattcaggt acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc 43921 aaaaagcccg atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac 43981 tgatgtgatg gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt 44041 ctgcggtaag cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca 44101 atttgcacac aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa 44161 ggcaaaggta ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc 44221 gacgccgggg gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa 44281 aacagagctg tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta 44341 ttcaaggatg ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac 44401 ccaacccacc tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca 44461 caaagaagag tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg 44521 ccacggatgg caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt 44581 tgactcaacg atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta 44641 ctaccgattc cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg 44701 cagagaggtc tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat 44761 aacggtttcg ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg 44821 aagagtcggc gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc 44881 ggaagcagaa ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa 44941 cccaaactga gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc 45001 cttttacaca tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc 45061 gttttgcccg tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt 45121 tgttctatca gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt 45181 aagacatgaa gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac 45241 aaggcatcgg ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg 45301 gtgcgtttac aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tggttcattc 45361 gtgaccttct cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta 45421 tcggctacat cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag 45481 ccggagtaga agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg 45541 tggtcggagg gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt 45601 gtaggcggag agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac 45661 ccaaaactca aatcaacagg cgccggacgc taccagcttc tttcccgttg gtgggatgcc 45721 taccgcaagc agcttggcct gaaagacttc tctccgaaaa gtcaggacgc tgtggcattg 45781 cagcagatta aggagcgtgg cgctttacct atgattgatc gtggtgatat ccgtcaggca 45841 atcgaccgtt gcagcaatat ctgggcttca ctgccgggcg ctggttatgg tcagttcgag 45901 cataaggctg acagcctgat tgcaaaattc aaagaagcgg gcggaacggt cagagagatt 45961 gatgtatgag cagagtcacc gcgattatct ccgctctggt tatctgcatc atcgtctgcc 46021 tgtcatgggc tgttaatcat taccgtgata acgccattac ctacaaagcc cagcgcgaca 46081 aaaatgccag agaactgaag ctggcgaacg cggcaattac tgacatgcag atgcgtcagc 46141 gtgatgttgc tgcgctcgat gcaaaataca cgaaggagtt agctgatgct aaagctgaaa 46201 atgatgctct gcgtgatgat gttgccgctg gtcgtcgtcg gttgcacatc aaagcagtct 46261 gtcagtcagt gcgtgaagcc accaccgcct ccggcgtgga taatgcagcc tccccccgac 46321 tggcagacac cgctgaacgg gattatttca ccctcagaga gaggctgatc actatgcaaa 46381 aacaactgga aggaacccag aagtatatta atgagcagtg cagatagagt tgcccatatc 46441 gatgggcaac tcatgcaatt attgtgagca atacacacgc gcttccagcg gagtataaat 46501 gcctaaagta ataaaaccga gcaatccatt tacgaatgtt tgctgggttt ctgttttaac 46561 aacattttct gcgccgccac aaattttggc tgcatcgaca gttttcttct gcccaattcc 46621 agaaacgaag aaatgatggg tgatggtttc ctttggtgct actgctgccg gtttgttttg 46681 aacagtaaac gtctgttgag cacatcctgt aataagcagg gccagcgcag tagcgagtag 46741 catttttttc atggtgttat tcccgatgct ttttgaagtt cgcagaatcg tatgtgtaga 46801 aaattaaaca aaccctaaac aatgagttga aatttcatat tgttaatatt tattaatgta 46861 tgtcaggtgc gatgaatcgt cattgtattc ccggattaac tatgtccaca gccctgacgg 46921 ggaacttctc tgcgggagtg tccgggaata attaaaacga tgcacacagg gtttagcgcg 46981 tacacgtatt gcattatgcc aacgccccgg tgctgacacg gaagaaaccg gacgttatga 47041 tttagcgtgg aaagatttgt gtagtgttct gaatgctctc agtaaatagt aatgaattat 47101 caaaggtata gtaatatctt ttatgttcat ggatatttgt aacccatcgg aaaactcctg 47161 ctttagcaag attttccctg tattgctgaa atgtgatttc tcttgatttc aacctatcat 47221 aggacgtttc tataagatgc gtgtttcttg agaatttaac atttacaacc tttttaagtc 47281 cttttattaa cacggtgtta tcgttttcta acacgatgtg aatattatct gtggctagat 47341 agtaaatata atgtgagacg ttgtgacgtt ttagttcaga ataaaacaat tcacagtcta 47401 aatcttttcg cacttgatcg aatatttctt taaaaatggc aacctgagcc attggtaaaa 47461 ccttccatgt gatacgaggg cgcgtagttt gcattatcgt ttttatcgtt tcaatctggt 47521 ctgacctcct tgtgttttgt tgatgattta tgtcaaatat taggaatgtt ttcacttaat 47581 agtattggtt gcgtaacaaa gtgcggtcct gctggcattc tggagggaaa tacaaccgac 47641 agatgtatgt aaggccaacg tgctcaaatc ttcatacaga aagatttgaa gtaatatttt 47701 aaccgctaga tgaagagcaa gcgcatggag cgacaaaatg aataaagaac aatctgctga 47761 tgatccctcc gtggatctga ttcgtgtaaa aaatatgctt aatagcacca tttctatgag 47821 ttaccctgat gttgtaattg catgtataga acataaggtg tctctggaag cattcagagc 47881 aattgaggca gcgttggtga agcacgataa taatatgaag gattattccc tggtggttga 47941 ctgatcacca taactgctaa tcattcaaac tatttagtct gtgacagagc caacacgcag 48001 tctgtcactg tcaggaaagt ggtaaaactg caactcaatt actgcaatgc cctcgtaatt 48061 aagtgaattt acaatatcgt cctgttcgga gggaagaacg cgggatgttc attcttcatc 48121 acttttaatt gatgtatatg ctctcttttc tgacgttagt ctccgacggc aggcttcaat 48181 gacccaggct gagaaattcc cggacccttt ttgctcaaga gcgatgttaa tttgttcaat 48241 catttggtta ggaaagcgga tgttgcgggt tgttgttctg cgggttctgt tcttcgttga 48301 catgaggttg ccccgtattc agtgtcgctg atttgtattg tctgaagttg tttttacgtt 48361 aagttgatgc agatcaatta atacgatacc tgcgtcataa ttgattattt gacgtggttt 48421 gatggcctcc acgcacgttg tgatatgtag atgataatca ttatcacttt acgggtcctt 48481 tccggtgatc cgacaggtta cg // LOCUS MUSIGKCLO 422 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse IgMk rearranged kappa light-chain mRNA variable region (V-J-kappa) anti-DNA autoantibody. ACCESSION M20834 KEYWORDS autoantibody; immunoglobulin light chain; kappa-immunoglobulin; kappa-immunoglobulin light chain; variable region. SOURCE Mouse (strain MRL/Mp -lpr/lpr) spleen hybridoma cell line MRL-DNA22, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 422) AUTHORS Kofler,R., Strohal,R., Balderas,R.S., Johnson,M.E., Noonan,D.J., Duchosal,M.A., Dixon,F.J. and Theofilopoulos,A.N. TITLE Immunoglobulin kappa light chain variable region gene complex organization and immunoglobulin genes encoding anti-DNA autoantibodies in lupus mice JOURNAL J. Clin. Invest. 82, 852-860 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R. Kofler 28-JUL-1988 FEATURES from to/span description pept 33 > 422 IgMk light chain precursor sigp 33 98 Ig light chain signal peptide matp 99 > 422 Ig light chain mRNA 1 > 422 IgMk mRNA recomb 386 387 V-region end/J-region start BASE COUNT 112 a 109 c 95 g 106 t ORIGIN Chromosome 6. 1 taattagcta gggaccaaaa ttcaaagaga aaatggattt tcaggtgcag attttcagct 61 tcctgctaat cagtgcctca gtcataatgt ccagaggaca aattgttctc acccagtctc 121 cagcaatcat gtctgcatct cctggggaac gggtcaccat gacctgcagt gccagctcaa 181 gtgtaagttc cagctacttg tactggtacc agcagaagcc aggatcctcc cccaaactat 241 ggatttatag cacatccaac ctggcttctg gagtccctgc tcgcttcagt ggcagtgggt 301 ctgggacctc ttattctctc acaatcagca gcatggaggc tgaagatgct gccacttatt 361 actgccagca gtacagtggt tacccattca cgttcggcac ggggacaaaa ttggaaataa 421 aa // LOCUS MUSIGKCLY 312 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse Ig unproductively rearranged kappa-chain VJ2C mRNA from plasmacytoma BFPC 61B, partial cds. ACCESSION M12189 KEYWORDS immunoglobulin; immunoglobulin light chain; kappa-immunoglobulin; variable region. SOURCE Mouse plasmacytomas, cDNA to mRNA, clone BFPC 61B. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Bernard,O., Gough,N.M. and Adams,J.M. TITLE Plasmacytomas with more than one immunoglobulin kappa mRNA: Implications for allelic exclusion JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 5812-5816 (1981) STANDARD simple staff_entry FEATURES from to/span description pept.ps < 1 > 312 Ig kappa-chain VJC-regions (AA at 3) recomb 245 246 V-region end/J2-region start recomb 285 286 J2-region end/C-region start BASE COUNT 75 a 88 c 82 g 67 t ORIGIN 1 ctgcaccagg ggagaaggtc acaatgactt gcagggccag ctcaagtgta agttacatgg 61 actggtacca gcagaagcca ggatcctccc ccaaaccctg gatttatgcc acatccaccc 121 tagtttctgg agttcctgct cgcttcagtg gcagtgggtc tgggacctct tactctctca 181 caatcagcag agtggaggct gaagatgctg ccacttatta gtgcccgcag tggagtagta 241 acccacccga cgttcggtgg aggcaccaag ctggaaatca aacgggctga tgctgcacca 301 actgtatcca tc // LOCUS MUSTCBXZA 396 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse T-cell receptor active beta-chain V-region V11DJ mRNA. ACCESSION M13676 KEYWORDS T-cell antigen receptor; T-cell receptor; T-cell receptor beta-chain; antigen receptor; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (C57BL/6) T-helper hybridoma, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 390) AUTHORS Behlke,M.A., Spinella,D.G., Chou,H.S., Sha,W., Hartl,D.L. and Loh,D.Y. TITLE T-cell receptor beta-chain expression: Dependence on relatively few variable region genes JOURNAL Science 229, 566-570 (1985) STANDARD full staff_review REFERENCE 2 (bases 391 to 396) AUTHORS Behlke,M.A. JOURNAL Unpublished (1985) Howard Hughes Med Inst, St.Louis, MO 63110. STANDARD full staff_review COMMENT Draft entry and computer-readable copy of the sequence in [2] were kindly provided by M.A.Behlke, 25-OCT-1985. FEATURES from to/span description pept 1 > 396 T-cell receptor beta-chain precursor (V11DJ) sigp 1 57 T-cell receptor beta-chain signal peptide matp 58 > 396 T-cell receptor beta-chain recomb 342 343 V11-region end/D-region start recomb 345 346 D-region end/N-region start recomb 348 349 N-region end/J-region start recomb 390 391 J-region end/C-region start BASE COUNT 113 a 96 c 81 g 106 t ORIGIN Chromosome 6. 1 atgagctgca ggcttctcct ctatgtttcc ctatgtcttg tggaaacagc actcatgaac 61 actaaaatta ctcagtcacc aagatatcta atcctgggaa gaacaaataa gtctttggaa 121 tgtgagcaac atctgggaca taatgctatg tactggtata aacagagcgc tgagaagccg 181 ccagagctca tgtttctcta caatcttaaa cagttgattc gaaatgagac ggtgcccagt 241 cgttttatac ctgaatgccc agacagctcc aagctacttt tacatatatc tgccgtggat 301 ccagaagact cagctgtcta tttttgtgcc agcagccaga taactagtaa ccaagacacc 361 cagtactttg ggccaggcac tcggctcctc gtgtta // LOCUS P18XIS911 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pACYC184/insertion element 91 (IS91), left junction. ACCESSION K03543 KEYWORDS insertion sequence. SEGMENT 1 of 2 SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description recomb 16 17 pBR322 DNA end/IS91 DNA start rpt 16 24 IS91 inverted terminal repeat BASE COUNT 26 a 26 c 29 g 34 t ORIGIN 16 bp upstream of TaqI site. 1 ttaacgaccc tgccctcgag taggcagcct ggcggctgcg gcttgtcatg gtctggaatt 61 accgttataa aaaaagataa tgtcattgtc tttcaggtag ttatatggcc cgttc // LOCUS P18XIS912 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pACYC184/insertion element 91 (IS91), right junction. ACCESSION M15256 KEYWORDS insertion sequence. SEGMENT 2 of 2 SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description rpt 93 100 IS91 inverted terminal repeat recomb 101 102 IS91 DNA end/pACYC184 DNA start BASE COUNT 25 a 20 c 29 g 41 t ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site. 1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt 61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tgaaccgacg accgg // LOCUS PBR322IS2 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pBR322/insertion element 91 (IS91), right junction. ACCESSION M15257 KEYWORDS insertion sequence. SEGMENT 2 of 2 SOURCE Plasmid pBR322 and insertion element 91 DNA, clone pSU240. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description rpt 93 100 IS91 inverted terminal repeat recomb 101 102 IS91 DNA end/pBR322 DNA start BASE COUNT 25 a 19 c 27 g 44 t ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site. 1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt 61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tcaaggatct taccg // LOCUS RATMYCSE 903 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat c-myc/Ig epsilon translocation DNA. ACCESSION M23417 KEYWORDS epsilon immunoglobulin; switch region; translocation. SOURCE Rat (strain LOU) immunocytoma cell line IR162 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 903) AUTHORS Tian,S.-S. and Faust,C. TITLE A simple and precise aberrant translocation of the rat c-myc gene into the epsilon-heavy chain switch region of the IgE-producing immunocytoma, IR162 JOURNAL J. Biol. Chem. 264, 1846-1853 (1989) STANDARD simple staff_entry FEATURES from to/span description recomb 298 299 Ig epsilon switch region DNA end/c-myc DNA start signal 617 621 TATA box signal 777 781 TATA box BASE COUNT 212 a 250 c 238 g 203 t ORIGIN 1 gggcccagtt cagctctgtg ccaggccccc agaaagcata ggatatttct gataaacctg 61 attgtctcta cctaccctga tatagtctgg ctactatatc aggacaatct tcttggagaa 121 cagttaaaaa catccataaa aaatttcgag caatctattt ttttttaaat taatttattt 181 tttggatttt ttttaaactt acatttcaaa tgttatcact tttcccccac ccaaccaccc 241 acccctttcc acctctctgt cctgacattc ccctacactg gtgtgtgtgt ggggtccaga 301 ctaatacacg caggacaaga acacagttca gccgagcgct gcgcccgaac aactgtacag 361 aaagggaaag gaccgacgcg cgagcaggag aaaatggtcg ggcgcgcagt taattcatgc 421 tgcgctatta ctgtttacac cccgagtcgg agtactgggc tgcgggactg aggctcctcc 481 tctttccccg gctccccact agtgcccctc ccgagttccc aaagcagagg gcgggggagc 541 gagaggagga aaaaaataga ggtcgggagg ggagggagaa agagattctc tggctaatcc 601 ccgcccaccc gccctttata atccgggggt ctgcgcggct aaggaccccc gggctgcgct 661 gctctccgct gccgcctccg ccgcgcccac tccgctcgcc tcctgcctcc aaaagggcag 721 ggcttcgccg aggcttggcg ggaaaaagaa gcgaggggag ggatccggag tcgcagtata 781 aaagaagctt ttcgggcgtt ttttttctga ctcgctgtag taattccagc gagagacaga 841 gggagtgagc gggcgggttg gaagagccca gtgtgcagag ccccactccg ggcttcctag 901 gaa // LOCUS RATRAFB 2719 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Rat activated c-raf oncogene mRNA encoding a fused protein, complete cds. ACCESSION M15428 KEYWORDS fusion protein; oncogene; protein kinase; raf oncogene. SOURCE Rat primary NIH 3T3 cell transformant IQ7-2, cDNA to mRNA, clone pA5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2719) AUTHORS Ishikawa,F., Takaku,F., Nagao,M. and Sugimura,T. TITLE Rat c-raf oncogene activation by a rearrangement that produces a fused protein JOURNAL Mol. Cell. Biol. 7, 1226-1232 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Ishikawa, 29-MAY-1987. The activated rat c-raf gene (see below) is transcribed to produce a fused mRNA, in which the 5' half of the sequence is replaced by an unknown rat sequence. The normal and activated c-raf were each connected to the long terminal repeat of Rous sarcoma virus and transfected into NIH 3T3 cells. Only the activated form had transforming activity. FEATURES from to/span description pept 206 2014 raf fusion protein mRNA < 1 > 2525 raf mRNA recomb 898 899 rat DNA of unknown origin end/raf DNA start site 1032 1032 homology to v-raf start BASE COUNT 762 a 644 c 698 g 615 t ORIGIN 123 bp upstream of BglI site. 1 gccgtgcgca tagaggccgg tgcgcggccc ttgctcgttt aacgcgggac tatatttccc 61 agggtccgtc gcgggagtct ccggcgggca ggcgcgcggg agactgcgag cgaggcgccg 121 acggggcggc tcaggcgtct gggtccgcgc atctccttgc tccttcgctt ctccttcagc 181 cgctgctgcc acgaccccgg ccgacatggc ggcggtgttg cagcaagtgc tggagcgccc 241 ggagctgaac aagctgccta agtcgaccca gaacaaactt gagaagttcc tggctgaaca 301 gcagtccgaa atcgactgcc tgaaggggcg gcacgagaaa tttaaggtgg agagtgagca 361 acaatacttt gagatagaga agagactatc ccagagtcag gagaggcttg ttaatgaaac 421 ccgggagtgt cagaacttga ggctggagct tgagaagcta aataaccaag taaaagtatt 481 aactgagaaa aacaaagaac ttgaaactgc tcaagaccgc aatctaggca ttcagagcca 541 gtttacaaga gcaaaggaag agttagaagc tgaaaaaaga gatttaatca gaaccaatga 601 gaggttatct caggaagttg aatatttaac agaggatgtt aaacgtctaa acgaaaaact 661 taaagaaagc aatacaacga agggtgaact tcagttaaag ctggatgaac ttcaagcttc 721 tgatgtcact gtgaagtacc gagaaaaacg cttagaacaa gaaaaggaat tgctacacaa 781 tcaaaattca tggctaaaca cagagttgaa aaccaaaact gatgagctat tggctctagg 841 aagagaaaag ggaaatgaaa ttctggaact taagtgtact cttgaaaaca aaaaggaaga 901 ggatgcaatt cgaagtcaca gtgaatcagc ctcaccttca gccctgtcca gcagccccaa 961 caacctgagc ccaacaggct ggtcacagcc caaaacccct gtgccagcac aaagagagag 1021 ggcgccagga tctgggaccc aggaaaaaaa caaaattagg cctcgtgggc agagagattc 1081 aagttattac tgggaaatag aagccagtga ggtgatgctg tctactcgga ttggctcggg 1141 ctcctttggc actgtgtaca agggcaagtg gcatggagat gttgcagtaa agatcctaaa 1201 ggtggttgac ccaactccag agcaacttca ggccttcagg aacgaggtgg ctgttttgcg 1261 caaaacacgg catgttaata tcctgctgtt catggggtac atgacaaagg acaacctggc 1321 gattgtgacc cagtggtgtg aaggcagcag tctctacaaa cacctgcatg tccaggagac 1381 caaattccag atgttccagc taattgacat tgcccggcag acagctcagg gaatggacta 1441 tttacatgca aagaacatca tccacagaga catgaaatcc aacaatatat ttctccatga 1501 aggcctcacg gtgaaaatcg gagattttgg tttggcaaca gtgaagtcgc gctggagtgg 1561 ttctcagcag gttgaacagc ccactggctc tgtgctgtgg atggccccag aagtaatccg 1621 aatgcaggat aacaacccgt tcagcttcca gtccgatgtc tactcctatg gcattgtgct 1681 gtatgagctg atgactgggg agcttcccta ctcccacatc aacaaccgag accagatcat 1741 cttcatggtg ggccgtgggt acgcctcccc agatcttagc aggctctaca agaactgccc 1801 caaggcaatg aagaggttgg tggctgactg tgtgaagaaa gtcaaagaag aaaggccttt 1861 gtttcctcag atcctgtctt ccattgagct gcttcagcac tctctgccga aaatcaacag 1921 gagcgcctct gagccttccc tgcatcgggc agctcacact gaggacatca atgcttgtac 1981 gctgaccaca tccccaaggc taccagtctt ctagctgacg ttatagctgt tcttaggcca 2041 ccaggggacg aagaagagtc agcaggcacc actttctgtt tccttggggg cagaatgcat 2101 gtttccggaa aagctgctgc taaggaccta gactactcac agggccttaa cttcatattg 2161 ccttcttttc tacccttcct gccctggaaa tggaagctgt ccgccaagcc agcctgctcc 2221 agaggtatac aagtcagcga gtatttttag ggcaaatggc cttggagaga gaaggcaggg 2281 cactccggct actgcaggga catgcagttg ggaacttggc tcattgagct gtacagacag 2341 tggtgcagtg ccagttttgc acatggagtc ctggccacct gggggagcct gctttggtac 2401 tacagaactt cactttgtgg acacaccttc ctcttactga gtctaagatg tcctgtgcag 2461 aggatgcttt ccaagcacgg tgctccacct tctggcagcc tcccacacgc tgaatctgtc 2521 ttccaggagc tgccctatgg ggtgctgcag cccagcccta tctctatagt cacatccttg 2581 tctgtaagaa agccaggaat acaggttttc ttaatgattt tgggttttaa ttttgttttt 2641 attgagcctg ataaaataca gttatctgat ggttcctcaa ttatgttatt ttaataaaat 2701 aaattaaatt taaaaaaaa // LOCUS YSCINV 68 bp ss-mRNA SYN 29-JUN-1990 DEFINITION Yeast (S.cerevisiae) synthetic invertase signal peptide region fused to human interferon gene. ACCESSION M13346 KEYWORDS gene fusion; invertase signal peptide; signal peptide. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 68) AUTHORS Chang,C.N., Matteucci,M., Perry,L.J., Wulf,J.J., Chen,C.Y. and Hitzeman,R.A. TITLE Saccharomyces cerevisiae secretes and correctly processes human interferon hybrid proteins containing yeast invertase signal peptides JOURNAL Mol. Cell. Biol. 6, 1812-1819 (1986) STANDARD full staff_review FEATURES from to/span description pept 12 > 68 yeast invertase signal/human interferon fusion peptide sigp 12 68 invertase signal peptide recomb 8 9 EcoRI sequence end/yeast invertase signal start BASE COUNT 13 a 16 c 12 g 27 t ORIGIN 3 bp upstream of EcoRI site. 1 ccgaattcat gatgcttttg caagctttcc ttttcctttt ggctggtttt gcagccaaaa 61 tatctgca // LOCUS ADGLS 1477 bp ds-DNA VRL 29-JUN-1990 DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units. ACCESSION K03325 KEYWORDS alternate splicing; terminal repeat. SOURCE Adenovirus type 7 (simian) DNA. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 1477) AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E. TITLE E1a regions of the human adenoviruses and of the highly oncogenic simian adenovirus 7 are closely related JOURNAL J. Virol. 53, 399-409 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Kimelman (15-AUG-1985). The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA hybridization and S1 nuclease digestion. The peptide coding regions were deduced by analogy to other adenoviruses and following the most likely mode of translation. The intron of the mRNA III transcribed from the E1a of other adenoviruses has not yet been determined by S1 mapping. Enhancer sequences are located at positions 165-173 and 265-270. A TATA box is located at position 408-413. A polyadenylation signal is located at posityion 1447-1452. FEATURES from to/span description pept 486 837 E1a 28 K protein from mRNA I, exon 1 1178 1407 E1a 28 K protein from mRNA I, exon 2 pept 486 1056 E1a 24 K protein from mRNA II, exon 1 1178 1407 E1a 24 K protein from mRNA II, exon 2 pre-msg 439 1477 E1a mRNA IVS 838 1177 E1a 24K pept intron A IVS 1057 1177 E1a 28K pept intron A BASE COUNT 331 a 334 c 442 g 370 t ORIGIN 5' end of the l strand of the genome. 1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt 61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg 121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt 181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt 241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga 301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc 361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc 421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg 481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata 541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca 601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg 661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc 721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg 781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg 841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg 901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg 961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc 1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact 1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga 1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca 1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc 1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca 1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg 1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta 1441 tcaggaaata aaaagattaa ctggattctt tgtgcct // LOCUS ADGRS 116 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus type 7 (simian) right end of genome. ACCESSION V00035 J01991 KEYWORDS terminal repeat. SOURCE adenovirus type 7 (simian) dna. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 116) AUTHORS Broker,T.R. TITLE terminal nucleotide sequences and restriction endonuclease cleavage maps of simian and murine adenoviruses JOURNAL (in) Tooze,J. (Ed.); DNA Tumor Viruses, Second Edition Revised: 1037-1046; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD simple staff_review REFERENCE 2 (bases 1 to 116) AUTHORS Tolun,A., Alestroem,P. and Pettersson,U. TITLE sequence of inverted terminal repetitions from different adenoviruses: demonstration of conserved sequences and homology between sa7 termini and sv40 dna JOURNAL Cell 17, 705-713 (1979) STANDARD simple staff_review COMMENT L-strand is given, from 5' to 3'. BASE COUNT 25 a 15 c 46 g 30 t ORIGIN 1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg 61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct // LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 29-JUN-1990 DEFINITION Simian adenovirus SA7P, leftmost 2338 bp. ACCESSION X01027 KEYWORDS E1A protein; E1b protein; inverted terminal repeat; overlapping genes; transforming gene. SOURCE Simian adenovirus SA 7P. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338) AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and Van Ormondt,H. TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of simian adenovirus SA7P JOURNAL J. Gen. Virol. 65, 1699-1708 (1984) STANDARD simple automatic COMMENT **map: leftmost XhoI fragment (6%); EMBL features not translated to GenBank features: key from to description SITE 1 183 inverted terminal repeat (ITR) of SA7P PRM 409 416 TATA E1A region CAP 440 440 cap site E1A RNAs MSG 440 838 "12S" E1A RNA MSG 440 838 1st exon "12S" E1A RNA IVS 839 1175 intron "12S" E1A RNA MSG 1176 >1445 2nd exon "12S" E1A RNA MSG 440 1057 1st exon "13S" E1A RNA IVS 1058 1175 intron "13S" E1A RNA MSG 1176 >1445 2nd exon "13S" E1A RNA SITE 1445 1450 AATAAA E1A PRM 1508 1514 TATA E1B CAP 1538 1538 cap E1B RNA's This virus, an independent isolate from Moscow, USSR, is highly related to simian adenovirus 7 (SA7). The E1A sequences are 95% homologous to those of SA7 see $$ J. VVrol. 5:399-409(1985) FEATURES from to/span description pept 487 838 E1A 21K protein (part 1) 1176 / 1401 E1A 21K protein (part 2) pept 487 1057 E1A 29K protein (part 1) 1176 / 1401 E1A 29K protein (part 2) pept 1569 2111 pot. 21K E1B protein pept 1874 > 2338 pot. 55K E1B protein BASE COUNT 514 a 535 c 725 g 564 t ORIGIN 1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag 61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg 121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt 181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt 241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg 301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg 361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg 421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct 481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat 541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac 601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag 661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg 721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct 781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt 841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg 901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat 961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg 1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact 1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg 1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga 1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac 1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc 1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc 1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata 1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct 1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca 1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg 1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc 1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt 1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg 1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct 1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca 1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc 1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc 2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa 2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg 2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat 2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg 2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag // LOCUS ADGVARNA 850 bp ds-DNA VRL 29-JUN-1990 DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene. ACCESSION M14082 KEYWORDS virus-associated RNA. SOURCE Simian adenovirus type 7 DNA. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 850) AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G. TITLE Characterization of a low-molecular-weight virus-associated (VA) RNA encoded by simian adenovirus type 7 which functionally can substitute for adenovirus type 5 VA RNA-1 JOURNAL J. Virol. 60, 635-644 (1986) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Larsson, 15-JAN-1987. Simian adenovirus type 7 VA RNA stimulates protein synthesis in a transient expression assay and can functionally substitute for VA RNA-I during lytic growth of human Ad5. The SA7 genome encodes only one VA RNA species. FEATURES from to/span description RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp) ORF 105 < 1 (c) ORF A cds ORF 366 > 850 ORF B cds BASE COUNT 140 a 260 c 338 g 112 t ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map. 1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc 61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg 121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga 181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg 241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg 301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt 361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag 421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc 481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg 541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc 601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc 661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca 721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc 781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg 841 tggccccggg // LOCUS ADZITRL 150 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end. ACCESSION J01958 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 38 a 29 c 48 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgcg // LOCUS ADZITRR 150 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end. ACCESSION J01959 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 40 a 28 c 47 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgaa // LOCUS HSKGAORA 3726 bp ds-DNA VRL 29-JUN-1990 DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown function. ACCESSION M33303 KEYWORDS glycoprotein A. SOURCE Herpesvirus of turkey DNA. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 3726) AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K. TITLE Homologies between herpesvirus of turkey and Marek's disease virus type-1 DNAs within two co-linearly arranged open reading frames, one encoding glycoprotein A JOURNAL Gene 84, 399-405 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 207 1676 glycoprotein A pept 2118 2756 ORF2 BASE COUNT 1016 a 884 c 815 g 1011 t ORIGIN 1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat 61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt 121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc 181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc 241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg 301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg 361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg 421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca 481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta 541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt 601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg 661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta 721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc 781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg 841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata 901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat 961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg 1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg 1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca 1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct 1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga 1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta 1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta 1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc 1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa 1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt 1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt 1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt 1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc 1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact 1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat 1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt 1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa 1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca 2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg 2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg 2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc 2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat 2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg 2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg 2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta 2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg 2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca 2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca 2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac 2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac 2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata 2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta 2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc 2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga 3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc 3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc 3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt 3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg 3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt 3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc 3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa 3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca 3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga 3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag 3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat 3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat 3721 gaattc // LOCUS HSKTK 2039 bp ds-DNA VRL 29-JUN-1990 DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds. ACCESSION M26659 KEYWORDS thymidine kinase. SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo fibroblasts. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 2039) AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K. TITLE Genetic and biochemical characterization of the thymidine kinase gene from herpesvirus of turkeys JOURNAL J. Virol. 63, 2847-2852 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.L.Martin, 03-AUG-1989. FEATURES from to/span description pept 608 1540 thymidine kinase (EC 2.7.1.21) BASE COUNT 570 a 458 c 437 g 574 t ORIGIN 649 bp upstream of XbaI site. 1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa 61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg 121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt 181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta 241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa 301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc 361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg 421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat 481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct 541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg 601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc 661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc 721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt 781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc 841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga 901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt 961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact 1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc 1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg 1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt 1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga 1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt 1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct 1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta 1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga 1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga 1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat 1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat 1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt 1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa 1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc 1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc 1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc 1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat // LOCUS LACHSP 3095 bp ds-DNA BCT 29-JUN-1990 DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene, complete cds. ACCESSION M30192 KEYWORDS . SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3095) AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R. TITLE Sequence of HSP gene from pTR2030 JOURNAL Unpublished (1989) North Carolina State Univ., Raleigh, NC 27695 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hill, 29-NOV-1989. FEATURES from to/span description pept 1105 2991 hsp gene (aborts phage infection) BASE COUNT 1129 a 439 c 518 g 1009 t ORIGIN 1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg 61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga 121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat 181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt 241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac 301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga 361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat 421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca 481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac 541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct 601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca 661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat 721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa 781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg 841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc 901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat 961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt 1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag 1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca 1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct 1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca 1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca 1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata 1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat 1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa 1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt 1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata 1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa 1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat 1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt 1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat 1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat 1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa 1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat 2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct 2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct 2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat 2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct 2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta 2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata 2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa 2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa 2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata 2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac 2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct 2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa 2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa 2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc 2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa 2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt 3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat 3061 ctcctcatcc gcatttggat aaagatgact ataaa // LOCUS PH15RER 2016 bp ds-DNA PHG 29-JUN-1990 DEFINITION Bacteriophage phi-15 right early region containing genes 17, 16.5-16.9, complete cds. ACCESSION M28830 KEYWORDS DNA replication protein. SOURCE Bacteriophage phi-15 DNA. ORGANISM Bacteriophage phi-15 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 2016) AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V. TITLE Nucleotide sequence of the right early region of Bacillus phage phi-15 and comparison with related phages: Reorganization of gene 17 during evolution JOURNAL Gene 75, 341-347 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 203 664 gene 17 peptide pept 666 992 gene 16.9 peptide (put.) pept 989 1306 gene 16.8 peptide (put.) pept 1308 1700 gene 16.7 peptide (put.) pept 1697 1861 gene 16.6 peptide (put.) pept 1858 1971 gene 16.5 peptide (put.) BASE COUNT 744 a 374 c 382 g 516 t ORIGIN 1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca 61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa 121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa 181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga 241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc 301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt 361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca 421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc 481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt 541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga 601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa 661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca 721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag 781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa 841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta 901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca 961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg 1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga 1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg 1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat 1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt 1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt 1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa 1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc 1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt 1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc 1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg 1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat 1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt 1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt 1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg 1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc 1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt 1981 ttgattgttt cacgtggaac atcactgtat acgcat // LOCUS STLENV 4122 bp ss-RNA VRL 29-JUN-1990 DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region. ACCESSION M11373 M11374 KEYWORDS env gene; pol gene. SOURCE Simian T-cell leukaemia virus genomic RNA. ORGANISM Simian T-cell lymphotropic virus type I Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 4122) AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and Yoshida,M. TITLE Sequence homology of the simian retrovirus genome with human T-cell leukemia virus type I JOURNAL Virology 144, 59-65 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 311 1777 env protein BASE COUNT 905 a 1456 c 741 g 1020 t ORIGIN 1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt 61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc 121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc 181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc 241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac 301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc 361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca 421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct 481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca 541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg 601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg 661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca 721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg 781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa 841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga 901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac 961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg 1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta 1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt 1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc 1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc 1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc 1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt 1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca 1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca 1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc 1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca 1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat 1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc 1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca 1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca 1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac 1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag 1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc 2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt 2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg 2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt 2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta 2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc 2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca 2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga 2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac 2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa 2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa 2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt 2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag 2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa 2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc 2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc 2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat 3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact 3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt 3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc 3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt 3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc 3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt 3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa 3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta 3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg 3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat 3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc 3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg 3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt 3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt 3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag 3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc 3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc 4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc 4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca // LOCUS FIVCG 9474 bp ss-RNA VRL 29-JUN-1990 DEFINITION Feline immunodeficiency virus, Petaluma strain, complete genome. ACCESSION M25381 M25729 KEYWORDS complete genome; env gene; gag gene; pol gene. SOURCE Feline immunodeficiency virus, Petaluma strain, clones FIV-14 [1], [3], and 34TF10 [2]. REFERENCE 1 (bases 1 to 355) AUTHORS Olmstead,R.A., Barnes,A.K., Yamamoto,J.K., Hirsch,V.M., Purcell,R.H. and Johnson,P.R. TITLE Molecular cloning of feline immunodeficiency virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2448-2452 (1989) STANDARD full automatic REFERENCE 2 (bases 1 to 9472) AUTHORS Talbott,R.L., Sparger,E.E., Lovelace,K.M., Fitch,W.M., Pedersen,N.C., Luciw,P.A. and Elder,J.H. TITLE Nucleotide sequence and genomic organization of feline immuno- deficiency virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 5743-5747 (1989) STANDARD full automatic REFERENCE 3 (bases 1 to 9474) AUTHORS Olmstead,R.A., Hirsch,V.M., Purcell,R.H. and Johnson,P.R. TITLE Nucleotide sequence analysis of feline immunodeficiency virus: genome organization and relationship to other lentiviruses JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8088-8096 (1989) STANDARD full automatic REFERENCE 4 (bases 7175 to 7182; revision) AUTHORS Elder,J.H. and Scripps Institute LaJolla,CA. JOURNAL Unpublished (1990) STANDARD full automatic COMMENT [2] sequence kindly provided in computer-readable form by John Elder, Scripps Institute, La Jolla, CA. [3] sequence kindly provided in computer-readable form by Robert Olmstead, NIAID, NIH, Bethesda, MD. [1] contains the LTR sequence from the complete genome in [3]. The sequence from [3] is shown (with differences from [2]); the substantive difference from [2] is at position 6123 where the latter possesses an early stop codon in orf2, which is thought to encode the transactivator protein. Clone 34TF10 is, nevertheless, biologically active as described in [2]. A difference earlier reported at 7175-7182 turns out to have been a typo [2,4]. Orf1 is weakly homologous (PLSEARCH) with the Q orf of visna and has no detectable similarity to the vif protein of HIV and SIV. This is consistent with the clustering of FIV with the ungulate lentiviruses [2],[3]. Several "urfs" are identified by [2] and [3] of which only orf3, possibly analogous to the L orf of visna, is annotated below; a small, possibly associated, orf is found at 8955-9170. For a fuller discussion of the reading frames, see [2] and [3]. FEATURES from to/span description pept 628 1980 gag polyprotein pept / 1869 5243 pol polyprotein (NH2 terminus uncertain) (AA at 1869) pept 5236 5991 orf1 pept 5992 6228 orf2 pept 6266 8836 env polyprotein pept 6712 6918 orf3 LTR 1 355 5' LTR LTR 9120 9474 3' LTR rpt 217 281 R repeat 5' copy rpt 9336 9400 R repeat 3' copy binding 357 377 primer binding site variant 12 12 t in [3]; c in [2] variant 122 122 c in [3]; t in [2] variant 146 146 g in [3]; t in [2] variant 373 374 ga in [3]; a in [2] variant 498 501 ggac in [3]; agca in [2] variant 1575 1575 g in [3]; a in [2] variant 1840 1840 g in [3]; c in [2] variant 1977 1977 a in [3]; g in [2] variant 2028 2028 g in [3]; a in [2] variant 2033 2033 c in [3]; t in [2] variant 2288 2288 a in [3]; g in [2] variant 2464 2464 g in [3]; a in [2] variant 2490 2490 c in [3]; t in [2] variant 2580 2580 c in [3]; t in [2] variant 2631 2631 t in [3]; c in [2] variant 2735 2735 a in [3]; g in [2] variant 3733 3733 a in [3]; g in [2] variant 3743 3743 a in [3]; t in [2] variant 3812 3812 g in [3]; a in [2] variant 4202 4202 g in [3]; a in [2] variant 4211 4211 c in [3]; t in [2] variant 4271 4271 c in [3]; t in [2] variant 4493 4493 a in [3]; g in [2] variant 4540 4540 g in [3]; t in [2] variant 4706 4706 c in [3]; t in [2] variant 4891 4891 g in [3]; a in [2] variant 5332 5332 g in [3]; a in [2] variant 5497 5497 t in [3]; c in [2] variant 5559 5559 t in [3]; c in [2] variant 5986 5986 t in [3]; c in [2] variant 6000 6000 c in [3]; a in [2] variant 6123 6123 g in [3]; a in [2] variant 6149 6149 t in [3]; c in [2] variant 6179 6179 a in [3]; g in [2] variant 6238 6238 g in [3]; t in [2] variant 6245 6248 attt in [3]; ttt in [2] variant 6306 6306 t in [3]; c in [2] variant 6430 6430 c in [3]; t in [2] variant 6548 6548 a in [3]; c in [2] variant 6739 6739 c in [3]; t in [2] variant 6753 6753 t in [3]; c in [2] variant 6758 6758 a in [3]; g in [2] variant 6764 6764 a in [3]; g in [2] variant 6779 6779 a in [3]; g in [2] variant 6782 6783 aa in [3]; gg in [2] variant 7175 7182 acaggagg in [3]; caggagga in [2] variant 7381 7381 a in [3]; g in [2] variant 7484 7484 g in [3]; a in [2] variant 7512 7513 ag in [3]; ca in [2] variant 7618 7618 a in [3]; c in [2] variant 7625 7625 g in [3]; a in [2] variant 7670 7670 a in [3]; g in [2] variant 7704 7708 aaaaa in [3]; caaat in [2] variant 7750 7750 t in [3]; c in [2] variant 7808 7810 atg in [3]; gta in [2] variant 7824 7828 aaaag in [3]; caaaa in [2] variant 7996 7996 g in [3]; a in [2] variant 8054 8054 g in [3]; c in [2] variant 8129 8129 g in [3]; a in [2] variant 8222 8222 g in [3]; a in [2] variant 8238 8238 a in [3]; g in [2] variant 8329 8329 g in [3]; a in [2] variant 8377 8377 t in [3]; c in [2] variant 8394 8394 c in [3]; t in [2] variant 8548 8550 gaa in [3]; aac in [2] variant 8588 8588 g in [3]; a in [2] variant 9131 9131 t in [3]; c in [2] variant 9241 9241 c in [3]; t in [2] variant 9265 9265 g in [3]; t in [2] BASE COUNT 3571 a 1370 c 2119 g 2414 t ORIGIN 5' terminus of 5' LTR 1 tgggatgagt attggaaccc tgaagaaata gaaagaatgc ttatggacta gggactgttt 61 acgaacaaat gataaaagga aatagctgag catgactcat agttaaagcg ctagcagctg 121 cctaaccgca aaaccacatc ctatggaaag cttgctaatg acgtataagt tgttccattg 181 taagagtata taaccagtgc tttgtgaaac ttcgaggagt ctctttgttg aggacttttg 241 agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 301 tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 361 cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 421 tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg 481 agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 541 aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 601 acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 661 gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 721 gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 781 gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 841 gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 901 gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct 961 caaatgggat tagacactag gccatctatg aaagaagcag gtggaaaaga ggaaggccct 1021 ccacaggcat atcctattca aacagtaaat ggagtaccac aatatgtagc acttgaccca 1081 aaaatggtgt ccatttttat ggaaaaggca agagaaggac taggaggtga ggaagttcaa 1141 ctatggttta ctgccttctc tgcaaattta acacctactg acatggccac attaataatg 1201 gccgcaccag ggtgcgctgc agataaagaa atattggatg aaagcttaaa gcaactgaca 1261 gcagaatatg atcgcacaca tccccctgat gctcccagac cattacccta ttttactgca 1321 gcagaaatta tgggtatagg attaactcaa gaacaacaag cagaagcaag atttgcacca 1381 gctaggatgc agtgtagagc atggtatctc gaggcattag gaaaattggc tgccataaaa 1441 gctaagtctc ctcgagctgt gcagttaaga caaggagcta aggaagatta ttcatccttt 1501 atagacagat tgtttgccca aatagatcaa gaacaaaata cagctgaagt taagttatat 1561 ttaaaacagt cattgagcat agctaatgct aatgcagact gtaaaaaggc aatgagccac 1621 cttaagccag aaagtaccct agaagaaaag ttgagagctt gtcaagaaat aggctcacca 1681 ggatataaaa tgcaactctt ggcagaagct cttacaaaag ttcaagtagt gcaatcaaaa 1741 ggatcaggac cagtgtgttt taattgtaaa aaaccaggac atctagcaag acaatgtaga 1801 gaagtgaaaa aatgtaataa atgtggaaaa cctggtcatg tagctgccaa atgttggcaa 1861 ggaaatagaa agaattcggg aaactggaag gcggggcgag ctgcagcccc agtgaatcaa 1921 atgcagcaag cagtaatgcc atctgcacct ccaatggagg agaaactatt ggatttataa 1981 attataataa agtaggtact actacaacat tagaaaagag gccagaaata ctcatatttg 2041 taaatggata tcctataaaa tttttattag acacaggagc agatataaca attttaaata 2101 ggagagattt tcaagtaaaa aattctatag aaaatggaag gcaaaatatg attggagtag 2161 gaggaggaaa gagaggaaca aattatatta atgtacattt agagattaga gatgaaaatt 2221 ataagacaca atgtatattt ggtaatgttt gtgtcttaga agataactca ttaatacaac 2281 cattattagg gagagataat atgattaaat tcaatattag gttagtaatg gctcaaattt 2341 ctgataagat tccagtagta aaagtaaaaa tgaaggatcc taataaagga cctcaaataa 2401 aacaatggcc attaacaaat gaaaaaattg aagccttaac agaaatagta gaaagactag 2461 aaagagaagg gaaagtaaaa agagcagatc caaataatcc atggaataca ccagtatttg 2521 ctataaaaaa gaaaagtgga aaatggagaa tgctcataga ttttagagaa ttaaacaaac 2581 taactgagaa aggagcagag gtccagttgg gactacctca tcctgctggt ttacaaataa 2641 aaaaacaagt aacagtatta gatatagggg atgcatattt caccattcct cttgatccag 2701 attatgctcc ttatacagca tttactttac ctagaaaaaa taatgcggga ccaggaagga 2761 gatttgtgtg gtgtagtcta ccacaaggct ggattttaag tccattgata tatcaaagta 2821 cattagataa tataatacaa ccttttatta gacaaaatcc tcaattagat atttaccaat 2881 atatggatga catttatata ggatcaaatt taagtaaaaa ggagcataaa gaaaaggtag 2941 aagaattaag aaaattacta ttatggtggg gatttgaaac tccagaagat aaattacagg 3001 aagaaccccc atatacatgg atgggttatg aattacatcc attaacatgg acaatacaac 3061 agaaacagtt agacattcca gaacagccca ctctaaatga gttgcaaaaa ttagcaggaa 3121 aaattaattg ggctagccaa gctattccag acttgagtat aaaagcatta actaacatga 3181 tgagaggaaa tcaaaaccta aattcaacaa gacaatggac taaagaagct cgactggaag 3241 tacaaaaggc aaaaaaggct atagaagaac aagtacaact aggatactat gaccccagta 3301 aggagttata tgctaaatta agtttggtgg gaccacatca aataagttat caagtatatc 3361 agaaggatcc agaaaagata ctatggtatg gaaaaatgag tagacaaaag aaaaaggcag 3421 aaaatacatg tgatatagcc ttaagagcat gctataagat aagagaagag tctattataa 3481 gaataggaaa agaaccaaga tatgaaatac ctacttctag agaagcctgg gaatcaaatt 3541 taattaattc accatatctt aaggccccac ctcctgaggt agaatatatc catgctgctt 3601 tgaatataaa gagagcgtta agtatgataa aagatgctcc aataccagga gcagaaacat 3661 ggtatataga tggaggtaga aagctaggaa aagcagcaaa agcagcctat tggacagata 3721 caggaaagtg gcaagtgatg gaattagaag gcagtaatca gaaggcagaa atacaagcat 3781 tattattggc attaaaagca ggatcagagg agatgaatat tataacagat tcacaatatg 3841 ttataaatat tattcttcaa caaccagata tgatggaggg aatctggcaa gaagttttag 3901 aagaattgga gaagaaaaca gcaatattta tagattgggt cccaggacat aaaggtattc 3961 caggaaatga ggaagtagat aagctttgtc aaacaatgat gataatagaa ggggatggga 4021 tattagataa aaggtcagaa gatgcaggat atgatttatt agctgcaaaa gaaatacatt 4081 tattgccagg agaggtaaaa gtaataccaa caggggtaaa gctaatgttg cctaaaggat 4141 attggggatt aataatagga aaaagctcga tagggagtaa aggattggat gtattaggag 4201 gggtaataga cgaaggatat cgaggtgaaa ttggagtaat aatgattaat gtatcaagaa 4261 aatcaatcac cttaatggaa cgacaaaaga tagcacaatt aataatattg ccttgtaaac 4321 atgaagtatt agaacaagga aaagtagtaa tggattcaga gagaggagac aatggttatg 4381 ggtcaacagg agtattctcc tcttgggttg acagaattga ggaagcagaa ataaatcatg 4441 aaaaatttca ctcagatcca cagtacttaa ggactgaatt taatttacct aaaatggtag 4501 cagaagagat aagacgaaaa tgcccagtat gcagaatcag aggagaacaa gtgggaggac 4561 aattgaaaat agggcctggt atctggcaaa tggattgcac acactttgat ggcaaaataa 4621 ttcttgtggg tatacatgtg gaatcaggat atatatgggc acaaataatt tctcaagaaa 4681 ctgctgactg tacagttaaa gctgtcttac aattgttgag tgctcataat gttactgaat 4741 tacaaacaga taatggacca aattttaaaa atcaaaagat ggaaggagta ctcaattaca 4801 tgggtgtgaa acataagttt ggtatcccag ggaacccaca gtcacaagca ttagttgaaa 4861 atgtaaatca tacattaaaa gtttggattc ggaaattttt gcctgaaaca acctccttgg 4921 ataatgcctt atctctcgct gtacatagtc tcaattttaa aagaagaggt aggataggag 4981 ggatggcccc ttatgaatta ttagcacaac aagaatcctt aagaatacaa gattattttt 5041 ctgcaatacc acaaaaattg caagcacagt ggatttatta taaagatcaa aaagataaga 5101 aatggaaagg accaatgaga gtagaatact ggggacaggg atcagtatta ttaaaggatg 5161 aagagaaggg atattttctt atacctagga gacacataag gagagttcca gaaccctgcg 5221 ctcttcctga aggggatgag tgaagaagat tggcaggtaa gtagaagact ctttgcagtg 5281 ctccaaggag gagtaaatag cgctatgcta tacatatcta ggctacctcc ggatgaaaga 5341 gaaaagtata aaaaagactt caagaaaaga ctttttgaca cagaaacagg atttataaag 5401 agactacgga aagctgaagg aataaaatgg agctttcata ctagagatta ttacatagga 5461 tatgtcagag aaatggtggc aggatccact acatcattaa gtctaaggat gtatatatat 5521 ataagtaacc cactatggca ttctcagtat cgtccaggtt tgaaaaattt caataaggaa 5581 tggccttttg taaatatgtg gataaaaaca ggatttatgt gggatgatat tgaaaaacaa 5641 aatatttgta taggaggaga agtttcacca ggatggggac cagggatggt aggtatagca 5701 ataaaagctt ttagttgtgg cgaaagaaag attgaggcta ctcctgtaat gattataaga 5761 ggagaaatag atccaaaaaa atggtgcgga gattgttgga atttaatgtg tcttagaaac 5821 tcacctccaa agactttaca aagactcgct atgttggcgt gtggcgtgcc ggctaagaag 5881 tggcgaggat gctgtaatca acgctttgtt tctccttaca gaacgcctgc tgatttagag 5941 gtcattcaat ccaagcccag ctggaacctg ttatggtcgg gagaattatg aatggaagac 6001 ataatagtat tattcaatag ggtcactgag aaactagaaa aagaattagc tatcagaata 6061 tttgtattag cacatcaatt agaaagggac aaagctatta gattactaca aggattattt 6121 tggagatata gatttaagaa accccgagta gattattgtt tatgttggtg gtgttgcaaa 6181 ttctattatt ggcagttgca atctacatta tcaataacta ctgcttagaa atatttagat 6241 taatatttca tttgcaacaa taagaatggc agaaggattt gcagccaata gacaatggat 6301 aggactagaa gaagctgaag agttattaga ttttgatata gcaacacaaa tgagtgaaga 6361 aggaccacta aatccaggag taaacccatt tagggtacct ggaataacag aaaaagaaaa 6421 gcaaaactac tgtaacatat tacaacctaa gttacaagat ctaaggaacg aaattcaaga 6481 ggtaaaactg gaagaaggaa atgcaggtaa gtttagaaga gcaagatttt taaggtattc 6541 tgatgaaagt gtattgtccc tggttcatgc gttcatagga tattgtatat atttaggtaa 6601 tcgaaataag ttaggatctt taagacatga cattgatata gaagcacccc aagaagagtg 6661 ttataataat agagagaagg gtacaactga caatataaaa tatggtagac gatgttgcct 6721 aggaacggtg actttgtacc tgattttatt tataggaata ataatatatt cacagacaac 6781 caacgctcag gtagtatgga gacttccacc attagtagtc ccagtagaag aatcagaaat 6841 aattttttgg gattgttggg caccagaaga acccgcctgt caggactttc ttggggcaat 6901 gatacatcta aaagctaaga caaatataag tatacgagag ggacctacct tggggaattg 6961 ggctagagaa atatgggcaa cattattcaa aaaggctact agacaatgta gaagaggcag 7021 aatatggaaa agatggaatg agactataac aggaccatca ggatgtgcta ataacacatg 7081 ttataatgtt tcagtaatag tacctgatta tcagtgttat ttagatagag tagatacttg 7141 gttacaaggg aaaataaata tatcattatg tctaacagga ggaaaaatgt tgtacaataa 7201 agttacaaaa caattaagct attgtacaga cccattacaa atcccactga tcaattatac 7261 atttggacct aatcaaacat gtatgtggaa tacttcacaa attcaggacc ctgaaatacc 7321 aaaatgtgga tggtggaatc aaatggccta ttataacagt tgtaaatggg aagaggcaaa 7381 agtaaagttt cattgtcaaa gaacacagag tcagcctgga tcatggttta gagcaatctc 7441 gtcatggaaa caaagaaata gatgggagtg gagaccagat tttgaaagta aaaaggtgaa 7501 aatatctcta cagtgcaata gcacaaaaaa cctaaccttt gcaatgagaa gttcaggaga 7561 ttatggagaa gtaacgggag cttggataga gtttggatgt catagaaata aatcaaaact 7621 tcatgctgaa gcaaggttta gaattagatg tagatggaat gtagggagta atacctcgct 7681 cattgataca tgtggaaaca ctcaaaaagt ttcaggtgcg aatcctgtag attgtaccat 7741 gtattcaaat aaaatgtaca attgttcttt acaaaacggg tttactatga aggtagatga 7801 ccttattatg catttcaata tgaaaaaggc tgtagaaatg tataatattg ctggaaattg 7861 gtcttgtaca tctgacttgc catcgtcatg ggggtatatg aattgtaatt gtacaaatag 7921 tagtagtagt tatagtggta ctaaaatggc atgtcctagc aatcgaggca tcttaaggaa 7981 ttggtataac ccagtggcag gattacgaca atccttagaa cagtatcaag ttgtaaaaca 8041 accagattac ttagtggtcc cagaggaagt catggaatat aaacctagaa ggaaaagggc 8101 agctattcat gttatgttgg ctcttgcagc agtattatct attgccggtg cagggacggg 8161 ggctactgct atagggatgg taacacaata ccaccaagtt ctggcaaccc atcaagaagc 8221 tgtagaaaag gtgactgaag ccttaaagat aaacaactta agattagtta cattagagca 8281 tcaagtacta gtaataggat taaaagtaga agctatggaa aaatttttgt atacagcttt 8341 cgctatgcaa gaattaggat gtaatcaaaa tcaatttttc tgcaaaatcc ctcctgagtt 8401 gtggacaagg tataatatga ctataaatca aacaatatgg aatcatggaa atataacttt 8461 gggggaatgg tataaccaaa caaaagattt acaacaaaag ttttatgaaa taataatgga 8521 catagaacaa aataatgtac aagggaagaa agggatacaa caattacaaa agtgggaaga 8581 ttgggtagga tggataggaa atattccaca atatttaaag ggactattgg gaggtatctt 8641 gggaatagga ttaggagtgt tattattgat tttatgttta cctacattgg ttgattgtat 8701 aagaaattgt atccacaaga tactaggata cacagtaatt gcaatgcctg aagtagaagg 8761 agaagaaata caaccacaaa tggaattgag gagaaatggt aggcaatgtg gcatgtctga 8821 aaaagaggag gaatgatgaa gtatctcaga cttattttat aagggagata ctgtgctgag 8881 ttcttccctt tgaggaaggt atgtcatatg aatccatttc gaatcaaatc aaactaataa 8941 agtatgtatt gtaaggtaaa aggaaaagac aaagaagaag aagaaagaag aaagccttca 9001 agaggatgat gacagagtta gaagatcgct tcaggaagct atttggcacg acttctacaa 9061 cgggagacag cacagtagat tctgaagatg aacctcctaa aaaagaaaaa agggtggact 9121 gggatgagta ttggaaccct gaagaaatag aaagaatgct tatggactag ggactgttta 9181 cgaacaaatg ataaaaggaa atagctgagc atgactcata gttaaagcgc tagcagctgc 9241 ctaaccgcaa aaccacatcc tatggaaagc ttgctaatga cgtataagtt gttccattgt 9301 aagagtatat aaccagtgct ttgtgaaact tcgaggagtc tctttgttga ggacttttga 9361 gttctccctt gaggctccca cagatacaat aaatatttga gattgaaccc tgtcgagtat 9421 ctgtgtaatc ttttttacct gtgaggtctc ggaatccggg ccgagaactt cgca // LOCUS ASTSSP12S 3341 bp ds-DNA PLN 29-JUN-1990 DEFINITION A.sativa 12S globulin seed storage protein gene, complete cds. ACCESSION J05485 KEYWORDS 12S globulin; storage protein. SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-OG1-E1. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 3341) AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A. TITLE Analysis of seed storage protein genes of oats JOURNAL J. Biol. Chem. 265, 9652-9658 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Shotwell, 17-APR-1990. FEATURES from to/span description pept 983 1310 12S globulin seed storage protein, exon 1 1428 1702 12S globulin seed storage protein, exon 2 1829 2344 12S globulin seed storage protein, exon 3 2445 2882 12S globulin seed storage protein, exon 4 pre-msg 945 > 3341 12S globulin seed storage protein mRNA and introns IVS 1311 1427 12S globulin seed storage protein intron A IVS 1703 1828 12S globulin seed storage protein intron B IVS 2345 2444 12S globulin seed storage protein intron C signal 885 890 CAT box signal 917 921 TATA box signal 2915 2920 polyA addition signal signal 2965 2970 polyA addition signal BASE COUNT 1044 a 677 c 606 g 1014 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctttt tggaaagtca ttttgcctcc tgaactccag tgtttcctat ttattaaaaa 61 aaactaaaaa ctatatttat aagtttgaaa aaatcatgaa acaaaattgt aaaaattgct 121 agtgatatat cccacaaacg tgcaaaatct caatttgaag tgctttgtat ttcgagctac 181 acaaaaatga caaagtgtga ctttttatgt gatttgaaat cactatacta cagatctaca 241 attttgttct ttttgtgaag ctataaatac acattatttt gagttgagat ttttcatgtt 301 tgtgctatga atcataggct acatcctgat ttatttttag aattttttgg aacctaaaat 361 atgttctaga ttatttttta aaaagtggga tcattgatgc ccatacacac gaaatctcca 421 ctcaattctt ttatacatta tctttctata tctactaacg tggattatac atcatagtaa 481 gtttcttact acatgtgctt tcttgttaca atgtggacat gactcttcca cttttgggct 541 ttatgttgta tggatatact catgacatgg aattttgtcc acacacgtag aaccatccat 601 atatattgtt gtgcatagaa caaaacacaa gcaagccatg aaaaaggagt cacaggtgcc 661 acaaactgtt gtaggaagta taactagtat gaggcctttt atttgacgtc ggacaatggc 721 caagagctac atataaaaga tggtgctaga tttgtgagtc agcaccagct gtaggcagaa 781 aacaacacat atcttttgag acaaagttat atctattcac atttaaaacc atgatctgtt 841 gagtcaccat gaatatcttc ttatctatgt taataattac atgtcatcat gtttatcctg 901 gactactttt tatggctata aaatcaaact tacaataagg aaactagcac caatccacct 961 tctacaatct cttcaaacaa tcatggcaac tactaggttt ccatcattgt tgttttactc 1021 ctatattttt ctcttgtgca atgggtcaat ggctcagcta ttcgggcaga gctttacccc 1081 atggcaaagc tctcgacaag gaggtttaag ggggtgcaga tttgataggc tacaagcatt 1141 tgaaccactt cgacaagtgg ggtcacaagc gggtatcatt gagtactttg atgagcagaa 1201 tgagcaattt cgttgtgcag gtgtatccgt catccgtcgt gttattgagc ctcaaggcct 1261 cttgttacct caataccaca acgctcctgg cttggtgtac atccttcaag gttagtgtct 1321 aattgattat aaaaattgcc tttgttatac ttcacttagg atttagatgt gccaaatgtt 1381 acaccgttca tattttttaa caatgaaaca aatgtgtttt cttttaggta ggggattcac 1441 agggttgact ttccctggat gcccggcgac cttccaacaa cagttccaac catttgatca 1501 agcccagttt gctgaaggtc aaagccaaag ccaaaatctt aaggatgaac accaaagagt 1561 tcaccacatc aaacaaggag atgttgttgc tctaccggct ggcatagtac actggtgcta 1621 caacgatggt gatgcaccga ttgtagctgt ctatgtcttc gacgtaaaca acaacgctaa 1681 tcagcttgaa ccaaggcaaa aggtaactat acaacttaat gcacacaaaa tatatacaac 1741 tatttacaag tgatccaact atgatttgaa gtatacatta gtgggatatt aatgaactct 1801 gtttaacttc atcgatataa aattttagga gttcttgttg gctggtaaca acaagagaga 1861 gcaacagttt ggacaaaaca tattcagtgg attcagtgtc caacttctta gtgaggccct 1921 tggtataagt cagcaagtag cacaaaagat tcagagtcaa aatgaccaaa gaggtgagat 1981 aattcgtgtg agtcaaggcc ttcaattctt gaagcctttt gtttcccaac aaggaccagt 2041 agagcatcaa gcctaccaac caattcaaag tcaagaagaa caatcaaccc aataccaggt 2101 agggcaatca ccacaatatc aagaaggaca atcaactcaa taccagccag gacagtcatg 2161 ggaccaaagt ttcaatggtt tggaggagaa tttctgttca ttggaggcaa ggcaaaacat 2221 cgaaaacccg aaacgtgccg acacgtacaa cccacgtgct ggcaggataa cacatctcaa 2281 tagcaagaat tttcccaccc ttaacctggt gcaaatgagt gctacaagag taaatttata 2341 ccaggtattt atgatactac attcaacaca ctatcttatt tttagatatt ctaagcttca 2401 tacaaccgat taataatatg gcatacaaat aattgctatt gcagaatgct attctttcac 2461 catactggaa cattaatgct cacagtgtca tgcacatgat ccaaggacgt gctcgagttc 2521 aagttgtcaa taaccatggt cagaccgtat tcaatgacat tcttcgtcgc ggacaactgc 2581 taatcatacc acaacactat gttgttctca agaaggcaga gcgtgaagga tgccagtata 2641 tttcattcaa gaccaacccc aattccatgg ttagccagat cgcaggaaag acctccatcc 2701 tacgtgcatt gcccgttgat gtcctcgcca atgcataccg catttctaga caggaagccc 2761 aaaacctcaa aaataaccgt ggagaagagt ttgatgcatt cacccctaag tttacacaaa 2821 cgggctccca gagttaccag gacgaggggg agtcatcttc gactgagaag gcatccgagt 2881 gaataagtga gtgtaatgga aactagtata gtgaaataaa ggcatcgcat gtttgcagcc 2941 tagtggtata taaccgctta tctcaataaa aaagtttctc cgtgttatat tgtttgcttg 3001 tttcttgtac tcttcttaat tttatctttt atgatcaacc aacttacctc tcttcttcaa 3061 ttttctgcca ctcttcacat atgcatatca aggagacgta tagtccactt gtactagatt 3121 tattttttgg tactatcttt tgtggtttga gtatttctta ggtccgataa aaacattcaa 3181 tttactgatg ctaaattttt tcagcaaacc tagccaccta aaattgtgaa tcacatacct 3241 tcaaatttct atgccaaagt gtgcgatttt tatgttctat agatccaagc ttgggctgca 3301 ggtcgactct agaggatcga tccccgggcg agctcgaatt c // LOCUS ASTSSPA 1548 bp ds-DNA PLN 29-JUN-1990 DEFINITION A.sativa seed storage protein gene, complete cds. ACCESSION J05486 KEYWORDS avenin; prolamine; storage protein. SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-AV45-X1. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1548) AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A. TITLE Analysis of seed storage protein genes of oats JOURNAL J. Biol. Chem. 265, 9652-9658 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Shotwell, 17-APR-1990. FEATURES from to/span description pept 484 1113 seed storage protein mRNA 411 > 1548 seed storage protein mRNA signal 343 347 CAT box signal 380 384 TATA box signal 1189 1194 polyA addition signal signal 1252 1258 polyA addition signal BASE COUNT 484 a 375 c 323 g 366 t ORIGIN 1 tctagaacaa tccttcctgt tactagttta atctacctta tacacagaca agtttacaaa 61 ctttgcaagc catgacaata tggtttatgc ctaacagcct gtagaaaaac acaaacttag 121 tttcagaaaa tgatgcaata tagatcagtg attgacatgt aaagcgaaaa agatgagaca 181 tattgccaac tatcaagctt tcttgatagt cgaaaggtga tacatgtaat ggcaaagcaa 241 cttttatgat cgatccaaaa ataacatttg taggtagtgc caccaaacac aacataccaa 301 ataatcagtt cgagaagcat acaagcattt tttaaggaaa agcaaatgca aagatgaaaa 361 gaaactatgc catggcagat ataaatagac ctgcaccatg gagatcctcc tccatcattc 421 atccttcaca caccgagagc acaaacatta aaacctagca agcagtatca acacaaatct 481 accatgaaga acttcctcat ccttgccctc ctttccatgg cggcgaccat ggccactgcg 541 cagtttgacc ctagcgaaca ataccagcca tatcctgagc aacaacagcc aattctacaa 601 caacagttgc tgttgcagca acaacaacag atgttgttgc agcaacaacc attgttgcag 661 gttttgcagc aacagttgaa cccatgcagg cagttcctcg tgcaacagtg cagcccggtg 721 gcagcggtgt cattcctccg gtcgcaaatc ctgcaacaga gcagctgcca ggtgatgagg 781 caacaatgct gccgacggct ggagcagatc cccgagcagc tccggtgccc agccatccat 841 agcgtcgtcc aggccatcat tatgcagcaa caacaacaac agttcttcca gcctcagatg 901 caacaacagt tcttccagcc tcagatgcaa caggtgacac agggcatctt ccagcctcag 961 atgcaacagg tgacacaggg catcttccag actcagatgc aaggtcagat cgaggggatg 1021 agggcgtttg cgctgcaggc cctgccggcg atgtgcgatg tatacgtccc gccgcactgc 1081 cccgtcgcca ccaccccgct cagtggcttc taagaacact acaagagcta tagtactaca 1141 taaataccat cagcgtttag ccgatggacc gatcttgtag cggtgacaaa taaaataaaa 1201 gggtcatgca ctaacatgtg tgacagccga cccgtactag tccaaacttg gaaataaaag 1261 aatgcaaagt tcttgtctgc ataacattga ttgtcgtttt tccattcatg tttataccta 1321 accataagct catgcattaa ctagctgctt atgtagcctt cattgatgtg aacataataa 1381 cagagtgggt gtaaaagatg gatttgaatt gaggcactat taaagtgagg agtaagaacc 1441 aggtgcagat gatagctctt taatggtgag gcttgtgagt tgtgacctta ttggctgaca 1501 tgcattggat tctaaagagt acatgttgat gctttcaaca atgtcgac // LOCUS DOGSPC18A 847 bp ss-mRNA MAM 29-JUN-1990 DEFINITION Dog microsomal signal peptidase complex (SPC 18) mRNA, complete cds. ACCESSION J05466 KEYWORDS ER membrane protein; signal peptidase complex. SOURCE Dog liver, cDNA to mRNA, lambda-18-[50,51]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 847) AUTHORS Shelness,G.S. and Blobel,G. TITLE Two subunits of the canine signal peptidase complex are homologous to yeast sec-11 protein JOURNAL J. Biol. Chem. 265, 9512-9519 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.S.Shelness, 04-APR-1990. FEATURES from to/span description pept 39 578 microsomal signal peptidase complex BASE COUNT 212 a 162 c 216 g 257 t ORIGIN 1 cctctccgct gcccgtgtcc tccttcaccg gccgcgccat gctgtctcta gattttttgg 61 acgatgtgcg gcgaatgaac aagcggcagc tgtattatca agtcctaaat tttggaatga 121 ttgtctcctc ggcactaatg atctggaagg gactgatggt cataactgga agtgaaagtc 181 caattgtagt ggtgctcagt ggcagcatgg agccagcatt tcatagagga gatcttctct 241 ttttaacaaa tcgagttgaa gatcccatcc gagtgggaga aattgttgtt tttaggatag 301 aaggaagaga gattcctata gttcaccgag tcttgaagat tcatgaaaag caaaatggac 361 atatcaagtt tttgaccaaa ggagataata atgccgttga tgaccgaggc ctctataaac 421 aaggacaaca ctggctagag aagaaggatg tcgtggggag agcaagggga tttgttcctt 481 atattggaat tgtgacgatc ctcatgaatg actatccgaa atttaagtat gccgtcctct 541 tcttgctggg tttattcgtg ctggtccatc gtgagtaaga agtctacctt gctgttcctg 601 gaagatgccg tacttttcgt tcctgactgt ttggagtaga tattggtcta tgattggtgg 661 agtgcagaac gtacatgttg gcgcttcttg gtagcaatgg tttgcattag tttctgtttc 721 cacgccaggg tctgtgtggg cgggtgcacc aagatgcgca ccgaggggac tctcaatcac 781 aggatttcat atgttgtcat tgtcacactt tcacattttt gtacatcagt gaatttttta 841 tattaaa // LOCUS DROCRPA 7226 bp ss-mRNA INV 29-JUN-1990 DEFINITION D.melanogaster crumbs protein mRNA, complete cds. ACCESSION M33753 KEYWORDS crumbs protein. SOURCE D.melanogaster (strain Oregon R) 3-12 hr embryo, cDNA to mRNA, clone lambda-gt10. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 7226) AUTHORS Tepass,U., Theres,C. and Knust,E. TITLE Crumbs encodes an EGF-like protein expressed on apical membranes of Drosophila epithelial cells and required for organization of epithelia JOURNAL Cell 61, 787-799 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by U.Tepab 17-APR-1990. FEATURES from to/span description pept 214 6633 crumb protein BASE COUNT 1880 a 1860 c 1879 g 1607 t ORIGIN Chromosome 95F9-13. 1 aaaaaaaata aaaataacct gttctaagcc aagtgccgat tgtgtgtgtg tgtgtgtgtg 61 cgtatgaagt gtttgcgtgg agcagagaga atgggagaat gcaattacca acaacaacaa 121 gctcaatgag tatcgtttga atagcaataa tatctagcca aaagcaaccc tttccaatac 181 gaaaactcga cttgagccaa caaatcagcg atcatggcta aaatcgccaa tgcgtcactg 241 tcgcaacaac aaaaacagcg gcaagcagaa acagcaacga ccacaacaac aacagtggcg 301 gcatcaggta gaaacagcaa caacaacagc aagaagtcga gaccgcacga aaagcgccac 361 gtcaaattac gctcgcattt gctcaagcgc cgcccatttc agtttattcg tcgccctcaa 421 tggataccgc tcttcatact tatttactta gcaacagatg tcgcctcagt ggcggtgccg 481 acgaaggagg cgtactttaa tggctccact tacctccgcc tcaccacgcc gatgcccatt 541 tgggatcact cggcgattag tttccgctcg tgccgcggcg gcgagatcct cgcccagcag 601 tacaacaaga actccattgt aatctcagtg ctcaatgact ttctgcaaat ctcactggct 661 ggacccgccg tccatgggcc caacaaccgg ctggatgtca agctgcccta ccaactgctg 721 gacaaccgct ggcatacgct gcagttcaag tacgagtacg gaaatctcta cctgcatgtg 781 gatcgcgcgg caagcatatt tgccaactcc acgtacaaca gtcagttcct gactaaccag 841 gacattggct acaaggacgc catcttgata ctgggaaact ccttctccgg ttgccttctg 901 gatggaccag gtctgcaatt tgtgaacaac tcgacggtgc agaatgtggt ctttggtcac 961 tgtcccttga ccccgggtcc ctgcagcgat cacgatcttt tcactcgact gccagacaac 1021 ttttgtctga atgatccctg tatgggccat ggaacctgct cgtccagtcc agagggatac 1081 gaatgtcggt gcacggcacg ttactcggga aaaaattgcc agaaggacaa tggctcgccg 1141 tgtgccaaaa atccatgtga gaacggtggt tcctgcctgg agaattccga ggggaattac 1201 cagtgcttct gtgatcccaa ccacagtggt cagcactgcg agacggaggt gaacatccac 1261 ccactctgcc agacgaaccc ctgcctgaac aatggagcat gtgtggtgat tggcggcagt 1321 ggagcactca cctgcgagtg tcccaaagga tacgctggtg ccaggtgcga ggtggacacg 1381 gatgagtgcg cctcacagcc gtgtcagaac aacggaagct gcatagaccg tatcaatgga 1441 ttcagctgcg attgcagcgg caccggttac acgggtgcgt tctgccagac gaatgtggat 1501 gagtgcgaca agaatccgtg cctgaacggc ggcagatgct tgcataccta cggatggtac 1561 acctgccagt gtctggatgg ttggggcggc gagatttgtg atcgacccat gacatgccag 1621 acgcagcagt gtttcaacgg tggcacctgc ctggacaagc ccatcggctt ccagtgcctc 1681 tgtccgccgg agtacactgg tgaactgtgc caaatcgcac ccagttgcgc acagcagtgc 1741 cccatcgatt cggagtgcgt aggcggcaaa tgcgtttgta agccaggctc atcgggttac 1801 aactgtcaaa cgagcacggg cgatggagct tctgctttgg ccctgacacc catcaactgc 1861 aatgccacca atggaaagtg tctcaatgga ggaacttgct ccatgaacgg aacccactgc 1921 tattgcgccg tcggctattc gggagatcgt tgcgagaagg ccgagaactg ttcgccactg 1981 aactgccagg aaccgatggt atgtgtccag aatcagtgcc tctgcccgga gaacaaggtg 2041 tgcaaccagt gcgccacaca accgtgccag aatggcggtg aatgtgtgga cctaccgaat 2101 ggagactacg agtgcaagtg cacccgtgga tggactggac gcacctgtgg caacgacgtg 2161 gacgagtgca ccctgcatcc gaagatctgc ggcaatggta tctgcaagaa cgagaaggga 2221 tcgtacaaat gctactgtac gcctggattt accggagtcc actgcgattc cgatgtggac 2281 gaatgcctta gctttccctg cctcaacgga gccacgtgcc acaacaagat aaatgcctac 2341 gagtgcgttt gccaaccagg atatgaggga gagaactgcg aagtagatat cgatgagtgc 2401 ggcagtaatc cctgctcgaa cggatccacc tgcatcgaca ggatcaacaa cttcacctgc 2461 aactgcattc cgggaatgag gggccgcata tgtgacatcg acattgacga ctgtgtggga 2521 gatccctgct tgaatggtgg tcagtgcatc gaccaattgg gtggcttccg ttgtgactgc 2581 agtggcaccg gttatgaggg cgagaattgt gagctgaaca tagacgaatg cctctcgaat 2641 ccgtgcacga atggtgccaa gtgcctggac agggttaagg actacttctg cgattgccac 2701 aacggctaca agggaaagaa ctgtgagcag gacatcaacg agtgcgagag taatccctgc 2761 cagtacaatg gcaactgcct agagcgttcg aacataacgc tttaccaaat gagtaggatc 2821 acggatctgc ccaaggtgtt tagccagccc tttagctttg agaatgccag cggctacgag 2881 tgcgtctgtg tgcccggcat cattggaaag aactgcgaga tcaacattaa tgagtgcgac 2941 agtaatccat gcagcaagca cggaaattgc aatgacggga ttggaaccta tacctgcgaa 3001 tgcgaacctg gcttcgaggg cacccactgt gagatcaaca tagatgaatg tgatcgctat 3061 aatccctgcc agaggggcac ttgctatgac cagatagacg actacgactg cgactgcgat 3121 gcgaactatg gaggcaaaaa ctgttccgtt cttctcaaag gctgtgacca gaatccttgt 3181 ttaaatggcg gtgcctgctt gccgtatttg atcaacgagg taactcatct gtacacctgc 3241 acctgcgaga acggcttcca gggcgataaa tgcgaaaaga ccactactct gtccatggtg 3301 gccactagcc tgatttcggt taccacggaa cgcgaggaag gctacgacat taatcttcaa 3361 tttaggacca cactgcccaa cggagttttg gctttcggaa ccaccggcga aaagaatgaa 3421 ccagttagct atattttgga gctgattaat ggacgactga atctacattc ctcgctccta 3481 aataagtggg agggcgtgtt tattggatcg aagctgaacg atagcaactg gcacaaggtg 3541 tttgtggcca taaacacttc gcatctagtc ctttcggcca acgacgagca ggctattttt 3601 ccggttggct cctatgaaac ggccaataat agccagcctt cattcccgcg cacatacctc 3661 ggaggcacca ttcccaatct gaagtcctat ctgcgccacc tcacccacca accatcagct 3721 tttgtgggct gcatgcaaga tataatggtc aatggtaaat ggatctttcc cgatgagcag 3781 gatgcgaata tcagttatac caaactggaa aatgtccaga gcggttgtcc gcgcaccgaa 3841 caatgtaaac cgaatccttg ccattccaac gttgaatgca cggacctttg gcacaccttc 3901 gcttgtcact gtcccagacc attctttggg cacacatgtc agcacaatat gactgctgcc 3961 acttttggcc acgagaacac aacccattcg gccgtgattg tggagaccac agatgtggct 4021 aggcgagcta ttagatccat cctagacatt tccatgttta ttcgtacacg cgagccaact 4081 ggtcaagtct tttacttggg cactgatcca cgcaaagccc ccactaaaaa tattggcgac 4141 tcatatgtgg cggccaaatt acacggcggc gagctgctgg taaagatgca gttcagcggt 4201 actccggagg cctacaccgt tggtggccag aaattggaca acggctataa ccacctgatc 4261 gaggtggtgc gcaatcaaac gctcgtgcag gtcaagctca atggcaccga gtacttccgc 4321 aagacgctgt ccacgacggg tctgctggac gcacagttgc tttacttggg cggacctgca 4381 cccacacgcg agtcacttct gggcgcgacc acagaaccag gaataattcc agttccggga 4441 gcaggaatac ccattgaaga cacaacagtg cccaaggagg cggacgacag cagggactac 4501 tttaagggca taatccagga cgtgaaggtt agcaatggct cgctcaacct gattgtggag 4561 atgtattccc taaacgtgac agatgtccaa gtgaatgcca aacctctggg cgctgtaacc 4621 atcgatcgtg cctccgtgtt acctggtgag gtgtccgatg atctgtgccg aaagaatgcc 4681 tgcctccata atgccgagtg caggaacaca tggaatgact acacttgcaa atgcccaaat 4741 ggttacaagg gaaagaagtg tgccaggaga tcagaatttt gccaacacgt tacatgtccg 4801 ggacagagct tgtgccaaaa tctggacgat ggctatgagt gtgttacgaa caccacattt 4861 actgggcagg aacgcagtcc tctcgccttc ttctacttcc aggagcagca atctgacgat 4921 attgtcagtg aagcgtctcc caagcaaact cttaaaccgg taattgatat tgccttccga 4981 ctcgtgctgg aggtactctg cctgtacata gacaatgtgg acggattctt tgaaattgga 5041 gtaaacggag gacgagtgac catcacctgg aagctaagtg cactgcattt tggcgagtct 5101 gcacgctttg agaaggagaa cactgacgga gaatggagtc gcatttacct cagggcacac 5161 aacagcaaac tggagggtgg atggaaggga tgggaatcga tggtggatcc gacgccagct 5221 ttttccacgg acatcgacca agcggccttc cagtccctga tcgctacaag tacccaggtt 5281 tacttgggtg gcatgccaga gtcgcgacaa gcacgaggat ccactttgtc tgcccagcag 5341 ggctctcagt tcaagggctg tgtgggagag gcaaggctcg gcgatctttt actgccctac 5401 ttctccatgg cggaactgta ttcgcgcacc aatgtttcag tacagcaaaa ggctcaattc 5461 cgtctaaatg ccacacgacc tgaggagggc tgcatcctgt gcttccagtc agactgcaaa 5521 aatgacggct tctgtcaatc tccttcagat gagtacgcct gcacctgtca gcctggattc 5581 gagggcgatg attgcggcac ggacatcgac gagtgtctta acacggaatg cttgaacaac 5641 ggcacctgca tcaaccaggt tgcagctttc ttctgccagt gtcagccagg attcgagggt 5701 cagcactgtg agcaaaacat cgacgagtgt gcggatcagc cgtgccacaa cggtggcaac 5761 tgcacggatc ttatcgcatc gtacgtgtgc gactgccctg aggactatat gggcccgcag 5821 tgcgacgtgc tgaagcaaat gacctgcgag aacgagccat gtcggaatgg atcaacctgc 5881 cagaatggat tcaatgcttc tactggcaat aactttacat gtacatgcgt gcccggcttc 5941 gagggtccac tgtgtgacat acccttctgt gaaataacgc cttgcgataa cggtggcctc 6001 tgcctgacca ctggagcggt accgatgtgc aaatgtagtc tgggatacac tggtcgcctg 6061 tgcgagcagg acattaacga gtgcgaatcg aatccatgcc agaacggtgg tcagtgtaag 6121 gacctcgtcg gcaggtacga gtgcgattgc agggcacgga ttcgagggat tcgctgtgaa 6181 aatgacatcg acgagtgcaa catggaggga gattactgcg gcggattggg ccggtgtttc 6241 aacaagcccg gatccttcca gtgcatctgc cagaaaccct attgcggagc ctactgcaac 6301 tttacggatc cctgcaacgc tacggacctc tgtctaaacg gcggtcgctg cgtagagtcc 6361 tgcggcgcca aaccggacta ctactgcgag tgtccggaag gtttcgcggg aaagaattgc 6421 acagcaccga ttacggccaa ggaggacggg ccttcgacca cagacattgc catcattgta 6481 atacccgtag tggtggtgct gctgctgatc gcgggagccc tcctgggcac cttcctggtg 6541 atggccagga acaagcgagc aaccaggggc acctatagcc cgagcgtcgt caagagtact 6601 gcaacccacg gctggaaatg gacaacgtac tgaagccacc gccggaagag cgactaattt 6661 agttttgagt tttgagcatg aacgacgatt agcaaagcaa acaaaagata tttttaaatc 6721 cgcccatata cacctagctg taggagtaac tcaatgtttt gtactaagtt ggcccctagt 6781 tatggtttac atcttaaggt gctcaaagca atagcagcag cagcagcagc caggcttcca 6841 cctcgactcg tcccttatcc cctgcctacg ctttagttag ttaataatgc cgttgtctat 6901 ttattctagt agttagatga catacgtacc gccctatagt cgttatgtag ttatgttccc 6961 gatagtttag attccgtatt cgatttctcg tatatgtaat cctaaagctg cgaagaaact 7021 tgagctccga actgtcgatt tcccctttgc attccacaag gaagcccatc aaatgtattg 7081 tatgagtatt tttgtaaata agattgtttg ccgactctta aagtaaatta gctgagctaa 7141 aatcgaaatt aaaaacagaa aatcctatat gcctatatat cgcgagtata tatcagtaaa 7201 tgtggcctta taacaaaaaa aaaaaa // LOCUS ECOINCRNA 263 bp ss-RNA BCT 29-JUN-1990 DEFINITION E.coli antisense Inc RNA. ACCESSION M33752 KEYWORDS antisense RNA. SOURCE E.coli clinical isolate RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 263) AUTHORS Lopez,X.X. and Rodriguez,J.C. TITLE Isolation and nucleotide sequence of an incompatibility gene from the incFv1 plasmid pSU401: comparison to other IncF genes JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Rodriguez, 17-APR-1990. Univ de Cantabria, Depto Biologia Molecular, c/Cardenal Herrera Oria s/n, 39011 Santander, SPAIN FEATURES from to/span description signal 179 185 -10 promoter signal 205 211 -35 promoter BASE COUNT 70 a 54 c 71 g 68 t ORIGIN 1 gatcgtttaa ggaattttgt ggctggccac gccgtaaggt ggcaaggaac tggttctgat 61 gtggatgtac aggagccaga aaagcgaaaa ccccgataat cttctctaac tttggcgagt 121 gcagaaagat taccggggcc atctaaaaac cgtatagcca acaattcagc tatgcgggga 181 gtatagttat atgcccggaa aagttcaaga cttctttctg tgctcgctcc ttctgcgcat 241 tgtaagtgca ggatggtgtg act // LOCUS FLASL287HA 1095 bp ss-RNA VRL 29-JUN-1990 DEFINITION Influenza A/SL/2/87, A(H1N1) virus hemagglutinin (seg 4) RNA, 5' end. ACCESSION M33748 KEYWORDS hemagglutinin. SOURCE Influenza type A/SL/2/87 passed in rhesus monkey kidney cells. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1095) AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T. JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review REFERENCE 2 (bases 33 to 1095) AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T. TITLE Sequence of an influenza virus hemagglutinin determined directly from a clinical sample JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.T.Schulze 17-APR-1990. St. Louis School of Med, Dept of Microbiology, 1402 S Grand Blvd St. Louis Missouri 63104 FEATURES from to/span description pept 33 > 1095 hemagglutinin precursor protein sigp 33 83 signal peptide matp 84 1061 hemagglutinin 1 (HA1) matp 1066 > 1095 hemagglutinin 2 (HA2) BASE COUNT 381 a 233 c 237 g 244 t ORIGIN 1 agcaaaagca ggggaaaata aaaacaacca aaatgaaagc aaaactacta gtcctgttat 61 gtgcatttac agctacagat gcagacacaa tatgtatagg ctaccatgcg aacaactcaa 121 ccgacactgt tgacacagta cttgaaaaga acgtgacagt gacacactct gtcaacctac 181 ttgaggacag tcacaacgga aaactatgtc gactaaaagg aatagcccca ctacaattgg 241 gtaattgcag tattgccgga tggattttag gaaacccaga atgcgaatca ctgttttcta 301 agaaatcatg gtcctacatt gcagaaacac caaactccga gaatggaaca tgttacccag 361 ggtatttcgc tgactatgag gaactgaggg agcaattgag ttcagtatca tcattcgaga 421 gattcgaaat attccccaaa gaaagctcat ggcccaacca caccgtaacc aaaggagtaa 481 cggcatcatg ctcccataag gggaaaagca gtttttacag aaatttgcta tggctgacgg 541 agaagaatgg ctcgtaccca aatctgagca agtcctatgt gaacaacaaa gagaaagaag 601 tccttgtact atggggtgtt catcacccgt ctaacatagg ggaccaaagg gccatctatc 661 atacagaaaa tgcttatgtc tctgtagtgt cttcacatta taacaggaga ttcaccccag 721 aaatagcaaa aagacccaaa gtaagagatc aagaaggaag aattaactac tactggactc 781 tgctggaacc cgaggacaca ataatatttg aggcaaatgg agatctaata gcgccatggt 841 atgcttttgc actgagtaga ggctttgggt caggaatcat cacctcaaac gcatcaatgg 901 atgaatgtga cgcgaagtgt caaacacccc agggagctat aaacagtagt cttcctttcc 961 agaatgtaca cccagtcaca ataggagagt gcccaaagta tgtcaggagt acaaaattaa 1021 gaatggttac aggactaagg aacatcccat ccattcaatc cagaggtttg tttggagcca 1081 ttgccggttt cattg // LOCUS TREKPMDA 999 bp ds-DNA ORG 29-JUN-1990 DEFINITION T.evansi kinetoplast DNA minicircle. ACCESSION M33751 KEYWORDS . SOURCE T.evansi bloodstream forms DNA, clone AnTat 3/3. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.Bajyana Songa 17-APR-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS RATCASPAT 1535 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Rat cytosolic aspartate aminotransferase mRNA, complete cds. ACCESSION D00252 KEYWORDS cytosolic aspartate aminotransferase. SOURCE Rat (strain Wistar), cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1535) AUTHORS Horio,Y., Tanaka,T., Taketoshi,M., Nagashima,F., Tanase,S., Morino,Y. and Wada,H. TITLE Rat cytosolic aspartate aminotransferase: Molecular cloning of cDNA and expression in Escherichia coli JOURNAL J. Biochem. 103, 797-804 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Wada, 07-SEP-1989. FEATURES from to/span description pept 43 1284 cytosolic aspartate aminotransferase (EC 2.6.1.1) mRNA 1 1535 cytosolic aspartate aminotransferase mRNA BASE COUNT 360 a 409 c 410 g 356 t ORIGIN 1 agctcccagc tcgtgctcga ggtcccctca gattccatcg cgatggcccc tccatcattc 61 tttgcccagg ttccacaggc cccgccggtt ctggtcttta agctcattgc ggacttccgg 121 gatgatcccg atccccgcaa ggttaacctc ggcgtgggag cgtaccgcac agatgactct 181 cagccctggg ttttgccagt agtgaggaag gtcgaacaga agattgctaa cgaccacagt 241 ctcaaccacg agtacttgcc catcctgggc ctggcggagt tccggagctg tgcttctcag 301 ctagtacttg gggacaacag cccagctctc agggagaatc gggttggggg tgtgcagtct 361 ttgggaggga ccggtgcact tcgaattgga gctgacttct tagggcgatg gtacattggc 421 acagacaaca agaacacgcc cgtctacgta tcatcgccga cctgggagaa ccataatggc 481 gtgttttctg ccgctggttt taaagacatt cggtcctatc gctactggga tgcagagaag 541 agaggacttg atctccaggg tttcctgaat gatctggaga atgctcctga gttctccatc 601 tttgtcctcc acgcctgtgc acacaaccca acggggaccg acccaactga agaggagtgg 661 aagcagatcg ccgccgtcat gaagcgccgt tttctgttcc ccttctttga ctcagcctat 721 cagggctttg catctggaga cctagagaaa gatgcctggg ctattcgcta ttttgtgtct 781 gaaggcttcg agctcttctg tgcccagtcc ttctccaaga acttcgggct ctacaatgag 841 agagtgggga atctgaccgt ggtcggaaaa gagcatgaca gcgtcctgcg ggtcctttcc 901 cagatggaga agattgtacg aatcacctgg tccaatcccc ctgcccaggg agctcggatc 961 gtggccacca ccctctccaa ccctgagctc tttaaggagt ggaaaggaaa cgtgaagaca 1021 atggctgacc ggattctgac catgagatcc gaactcaggg cgcgactaga agctctcaag 1081 actcccggga cttggtctca catcactgag cagattggaa tgttcagctt tactgggttc 1141 aaccccaagc aggtcgagta tttggtcaac gagaagcaca tctatctgat gccgagcggt 1201 cggatcaaca tgtgcggctt gaccaccaag aacctagatt atgtggctac ctccatcaat 1261 gaagctgtca ccaaattcca gtgaagaaac accgagtagt tcataccacc aaagcagttc 1321 ctgtcacagc tttcctgcct gcgcaaacct agccgtacat gttgtttatt agagatgacc 1381 accatgggga ggcagccgct gtttagctgg ccccacaaga gaagacattt cttgaacatg 1441 aacctgggtc gggtgggggg atgactgggg ttagggcctt ttggaaacca gagcagatta 1501 aagttattta agaataaaaa aacccgttga tatgc // LOCUS RATGL5 599 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat NICER element (GL5-14) 3' long terminal repeat. ACCESSION M33534 KEYWORDS long terminal repeat. SOURCE Rat (strain NEDH) liver NGS-TC12 induced pheochromocytoma cell line DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 599) AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A. TITLE NICER elements: A family of nerve growth factor inducible cAMP-extinguishable retrovirus-like elements JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Wagner, 03-APR-1990. FEATURES from to/span description rpt 21 575 3' viral long terminal repeat recomb 575 576 viral DNA end/rat DNA start signal 345 349 CAAT box signal 402 406 TATA box signal 481 486 Poly-A signal BASE COUNT 174 a 154 c 139 g 132 t ORIGIN 1 tggtagaaga agaggggaat ggaaaggaaa attatacaga tttaaggttt aaaaatatga 61 agttaaaaga gtatgtttca actcaggact aaacactgtg aaaagcaagt ccagacagcc 121 ccgccctgcc gctagaacta acagaccata aaaggaaagg aatgcagaac agaccaggag 181 taccggacct gactcacagg ccacctggca ggaagagata agcccccagc ccccgacatc 241 caggacgccc caaagctgcc aatgtgtgta gctataactt attacctcat catgtgaaat 301 agccaatcat atgtgaacat gtctttgtgc ctcgtttgaa tccaccaatc cccgtaacta 361 tgcatctgct tctgtacgcc cacttctgct tccccaatcc ctataaaatc cccatgctgg 421 agctgctggg tgcgcaagtc ctccgaagag tctgtgtgcc cgcaggtacc tgtgttttcc 481 aataaaccct cttgctgatg gcatcgcagt ggactcggct cggtcattgg ggcttggact 541 cctcctgagg gaaggtcctc tccgggggtc tttcagaaga agagaattaa cagaccaaa // LOCUS RATGL5A 599 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat NICER element (GL5-14) 5' long terminal repeat. ACCESSION M33535 KEYWORDS long terminal repeat. SOURCE Rat (strain NEDH, isolate GL4-14) liver NGS-TC12 induced pheochromocytoma cell line DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 599) AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A. TITLE NICER elements: A family of nerve growth factor inducible cAMP-extinguishable retrovirus-like elements JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Wagner, 03-APR-1990. FEATURES from to/span description recomb 18 19 rat DNA end/viral DNA start rpt 19 584 5' viral long terminal repeat signal 344 348 CAAT box signal 401 405 TATA box mRNA 439 > 599 mRNA signal 479 485 Poly-A signal binding 579 586 primer binding site BASE COUNT 160 a 158 c 138 g 143 t ORIGIN 1 ttattatgtg tccttgaagg gaaaggaaaa ttatacagat ttaaggttta aaaatatgaa 61 gttaaaagag tatgtttcaa ctcaggacta aacactgtga aaagcaagtc cagacagccc 121 cgccctgccg ctagaactaa cagaccataa aaggaaagga atgcagaaca gaccaggagt 181 accggacctg actcacaggc cacctggcag gaagagataa gcccccagcc tccgacatcc 241 aggacgcccc aaagctgcca atgtgtgtag ctatacctta ttacctcatc atgtgaaata 301 gccaatcata tgtgaacatg tctttgtgcc tcgtttgaat ccaccaatcc ccgtaactat 361 gcatctgctt ctgtacgccc acttctgctt ccccaatccc tataaaagcc ccatgctgga 421 gctgctgggt gcgcaagtcc tccgaagaga ctgtgtgccc gaaggtacct gtgttttcca 481 ataaaccctc ttgctgattg catcgagtgg actcggctcg gtcattgggc gcttggactc 541 ctcctgaggg aaaggtcctc tccgggggtc tttcattatg gggctcgccc cgggatttg // LOCUS XELHMGCOA 2970 bp ss-mRNA VRT 29-JUN-1990 DEFINITION X.laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA, complete cds. ACCESSION M29258 KEYWORDS 3-hydroxy-3-methylglutaryl-coenzyme A reductase. SOURCE X.laevis liver, cDNA to mRNA, clone 2. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2970) AUTHORS Chen,H. and Shapiro,D.J. TITLE Nucleotide sequence and estrogen induction of Xenopus laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase JOURNAL J. Biol. Chem. 265, 4622-4629 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-OCT-1989. FEATURES from to/span description pept 115 2766 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA < 1 2970 3-hydroxy-3-methylglutaryl-coenzyme A reductase BASE COUNT 821 a 676 c 714 g 759 t ORIGIN 1 ggcgggaggt agaacatctg tgtgtgtcat ctccgtatcc ttgtggtatt tctctgccgc 61 cgccgggtca gtcaccagcc agcatcggcc aaagatatcc ttaatagagc caacatgctg 121 tccagactat ttcgcatgca tgggcagttt gtggcctctc atccctggga ggtcatcgtt 181 gggactgtga ctctcacaat ctgcatgatg tcaatgaaca tgttcactgg gaatgacaaa 241 atctgtggct ggaattatgc ttgccccaag tttgaagaag atgttctgag cagtgacatc 301 atcatcctga caatcacaag atgcatcgcc atcctttaca tttatttcca gttccaaaac 361 ctcagacagc tcggctcaaa atacatttta gggattgctg ggcttttcac catcttctca 421 agctttgttt ttagcacagt ggttattcac ttcctggata aagaactgac aggcctaaac 481 gaagctcttc catttttcct gcttttgatc gatctttcca aggctagtgc tttagccaaa 541 tttgctctaa gttccaactc acaggatgaa gtccgagaca atattgcccg tggcatggca 601 attctaggac ctacctttac ccttgaggcg cttgtagaat gtcttgtgat tggagtcggt 661 acaatgtcag gtgtgcggca gttagaaatc atgtgttgct tcggctgcat gtctgtcctt 721 gccaattatt ttgccttcat gaccttcttt ccagcttgtg tttcattggt gttagagctt 781 tctagagaga gccgagaggg ccgtccaatc tggcagctca gtcagtttgc tagtgtacta 841 gaagaagagg aggataataa acccaatcct gttactcaga gagtgaaaat gattatgtct 901 ttgggtttgg tccttgttca tgctcatagt cggtggataa gtgaaccctc ttcccaaaac 961 agcacctcaa tcagtgacca tgaagttacc accatgctgg atgatatgat gccaaagaga 1021 gttgagccta gcatgcctct ctggcagttc tatctttcta gaatggtgac catggatgta 1081 gaacagatca ttactcttgg tcttgctctc ctccttgccg tgaagtacat attctttgag 1141 caaacagaga cagaatcaac cttctctatg aagaacccga taatctcccc cgtagcggtg 1201 caaaagaagc aaatagagag ttgctgcaga cgggaacctg aacaggaaaa aactgtccat 1261 gtttccacaa cagaagaagc ttcatcaaag gaggaaactg aggcagttat aaagccatta 1321 ccccttgaga catcgcccaa agccaagttt atagtgggag attcttctcc actggaattg 1381 tccccagagg ataaaaatac catgtttgat ctaccagagg aaccccgtcc acttgatgaa 1441 tgtgttcgta tactcaagaa tccagataaa ggcgcacagt atctgactga tgcagaagtc 1501 attagcctgg tgaatgccaa acacatccca gcctacaaac tggaaaccat gatggaatct 1561 ccccgagagg gcgtggccat tcgccggcaa atgttgtctg acaagctccc ccagcgatca 1621 gcactgcaga gcttgcctta taagaactac aactactctt tagtaatggg tgcttgttgt 1681 gagaatgtca tcgggtacat gcccatccct gtaggtgtag ctggacctct cttgctgaac 1741 aacaaagagt atcaggtgcc catggcaaca actgaaggat gtcttgtggc cagcacaaac 1801 agaggttgca gagccataat gctgggagga ggtgccaaaa gtcgtgttct ggctgatggg 1861 atgactcgtg gaccggttgt taggttacca actgcctgtg atgcagcaga agtaaaggcc 1921 tggctagaca gcgcagaggg attcaaagtc ataaaagatg catttgatag cacaagcaga 1981 tttgcacgac tcgggagact acaaaactgc gttgctggca ggaacctgta catccgtttc 2041 cagtcaaaaa caggagatgc catgggaatg aacatgattt cgaaggtcac tgaacaagct 2101 cttgctagac tacaagaaga gttccctgat ctccatgttt tggctgtgag cggcaactac 2161 tgtacggata agaagccagc tgctattaac tggatagagg gaagaggaaa gtcagtggtt 2221 tgtgaggcca tcattccagc caaagttgta cgggaggttc taaagagctc aacagaagcg 2281 ttagttgaag tgaatatcaa taagaatttt attggttctg caatggcggg aagcatcgga 2341 gggtacaatg cacatgctgc aaacattgtg actgccattt acattgcttg cggacaggat 2401 gctgctcaga atgttgggag ctccaattgc atcacaataa tggaggctac cggtcccacg 2461 tatgaagatt tgtacatcag ctgcacaatg ccctctatag aaattggaac tgtaggaggt 2521 ggaactaact tggctcccca gcaagcttgc ttacagatgc tgggagtgca aggagcaagc 2581 acagagacac cagggaaaaa cgcctgccag cttgctcaga tagtgtgtag cacagttatg 2641 gccggggaac tgtcactaat ggcggcttta gcagcgggtc acctcgtcaa gagtcacatg 2701 gtccacaaca ggtcaaaaat taaccttcag gacctgccag gcacgtgcac aaagaaagct 2761 gcctgattgg ccgtaggaag agccgaatct taaaatgaac ttctctgaga aacggtgctg 2821 tatctcatca caacacaagg tggcgccaca tctgcctgag aaatggaatt gcctcatcta 2881 catatggatt cccaggcccg agcttcacaa tcaccatgtt aatccagtct tttattgatg 2941 atggcatctt aaaaagaaac ttacataatt // LOCUS HUMPFKMIX 1139 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM) alternatively spliced mRNA, partial cds. ACCESSION J05533 KEYWORDS phosphofructokinase. SOURCE Human (34 week old) placenta, cDNA to mRNA, clone HPFKM1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62; 1110 to 1139) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1139) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. JOURNAL Unpublished (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 1139 muscle phosphofructokinase (AA at 3) (EC 2.7.1.11) /nomgen="PFKM" /map="1cen-q32" /hgml_locus_uid="LL0081L" BASE COUNT 285 a 262 c 337 g 255 t ORIGIN 1 gaaaaccaat cacctcagaa gacatcaaga atggcagcag gatgggtgtg gaagcagtga 61 tggcactttt ggaggggacc ccagataccc cagcctgtgt agtgagcctc tctggtaacc 121 aggctgtgcg cctgcccctc atggaatgtg tccaggtgac caaagatgtg accaaggcca 181 tggatgagaa gaaatttgac gaagccctga agctgagagg ccggagcttc atgaacaact 241 gggaggtgta caagcttcta gctcatgtca gacccccggt atctaagagt ggttcgcaca 301 cagtggctgt gatgaacgtg ggggctccgg ctgcaggcat gaatgctgct gttcgctcca 361 ctgtgaggat tggccttatc cagggcaacc gagtgctcgt tgtccatgat ggtttcgagg 421 gcctggccaa ggggcagata gaggaagctg gctggagcta tgttgggggc tggactggcc 481 aaggtggctc taaacttggg actaaaagga ctctacccaa gaagagcttt gaacagatca 541 gtgccaatat aactaagttt aacattcagg gccttgtcat cattgggggc tttgaggctt 601 acacaggggg cctggaactg atggagggca ggaagcagtt tgatgagctc tgcatcccat 661 ttgtggtcat tcctgctaca gtctccaaca atgtccctgg ctcagacttc agcgttgggg 721 ctgacacagc actcaatact atctgcacaa cctgtgaccg catcaagcag tcagcagctg 781 gcaccaagcg tcgggtgttt atcattgaga ctatgggtgg ctactgtggc tacctggcta 841 ccatggctgg actggcagct ggggccgatg ctgcctacat ttttgaggag cccttcacca 901 ttcgagacct gcaggcaaat gttgaacatc tggtgcaaaa gatgaaaaca actgtgaaaa 961 ggggcttggt gttaaggaat gaaaagtgca atgagaacta taccactgac ttcattttca 1021 acctgtactc tgaggagggg aagggcatct tcgacagcag gaagaatgtg cttggtcaca 1081 tgcagcaggg tgggagccca accccatttg ataggaattt tgccactaag atgggcgcc // LOCUS J05525 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05525 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 8 c 3 g 5 t ORIGIN 1 ctcatctcag atacctggcc // LOCUS J05526 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05526 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 5 a 6 c 5 g 4 t ORIGIN 1 actcagcgag gtacttgcac // LOCUS J05527 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05527 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 7 a 8 c 3 g 2 t ORIGIN 1 tctcaaccag acaaggaccc // LOCUS J05528 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05528 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 7 a 3 c 4 g 6 t ORIGIN 1 catcaagaat gttcgtatga // LOCUS J05529 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05529 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 2 a 5 c 6 g 7 t ORIGIN 1 gtcccttcag ctggtggtta // LOCUS J05530 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05530 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 6 a 4 c 5 g 5 t ORIGIN 1 cagaattctg gtaagtcacg // LOCUS J05531 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05531 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 6 c 8 g 2 t ORIGIN 1 gctcctgcag ggcagcagga // LOCUS J05532 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05532 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 3 c 7 g 6 t ORIGIN 1 tgtccaggtg gtaagtactg //