Path: utzoo!attcan!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 13 Jul 90 12:00:06 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 1487 Approved: lear@genbank.bio.net Checksum: 01061 91 LOCUS TMGMPCPA 1413 bp ss-RNA VRL 13-JUL-1990 DEFINITION Tobacco mild green mosaic virus movement and coat protein genes, complete cds. ACCESSION M34236 KEYWORDS coat protein; movement protein. SOURCE Tobacco mild green mosaic virus (strain PV 228), cDNA to viral RNA. ORGANISM Tobacco mild green mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Tobamovirus. REFERENCE 1 (bases 1 to 1413) AUTHORS Nejidat,A., Cellier,F., Holt,C.A., Gafny,R., Eggenberger,A.L. and Beachy,R.N. TITLE Examination of movement function complementation between two tobamoviruses in a recombinant virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Nejidat, 11-MAY-1990. FEATURES from to/span description pept 51 851 movement protein pept 857 1336 coat protein BASE COUNT 456 a 214 c 331 g 412 t ORIGIN 1 tgttaacaaa tttttgtgtg ataaattttt atttagaact ttgtttttaa atggctgtta 61 gtctcagaga tactgtcaaa attagcgagt tcattaatct ttcgaaacag gatgagatac 121 ttccggcatt catgactaag gtcaagagtg ttagaatatc gactgtggac aagattatgg 181 ctgttaagaa tgatagtctt tctgatgtag atttacttaa aggtgttaag ttagttaaga 241 atggttatgt atgcttagct ggtttggtag tgtctgggga gtggaatctc ccggataact 301 gccgtggtgg tgtcagtgtt tgtattgtag acaagagaat gaaaagaagt aatgaggcaa 361 cgctgggtgc ttatcacgcc cctgcttgca aaaagaattt ttcctttaag ctaatcccta 421 attattcaat aacatccgag gatgctgaga agaacccgtg gcaagtgtta gtgaatatca 481 aaggagtggc tatggaagaa ggatactgtc ctttatcttt ggagttcgtt tcaatttgtg 541 tagtacataa aaataatgta aagaaaggtt tgagggaacg tattttgaga gtaacagacg 601 actcgccaat tgaactcact gaaaaagttg ttgaggagtt cgtggatgaa gtaccaatgg 661 ctgtgaaact cgaaaggttc cggaaaacaa aaaagggaaa aaaaagaaaa aaagaaaaaa 721 aaaagagagt ggtaggtaat agtgttaata ataagaaaat aaataatagt ggtaagaagg 781 gtttgaaagt tgaggaaatt gaggataatg taagtgatga cgagtctatc gcgtcatcga 841 gtacgtttta atcaacatgc cttatacaat caactctccg agccaatttg tttacttaag 901 ttccgcttat gcagatcctg tgcagctgat caatctgtgt acaaatgcat tgggtaacca 961 gtttcaaacg caacaagcta ggacaacagt ccaacagcaa tttgcggatg cctggaaacc 1021 tgtgcctagt atgacagtga gatttcctgc atcggatttc tatgtgtata gatataattc 1081 gacgcttgat ccgttgatca cggcgttatt aaatagtttt gatactagaa atagaataat 1141 agaggttgat aatcaacccg caccgaatac tactgaaatc gttaacgcga ctcagagggt 1201 agacgatgcg actgtagcta taagggcttc aatcaataat ttagctaatg aactggttcg 1261 tggaactggc atgttcaatc aagcaggctt tgagactgct agtggacttg tctggaccac 1321 aactccggct acttagctat tgttgtgaga tttcctaaaa taaagtcgct gatactttaa 1381 attcagggtg gctgatacca aaatcagcag tgg // LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 13-JUL-1990 DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene, complete cds. ACCESSION J02911 KEYWORDS forymltetrahydrofolate synthetase. SOURCE C.thermoaceticum DNA. ORGANISM Clostridium thermoaceticum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1965) AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G. TITLE Primary structure of the thermostable formyltetrahydrofolate synthetase from Clostridium thermoaceticum JOURNAL Biochemistry 29, 5687-5694 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Lovell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg start codon) (EC 6.3.4.3) mRNA < 1 1760 FTHFS mRNA binding 41 44 ribosomal binding site (put.) signal 1761 1785 pot. transcription termination signal BASE COUNT 459 a 576 c 532 g 398 t ORIGIN 1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc 61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact 121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa 181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt 241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac 301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg 361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat 421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct 481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg 541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat 601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc 661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt 721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga 781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac 841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg 901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc 961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg 1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg 1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa 1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa 1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc 1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact 1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa 1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga 1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta 1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa 1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg 1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc 1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc 1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat 1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca 1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag 1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat // LOCUS HUMGLIBSA 176 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A1. ACCESSION M34816 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 176) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 31 53 human zinc finger protein binding site binding 94 116 human zinc finger protein binding site binding 157 176 human zinc finger protein binding site BASE COUNT 68 a 50 c 36 g 22 t ORIGIN 1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt 61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta 121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat // LOCUS HUMGLIBSB 166 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A2. ACCESSION M34817 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 166) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 27 49 human zinc finger protein binding site binding 90 112 human zinc finger protein binding site BASE COUNT 60 a 47 c 37 g 22 t ORIGIN 1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa 61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc 121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac // LOCUS HUMGLIBSC 313 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A3. ACCESSION M34818 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 1 23 human zinc finger protein binding site binding 64 86 human zinc finger protein binding site binding 127 149 human zinc finger protein binding site binding 190 212 human zinc finger protein binding site binding 253 275 human zinc finger protein binding site BASE COUNT 110 a 79 c 75 g 49 t ORIGIN 1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc 61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag 121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc 181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac 241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc 301 agctgcaagg gct // LOCUS HUMGLIBSD 184 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A4. ACCESSION M34819 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 184) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 32 54 human zinc finger protein binding site binding 95 117 human zinc finger protein binding site binding 158 180 human zinc finger protein binding site BASE COUNT 67 a 46 c 41 g 30 t ORIGIN 1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg 61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt 121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt 181 tgta // LOCUS HUMGLIBSE 62 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region B1. ACCESSION M34820 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone B1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 11 33 human zinc finger protein binding site BASE COUNT 18 a 26 c 11 g 7 t ORIGIN 1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc 61 tt // LOCUS HUMGLIBSF 67 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region C1. ACCESSION M34821 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone C1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 67) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 16 29 human zinc finger protein binding site BASE COUNT 15 a 26 c 15 g 11 t ORIGIN 1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct 61 ctctgct // LOCUS MLVU3RA 383 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA. ACCESSION M34722 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 383) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 178 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 147 152 Moloney leukemia virus binding protein b binding 156 167 enhancer core binding protein binding 350 362 CCAAT-binding transcription factor binding 363 372 glucocorticoid response element BASE COUNT 127 a 105 c 84 g 67 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc 61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc 181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc 361 caagaacaga tggtccccag aaa // LOCUS MLVU3RB 348 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA. ACCESSION M34723 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 348) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 177 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 112 117 Moloney leukemia virus binding protein b binding 121 132 enhancer core binding protein binding 142 154 CCAAT-binding transcription factor binding 155 164 glucocorticoid response element BASE COUNT 115 a 97 c 76 g 60 t ORIGIN 1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc 121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct 181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg 241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg 301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc // LOCUS MLVU3RC 445 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA. ACCESSION M34724 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 445) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 274 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 168 179 CCAAT-binding transcription factor binding 238 252 CCAAT-binding transcription factor binding 138 143 Moloney leukemia virus binding protein b binding 209 214 Moloney leukemia virus binding protein b binding 147 158 enhancer core binding protein binding 218 229 enhancer core binding protein binding 180 189 glucocorticoid response element binding 252 263 glucocorticoid response element BASE COUNT 142 a 126 c 104 g 73 t ORIGIN 1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg 121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa 181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg 241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga 301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa 361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa 421 aaagggtaag aaccccacac tcggc // LOCUS MLVU3RD 409 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA. ACCESSION M34725 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 409) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 238 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 174 179 Moloney leukemia virus binding protein b binding 183 194 enhancer core binding protein binding 204 215 CCAAT-binding transcription factor binding 216 225 glucocorticoid response element BASE COUNT 133 a 114 c 91 g 71 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc 61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata 181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc 241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc // LOCUS TTHISOR 6434 bp ds-DNA BCT 13-JUL-1990 DEFINITION T.thermophilus insertion sequences Is1000A and Is1000B. ACCESSION M33159 KEYWORDS insertion sequence. SOURCE T.thermophilus (strain HB8, ATCC 27634), clones pNZ1557 and 1559. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 6434) AUTHORS Ashby,M.K. and Bergquist,P.L. TITLE Cloning and sequence of IS1000 from Thermus thermophilus HB8 JOURNAL Plasmid (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.K.Ashby, 22-MAR-1990. FEATURES from to/span description pept 666 106 (c) ORFA pept < 706 1347 ORFB pept < 1251 1961 ORFC (AA at 1251) pept < 2007 3026 ORFD (AA at 2007) pept 3711 4664 ORF 1A pept 5173 6126 ORF 1B trns 3516 4711 IS1000A trns 4978 6173 IS1000B BASE COUNT 1060 a 2367 c 2037 g 970 t ORIGIN 1 ctcgccgccc tcctcctcgg cgggggccaa gcccacctgc tcttcgccct cgccctcagc 61 ctcggcctcc tttggccccc ccggtgagct cggccaggac cgccttcaag gccctttcca 121 ggcctcgagg ctccccgtgt tctccaagac ccaggtcgcc cgcttgcgct tctcctcctc 181 gggcatctgg gcccgctccc gggccaggac ctcctcccgg gaaagccccg agcgcgccat 241 cacccgcctc acccgctcct ccaggggggc cgccaccaag agggtcccgt ggagcctccc 301 ctcccacccc ttctcaaaga ggagggggat ctccagaaag acgagggggg cctctaggcg 361 ggaaagctcc tccataagaa gccgcctgac ctccgggtgg accacggcct cgagggcctt 421 gagcctctcc gggtcggaaa agacgagccg ggcgagggcc ctccggtcca gcctcccccc 481 caccaccgcc tcggggaaga ggcgcttcag ctcctcctcc ttgttctccc gggcccgggc 541 ggcgagcgcg tccaggtcca agacggggta gccccaggac ctgaggaggg cggccacggt 601 gctcttgccg ctgccgatgt tcccggtgat gccgataata atggggtgct tcgcctcgtg 661 gcccatatgg acttcccctt ttataccccc aaggacgcct tccccgtggg cggggcggtg 721 cgggacctcc tcctgggaag aaggccaacg gacctggact acgccgcctt ggaccccgaa 781 ggggcggccg aggaggcaaa gaggcgcctc gggggaagcc tcttccccct ggaccccaag 841 cggggccact accgcctggt ggtgggggaa aggaccctgg acttcacccc cttggagggg 901 aggctggagg aggacctcct ccggcgggac taccgggtga acgccctcct ctggaagggg 961 ggcgccgtct tcggcctaaa gggggtggag gaggacctga ggcggcgcct cctcgtcccc 1021 gtgcgggagg aaaacctcta ccaggaccac ctgcgaagcc tccggggggt ccgcctcgcc 1081 gccaccttgg gcttcgggct tccccagagg acccgggagg cccttggccg ccacgcccgg 1141 ttcctccagg cccaccccga ggccctcccc gcgcgggaaa gggtcaagga ggagcttgcc 1201 cggctcctcc tctccccgag ggcggcttcg gcctgcgcct tctggagagg gtggggctcc 1261 ttggggtcta cctccccgag ctcgccctcc tcgtgggcct ccatcagggc ggggtgcacc 1321 accttcccgc ctgggaccac accctgagcg cggtcttcca cctcctctgg ctctggcccg 1381 aggcccccct cgaggcccgc ctcgccgccc tcttccacga cgtgggcaag cccctcaccc 1441 gccgctttga ccccgaggtg ggccgcttcc gcttcctggg gcacgccgag gtgggggcgg 1501 agatcgcccg ggcaagcctc ttttggctcc gcttccccaa ggaggtggtg gagcgggcgg 1561 cgggcctggt gcgccgccac atggaccgcc tccccgagga gaggaaggcc ctgcgccgct 1621 tcttcctgag gcgccaggac ctcctcccgg acctcgtcta cctcatggcg gcggaccgcc 1681 tggccacgag gggcgtggag cgcgaggcct gggaggtgct cgggcgctac gaagaggtcc 1741 tgaaagaccc cctgccgcaa aggcccctcc tctccgggga ggaggtgatg gcccttctgg 1801 gcctccaaga aggccccgag gtgggaaggg ccctaaaggc cctcctcgag gcccaggccg 1861 aaggccgggt ggggaccaag gaggaggccc gggcctttct cctatattgg aggggtggaa 1921 gggaggctca ggcttcggga accccagatc acccccattg aggggggctt cctggtctcc 1981 gaccctacgg ggtctacgag aagcccctgg ccctcaccga gggcgggctt ttcctcctct 2041 ccctcatgga ggggaggacc ctggaggagg tgcaggagga ggtgttcaag cgccacgggg 2101 tcctggtgcc gaagaaggag ctggaagacc tggcgaaggc cctggaggag gcgggcctcc 2161 tcctcacgga gaaggtggag gcgaggctca aagaggagga ggaaaagctc aagcgggagc 2221 ggcccatgcg cctcgccggg ctctcctatc ccgagggcga gcgggaggcc cgggccttcc 2281 tcgaggcctt ccgggcgagc tacccggggg aaggcgagga ggcccgggtc ctcctcatgc 2341 cccacctgga gccctcccgc gtccccgagg tctacggggc ggccctcgcc gccctggaga 2401 agaccccgcc ccctgagcgc atctacctgg tgggggtcgc ccacaggccg ctgaaggaga 2461 aggccgccgc ccttcccgtg cccttccaga cccccttcgg ccccgccctg ccggaccttc 2521 ccgccctcca ggccttagac gccctccttc cctttgagct cttcaacacg cccctcgcct 2581 tccgggagga gcacagcctg gagcttcccc tcttcttcct gaaggggcgg ttccccgagg 2641 cccgcgtcct ccccctcctc gtggccaggc gaagcccgga gctcggggag gccctgaagg 2701 tggtcctgcg ggacttcccg ggccttctcg tcctcgccgt ggacctctcc cacgtggggc 2761 cccgcttcgg ggacaccccc ctcacccgca ccctggcgga ggaggcgagg aggcgggacc 2821 tgggcttcct ggaaaggctc gcggaagggg agccggaggc cgccctcgcc ttcctggggg 2881 ccaaccccac ccgcatagac ggggtggagg tggtggcaag cctcctcccc ctcctccggg 2941 aaaggaaggg gaaggtcctg gcccaccgcc tggacctcga ggcccccacc ctaagcgccg 3001 tgggggcggg caccctggtc ctctaaagag ccccagattt ggaacgacgc gaaggcccgg 3061 gcgccgaacc gcccgggccc cttttctggg agcgcccctt tgggaaaggg cccggaaggt 3121 gggtgggcag gcgcgccggg gcctgcgtga aagcctccta gcgcgagggg tacaggcctt 3181 cacccacgac aaaggggtcg cttgatgggc aagacagtaa tccccttacg gggctcaatc 3241 ccttgcaacc cttaggagct acctttttgc gcttcccgtg acgagcgtcg caatcccctt 3301 acggggctca atcccttgca acttccttct aaggaggaat gggcatcatt cacaatcagc 3361 aagtcgcaat ccccttacgg ggctcaatcc cttgcaacgg attcgcctcg gcccgcctag 3421 ggactggatc tatgaccccg tcgcaatccc cttacggggc tcaatccctt gcaaccattg 3481 gtggaggcgc gtgataagcg agtggaggtg aggtggtcgc atgtatcttg ccccctgagg 3541 gaggccccac cgccccggtt gccgggccac cacgcccgcg ggattgatcg gtagccaccc 3601 accccggcct ccccgccgca cagtatcccg gtcgtcaaag accgcatcca catggaaggc 3661 ccacagggtc gggtggccac ctccctcagg ttaccaaaag gagagagccc atgaccttcg 3721 ccggcattga cgtcagcaaa acccacctgg acctggccct cgtctccaac tcccccaaac 3781 ccacccgcct ccgcttcccc aactcccctg aaggccgtca agccctcctc gccgccctcg 3841 cccaccacaa ccccgcctgg gtcgccctgg agcccacggg cgcttaccac ctccccctcc 3901 tcaagctcct ggcagaaaac cgcctccagg tggccctggt caacccctac catctcgccg 3961 ccttccgcaa ggccaaggga gaacgccaga agaccgaccg ccaagacgcc ctcctcctcg 4021 cccgctacgc ccaggtctac cacggggagc tctgggccta caccctcccc ccagaagccc 4081 tccgtgagct caaagccctg gtgggctacc gggaggactt ggccgggcgg gaaagggcca 4141 tcctcaacca gatggaggcg gcggagtggg cggggagcaa ggaggtcctc gccctcctcc 4201 aaaaggagct ggcctgcgtg aaggggcttc tcggggaggt ggaggccagg atccaggccc 4261 tcctcgccac cctccccgag gccgaggtcc tgatggccct gcccggggtg gggccccagg 4321 tggcggcagc ggtgctggcc ctcctgcccc cagagctctg gggccgggcg aagagggcgg 4381 cctcctacgc ggggctcatc cccgagcggg aggagtcggg aaagagcgtg gagaggagtc 4441 ggctctccaa aaaagggcct cccctcctgc ggcgaaagct ctacatgggc gccctggtgg 4501 cggtgcgcca tgacccggag atgcgggcct tctaccaccg cctgctctcg cggggaaaga 4561 gaaagaagca ggcgttggtg gccgtggccc acaagctcct caggcggatg atgggaaggc 4621 tcagggagta ctacgcgacc cagctagatc aaggggtcgc ttgacaggca agacagtatc 4681 aatcccctta cggggctcaa tcccttgcaa ctccgacggc aagatcatcg gggtgcggcg 4741 gctcctcgcc gtcgcaatcc ccttacgggg ctcaatccct tgcaacgcca gtctatgacc 4801 aagcccctag gctccttcac tttgaagtgt cgcaatcccc ttacggggct caatcccttg 4861 caacacctca tgggggagta agataggcgg gaaaggaggg aggtcgcaat ccccttacgg 4921 ggctcaatcc cttgcaacca cgtgaagacg tccctggcct cggtcgtgag ggacgtggtc 4981 gcatgtatct tgccccctga gggaggcccc accgccccgg ttgccgggcc accacgcccg 5041 cgggattgat cggtagccac ccaccccggc ctccccgccg cacagtatcc cggtcgtcaa 5101 agaccgcatc cacatggaag gcccacaggg tcgggtggcc acctccctca ggttaccaaa 5161 aggagagagc ccatgacctt cgccggcatt gacgtcagca aaacccacct ggacctggcc 5221 ctcgtctcca actcccccaa acccacccgc ctccgcttcc ccaactcccc tgaaggccgt 5281 caagccctcc tcgccgccct cgcccaccac aaccccgcct gggtcgccct ggagcccacc 5341 agcgcctacc acctccccct cctcaagctc ctggcagaaa accgcctcca ggtggccctg 5401 gtcaacccct accacctcgc cgccttccgc aaggccaagg gagaacgcca gaagaccgac 5461 cgccaagacg ccctcctcct cgcccgctac gcccaggtct accacgaaga cctccgggcc 5521 tacaccctac ccccagaaac cctccgggag ctcaaagccc tggtgggcta ccgggaggac 5581 ttggccgggc gggaaaggac catcctcaac cagatggagg cggcggagtg ggcggggagc 5641 aaggaggtcc tcgccctcct ccagaaggag ctggcctgcg tgaaggggct tctcggggag 5701 gtggaggcca ggatccaggc cctcctcgcc accctccccg aggccgaggt cctgatggcc 5761 ctgcccgggg tggggcccca ggtggcggca gcggtgctgg ccctcctgcc cccagagctc 5821 tggggccggg cgaagagggc ggcctcctac gcggggctca tccccgagcg ggaggagtcg 5881 ggaaagagcg tggagaggag tcggctctcc aaaaaagggc ctcccctcct gcggcgaaag 5941 ctctacatgg gcgccctggt ggcggtgcgc catgacccgg agatgcgggc cttctaccac 6001 cgcctgctct cgcggggaaa gagaaagaag caggcgttgg tggccgtggc ccacaagctc 6061 ctcaggcgga tgatgggaag gctcagggag tactacgcaa cccagctaga tcaaggggtc 6121 gcttgacagg caagacagta tcaatcccct tacggggctc aatcccttgc aacgaacctc 6181 ctcgggccca ccgagggggt aatcctcgca gagtcgcaat ccccttacgg ggctcaatcc 6241 cttgcaacct gaaccagaag gccctctggg cggcggcgta cgccgtggcg tcgcaatccc 6301 cttacggggc tcaatccctt gcaacgcgat tggcgtcgcc ctccccgtta tcgcttacga 6361 gtcgcatgta tcttgccccc tgagggaggc cccaccgccc cggttgccgg gccaccacgc 6421 ccgcgggatt gatc // LOCUS MLUDNAA 4171 bp ds-DNA BCT 13-JUL-1990 DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34 (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA polymerase III beta subunit (dnaN) genes, complete cds. ACCESSION M34006 KEYWORDS 50S ribosomal subunit protein L34; DNA biosynthesis initiation protein; DNA polymerase III beta subunit; ribonuclease P. SOURCE M.luteus (strain IF03333) DNA. ORGANISM Micrococcus luteus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 4171) AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N. TITLE Structure of the dnaA region of Micrococcus luteus: Conservation and variations among eubacteria JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by N.Ogasawara, 05-MAY-1990. FEATURES from to/span description pept 435 37 (c) ribonuclease P (rnpA) pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH) pept 944 2491 DNA biosynthesis initiation protein (dnaA) pept 3242 > 4171 DNA polymerase III beta subunit (dnaN) signal 843 848 dnaA -35 signal signal 866 871 dnaA -10 signal BASE COUNT 672 a 1558 c 1361 g 580 t ORIGIN 1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg 61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta 121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag 181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg 241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct 301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg 361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg 421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg 481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg 541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca 601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag 661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc 721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc 781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc 841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc 901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc 961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc 1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct 1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc 1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga 1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg 1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac 1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt 1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc 1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg 1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc 1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc 1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga 1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca 1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga 1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt 1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc 1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc 1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta 2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac 2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa 2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac 2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct 2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc 2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa 2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa 2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg 2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact 2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc 2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg 2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct 2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg 2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc 2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca 2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc 3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac 3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca 3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc 3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct 3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc 3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg 3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga 3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc 3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg 3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct 3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac 3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga 3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg 3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac 3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc 3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga 3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac 4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct 4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag 4141 cgccaacgag tccgtgccgt gcacgctcga g // LOCUS CE1CG 6646 bp ds-DNA circular BCT 13-JUL-1990 DEFINITION Plasmid ColE1, complete genome. ACCESSION J01566 M33100 KEYWORDS cea gene; colicin; colicin E1; colicin E1 immunity; complete genome; kil gene; origin of replication. SOURCE Plasmid ColE1 DNA. ORGANISM Plasmid Colicin E1 Prokaryota; Bacteria. REFERENCE 1 (bases 1008 to 1370) AUTHORS Tomizawa,J.-I., Ohmori,H. and Bird,R.E. TITLE Origin of replication of colicin E1 plasmid DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 1865-1869 (1977) STANDARD full staff_review REFERENCE 2 (bases 1410 to 1599) AUTHORS Bastia,D. TITLE Determination of restriction sites and the nucleotide sequence surrounding the relaxation site of ColE1 JOURNAL J. Mol. Biol. 124, 601-639 (1978) STANDARD full staff_review REFERENCE 3 (bases 1 to 1613; 6577 to 6646) AUTHORS Oka,A., Nomura,N., Morita,M., Sugisaki,H., Sugimoto,K. and Takanami,M. TITLE Nucleotide sequence of small ColE1 derivatives: Structure of the regions essential for autonomous replication and colicin E1 immunity JOURNAL Mol. Gen. Genet. 172, 151-159 (1979) STANDARD full staff_review REFERENCE 4 (bases 623 to 832) AUTHORS Morita,M. and Oka,A. TITLE The structure of a transcriptional unit on colicin E1 plasmid JOURNAL Eur. J. Biochem. 97, 435-443 (1979) STANDARD full staff_review REFERENCE 5 (bases 1 to 192; 384 to 519; 6644 to 6646) AUTHORS Patient,R.K. TITLE Characterization of in vitro transcription initiation and termination sites in Col E1 DNA JOURNAL Nucleic Acids Res. 6, 2647-2665 (1979) STANDARD full staff_review REFERENCE 6 (bases 4783 to 5353) AUTHORS Ebina,Y., Kishi,F., Miki,T., Kagamiyama,H., Nakazawa,T. and Nakazawa,A. TITLE The nucleotide sequence surrounding the promoter region of colicin E1 gene JOURNAL Gene 15, 119-126 (1981) STANDARD full staff_review REFERENCE 7 (bases 1 to 6646) AUTHORS Chan,P.T., Ohmori,H., Tomizawa,J.-I. and Lebowitz,J. TITLE Nucleotide sequence and gene organization of ColE1 DNA JOURNAL J. Biol. Chem. 260, 8925-8935 (1985) STANDARD full staff_review REFERENCE 8 (bases 57 to 123) AUTHORS Zhiang,S., Yan,L. and Zubay,G. TITLE Regulation of gene expression in plasmid ColE1: Delayed expression of the kil gene JOURNAL J. Bacteriol. 170, 5460-5467 (1988) STANDARD simple staff_entry REFERENCE 9 (bases 639 to 753) AUTHORS Tomizawa,J.-I. TITLE Control of ColE1 plasmid replication: The process of binding of RNA I to the primer transcript JOURNAL Cell 38, 861-870 (1984) STANDARD full staff_review REFERENCE 10 (bases 5046 to 5110) AUTHORS Parker,R.C. TITLE Mitomycin C-induced bidirectional transcription from the colicin E1 promoter region in plasmid ColE1 JOURNAL Biochim. Biophys. Acta 868, 39-44 (1986) STANDARD full staff_entry COMMENT The L (light) strand is shown. This sequence was kindly supplied over computer network by J.Lebowitz (04-APR-1985). The open reading frames noted by [7] are named in FEATURES by the molecular mass of their predicted products or by the names given by [7]. FEATURES from to/span description pept 400 59 (c) colicin E1 immunity protein (imm) pept 448 585 kil protein pept 1609 1746 mob1 protein (gtg start codon) pept 1803 1612 (c) RNA I inhibition modulator protein (rom; gtg start codon) pept 4362 3943 (c) entry exclusion protein 2 (exc2) pept 4862 4443 (c) entry exclusion protein 1 (exc1) pept 5140 62 colicin E1 protein (cea) RNA 641 > 641 II RNA RNA 751 645 (c) I RNA ((3' end +/- 1 bp) [4] mRNA 1449 > 3936 mob mRNA (5' end putative) [5],[7] mRNA 5056 > 585 colE1-kil mRNA [7] mRNA 409 < 59 (c) imm mRNA [5],[7] mRNA 4887 < 3943 (c) exc mRNA [5],[7] binding 5063 5093 LexA binding site [7] ORF 605 895 10.2 kd ORF [7] (ttg start codon) ORF 1241 870 (c) 13.8 kd ORF [7] (gtg start codon) ORF 1189 1350 6.0 kd ORF [7] ORF 1408 1238 (c) 6.4 kd ORF [7] ORF 1347 1526 6.9 kd ORF [7] (gtg start codon) ORF 1709 1500 (c) 7.6 kd ORF [7] ORF 1609 1746 mob1 ORF (gtg start codon) ORF 1843 2190 mob2 ORF ORF 2180 3733 mob3 ORF ORF 2214 2399 mob4 ORF (gtg start codon) ORF 2868 3386 mob6 ORF ORF 3393 3626 mob7 ORF (gtg start codon) ORF 3657 3818 mob8 ORF ORF 3766 3936 mob9 ORF (gtg start codon) ORF 4258 4647 14.5 kd ORF [7] ORF 5435 5653 8.8 kd ORF [7] ORF 5852 6013 6.7 kd ORF [7] conflict 83 84 aa in [3],[7]; a in [5] conflict 99 100 aa in [3],[7]; a in [5] conflict 135 135 a in [3],[7]; g in [5] conflict 144 144 a in [3],[7]; g in [5] conflict 174 174 a in [3],[7]; g in [5] conflict 484 485 aa in [3],[7]; a in [5] conflict 491 492 tt in [3],[7]; t in [5] conflict 506 512 ctaacta in [3],[7]; caca in [5] conflict 517 519 cct in [3],[7]; cgg in [5] conflict 528 528 c in [7]; g in [3] conflict 652 653 aa in [3],[7]; aaa in [4] conflict 1111 1111 g in [7]; t in [1],[3] orgrpl 1197 1197 replication origin (+/- 1 bp) [1] conflict 1366 1366 a in [3],[7]; g in [1] conflict 1416 1417 cc in [3],[7]; c in [2] conflict 1454 1454 c in [3],[7]; t in [2] cutss 1466 1465 (c) relaxation (nic) cut site [2] conflict 1581 1582 aa in [3],[7]; a in [2] conflict 4784 4787 ctcc in [7]; cgc in [6] BASE COUNT 1760 a 1421 c 1798 g 1667 t ORIGIN 2 bp downstream of EcoRI site. 1 ttctatgctc ctatattgat aagaataaac ttaatactat aaatgaggtg ttagggattt 61 aattattctt tattgatata aaaagtccta gcaatccaaa tgggattgct aggaccaaac 121 aaagtagatt atatagcata aataggttta attttgctac gggggcgtta tttaggtttt 181 ttcttctttc gaaaaaatct ttctttatga agttaaaagc tatgtattca atagcatatt 241 ttgaatatgg acatagaata gtgcttatca ctattgcata tagcatctta tctgacacaa 301 ggaaataata cccttcgctg ttttttgtta taaggtatat atatataagt gtgcagtaca 361 ggccaaataa aatatttttt atgtagtatc ttaagctcat aaattaaacc tcgccatata 421 ttcttttcat tttataagga tcgagttatg aggaaaagat tttttgtggg aatattcgcg 481 ataaacctcc ttgttggatg tcaggctaac tatatacctg atgttcaggg agggaccatc 541 gcaccatcct cctcttctaa actgacgggg atcgcggttc agtagaaaag attaaaggat 601 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 661 taccaacggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 721 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag tcgggccact 781 acttcaagaa ctctgtagca ccgtttgtgc catcatcgct ctgctaatcc ggttaccagt 841 ggctgctgcc agtggcgtta aggcgtgcct taccgggttg gactcaagac gatagttacc 901 ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 961 aacgacctac accgaactga gataccaaca gcgtgagcta tgagaaagcg ccacgcttcc 1021 cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 1081 gagggagctt ccagggggaa acgcctggta gctttatagt cctgtcgggt ttcgccacct 1141 ctgacttgag cgtctatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 1201 ctgctacgtg gccttcttcc tgttcctggt cttttgctca catgttcttt ccggccttat 1261 cccctgattc tgtggataac tgtgttaccg tttttgtgtg agtcagtacc gctcgccgca 1321 gtcgaacgac cgagcgtagc gagtcagtga gcgaggaagc ggaaaagcgc ctggacgtgc 1381 attttctcct tacgcatctg tgcggcattt cacacccggc atggcgtact tttcatacaa 1441 tccgcactga tgccgcatgg ttaagccagt atacactccg ctatcgctac gtgactgggt 1501 cagggctgcg ccccgacacc cgctaaaacc tgctgacgcg ccctgacggg cttgtcagct 1561 cccggcatcc gctcacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt 1621 ttcaccgtca tccccgaaac gtgcgaggca gctgcggtaa agctcatcgg cgtggtcgtg 1681 aagcgattca caaatatcgg cctgttcatc tgcgtccagt tcgttgagct tctccagcag 1741 cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttagt 1801 cactgatgcc tccgtgtaag ggggatttct gttcatgggg taatgatacc gatgaaacgc 1861 gagaggatgc tcacaatacg ggttactgat gatgaacatg cccggttact tgaacgctgt 1921 gagggtaaac aactggcggt atggatgcgg cgggtctgcc tgggggagcc ggttgcccgt 1981 tccggaaaac tgccgacact ggcaccgccg ttactgcgtc agctggccgc catcggaaat 2041 aacctgaatc agacagcccg taaggtgaac agcgggcagt ggtcttccgg tgaccgggtt 2101 caggtggtgg ccgcactgat ggccatcggg gatgagctgc gccggctgcg tctggctgtc 2161 agggaacagg gggcgcggga tgatagttaa atttcatgcc aggggaaaag gtggtggcag 2221 tggtccggtt gattacctgc tggggaggga gcgtaaccgc gaaggcgcaa cggtgcttca 2281 gggtataccg gaagaagtcc gggaactcat cgatgccacg ccatttgcga agaaatacac 2341 gtccggtgtt ctgtcgttcg cggagaagga gctgccgccg ggaggacgtg aaaaagtgat 2401 ggcgagcttt gagcgtgtac tgatgcccgg tctcgaaaaa aatcagtaca gcatcctgtg 2461 ggtggagcac caggacaagg gacggcttga gctgaatttt gtcattccga acatggagct 2521 acagaccgga aaacgcctcc agccgtacta cgaccgcgca gacaggccta gaattgatgc 2581 ttggcagacg ctgttaaatc accattacgg gctgcatgac ccgaacgccc cggagaaccg 2641 caggacgctg acactccctg ataacctgcc tgaaacgaaa caggcgcttg ctgagggcgt 2701 cacgcgaggt atagatgcac tttaccatgc cggagagata aaaggccgtc aggatgtgat 2761 tcaggcgctc actgaggcgg ggctggaagt ggtcagggtg acgcgaagca gtatcagcat 2821 tgcagatccg aacggcggga agaatatcag gctgaaagga gcattttatg agcaatcttt 2881 tgcagacggg cgcggagttc gagaaaaagc tgaaagagag agccgaatct acagagaaaa 2941 tgctgaacaa cgagttcagg aggctcggcg aatctgtaag cgaggctgtg acatcaaacg 3001 agacgaaaat cagagacgct atagccctgt tcacagcctc gacagaggaa tcgctggaaa 3061 aacaccggga aggggtgaaa gaggcgatga tgcagcacag gagggacgtg ttaaagctgg 3121 cagggaatac gggcatgatg ttactgggga tagtctttct cctgtttacc gcgagtggcg 3181 ggacgctctg gtatcttgga gggaggatac aggcgaacct ggaagaaatc aggaagcagg 3241 aagagacatt gcagaaactg aacgcgaaga catggggcgt ggagtttgtg caggacggga 3301 acaggaaatt ccttgtcctt ccgtacggga aatcagcgga ggtgattccc tttcagggga 3361 aagagtgggt acatctgaag gagtgacaca gagtgacaga gctggaaaca catttgctga 3421 acgccttaga gcagctgcaa caggactata tgcagcggct gagcgaatgg gagagcgcct 3481 tcgtggaatt gcagaagatg ttttcgctta cgcaacggga caacgcgatg ctgaacgagc 3541 gggtcatgca gttgagtcag caggtgcaac acttgagcga gcagacagaa cgcttgagcc 3601 agttatacag cgagaactgg agataagaga ggaacggctg atacaggagc gcgaacatgt 3661 gttatcccta gaacgggagc gtcagccgga aatacaggaa cgcacgctgg atggcccttc 3721 gctgggatgg tgaaaccatg aaaaatggca gcttcagtgg attaagtggg ggtaatgtgg 3781 cctgtaccct ctggttgcat aggtattcat acggttaaaa tttatcaggc gcgatcgcgc 3841 agtttttagg gtggtttgtt gccattttta cctgtctgct gccgtgatcg cgctgaacgc 3901 gttttagcgg tgcgtacaat taagggatta tggtaaatcc acttactgtc tgccctcgta 3961 gccatcgaga taaaccgcag aaatcgtgtc agccagcagc ctggattttt tcggggtaag 4021 gtctttgctg cggtaatcag tacccgcaaa ttgagttgtg ccccggagtg agtttaaaaa 4081 ttcttggctg tgaaactgct gaatgcgttg ttgcacatca ctccgtgatt ttcctgtcgc 4141 tttgtcttta gctcccatat cccagaactg ccgaaaaaaa ggcaccatca ttcgtgttgt 4201 gtcggctttt tgggtggaaa agttgggatc aaaaccgtca tcagaagcat atacgtaatg 4261 ccgggtatgt tgttccacgc tgccctttgg ggtacaggct gtgagagcta ctgttgctgt 4321 tatcatggcg ataagacaaa gtttgtttcc ggtacgtctc atgagtgtct ccctacctta 4381 aagtatttat tctcggtgag ttattatccg aagactattt tatcatcagt ctctgacagc 4441 gttcagcgac cgaacaacct ggaccagaat ccccgtttct tttccagttc cagtgcctgc 4501 ctgagctggg ctatctcatt ttgtagctgt tcccgttctg cttcctggcg tctgcgatcc 4561 atatcctgtg cctgtttatc ctcaagcatc agcgtcaggc attgtttcag ctcattcagt 4621 tcccggagaa tgcgttctgt ctgctgatca tgtggatttt ctgcatgtcc ctcactgtgc 4681 ctttctggtg tctcattctg ctttaattcg ccgtatgccc ggatcagttc actggtttca 4741 aactcccgtc gtccgtccct gccggtacgg taactgacaa ggcctccgcg catatcacgg 4801 taaaactgtg aacgcgatct gcctgtcatt tttagtgcgt cccggagtgt gtgccatgcc 4861 ataaagtgac agtgtcccat agatgtctca tctcatagtt tcagtaaaac ataatgaggt 4921 ctgagaacgg taatgtttgt gctggttttt gtggcatcgg gcgagaatag cgcgtggtgt 4981 gaaagactgt ttttttgatc gttttcacaa aaatggaagt ccacagtctt gacagggaaa 5041 atgcagcggc gtagctttta tgctgtatat aaaaccagtg gttatatgta cagtatttat 5101 ttttaactta ttgttttaaa agtcaaagag gattttataa tggaaaccgc ggtagcgtac 5161 tataaagatg gtgttcctta tgatgataag ggacaggtaa ttattactct tttgaatggt 5221 actcctgacg ggagtggctc tggcggcgga ggtggaaaag gaggcagtaa aagtgaaagt 5281 tctgcagcta ttcatgcaac tgctaaatgg tctactgctc aattaaagaa aacacaggca 5341 gagcaggctg cccgggcaaa agctgcagcg gaagcacagg cgaaagcaaa ggcaaacagg 5401 gatgcgctga ctcagcgcct gaaggatatc gtgaatgagg ctcttcgtca caatgcctca 5461 cgtacgcctt cagcaacaga gcttgctcat gctaataatg cagctatgca ggcggaagca 5521 gagcgtttgc gccttgcgaa agcagaagaa aaagcccgta aagaagcgga agcagcagaa 5581 aaggcttttc aggaagcaga acaacgacgt aaagagattg aacgggagaa ggctgaaaca 5641 gaacgccagt tgaaactggc tgaagctgaa gagaaacgac tggctgcatt gagtgaagaa 5701 gctaaagctg ttgagatcgc ccaaaaaaaa ctttctgctg cacaatctga agtggtgaaa 5761 atggatggag agattaagac tctcaattct cgtttaagct ccagtatcca tgcccgtgat 5821 gcagaaatga aaacgctcgc tggaaaacga aatgaactgg ctcaggcatc cgctaaatat 5881 aaagaactgg atgagctggt caaaaaacta tcaccaagag ccaatgatcc gcttcagaac 5941 cgtccttttt ttgaagcaac cagacgacgg gttggggccg gtaagattag agaagaaaaa 6001 caaaaacagg taacagcatc agaaacacgt attaaccgga taaatgctga tataactcag 6061 atccagaagg ctatttctca ggtcagtaat aatcgtaatg ccggtatcgc tcgtgttcat 6121 gaagctgaag aaaatttgaa aaaagcacag aataatctcc ttaattcaca gattaaggat 6181 gctgttgatg caacagttag cttttatcaa acgctgactg aaaaatatgg tgaaaaatat 6241 tcgaaaatgg cacaggaact tgctgataag tctaaaggta agaaaatcgg caatgtgaat 6301 gaagctctcg ctgcttttga aaaatacaag gatgttttaa ataagaaatt cagcaaagcc 6361 gatcgtgatg ctatttttaa tgcgttggca tcggtgaagt atgatgactg ggctaaacat 6421 ttagatcagt ttgccaagta cttgaagatt acggggcatg tttcttttgg atatgatgtg 6481 gtatctgata tcctaaaaat taaggataca ggtgactgga agccactatt tcttacatta 6541 gagaagaaag ctgcagatgc aggggtgagt tatgttgttg ctttactttt tagcttgctt 6601 gctggaacta cattaggtat ttggggtatt gctattgtta caggaa // LOCUS COGTUB1A 2672 bp ds-DNA PLN 13-JUL-1990 DEFINITION C.graminicola beta-tubulin-1 (TUB1) gene, complete cds. ACCESSION M34491 KEYWORDS beta-tubulin-1. SOURCE C.graminicola DNA. ORGANISM Colletotrichum graminicola Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Diaporthales; Diaporthaceae. REFERENCE 1 (bases 1 to 2672) AUTHORS Panaccione,D.G. and Hanau,R.M. TITLE Characterization of two divergent beta-tubulin genes from Colletotrichum graminicola JOURNAL Gene 86, 163-170 (1990) STANDARD simple staff_review FEATURES from to/span description pept 662 673 beta-tubulin-1 (TUB1), exon 1 740 763 beta-tubulin-1, exon 2 818 843 beta-tubulin-1, exon 3 905 946 beta-tubulin-1, exon 4 1014 1068 beta-tubulin-1, exon 5 1124 1908 beta-tubulin-1, exon 6 1962 2355 beta-tubulin-1, exon 7 IVS 674 739 TUB1 intron A IVS 764 817 TUB1 intron B IVS 844 904 TUB1 intron C IVS 947 1013 TUB1 intron D IVS 1069 1123 TUB1 intron E IVS 1909 1961 TUB1 intron F BASE COUNT 632 a 754 c 660 g 626 t ORIGIN 1 ggatccggct cggcgttacg gactggaaac ccaaagggca acaatgagta gagcacttag 61 caaccgtgca acaacaacac catagccagg cagctgtgta tccaggggaa cgaacccatt 121 ggtccaaggt gttaccaagc ctctaggcaa taatctccca gttacggcca tacaaacctg 181 attcatgatg cctagaattc tgagcaaagc aaaacagagc atccatgtac caatcatttc 241 tgtggatata tactcaacgc ggactagctt ggtctctttt gggcgccgcc tccgagagaa 301 tgggcatcag ccataggtcc ctttcttttt tccttttccc atccccctcc cctccctacc 361 catcactctc tcttccaaag agtaagcaaa actgtgctgc cggagtaggc agccgtcgaa 421 tcagaaaggc agccattcaa cgatgcgcaa acatgtgggg attggaaatc cagaccgcaa 481 agttacactc tgctcgggcc tatccgatac gcagcgataa gttgtcggtc cttggatagg 541 ggatttgttg tgatttacag gcatatgtat ctataagatg accacctgtc ttgagtagcg 601 gggaaaatct ctgctttcca ccatcaactt ttgacaatcc acaccattcc caaagttcat 661 aatgcgtgag attgtgagtt gtaccttgtc cacgtaccta ttgaggtatt gattctcagc 721 tgactcgggg atccaacaga tacacctcca aaccggccaa tgtgtacgta tctttcgctg 781 ccgaatggat gagccctgac tgacgaagaa tctgtagggc aaccaagtcg gtacagcatt 841 ctggtatgag gatgggaccg tcgatgcacc ttcaagagat gagactgatg ttggtcacta 901 ccaggcaaac cattcacggc gagcacgggc ttgatcaaga tggagtgtga gtctcaccgc 961 catgtaacgc gacgtcacct cgctcatcga gcgtcaagtt gacattttca tagattccgc 1021 ggctctgatg agcagcagtc cgagaggctg agcgtctact tcaccgaggt aactcgatac 1081 agtcgtcgcg cacttctccc tattcctttc tgacacgatg caggctgcca aacaaaagta 1141 cgttcctcgt gcagtcctgg ttgaccttga gcccgcaacc atggatgcaa tccgttccgg 1201 tccccttggt gacttcttcc gtcccgacaa catggtctac ggacagtccg gcgcgggtaa 1261 caactgggca aagggccact acaccgaagg cgccgagctg gttgaccaag tcctcgacgt 1321 cgtgcgccgc gaagccgaag cctgcgacag tctgcagggt ttccagatca cgcactccct 1381 cggtggcggc accggctctg gcatgggcac cctcctcatc gcaaaggtcc gcgaggaatt 1441 ccctgatcgt atgatggcga ccttctctgt cctgccgtct gccaaggtct ctgaggtcgt 1501 tgtcgagccg tacaatgcga ccttgtcgat tcatcagctc gtcgaaaact cggacgagac 1561 gttctgtatc gacaacgagg ccctgtatga tatctgccgc cgcaccctga agcaagcgca 1621 cccgtcctac ggacatctca accacctggt gtccagggtc atgtccggct tgacgacggg 1681 cttccgcttc ccggggcaat tgaacgccga cctccgcaag ctggccgtga acctcgtccc 1741 cttccccaga cttcacttct tcaccgtcgg attcgcgccg ctcacctcct cggcctcctt 1801 ctcaaacctg ggtatcgccg agctcacgca gcagatgttc gacccgaaga acgtcatgtt 1861 ggcgtctgat ttccgtgacg gacgcttctt gacttgctcc accatgttgt gagttggtcc 1921 cctcatgtcg cttgatgaat tctttgctaa cgtaccttta gccgtggaaa ggtgtccatg 1981 aagcaggtcg aggaacagat ccaggccatc aagaacaaga actcagccaa cttcgtggag 2041 tggatcccga ataacatcca gactgctcat tgctctgtgc cgcccaaggg actggacgta 2101 tcatcgacgt tcattggaaa ctccaccgca atccagaaca gtttccgcag agtcggcgac 2161 cagttcagcc tcatgttccg gagaaaggct ttcttgcatt ggtacactgg agagggcatg 2221 gatgagatgg aattcactga ggccgagtct aacatgaacg atctggtatc cgagtaccag 2281 cagtaccagg atgcaggcat ggatgacgag tacggcgaag agtacgaaga cgaggctcct 2341 gctgaggagg aatgagaagg caagacaact cttcgatgtc cttcccgagg tgaagttact 2401 ttatattcaa ggtagttgat ggtctttgaa ccttttccaa tcactgtaca cacccaatcc 2461 accatcatag tccgagcagc tcacttgaag ttattttccc tttcgactga gaaggcaata 2521 gagttgttct atctcgttgc ttgtgcagac aacgaagtgg atcgttacgc aacccctagt 2581 cttgacttgc aaatcatatc gtcccaagat ccagatggct attagttcat ggatcggtac 2641 ccttgtagat ccacctgaag tggctccccg gg // LOCUS COGTUB1B 2614 bp ds-DNA PLN 13-JUL-1990 DEFINITION C.graminicola beta-tubulin-2 (TUB2) gene, complete cds. ACCESSION M34492 KEYWORDS beta-tubulin-2. SOURCE C.graminicola DNA. ORGANISM Colletotrichum graminicola Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Diaporthales; Diaporthaceae. REFERENCE 1 (bases 1 to 2614) AUTHORS Panaccione,D.G. and Hanau,R.M. TITLE Characterization of two divergent beta-tubulin genes from Colletotrichum graminicola JOURNAL Gene 86, 163-170 (1990) STANDARD simple staff_review FEATURES from to/span description pept 441 452 beta-tubulin-2 (TUB2), exon 1 655 678 beta-tubulin-2, exon 2 738 763 beta-tubulin-2, exon 3 840 881 beta-tubulin-2, exon 4 942 996 beta-tubulin-2, exon 5 1068 1858 beta-tubulin-2, exon 6 1917 2310 beta-tubulin-2, exon 7 IVS 453 654 TUB2 intron A IVS 679 737 TUB2 intron B IVS 764 839 TUB2 intron C IVS 882 941 TUB2 intron D IVS 997 1067 TUB2 intron E IVS 1859 1916 TUB2 intron F BASE COUNT 561 a 802 c 607 g 644 t ORIGIN 1 cccgggtctt ggatcggaga gaacagagga gcagaatagg gattattggg taggcaggta 61 cgtgcccagg taacacgtcg tgcctgccaa tcaacgccga tctgagtcct cgacttgttg 121 cgacccacca tcaaatagtg cctgggaatg gtctttcccc aatcaggatc gtgcacggat 181 cccaatagta aacaagcgag ctgcaccctt ttcctctctg gcctgtctct gggcacctgg 241 cctgtggcgg tgagcgaaat cacgtttacc cgcaaaacaa aaatcaacaa cctttcccct 301 acctaaccac accttaatct catccacttt ccaaccacgt ccaccttgga agcttcgcgc 361 agctctcaag cacctcccac tcgtcctctc ctttgctcca tcgtcggcct agtcagctca 421 agagcttttc tccattcaca atgcgtgaga ttgtaagtcc ttcccctcaa tcattcgtaa 481 caaataaacc tgcgaccgac gcgtttggcg acgaatcgtc ggccttgccc ctgaacgtac 541 cccgccgaca tttccaccca acactggtcc tcaccgaaga cgaccacgat tgccatcacc 601 gacagtatgc accttgggga tatatcgctg accattgatt gttatactcg ataggttcac 661 ctccagaccg gccagtgcgt aagtcttctc tgatcccaac caacaatcca aggtgcgggg 721 ctaacttctt tgaatagggt aaccagattg gtgctgcctt ttggtgcgta gccagaccga 781 catcatcgac ttcggcgaga ttggcctcga aaagacattg gatattaata cgggcacagg 841 caaaacatct ctggcgagca cggcctcgac agcaatggcg tgtatgttgc caacctccag 901 atctggccac ttcctcgagt tcaccgctaa tttctcaaca gttacaatgg cacctctgag 961 ctccagctcg agcgcatgag cgtctacttc aacgaagttt gttatcctag ccccccccag 1021 gaagcagaca aacctattga tgaatactga ccttgtcacg tacccaggct tccggcaaca 1081 agtatgtccc tcgcgccgtc ctcgtcgact tggagcccgg taccatggat gctgttcgcg 1141 ccggcccctt cggccagctt ttccgccccg acaacttcgt ctttggccag tccggtgccg 1201 gcaacaactg ggccaagggt cactacaccg aaggagctga gcttgtcgac caggtccttg 1261 acgtcgtccg ccgcgaggct gagggctgcg actgccttca gggcttccag attacccact 1321 cccttggtgg aggtactggt gccggtatgg gtactctgtt gatctccaag attcgcgagg 1381 agttccccga ccgcatgatg gctacctttt ccgtcgttcc ctcccctaag gtttccgaca 1441 ccgttgttga gccctacaac gccactctct ccgtccacca gctggttgag aactccgatg 1501 agaccttctg cattgataac gaggctctct acgacatctg catgcgtact cttaagctct 1561 ctaacccctc gtacggcgac ctgaaccatc tcgtctctgc cgtcatgtcc ggtgtcacta 1621 cctgcctgcg tttccctggt cagctgaact ctgacctgcg taagctggcc gtcaacatgg 1681 ttcctttccc ccgtcttcac ttcttcatgg tcggattcgc tcccctgacc agccgtggtg 1741 cccactcttt ccgcgctgtc agcgttcctg agctcaccca gcagatgttc gaccccaaga 1801 acatgatggc tgcctctgac ttccgcaacg gtcgctacct gacctgctct gccatcttgt 1861 aagtgtcatt tccagccaac ctacaatgtc agcacttgct aacagctgcc tctcagccgt 1921 ggtaaggtcg ccatgaagga cgtcgaggac cagatgcgca acgtcctgaa caagaactct 1981 tcttacttcg tcgagtggat ccccaacaat gtccagaccg ccctctgctc cattcctccc 2041 cgcggcctca agatgtcctt cacctttgtc ggtaactcta ccgccatcca ggagctcttc 2101 aagcgtgtcg gtgagcagtt cactgccatg ttccgtcgca aggctttctt gcattggtac 2161 actggtgagg gtatggacga gatggagttc actgaggctg agtccaacat gaacgacttg 2221 gtctctgagt accagcagta ccaggacgct ggtgttgatg aggaggagga ggagtacgag 2281 gatgacgccc ccctggagga ggaggtttaa gcgttgtctg aaaatgctgt gccaccttgg 2341 ccatgtcttc acccaacccg tctgcggtgg catttcgctt cactattcca gctctgcact 2401 ggaaatgggc ttctagatat acctctctta gtagttcgcc tggcgtatca aaatgagtac 2461 gaagaatcag agattactct gtacaaatta ttggcaacat caaatgcata gttttatggc 2521 aattgcgaca cctctaatct tgccagagtt caagtattcg tatcttcttt cgtgactgac 2581 agattatcta tgtttaaacg tggacaccct cgag // LOCUS DDICNPA 6372 bp ds-DNA INV 13-JUL-1990 DEFINITION D.discoideum cyclic nucleotide phosphodiesterase gene, complete cds. ACCESSION M23449 KEYWORDS cyclic nucleotide phosphodiesterase. SOURCE D.discoideum (strain NC4; cell line AX3-K) DNA, clones pGP-1 and pC34. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 6372) AUTHORS Podgorski,G.J., Franke,J., Faure,M. and Kessin,R.H. TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium discoideum utilizes alternate promoters and splicing for the synthesis of multiple mRNAs JOURNAL Mol. Cell. Biol. (1989) In press STANDARD full staff_review REFERENCE 2 (bases 3380 to 4379) AUTHORS Faure,M., Franke,J., Hall,A.L., Podgorski,G.J. and Kessin,R.H. TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium discoideum contains three promoters specific for growth, aggregation, and late development JOURNAL Mol. Cell. Biol. 10, 1921-1930 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.H.Kessin, 23-MAR-1989. FEATURES from to/span description pept 4127 4146 cyclic nucleotide phosphodiesterase, first expressed exon (EC 3.1.4.17) 4297 5635 cyclic nucleotide phosphodiesterase, exon 2 pre-msg 1163 6060 CNP mRNA and introns IVS 1780 2266 CNP intron A (no splice consensus at 2266) IVS 2377 4077 CNP intron B IVS 4147 4296 CNP intron C signal 6015 6020 poly-A signal BASE COUNT 2615 a 750 c 504 g 2503 t ORIGIN 1 tgatcatttt gtaaatcttt caattaaatt tcaacttaca cattaataat aataaaaata 61 ataataataa taatatcaga cattcacagg atttttataa atccaatttt atttttcatt 121 gatgtcgttt tattaattga cattcaaatt gtgaataact gacaataaat aaaacagttc 181 atatatttat ttgtttgaaa aaaaaaaaaa aaaaaaaaaa tcaaagttgc atgtgcgagt 241 gtaaattggg gtttttttta taaatctaaa ttagttattt tacaatttta gaaccactac 301 caaattacaa aaaaaaaaaa aaaaaaacca ctcatttatt tttttttttt tttttttttt 361 ttttatcaca tagattccaa taaaattgtg aaacataaag aggaaaacga taatatggat 421 tattagaaaa taatcaatac tttttttttt tttttttttt tgtgttttat tggtttttaa 481 ttaaaagttt tgggggtttt ttattttaat tttttttttt tttgaaaaac atactcactc 541 acacaaaacg caatcaataa atctattttt tgaaaataaa aaaaatcttt tttttttttt 601 tttttttttt ttttctagtt ttcaaaaaat tcatttaatc gcattacatt ataaaataaa 661 aaaattcggt gtcttaaatc tcccaaaatc ttgaatgatt tgtgtgtgtt aaaataataa 721 tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaatt ccaactttta gatccccaat 781 cacataggtg tgatatatct aagttattga aaattgtgtt gatattgttt ttcttacttt 841 tttttttttt ttaaaataac cactttactt tctaaatatt gcatcgcaat gggggtatta 901 aaagttttgt tatttttatt tacacatctt gatttcaaga taaaaatttt tagtgtaaat 961 atatcaaaca ccatagagac ttttcacatt atcaagtgtt tttgactctc tatcacactc 1021 acaaatctga tttttatttt tatttttatt tttttttttt ttttatttta tttttttttt 1081 attttatttt atttttctcc aaccacaatt gatttcaaaa aaaaaaaagt ctataaaatc 1141 aaaaaaaaat ttttttttta attaatcaaa aacaaaaaca aaaaacaaaa ataaataaaa 1201 aaaaaaaaaa aaaacttgta acattaataa taataatttg taattttaaa taccaacttc 1261 aattccatat tacttttttt gtatcattat ttatatatct caatctattt taaatcacca 1321 ttgtatcact actaaaaaaa aaaaaaaaaa aaaaatcaaa aatcaaaaat ccaaatttat 1381 aaaaaaaaaa aaaaaaatca aaattgtaaa aaaataacaa tatcaccctt ttttctgtat 1441 tattacttgt atttgtaaca ctataaatta aataaataaa aatttaattt tgtatcatta 1501 ccacactatt aatttttttt ttacaatttt tgatcaataa taaatcgtta aaaaaaaaaa 1561 aaaaaaaaat ttttaataat aaaataaatt aataccaaaa ataatcaacc aaccaaccaa 1621 aaattattta attatttatt aaccatttta tcataattaa taaataaaaa aaaaaaaaaa 1681 ataaaaaaaa aacatcaaaa aaaaaaaata aaaaaacaat accatatcaa attatctctc 1741 gttctgtact attactcctt aaaaaattta ttaaaaaagg tatgtaaata atataattat 1801 tattatttta tttattatta tgattccatt taaatgtttc aaaatttggt ttttttattt 1861 tttccttatt ttcgttcgct ttgaccaatc attttttttt attattttaa tttttaatta 1921 tttttcgggt tcattccaac atgacaatgg aaaaaattaa aaaagggtat aggaaaagcg 1981 caaatcaaaa ggaaaaaaaa aaatatcttg cgtttgccaa tttttatttt tttttatttt 2041 tttttttatt tttttttttt ttgaaatcaa tcaaaatttc caaccttata ataatccaat 2101 ttaatcacaa atccagtgat tttatcgttg tgtgtgtgtg gtctgatttt aaaatcacca 2161 actgtttttt tttataatta tttttttttt attttatttt tcaatttctt ttcttttttt 2221 tttttttatt ttttaatatc ttttattttt tttttttttt ttttttttat tattattaat 2281 aatcataaaa ctgtaatagt acttaaattg taacaaaata aaaatataaa aataataata 2341 ataatttaac aataagaata gcaacaatta ttcattgttt gtatttaatc aatatttaga 2401 atttttatta tttttttttt atatttatta tttattatct atctagaata caagtagaaa 2461 atataattat ataattttat aatacccatt ataatatatt ttattttatt ttattttatt 2521 ttattttttt ttctcctaaa atcacccaaa caccccacac acaattttat gtaatattca 2581 ataacaacat tgattttcac accactcact gcccaattca attcaaacga tagtgagtaa 2641 ttaattttgt gtttagatgg tagatagaga tgatgtattt tattttttta tatcatccca 2701 agttccatat aaatcaacat tttatgatgt ttatttgttt ttcaatataa catttttatt 2761 taataaaaaa atttaattaa atcatatata acactactta ataaaaacat caataatcac 2821 accacccact accaataata aaaaataaat aaaaaaaata aataaataaa aaattaatta 2881 aatgtttata tttttataac accacacata cgcccacact ccctctttct ccccactact 2941 aaacttgata tttttattgt tttttttttt atacaataaa acattgattg ttatattttt 3001 aattttatga ttaaaaaaaa aaaaaaaaaa caaaaacaaa aactatatat aaaaataccc 3061 aaaaataaaa tatttttaag ggtagagtga taataattaa tcatacacaa aataatgaaa 3121 gtcacaagta gttataaaaa aataaaaaaa aaaaaaaaaa aaaggaaaaa aaaaaaaagt 3181 aataataata ataagaataa taaaaataat aaaaataatt taaaccctat atattgacat 3241 aatggatagc tctataatgt ataatagtag tatatttatt ttaaattata tataaaaata 3301 atttacgatt tcaaagatcg aaaccaatta taacactact gttatgaaac aaatgtgtat 3361 attttacact aacctcgtcg atatctattg tgtgtaacaa ttaggggttt tattatttcc 3421 ttttttttta aaaaaatttt tttttttttt ttatttacta ctatttttaa aaaaaaaaaa 3481 aaaaaaaaaa aaaaaaaaaa gataatttat tttgatcttt tatgtgttga tacacctttt 3541 catgtttaca cacaaacaaa aaacactcga taaattttat tattattttt tatttgtttt 3601 taatttaatt gatattaaaa aaaaaaaaat taaaaaaaaa aaaaaaaaaa aattttaagt 3661 gatatataat ttttaaaatt ttttatttat ttattttttt aaaaaattaa tttttcaaac 3721 aataatacat tcaaaattaa aacaataata acagattcaa aatatcattt ggttttgttt 3781 tttttttttt ttttatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaattata 3841 catataaata tttataacaa taacccacaa acgccacaca ctcacttatt ttttttaatt 3901 ttaaatttta tttattttat ttatttattt atttatttaa tcaaaaaaaa aaaaaaaaaa 3961 aaaactaaaa ttaaaattac taacttattt ttttttttat aaatatatat aaaaaaaaaa 4021 aaaaaataaa attttttttt tttccaatat ctttatcttt tttttttatt ataaaagaag 4081 aaagaattaa tatagttcga ataactacaa aaaatataca aaaaaaatgg cattaaataa 4141 aaaattgtaa gtggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaattta aaaatagaaa 4201 ttgatttgtt gtttatacta attttttttt tttttttttg gaattttttt tttggaaatt 4261 tttatttttt attttttttt tttttcccaa aaatagaatt agtttattac ttttaatttt 4321 tataatttta aatattgtaa attcccatca acaagaagat tgtgatgatg acgatgaaga 4381 tataggaatt tcagcagaga gatcagaaag aagatcagtt aaaaacagta acgatggtag 4441 taatttttat aatttaaatg attattatac accagagaat tggaatcatt atagtggatc 4501 atttgcaact aaagattgta gggatgcatc atacataacc ataccattgg gtacaactgg 4561 tggattggat gaaggtaatc tctcatcatt cctcttgaca aagaaaggta gtaatctttt 4621 cattgcattg gatgcaggta cagtatggca aggtgttaga agattaacca ccttcaaata 4681 tttcaataca ctcttcaata ttacctatcc atcatgggca gttttaccag agcaaagaac 4741 ctcatggttc ctcaagaaac acgttatgag ttatttcatt ggtcacagtc atttagatca 4801 tgtaggtggt ttaattttag tttcaccaga ggattattta gcaaagaatt ggatcgatgt 4861 tcaaccacca atcaacaatg gtatcatggg tctcattcgt aaattagggt tcaaaccaac 4921 tgatttcacc tcttcaagta tcctccaaaa gaaaaccatc atgggtttac catcaaccat 4981 caattcaatc tctactaatc ttttcaataa tcaagtttgg ccaaatcttc catcatttgg 5041 tcgttatcaa tactttagtt tagcaagtgg cattgaatat ccattcacag agttggtccc 5101 atacaatgcc accactatgt ctttggtcgc caatgaattc cctttctctg tcaaagttaa 5161 accatttgaa ttgtgtcacg ataatttaat atcaacttct ttcctcttca ctgatagtat 5221 ctctggtgaa caaatcgctt tcttctctga tactggtgtt ccatcctctg tcgcttgtga 5281 ttgggaaggt aaaatctatg ccgtctggaa acaaattaaa atcgataaat taaaagctat 5341 ctacatcgaa acttctttcc caaataatac tccagactct gctatgtttg gtcatctccg 5401 tccaagagat gtcatgaaat taatggatca actcttggtt caatccattc aaacctctcc 5461 accaatgact aatctcaaac atgttaaatt aatcatcgaa cacattaaac cacaagttgc 5521 tgaagatcca aatggttgga caactcaaag agtaatctat caacaattaa aagaagcaaa 5581 taataatggt gttagaatta ttattccaaa tcaaggtgat ccaatttgta tttaaataat 5641 taattaatat aatatataat aattaaacat atttatttat ttattccacc aaaaaaaaaa 5701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa attaaaaaaa aaaaaaaaaa aaaaaaatta 5761 aataataata ataactaaaa tagaaaacaa taccaaaaaa aaaaacaaaa aaaaaaaaaa 5821 aaaccaatcg acaatttttt tttttttttt tctctaaaac aaatacataa aaaaaatcat 5881 cttataaata actcctcctc ctttcccctc cttctataat tattcagttt cattaattgt 5941 atagattttt tttttttaag tcaaccaacc tccttttttt ttaaaccaca aaaaaaaaaa 6001 aaaaaaaaaa aaaaaataaa ccattttatt tttttttttc caaaaaaatt ttccagtttt 6061 tttttttttt tattttagaa acattttatt tttactttta aaaaatatgg gtgtttattt 6121 tgagaaatat attttttatt taataacaac accctttaag gaaagataga taataattaa 6181 ttatattttt attttacata atgcattttt tttttttttt ttgggtttta ttttattatt 6241 ttttttttta ttgttttaat attattattt ttttttcata aaactttttg agaatcacta 6301 tggaagtcac ttaaattatt attttctctt tcgttatcat cttcttggaa atcaacttgt 6361 tcatcttgat ca // LOCUS HUMCPGISL 2520 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human nested gene protein gene, complete cds. ACCESSION M34677 KEYWORDS coagulation factor VIII; nested gene protein. SOURCE Human 4X B cell line (ACTT GM1202) DNA, clone p542. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2520) AUTHORS Levinson,B., Kenwrick,S., Lakich,D., Hammonds,G.Jr. and Gitschier,J. TITLE A transcribed gene in an intron of the human factor VIII gene JOURNAL Genomics 7, 1-11 (1990) STANDARD simple staff_review FEATURES from to/span description pept 585 1268 CpG island protein, exon 1 1271 1681 CpG island protein, exon 2 IVS < 1 > 2520 coagulation factor VIII, intron 22 mRNA 528 2265 CpG island protein mRNA (5' end = end of cDNA) binding 523 532 Sp1 binding site binding 543 552 AP4 binding site BASE COUNT 418 a 843 c 761 g 498 t ORIGIN Chromosome X. 1 cccgggaaag tcctcagcat gatggtgttc gttggggcgg gtgggcctca agaccatgac 61 tgcagtgttg cgacaggacg gacttctaag gcaaccgggg tggggacgga gaggtggacc 121 cggtgcttgg caaggggccc tggaccggaa gacggaggac tgagtagtcg gggcgggggg 181 ggggggggac aacagtctag aaatgcgcag acatggaatt agggtcacga gtttcgtttg 241 ggtcatgttc agatggagag gctgtgggat gcctttagca acctagacca cttgcattgc 301 agccgagcta gcagcagctg gtgccccgcc aggccccgcg taccttccca gggtcttgga 361 tccgcatccc cgagaccctc gatttgcata cgccgctcac agcccggcca ggccccgcct 421 ccgatcccgc tctttgcacc tcatcagggc aaggggcagg ctcgggctgg ggagccggcg 481 cgggcacggc ggcagcccgc cgagggggac gcagggcacg tcgccccgcc ccggcccgcc 541 cgccagctgc ggcggcgcgt cgggaagtgc ttggggcggc gagcatggcg gcagcggctg 601 caggcctggg cggcggcggc gccggcccgg gacccgaggc cggggacttc ctggcccgct 661 accggctggt atcgaacaag ctgaagaagc ggttcctgcg gaagccgaac gtggcggagg 721 ccggcgagca gttcggacag ctgggccggg agctgcgcgc ccaggagtgt ctgccctacg 781 cggcctggtg ccagctggcg gtggcgcgct gccagcaggc gctcttccac gggcccgggg 841 aggcgctggc cctcaccgag gccgcccgcc tcttcctgcg gcaggagcgc gacgcgccag 901 cgctggtctg ccccgccgcc tacggggagc cgctgcaggc cgccgccagc gccctgggcg 961 ccgcggtgcg tctgcacctc gagctgggcc agccggccgc cgccgccgcc ctctgcctcg 1021 agctggccgc cgccctgcgc gacctgggcc agccggccgc cgccgccggt cacttccagc 1081 gcgccgccca gctccagctg ccccagctgc ccctggccgc gctgcaggcg cttggcgagg 1141 ccgcctcctg ccagctgctg gcgcgcgact acaccggcgc cctggcggtc ttcacgcgca 1201 tgcagcgcct ggcgcgggag cacggcagcc acccggtgca gtcactgccg ccgcccccgc 1261 cgccggcacc ccagcccggg cccggggcga cgcccgccct accggccgcg ctgcttcctc 1321 cgaactccgg ctcggcggcg ccctctcccg cgccctgggc gccttctcgg acgtgctggt 1381 ccgctgcgag gtgtcccgcg tgctgctgct gctcctcctg caaccaccgc ccgccaagct 1441 gctgccggag cacgcccaga ccctggagaa gtactcctgg gaggcttttg acagccacgg 1501 gcaggagagc agcggccagc ttcccgagga gctctttctg ctgctccagt ctttggtcat 1561 ggctacccac gaaaaggaca cggaagccat caagtcgctg caggtggaga tgtggccact 1621 gttgactgct gagcagaacc acctccttca cctcgttctg caagaaacca tctcccccta 1681 ggacagggag tctgatccat cccattcacc cagtgacttc tttttgccca ggcctggact 1741 ttttgcatca gtcacgttaa ccagatgact ttgcctgtta ccaaacctca tgcatccacg 1801 tttgcgtctg gggaggaata aaaagacatc gttcccgctt ctgcgttttg ttattcctac 1861 tgccgccata ggaattattt cgttggctga acgttaccag caccccgaga acacattttg 1921 atagaatcag agtagaggac atggctgtct tctaaaaagc cacgacatga aaatgacaat 1981 ccctttcgtc tccttcctcc gctgcttcca cctaacgcag cctcctgcct ccgcctttgt 2041 ttcatagtga ggattttatt ttgcacggcg ccctccctct aaatacctac cctagatgat 2101 ttcatcctgc ccctcacttc ttaacatatt cctgtgtctt tgtaatggcc aaatttctcc 2161 ttccactcgt ccgcacggta cgtcttcatg ggagtcattt tattccttac agcttccttg 2221 taaccacagc cctaaatcca tgataaagct actcttgtac tcttcactca tgtgtggttt 2281 tggtgtccgt cgtctataaa atgtgactct cttacctact tgtcaggagg ccagcagcag 2341 aaccagatag ctgtttgggc tccaggatgt attcgttcct ttgaaatttg cgctatgcct 2401 cctatgtggg caacgaacct gcagagatca tctatgatgt cctccgtact cgggtgactg 2461 caatgcagat acatccagtg cagcgggtct ggtgccgatg catcgatttg cattgaattc // LOCUS HUMIGHCY 372 bp ss-mRNA PRI 13-JUL-1990 DEFINITION Human Ig active mu-chain mRNA Vv-D-J6-region, partial cds. ACCESSION M34678 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell, cDNA to mRNA, clone EBV 383. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 372) AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic antibody reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 372 Ig mu-cahin Vv-D-J6-region (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" recomb 294 295 Vv-region end/D-region start recomb 316 317 D-region end/J6-region start BASE COUNT 84 a 107 c 108 g 73 t ORIGIN 1 gaggtgcagc tggtgcagtc tggagcagaa gtgaaaaagc cgggggagtc tctgaagatc 61 tcctgtaagg gttctggata cagctttagc acctcctgga tcggctgggt gcgccagatg 121 cccgggaaag gcctcgagtg gctggggacc atccatcctg gtgactctga cacaagaaat 181 agcccctcct tccaaggcca ggtcaccatc tcagtcgaca agtccatcag cacggcctcc 241 ctgcagtggg gcagcctgaa ggcctcggac accggtatat attactgtgc gagactgata 301 acaggtcctc gacactacta ctactacgct atggacgtct ggggccaagg gaccacggtc 361 accgtctcct ca // LOCUS HUMIGHCZ 306 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human Ig expressed mu-chain gene Vv-region. ACCESSION M34680 KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell DNA, clone EBV 383ex. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 306) AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic antibody reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" sigp < 1 12 Ig mu-chain Vv-region signal peptide matp 13 > 305 Ig mu-chain Vv-region BASE COUNT 66 a 81 c 97 g 62 t ORIGIN 1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag 61 tctctgaaga tctcctgtaa gggttctgga tacagcttta gcaccttctg gatcggctgg 121 gtgcgccaga tgcccgggaa aggcctggag tggctgggga ccatccatcc tggtgactct 181 gacacaagaa atagcccgtc cttccaaggc caggtcacca tctcagtcga caagtccatc 241 agcacggcct tcctgcagtg gggcagcctg aaggcctcgg acaccggtat atattactgt 301 gcgaga // LOCUS HUMIGHVV 306 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human Ig germline mu-chain gene Vv-region. ACCESSION M34679 KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell DNA, clone 251. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 306) AUTHORS van der Heijden,R.W.J., Bunschoten H Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,J.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic anti- body reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" sigp < 1 12 Ig mu-chain Vv-region signal peptide matp 13 > 305 Ig mu-chain Vv-region BASE COUNT 67 a 83 c 95 g 61 t ORIGIN 1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag 61 tctctgaaga tctcctgtaa gggttctgga tacagcttta ccagctactg gatcggctgg 121 gtgcgccaga tgcccgggaa aggcctggag tggatgggga tcatctatcc tggtgactct 181 gataccagat acagcccgtc cttccaaggc caggtcacca tctcagccga caagtccatc 241 agcaccgcct acctgcagtg gagcagcctg aaggcctcgg acaccgccat gtattactgt 301 gcgaga // LOCUS MUSPLII 874 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse placental lactogen II (PL-II) mRNA, complete cds. ACCESSION M14647 KEYWORDS placental lactogen II. SOURCE Mouse (strain Balb/c) placenta, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 874) AUTHORS Jackson,L.L., Colosi,P., Talamantes,F. and Linzer,D.I.H. TITLE Molecular cloning of mouse placental lactogen cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 8496-8500 (1986) STANDARD simple staff_review FEATURES from to/span description pept 60 728 placental lactogen II (mPL-II) precursor sigp 60 152 placental lactogen II (mPL-II) signal peptide matp 153 725 placental lactogen II (mPL-II) mRNA < 1 874 mPL-II mRNA BASE COUNT 234 a 212 c 190 g 238 t ORIGIN 1 agcactcggg gaacagcagc cttctggtgt tgtcagcact tcaagagaac tcctcagaga 61 tgaagctgtc tttgagtcaa ccatgctcct tctctggggc actcctgttg ctggcagtgt 121 caaacctact tgtctgggag aaggtgactt ccttaccaaa ttatcgatta cccactgaaa 181 gcctatacca acgtgtgatt gtggtgtcac acaacgccca tgatcttgct tcaaaagcct 241 tcatggaatt tgaaatgaag tttggtagga cagcttggac ctatggcctg atgttaagcc 301 cctgccacac tgctgcaatc cttactccag aaaacagcga gcaagtccac cagacaacat 361 cggaagacct tctgaaagtg tccatcacta ttttacaagc ctgggaagag cctctgaaac 421 acatggtggc agcagtggct gctcttccac atgtacctga tactctgctg tcaagaacaa 481 aggagttgga ggaaagaatt caagggcttc tggaaggact gaagatcata ttcaataggg 541 tttacccagg agctgttgca agtgactata ctttctggtc tgcatggtca gatttgcagt 601 catctgatga atccactaag aacagtgctc ttagaacctt atggcggtgc gtgcgcaggg 661 atacacataa agttgacaat tacctcaagg tcctgaagtg tcgtgatgtt cataacaaca 721 actgctgaac atctatccct cttctctgtc tcttagaagg tccctcgtga tctagacctt 781 caaagcacca atgaatttcc ccttctttgg tgcctttcta gattgagttg tcatcttacc 841 cccaaataaa ctgattcttt gcaaatgcta aatt // LOCUS MUSRARGA 2715 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma (mRAR-gamma-A) mRNA, complete cds. ACCESSION M34476 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE13.20. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2715) AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and Varmuza,S. TITLE Identification of a novel isoform of the retinoic acid receptor gamma expressed in the mouse embryo JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990) STANDARD simple staff_review FEATURES from to/span description pept 253 1629 retinoic acid receptor gamma mRNA 1 2715 RAR-gamma-A mRNA BASE COUNT 578 a 839 c 741 g 557 t ORIGIN 1 cggggctctg cggccttcct tcccccgccc tcccctccag cagtttccac caggtccctc 61 acctcagcct ggcccagtat gtaggaggga atctctgcag aggccagagg gatccttgga 121 aacccactgg acagaccagg cagggtgggc acggagcctc ccaggccagg gcagtgggca 181 tgggcggggg ctgtagctga agaccacccc cgcctgctgc agagtccaag ggattcccac 241 gccgcagcta ccatggccac caataaggag agactctttg cgcccggtgc cctggggcct 301 ggatctggtt acccaggagc aggcttccca ttcgccttcc caggtgcact cagagggtcg 361 ccaccatttg agatgctgag ccctagcttc cggggcctgg gccagcctga cctccccaag 421 gagatggctt ctctctcggt ggagacacag agcaccagct cggaggagat ggtacccagc 481 tctccctcac ccccaccacc tcctcgggtc tataagccat gctttgtatg caatgacaag 541 tcttctggct accactatgg ggtcagctcc tgtgaaggct gcaagggctt cttcagacgc 601 agcattcaga aaaacatggt gtatacatgt caccgtgaca aaaactgtat catcaacaag 661 gtcaccagaa atcgatgcca gtactgcagg ctacaaaagt gtttcgaagt gggcatgtcc 721 aaggaagctg taaggaacga tcgaaacaag aagaaaaagg aggtaaaaga ggagggctcg 781 cccgacagct atgaactgag tccacagtta gaggaactca tcaccaaggt cagcaaagcc 841 caccaggaga cttttccctc actctgccag ctgggcaagt acaccacgaa ctccagtgca 901 gatcaccggg tgcagctgga cctggggctg tgggacaagt tcagcgagct ggccaccaaa 961 tgcatcatca agattgtgga gtttgcgaag cggctgcctg gttttacagg gctcagcatt 1021 gccgaccaga tcacgctgct caaggctgct tgtctggaca tcctaatgct gcggatctgt 1081 acaaggtata ccccagagca ggacactatg acattctcgg atgggctgac cctgaaccga 1141 acccagatgc acaatgctgg ctttgggccc cttacagacc tcgtctttgc ctttgccggg 1201 cagctgctgc ccctggagat ggatgacacc gagactgggc tacttagtgc tatctgcctc 1261 atctgtggag accgaatgga cctggaagag cccgagaagg tggacaagct gcaggagccc 1321 ctgctggaag ccctgaggct ctatgcccgg cgacgagacc cagccaaacc ctacatgttc 1381 ccaaggatgc tgatgaaaat caccgacctc cggggcatca gcactaaggg agcagaaagg 1441 gctataaccc tgaagatgga gattccaggc ccgatgccac ccctgatccg agagatgctg 1501 gagaacccgg agatgtttga ggacgactcc tcgaagcctg gcccccaccc caaggcttcc 1561 agtgaggacg aagctccagg gggccagggc aaaaggggcc aaagtcccca acctgaccag 1621 gggccctgac ctaccccgtt gtggggttgg gccccaggca gcagactgac catttcccag 1681 ataccgccag tgactggggg aggacctgcc ccgcccactc tccacccttt taatgagctc 1741 gttattttgc caaagtttct aggggtgcct gtgttcatcc ctgtcctgtt ctaactggct 1801 ccctctacag tcccgcgggg atctgctgca cacctaccag aagagctgga gaaggggcga 1861 gcctgggtct agactctaaa atctcagcac tgcctttcgg attcaggcca caggctcccg 1921 acgcaagagg aagccccgcc ttcccatagc cttttcctct gccaggtgct tgggcctctg 1981 ggagcaaaca ggaacactag agaccaaaag gggggccccc cccgagggga gggctgagcc 2041 acccttgtgc cccaccctga gcgcctgact ctgtgtgatg aatctgccgc gctcagtgcc 2101 cccggtgccc catcctaatc ctcctgccgg gtgtaggggc aggctggggc ctgcgttttc 2161 tagggtgggg cctgagggga agggagaatg tgagcccgag cccgtctgta cctctgctga 2221 agtacagcac tcatcctggg ccctccatta tataagcctc tcccttactt ctgtctcgtg 2281 ccttggcctc ccgccctcat ctcagccatc aggcagggcc ctcctacact acagaggggc 2341 caggggtccc tctcctagtg ccttctaccc ttgatccccc cagagcagct tggccccggg 2401 gaggggagat gctgcttagc tgatcccacc ttgacccaga ggaagcctct atttatttat 2461 tagcttttgt ttacaccctg gaaatgaccc cttcctccag gggtcttggg agggggagcc 2521 cagggcctct gtgacccctt cctttctttc cccaagcccc cagtttatat ttagctgcca 2581 aataagattc ctgcaggctc ccccatttct ctggggggtt agggtgctgt cccctcccct 2641 ctgtttatat ctcctacccc gctgttcgca tattgctgag ttttctattt ttgcaaaata 2701 aagtgatgga aactc // LOCUS MUSRARGB 2496 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma (RAR-gamma-B) mRNA, complete cds. ACCESSION M34475 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE7.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2496) AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and Varmuza,S. TITLE Identification of a novel isoform of the retinoic acid receptor gamma expressed in the mouse embryo JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990) STANDARD simple staff_review FEATURES from to/span description pept 67 1410 retinoic acid receptor gamma mRNA 1 2496 RAR-gamma-B mRNA BASE COUNT 533 a 751 c 684 g 528 t ORIGIN 1 aggaggggga atgggctgcg ggtccttggt gttctagcac ccagtttctc tccaagccgg 61 gtcgcgatgt acgactgcat ggaatcgttt gtcccgggtc cgcgacggct gtatggggcg 121 gccgggcccg gggccggctt actacgcaga gccactggca gctcctgctt cgccggactt 181 gagtcttttg cctgggcaca gcctgccagt ctacaatcgg tggagacaca gagcaccagc 241 tcggaggaga tggtacccag ctctccctca cccccaccac ctcctcgggt ctataagcca 301 tgctttgtat gcaatgacaa gtcttctggc taccactatg gggtcagctc ctgtgaaggc 361 tgcaagggct tcttcagacg cagcattcag aaaaacatgg tgtatacatg tcaccgtgac 421 aaaaactgta tcatcaacaa ggtcaccaga aatcgatgcc agtactgcag gctacaaaag 481 tgtttcgaag tgggcatgtc caaggaagct gtaaggaacg atcgaaacaa gaagaaaaag 541 gaggtaaaag aggagggctc gcccgacagc tatgaactga gtccacagtt agaggaactc 601 atcaccaagg tcagcaaagc ccaccaggag acttttccct cactctgcca gctgggcaag 661 tacaccacga actccagtgc agatcaccgg gtgcagctgg acctggggct gtgggacaag 721 ttcagcgagc tggccaccaa atgcatcatc aagattgtgg agtttgcgaa gcggctgcct 781 ggttttacag ggctcagcat tgccgaccag atcacgctgc tcaaggctgc ttgtctggac 841 atcctaatgc tgcggatctg tacaaggtat accccagagc aggacactat gacattctcg 901 gatgggctga ccctgaaccg aacccagatg cacaatgctg gctttgggcc ccttacagac 961 ctcgtctttg cctttgccgg gcagctgctg cccctggaga tggatgacac cgagactggg 1021 ctacttagtg ctatctgcct catctgtgga gaccgaatgg acctggaaga gcccgagaag 1081 gtggacaagc tgcaggagcc cctgctggaa gccctgaggc tctatgcccg gcgacgagac 1141 ccagccaaac cctacatgtt cccaaggatg ctgatgaaaa tcaccgacct ccggggcatc 1201 agcactaagg gagcagaaag ggctataacc ctgaagatgg agattccagg cccgatgcca 1261 cccctgatcc gagagatgct ggagaacccg gagatgtttg aggacgactc ctcgaagcct 1321 ggcccccacc ccaaggcttc cagtgaggac gaagctccag ggggccaggg caaaaggggc 1381 caaagtcccc aacctgacca ggggccctga cctaccccgt tgtggggttg ggccccaggc 1441 agcagactga ccatttccca gataccgcca gtgactgggg gaggacctgc cccgcccact 1501 ctccaccctt ttaatgagct cgttattttg ccaaagtttc taggggtgcc tgtgttcatc 1561 cctgtcctgt tctaactggc tccctctaca gtcccgcggg gatctgctgc acacctacca 1621 gaagagctgg agaaggggcg agcctgggtc tagactctaa aatctcagca ctgcctttcg 1681 gattcaggcc acaggctccc gacgcaagag gaagccccgc cttcccatag ccttttcctc 1741 tgccaggtgc ttgggcctct gggagcaaac aggaacacta gagaccaaaa ggggggcccc 1801 ccccgagggg agggctgagc cacccttgtg ccccaccctg agcgcctgac tctgtgtgat 1861 gaatctgccg cgctcagtgc ccccggtgcc ccatcctaat cctcctgccg ggtgtagggg 1921 caggctgggg cctgcgtttt ctagggtggg gcctgagggg aagggagaat gtgagcccga 1981 gcccgtctgt acctctgctg aagtacagca ctcatcctgg gccctccatt atataagcct 2041 ctcccttact tctgtctcgt gccttggcct cccgccctca tctcagccat caggcagggc 2101 cctcctacac tacagagggg ccaggggtcc ctctcctagt gccttctacc cttgatcccc 2161 ccagagcagc ttggccccgg ggaggggaga tgctgcttag ctgatcccac cttgacccag 2221 aggaagcctc tatttattta ttagcttttg tttacaccct ggaaatgacc ccttcctcca 2281 ggggtcttgg gagggggagc ccagggcctc tgtgacccct tcctttcttt ccccaagccc 2341 ccagtttata tttagctgcc aaataagatt cctgcaggct cccccatttc tctggggggt 2401 tagggtgctg tcccctcccc tctgtttata tctcctaccc cgctgttcgc atattgctga 2461 gttttctatt tttgcaaaat aaagtgatgg aaactc //