Path: utzoo!attcan!uunet!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 10 Aug 90 12:00:24 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3892 Approved: lear@genbank.bio.net Checksum: 09407 223 LOCUS HUMMPRCI 9081 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human cation-independent mannose 6-phosphate receptor mRNA, complete cds. ACCESSION J03528 KEYWORDS mannose 6-phosphate receptor. SOURCE Human placenta, cDNA to mRNA (library of J.E.Sadler), clones PP[4,112,188), RB3 and RC5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 9081) AUTHORS Oshima,A., Nolan,C.M., Kyle,J.W., Grubb,J.H. and Sly,W.S. TITLE The human cation-independent mannose 6-phophate receptor: Cloning and sequence of the full-length cDNA and expression of functional receptor in COS cells JOURNAL J. Biol. Chem. 263, 2553-2562 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Miller, 04-JAN-1988. FEATURES from to/span description pept 148 7623 mannose 6-phosphate receptor precursor /hgml_locus_uid="LY0166H" /nomgen="M6PR" /map="12" sigp 148 267 mannose 6-phosphate receptor signal peptide matp 268 7620 mannose 6-phosphate receptor mRNA < 1 9081 MPR-CI mRNA BASE COUNT 2243 a 2233 c 2476 g 2129 t ORIGIN Chromosome 12; 475 bp upstream of EcoRI site. 1 gagcccagtc gagccgcgct cacctcgggc tcccgctccg tctccacctc cgcctttgcc 61 ctggcggcgc gaccccgtcc cgggcgcggc cccagcagtc gcgcgccgtt agcctcgcgc 121 ccgccgcgca gctccgggcc cggcgcgatg ggggccgccg ccggccggag cccccacctg 181 gggcccgcgc ccgcccgccg cccgcagcgc tctctgctcc tgctgcagct gctgctgctc 241 gtcgctgccc cggggtccac gcaggcccag gccgccccgt tccccgagct gtgcagttat 301 acatgggaag ctgttgatac caaaaataat gtactttata aaatcaacat ctgtggaagt 361 gtggatattg tccagtgcgg gccatcaagt gctgtttgta tgcacgactt gaagacacgc 421 acttatcatt cagtgggtga ctctgttttg agaagtgcaa ccagatctct cctggaattc 481 aacacaacag tgagctgtga ccagcaaggc acaaatcaca gagtccagag cagcattgcc 541 ttcctgtgtg ggaaaaccct gggaactcct gaatttgtaa ctgcaacaga atgtgtgcac 601 tactttgagt ggaggaccac tgcagcctgc aagaaagaca tatttaaagc aaataaggag 661 gtgccatgct atgtgtttga tgaagagttg aggaagcatg atctcaatcc tctgatcaag 721 cttagtggtg cctacttggt ggatgactcc gatccggaca cttctctatt catcaatgtt 781 tgtagagaca tagacacact acgagaccca ggttcacagc tgcgggcctg tccccccggc 841 actgccgcct gcctggtaag aggacaccag gcgtttgatg ttggccagcc ccgggacgga 901 ctgaaggtgg tgcgcaagga caggcttgtc ctgagttacg tgagggaaga ggcaggaaag 961 ctagactttt gtgatggtca cagccctgcg gtgactatta catttgtttg cccgtcggag 1021 cggagagagg gcaccattcc caaactcaca gctaaatcca actgccgcta tgaaattgag 1081 tggattactg agtatgcctg ccacagagat tacctggaaa gtaaaacttg ttctctgagc 1141 ggcgagcagc aggatgtctc catagacctc acaccacttg cccagagcgg aggttcatcc 1201 tatatttcag atggaaaaga atatttgttt tatttgaatg tctgtggaga aactgaaata 1261 cagttctgta ataaaaaaca agctgcagtt tgccaagtga aaaagagcga tacctctcaa 1321 gtcaaagcag caggaagata ccacaatcag accctccgat attcggatgg agacctcacc 1381 ttgatatatt ttggaggtga tgaatgcagc tcagggtttc agcggatgag cgtcataaac 1441 tttgagtgca ataaaaccgc aggtaacgat gggaaaggaa ctcctgtatt cacaggggag 1501 gttgactgca cctacttctt cacatgggac acggaatacg cctgtgttaa ggagaaggaa 1561 gacctcctct gcggtgccac cgacgggaag aagcgctatg acctgtccgc gctggtccgc 1621 catgcagaac cagagcagaa ttgggaagct gtggatggca gtcagacgga aacaaagaag 1681 aagcattttt tcattaatat ttgtcacaga gtgctgcagg aaggcaaggc acgagggtgt 1741 cccgaggacg cggcagtgtg tgcagtggat aaaaatggaa gtaaaaatct gggaaaattt 1801 atttcctctc ccatgaaaga gaaaggaaac attcaactct cttattcaga tggtgatgat 1861 tgtggtcatg gcaagaaaat taaaactaat atcacacttg tatgcaagcc aggtgatctg 1921 gaaagtgcac cagtgttgag aacttctggg gaaggcggtt gcttttatga gtttgagtgg 1981 cacacagctg cggcctgtgt gctgtctaag acagaagggg agaactgcac ggtctttgac 2041 tcccaggcag ggttttcttt tgacttatca cctctcacaa agaaaaatgg tgcctataaa 2101 gttgagacaa agaagtatga cttttatata aatgtgtgtg gcccggtgtc tgtgagcccc 2161 tgtcagccag actcaggagc ctgccaggtg gcaaaaagtg atgagaagac ttggaacttg 2221 ggtctgagta atgcgaagct ttcatattat gatgggatga tccaactgaa ctacagaggc 2281 ggcacgccct ataacaatga aagacacaca ccgagagcta cgctcatcac ctttctctgt 2341 gatcgagacg cgggagtggg cttccctgaa tatcaggaag aggataactc cacctacaac 2401 ttccggtggt acaccagcta tgcctgcccg gaggagcccc tggaatgcgt agtgaccgac 2461 ccctccacgc tggagcagta cgacctctcc agtctggcaa aatctgaagg tggccttgga 2521 ggaaactggt atgccatgga caactcaggg gaacatgtca cgtggaggaa atactacatt 2581 aacgtgtgtc ggcctctgaa tccagtgccg ggctgcaacc gatatgcatc ggcttgccag 2641 atgaagtatg aaaaagatca gggctccttc actgaagtgg cttccatcag taacttggga 2701 atggcaaaga ccggcccggt ggttgaggac agcggcagcc tccttctgga atacgtgaat 2761 gggtcggcct gcaccaccag cgatggcaga cagaccacat ataccacgag gatccatctc 2821 gtctgctcca ggggcaggct gaacagccac cccatctttt ctctcaactg ggagtgtgtg 2881 gtcagtttcc tgtggaacac agaggctgcc tgtcccattc agacaacgac ggatacagac 2941 caggcttgct ctataaggga tcccaacagt ggatttgtgt ttaatcttaa tccgctaaac 3001 agttcgcaag gatataacgt ctctggcatt ggtaagattt ttatgtttaa tgtctgcggc 3061 acaatgcctg tctgtgggac catcctggga aaacctgctt ctggctgtga ggcagaaacc 3121 caaactgaag agctcaagaa ttggaagcca gcaaggccag tcggaattga gaaaagcctc 3181 cagctgtcca cagagggctt catcactctg acctacaaag ggcctctctc tgccaaaggt 3241 accgctgatg cttttatcgt ccgctttgtt tgcaatgatg atgtttactc agggcccctc 3301 aaattcctgc atcaagatat cgactctggg caagggatcc gaaacactta ctttgagttt 3361 gaaaccgcgt tggcctgtgt tccttctcca gtggactgcc aagtcaccga cctggctgga 3421 aatgagtacg acctgactgg cctaagcaca gtcaggaaac cttggacggc tgttgacacc 3481 tctgtcgatg ggagaaagag gactttctat ttgagcgttt gcaatcctct cccttacatt 3541 cctggatgcc agggcagcgc agtggggtct tgcttagtgt cagaaggcaa tagctggaat 3601 ctgggtgtgg tgcagatgag tccccaagcc gcggcgaatg gatctttgag catcatgtat 3661 gtcaacggtg acaagtgtgg gaaccagcgc ttctccacca ggatcacgtt tgagtgtgct 3721 cagatatcgg gctcaccagc atttcagctt caggatggtt gtgagtacgt gtttatctgg 3781 agaactgtgg aagcctgtcc cgttgtcaga gtggaagggg acaactgtga ggtgaaagac 3841 ccaaggcatg gcaacttgta tgacctgaag cccctgggcc tcaacgacac catcgtgagc 3901 gctggcgaat acacttatta cttccgggtc tgtgggaagc tttcctcaga cgtctgcccc 3961 acaagtgaca agtccaaggt ggtctcctca tgtcaggaaa agcgggaacc gcagggattt 4021 cacaaagtgg caggtctcct gactcagaag ctaacttatg aaaatggctt gttaaaaatg 4081 aacttcacgg ggggggacac ttgccataag gtttatcagc gctccacagc catcttcttc 4141 tactgtgacc gcggcaccca gcggccagta tttctaaagg agacttcaga ttgttcctac 4201 ttgtttgagt ggcgaacgca gtatgcctgc ccacctttcg atctgactga atgttcattc 4261 aaagatgggg ctggcaactc cttcgacctc tcgtccctgt caaggtacag tgacaactgg 4321 gaagccatca ctgggacggg ggacccggag cactacctca tcaatgtctg caagtctctg 4381 gccccgcagg ctggcactga gccgtgccct ccagaagcag ccgcgtgtct gctgggtggc 4441 tccaagcccg tgaacctcgg cagggtaagg gacggacctc agtggagaga tggcataatt 4501 gtcctgaaat acgttgatgg cgacttatgt ccagatggga ttcggaaaaa gtcaaccacc 4561 atccgattca cctgcagcga gagccaagtg aactccaggc ccatgttcat caacgccgtg 4621 gaggactgtg agtacacctt tgcctggccc acagccacag cctgtcccat gaagagcaac 4681 gagcatgatg actgccaggt caccaaccca agcacaggac acctgtttga tctgagctcc 4741 ttaagtggca gggcgggatt cacagctgct tacagcgaga aggggttggt ttacatgagc 4801 atctgtgggg agaatgaaaa ctgccctcct ggcgtggggg cctgctttgg acagaccagg 4861 attagcgtgg gcaaggccaa caagaggctg agatacgtgg accaggtcct gcagctggtg 4921 tacaaggatg ggtccccttg tccctccaaa tccggcctga gctataagag tgtgatcagt 4981 ttcgtgtgca ggcctgaggc cgggccaacc aataggccca tgctcatctc cctggacaag 5041 cagacatgca ctctcttctt ctcctggcac acgccgctgg cctgcgagca agcgaccgaa 5101 tgttccgtga ggaatggaag ctctattgtt gacttgtctc cccttattca tcgcactggt 5161 ggttatgagg cttatgatga gagtgaggat gatgcctccg ataccaaccc tgatttctac 5221 atcaatattt gtcagccact aaatcccatg cacggagtgc cctgtcctgc cggagccgct 5281 gtgtgcaaag ttcctattga tggtcccccc atagatatcg gccgggtagc aggaccacca 5341 atactcaatc caatagcaaa tgagatttac ttgaattttg aaagcagtac tccttgctta 5401 gcggacaagc atttcaacta cacctcgctc atcgcgtttc actgtaagag aggtgtgagc 5461 atgggaacgc ctaagctgtt aaggaccagc gagtgcgact ttgtgttcga atgggagact 5521 cctgtcgtct gtcctgatga agtgaggatg gatggctgta ccctgacaga tgagcagctc 5581 ctctacagct tcaacttgtc cagcctttcc acgagcacct ttaaggtgac tcgcgactcg 5641 cgcacctaca gcgttggggt gtgcaccttt gcagtcgggc cagaacaagg aggctgtaag 5701 gacggaggag tctgtctgct ctcaggcacc aagggggcat cctttggacg gctgcaatca 5761 atgaaactgg attacaggca ccaggatgaa gcggtcgttt taagttacgt gaatggtgat 5821 cgttgccctc cagaaaccga tgacggcgtc ccctgtgtct tccccttcat attcaatggg 5881 aagagctacg aggagtgcat catagagagc agggcgaagc tgtggtgtag cacaactgcg 5941 gactacgaca gagaccacga gtggggcttc tgcagacact caaacagcta ccggacatcc 6001 agcatcatat ttaagtgtga tgaagatgag gacattggga ggccacaagt cttcagtgaa 6061 gtgcgtgggt gtgatgtgac atttgagtgg aaaacaaaag ttgtctgccc tccaaagaag 6121 ttggagtgca aattcgtcca gaaacacaaa acctacgacc tgcggctgct ctcctctctc 6181 accgggtcct ggtccctggt ccacaacgga gtctcgtact atatgaatct gtgccagaaa 6241 atatataaag ggcccctggg ctgctctgaa agggccagca tttgcagaag gaccacaact 6301 ggagacgtcc aggtcctggg actcgttcac acgcagaagc tgggtgtcat aggtgacaaa 6361 gttgttgtca tgtactccaa aggttatccg tgtggtggaa ataagaccgc atcctccgtg 6421 atagaattga cctgtacaaa gacggtgggc agacctgcat tcaagaggtt tgatatcgac 6481 agctgcactt actacttcag ctgggactcc cgggctgcct gcgccgtgaa gcctcaggag 6541 gtgcagatgg tgaatgggac catcaccaac cctataaatg gcaagagctt cagcctcgga 6601 gatatttatt ttaacctgtt cagagaatct ggggacatga ggaccaatgg ggacaactac 6661 ctgtatgaga tcctactttc ctccatcaca agctccagaa acccggcgtg ctctggagcc 6721 aacatatgcc aggtgaagcc caacgatcag cacttcagtc ggaaagttgg aacctctgac 6781 aagaccaagt actaccttca agacggcgat ctcgatgtcg tgtttgcctc ttcctctaag 6841 tgcggaaagg ataagaccaa gtctgtttct tccaccatct tcttccactg tgaccctctg 6901 gtggaggacg ggatccccga gttcagtcac gagactgccg actgccagta cctcttctct 6961 tggtacacct cagccgtgtg tcctctgggg gtgggctttg acagcgagaa tcccggggac 7021 gacgggcaga tgcacaaggg gctgtcagaa cggagccagg cagtcggcgc ggtgctcagc 7081 ctgctgctgg tggcgctcac ctgctgcctg ctggccctgt tgctctacaa gaagaagagg 7141 agggaaacaa tgataagtaa gctgacctct tgctgtagga gaagttccaa cgtgtcctac 7201 aaatactcaa aggtgaataa ggaagaagag acagatgaga atgaaacaga gtggctgatg 7261 gaagagatcc agctgcctcc tccacggcag ggaaaggaag ggcaggagaa cggccatatt 7321 accaccaagt cagtgaaagc cctcagctcc ctgcatgggg atgaccagga cagtacggat 7381 gaggttctga ccatcccaga ggtgaaagtt cactcgggca ggggagctgg ggcagagagc 7441 tcccacccag tgagaaacgc acagagcaat gcccttcagg agcgtgagga cgatagggtg 7501 gggctggtca ggggtgagaa ggcgaggaaa gggaagtcca gctctgcaca gcagaagaca 7561 gtgagctcca ccaagctggt gtccttccat gacgacagcg acgaggacct cttacacatc 7621 tgactccgca gtgcctgcag gggagcacgg agccgcggga cagccaagca cctccaacca 7681 aataagactt ccactcgatg atgcttctat aattttgcct ttaacagaaa ctttcaaaag 7741 ggaagagttt ttgtgatggg ggagagggtg aaggaggtca ggccccactc cttcctgatt 7801 gtttacagtc attggaataa ggcatggctc agatcggcca cagggcggta ccttgtgccc 7861 agggttttgc cccaagtcct catttaaaag cataaggccg gacgcatctc aaaacagagg 7921 gctgcattcg aagaaaccct tgctgcttta gtcccgatag gtatttgacc ccgatatatt 7981 ttagcatttt aattctctcc ccctatttat tgactttgac aattactcag gtttgagaaa 8041 aaggaaaaaa aaacagccac cgtttcttcc tgccagcagg ggtgtgatgt accagtttgt 8101 ccatcttgag atggtgaggc tgtcagtgta tggggcagct tccggcggga tgttgaactg 8161 gtcattaatg tgtcccctga gttggagctc attctgtctc ttttctcttt tgctttctgt 8221 ttcttaaggg cacacacacg tgcgtgcgag cacacacaca tacgtgcaca gggtccccga 8281 gtgcctaggt tttggagagt ttgcctgttc tatgccttta gtcaggaatg gctgcacctt 8341 tttgcatgat atcttcaagc ctgggcgtac agagcacatt tgtcagtatt tttgccggct 8401 ggtgaattca acctgcccaa agattgattt gtgtgtttgt gtgtgtgtgt gtgtgtgtgt 8461 gtgtgtgtgt gagtggagtt gaggtgtcag agaaaatgaa ttttttccag atttggggta 8521 taggtctcat ctcttcaggt tctcatgata ccacctttac tgtgcttatt tttttaagaa 8581 aaaagtgttg atcaaccatt cgacctataa gaagccttaa tttgcacagt gtgtgactta 8641 cagaaactgc atgaaaaatc atgggccaga gcctcggccc tagcattgca cttggcctca 8701 tgctggaggg aggctgggcg ggtacagcgc ggaggaggag ggaggccagg cgggcatggc 8761 gtggaggagg tgggaggccg ggcggtcaca gcatggagga ggagggaggc gctgctggtg 8821 ttcttattct ggcggcagcg cctttcctgc catgtttagt gaatgacttt tctcgcattg 8881 tagaattgta tatagactct ggtgttctat tgctgagaag caaaccgccc tgcagcatcc 8941 ctcagcctgt accggtttgg ctggcttgtt tgatttcaac atgagtgtat tttttaaaat 9001 tgatttttct cttcattttt ttttcaatca actttactgt aatataaagt attcaacaat 9061 ttcaataaaa gataaattat t // LOCUS HUMMRA 5185 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human mannose receptor mRNA, complete cds. ACCESSION J05550 KEYWORDS mannose receptor. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5185) AUTHORS Taylor,M.E., Conary,J.T., Lennartz,M.R., Stahl,P.D. and Drickamer,K. TITLE Primary structure of the mannose receptor contains multiple motifs resembling carbohydrate-recognition domains JOURNAL J. Biol. Chem. 265, 12156-12162 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Drickamer, 18-MAY-1990. FEATURES from to/span description pept 104 4474 mannose receptor precursor sigp 104 157 mannose receptor signal peptide matp 158 4471 mannose receptor BASE COUNT 1602 a 999 c 1198 g 1386 t ORIGIN 1 gggaacttgg attaggtgga gaggcagttg gggggcctcg ttgttttgcg tcttagttcc 61 gccctcctgt ccatcaggag aaggaaagga taaaccctgg gccatgaggc tacccctgct 121 cctggttttt gcctctgtca ttccgggtgc tgttctccta ctggacacca ggcaattttt 181 aatctataat gaagatcaca agcgctgcgt ggatgcagtg agtcccagtg ccgtccaaac 241 cgcagcttgc aaccaggatg ccgaatcaca gaaattccga tgggtgtccg aatctcagat 301 tatgagtgtt gcatttaaat tatgcctggg agtgccatca aaaacagact gggttgctat 361 cactctctat gcctgtgact caaaaagtga atttcagaaa tgggagtgca aaaatgacac 421 acttttgggg atcaaaggag aagatttatt ttttaactac ggcaacagac aagaaaagaa 481 tattatgctc tacaagggat cgggtttatg gagcaggtgg aagatctatg gaaccacaga 541 caatctgtgc tccagaggtt atgaagccat gtatacgcta ctaggcaatg ccaatggagc 601 aacctgtgca ttcccgttca agtttgaaaa caagtggtac gcagattgca cgagtgctgg 661 gcggtcggat ggatggctct ggtgcggaac cactactgac tatgacacag acaagctatt 721 tggatattgt ccattgaaat ttgagggcag tgaaagctta tggaataaag acccgctgac 781 cagcgtttcc taccagataa actccaaatc cgctttaacg tggcaccaag cgaggaaaag 841 ctgccaacaa cagaacgctg agctcctgag catcacagag atacatgagc aaacatacct 901 gacaggatta accagttcct tgacctcagg actctggatt ggacttaaca gtctgagctt 961 caacagcggt tggcagtgga gtgaccgcag tcctttccga tatttgaact ggttaccagg 1021 aagtccatca gctgaacctg gaaaaagctg tgtgtcacta aatcctggaa aaaatgctaa 1081 atgggaaaat ctggaatgtg ttcagaaact gggctatatt tgcaaaaagg gcaacaccac 1141 tttaaattct tttgttattc cctcagaaag tgatgtgcct actcactgtc ctagtcagtg 1201 gtggccgtat gccggtcact gttacaagat tcacagagat gagaaaaaaa tccagaggga 1261 tgctctgacc acctgcagga aggaaggcgg tgacctcaca agtatccaca ccatcgagga 1321 attggacttt attatctccc agctaggata tgagccaaat gacgaattgt ggatcggctt 1381 aaatgacatt aagattcaaa tgtactttga gtggagtgat gggacccctg taacgtttac 1441 caaatggctt cgtggagaac caagccatga aaacaacaga caggaggatt gtgtggtgat 1501 gaaaggcaag gatgggtact gggcagatcg gggctgtgag tggcctcttg gctacatctg 1561 caagatgaaa tcacgaagcc aaggtccaga aatagtggaa gtcgaaaaag gctgcaggaa 1621 aggctggaaa aaacatcact tttactgcta tatgattgga catacgcttt caacatttgc 1681 agaagcaaac caaacctgta ataatgagaa tgcttattta acaactattg aagacagata 1741 tgaacaagcc ttcctgacta gtttcgttgg cttaaggcct gaaaaatatt tctggacagg 1801 actttcagat atacaaacca aagggacttt tcagtggacc atcgaggaag aggttcggtt 1861 cacccactgg aattcagata tgccagggcg aaagccaggg tgtgttgcca tgagaaccgg 1921 gattgcaggg ggcttatggg atgttttgaa atgtgatgaa aaggcaaaat ttgtgtgcaa 1981 gcactgggca gaaggagtaa cccacccacc gaagcccacg acgactcccg aacccaaatg 2041 tccggaggat tggggcgcca gcagtagaac aagcttgtgt ttcaagctgt atgcaaaagg 2101 aaaacatgag aagaaaacgt ggtttgaatc tcgagatttt tgtcgagctc tgggtggaga 2161 cttagctagc atcaataaca aagaggaaca gcaaacaata tggcgattaa taacagctag 2221 tggaagctac cacaaactgt tttggttggg attgacatat ggaagccctt cagaaggttt 2281 tacttggagt gatggttctc ctgtttcata tgaaaactgg gcttatggag aacctaataa 2341 ttatcaaaat gttgaatact gtggtgagct gaaaggtgac cctactatgt cttggaatga 2401 tattaattgt gaacacctta acaactggat ttgccagata caaaaaggac aaacaccaaa 2461 acctgagcca acaccagctc ctcaagacaa tccaccagtt actgaagatg ggtgggttat 2521 ttacaaagac taccagtatt atttcagcaa agagaaggaa accatggaca atgcgcgagc 2581 gttttgcaag aggaattttg gtgatcttgt ttctattcaa agtgaaagtg aaaagaagtt 2641 tctatggaaa tatgtaaaca gaaatgatgc acagtctgca tattttattg gtttattgat 2701 cagcttggat aaaaagtttg cttggatgga tggaagcaaa gtggattacg tgtcttgggc 2761 cacaggtgaa cccaattttg caaatgaaga tgaaaactgt gtgaccatgt attcaaattc 2821 agggttttgg aatgacatta actgtggcta tccaaacgcc ttcatttgcc agcgacataa 2881 cagtagtatc aatgctacca cagttatgcc taccatgccc tcggtcccat cagggtgcaa 2941 ggaaggttgg aatttctaca gcaacaagtg tttcaaaatc tttggattta tggaagaaga 3001 aagaaaaaat tggcaagagg cacgaaaagc ttgtataggc tttggaggga atctggtctc 3061 catacaaaat gaaaaagagc aagcatttct tacctatcac atgaaggact ccactttcag 3121 tgcctggact gggctgaatg atgtcaattc agaacacacg ttcctttgga cggatggacg 3181 aggagtccat tacacaaact gggggaaagg ttaccctggt ggaagaagaa gcagtctttc 3241 ttatgaagat gctgactgtg ttgttattat tggaggtgca tcaaatgaag caggaaaatg 3301 gatggatgat acctgcgaca gtaaacgagg ctacatatgc cagacacgat ccgacccttc 3361 cttgactaat cctccagcaa cgattcaaac agatggcttt gttaaatatg gcaaaagcag 3421 ctattcactc atgagacaaa aatttcaatg gcatgaagcg gagacatact gcaagcttca 3481 caattccctt atagccagca ttctggatcc ctacagtaat gcatttgcgt ggctgcagat 3541 ggaaacatct aatgaacgtg tgtggatcgc cctgaacagt aacttgactg ataatcaata 3601 cacttggact gataagtgga gggtgaggta cactaactgg gctgctgatg agcccaaatt 3661 gaaatcagca tgtgtttatc tggatcttga tggctactgg aagacagcac attgcaatga 3721 aagtttttac tttctctgta aaagatcaga tgaaatccct gctactgaac ccccacaact 3781 gcctggcaga tgcccggagt cagatcacac agcatggatt cctttccatg gtcactgtta 3841 ctatattgag tcctcatata caagaaactg gggccaagct tctctggaat gtcttcgaat 3901 gggttcctct ctggtttcca ttgaaagtgc tgcagaatcc agttttctgt catatcgggt 3961 tgagccactt aaaagtaaaa ccaatttttg gataggattg ttcagaaatg ttgaagggac 4021 gtggctgtgg ataaataaca gtccggtctc ctttgtcaac tggaacacag gagatccctc 4081 tggtgaacgg aatgattgtg tagctttaca tgcgtcttct gggttttgga gtaatattca 4141 ctgttcttcc tacaaaggat atatttgtaa aagaccaaaa attattgatg ctaaacctac 4201 tcatgaatta cttacaacaa aagctgacac aaggaagatg gacccttcta aaccgtcttc 4261 caacgtggcc ggagtagtca tcattgtgat cctcctgatt ttaacgggtg ctggccttgc 4321 cgcctatttc ttttataaga aaagacgtgt gcacctacct caagagggcg cctttgaaaa 4381 cactctgtat tttaacagtc agtcaagccc aggaactagt gatatgaaag atctcgtggg 4441 caatattgaa cagaatgaac actcggtcat ctagtacctc aatgcgattc tgagatattt 4501 gaatttcata aaattgtaac tgaaatttaa aatttttagt tcaatgtgat tgttttcttt 4561 aaaatgagta ctgaattgta ctggtctgtc cttttttcct ttgcctaatt gaagaaataa 4621 ttgcttgttt tctagcctgg caagatattt tcataaaaga gggataacaa tgctgattac 4681 taccttttaa aatattttag ataaatgcac agcaccacag caccacatct aagcattagt 4741 gatgggtagc tgatgtcagc ttcatgtgga ttttaagcac tctagaaaca atgaagcttc 4801 ttggcatatt ttaaggagct cccaaaatgt gttacctatt aaattgtaac tcagcaagta 4861 gaagaccatt tgaaaagtca ggtacaaatt tcctcaagtg gcataaaaat gtagtcagtt 4921 ttctctttta ccagttttta tttccactcc aattatttag aactttattt gtacatgtgc 4981 agaagaataa ggcagctgag aatcttgttt cccccaagag agttttacag gctgagtgtt 5041 gcaaatgtgt tctttgtcct gttatatgta tatcaggaat acaaggatgt gaaataaaac 5101 tgtaaatttg cataactgga tgtacttaga taatgtgaaa taaacattaa agacaaggtc 5161 tatttttaat aaaaaaaaaa aaaaa // LOCUS BFRACP 1788 bp ss-RNA PHG 10-AUG-1990 DEFINITION Bacteriophage fr maturation and coat protein genes, complete cds, and replicase gene, 5' end. ACCESSION M31635 KEYWORDS coat protein; maturation protein; replicase. SOURCE Bacteriophage fr RNA. ORGANISM Bacteriophage fr Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 1788) AUTHORS Berzin,V., Avots,A., Jansone,I., Gintnere,L. and Tsimanis,A. TITLE Sequence of the genes coding for the A-protein and coat protein of bacteriophage fr JOURNAL Nucleic Acids Res. 15, 6741-6741 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 121 1302 maturation protein (gtg start codon) pept 1328 1720 coat protein pept 1754 > 1788 replicase protein BASE COUNT 443 a 454 c 473 g 418 t ORIGIN 1 cccctttcgg ggtcctgctc gacttcctgt caagctaaat gccattttta atgtctttag 61 cgagacgcta ccatggctat cgctgtaggt agccgcaatt ccattgctag ggagcctcgt 121 gtgcgaaagt tcataccaac cgagagaatg agtaagagcc acgtggtctc cgtccgcgaa 181 tatgctgacg gggaactcga ggataactct ttaccgttga tttaccgatc caattggtcg 241 ccgggccagt acacaagtac gggtccgcgc acaaaggaat ggcactaccc atcgtcctac 301 tcacgtgggg cgataggaat caaggctctt gatcaaggta agtacgcacg attgggcaca 361 tcgtggggcc gcgagttcga agaacgtgcc ggttatggca tgtctatcga cgcacgtagt 421 tgttatagcc tattccccgt tagtcaaaac ttaacgtgga tagacgtacc aacgaacgta 481 gctaatcgcg ccacgactga ggtcttaggt aaggtcactc aaggtaattt taaccttggc 541 gtggcccttg ctgaagcccg gtctacggcc tcacaactgt cgacgcaaac catcgcgttg 601 attaaggcgt acaccgcggc tcgccgcggg aactggcgcc aggcactccg ctacttagcg 661 ctgaacgaga atcgaaaatt caattcgaag tccgtcgcaa gcaggtggct ggagttgcag 721 ttcgggtgga tgccgcttct cagcgatatc caaggtgcgt atgagatgct caccaaagtg 781 catcttaagg catttatgcc tatgcgtgcc gtgagtcaag tcggacaaaa cgtcagtttg 841 tctggccggc tcacttctcc ggctgcaagc tataagtcta cgtgcaacat atcacgacgc 901 attgtgatat ggttttacat aaacgatgca cgtctggctt ggctgtcctc cttagggatt 961 ttgaacccgc taggaatagt gtgggaaaag gtcccgttct ctttcctggt cgattggttg 1021 ctgccggttg ggaacatgct tgaggggctt accgccccga taggctgttc gtatcaatcg 1081 ggaacagtaa ccgacgtaat atcaggagag tcgacaataa tcgccgatga catctatggt 1141 tgggatacag tgcgacctgc aaccgctaag gtgcaaatca gtgctgtcca ccggggggta 1201 caaagcgtgt ggcccacaac gggcgtatac gttaagtcac ctttctcgat ggtccatacc 1261 ttagatgcct tggcactttt caggcaacgc ctctggaaat agaggcccta accgaaggga 1321 gagccacatg gcttcgaact ttgaagagtt cgttctcgtc gacaatggcg gaacgggaga 1381 tgtaaaagtc gctccgagca acttcgctaa cggggttgca gaatggatca gctcgaactc 1441 acgttctcag gcttacaaag tgacctgtag cgtgcgtcag agctctgcga acaatcggaa 1501 atacaccgtc aaggtcgagg tcccgaaagt ggcaactcag gtccaaggcg gcgttgagct 1561 tcctgttgcg gcgtggcgct cgtacatgaa tatggaatta actattccgg tattcgcgac 1621 gaacgacgac tgtgccttaa tcgttaaggc attgcaaggc acctttaaaa ctggtaaccc 1681 aattgcaaca gccatcgcag ccaactcggg aatctactaa gaaacccgtg ccattccaac 1741 atgaggaata cccatgtcaa aatcaacaaa gaagttcaac tctttatg // LOCUS HUMERCC1A 1098 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human excision repair protein (ERCC-1) mRNA, complete cds. cDNA clone pcDE. ACCESSION M28650 KEYWORDS alternative splicing; excision repair protein. SOURCE Human, cDNA to mRNA, clone pcDE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Hoeijmakers,J.H.J. TITLE Characterization of genes and proteins involved in excision repair of human cells JOURNAL J. Cell Sci. Suppl. 6, 111-125 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 143 1036 excision repair protein site 863 916 alternatively spliced out exon BASE COUNT 243 a 352 c 312 g 191 t ORIGIN 1 aagtgctgcg agccctgggc cacgctggcc gtgctggcag tgggccgcct cgatccctct 61 gcagtctttc ccttgaggct ccaagaccag caggtgaggc ctcgcggcgc tgaaaccgtg 121 aggcccggac cacaggctcc agatggaccc tgggaaggac aaagaggggg tgccccagcc 181 ctcagggccg ccagcaagga agaaatttgt gatacccctc gacgaggatg aggtccctcc 241 tggagtggcc aagcccttat tccgatctac acagagcctt cccactgtgg acacctcggc 301 ccaggcggcc cctcagacct acgccgaata tgccatctca cagcctctgg aaggggctgg 361 ggccacgtgc cccacagggt cagagcccct ggcaggagag acgcccaacc aggccctgaa 421 acccggggca aaatccaaca gcatcattgt gagccctcgg cagaggggca atcccgtact 481 gaagttcgtg cgcaacgtgc cctgggaatt tggcgacgta attcccgact atgtgctggg 541 ccagagcacc tgtgccctgt tcctcagcct ccgctaccac aacctgcacc cagactacat 601 ccatgggcgg ctgcagagcc tggggaagaa cttcgccttg cgggtcctgc ttgtccaggt 661 ggatgtgaaa gatccccagc aggccctcaa ggagctggct aagatgtgta tcctggccga 721 ctgcacattg atcctcgcct ggagccccga ggaagctggg cggtacctgg agacctacaa 781 ggcctatgag cagaaaccag cggacctcct gatggagaag ctagagcagg acttcgtctc 841 ccgggtgact gaatgtctga ccaccgtgaa gtcagtcaac aaaacggaca gtcagaccct 901 cctgaccaca tttggatctc tggaacagct catcgccgca tcaagagaag atctggcctt 961 atgcccaggc ctgggccctc agaaagcccg gaggctgttt gatgtcctgc acgagccctt 1021 cttgaaagta ccctgatgac cccagctgcc aaggaaaccc ccagtgtaat aataaatcgt 1081 cctcccaggc caggctca // LOCUS MZECPPETE 3916 bp ds-DNA ORG 10-AUG-1990 DEFINITION Maize chloroplast cytochrome b559 alpha- and beta-subunit (psbE and F), cytochrome b6-f complex (petE), and photosystem II polypeptide, complete cds, Pro-tRNA and Trp-tRNA. ACCESSION J04502 KEYWORDS cytochrome; photosystem II protein; transfer RNA-Pro; transfer RNA-Trp. SOURCE Maize (strain FR9 cms + FR37) DNA, clone pZmc503. ORGANISM Chloroplast Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 2565 to 2725) AUTHORS Haley,J. and Bogorad,L. TITLE A 4-kDa maize chloroplast polypeptide associated with the cytochrome b-6-f complex: Subunit 5, encoded by the chloroplast petE gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1534-1538 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3916) AUTHORS Haley,J. and Bogorad,L. TITLE Expression of the maize chloroplast petE gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by L.Bogorad, 07-MAY-1989. Author address: J.Haley Department of Cellular and Developmental Biology Harvard University Biological Laboratories 16 Divinity Ave Cambridge, MA 02138 Ph: 617-495-4260 FEATURES from to/span description pept 454 332 (c) ORF40 pept 696 580 (c) photosystem II protein (psbL) pept 838 719 (c) cytochrome b559 beta subunit (psbF) pept 1100 849 (c) cytochrome b559 alpha subunit (psbE) pept 2337 2432 ORF31 pept 2596 2709 cytochrome b6-f complex subunit 5 (petE) pept 3499 3627 ORF42 tRNA 2903 2830 (c) Trp-tRNA tRNA 3115 3042 (c) Pro-tRNA BASE COUNT 1242 a 684 c 658 g 1332 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctctc ccctctttct tgttgcttca taagagtgaa tccattttat gggcgaaagg 61 cgggggcttt aaatcgaccg atggtttgct tcactcacta acatcattaa caaacaaaag 121 aataaatgga gggattccaa ccatcagagc aaaggttttc tctttgttat ttttacaaat 181 agaaataggt aaccaatttc tagattatgg aacaaaaccg cgttataaca ataagaattc 241 cgcgggccct ttccgctcta atcagataaa ggggagtaag gacccgctaa gttcctactt 301 tttcatgttt acaatctgga tcctccgatt actatagaga tgaacccaat ccagaatacg 361 aaccgtaaaa gaaaaaacct attaaaccaa tcacaagaat accagttaca gtacctatca 421 gccaaagagg aattcttcca gtagtatcgg ccatttcccc tactttcctc cacattttct 481 caagtggtca tgctagagac aaaaacagtc atggatagtt ataaagatgg tatccttcca 541 aatgggataa gagaattctt actactctct ttctttctct caattgaaga agtaattgga 601 aaataaaaca gcaagtacaa aaatgagtaa taaaccccag tatagactgg tacgattcaa 661 ttcaacattt tgttcattcg ggtttgattg tgtcatagtt ctatagttgg aatttggttt 721 atcgttggat gaactgcatt gctgatattg atcccaagaa aaaaaccgtg ggtacagcta 781 atccgtgaac agccagccat cgcactgtaa aaataggata ggttcgatct atggtcattg 841 ggggcctcct aaaaggatct actaaattca tctagttgtt ctaaagaatc aaaacggtcg 901 gttattaatg gaattccttg tcggctttcc gtgaaatact cgtttggtcg aggacttcca 961 aacacgtcat aagctaaacc cgtactgaca aataaccaac ccgcaatgaa tagggaaggt 1021 atagtaatgc tatgaataac ccagtatcga atactggtaa taatatcagc aaaagaacgt 1081 tctcccgtgc ttccagacat gctgagctcc caaaattttt attcaaaaaa ggaattgatt 1141 ccgtaaaaga tgggatccac cagtaaatag aaaattactg atatttcatc cttgtgagat 1201 tgtcaatttt gtaccaaagg tgtattttga gtataccgaa ttagtatagc tatccttcct 1261 atggcacagc aatccagttt tgcttggtcc cgaaacagaa ttcctttttt ctcttttttg 1321 ttccttgtct ataggaaaat tacatgttat tcaaggcatc aatagaaccc cacaattttt 1381 tgggtcctac ttattttcat tgtcttcgga atagtagaat aatttaattt ggaatagcgg 1441 ccaagatctt gggaaaatct aagttaatga ttaataagat aaagaattta ggaaggatat 1501 tctcatattg acgcaataca aagataagta tatgcaaaat cgatcccttt ttagttaaag 1561 gaatttaatt ggttagcata aaataatatc taataaatag aaaatcgaat agcggataat 1621 ccgttatgaa agaaacggaa tacattcttt gaagaatcaa gattcgtaat caatccttgt 1681 cttgtttgtt ggattaggtc taattttctt gaccaaacag caagcatgga actttacaag 1741 aagaacttaa aagaaaagga taattgaagt aactttgaat ccacttttgt tggggttcaa 1801 aaaacgaata aaaataaata aaaaaaagta aatttaagga atagttccct ttttgagggg 1861 gccctcgggg gtcgtggaat gcttttcttc tcctcttatt ccatatggaa tacaatgagt 1921 taaaatttgc tccaaaaaga ggatgaaatc catcctatta aaaaagaaat aatccgaatc 1981 cgaaatagaa agaatttttt ttttttcaaa ttcaattctt tatttatctt ttattccaaa 2041 attctcccga aaatccaatt tcatttttca atggggttag atgatctagt tcttaatatt 2101 attagtttac ttaactgaca gattccacaa caaatctctt gattcggaat tagggactca 2161 tgttccgtct gatgaatcga ttttctcttt tttttacact tctgtatctc actctatctt 2221 gttttttagt attatctaaa ataaccgatg aattatgaat tttccataac ttaaacttag 2281 gtaagtgctt taccaacata tgtagtgtag taaaaaaaat ggaattgaac cccttcatgc 2341 ttactataac tagttatttc ggttttctac tggctgcttt aactataacc ccagctctat 2401 ttattagctt gaacaagata cgtcttattt gaaatgaatt gaatagaaga atctttcttt 2461 tggattcttg gtattctaga ctcttttcca cactaattac caattctttt cttggtcatt 2521 gagattcgtg ggtagtttag actattattt agagatagat cgtacctctt tttttatccc 2581 ctcgaacaaa tcgaaatgat tgaagttttt ctatttggaa tcgtcttagg cctaattcct 2641 attactttag cgggattatt cgtgactgcg tatttgcaat acaggcgtgg ggatcagttg 2701 gatctttgat tgagtaatat ttcttttttg attgacctcc tctctggtct ggaggaggtc 2761 aaattggagt tgcaattcta ctttgttttt ttttaagtta ttttagtttc gacataagat 2821 atatggaatc acgctctgta ggatttgaac ctacgacatc gggttttgga gacccgcgtt 2881 ctaccaaact gaactaagag cgctttcaaa acaaaaaaaa aaaagaaaat ccttttctat 2941 tcctaacgtg tctcacgtcc gtatagtatc cacaaattca agttataccc actttaatcg 3001 atctccccac tactgcccat aacgaagaga gaattaatag gtagggatga caggatttga 3061 acctgtgaca ttttgtaccc aaaacaaacg cgctaccaag ctgcgctaca tcccttttcc 3121 aaattgttgt acaatgccat tgtacacaat tcctttcttg ttttccacat cgtaattttc 3181 ttctatttct ttatccatat agaactttct tgtcatttct tgtttttggt ctcatataag 3241 tcaaggaagg ggtataaact aaaatccagt caaatttcac ctataaaaga aagactacta 3301 ttccttagta atgtatagga agaaggggtc atctttttta gggataggaa aatctcgtct 3361 atacggttca ttctatatat atataatatt tattttgttt ttttagttag gaatttcgcc 3421 taaacaaaag aaatacaaag gatcttgggc aagagtatct gatcatatat gtattccaat 3481 acggaaggag gattttcaat gcgggatata aaaacatatc tctctgtagc acccgtgcta 3541 agtactctat ggtttggggc tttagcaggt ttattgatag aaatcaatcg tttattccca 3601 gatgctttgt cattcccttt tttttaatta tagttattcc tatgcgagag atagaattct 3661 tcgtgacatg acgaaaattc cccctttttg aattcttttt tagtatatga agcaaaaaga 3721 aagaaaagat ggataaggat tgtattcttt aattatttct ctatgtttta ttacttaatt 3781 tacgaatttc aaaaattttg tattctattg gattggattt gttcgagaat tcgaagaatt 3841 acaacaaaat ctttagaaat cacattttta gttaggaact tctatggatt ttattcttct 3901 tcttttcttt ggatcc // LOCUS RATLOX 2672 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat aorta lysyl oxidase mRNA, complete cds. ACCESSION J02903 KEYWORDS lysyl oxidase. SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7, 13, 8-1 and IIB. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2672) AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D., Troxler,R.F. and Kagan,H.M. TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and predicted amino acid sequence JOURNAL Biochemistry 29, 4863-4870 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.C.Trackman, 07-FEB-1990. FEATURES from to/span description pept 287 1516 lysyl oxidase precursor sigp 287 349 pot. lysyl oxidase signal peptide matp 350 1513 lysyl oxidase site 2252 2274 region of dyad symmetry BASE COUNT 650 a 763 c 622 g 637 t ORIGIN 192 bp upstream of TaqI site. 1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc 61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg 121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc 181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt 241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg 301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc 361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa 421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga 481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt 541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg 601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg 661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc 721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac 781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga 841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc 901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat 961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga 1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg 1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca 1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag 1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg 1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga 1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa 1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa 1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac 1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt 1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa 1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat 1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta 1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat 1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca 1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt 1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg 1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc 2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt 2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac 2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc 2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta 2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac 2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat 2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct 2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct 2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg 2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat 2641 gcagtactgt aatccccgtc cctcctggag cc // LOCUS RHMDCTABA 764 bp ds-DNA BCT 10-AUG-1990 DEFINITION R.meliloti dctA and dctB genes, 5' end. ACCESSION M33555 KEYWORDS regulatory protein. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 764) AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F. TITLE Genetic analysis and regulation of the Rhizobium meliloti genes controlling C-4-dicarboxylic acid transport JOURNAL Gene 85, 135-144 (1989) STANDARD simple staff_review FEATURES from to/span description pept 726 > 764 dctA protein BASE COUNT 149 a 239 c 252 g 124 t ORIGIN 1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg 61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc 121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg 181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg 241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc 301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg 361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca 421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc 481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc 541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg 601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt 661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc 721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg // LOCUS BSUCOMGA 6278 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.subtilis comG-[1,2,3,4,5,6,and 7] proteins in comG operon, complete cds. ACCESSION M29691 M22854 KEYWORDS exogenous DNA-binding. SOURCE B.subtilis (strain 168) DNA, clone pED4. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 6278) AUTHORS Albano,M., Breitling,R. and Dubnau,D.A. TITLE Nucleotide sequence and genetic organization of the Bacillus subtilis comG operon JOURNAL J. Bacteriol. 171, 5386-5404 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.A.Dubnau, 02-MAR-1989. ORF1 is required for the ability of competent cultures to resolve into two populations with different cell densities on Renografin gradients, as well as for full expression of comE, another late competent locus. ORF1 shows significant similarity to the virB ORF11 protein from Agrobacterium tumefaciens, which is probably involved in T-DNA transfer. The N-terminal sequences of comG ORF3 and, to a lesser extent, the comG ORF4 and 5 proteins are similar to a class of pilin proteins from members of the genera Bacteroides, Pseudomonas, Neisseria and Moraxella. All of the comG proteins, except ORF1, possess hydrophobic domains that are potentially capable of spanning the bacterial membrane and may be part of the DNA transport machinery. FEATURES from to/span description pept 1298 2368 comG-1 protein (ttg start codon) pept 2420 3391 comG-2 protein pept 3405 3701 comG-3 protein pept 3691 4122 comG-4 protein (ttg start codon) pept 4106 4453 comG-5 protein pept 4863 5237 comG-7 protein ORF 4365 4862 comG-6 protein (put.) ORF < 1 886 ORF1 (AA at 2) ORF 6126 6276 ORF2 mRNA 1278 > 6278 comG mRNA signal 1100 1124 pot. transcription termination signal signal 5501 5533 transcription termination signal signal 5696 5715 transcription termination signal signal 1243 1248 comG -35 region signal 1266 1271 comG -10 region BASE COUNT 1960 a 1122 c 1383 g 1813 t ORIGIN 1 bp upstream of BglII site. 1 agatctaatc catttttctc actggcctca gtgtgaaaag tggtttgaaa ataaccatca 61 cgttaatttt ttgcgagtag atacaactga aacggaaaat gaagcagtat ttgggtcgat 121 tgtttatgat caggggcttg gtgaagaaaa agaccatact gtttttcact tttatatcac 181 cagacaatat ttttttacaa tcaactttga cttttcaatt ttgagagaga ttaaaggcaa 241 agaagttgtt cggcaaatgg aaagagcgga caatgcgata gaggggtttt taattcttct 301 cggcgaacta atgaatgcgt atttaatcgg tgttgatgaa tttgaagtca agctgagaaa 361 gctcagatgg caaattaaag acgacaatag caaaagcatt ttaaaccgcg tccatctcct 421 gcgccatgaa ctgatgattt ggaaaaattt gatattaagc gctaaaaaaa ttgaaatggc 481 gttgaaagaa acctttttac ctcaaaatga agggaaaaag gattatcagc ggacacaact 541 gaagattgac aggggattta catacatcag cgaatttgaa ggggagctta acaatctgct 601 gcattcagag gaagtcatta cctcacatag ggggaatgaa attgtaaaag cgctgaccat 661 tttcacgacg ctttttactc cgattacagc tctgggtgcc ttatggggga tgaacttttc 721 agtgatgccg gaactgaatt ggaaatacgg atatctcttt tccctcttat tgattgtcac 781 atctacagtt ctgatctatc tctatttgag aaaaaaaggc tggacgggag atatgctgca 841 ggagcggaag aagaaaaaga aacctcgaaa aaggcggact ctataggatg tttcatattt 901 tgtgcagcgt gccccgcttt ttcaccagac atatcagggt gaccggatac gatgtcaagg 961 ggcttatgac agagcattaa atccgcagtt tatcgattct tgaaaatgac caaatgaccg 1021 gtattgttgc attaggcgat ctttccgttg agaaagatac tggtcaataa gcgaaaacag 1081 cataatgaaa atggaatcta gcaggcatgg tgaccatgtc tgctttttta tttataggga 1141 aaattataat gacaggggta cattcagttg aaagtctttt ttcttgccag aaagaattgg 1201 tttttcagca tataacatct cacaaaatca cgttttccct gtttgattac cttttcttct 1261 ttttctacaa tatgcgttga aaggagaggg aatcaaattg gattcaatag aaaaggtaag 1321 caaaaacttg attgaagagg catatctaac aaaggcttct gatattcaca ttgtgccgag 1381 ggagcgggac gctatcattc attttcgggt cgatcatgcc ttgctgaaaa aaagggacat 1441 gaaaaaagaa gagtgcgtaa gactgatttc acattttaaa tttctttcag caatggatat 1501 aggtgaaagg cgaaagccgc aaaacggttc gcttacgtta aagttgaaag agggaaatgt 1561 tcatttaaga atgtcaacgc tgcccacaat taatgaagaa agcctcgtga tcagagtgat 1621 gccccaatac aatatccctt cgattgataa attgtcgcta tttccgaaga caggagccac 1681 attactctcg tttttaaaac attcccatgg catgctcatt tttaccgggc cgactggttc 1741 agggaagact accacattat actctctcgt tcaatatgca aaaaaacact ttaatcgaaa 1801 tattgtcaca ttagaggacc ctgttgaaac aagggacgaa gatgttcttc aggttcaggt 1861 gaatgaaaaa gccggtgtaa cttattccgc aggtctgaaa gcaattttgc gccatgaccc 1921 cgatatgatt attttaggtg agatcagaga cgcggaaaca gctgaaattg cggtgcgggc 1981 agcgatgacg ggacatctgg tactaacgag ccttcatacg agagacgcaa agggcgcaat 2041 ttacagactg cttgaattcg gtatcaatat gaatgaaatc gaacagactg tcattgcaat 2101 agcggctcag cgcttggttg atttggcttg cccgttttgt gaaaacggat gttcatcagt 2161 gtattgccga cagtcacgaa atactaggag agctagcgtt tatgagcttc tatacgggaa 2221 aaatcttcag caatgtatcc aggaggcaaa aggaaatcat gcaaattacc aatatcaaac 2281 gcttcgtcaa attatcagaa aaggaattgc gctcggctat ttaacgacaa acaactatga 2341 ccggtgggtt tatcatgaaa aagattagaa agtctggttg ttaaaggatc aagccaggtt 2401 attaaagagg ctcggtgaaa tgactgcggg cggatataca cttctggatg gattacgcct 2461 gatggaactt cagatgaata agaggcaggc ggctgacttg actgattcgg tcacttgttt 2521 gagggaaggg gctccgtttt atcaagtact aaagagtttg tcatttcata aggaagccgt 2581 aggtatttgt tattttgctg aaacacatgg tgaactgcct gcttcaatga tccagagcgg 2641 agagctgctg gaacgaaaaa ttgcacaggc agaccagctg aaaagagtgc tgcgctatcc 2701 gcttttcctc atctttacgg tcgctgtcat gttttatatg ttacagtcca tcatcattcc 2761 tcagttttcc ggtatctatc aatcgatgaa tatggaaacc tcacgttcaa ccgatatgct 2821 ttttgctttt tttcagcata ttgatcttgt gatcattttg cttgttcttt ttacagcagg 2881 tatcgggatt tattattggc ttgtgtttaa gaaaaaatca cctgcccggc aaatgctgat 2941 ttgtatcagg attcctttgg ttggaaagct tgtaaagctg tttaacagct actttttttc 3001 tttgcagcta agcagccttt taaaatcagg cctctcaatt tatgacagcc ttaatgcatt 3061 taaacatcaa acgtttctcc ctttctaccg ctgcgaggct gaacaattga ttgaacggct 3121 aaaagccggt gagtcaattg aatccgctat ttgtggaagc cttttttatg aaactgattt 3181 atcaaaagtc atatctcacg gccagctgag cggccgattg gatcgggagc ttttcacata 3241 cagccaattc atattacagc ggctggaaca caaagcgcaa aaatggacag gcatccttca 3301 gccaatgatt tatggatttg ttgcagcgat gatcttactt gtgtatttat ctatgcttgt 3361 gcctatgtat cagatgatga atcaaatgtg aaaggaagag gctgatgaat gagaaaggat 3421 ttacacttgt tgaaatgtta atcgtgctct ttattatttc gattttgctt ttaattacga 3481 taccgaacgt cacgaaacat aatcaaacca ttcaaaaaaa gggctgtgaa ggcttacaaa 3541 acatggttaa ggcacaaatg actgcatttg agcttgatca tgaaggacaa actccgagcc 3601 ttgccgattt acagtcagag ggctatgtga aaaaggatgc tgtctgtcca aatggtaagc 3661 gcattatcat caccggcgga gaagttaagg ttgaacatta aattaaacga ggagaagggg 3721 tttacccttt tagaaagttt gcttgtgtta agccttgcct ctatcctcct ggtggccgtc 3781 ttcactacac ttcctcctgc ttatgacaat acagctgtcc gacaggcagc aagtcagctg 3841 aaaaatgata ttatgctcac acagcagact gctatttccc gtcaacaaag aacaaaaatt 3901 ctctttcata aaaaagaata tcaattagtc attggtgata cggttattga acgtccgtat 3961 gcaacgggac tttctataga actgctgaca ttaaaagacc gtttggaatt taatgagaaa 4021 gggcacccga atgcaggcgg aaaaatacga gtaaaaggcc atgccgttta tgacataaca 4081 gtttatctag ggagcgggag agtcaatgtg gagagaaaat aaaggttttt ctacaataga 4141 aacaatgtct gcgctaagcc tgtggctgtt tgtgctgctg acagtcgtcc ccttgtggga 4201 caagctgatg gctgatgaaa aaatggcgga atcacgagaa attggctatc agatgatgaa 4261 tgagagcatt agcaaatatg tcatgagtgg tgaaggagcc gcgtcaaaaa cgattacaaa 4321 gaacaatcat atctatgcaa tgaagtggga ggaggagggc gaatatcaaa acgtatgtat 4381 caaagccgca gcttataaag aaaaatcatt ttgcctcagc attttgcaga cagaatggct 4441 acacgcttct taacgtatta ttttcgctct cagtcttttt gctcatatca ggatcgttag 4501 ctgcgattat ccatctgttt ttgtctcgac agcaggaaca tgacggtttc acacagcagg 4561 aatggatgat ttcgatagaa cagatgatga atgaatgcaa ggaatcacag gcagttaaga 4621 cagccgagca tgggagcgtg ttaatctgca ccaatctttc cggacaagac atccgttttg 4681 acatttatca ttcaatgata agaaaaagag tggatggcaa agggcatgtt ccgattttag 4741 atcatattac tgccatgaaa gctgatattg aaaatggtgt tgttttgctg aaaattgaga 4801 gtgaagacca aaaagtgtat caaactgctt ttccagtcta ttcgtattta ggaggggggt 4861 gaatgtatcg tacaagaggg tttatttatc cagctgttct ttttgtgtca gcgcttgtgc 4921 tgttaatcgt gaactttgtt gctgctcaat atatttcacg ctgcatgttt gagaaggaaa 4981 caaaagagtt atacatagga gagaatttgc ttcaaaatgg ggtgcttctt tcgattcggc 5041 atgttctaga ggaacggaaa ggccaggagg gtacgcagca atttctatat ggacgggttt 5101 cttattacat tcatgataca tcgataaaag aacaaaaaga aatcaactta agagtgtcaa 5161 cggattcggg aacagaaaga actgcacaga tcgtgtttga ccaaaaacag aaaaaactgc 5221 tgagatggac agaataaaac agtgtaaagg gtataaaaaa agtcatgtga gacaacactc 5281 ataataattg aatgatgagg tgatcacgtg aaaacgaatg attatgttaa atatatgacg 5341 cagcaatttg tcaaatatat agatactccg agagatgagc gaaaagaacg aaaagaggtg 5401 cggaaagaaa caaaaacgcc tgtttcccag cagtggttcg gtattttacc ctatggcttc 5461 cgactttggc tgaaacggaa aaaataaccg caaataaacg aataaggtcc ttcaaaaaat 5521 ggaggacctt attgatattc ttctaatatg gcaattttat tgaccttttg gctataagga 5581 tcaaatgaaa tcgtcacaaa aacgccgaat tcttttgacc cttccctcag agttaaatgg 5641 tattgcttca ctgcttcatc ttttctttta cggtcccata ctttttgttt gaacagtacc 5701 tgtgcgagcg ggtacctttt ttttgcttct tttacagcaa tctcttccca tttggacatg 5761 tggcgggcgg ttacaagcgg tgtttcttct gcgtgagcgg ctgtggtgcc aaagacgaga 5821 agagatagac aaatcacaca ttgtttgatc atcatgctgt cacctttctt tgtttattat 5881 taccaaataa taatgggata tgcatttaac ttctcacata acaatcccaa aaatttctaa 5941 aaaattgaaa aaatgagcaa tactgagcaa gactttgtaa tatgatgaaa acattctttt 6001 aaacgaacaa aatgagcgat ttcggtgttt ttaaatctat aaatcgttga ttatactcta 6061 tttgtgaagt tctttaaaga gaacgattgt catatcaagt tacagtgttt tacaggaggt 6121 aagatatgtt tcgattgttt cacaatcagc aaaaggcgaa gacgaaactg aaagttctgc 6181 ttatctttca gctttcagtc attttcagtc tgactgccgc aatatgctta caattttcga 6241 tgatacaagc gctgcttttc atgatattga aacatttg // LOCUS CELTC1A 234 bp ds-DNA INV 10-AUG-1990 DEFINITION C.elegans Tc1(1664) element with a DNA insert. ACCESSION M22301 M29977 KEYWORDS Tc1 element; Tc1 repetitive element. SOURCE C.elegans (var. Bristol, strain N2) DNA, clone pCeh45. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 234) AUTHORS Harris,L.J. and Rose,A.M. TITLE Structural analysis of Tc1 elements in Caenorhabditis elegans var. Bristol (strain N2) JOURNAL Plasmid 22, 10-21 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.J.Harris, 18-JAN-1989. FEATURES from to/span description rpt < 1 > 234 Tc1(1664) insertion element rpt 85 94 direct repeat copy A rpt 107 116 direct repeat copy B rpt 128 137 direct repeat copy C recomb 87 88 Tc1(1664) DNA end/inserted DNA start recomb 141 142 inserted DNA end/Tc1(1664) DNA start BASE COUNT 67 a 36 c 32 g 99 t ORIGIN 3 bp upstream of EcoRV site. 1 gatatccact tttggttttt tgtgtgtaac ttttttctca agcatccatt tgacttgaat 61 ttttccgtgt gcataaagcg aaatgtttcg caaattaagc aaataaagtt tcgcaaagaa 121 ttaaattgtt tcgcaaacga aacgcaaatt tgcggaccaa acattacatg attatcgatt 181 ttttctgaat tttatttcaa ttttttgatt ttttcgtttt tccaattttc atta // LOCUS HUMDEF1A 448 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human neutrophil peptide (defensin) 1 mRNA, complete cds. ACCESSION M21130 M22160 KEYWORDS cytotoxic protein; defensin 1; microcidal protein; neutrophil peptide. SOURCE Human leukocyte (myeloid cell line HL-60), cDNA to mRNA, clones HNP1a and HNP1b. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 448) AUTHORS Daher,K.A., Lehrer,R.I., Ganz,T. and Kronenberg,M. TITLE Isolation and characterization of human defensin cDNA clones JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7327-7331 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Daher, 25-NOV-1988. Submission form for [1] kindly provided by M.Kronenberg, 01-JAN-1989. FEATURES from to/span description pept 51 335 neutrophil peptide 1 precursor /nomgen="DEF1" /map="8p23" /hgml_locus_uid="LR0109A" sigp 51 107 neutrophil peptide 1 signal peptide matp 243 332 neutrophil peptide 1 mRNA < 1 448 neutrophil peptide 1 mRNA BASE COUNT 106 a 127 c 110 g 105 t ORIGIN Unreported. 1 tctgccctct ctggtcaccc tgcctagcta gaggatctgt gaccccagcc atgaggaccc 61 tcgccatcct tgctgccatt ctcctggtgg ccctgcaggc ccaggctgag ccactccagg 121 caagagctga tgaggttgct gcagccccgg agcagattgc agcggacatc ccagaagtgg 181 ttgtttccct tgcatgggac gaaagcttgg ctccaaagca tccaggctca aggaaaaaca 241 tggcctgcta ttgcagaata ccagcgtgca ttgcaggaga acgtcgctat ggaacctgca 301 tctaccaggg aagactctgg gcattctgct gctgagcttg cagaaaaaga aaaatgagct 361 caaaatttgc tttgagagct acagggaatt gctattactc ctgtaccttc tgctcaattt 421 cctttcctca tcccaaataa atgccttg // LOCUS MUSMIPAA 753 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse macrophage inflammatory protein (MIP)-1a mRNA, complete cds. ACCESSION X12531 M19382 KEYWORDS lymphokine; macrophage inflammatory protein. SOURCE Mouse cell line RAW264.7., cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 753) AUTHORS Davatelis,G., Tekamp-Olson,P., Wolpe,S.D., Hermsen,K., Luedke,C., Gallegos,C., Coit,D., Merryweather,J. and Cerami,A. TITLE Cloning and characterization of a cDNA for murine macrophage inflammatory protein (MIP), a novel monokine with inflammatory and chemokinetic properties JOURNAL J. Exp. Med. 167, 1939-1944 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.Davatelis, 27-MAY-1988. FEATURES from to/span description pept 71 349 macrophage inflammatory protein precursor sigp 71 139 macrophage inflammatory protein signal peptide matp 140 346 macrophage inflammatory protein mRNA < 1 753 MIP mRNA signal 753 753 polyA signal BASE COUNT 190 a 203 c 143 g 217 t ORIGIN 1 aagcagcagc gagtaccagt cccttttctg ttctgctgac aagctcaccc tctgtcacct 61 gctcaacatc atgaaggtct ccaccactgc ccttgctgtt cttctctgta ccatgacact 121 ctgcaaccaa gtcttctcag cgccatatgg agctgacacc ccgactgcct gctgcttctc 181 ctacagccgg aagattccac gccaattcat cgttgagtat tttgaaacca gcagcctttg 241 ctcccagcca ggtgtcattt tcctgactaa gagaaaccgg cagatctgcg ctgactccaa 301 agagacctgg gtccaagaat acatcactga cctggaactg attgcctgag agtcttggag 361 gcagcgagga accccccaaa cctccatggg tcccgtgtag agcaggggct tgagccgaac 421 attcctgcca cctgcatagc tccatctcct ataagctgtt tgctgccaag tagccacatc 481 gagggactct tcacttgaaa ttttatttaa tttaatccta ttggtttaat actatttaat 541 tttgtaattt attttattgt catacttgta tttgtgacta tttattctga aagacttcag 601 gacacgttcc tcaaccccca tctccctccc agttgttcac actgtttggt gacagctatt 661 ctaggtagac atgatgacaa agtcatgaac tgacaatgta caatagatgc tttgtttata 721 ccagagaagt aataaatatg ccctttaaca agt // LOCUS ONGSSDNA 154 bp ds-DNA INV 10-AUG-1990 DEFINITION O.volvulus species-specific DNA. ACCESSION J03990 M22037 KEYWORDS . SOURCE O.volvulus (strain Liberian) DNA, clone C1A1-2. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 154) AUTHORS Harnett,W., Chambers,A.E., Renz,A. and Parkhouse,R.M.E. TITLE An oligonucleotide probe specific for Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 35, 119-126 (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Harnett, 18-JAN-1989. BASE COUNT 50 a 23 c 27 g 54 t ORIGIN 1 gaattcaaaa aaatatagtc gaatattttt cttaggaccc agtttgaagg tacgtacccg 61 ttttttgaaa ttatatgtca taggtcatct gttaataatc aattttgcaa aatgcgtttt 121 tcgccgaaaa aatcgccgtg taaatgtgga attc // LOCUS RHMDCTAB 2300 bp ds-DNA BCT 10-AUG-1990 DEFINITION R.meliloti C-4-dicarboxylate carrier protein (dctA) gene, complete cds, and dctB protein gene, partial cds. ACCESSION M26399 M29729 KEYWORDS C4-dicarboxylate carrier protein; membrane protein; transport protein. SOURCE R.meliloti (strain 2011) DNA, clone pRmSC121. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 2300) AUTHORS Engelke,T., Jording,D., Kapp,D. and Puehler,A. TITLE Identification and sequence analysis of the Rhizobium meliloti dctA gene encoding the C4-dicarboxylate carrier JOURNAL J. Bacteriol. 171, 5551-5560 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Jording, 13-OCT-1989. FEATURES from to/span description pept 750 2111 C4-dicarboxylate carrier A (put.) pept 786 2111 C4-dicarboxylate carrier A' (put.) pept 555 < 1 (c) dctB protein signal 582 597 activator sequence signal 673 689 5' promoter region BASE COUNT 388 a 758 c 707 g 447 t ORIGIN 1 gtcgacgcgc cgggaaatat aaagcccggg ccgattgctg accgtgccca tggcgaaatg 61 ttcggccatg ccgtcgcgga cggcgagccg gaaataatcg cggaaggcat agtcgttgcc 121 gacaaagctc gtcggctcct gccagttgct ggccgcgacg gcgacgccgc tccggtcgat 181 cagataaatg accgcggctt cggcgcttgt cgccagggcc tcgagcttac ggttgatgcg 241 gtcgagcgac ggcctgtccg gcgaaagcaa tgcgccacga atggctgcgt cgtcggcgag 301 gacaagcggc agggcgcgct gccgttccac gactgctcga agaagcgagg ctttcaggct 361 ggcgtcgatc cggctctgac cggcaaggcc ggcgagcgcc tgcgaccggc cgtagtctcg 421 cgcaagaaga aggccggctg ccagaaggac cagcgcgact gccgcgaaaa cgagccagga 481 ccgtcgggcc cggctgcgaa gcgcatgcgg atcgctcgct tctgcaggaa gtttgaccat 541 gcgaacatgg tgcatgtttt cgcccaggac gccagcactt ctgtgcggaa atccgcacat 601 atccacgaac ggcaagcgag caccgctccc aaaaatgtca tggattcaat cgcataggcc 661 ttccgcggca aactggcacg catgttgctg acaagctcca caaggcagcc actgctgtcg 721 atcttcggaa agcggcccgg gaggccggca tgttgccgga ctgggcctgc cacgtggagg 781 atatcatgat catcgaacat tccgcggagg tccgcggcaa gacacccctt taccgccatc 841 tctatgttca ggtgctggcg gcgatcgccg cgggcatcct gctcgggcat ttctatcctg 901 atatcggcac ggagctcaaa ccgcttggcg acgccttcat caggctcgtg aagatgatca 961 tcgcgccggt gatcttcctg acggtcgcga ccgggattgc cggcatgacc gatctcgcca 1021 aggtcggccg cgtcgccggc aaggcgatga tctactttct cgccttctcc accctcgcgc 1081 ttgtcgtcgg cctcgtcgtc gcaaacgtgg tgcagccggg cgcgggcatg catatcgatc 1141 cggcctcgct ggacgccaag gcggtcgcga cctatgccga gaaagcgcat gagcagtcga 1201 tcaccggctt cctgatgaac atcatcccga cgacgctcgt cggcgccttc gccgaaggcg 1261 acatcctgca ggtcctgttc atctcggtgc tcttcggtat ctcgctggcg atcgtcggca 1321 agaaagccga gcccgtggtc gatttcctgc aggcgctgac gctgccgatc ttccggctcg 1381 ttgcgatcct gatgaaggcc gccccgatcg gcgccttcgg cgccatggcc ttcaccatcg 1441 gcaagtacgg tatcgcctcg atcgccaatc tcgccatgct gatcggcacc ttctatctga 1501 cgtcgtttct cttcgtcttc atcgttctcg gcgcggtcgc acgctataac ggcttctcga 1561 tcctctcgct catccgctac atcaaggagg agctgctgct ggtgctcggg acgtcctctt 1621 cggaggcggc tcttccgggc ctcatgaaca agatggagaa ggccggctgc aagcgctcgg 1681 tcgtcggtct cgtcattccg accggttatt ccttcaacct ggacggcacc aatatctaca 1741 tgacccttgc ggccctgttc atcgcccagg cgaccgatac gccgctctca tacggcgacc 1801 agattctgct gctcctcgtc gcaatgctga gctcgaaggg tgcggccggc attaccggcg 1861 ccggcttcat cacgcttgcc gcaacgctct cggtcgttcc ctccgtgccg gtcgccggca 1921 tggcgctgat cctcggcatc gaccgcttca tgtcggaatg ccgcgccctg accaatttcg 1981 tcggcaacgc ggttgcgacg atcgtggtgg cgaagtggga gggcgagctc gatcaggcgc 2041 agctttccgc agctctcggc ggcgaggcgt ccgtcgaggc catcccggcg gtcgtccagc 2101 ccgccgaata agcttgcctc ccagggcgag accgcacgct ggtgcgcttg gcgcggtccg 2161 gttctccggg ccgcgctttt ttatcgccgg ccaggcctga gggggtgggc actcggcaca 2221 agatcagatg ccgtgcagca cctgcgtcgc cttcactgcc gccgaggccc ggttttcgac 2281 gccgagcttc acatagatct // LOCUS HUMMYOHP 6016 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human perinatal myosin heavy chain mRNA, complete cds. ACCESSION M36769 M35250 Y00821 KEYWORDS myosin; myosin heavy chain. SOURCE Human fetal muscle, cDNA to mRNA, clone pSMHCP. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 2651 to 6016) AUTHORS Leinwand,L.A. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 2651 to 6016) AUTHORS Feghali,R. and Leinwand,L.A. TITLE Molecular genetic characterization of a developmentally regulated human perinatal myosin heavy chain JOURNAL J. Cell Biol. 108, 1791-1797 (1989) STANDARD simple automatic REFERENCE 3 (bases 1 to 2650) AUTHORS Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A. TITLE Generation of a full-length human perinatal myosin heavy-chain-encoding cDNA JOURNAL Gene 89, 289-294 (1990) STANDARD full staff_review COMMENT Author address: L.A.Leinwand Albert Einstein College of Medicine 1300 Morris Park Avenue Bronx, N.Y. 10461, USA. Submitted (12-APR-1989) on tape to the EMBL data library. Draft entry and computer-readable sequence for [3] kindly submitted by I.Karsch-Mizrachi, 18-JUN-1990. FEATURES from to/span description pept 74 5887 perinatal myosin heavy chain mRNA < 1 6016 perinatal myosin heavy chain mRNA BASE COUNT 1953 a 1300 c 1586 g 1177 t ORIGIN 1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa 61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc 121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc 181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag 241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga 301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat 361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat 421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt 481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca 541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc 601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca 661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca 721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc 781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg 841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt 901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa 961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt 1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga 1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac 1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc 1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc 1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa 1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga 1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag 1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct 1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat 1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt 1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat 1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta 1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc 1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg 1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc 1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc 1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga 2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat 2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct 2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat 2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg 2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca 2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct 2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag 2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg 2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt 2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa 2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct 2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc 2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat 2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa 2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga 2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa 3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc 3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga 3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga 3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg 3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa 3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa 3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc 3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga 3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc 3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa 3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg 3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt 3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag 3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga 3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct 3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga 3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga 4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct 4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg 4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac 4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt 4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc 4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga 4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct 4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga 4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga 4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac 4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt 4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca 4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga 4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt 4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag 4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa 4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac 5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat 5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga 5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt 5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga 5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga 5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca 5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct 5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca 5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa 5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca 5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc 5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct 5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc 5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc 5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat 5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac 6001 agtaaatgta tacatt // LOCUS CDCMANABD 2155 bp ds-DNA BCT 10-AUG-1990 DEFINITION C.saccharolyticum beta-D-mannanase (manA) gene, complete cds. ACCESSION M36063 KEYWORDS beta-D-mannanase. SOURCE C.saccharolyticum DNA, clone lambda-NZP2. ORGANISM Caldocellum saccharolyticum Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2155) AUTHORS Leuthi,E., Bhana Jasmat,N., Grayling,R.A., Love,D.R. and Bergquist,P.L. TITLE Cloning, sequence analysis and expression in Escherichia coli of a gene coding for a beta-mannanase from the extremely thermophilic bacteria Caldocellum saccharolyticum JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Luethi, 03-JUL-1990. Author address: E.Luethi University of Auckland Department of Cellular and Molecular Biology Private Bag Auckland NEW ZEALAND email:SYSTEM@CELVAX.AUKUNI.AC.NZ FEATURES from to/span description pept 841 1881 beta-D-mannanase precursor (E.C. 3.2.11.78) sigp 841 963 beta-D-mannanase signal peptide matp 964 1878 beta-D-mannanase pept < 1 685 ORF1 (AA at 2) binding 818 830 manA ribosome binding site BASE COUNT 721 a 334 c 590 g 510 t ORIGIN 1 gggtagcaat acatggtttg gattccaggc atggtcgatg cagagggtag cggagtatta 61 ctatgtgaca ggagataaag atgcaggggc actgcttgag aagtgggtaa gctggataaa 121 gagtgtagtg aagttgaata gtgatggtac atttgcgata ccatcgacgc ttgattggag 181 tgggcagcca gacacatgga atgggacata tacaggtaat ccgaacttgc atgtgaaggt 241 agtagattat gggacggatt taggaataac ggcatcactt gcgaatgcac tactttatta 301 cagtgcaggg acgaagaagt atggggtatt tgatgaggaa gcgaagaatt tagcgaagga 361 attgctggac aggatgtgga agttatacag ggatgagaaa ggtttatcgg cgccagagaa 421 gagagcggac tacaagaggt tctttgagca agaggtatac attccggcag gatggacagg 481 gaagatgccg aatggagatg taataaagag cggagttaag tttatagaca taaggagcaa 541 gtacaaacaa gatcctgatt ggccgaagtt agaggcggca tacaagtcag ggcaggtacc 601 ggagttcaga tatcacaggt tctgggcaca gtgtgacata gcaattgtta atgcaacata 661 tgaaattctg ttcggtaatc aataatgagt aggtaaatgg aaatttagcg gggtggcaca 721 tctataagtt tggtgtgctg cctcgctaaa atcctgtatg gaagtgttcg aaaaatagta 781 caaaaaaatg gcgaggtaaa atggcaaaaa tgcataaaaa agagggggtt aggttttaag 841 atgagactaa aaacaaaaat aagaaagaaa tggttgagtg tactatgtac agttgttttt 901 ttattgaaca ttttgtttat agcaaatgta acgattttac ccaaagttgg tgcggctaca 961 tctaatgatg gagtagtgaa gatagatacg agtacgttaa taggaaccaa tcacgcacat 1021 tgctggtaca gagatagact tgatacggca ttgcgtggaa ttaggtcatg gggtatgaac 1081 tctgtgaggg tagtgctgag taatggttac cgatggacga agataccagc aagtgaagtg 1141 gcaaatatta tatcattgtc aagaagtctt ggtttcaaag ctattatatt agaagtgcac 1201 gacacaacag gatatggaga agatggggca gcatgttcat tggcacaagc agtggaatat 1261 tggaaggaga taaagagcgt attagacggt aacgaagatt ttgtaattat aaacattggt 1321 aatgagccgt atgggaacaa taactatcaa aactgggtta atgacacgaa gaacgctata 1381 aaagcactta gagatgcagg attcaagcac acgataatgg tggatgcgcc gaactggggt 1441 caggattggt ctaatactat gagagataat gcccagagca taatggaagc agatccgctg 1501 cgcaatttgg tattttcgat tcatatgtat ggcgtataca atacagcaag caaggtcgaa 1561 gagtacatca aatcatttgt tgataagggg ttaccattgg ttattggaga atttggacat 1621 cagcacacag atggtgaccc tgatgaagaa gctattgtca ggtatgcaaa acagtacaag 1681 ataggattat ttagttggtc gtggtgtgga aattcgagct atgttgggta tttggacatg 1741 gtaaacaact gggaccccaa taatccaact ccatggggac aatggtataa aactaatgca 1801 attggtacat cttctacacc aacacctacg tcgacagtaa cgccaacgcc acccccacgc 1861 caacaccaac accgacagta acagcgacac caacaccaac acctacacct gttagcacac 1921 ctgcgacaag tgggcagata aaggtactgt atgctaacaa ggagacaaac agcactacaa 1981 acacgataag gccgtggttg aaggtagtga atagtggtag cagtagcata gatttgagca 2041 gggtaacgat aaggtactgg tacacggtag atggtgagag ggcacagact ggatatcaga 2101 ctgggcacag ataggagcaa gcaatgtcaa cattcaagtt tgtgaagctg agagt // LOCUS MUSMLHRCA 1431 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse lymph node homing receptor mRNA, complete cds. ACCESSION M36005 KEYWORDS lymph node homing receptor. SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22+(C3H). ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1431) AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K. TITLE The mouse lymph node homing receptor is identical with the lymphocyte cell surface marker Ly-22: Role of the EGF domain in endothelial binding JOURNAL Cell 61, 611-622 (1990) STANDARD simple staff_review FEATURES from to/span description pept 52 1170 lymph node homing receptor precursor sigp 52 165 lymph node homing receptor signal peptide matp 166 1167 lymph node homing receptor BASE COUNT 412 a 323 c 362 g 334 t 0 others ORIGIN Chromosome 1. 1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt 61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg 121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat 181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat 241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc 301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc 361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag 421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac 481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg 541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat 601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct 661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt 721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca 781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag 841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc 901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt 961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc 1021 tcaaagatca aagaaggtga ctacaacccc ctcttcattc ctgtagccgt catggtcacc 1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa 1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag 1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag 1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga 1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt 1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a // LOCUS MUSMLHRCB 1431 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse lymph node homing receptor mRNA, complete cds. ACCESSION M36058 KEYWORDS lymph node homing receptor. SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22-(SJL). ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1431) AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K. TITLE The mouse lymph node homing receptor is identical with the lymphocyte cell surface marker Ly-22: Role of the EGF domain in endothelial binding JOURNAL Cell 61, 611-622 (1990) STANDARD simple staff_review FEATURES from to/span description pept 52 1170 lymph node homing receptor sigp 52 165 lymph node homing receptor signal peptide matp 166 1167 lymph node homing receptor BASE COUNT 412 a 322 c 362 g 335 t 0 others ORIGIN Chromosome 1. 1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt 61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg 121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat 181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat 241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc 301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc 361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag 421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac 481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg 541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat 601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct 661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt 721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca 781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag 841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc 901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt 961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc 1021 tcaaagatca aagaaggtga ctataacccc ctcttcattc ctgtagccgt catggtcacc 1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa 1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag 1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag 1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga 1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt 1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a // LOCUS RATMHRT1B 562 bp ds-DNA ROD 10-AUG-1990 DEFINITION Rat MHC class II A-beta RT1.B-b-beta gene, partial cds. ACCESSION M36151 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Rat (strain BUF-(RT-b)) DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 562) AUTHORS Figueroa,F., Guenther,E. and Klein,J. TITLE MHC polymorphism pre-dating speciation JOURNAL Nature 335, 265-267 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 562 MHC A-beta RT1.B-b-beta cell surface glycoprotein (AA at 1) BASE COUNT 122 a 171 c 173 g 96 t ORIGIN 1 agagactccc caagggattt cgtgtaccag ttcgagggcc agtgctacta caccaccggg 61 acgcagcgca tgcggctcgt gaccagacac atctacaacc gggaggagta cgtgcgcttc 121 gacagcgacc tgggcgagta ccgcgcgctg accgagctgg ggcggccctc agccgagtac 181 tggaataagc agtacctcga gcagacgcgg gccgagctgg acagggtctg cagatacaac 241 tacgaggggc cgggggctct cacctccctg agacggcttg agcagcccaa tgtggccatc 301 tccctgtcca ggacagaggc ccttaaccac cacaacctgc tggtctgctc agtgacagat 361 ttctacccag cccagatcaa agtgcgctgg ttccggaatg gccaggagga gacgacgggg 421 gtcgtgtcca cacagcttat taggaatggg gactggacct tccagatcct ggtcatgctg 481 gagatcacgc ctcagcgggg agatgtgtac acctgccatg ttgaccaccc cagccttcag 541 agccctgtca cagtggagtg gc // LOCUS WHTMTTGPAA 1743 bp ds-DNA ORG 10-AUG-1990 DEFINITION Wheat mitochondrial Pro-tRNA gene. ACCESSION M36018 KEYWORDS transfer RNA-Pro. SOURCE Wheat mitochondrial DNA, clone H-P2. ORGANISM Mitochondrion Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Triticum aestivum. REFERENCE 1 (bases 1 to 1743) AUTHORS Joyce,P.B.M., Spencer,D.F. and Gray,M.W. TITLE Multiple sequence rearrangements accompanying the duplication of a Pro-tRNA gene in wheat mitochondrial DNA JOURNAL Plant Mol. Biol. 11, 833-843 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 831 905 Pro-tRNA BASE COUNT 483 a 396 c 414 g 450 t ORIGIN 1 aagcttcatg aaggaagggt ccaaggacct aagcgacaac ctctacgaaa aagaaaaacc 61 tatatataga tagtggtatg tgttcgctga cactaaatag acttggctcg gggcagaact 121 gagtggctag gctaagttcc ctagtcttct attggcctac ccacccctac gacatggagg 181 cgcttaccct tctccaacaa gcgttccacc acgcggcgat tgattaccgc tttccgtctc 241 aatcagttca gaccaagccc cttctctgat cgggggaagg tcattctaaa acatcagatc 301 tcccattgat gctgatcaaa aataagttta tccacctaga atatatgata ggtaccgaaa 361 gattggatgt catttctaaa caccaggcgg gacgagcagc cctataccac gtgtagccac 421 actctagtgt ccttttctac ttagttggac agatcacttc agaaaatcgt ataaaaatca 481 agcaagaaaa cggatgcgct aacgcgcaac ggctttcgcg ctagttgctc aaaaaatcgt 541 ataaaaatca agcaagaaaa aggttctggc aggctgcgtg ggactgtaaa tcctctttcg 601 ctggctgggc cctttggact cgaaatccaa acggagtgag tggttcgatt ccactctcag 661 aacgagactg aacgaaataa aatgcaagtg aaacgagacg agaatcaaat tgtaggcttc 721 tttcctaaaa gcggtggttc tcgcctcccc gtgcccaaag cggggtgggc gacacagcgc 781 tgcggttctt ttctttatcg atcgggtaga tccatatgtt ctgaggggga cgaggtgtag 841 cgcagtctgg tcagcgcatc tgttttgggt acagagggcc ataggttcga atcctgtcac 901 cttgatgtgg tattcacaca atggggccga agtgcaaagc cccgcagcct atccgtggtc 961 gggaaggcag gggtattaaa agtgaaactt ggcttgcgtg tgcgcgcaca caaaggagac 1021 tgagttggat tgcaaagcct ctcagacgga aggaacctgt tttaaaaacc cctttctaca 1081 atcttgagaa atctcacgca agcaactaca cgccgcaagt cttcggtatt gtcgcaagaa 1141 ttgaaaagat gactccgctt tttctttcga gcatgcaagg cctcagtaga gcgagtctgt 1201 gcatgagtag cgcagtttac aaactagtga atggtcaaat gaataggctc gtttcatcag 1261 gggtaaccaa ggagtaagaa tgatttgata aagtaaacat tccctttgag tgcggttcct 1321 tcagaacaga ctcttccctt ttcagattgc ttctttgggc cacttgagca tcgactgggc 1381 aaggttggat tccctctact tcggggtggc atctatctca catgttttac attgatccat 1441 agccctccgt tagataggaa cagctttgct aaatactgag aactctcgaa tagaatatga 1501 gaagggaaag atcctttaca taaggaacta ttgatcatag gcgtacactc tgaaaagaat 1561 tgaatgaaac tctctgtttc gtcactttga tatcattagc tgtctctgta gctggctctc 1621 atgtctggtg ttccaagtaa aaacttttca actagctggg tcaccgctgt tttaaaccct 1681 gacttggaat cagagccata aacgagtgat tcggtcggac ggacgattga gttcttaaag 1741 ctt // LOCUS YSCRSDSBA 76 bp ss-tRNA PLN 10-AUG-1990 DEFINITION S.cerevisiae Lys-tRNA. ACCESSION M36150 KEYWORDS transfer RNA-Lys. SOURCE S.cerevisiae tRNA, clone alpha-S288c. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Madison,J.T., Boguslawski,S.J. and Teetor,G.H. TITLE Nucleotide sequence of a lysine transfer ribonucleic acid from bakers' yeast JOURNAL Science 176, 687-689 (1972) STANDARD simple staff_review FEATURES from to/span description tRNA 1 76 Lys-tRNA anticdn 34 36 anticodon stt modified 1 1 p modified 9 9 m2a modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 27 27 p modified 28 28 p modified 37 37 t6a modified 46 46 m7g modified 47 47 d modified 48 48 m5c modified 55 55 p modified 58 58 m1a modified 67 67 p BASE COUNT 15 a 18 c 22 g 16 t 5 others ORIGIN 1 tccttgttag ctcagddggd agagcgttcg gctsttaacc gaaatgdcag gggttcgagc 61 cccctatgag gagcca // LOCUS TNCPOLY 3660 bp ss-RNA VRL 10-AUG-1990 DEFINITION Tobacco necrosis virus RNA dependent RNA polymerase and coat protein genes, complete cds. ACCESSION M33002 KEYWORDS RNA dependent RNA polymerase; coat protein; polymerase. SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell line, cDNA to viral RNA, clones pFM2[0,1,2,3]. ORGANISM Tobacco necrosis virus Viridae; ss-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 3660) AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J. TITLE Genome structure of tobacco necrosis virus strain A JOURNAL Virology 177, 699-709 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Meulewaeter, 20-MAR-1990, for release after publication. FEATURES from to/span description pept 60 665 RNA dependent RNA polymerase (put.) /transl_except=(666..668) 669 2234 RNA dependent RNA polymerase pept 2218 2436 ORF 3 pept 2440 2610 ORF 4 pept 2613 3443 coat protein pept 3467 3646 ORF 6 site 666 668 readthrough codon BASE COUNT 1034 a 890 c 895 g 841 t ORIGIN 1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa 61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact 121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag 181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt 241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa 301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg 361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg 421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg 481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac 541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag 601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt 661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac 721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct 781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc 841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc 901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa 961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga 1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag 1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc 1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc 1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg 1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat 1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg 1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa 1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta 1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact 1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc 1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga 1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg 1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt 1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat 1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca 1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc 1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat 2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt 2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg 2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg 2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt 2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa 2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa 2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt 2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca 2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac 2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg 2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc 2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc 2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg 2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag 2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa 2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca 3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca 3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg 3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag 3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta 3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg 3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg 3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa 3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga 3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt 3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa 3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag // LOCUS BMORGA1 120 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mori 5.8S ribosomal RNA upstream region. ACCESSION M36169 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 1 of 2 SOURCE B.mori DNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 120) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review BASE COUNT 34 a 15 c 20 g 51 t ORIGIN 1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca 61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta // LOCUS BMORGA2 354 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mori 5.8S ribosomal RNA. ACCESSION M36302 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 2 of 2 SOURCE B.mori DNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 354) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 139 305 5.8S rRNA BASE COUNT 103 a 74 c 74 g 103 t ORIGIN About 50 bp after segment 1. 1 gctcgtgtct ctctcgtcgt cgttgacggt ttcgtaaatt aacacaaaat acattttatt 61 ttatttattg tttttttttc gacaagaaaa aaaaaaaaga aattttttat ttttatttaa 121 acattacaga aaaaacgaaa atcattaccc tggacggtgg atcacttggc tcgcgggtcg 181 atgaagaacg cagttaactg cgcgtcatag tgtgaactgc aggacacatt tgaacatcga 241 catttcgaac gcacattgcg gtccgtggag acacatccag gaccactcct gtctgagggc 301 cggctgcata aaaaacatta atcacattgc gtggctcgtg gcgcgctcgt gatc // LOCUS BMORGB1 120 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mandarina 5.8S ribosomal RNA upstream region. ACCESSION M36170 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 1 of 2 SOURCE B.mandarina DNA. ORGANISM Bombyx mandarina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 120) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review BASE COUNT 34 a 15 c 20 g 51 t ORIGIN 1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca 61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta // LOCUS BMORGB2 138 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mandarina 5.8S ribosomal RNA. ACCESSION M36303 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 2 of 2 SOURCE B.mandarina DNA. ORGANISM Bombyx mandarina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 138) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 89 5.8S rRNA BASE COUNT 35 a 39 c 34 g 30 t ORIGIN About 50 bp after segment 1. 1 ctgcaggaca catttgaaca tcgacatttc gaacgcacat tgcggtccgt ggagacacat 61 ccaagaccac tcctgtctga gggccggctg cataaaaaac attaatcaca ttgcgtggct 121 cgtggcgcgc tcgtgatc // LOCUS BOVINTERA 1003 bp ss-mRNA MAM 10-AUG-1990 DEFINITION Bovine interleukin-1 alpha mRNA, complete cds. ACCESSION M36182 KEYWORDS interleukin-1 alpha. SOURCE Bovine macrophage, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1003) AUTHORS Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W. TITLE The nucleotide sequence for the cDNA of bovine interleukin-1 alpha JOURNAL Nucleic Acids Res. 16, 9053-9053 (1988) STANDARD simple staff_review FEATURES from to/span description pept 33 839 interleukin-1 alpha precursor sigp 33 368 interleukin-1 alpha signal peptide matp 369 836 interleukin-1 alpha mRNA < 1 1003 interleukin-1 alpha mRNA BASE COUNT 313 a 218 c 215 g 257 t ORIGIN 1 tgttgctagc tcggttcagc aaagaagtga agatggccaa agtccctgac ctctttgaag 61 acctgaagaa ctgttacagt gaaaatgaag actacagttc tgaaattgac cacctctctc 121 tcaatcagaa gtccttctat gatgcaagct atgagccact tcgtgaggac cagatgaata 181 agtttatgtc cctggatacc tcggaaacct ctaagacatc caagcttagc ttcaaggaga 241 atgtggtgat ggtggcagcc agtgggaaga ttctgaagaa gagacggttg agtttaaatc 301 agttcatcac cgatgatgac ctggaagcca ttgccaataa tacagaagaa gaaatcatca 361 agcccagatc agcacattac agcttccaga gtaacgtgaa atacaacttt atgagagtca 421 tccaccagga atgcatcctg aacgacgccc tcaatcaaag tataattcga gatatgtcag 481 gtccatacct gacggctact acattaaata atctggagga ggcagtgaaa tttgacatgg 541 ttgcttatgt atcagaagag gattctcagc ttcctgtgac tctaagaatc tcaaaaactc 601 aactgtttgt gagtgctcaa aatgaagacg aacccgtctt gctaaaggag atgcctgaga 661 cacccaaaat catcaaagat gagaccaacc tcctcttctt ctgggaaaag catggctcta 721 tggactactt caaatcagtt gcccatccaa agttgtttat tgccacaaag caagaaaaat 781 tggtgcacat ggcaagtggg ccgccctcga tcactgactt tcagatattg gaaaaatagc 841 cttgactgtg cactctactt acttgtaaag tggtgaccat ccgtatgtac tatgtacatg 901 aaggagtcga gcccttcact gttagtcact cgctgagcat gtgctgagct tttgtaattc 961 taaatgaatg tttactctct ttgtaagaga gaacacaaag tcc // LOCUS CHKCALMA 450 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken calmodulin mRNA, complete cds. ACCESSION M36167 KEYWORDS calmodulin. SOURCE Chicken, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 450) AUTHORS Iida,Y. TITLE cDNA sequences and molecular evolution of calmodulin genes of chicken and eel JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 450 calmodulin BASE COUNT 160 a 62 c 122 g 106 t ORIGIN 1 atggctgatc aactgacaga agagcagatt gcagaattca aagaagcttt ttcactattt 61 gacaaggatg gtgatggtac tataactaca aaggagttgg ggactgtgat gagatcactt 121 ggtcagaacc ccacagaagc agaattacag gacatgatca atgaagtaga cgctgatggc 181 aatggcacaa ttgacttccc agagtttctg acaatgatgg caagaaaaat gaaagataca 241 gatagcgaag aagaaattag agaagcgttc cgtgtgtttg acaaggatgg taatggttac 301 attagtgctg cagaacttcg tcatgtgatg acaaatcttg gggagaagct aacagatgaa 361 gaagttgatg aaatgattag ggaagcagac attgatggtg atggtcaagt aaactatgaa 421 gagtttgtac agatgatgac agcgaagtga // LOCUS FSBCALMA 450 bp ss-mRNA VRT 10-AUG-1990 DEFINITION E.electricus calmodulin mRNA, complete cds. ACCESSION M36168 KEYWORDS calmodulin. SOURCE E.electricus, cDNA to mRNA. ORGANISM Electrophorus electricus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Characoidei; Electrophoridae. REFERENCE 1 (bases 1 to 450) AUTHORS Iida,Y. TITLE cDNA sequences and molecular evolution of calmodulin genes of chicken and eel JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 450 calmodulin BASE COUNT 133 a 96 c 137 g 84 t ORIGIN 1 atggcagatc agctgactga ggaacagatt gctgagttca aggaggcgtt ttccctcttt 61 gacaaagatg gtgacggcac catcaccacc aaagagctgg gtactgtgat gcgctctctg 121 ggccaaaacc ccaccgaggc agagctgcag gacatgatca atgaagtgga tgctgacggc 181 aatggaacaa tagacttccc ggagttcctg accatgatgg ccaagaaaat gaaggacaca 241 gacagtgaag aagagatccg agaagccttc cgagtttttg acaaggacgg taatggctac 301 atcagtgcag ccgagttgcg acatgtcatg actaacttgg gcgagaagct gacggacgag 361 gaggtggatg agatgatccg agaggccgac atcgatggcg acggccaggt gaactatgaa 421 gagttcgtgc aaatgatgac tgcaaagtga // LOCUS HALTRL1L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-1. ACCESSION M35741 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon gag modified 29 29 m22g modified 40 40 m1g modified 42 42 m5c modified 59 59 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 16 a 29 c 29 g 13 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgtgggtag ccaanccagg ccaacggcgc agcgttgagg gcgctgtcct gtagaggtcc 61 gccggttcaa atccggtccc acgcacca // LOCUS HALTRL2L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-2. ACCESSION M35742 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon cag modified 29 29 m22g modified 40 40 m1g modified 60 60 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 18 a 28 c 25 g 16 t 1 others ORIGIN 5' end of mature tRNA. 1 gcagggatag ccaantctgg ccaacggcgc agcgttcagg gcgctgtctc ataggagtcc 61 gcaggttcaa atcctgctcc ctgcacca // LOCUS HALTRL3L 87 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-3. ACCESSION M35743 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 87) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 87 Leu-tRNA anticdn 36 38 Leu-tRNA anticodon caa modified 28 28 m1i modified 39 39 m1g modified 41 41 p modified 59 59 m5c modified 64 64 m1g modified 65 65 p modified 66 66 p modified 67 67 cm modified 68 68 m1i BASE COUNT 20 a 27 c 25 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgagggtag ctaantcagg aaaaagcggc ggactcaaga tccgctcccg taggggtccg 61 tgggttcaaa tccctcccct cgcacca // LOCUS HALTRL4L 86 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-4. ACCESSION M35744 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 86) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 86 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon tag modified 27 27 m22g modified 38 38 m1g modified 48 48 m5g modified 53 53 m1g modified 54 54 p modified 55 55 p modified 56 56 cm BASE COUNT 16 a 27 c 28 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgcgggtag ccaantggcc aaaggcgcag cgcttaggac gctgtggtgt agaccttcgc 61 aggttcgaac cctgtcccgc gcacca // LOCUS HALTRL5L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-5. ACCESSION M35745 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon taa modified 29 29 m2g modified 40 40 m1g modified 42 42 p modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm BASE COUNT 16 a 28 c 29 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgggggtgg ctganccagg ccaaaagcgg cggacttaag atccgctccc gtaggggttc 61 gcgagttcga atctcgtccc ccgcacca // LOCUS HALTRS1S 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-1. ACCESSION M35746 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Ser-tRNA anticdn 37 39 Ser-tRNA anticodon gct modified 29 29 m22g modified 40 40 t6a modified 60 60 m5c modified 61 61 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm BASE COUNT 15 a 31 c 27 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gttgcggtag ccaancctgg cccaaggcgc tgggttgcta actcagtggc gtcaagcccc 61 cggggttcga atccccgccg caacgcca // LOCUS HALTRS2S 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-2. ACCESSION M35747 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Ser-tRNA anticdn 36 38 Ser-tRNA anticodon cga modified 13 13 p modified 28 28 m22g modified 36 36 ac4c modified 41 41 p modified 60 60 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 20 a 28 c 22 g 17 t 1 others ORIGIN 5' end of mature tRNA. 1 gccgaggtag cctancccgg ccaaggcggt agattcgaaa tctactgtcc attcggacac 61 gtgagttcaa atctcaccct cggcgcca // LOCUS HALTRS3S 84 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-3. ACCESSION M35748 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 84) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 84 Ser-tRNA anticdn 33 35 Ser-tRNA anticodon gga modified 25 25 m22g modified 56 56 m5c modified 61 61 m1g modified 62 62 p modified 63 63 p modified 64 64 cm modified 65 65 m1i BASE COUNT 14 a 26 c 28 g 15 t 1 others ORIGIN 5' end of mature tRNA. 1 gccaggatgg ccgancggta aggcgcacgc ctggaaagcg tgttccctct gggatcgggg 61 gttcaaatcc ctctcctggc gcca // LOCUS HUMIGHVY 920 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D5-region. ACCESSION M35647 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 604 / 633 Ig H-chain D-region (AA at 604) signal 456 462 heptamer recombination signal signal 488 497 nonamer recombination signal signal 575 583 nonamer recombination signal signal 596 602 heptamer recombination signal signal 634 640 heptamer recombination signal signal 654 662 nonamer recombination signal BASE COUNT 256 a 269 c 255 g 140 t ORIGIN 1 ggatccaggc agcaacatca cgggagaatg cagggctccc agacatccca accctcttgc 61 aggcctctcc tgggaagaga ctgcatccac caccaaacag catggagccc gctgcatagt 121 aactgagtca gtgactgacc tggagggcag gggagcagtg agccggagcc cagaccatag 181 ggaccgagac cagccgctga catcccgagc ccctcactgg tggccccaga aaatatggaa 241 acaggacgga cccacattcc cagctggacc agggcggaga ctgctgagcc ccgagcacca 301 gccccaagaa acaccaggca aaatcatcag aggaggctcc tgagaaagag aggaggggag 361 gtctccctca ccagcaaatg cttcccttaa ccaaacacag ggtccacgca actcccccca 421 ggacaaagga ggagccccct gtacagcact gggctcagag tcctctccaa gacaccctga 481 gtttcagaca aaaaccccct ggaaatcaca gtatcagcag gagaactggc cagagacagc 541 aaggggggac tccgtgacac ccgtggggac aggaggattt tgtgggggct tgtgtcactg 601 tgagaatatt gtaatagtac tactttctat gcccacagtg acacagcccc agtcccaaag 661 ccctgctgta aatgcttcca cttctggagc tgagggggag cgtctgggaa gtagggccta 721 gggtgtccac caatgccaaa acacaccaga cttcccccag acatcacccc caccagccag 781 cgagcagagt aaacagaaaa tgagaagcaa gctgggaact gctggcacag gccccaagga 841 aagagctggg gttgtgcaag agaagatgca ggcagagcta gcagggcctt ttactgtttc 901 tgctttctgg cagagaattc // LOCUS HUMIGHVZ 688 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D4-region. ACCESSION M35648 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, clone D31. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 688) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 70 / 100 Ig H-chain D5-region (AA at 71) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 42 50 nonamer recombination signal signal 63 69 heptamer recombination signal signal 101 107 heptamer recombination signal signal 121 129 nonamer recombination signal BASE COUNT 185 a 160 c 209 g 134 t ORIGIN 1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt 61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt 121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct 181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc 241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca 301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc 361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttca agatcgatgg 421 ctgggagtga gccgagacgg acagtgtggg aagggcacag ggaaggagaa gcagccgcta 481 tcctacactg tcatctttca agagtttgcc ctgtgcccac aatgctgcat catgggatcg 541 ttaacactga tgtagacaca gctaaagaga gaatcagtga atggatttgc agcacagatc 601 tgaataaatt ctcagaatgt ggagccacac agaagcaagc acacggaaag tggcctgatc 661 gaatttcaag tacagtggca cttcagct // LOCUS HUMIGHWA 690 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D4-region. ACCESSION M35649 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, deletion clone D30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 690) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Hongo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 70 / 100 Ig H-chain D5-region (AA at 71) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 42 50 nonamer recombination signal signal 62 68 heptamer recombination signal signal 101 107 heptamer recombination signal signal 121 129 nonamer recombination signal BASE COUNT 186 a 160 c 212 g 132 t ORIGIN 1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt 61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt 121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct 181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc 241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca 301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc 361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttcg agatcaatgg 421 ctgggagtga gccgagcagg acagcgtggg aagagcacag ggaaggagga gcagccgcta 481 tcctacactg tcatctttcg aaagtttgcc ttgtgcccac actgctgcat catgggatcg 541 ttaacactga tgtagacaca gctaaagaga gaatcatgga atggatttgc agcacagatc 601 tgaataaatt ctcagaatgt ggagcagcac agaagcaagc acacagaaag tggcctgatg 661 caaggacaag ttcagtggca ctagcatgct // LOCUS HUMIGHWB 681 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D1-region. ACCESSION M35650 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, clone D31. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 681) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 69 / 99 Ig H-chain D5-region (AA at 70) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 41 49 nonamer recombination signal signal 61 67 heptamer recombination signal signal 100 106 heptamer recombination signal signal 120 128 nonamer recombination signal BASE COUNT 187 a 156 c 203 g 135 t ORIGIN 1 ccagcaagat cgggctccag tgacaccgag acaggacgga ggattttgtg ggggctcgtg 61 tcactgtgag gatattgtac taatggtgta tgctataccc acagtgacac agccccattc 121 ccaaagccct actgcaaaga ttccacttgc tgggctgagg ggctggggag cgtctgggaa 181 atagggctag ggaccagcaa tgcccaaaac gcagattccg tcccatacat caccccattc 241 accagtcagc agagtaaaca gaaaatgaga agcagctggg aagcttccac aggcccaagg 301 aaagagcttt ggcgggtgtg taagagggga tgcgggcaga gcgagcaggg ccttttgctg 361 tttctgcttt cctgtgcaga gagttcagca gctggtgttc gagatcaatg gctgggagtg 421 agccgagcag gacagcgtgg gaagagcaca gggaaggagg agcagccgct atcctacact 481 gtcatctttc gaaagtttgc cttgtgccca cactgctgca tcatgggatc gttaacactg 541 atgtagacac agctaaagag agaatcatgg aatggatttg cagcacagat ctgaataaat 601 tctcagaatg tggagcagca cagaagcaag cacacagaaa gtggcctgat gcaaggacaa 661 gttcagtggc actagcatgc t // LOCUS HUMTCAGVR 508 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human T-cell receptor germline gamma-chain gene V4-region (V4; subgroup I), exons 1 and 2. ACCESSION M36285 KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline; variable region. SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 508) AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L., Cann,H., Cohen,D., Dausset,J. and Sigaux,F. TITLE The V-gamma locus of the human T cell receptor gamma gene JOURNAL J. Exp. Med. 168, 1383-1394 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 43 T-cell receptor gamma-chain (V4), exon 1 /hgml_locus_uid="LE0226F" /nomgen="TCRGV4" /map="7p15" 159 > 508 T-cell receptor gamma-chain (V4), exon 2 IVS 44 158 T-cell receptor gamma-chain (V4) intron A BASE COUNT 128 a 119 c 122 g 139 t ORIGIN 1 atgcagtggg ccctagcggt gcttctagct ttcctgtctc ctggtgagtg cgctgcctac 61 agagaggatc atgggttttg ttttctttat tttcttcttt tgcaaggatt gccatactaa 121 ggaattcctc attatatttt gtgttgttcc cattgcagcc agtcagaaat cttccaactt 181 ggaagggaga acgaagtcag tcatcaggca gactgggtca tctgctgaaa tcacttgtga 241 tcttgctgaa ggaagtaccg gctacatcca ctggtaccta caccaggagg ggaaggcccc 301 acagcgtctt ctgtactatg actcctacac ctccagcgtt gtgttggaat caggaatcag 361 cccagggaag tatgatactt acggaagcac aaggaagaac ttgagaatga tactgcgaaa 421 tcttattgaa aatgactctg gagtctatta ctgtgccacc tgggatgggc acagtgattc 481 agatccgccc tacaccacac tgaaaacc // LOCUS HUMTCAGVS 509 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human T-cell receptor germline gamma-chain gene V5-region (V5; subgroup I), exons 1 and 2. ACCESSION M36286 KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline; variable region. SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 509) AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L., Cann,H., Cohen,D., Dausset,J. and Sigaux,F. TITLE The V-gamma locus of the human T cell receptor gamma gene JOURNAL J. Exp. Med. 168, 1383-1394 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 43 T-cell receptor gamma-chain (V5), exon 1 /hgml_locus_uid="LF0226G" /nomgen="TCRGV5" /map="7p15" 160 > 508 T-cell receptor gamma-chain (V5), exon 2 IVS 44 159 T-cell receptor gamma-chain (V5) intron A BASE COUNT 131 a 117 c 122 g 139 t ORIGIN 1 atgcggtggg ccctactggt gcttctagct ttcctgtctc ctggtgagta cgctgcctac 61 agagaggctc acaggttggg ttttgttttg tttacttctt ttgaaagggg tgcgatacaa 121 aggaatacct cattaaattt tgtgttgttc ccattgcagc cagtcagaaa tcttccaact 181 tggaaggggg aacgaagtca gtcacgaggc cgactaggtc atctgctgaa atcacttgtg 241 accttactgt aataaatgcc ttctacatcc actggtacct acaccaggag gggaaggccc 301 cacagcgtct tctgtactat gacgtctcca actcaaagga tgtgttggaa tcaggactca 361 gtccaggaaa gtattatact catacaccca ggaggtggag ctggatattg atactacgaa 421 atctaattga aaatgattct ggggtctatt actgtgccac ctgggacagg cacagtgatt 481 cagacctgtc ctacaccaca ctgaaaatc // LOCUS MUSNM23A 664 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse tumor metastatic process-associated protein (NM23) gene mRNA, 3' end. ACCESSION M35970 KEYWORDS tumor metastatic process-associated protein. SOURCE Mouse cell line K-1735, cDNA to mRNA, clone pNM23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 664) AUTHORS Steeg,P.S., Bevilacqua,G., Kopper,L., Thorgeirsson,U.P., Talmadge,J.E., Liotta,L.A. and Sobel,M.E. TITLE Evidence for a novel gene associated with low tumor metastatic potential JOURNAL J Natl Cancer Inst 80, 200-204 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 501 tumor metastatic process-associated protein NM23 (AA at 1) mRNA < 1 664 NM23 mRNA BASE COUNT 150 a 154 c 201 g 159 t ORIGIN 1 cagtcgcagc cggcggtaaa gccttgtcat ctgaagggga ccatggccaa cagtgagcgt 61 accttcattg ccatcaagcc tgatggggtc cagcgggggc tggtgggcga gatcatcaag 121 cggttcgagc agaaggggtt ccgccttgtt ggtctgaagt ttctgcaggc ttcagaggac 181 cttctcaagg agcactacac tgacctgaag gaccgcccct tctttactgg cctggtgaaa 241 tacatgcact caggaccagt ggttgctatg gtctgggagg gtctgaatgt ggtgaagaca 301 ggccgcgtga tgcttggaga gaccaacccc gcagactcta agcctgggac catacgagga 361 gacttctgca ttcaagttgg caggaacatc attcatggca gcgattctgt aaagagcgca 421 gagaaggaga tcagcttgtg gtttcagcct gaggagctgg tggagtacaa gagctgtgcg 481 cagaactgga tctatgagtg ataggacggt gccggttttc tacctgctta ctcttgttct 541 cacaggcagg ggaccagcaa ccctagatat ttctggaact tctttgacct ggaaggaacc 601 tttgggagcc gtgactccct gtgcagtgtt acgtgccact gttagattaa agtgtttaat 661 ctgt // LOCUS SYNHGHOA 582 bp ds-DNA SYN 10-AUG-1990 DEFINITION Synthetic human growth hormone gene, complete cds. ACCESSION M36282 KEYWORDS growth hormone. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 582) AUTHORS Ohtsuka,E., Taniyama,Y., Iwai,S., Kitano,K., Miyamoto,S., Ohgi,T., Sakuragawa,Y., Fujiyama,K., Ikari,T., Kobayashi,M., Miyake,T., Shibahara,S., Tokunaga,T., Ikehara,M., Ono,A., Ueda,T., Tanaka,T., Baba,H., Miki,T., Sakurai,A. and Oishi,T. TITLE The synthesis of human growth hormone gene JOURNAL Nucleic Acids Symp Ser 12, 79-82 (1983) STANDARD simple staff_review FEATURES from to/span description pept 1 579 growth hormone BASE COUNT 146 a 153 c 129 g 154 t ORIGIN 1 atgttcccaa ctattccact gagtcgcctg ttcgataacg cgatgctgcg tgcgcatcgt 61 ctgcaccaac tggctttcga cacttaccag gagttcgaag aagcatacat cccgaaagaa 121 cagaaataca gcttccttca gaacccacag acctcgttgt gtttctctga aagtatcccg 181 accccttcta accgcgaaga gacccagcag aaatcgaacc ttgaactgct tcgtatctcg 241 ctgcttctca ttcagtcgtg gctggagcca gtacagttcc tgcgttcggt tttcgcaaac 301 tcactggtat acggtgcgtc tgacagtaac gtttacgacc tgctgaaaga ccttgaagaa 361 gggatccaga ccctgatggg tcgcctggaa gatggttcac cacgcactgg tcagatcttc 421 aaacagactt actccaaatt cgatactaac tctcataacg atgatgctct gctgaaaaac 481 tacggcctgc tgtactgttt ccgtaaagat atggataaag ttgaaacttt cctgcgtatc 541 gttcagtgtc gttctgttga agggtcgtgt ggcttctaat ag // LOCUS AC2TFDC 803 bp ds-DNA BCT 10-AUG-1990 DEFINITION Plasmid pAC27 (from Pseudomonas putida) pyrocatechase (tfdC) gene, complete cds. ACCESSION M36279 KEYWORDS pyrocatechase. SOURCE Plasmid pAC27(from Pseudomonas putida, strain AC867) DNA. ORGANISM Plasmid pAC27 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 803) AUTHORS Ghosal,D. and You,I.-S. TITLE Nucleotide homology and organization of chlorocatechol oxidation genes of plasmids pJP4 and pAC27 JOURNAL Mol. Gen. Genet. 211, 113-120 (1988) STANDARD simple staff_review FEATURES from to/span description pept 21 803 pyrocatechase (tfdC) BASE COUNT 174 a 240 c 250 g 139 t ORIGIN 1 atcttgaaag gagacgagtc atggataaac gagttgccga ggtcgcaggc gcgatcgtcg 61 aggcagtacg caaaattttg ctggacaagc gcgtcacgga agccgaatac cgcgcgggtg 121 tcgactatct caccgaggtc gcacagacgc gggaaaccgc gctgcttctg gacgttttcc 181 tgaacagcac catcatcgaa ggcaaggcgc agcgctcgcg gacctctgcg cctgcgatcc 241 aggggccgta cttcctggaa ggtgctcctg tagttgaagg cgtcctcaag acctacgata 301 ccgacgacca caaaccgctg atcattcgcg gtacggtgcg ctcggacacg ggcgagttgc 361 tcgctggcgc tgtcatcgac gtgtggcact cgacgcctga tggcttgtac agcgggatcc 421 acgacaacat ccccgtggac tactaccgcg gaaaactcgt gacggattcc cagggcaact 481 atcgcgtgcg caccacgatg ccagtgccat accagatccc ctacgagggg ccgactgggc 541 gtctgctggg ccacctgggc agccatacct ggcgtccgcc gcacgtgcac ttcaaggtgc 601 gcaaggacgg tttcgaaccg ttgaccacgc aatactactt cgaagggggc aaatgggtgg 661 acgatgactg ctgtcacggc gtcacccccg acctgattac gcccgagacg atcgaggacg 721 gggtgcgggt catgaccctg gacttcgtaa tcgagcgtga gcaggccgag caacgcaagt 781 cggctacgga gacagtggca tga // LOCUS AFATFDC 788 bp ds-DNA BCT 10-AUG-1990 DEFINITION Plasmid pJP4 (from Alcaligenes eutrophus) pyrocatechase (tfdC) gene, complete cds. ACCESSION M36280 KEYWORDS pyrocatechase. SOURCE Plasmid pJP4 (from Alcaligenes eutrophus, strain JMP134) DNA. ORGANISM Plasmid pJP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 788) AUTHORS Ghosal,D. and You,I.-S. TITLE Nucleotide homology and organization of chlorocatechol oxidation genes of plasmids pJP4 and pAC27 JOURNAL Mol. Gen. Genet. 211, 113-120 (1988) STANDARD simple staff_review FEATURES from to/span description pept 21 788 pyrocatechase (tfdC) BASE COUNT 178 a 203 c 240 g 167 t ORIGIN 1 gttttcatga cggaggcaaa gtgaacaaaa gagtcaagga tgttgtcgat gcgatcgtcg 61 cggcggtaca gcgggtgctc gaccagaagg aggttactga ggcggaatac cgaactgcgg 121 ttcattacct catgcaagta gccgaacagc gtgaaaccgc tcttctttgc gatgtttttt 181 tcaatagcac cgtggctgcg acgaaggctc gtattagcga aggttcaacg cctgctatcg 241 agggacccta ctatcgcgac gacgcccctc tggtcgatga ccggctcaag acttacgaca 301 cggacgacca caagcctttg cttatccagg gaacggtcaa agcggtcgac gggagcgttg 361 tcgaggacgt gacgattgat gtctggcatt cgacgcccga tgggaagtac agcggtttcc 421 atgacgacat cccgactgat ttttatcgag ggaagctcag ggtgggcacc gatggcagct 481 tccgcgtgcg cacaacgatg ccggtgccgt atcagatccc ggatcagggt cccacgggcg 541 cattgctcga aaccatgggt ggtcactcgt ggcgtcccgc tcatgtacat ttcaaggtga 601 aggcgccggg atatgaaacg ttgaccacgc agtactactt cgaaggtgga gattggatca 661 cggacgactg ttgcaacggc gtccagtcta gtctgattac tcccgatatt gtggaagagg 721 gtgtccgcct gatgaacatc aacttcgtta tcgagcccgc gcgtgcgcag gcgggggcaa 781 acccgtga // LOCUS ECOHSEII 530 bp ds-DNA BCT 10-AUG-1990 DEFINITION E.coli heat-stable enterotoxin II (STII) gene, complete cds. ACCESSION M35729 KEYWORDS enterotoxin II. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 530) AUTHORS Picken,R.N., Mazaitis,A.J., Maas,W.K., Rey,M. and Heyneker,H. TITLE Nucleotide sequence of the gene for heat-stable enterotoxin II of Escherichia coli JOURNAL Infect. Immun. 42, 269-275 (1983) STANDARD simple staff_review FEATURES from to/span description pept 166 381 enterotoxin II (STII) BASE COUNT 188 a 86 c 88 g 168 t ORIGIN 1 taaataccta caacgggtga ttgacactac actcattaac tatactgcaa gtagcattaa 61 aaatcttaat aaaggagagc ttcgtcacat ttttttgact tgactcatat aaaagcccac 121 tggtataagt tttattgctt atagcaataa ggttgaggtg attttatgaa aaagaatatc 181 gcatttcttc ttgcatctat gttcgttttt tctattgcta caaatgccta tgcatctaca 241 caatcaaata aaaaagatct gtgtgaacat tatagacaaa tagccaagga aagttgtaaa 301 aaaggttttt taggggttag agatggtact gctggagcat gctttggcgc ccaaataatg 361 gttgcagcaa aaggatgcta atatatttat caatagcatt cagcaccata tacacaaaaa 421 taatttttca taaaaagaac tctataaaat aaatattttt tgtgacaatg tcctaacgca 481 agacggacat tgtccatttc tcactgcagg taaatgatct gtaaatagtc // LOCUS RAT6P26B 1086 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase mRNA, 3' end. ACCESSION M15685 KEYWORDS 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1086) AUTHORS Colosia,A.D., Lively,M., El-Maghrabi,M.R. and Pilkis,S.J. TITLE Isolation of a cDNA clone for rat liver 6-phosphofructo 2- kinase/fructose 2,6-bisphosphatase JOURNAL Biochem. Biophys. Res. Commun. 143, 1092-1098 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 915 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase (AA at 1) BASE COUNT 290 a 279 c 264 g 253 t ORIGIN 1 gcagaaaaca tcaagcaagt gaaacttggt agtcctgatt acatagactg tgaccaagaa 61 aaggttttgg aagactttct aaagagaata gagtgctatg agatcaacta ccaacctttg 121 gatgaggaat tggacagcca cctgtcctac atcaagatct tcgacgtggg cacacgctac 181 atggtaaatc gagtgcagga ccacgttcag agccgtacag cctactacct catgaacatc 241 catgtcacac ctcgatctat ctacctatgc cgccatggtg agagtgaact caaccttaga 301 ggccgcattg gaggtgactc tggcctctca gctcggggca agcagtatgc ctatgcacta 361 gccaacttca tccggtctca aggcatcagc tccctgaaag tatggactag ccacatgaag 421 aggaccattc agaccgctga agccctaggt gtcccctatg aacagtggaa ggccctgaat 481 gagattgatg cgggtgtctg tgaagagatg acctatgaag aaattcagga acactaccct 541 gaggaatttg cactacggga ccaggataaa tatcgttacc gctatccgaa gggagagtcc 601 tatgaggatc tggttcagcg tcttgaacca gttataatgg agctagaacg gcaagaaaat 661 gtactggtga tctgtcacca ggctgtcatg cggtgcctcc tggcatactt cctggataaa 721 agttcagatg agctgcccta tctcaagtgt cctctgcata ctgtgctcaa actcacacct 781 gtggcttatg gctgcagagt ggagtccatc tacctgaatg tggaggctgt gaacacacac 841 cgggacaagc ctgagaatgt ggacatcacc cgtgaagctg aggaagcctt ggacactgta 901 cctgcccatt actgagccct tttcaagtga tcagattgtc tctgttctca cccccttcct 961 cctgtaggag ctgctgccct tgttctcctt aagcagactc tggctatggc ctgagagtgc 1021 cctacctcca gtgaagaagt ccgtagcagc tcccaaacag gtctcaattc ctagccacaa 1081 ctaagg // LOCUS RATFABP 491 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat fatty liver acid binding protein (FABP) mRNA, complete cds. ACCESSION M35991 KEYWORDS fatty acid binding protein. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 491) AUTHORS Gordon,J.I. and Lowe,J.B. TITLE Analyzing the structures, functions and evolution of two abundant gastrointestinal fatty acid binding proteins with recombinant DNA and computational techniques JOURNAL Chem. Phys. Lipids 38, 137-158 (1985) STANDARD simple staff_review FEATURES from to/span description pept 40 423 fatty acid binding protein (FABP) mRNA < 1 491 fatty acid binding protein mRNA BASE COUNT 161 a 95 c 128 g 107 t ORIGIN 1 ctgttggtgg cagctgggaa aggaaacctc attgccacca tgaacttctc cggcaagtac 61 caagtgcaga gccaagagaa ctttgagccc ttcatgaagg cgatgggtct gcctgaggac 121 ctcatccaga aagggaagga catcaagggg gtgtcagaaa tcgtgcatga agggaagaaa 181 gtcaaactca ccatcaccta tgggtccaag gtgatccaca atgagttcac cttgggggag 241 gagtgcgaac tggagaccat gactggggaa aaggtcaagg cagtggttaa gatggagggt 301 gacaataaaa tggtgacaac tttcaaaggc ataaagtccg tgactgaatt caatggagac 361 acaatcacca ataccatgac actgggtgac atcgtctaca agagagtcag caagagaatt 421 tagacaaggc tgtatttcat attcttttac agtataaaat taatacaata aagttacctc 481 tgttttgaaa t // LOCUS RATFABPX 564 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat intestinal fatty acid binding protein (FABP) mRNA, complete cds. ACCESSION M35992 KEYWORDS fatty acid binding protein. SOURCE Rat adult small intestinal epithelium, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 564) AUTHORS Gordon,J.I. and Lowe,J.B. TITLE Analyzing the structures, functions and evolution of two abundant gastrointestinal fatty acid binding proteins with recombinant DNA and computational techniques JOURNAL Chem. Phys. Lipids 38, 137-158 (1985) STANDARD simple staff_review FEATURES from to/span description pept 13 411 fatty acid binding protein (FABP) BASE COUNT 179 a 108 c 146 g 131 t ORIGIN 1 acagctgaca tcatggcatt tgatggcact tggaaagtag accggaatga gaactatgaa 61 aagttcatgg agaaaatggg cattaacgtg gtgaagagga agcttggagc tcatgacaac 121 ttgaaactga cgatcacaca ggaaggaaat aaattcacag tcaaagaatc aagcaacttc 181 cgaaacattg atgttgtgtt tgaactcggc gtcgactttg cctatagtct agcagatgga 241 acagaactca ctgggacctg gaccatggag ggaaataaac ttgttggaaa attcaaacgt 301 gtagacaatg gaaaggagct gattgctgtc cgagagattt ctggtaacga actaatccaa 361 acctacacat atgaaggagt ggaggccaag cggatcttta agaaggaata ggccagcttc 421 ttggagcctg gcgcaatgca gaagggctag cttggtgtcg gattcctttc tccatctcgc 481 tgataatgca ggctcactca ctgtgtcatg agcactgctt ccaagccttg tcagagccaa 541 agaagtaaaa gttgattagg actt // LOCUS BSTRR5SA 117 bp ds-DNA RNA 10-AUG-1990 DEFINITION B.stearothermophilus 5S RNA. ACCESSION M25591 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE B.stearothermophilus RNA. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Zimmermann,J. and Erdmann,V.A. TITLE Binding sites of E.coli and B.stearothermophilus ribosomal proteins on B.stearothermophilus 5S RNA JOURNAL Nucleic Acids Res. 5, 2267-2288 (1978) STANDARD simple staff_entry COMMENT FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 25 a 36 c 38 g 18 t ORIGIN 1 cctagtggtg atagcggagg ggaaacaccc gttcccatcc cgaacacgga agttaagccc 61 tccagcgccg atggtagttg gggccagcgc ccctgcaaga gtaggccgct gctaggc // LOCUS CHKVIM1 2842 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 1 and 2. ACCESSION M15850 J02759 M25923 KEYWORDS intermediate filament; vimentin. SEGMENT 1 of 3 SOURCE Chicken DNA, library of Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2842) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.A.Roe, 11-MAY-1987. FEATURES from to/span description pept 824 1368 vimentin, exon 1 1458 + 1518 vimentin, exon 2 pre-msg 737 > 2842 VIM mRNA IVS 1369 1457 VIM cds intron A IVS 1519 > 2842 VIM cds intron B BASE COUNT 569 a 827 c 882 g 564 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgagg atttctgtcc tgcctttgag gctctttggg gatcacttct aatgaccttt 61 ttgtaagagc tctttcccga gacagtgacc gtgtgcctac agaaactaaa agcaaacaga 121 ccctcggtta aaaccgctcc tgccgctccc ccgctgacca ggagcgctgt gcccgaagca 181 aagcgatgcc cctcctgcag ccgtgcgagt gcgcagcgcc gtccgcaatg agcgcaactc 241 cgtcggcacc gagcaggctg ggcgggcccc gggagctgcg ggagcgcgct gagcccatga 301 gcacaggacg ggaggctgag gggggccgtc ggaagcagct aaccccaaac ccaccgacgg 361 ttcacgtttc tatctgtttg ctttggacgt tcggtagccc ttcgtgaata gcatcactcc 421 cccgatctcg gtgcctttca gagagctaag aaagaaggga cggggctcgc acaaagttca 481 acgcatgcat ttttaagcac tcggcccgtt ctaggagcgc tggcgagcac cgcagcccgt 541 cccgccctcg gaatcactgc gctgtcgaag gggcgtaggg cgcaaggcac ctcggcagcc 601 gcccgggtcc ctccgccttt gtcccagcgc cgccaccgcc gccctccctc cgcccctccc 661 cgggcccgac gcggcattgg cgacggacgg ggcggggcgg cgcggcggcg ggcggggcgg 721 cggccccata agaggggccc cggggtgggg acgccgctct tcttcgccgc cgcgctccga 781 gcccgtcgct cccggattac aaagccgctc cgttcctcgc gccatgagct tcaccagcag 841 caagaactcc tcgtaccgcc gcatgttcgg cgggggcagc cggcccagca gcggcacccg 901 ctacatcacg tccagcaccc gctattccct gggcagcgcc ctgcggccca gcagcgcccg 961 ctacgtgtcc gcctcgcccg gcggcgtgta ccgcaccaag gcgacgtcgg tgcggctgcg 1021 gagcagcatg ccgcccatgc ggatgcacga cgccgtggac ttcaccctgg cggacgccat 1081 caacacggag ttcaaggcga accgcaccaa cgagaaggta gagctgcagg agctcaacga 1141 ccgcttcgcc aactacatcg acaaggtgcg cttcctggag cagcagaaca agatcctgct 1201 ggccgagctg gagcagctca agggcaaagg cacgtcccgc ttgggcgacc tgtacgagga 1261 ggagatgcgg gacgtgcggc gccaggtgga ccagctgacc aacgacaagg cccgcgtcga 1321 ggtggagcgc gacaacctgg ccgacgacat catgcgcctg cgggagaagt gagtggggcc 1381 gcgggccggg gccgggaagc gggcgggcac ccacccctcc ctcacctcac ctcacctcac 1441 cctgtcgctg cttgcaggtt gcaggaggag atgctgcagc gggaggaggc cgagagcacc 1501 ctgcagtcct tccgacaggt gggcgcggcg ccggggcggt gggggcggaa ggtcggggcg 1561 gggctgggct gggggctgct ctgcccgcac aaaggcgctc gcgcagcctc ttgttctccc 1621 cgttgttttc tttagggggg tgccgcagtc cgggggggag ggtgaggcta ttgtcccgct 1681 ttatccggtc tggccggctc cacaaaagcg ctccgacgtg gaatcgcttt ctgatccaat 1741 agttgtgaaa acaggccatc tgcgcgccgg gctgaagggt attaatggtt tctatgggat 1801 tcacagagga aatgcagatc caggcattat ggccggtgtg gtggcagcag attgaaataa 1861 tggatccctt tgtgtcggag gggagggtgc cgtgggctgc attcctacca aatgcgtaat 1921 ggtgggggca tattcctggg ggagactgtg tgaaatacag aactccttct ggatgagcca 1981 cgtcggtctg cgctctcaga agactgtccc acatttgaag tatttcctat cagtggtgtc 2041 tgccgatcgg aagcagacaa tgggctcctt gggatgagct cgcagcagct gtctgcgtcc 2101 ccttgacgtg cattaaaaat gtagaattta atcctatata agcaaagatg tgaatgtaag 2161 aaacagtgca ctggagatga gtgtgtagga agctgcggtg tcacacagat gaaccaaagc 2221 cgatgtgaca gccaaaacca actgtgaagc ctgtgaaaag ttgtgagctc aagttctgat 2281 ttgtgcaaaa acagctctcg gcttggattg agggctgttt ggactagcag atgtttagga 2341 atgtgccttc tcaaaacaag ataacttcct gaaaactgct ggtgtgaaac aacactgctg 2401 tcactcctct gagctggaac gagctgccct gtatctccag acagctgggc gcaggctgac 2461 cgtcccatag agctctccat aggcaatagg tgtcaccatg caggaagttg tcatttgaat 2521 tttagcattt tccctagcgg tcttagtgag acagtgtggg agcagacaca tggcagggtc 2581 tgggaatgag gtgtccactg acgttctggg ggctgaggct tgaaatagca gctccaggta 2641 ctgcctttaa cctgtgctgc tgtgttacca ggctgagccg atccaggttg ttgccaagtc 2701 agtgcagatg catttccctt tttttgtttt aaattactag agttcaagca caagctcctt 2761 tgacttaaag ctagtttaat catgcgtgct tctctggtgc gcttcaaggc atgtcttcct 2821 gtccctccaa ccgatgaagc tt // LOCUS CHKVIM2 685 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 3 and 4. ACCESSION M15851 J02759 M25923 KEYWORDS intermediate filament; vimentin. SEGMENT 2 of 3 SOURCE Chicken DNA, library of J.B.Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 685) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by by B.A.Roe, 11-MAY-1987. FEATURES from to/span description pept + 7 102 vimentin, exon 3 246 + 407 vimentin, exon 4 pre-msg < 1 > 685 VIM mRNA IVS < 1 6 VIM cds intron B IVS 103 245 VIM cds intron C IVS 408 > 685 VIM cds intron D BASE COUNT 189 a 142 c 158 g 196 t ORIGIN About 3 kb after segment 1. 1 gatcaggatg ttgacaatgc ctctctggca ggccttgatc ttgagcgccc cgttgagtcc 61 ctgcaagaag aaattgtctt cttgaagaag cttcatgatg aggtaagctg aataatgagg 121 tctgtgtctg ggcaccgtca gtcaagtcaa caagtaaatt gtctctggtt ctggccccga 181 gtcctaacct gtgcattttc tgcatagtgt tgtttattaa ctccaagtgt ctacttctct 241 ctaaggaaat ccgggaactg caggctcaac tccaggaaca gcacatccaa atcgatatgg 301 atgtttctaa gcctgatctt actgctgccc tgcgcgatgt tcgtcaacaa tatgaaagcg 361 ttgctgctaa gaatcttcag gaagctgaag agtggtacaa gtccaaagta agtacaatgc 421 gttttaggat tatgcagatg cagtaggcat tcacatgaat tttaggtaac agctttgctg 481 tgttccttca gtactgaatg ttagcacagc cacagagtag ggtagtcttc aaactatcag 541 tcagggaaga atacagttgg aaaagtcatt tgtgccctaa gtgcagcatc agactaatgg 601 gctcatttcc tgttcttccc tattcttgta gctggaggca gttcttaagt atatacctaa 661 gaaaggaatg aacaaatgca ttaaa // LOCUS CHKVIM3 2857 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 5-9. ACCESSION M15852 J02759 M25922 KEYWORDS intermediate filament; vimentin. SEGMENT 3 of 3 SOURCE Chicken DNA, library of J.B.Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2857) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by by B.A.Roe, 11-MAY-1987. Polyadenylation signals are located at positions 2546-2551, 2594-2599, 2829-2834, and 2850-2855. Multiple mRNAs are found for vimentin, each using a different polyadenylation site. FEATURES from to/span description pept + 159 284 vimentin, exon 5 901 1121 vimentin, exon 6 1216 1259 vimentin, exon 7 1685 1770 vimentin, exon 8 2256 2297 vimentin, exon 9 pre-msg < 1 > 2297 VIM mRNA IVS < 1 158 VIM cds intron D IVS 285 900 VIM cds intron E IVS 1122 1215 VIM cds intron F IVS 1260 1684 VIM cds intron G IVS 1771 2255 VIM cds intron H BASE COUNT 816 a 536 c 611 g 892 t 2 others ORIGIN About 0.5 kb after segment 2. 1 tcagatagga aaggtcagtg taacgttcat gtttaaatat ttcaagcgct agaacttccc 61 tgtaatggct tcagttgcct ccaaatcctg ggtctagaag tggtccatta caccattttt 121 gtgaagtgtt atgctaacca atctgggtgt ttccacagtt tgcagatctc tccgaagctg 181 ctaataggaa caatgatgcc ctgcgccagg ccaaacaaga agctaatgaa tatcgcagac 241 agattcagtc tctcacctgt gaagttgatg cccttaaagg aagtgtaagt agacgttctt 301 cctgattcag ctttttaagc actatgctgt agtttagggg atgtgtgtgg ctgcagctca 361 tctcccacag aagacagcag tgcacccatc cttcctgaaa aagactctct gggctccatt 421 tgtctgcatg tactttgtag tgcactgctc agattgctac actggactgg tctttctcaa 481 ggtgtaaaca atacagaatg agctatacga agtggaaaag tgcagatctt gcacttccta 541 gaatacagtt gcctgaaagg ctggcttgaa atggaagtaa ctgtatagta ctcctgtaga 601 ttccctcacc aagtactgtc tgcatgctga cttgtactgt atgtgtttgg cttactgtgg 661 gggtttgctt tgctgtctgg tagaataaat gtgaaacatc tctcatcaga caccagaata 721 acacacctgg actttgcaag gggaggataa ggacatctcc ttatggggac atatttgagc 781 atgttcagaa aggacactgt cattcaactt ccctttgcgt ttttttctta agttatgaag 841 ggtcttgaca ttagaaagag tggctgaaat gtagttaaat ttgactttat tctctcctag 901 aatgaatccc tggagcgcca gatgcgtgaa atggaggaga attttgctgt tgaagctgct 961 aactaccagg acactattgg ccgcctgcag gatgagattc agaacatgaa ggaagaaatg 1021 gctcgccatc ttcgtgagta ccaggacctg ctgaatgtaa agatggctct tgatattgag 1081 attgctacct acagaaaact gctggaggga gaagagagca ggtaaaaatc acatgcaggc 1141 acctaccttc tagcagttac agtttagaga ttccttacca ttgtctgact tgatctattt 1201 cttcttttcc ttcaggatta acatgcctat tccaaccttt gcttctttga acctgagagg 1261 taagctgttt catttgggcc tgtgtaagca gctgttttta atctttagca aaaaggtcat 1321 ggatagttcc ttttgctgta gtatttgaac tggtgtaaat gcagttgtat atgaaagtca 1381 ttcttacaca caactgccag tatttgcagt tacttaattt tggatgagag gtggtacttt 1441 gttcctgggt gtacacgtaa gggagtatca atacataaca agtgacttgg atttctgttg 1501 aagctgcttt aagaggaata agaagaaaat aagcaaagct agtttataat tcattatcat 1561 gcaaaagaga agaggacttc tttgtataca aaactgggat tctgaaagta gatactgaca 1621 ttcctcagtt ctttgaggag gcacttcatc tagactcttg gtaacaatac attttcttct 1681 gcagaaacca acattgagtc tcagccaatt gttgacactc actcgaagag gacacttcta 1741 attaagaccg tggaaactag agatggacag gttggtggtt actttgattt tttttcattc 1801 tatctcagct ggttactttg cccagtgctg tagtctgggc cactcattgg tgcagcctgt 1861 atatggcaga cagtgtctgg cagtccaagc tgcatggctg aatgtgctga tgtagcaaat 1921 gtacttctgt ggaatagacc tcaaaacaga tctgagaaca gcttgagtga ctgacagccc 1981 gtgtttagag gagactttag gttcagttga gttgcaagtc ctcaagattt ctcccattgc 2041 ataagaaaaa tgttgcagga gaagcgttta ttaggggcag gtagtgtatt tgtactaaat 2101 tgaagtttag atggttaagc gtgtgcagct accatttaaa cttctcattc ataggcaaat 2161 gctgatacct ttgttcacat ctgtgctact atttgtttca gttctaactc tttataacaa 2221 ctgtcttaag cttnnatgcc tattttccaa cacaggttat taatgaaact tcccagcatc 2281 acgatgactt ggagtaaagt gaagatgcaa acttaatgca ggagaaattc ttaccagcaa 2341 ggttttaaaa agttcatgtc ttaaaggaag aaacagcttt caagtgcctt tctccagttt 2401 tccatgagcg caagattatt atgctaggaa ataggtctta gatcttgcaa actgactctc 2461 cctgaaggat tagagtttac aatggagtct agtttacaaa tagcaatatc ttgtgctgca 2521 atactgtttt taagtatctg aatttaataa aactgctttt tccagcacag tatgagcaac 2581 ctgtcgctac ttcaataaat ctttggaaaa tggctcttga tgtgttctaa tttaacttca 2641 tgactttctg caaagccata acttaatgct ggaattacta tacggttgac aactccagta 2701 ctgattgtgt gaaatattgt tttcagatta actagacaaa ctgtcttccc atttactgct 2761 taggttttgg aaccaattaa aatggactat aactggcaga tgcataatgt attgatactt 2821 atcagttgaa taaaatgata cttcaagcta ataaaaa // LOCUS CHKVIMN 591 bp ds-DNA VRT 10-AUG-1990 DEFINITION chicken intermediate filament protein vimentin gene, 3' end. ACCESSION X00185 J00915 KEYWORDS vimentin. SOURCE Chicken DNA and cDNA to embryonic muscle mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 591) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE Characterization of the chicken vimentin gene: Single copy gene producing multiple mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 911-915 (1983) STANDARD simple staff_review REFERENCE 2 (bases 30 to 591) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE Vimentin gene expression during myogenesis: Two functional transcripts from a single copy gene JOURNAL Nucleic Acids Res. 11, 8317-8332 (1983) STANDARD simple staff_review COMMENT FEATURES from to/span description mRNA < 1 308 vimentin mRNA (alt.) mRNA < 1 367 vimentin mRNA (alt.) mRNA < 1 591 vimentin mRNA (alt.) BASE COUNT 193 a 101 c 105 g 192 t ORIGIN 1 cagcatcacg atgacttgga gtaaagctga agtgaagatg caaacttaat gcaggagaaa 61 ttcttaccag caacctttta aaaagttcat gtcttaaagg aagaaacagc tttcaagtgc 121 ctttctccag tttttccatg agcgcaagat tattatgcta ggaaataggt cttagatctt 181 gcaaactgac tctccctgaa ggattagagt ttacaatgga gtctagttta caaatagcaa 241 tatcttgtgc tgcaatactg tttttaagta tctgaattta ataaaactgc tttttccagc 301 acagtatgag caacctgtcg ctacttcaat aaatctttgg aaaatggctc ttgatgtgtt 361 ctaatttaac ttcatgactt tctggaaagc cataacttaa tgctggaatt actatacggt 421 tgacaactcc agtactgatt gtgtggaata ttgttttcag attaactaga caaactgtct 481 tcccatttac tgcttaggtt ttggaaccaa ttaaaatgga ctataactgg cagatgcata 541 atgtattgat acttatcagt tgaataaaat gatacttcaa gctaataaaa a // LOCUS ECORRD 1541 bp ss-rRNA RNA 10-AUG-1990 DEFINITION E.coli 16S ribosomal RNA. ACCESSION J01859 M24828 M24911 K02555 M24996 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE Escherichia coli rRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Carbon,P. and Ebel,J.-P. TITLE Recent progress in the determination of the primary sequence of the 16 S RNA of Escherichia coli JOURNAL FEBS Lett. 84, 337-341 (1977) STANDARD full staff_review REFERENCE 2 (bases 1 to 1541) AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P. TITLE The sequence of Escherichia coli ribosomal 16 S RNA determined by new rapid gel methods JOURNAL FEBS Lett. 94, 152-156 (1978) STANDARD full staff_review REFERENCE 3 (bases 1 to 1541) AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P. TITLE The complete nucleotide sequence of the ribosomal 16-S RNA from Escherichia coli: Experimental details and cistron heterogeneities JOURNAL Eur. J. Biochem. 100, 399-410 (1979) STANDARD full staff_review REFERENCE 4 (bases 1346 to 1541) AUTHORS Taylor,B.H., Prince,J.B., Ofengand,J. and Zimmermann,R.A. TITLE Nonanucleotide sequence from 16S ribonucleic acid at the peptidyl transfer ribonucleic acid binding site of the Escherichia coli ribosome JOURNAL Biochemistry 20, 7581-7588 (1981) STANDARD full staff_review REFERENCE 5 (bases 1442 to 1541) AUTHORS Schnare,M.N. and Gray,M.W. TITLE 3'-terminal sequence of wheat mitochondrial 18S ribosomal RNA: Further evidence of a eubacterial evolutionary origin JOURNAL Nucleic Acids Res. 10, 3921-3932 (1982) STANDARD full staff_review REFERENCE 6 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P. TITLE The determination of the primary structure of the 16s ribosomal rna of escherichia coli: III. Further studies JOURNAL Biochimie 57, 711-748 (1975) STANDARD simple staff_entry REFERENCE 7 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P. TITLE the determination of the primary structure of the 16S ribosomal RNA of Escherichia coli: (2) nucleotide sequences of products from partial enzymatic hydrolysis JOURNAL Biochimie 54, 901-967 (1972) STANDARD simple staff_entry REFERENCE 8 (sites) AUTHORS Zimmermann,R.A. and Singh-Bergmann,K. TITLE Binding sites for ribosomal proteins S8 and S15 in the 16S RNA of Escherichia coli JOURNAL Biochim. Biophys. Acta 563, 422-431 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 558 to 890) AUTHORS Gregory,R.J., Zeller,M.L., Thurlow,D.L., Gourse,R.L., Stark,M.J.R., Dahlberg,A.E. and Zimmermann,R.A. TITLE Interaction of ribosomal proteins S6, S8, S15 and S18 with the central domain of 16S ribosomal RNA from Escherichia coli JOURNAL J. Mol. Biol. 178, 287-302 (1984) STANDARD simple staff_review COMMENT [2], [3] contain updated sequence data for the original work by the same laboratory [1]. There were too many discrepancies between [1] and [2], [3] to list each revision in our sites table. The sequence shown is from [3]. [1], [2], [3] point to a number of cistron heterogeneities. There is uncertainty, however, with regard to assigning these various heterogeneities to specific cistrons. The RNA method used by [1], [2], [3] gives the average of all the cistrons present in the cell [3]. The heterogeneities are classified by their relative proportions into major, minor and undetermined species. The sequence shown corresponds to the major species. The heterogeneities were annotated as variations in the sites table. It is not known which of the residues "c" (base 633) or "a" (base 641) undergoes a deletion, giving rise to the minor component "atctg". [3] suggests the existence of one or two mutated cistrons among the known seven cistrons of ribosomal RNA. With the exception of a single base deletion, this sequence is identical to the current 16S rDNA sequence for the E.coli rRNB gene. FEATURES from to/span description rRNA 1 1541 16S rRNA variant 76 76 g in major species; c in undetermined species variant 80 80 a in major species; c in undetermined species variant 88 89 tc in major species; tgc in undetermined species variant 90 92 ttt in major species; substitution of one of the t by c in undetermined species variant 179 179 t in major species; c in minor species variant 182 182 c in major species 1; t in major species 2 variant 193 194 ca in major species; cga in undetermined species variant 267 272 tcacct in major species; ccacct or tcaccc in minor species variant 283 285 ccc in major species; cc in minor species conflict 348 349 ag in [3]; ga in [2] modified 526 526 m7g = 7-methylguanosine variant 630 631 ct in major species; cct in minor species variant 632 634 gca in major species; ga in minor species (see comment) variant 640 642 tac in major species; tc in minor species (see comment) conflict 853 855 ttc in [3]; ctt in [2] revision 915 916 tg in [3]; tcg in [1],[2] (typing error) variant 965 965 g in undetermined species 1; 2-methylguanosine in species 2 conflict 1027 1029 ctt in [3]; ttc in [2] variant 1071 1072 gt in major species; gtt in undetermined species variant 1074 1074 t in major species; a in undetermined species modified 1099 1099 m5c = 5-methylcytidine modified 1206 1206 m2g = 2-methylguanosine variant 1280 1280 c in major species; t in minor species variant 1320 1322 tcg in major species; tg in minor species modified 1401 1401 m4c = n4-methylcytidine modified 1402 1402 cm = 2'-o-methylcytidine variant 1405 1405 t in major species; c in minor species variant 1406 1406 5-methylcytidine in major species; c in minor species conflict 1490 1493 gaag in [3]; gag in [4] modified 1497 1497 um = 2'-o-methyluridine modified 1517 1517 m66a = n6,n6-dimethyladenosine modified 1518 1518 m66a = n6,n6-dimethyladenosine BASE COUNT 389 a 352 c 487 g 313 t ORIGIN 5' end of mature rRNA. 1 aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgaacggt aacaggaaga agcttgctct ttgctgacga gtggcggacg ggtgagtaat 121 gtctgggaaa ctgcctgatg gagggggata actactggaa acggtagcta ataccgcata 181 acgtcgcaag accaaagagg gggaccttcg ggcctcttgc catcggatgt gcccagatgg 241 gattagctag taggtggggt aacggctcac ctaggcgacg atccctagct ggtctgagag 301 gatgaccagc cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg 361 gaatattgca caatgggcgc aagcctgatg cagccatgcc gcgtgtatga agaaggcctt 421 cgggttgtaa agtactttca gcggggagga agggagtaaa gttaatacct ttgctcattg 481 acgttacccg cagaagaagc accggctaac tccgtgccag cagccgcggt aatacggagg 541 gtgcaagcgt taatcggaat tactgggcgt aaagcgcacg caggcggttt gttaagtcag 601 atgtgaaatc cccgggctca acctgggaac tgcatctgat actggcaagc ttgagtctcg 661 tagagggggg tagaattcca ggtgtagcgg tgaaatgcgt agagatctgg aggaataccg 721 gtggcgaagg cggccccctg gacgaagact gacgctcagg tgcgaaagcg tggggagcaa 781 acaggattag ataccctggt agtccacgcc gtaaacgatg tcgacttgga ggttgtgccc 841 ttgaggcgtg gcttccggag ctaacgcgtt aagtcgaccg cctggggagt acggccgcaa 901 ggttaaaact caaatgaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt 961 cgatgcaacg cgaagaacct tacctggtct tgacatccac ggaagttttc agagatgaga 1021 atgtgccttc gggaaccgtg agacaggtgc tgcatggctg tcgtcagctc gtgttgtgaa 1081 atgttgggtt aagtcccgca acgagcgcaa cccttatcct ttgttgccag cggtccggcc 1141 gggaactcaa aggagactgc cagtgataaa ctggaggaag gtggggatga cgtcaagtca 1201 tcatggccct tacgaccagg gctacacacg tgctacaatg gcgcatacaa agagaagcga 1261 cctcgcgaga gcaagcggac ctcataaagt gcgtcgtagt ccggattgga gtctgcaact 1321 cgactccatg aagtcggaat cgctagtaat cgtggatcag aatgccacgg tgaatacgtt 1381 cccgggcctt gtacacaccg cccgtcacac catgggagtg ggttgcaaaa gaagtaggta 1441 gcttaacctt cgggagggcg cttaccactt tgtgattcat gactggggtg aagtcgtaac 1501 aaggtaaccg taggggaacc tgcggttgga tcacctcctt a // LOCUS HUMRSALPA 320 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 1. ACCESSION M28029 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 320) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 320 alphoid repeat, dimer 1 BASE COUNT 100 a 54 c 59 g 107 t ORIGIN 1 tgatgtgtgt attcaacgca cagagttgaa cgttccttta gacagagtag attggaaaca 61 ctctttttgt agaattttca agtggaggta tcaagcgctt tgaggccaat gatagaaaag 121 gaaatacctt cgtataataa ttagacagaa tcattctcag aaaccgcttt gcaatgtgtg 181 cgttcaactc acagtgttta acctttcttt tcatacagtt gtttcgaaac actcttttta 241 cagaatctac aagtaaatat ttggacctct ttgaagtctt cgttagaaat gagatttctt 301 catataatac tagctagaag // LOCUS HUMRSALPB 314 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 2. ACCESSION M28030 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 314) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 314 alphoid repeat, dimer 2 BASE COUNT 89 a 51 c 65 g 109 t ORIGIN 1 ttctctcagt aacttgtatt tgctcggtgc tatcaactct cagagttgaa ctttccttta 61 gaaacagtag atttgaaact ctctttttgc ggaatttgta agtggagata tcaagagttt 121 tgaggccaat ggtagaaaag gaaatatctt cgtatgtaaa ctagatagaa tcattctcag 181 aaactacttt ggcacgtgtg cgttcaactc acagtgttta acctttcttt tcatagagca 241 gtttggaaac actcagtttg taaagtcagc aactggatat ttggatgtat ttgattcctt 301 cgttggaaat ggga // LOCUS HUMRSALPC 2550 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pHS53. ACCESSION M28031 M28033 M28032 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2550) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 850 alphoid pentamer rpt 851 1700 alphoid pentamer rpt 1701 2550 alphoid pentamer BASE COUNT 743 a 496 c 524 g 787 t ORIGIN 1 gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg gataaacttc 61 ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt caactcacag 121 agttgaacct tgctttcata gttcagcttt caaacactct ttttgtagaa tctgcaagtg 181 gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca tcaaacctag 241 accgaagcat tctcagaatg tttcctttga tgactgcatt caactcacag aggtgaacaa 301 tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt ggatatgtgg 361 acctctgtga agatttcttt ggaaacgggt tcatcttcac agaaaaacta aacagaagca 421 ttctcagaaa ctgctttgtg atgtttgtgt tccacttcag gaattaaact ttcctcttga 481 cagagcagct ctgaaaccct cttattctag aatctgcaag tggacatttg gaggactttg 541 aggcctgtgg tggaaaagga aaatcttcac ataaaaacta gatggaagca ttctcagaaa 601 ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga tagagcaggt 661 tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg aggcctactg 721 tagtaaagga aataacttca tctaaaaacc aaacggaagc attcacagac aattcttagt 781 gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt ttccaaacac 841 actttctgta gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg 901 gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt 961 caactcacag agttgaacct tgctttcata gttcagcttt caaacactct ttttgtggaa 1021 tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca 1081 tcaaacctag accgaagcat tctcagaatg tttcctgtga cgactgcatt caactcacag 1141 agatgaacaa tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt 1201 tgatatgtgg acctctgtga agatttcgtt ggaaacgggt tcatcttcac agaaaaacta 1261 aacagaagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa gaattgaact 1321 ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag tggacatttg 1381 gagggctttg aggcctgtgg tgcaaaagga aaatcttcac ataaaaacta gatggaagca 1441 ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga 1501 tagagcaggt tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg 1561 aggcctactg tagtaaagga aataacttca tctaaaatcc aaacggaagc attcacagac 1621 aattcttagt gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt 1681 ttccaaacac actttctcta gaatctgcaa gtggatattt ggacttctct gaggatttcg 1741 ttggaaacgg gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga 1801 tgtttgcatt caactcacag agttgaacct tgctttcata gttcagcttt caaacactct 1861 ttttgtggaa tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta 1921 tatcttcaca tcaaacctag acagaagcat tctcagaatg tttcctgtga cgactgcatt 1981 caactcacag aggtgaacaa ccctgctgat ggagcagttt tgaaactctc tttctttgga 2041 ttctgcaagt ggatatgtgg acctctgtga agatttcgtt ggaaacggtt tcatcttcac 2101 agaaaaacta aacaggagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa 2161 gaattgaact ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag 2221 tggacatttg gagggctttg aggcctgtgg tggaaaagga aaatcttcac ataaaaacta 2281 gatggaagca ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca 2341 ttcctataga tagagcaggt tgtaaacaat gtttttgtag aatctgcgat tggagatttg 2401 gactgctttg aggcctactg tagtaaagga aataacttca tctaaaaacc aaacgcaagc 2461 attcacagac aattcttagt gatcattgga ttgaactaac agagctgaac attcctttag 2521 atggagaagt ttccaaacac actttctgca // LOCUS HUMRSALPD 134 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-11A. ACCESSION M28034 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 134) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 134 alphoid repeat BASE COUNT 47 a 23 c 27 g 37 t ORIGIN 1 tcacagagtt gaacgatcct ttacacagag cagacttgta acactctttt tgtggaattt 61 gcaagtggag atttcagccg ctttgaagtc aaaggtagaa aaggaaatat cttcctataa 121 aaactagaca gaat // LOCUS HUMRSALPE 146 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-23A. ACCESSION M28035 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 146) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 146 alphoid repeat BASE COUNT 46 a 28 c 26 g 43 t 3 others ORIGIN 1 ttttttccca actcacacga gttaacgatc ctttacacag agcagacttg tannnctctc 61 tttatggaat ttgcaagtgg agatttcagc cgctttgagg tcaatggtag aaaaggaaat 121 atcttcctat aaaaactaga cagaat // LOCUS HUMRSALPF 171 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-10A. ACCESSION M28036 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 171) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 171 alphoid repeat BASE COUNT 53 a 27 c 38 g 53 t ORIGIN 1 aattctcaga gtcttctttg tgatgtgtgc tttcaactca ccgagataaa gatttctctt 61 gatagagcaa attggaaaca ctctttttgt agaatttgca agggtacatt gagagcgctt 121 tcaggcctat ggtagaaaag ggaatatctt tccataaaag gtagacagaa g // LOCUS HUMRSALPG 137 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-11B. ACCESSION M28037 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 137) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 137 alphoid repeat BASE COUNT 33 a 27 c 30 g 47 t ORIGIN 1 gattctcaga aactcctttg tgatgtgtgc gttcaactca cagagtttaa cctttctttt 61 catagagcag ttaggaaaca ctctgtttgt gaagtctgca agtggatatt cagacctctt 121 tgaggccttc gttggaa // LOCUS HUMRSALPJ 158 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-03B. ACCESSION M28040 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 158) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 158 alphoid repeat BASE COUNT 44 a 31 c 40 g 43 t ORIGIN 1 gcaactcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa ccgagcgcaa 61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt 121 tgaggccttc gttggaaacg ggatttcttc ctataaag // LOCUS HUMRSALPK 138 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-10B. ACCESSION M28039 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 138 alphoid repeat BASE COUNT 37 a 27 c 32 g 42 t ORIGIN 1 caatctcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa cattcctctt 61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt 121 tgaggccttc gttggaaa // LOCUS HUMHBGF1 1082 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 1. ACCESSION M23017 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 1 of 3 SOURCE Human fetal liver DNA, clone ChIK117-1.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1082) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept 602 + 770 heparin-binding growth factor 1, exon 1 /nomgen="FGFA" /map="5q31.3-q33.2" /hgml_locus_uid="LD0150G" pre-msg 568 > 1082 heparin-binding growth factor 1 mRNA and intron IVS 771 > 1082 heparin-binding growth factor 1 intron A BASE COUNT 258 a 285 c 262 g 277 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttccct taacatacta accctttact ttccctgttg tgtccctgaa aggcctcctg 61 tgcctttggc tgcaggtccc gaacgtccag gccatctgtg ctatctgctt cgcggtacct 121 caccaacgca acgtgagggt ggagggcaga accttggtcc tggcctctca gcttttgtgg 181 gtttcagcca gaccctaggt gttattttag tgcaactttg gtgtttaatt tgaggatgtg 241 tgtggaccag aaggagggac caaaacatga ttcttttccc catggtcaga tgattaaatt 301 tgaagttcta aaaaatgcag tttggtccaa agctgtgtcc aattgggaag agagaaaaat 361 gccctggaaa cccctcccag gcctgggacc atccttcctt aaccaccagc cacctcacag 421 gcccgcggac tgcgggcatc acctgggcag gctgtgctta ctcactaccc gggaaccctg 481 tgccctggag ctgtccttcc tctcttcaaa gtgcattttg tgcctttgct ggaagaaccg 541 actacaggtt tgttcaattt cttacagtct tgaaagcgcc acaagcagca gctgctgagc 601 catggctgaa ggggaaatca ccaccttcac agccctgacc gagaagttta atctgcctcc 661 agggaattac aagaagccca aactcctcta ctgtagcaac gggggccact tcctgaggat 721 ccttccggat ggcacagtgg atgggacaag ggacaggagc gaccagcaca gtaagcccat 781 ctctatggca ccccccttcc ctttctgaca tcttctgtag tcaaggtggg aggaaggtgc 841 acatttaagt acaggtactt gcttctccaa ggttctattc aggcatgaca cattcagagg 901 tggagtcaca taaatgcgta aaatgtctgg gaaatgaaaa tagggacttg tgggggccac 961 cacttaccca aacgtgtcct atttcaagtt ttttaaagca ctctctgctg acccaacaga 1021 acgggctgcc ggtgctcaat tgctgtatgt tttcccaggt ttctgtaact agtgaaagat 1081 ct // LOCUS HUMHBGF2 427 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 2. ACCESSION M23086 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 2 of 3 SOURCE Human adult liver DNA, clone ChIL208-1.4.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 427) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept + 186 + 289 heparin-binding growth factor 1, exon 2 /nomgen="FGFA" /map="5q31.3-q33.2" pre-msg < 1 > 427 heparin-binding growth factor 1 mRNA and introns IVS < 1 185 heparin-binding growth factor 1 intron A IVS 290 > 427 heparin-binding growth factor 1 intron B BASE COUNT 106 a 85 c 108 g 128 t ORIGIN Unknown number of bp after segment 1. 1 cagctttctt tggaaggcaa agaaaaaggg actgtatttc tatgttttga ttaatctgag 61 gctcatcctg agggctccgt gaaatgaatg agcagaattt tccatggcca actgtcctgg 121 ctgccgggtc ctatcggcaa aagcgtagtg tttatttact tttgctcgtg ttatttttat 181 tccagttcag ctgcagctca gtgcggaaag cgtgggggag gtgtatataa agagtaccga 241 gactggccag tacttggcca tggacaccga cgggctttta tacggctcag taagtatgaa 301 gctgacatgc ttccagacgt tggccaaggt ttgaggtttc cagaaatctt gttacatgga 361 gtgaggcaaa ctataaagca acaattagtc tctgtttgtt attttttcca gaaggattcc 421 caccctc // LOCUS HUMHBGF3 664 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 3. ACCESSION M23087 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 3 of 3 SOURCE Human fetal liver DNA, clone ChIK105a-2.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 664) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept + 304 498 heparin-binding growth factor 1, exon 3 /nomgen="FGFA" /map="5q31.3-q33.2" pre-msg < 1 > 498 heparin-binding growth factor 1 mRNA and intron IVS < 1 303 heparin-binding growth factor 1 intron A BASE COUNT 181 a 153 c 142 g 188 t ORIGIN Unknown number of bp after segment 2. 1 tgaggactct tagaagtgct cttatcagta gcatcttaat tactttacaa tggattttaa 61 atggaaagga agtttacaat aatagcaaat gcatattgac agctctttag tgcccggtgc 121 tgttctaagt ccttatgact accctgtgaa ataagttcca ccatgacccc aattttcctg 181 aaaaggagac tgaggcatgg agagctttag tattttgccc aatgtcacac agctagtaaa 241 tggggacccc catgtgaaac tactcactga ttgtcctact ctcttgtggt tttatctttt 301 tagcagacac caaatgagga atgtttgttc ctggaaaggc tggaggagaa ccattacaac 361 acctatatat ccaagaagca tgcagagaag aattggtttg ttggcctcaa gaagaatggg 421 agctgcaaac gcggtcctcg gactcactat ggccagaaag caatcttgtt tctccccctg 481 ccagtctctt ctgattaaag agatctgttc tgggtgttga ccactccaga gaagtttcga 541 ggggtcctca cctggttgac ccaaaaatgt tcccttgacc attggctgcg ctaaccccca 601 gcccacagag cctgaatttg taagcaactt gcttctaaat gcccagttca cttctttgca 661 gagc // LOCUS HUMASAT3 1741 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alpha satellite and satellite 3 DNA. ACCESSION M21305 KEYWORDS alpha satellite; satellite 3; satellite DNA. SOURCE Human DNA, clone pTR9. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1741) AUTHORS Vissel,B., Earle,E. and Choo,K.H. TITLE A clone containing contiguous satellite 3 and alpha satellite sequences, separated by a direct repeat of 9 bp JOURNAL Unpublished (1989) Royal Children's Hospital, Victoria, Australia STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Vissel 29-MAR-1989. Indirect evidence suggests that this sequence may represent a natural junction sequence between alpha satellite and satellite 3 sequences on chromosome 9. FEATURES from to/span description site 1 1482 human satellite 3 DNA site 1501 1741 human alpha satellite DNA rpt 1483 1491 direct repeat 1A rpt 1492 1500 direct repeat 1B signal 5 9 caat box (put.) signal 71 77 tata box (put.) ORF 38 207 ORF 1 (alt.) ORF 43 172 ORF 1 (alt.) signal 630 634 caat box (put.) signal 718 722 tata box (put.) ORF 752 1109 ORF 2 (alt.) ORF 757 925 ORF 2 (alt.) BASE COUNT 661 a 152 c 534 g 394 t ORIGIN 1 gaattcaata gaatggaatg gtatcagctc gagtggaatg gaatggaatg gaatggaatg 61 gcatggaatc gtataaagtg gaatggaatc aactcgagtg gaatggaatg gaatggaatg 121 gaatggaatg cagtacaatg caatagaatg gaatggaatg aactcgagtt gactggaatg 181 gaatggaatg gaatgcattt gaattgaatg gacctgaaga gaatggaatg gaatggaatg 241 gaatgaagtg gcctcgaatg caatagatta gattgcaacg gaatcatctt gagtggaatg 301 gagtgaaatg gaacggaatg gaatggaatg gaatggaata gaatggaatg gaatggaatg 361 gaatagaatg gaatggaatg gaatggaatg aaatgcaata gaatggaatg gtatcaactc 421 cagtggaatg gaatggaatg caatggaatg gaatggaatg gaatggaatg gaatcaattc 481 acctggaacg gaatggaatg caatggaatg gaatggaatg gaatgcagtg gaatggaatg 541 gaatagaatg gaatggaatg gaatggaatg gaatccactt gagtggaatg gaatggaatg 601 gaatggattc gaacggaatg gaatggaatg caatggaatg gaatgcatgt gattgaatgg 661 acacgaaaag aattgaatgg aaatgaatgg aatcgaatga agtggcctcg aatgcaatat 721 aatggaatgg aatcaagtcg agtggaatgg aatggaatgg gttggaatgg aatggaatgg 781 aatggaatgc attagaatgg aatggaatca actcgagtgg aatggaatgg aatggaatgg 841 aatggaatgg aatgcaatgg aatggaatgg aatggaatgg aatggaatgg aatggaatgc 901 aatggaatgg aatggaatgc aatagaatgg aatggaatca cctcgactgg aatggaatgg 961 aatggaatgc atttgaattg aatggacccg aagagaatgg aatggaatgg aatggaagga 1021 gtggcctcaa atgcaataga atgcaatgga atcatctcga gtggaatgga gtgaaatgga 1081 atggaatgga atggaatgga atgctataga atgaaatgga atgaagtcaa gtggaatgga 1141 atggaatgga atgggttgga agggaatgaa atggaatgca atagaatgga aaggaatgga 1201 atggaatgaa ctggccaatg caatagaatt gaatggaatc aactggagtg gaatggaatg 1261 gaatggaatg gattggaatg gaatgcaata gaatggaatg gaatcaactc gagtaaaatg 1321 gaatggaatg gaatggaatg gaatggaatg gaatggaatt gaatggaatg gaatggaatg 1381 catttgaatt gaatggacac gaaaagaatg gattggaatg gaatggaatg aagtagcctc 1441 gagtgcaatg atatggaatg gcatcaactc gagtggaatg gagtgaaaaa ggtgaaaaag 1501 gaaatacctt cacacaaaaa ctagacagaa gcattctcag aatcttcctt ttgatgtgtg 1561 cattcaactc acagagttga acctttcttt tgatagagca gttttgaaac actctctttg 1621 taggatctgc aactggacat ttggagctct ttgataccta cagtgaaaaa ggaaatatct 1681 tcacataaaa acttcacaga agcattctca aaaacatttt tcagatgtgt gcattgaatt 1741 c // LOCUS DROLINEJA 5154 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22874 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-1. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5154) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description pept / 414 2165 ORF1, nucleic-acid binding protein (AA at 1) (put.) pept 2162 4912 ORF2, reverse transcriptase (put.) rpt 115 5134 jockey dispersed repetitive element BASE COUNT 1580 a 1226 c 1072 g 1276 t ORIGIN 1 catacccaac ctgaacacgt ttttctcaat atttgtgcca caaaatactt cgccagtaaa 61 ttaacatgaa gtcgctaaat gcgattacaa ttgacgacgc aatcatatac aaaaaaaaat 121 cattcacatg ggagatgagc aatcgagtgg acgtgttcac agaagtcgcg agataaaaca 181 aaaacgtaat tgtgatccat cacaaacatc tgcgcagatc gtgtgcttat ctcacaaaca 241 aaatctattt ttagtcactg cataacggtg acggcttcgg ttcgcgaaac ttatcagcaa 301 ctagcaattt ctaagctgtg ttgtttttgc ccctcgccct gcgcgctgcg caagcgggag 361 gttgttacaa tttaccttac aagtaaaccg gtaaatctta tcgtgtttag taaatatcaa 421 ttgcattata cggcataagt ataaagacaa ttgatataat ggagaattca tttgctcaat 481 cgcgacctag caatgggtgc gataaatttg agaaaatgag gaaagtagca ggtgttgagc 541 caggagaatt acgctcccaa ctccgcgcca gctgtgcagt tgtttcccct aacctggaag 601 gtatgccaac tcaatctgcg gtctccagct taatggtgac aatcagcagc aacaccaatg 661 caagtgttac ctgcactatt tctaacgtac aggccaacat gatctgtact cctacataca 721 ctgattgcac aaccgtgacc actagcattt gcccaactac gccttatgac aatggactgc 781 cgacacctct gtcatcactg cccaataagc catctaaagc gaattgcccc tttcaagcac 841 atgatcgtac tgtcaacagg aaacgaaaag gcgtgtctca gcccccatta cctatcctca 901 ccccttctcc aagccgtaaa actaaaaggc aggccactat gccactcaat gaggaggcct 961 ctacctccac tgcagcagca ttaaataaca atcgcttcgc gcttttgtcc gctgaagcgg 1021 agaatatgga gcaagacgtg tcggatgctg attctgacat tgaagactct gctgcccgag 1081 atggtggtgg acaatccgct aaatatagca aacccccagc catatgcgta ccaagtgtaa 1141 gcgatccggt caccttggaa cgggctctca atctgagcac cggctcctca aactactaca 1201 tccgcatttc tagatttggt gtatccagaa tctatacagc caaccctgat gctttccgca 1261 ccgctgtaaa agaactaaat aagttaaatt gtcaattctg gcatcaccaa cttaaagaag 1321 aaaaacccta cagagtagtg cttaaaggaa tccatgctaa tgttcctagt tcgcagatag 1381 aacaagcatt tagtgatcac ggctatgagg tccttaatat ctattgcccc agaaagtctg 1441 actggaagaa cattcaggta aacgaagatg ataatgaagc tacaaaaaac ttcaaaacta 1501 gacaaaattt gttttatatt aatcttaaac aaggcccgaa tgttaaagag tctcttaaga 1561 taactcgact tggcagatac agagtcactg ttgagcgcgc tacacgtaga aaagaactgc 1621 tacaatgtca aagatgccaa atttttggac actctaagaa ctattgcgcc caggatccta 1681 tttgtggtaa atgtagtggt ccccatatga ccgggttcgc tttgtgcata agtgacgtat 1741 gtctgtgtat aaattgtggt ggtgatcatg tctcgacaga caaaagctgc cctgtcagag 1801 cagagaaagc caagaagcta aaaccaaggt ccaggctacc gatgactaat aatattgcca 1861 cactcaaacc tccacaacgt tcttcaagcg gttacatacc agctgaggca ttaagaacca 1921 acatctctta tgctgatatt gctcgacgca acacgactca atctagggct cgtgctactg 1981 tgcaggctga agttatacca acgtcggaca atagccttaa caataaattt atgacgttag 2041 acaactccat tcgggccatc aatacgagaa tggacgaact atttaagctt atacacgaaa 2101 ctgtagaggc taataaagct ttcagagaac tggttcaggt tctaattaca cgtattccta 2161 aatgactcaa ccaaccttaa aaatcggatt gtggaacgct cgcggattaa caaggggctc 2221 tgaggagctt cggatattcc tcagcgatca cgatatagac gtaatgctta ccacggaaac 2281 acacatgcga gttggtcagc gcatctatct cccagggtat cttatgtatc acgcccacca 2341 ccccagtggt aacagtagag gtggctctgc agtcatcata aaatctagac tttgtcacag 2401 ccctctgaca cctatctcta ctaatgacag gcagatagcg agagtgcacc tgcaaacatc 2461 ggttgggacc gtcactgtag ctgctgttta tctacctcca gcagaaagat ggatagtaga 2521 tgacttcaaa tccatgtttg ctgcgttagg caacaaattt attgctggtg gtgattacaa 2581 tgccaaacat gcatggtggg ggaacccaag atcctgtcct agaggtaaaa tgttgcaaga 2641 agtcattgca catgggcaat accaagttct ggctacgggc gaacccactt tctactctta 2701 caaccctttg ttaacaccat cagcccttga tttttttata acctgtgggt acggcatggg 2761 caggctagat gtacaaactc tccaggaact ctcgtcggac catcttccta ttctggctgt 2821 attgcacgct acgccgttaa agaaaccaca acgcgtacga ctacttgccc ataatgctga 2881 cataaacata ttcaaaaccc atcttgaaca gctgagtgag gtaaatatgc aaattctgga 2941 ggcggtggac attgataatg ccacaagcct tttcatgagc aaactaagtg aggctgctca 3001 gcttgctgca ccgagaaatc ggcatgaagt agaggccttc agaccacttc aacttccttc 3061 cagtatattg gcactgctca ggctaaaacg aagagttcga aaagaatatg ctagaacagg 3121 tgatccccgc atgcaacaga tccacagtag actggccaac tgcctgcata aggcccttgc 3181 tcgaagaaag caggcccaaa tagatacctt cttggataac ttgggtgctg acgcgagcac 3241 aaattactca ctgtggcgta tcacgaaacg gttcaaagct cagcccaccc caaaatcagc 3301 aatcaaaaat ccgtctggtg gctggtgtcg cactagcttg gaaaaaactg aagtgttcgc 3361 taacaacctt gagcaacgtt ttacacccta taactatgca ccggaaagtc tctgtcgtca 3421 ggttgaagaa tacttggaat cgccctttca aatgagcctg cctctgagtg ctgtcacact 3481 ggaagaagtg aagaatttaa tagccaagct gccacttaag aaagctcctg gagaagatct 3541 tcttgataat agaaccatta gacttctccc agatcaagca ttgcagttcc ttgccttaat 3601 attcaacagc gttcttgatg ttggctactt tccgaaagct tggaaatcgg cgagcataat 3661 tatgatccat aagactggaa aaacaccgac agacgttgac tcgtacaggc ccaccagctt 3721 actcccatct ctgggtaaaa ttatggagag gctgatccta aacaggctgc tcacatgcaa 3781 ggatgttacc aaagcgattc ccaaatttca gtttggcttc cggttgcagc acggtactcc 3841 tgagcaacta catagagtag tgaactttgc tctggaagct atggaaaaca aggagtatgc 3901 agtaggtgcc tttcttgata ttcaacaggc atttgacaga gtctggcacc ctgggctcct 3961 gtacaaagcg aagaggctgt tcccgccgca gctatatttg gttgttaaaa gtttcctgga 4021 agaacgcaca ttccacgtct ctgttgatgg gtacaaatca tcaatcaagc caattgcagc 4081 tggagttcct caaggaagcg ttcttggccc aaccctatac tcagtttttg cttcggacat 4141 gcctactcac acaccagtca cagaggtaga cgaagaagat gtgctcatag ccacctacgc 4201 tgacgatact gctgtgctca cgaaaagtaa aagtatcctg gctgccactt ctggtctaca 4261 ggaatacctg gatgcattcc agcaatgggc tgagaactgg aatgtgcgca tcaacgctga 4321 gaagtgtgcc aatgtgacgt tcgccaaccg aacaggtagc tgtccgggtg tcagtctgaa 4381 tggaagactg atcagacacc atcaggctta taaatacctt ggtattaccc tcgataggaa 4441 gctcaccttc agcaggcaca tcacaaatat tcagcaagcg ttcaggacca aggttgctcg 4501 gatgtcttgg ctcattgcac cacgcaacaa actgtcgctt ggctgcaagg tcaatattta 4561 caagtccata ttggccccct gcctgttcta cggcctgcag gtatacggca ttgctgcgaa 4621 gagtcacctt aataagatcc ggattttaca ggcgaagacc ttaagaagaa tttcgggggc 4681 tccttggtat atgagaacaa gagacatcga acgcgacctc aaggtgccca aattaggaga 4741 caagctccag aacatcgccc aaaaatatat ggaaaggctt aatgtacacc ccaacagcct 4801 agcaaggaag ctaggaactg cagctgtggt caatgctgac cctcggacta gagtcaaaag 4861 aagactcaag cgacaccacc ctcatgacct ccctaacctg gttttgacct agaaagtctt 4921 agttttaaaa ttcattagaa taatcaaata aataataatt actatgttat atcaactatt 4981 ataattctcc ctatcatttt tagattaaaa atctgttagt cttaagtaac caagacacat 5041 tgtaaaataa aataatttaa gcagatcaaa ttaagttgcc gcatgggtaa cagtgcgttg 5101 atcaaataat aaaaacatca taaaaaaaaa aaaaatatac aaaattaatg atgg // LOCUS DROLINEJB 490 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22875 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-0. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 490) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt < 1 490 jockey dispersed repetitive element BASE COUNT 218 a 64 c 51 g 157 t ORIGIN 1 tttgttatta aaattcatta gaataatcaa ataaataata attactatgt tatatcaact 61 attataattc tccctatcat ttttaggatt aaaaatctgt tagtcttaag taaccaagac 121 acattgtaaa ataaaataat ttaagcagat caaattaagt tgccgcatgg gtaacagtgc 181 gttgatcaaa taataaaaac atcatcatat aaagacctcc ctaacctggt tttgacctag 241 aaagtcttag ttttaaaatt cattagaata atcaaataaa taataattac tatgttatat 301 caactattat aattctccct atcattttta ggattaaaaa tctgttagtc ttaagtaacc 361 aagacacatt gtaaaataaa ataatttaag cagatcaaat taagttgccg catgggtaac 421 agtgcgttga tcaaataata aaaacatcat cgtatataat taaaaaaaaa aaaaaaaaaa 481 aaaaaaaaaa // LOCUS DROLINEJC 367 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22876 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-4. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 367) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt < 1 367 jockey dispersed repetitive element BASE COUNT 135 a 76 c 53 g 103 t ORIGIN 1 ctatctcccg atctctattg tacaccccaa cagcctagca aggaagctag gaactgcagc 61 tgtggtcaat gctagccctc agcgctagag tcaaaagaag actcaagcga caccaccctc 121 atgacctccc taacctggtt ttgacctaga aagtcttagt tttaaaattc attagaataa 181 tcaaataaat aataattact atgttatatc aactattata attctcccta tcatttttag 241 gattaaaaat ctgttagtct taagtaacca agacacattg taaaataaaa taatttaagc 301 agatcaaatt aagttgccgc atgggtaaca gtgcgttgat caaataataa aaacatcatc 361 gtaaaaa // LOCUS HUMPOVRA 1254 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human poliovirus receptor mRNA, clone H20A. ACCESSION M24407 KEYWORDS oncogene; poliovirus receptor; transmembrane protein; tyrosine kinase; viral receptor. SOURCE Human Hela cell cDNA to mRNA, clone H20A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1254) AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R. TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily JOURNAL Cell 56, 855-865 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1254; revises [1]) AUTHORS Racaniello,V.R. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence kindly provided by V.Racaniello, 27-APR-1989. FEATURES from to/span description pept 1 1254 poliovirus receptor revision 1020 1020 a in [1]; g in [1] BASE COUNT 255 a 389 c 362 g 248 t ORIGIN 1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc 61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc 121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat 181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa 241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc 301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac 361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga 421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca 481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac 541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc 601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc 661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg 721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat 781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc 841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt 901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct 961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata 1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg 1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg 1141 tgtccctcga gtacagagca tgccagcgcc tcagctaatg ggcatgtctc ctattcagct 1201 gtgagcagag agaacagctc ttcccaggat ccacagacag agggcacaag gtga // LOCUS HUMPOVRB 1179 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human poliovirus receptor mRNA, clone H20B. ACCESSION M24406 KEYWORDS oncogene; poliovirus receptor; transmembrane protein; tyrosine kinase; viral receptor. SOURCE Human Hela cell cDNA to mRNA, clone H20B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1179) AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R. TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily JOURNAL Cell 56, 855-865 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1179; revises [1]) AUTHORS Racaniello,V.R. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence kindly provided by V.Racaniello, 27-APR-1989. FEATURES from to/span description pept 1 1179 poliovirus receptor revision 1020 1020 a in [1]; g in [1] BASE COUNT 234 a 368 c 341 g 236 t ORIGIN 1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc 61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc 121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat 181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa 241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc 301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac 361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga 421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca 481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac 541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc 601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc 661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg 721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat 781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc 841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt 901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct 961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata 1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg 1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg 1141 tgtccctcga gtgagcatca ccagagctgc cgtaattga // LOCUS MUSSNAP25 2040 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse SNAP-25 mRNA encoding synaptosomal associated protein 25, complete cds. ACCESSION M22012 KEYWORDS synaptosomal associated protein 25. SOURCE Mouse (strain BALB/c) adult neural brain cell, cDNA to mRNA, clones pMuBr8, p8.1, p8.51, p8.52, p8.53, and p8.71. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2040) AUTHORS Oyler,G.A., Higgins,G.A., Hart,R.A., Battenberg,E., Billingsley,M., Bloom,F.E. and Wilson,M.C. TITLE The identification of a novel synaptosomal associated protein, SNAP25, differentially expressed by neuronal subpopulations JOURNAL J. Cell Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Wilson 18-JAN-1989 FEATURES from to/span description pept 164 784 synaptosomal associated protein 25 mRNA < 1 2037 synaptosomal associated protein 25 mRNA BASE COUNT 613 a 462 c 430 g 535 t ORIGIN Chromosome 2. 1 cccgaggttt ggagctgtct ttccttccct ccctacccgg cggctcctcc actcttgcta 61 cctgcaggga tcagcggaca gcatcctctg aagaagacaa ggttccttaa ctaagcacca 121 ctgacttgct ggccccggcg cccagcaacc ccccaccact accatggccg aggacgcaga 181 catgcgtaat gaactggagg agatgcagag gagggctgac cagctggctg atgagtccct 241 ggaaagcacc cgtcgcatgc tgcagctggt cgaagagagt aaagatgctg gcatcaggac 301 tttggttatg ttggatgagc aaggcgaaca actggaacgc attgaggaag ggatggacca 361 aatcaataag gatatgaaag aagcagaaaa gaatttgacg gacctaggaa aattctgcgg 421 gctttgtgtg tgtccctgta acaagcttaa atccagtgat gcttacaaaa aagcctgggg 481 caataatcag gatggagtag tggccagcca gcctgcccgt gtggtggatg aacgggagca 541 gatggccatc agtggtggct tcatccgcag ggtaacaaac gatgcccggg aaaatgaaat 601 ggatgaaaac ctagagcagg tgagcggcat catcggaaac ctccgtcata tggccctaga 661 catgggcaat gagattgaca cccagaatcg ccagattgac aggatcatgg agaaggctga 721 ctccaacaaa accagaattg atgaagccaa ccaacgtgca acaaagatgc tgggaagtgg 781 ttaaatctgc cgttctgctg tgctgtcctc caatgttgtt ggacaagaga gaagagagct 841 ccttcatgct tctctcatgg tattacctag taagacttac acacacacac acacacacac 901 acacacacac acacacacac acacacacac agagtagtca cccccattgt aaatgtctgt 961 gtggtttgtc agcttcccaa tgataccatg tgtcttttgt tttctccggc tctctttctt 1021 tgccaaaggt tgtacatagt ggtcatctgg tgactctatt tcctgactta agagttcttg 1081 ggtctctctc tttcttttct cagtggcgtt tgctgaatga caacaattta ggaatgctca 1141 atgtactgtt gatttttctc aatacacagt attgttcttg taaaactgtg acttaccaca 1201 gagctactac cacagtcctt tcttagggtg tcaggctctg aatctctcca aatgtgctct 1261 ctttggttcc tcagtgctat tctttgtctt tatgatttca taattagaca atgtgaaatt 1321 acataacagg cattgcacta aaagtgatgt gatttatgca tttatgcatg agaactaaat 1381 agacttttag atcctactta aacaaaaact tccatgacag tagcatactg acaagaaaac 1441 acacacaaca gcaacaataa caaagcaaca actacgcatg ctcagcattg ggacactgtc 1501 aagattaagt cataccagca aaacctgcag ctgtgtcacc ttcttctgtc aacatacaga 1561 ctgatcataa tgatcccttc tttacacaca cacacacaca cacacacaca cacacacaca 1621 caaatggaat ttaaccaact tcccagaatt gatgaagcaa atatatgttt ggctgaaact 1681 attgtaaatg ggtgtaatat agggtttgtc gaatgctttt gaaagctctg ttttccagac 1741 aatactcttg tgtggaaaac gtgaagatct tctaagtctg gctcttgtga tcaccaaacc 1801 ctggtgcatc agtacaacac tttgcgctaa tctagagcta tgcacaacca aattgctgag 1861 atgtttagta gctgataaag aaacctttaa aaaattatat aaatgaatga aatatagata 1921 aactgtgaga taaatatcat tacagcatgt atattaaatc cctcctgtct cctctgttgg 1981 tttgtgaagt gatttgacat tttgtagcta gtttaaaatt attaaaaatt atagatgtta // LOCUS CHKNCAMC1 26 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12. ACCESSION M24014 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 1 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 7 to 18) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 26) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) STANDARD full staff_entry FEATURES from to/span description pept < 1 + 18 neural cell adhesion molecule, exon 12 (AA at 1) IVS 19 > 26 intron BASE COUNT 10 a 6 c 5 g 5 t ORIGIN 1 tcaagacaca gccagttcgt aagtaa // LOCUS CHKNCAMC2 29 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12A. ACCESSION M23991 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 2 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 6B1, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 7 to 21) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 6; 22 to 29) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 7 + 21 neural cell adhesion molecule exon 12A IVS < 1 6 N-CAM intron IVS 22 > 29 N-CAM intron BASE COUNT 6 a 10 c 6 g 7 t ORIGIN 1 ctgcaggtat ccctcactca cgtaagtgc // LOCUS CHKNCAMC3 54 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12B. ACCESSION M23992 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 3 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 12AC3, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 13 to 46) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 12; 47 to 54) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 13 + 46 neural cell adhesion molecule exon 12B IVS < 1 12 N-CAM intron IVS 47 > 54 N-CAM intron BASE COUNT 11 a 13 c 12 g 18 t ORIGIN 1 ttctttcttt agcaagtgct gctgccccca ccatgtttag cagagagtga gtat // LOCUS CHKNCAMC4 62 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12C. ACCESSION M23993 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 4 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 7C1, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 13 to 54) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 12; 55 to 62) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 13 + 54 neural cell adhesion molecule exon 12C IVS < 1 12 N-CAM intron IVS 55 > 62 N-CAM intron BASE COUNT 18 a 20 c 12 g 12 t ORIGIN 1 atgtttccat agcaactcag cctctccctg catccgaaag cacagagcca ccaagtgagt 61 ag // LOCUS CHKNCAMC5 3 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12D. ACCESSION M23994 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 5 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 3) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry COMMENT Exon 12D represents a very small exon. FEATURES from to/span description pept + 1 + 3 neural cell adhesion molecule, exon 12D BASE COUNT 2 a 0 c 1 g 0 t ORIGIN 1 aag // LOCUS CHKNCAMC6 33 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 13. ACCESSION M37649 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 6 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 33) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 33) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT FEATURES from to/span description pept + 13 > 33 neural cell adhesion molecule, exon 13 IVS < 1 12 intron BASE COUNT 9 a 11 c 7 g 6 t ORIGIN 1 ccattgtttc aggggaaccc agtgcaccca aac // LOCUS BADLDH 1147 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.caldolyticus lactate dehydrogenase (LDH) gene, complete cds. ACCESSION M19394 M19386 KEYWORDS lactate dehydrogenase. SOURCE B.caldolyticus (strain DSM 405) DNA, clone pLDHY1. ORGANISM Bacillus caldolyticus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1147) AUTHORS Zuelli,F., Weber,H. and Zuber,H. TITLE Nucleotide sequences of lactate dehydrogenase genes from the thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus and B.caldotenax JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Zuelli, 27-MAY-1988. FEATURES from to/span description pept 146 1099 lactate dehydrogenase (AC 1.1.1.27) BASE COUNT 282 a 275 c 321 g 269 t ORIGIN 301 bp upstream of NcoI site. 1 tgctcccgtt gtttgccgcg gcgtctgtca tgaaatggac aaacaatagt caaacaatcg 61 ccacaatgcc gcatgcatcg cggtgcgcct ttcgcgtaaa atagtttgtg aaagtgttca 121 cattgtattg agggggatga atgcaatgaa aaacaacggt ggaacccgag tagtggtcat 181 cggcaccggg tttgtcggcg ccagttatgc gtttgcctta atgaatcaag ggattgccga 241 tgagatcgtg ctcatcgatg cgaatgaaag caaggccata ggcgatgcga tggacttcaa 301 ccatgggaaa gtatttgcgc cgaagccggc tgacatttgg cacggcgatt acgatgattg 361 ccgcgatgcc gatttggttg tcatttgcgc cggcgccaac caaaaaccgg gcgagacgcg 421 gcttgatctt gtggacaaaa acattgccat tttccgctcg atcgttgagt cggtcatggc 481 atccggattt caaggactgt ttctcgtcgc caccaatccg gtcgacattt taacgtacgc 541 gacgtggaaa ttcagcggcc tgccgcatga gcgggtgatc ggttcgggga cgattttaga 601 tacggcgcgg ttccgctttt tgttgggcga gtatttctct gtcgctccgc aaaatgttca 661 tgcctatatt attggggaac acggcgacac tgaactcccg gtctggagcc aggctgatat 721 cggcggcgtg ccgatccgca agctggtcga gtccaaaggg gaagaagcgc aaaaagagct 781 cgagcgcatt tttgtcaatg tgcgcgatgc cgcctaccaa attattgaga aaaaaggagc 841 gacgtactac ggaattgcga tgggacttgc ccgcgtgacg cgcgccattt tgcataacga 901 aaacgctatt ttgaccgtat cggcctacct cgatggccca tatggggagc gcgacgtcta 961 catcggcgtg ccggctgtca ttaaccgcaa tggcatccgc gaggtgatcg aaattgaatt 1021 gaatgatgac gaaaaaaatc gattccatca tagcgcagcc acattaaaaa gcgtgctagc 1081 ccgttttttt acgcgatgag gcaacaaacg aaaagttgtg ccaacagtct ttctcgtcca 1141 aacaggg // LOCUS BMELDHA 1423 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.megaterium L-lactate dehydrogenase gene. ACCESSION M22305 KEYWORDS L-lactate dehydrogenase. SOURCE B.megaterium (strain DSM 090) DNA, clone pMH/pML. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1423) AUTHORS Waldvogel,S., Weber,H. and Zuber,H. TITLE Nucleotide sequence of the L-lactate dehydrogenase gene from the mesophilic bacterium B.megaterium. Preparation and properties of a hybrid lactate dehydrogenase comprising moieties of the B. megaterium and B.stearothermophilus enzymes JOURNAL Biol. Chem. Hoppe-Seyler 368, 1391-1399 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1423; revises [1]) AUTHORS Suter,F. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by S.Waldvogel 02-OCT-1987, and [2] by F.Suter 16-MAY-1989. Author address [2] F.Suter ETH-Hoenggerberg Inst. f. Molekularbiologie und Biophysik CH 8093 ZUERICH Switzerland FEATURES from to/span description pept 250 1206 L-lactate dehydrogenase (AA at 1) BASE COUNT 449 a 263 c 277 g 434 t ORIGIN 1 taacatacaa atagagaatg ttaaactctt tagctttctg caaaacaagt tgcctttact 61 caacacttta aacgtttaag atctttctaa atgtggcact tcatgtagaa ggacattttt 121 ttaaatcgcc ctgtaaaatt gtttactatt ttttaaaggg tgtgattttt atcacagctg 181 aagctattta ttcttgttac actaacattg tgaaaaacat cacaaaataa aattcaaagg 241 atgatacaaa tgaaaacaca atttacacca aaaacacgaa aagttgccgt tatcggaact 301 ggttttgttg gctcaagcta cgctttttca atggtgaatc aaggtattgc caatgaatta 361 gtgttaatcg atatgaacaa agaaaaagca gaaggtgaag cacgtgatat caatcatgga 421 atgccatttg ccacaccgat gaaaatctgg gctggagatt ataaagactg tgctgacgct 481 gatttagcag ttattacagc gggcgctaat caagctccag gggaaacacg cttagatcta 541 gttgaaaaaa acgttaaaat tttcgaatgc attgtaaaag atattatgaa cagcggattt 601 gacggcatca ttttagtggc aacaaatcca gttgatattc tcgcacacgt tacacaaaaa 661 gtatcaggat taccaaacgg acgggtaatt ggttcaggaa cgattcttga cacagctcgc 721 ttccgctact tgttaagcga ctatttcgaa gtagattctc gcaacgtcca cgcttatatt 781 atgggggaac atggagatac ggaatttcct gtttggagcc acgcgcaaat tggcggtgtg 841 aagctcgaac attttatcaa tactgccgct attgaaaaag aaccggatat gcagcatcta 901 ttcgaacaaa cccgcgatgc ggcttaccat attattaatc gaaaaggagc gacttattac 961 ggaattgcaa tggggcttgt acgcattacc aaggctattt tagatgatga aaattctatt 1021 ttaacagtat ctgctttatt agaaggacaa tacggtattt ctgatgtgta tatcggcgta 1081 ccagctatca ttaataaaaa cggcgtgcgt caaattattg aattgaattt aactcctcac 1141 gaacagcagc agctcgagca ctctgctagc attcttaagc aaactcgcga cagagctttt 1201 gtgtaacatc taaagatttt tgcgggggac tcccccgcac cttgttttaa atatgtagta 1261 gaggtgtatg aatatgacgt ggactcaggt atataatcct ttagataata tttggctttc 1321 tgcactaatt gcactcattc ccattatctt tttctttatt gctttaactc ttttgaaatt 1381 aaaaggacac attgcttgcc ggtattacgg tgcttctttc tat // LOCUS BSTLDH 1150 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.stearothermophilus lactate dehydrogenase (LDH) gene, complete cds. ACCESSION M19396 M19386 KEYWORDS lactate dehydrogenase. SOURCE B.stearothermophilus (strain NCIB 8924) DNA, clone pLDHY1. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1150) AUTHORS Zuelli,F., Weber,H. and Zuber,H. TITLE Nucleotide sequences of lactate dehydrogenase genes from the thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus and B. caldotenax JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Zuelli, 27-MAY-1988. FEATURES from to/span description pept 149 1102 lactate dehydrogenase (EC 1.1.1.27) BASE COUNT 283 a 270 c 321 g 276 t ORIGIN 304 bp upstream of NcoI site. 1 gtgctcccgt tgtttgccgc ggcgtctgtc atgaaatgga caaacaatag tcaaacaatc 61 gccacaatcg cgcatgcatt gcggtgcgcc tttcgcgtaa aatatttata tgaaagtgtt 121 cgcattatat tgagggagga tgaatgcaat gaaaaacaac ggtggagccc gagtagtggt 181 catcggcgcc gggtttgtcg gcgccagtta tgtgtttgcc ttaatgaatc aagggattgc 241 cgatgagatc gtgctcatcg atgcgaatga aagcaaggcc ataggcgatg cgatggactt 301 caaccatggg aaagtatttg cgccgaagcc ggttgacatt tggcacggcg attacgatga 361 ttgccgcgat gccgatttgg ttgtcatttg cgccggcgcc aaccaaaaac cgggcgagac 421 gcggcttgat cttgtggaca aaaacattgc cattttccgc tcgatcgttg agtcggtcat 481 ggcatccgga tttcaaggac tgtttctcgt cgccaccaat ccggtcgaca ttttaacgta 541 cgcgacgtgg aaattcagcg gcctgccgca tgagcgggtg atcggttcgg ggacgatttt 601 agatacggcg cggttccgct ttttgttggg cgagtatttc tctgtcgctc cgcaaaatgt 661 tcatgcctat attattgggg aacacggcga cactgaactc ccggtctgga gccaggctta 721 tatcggcgtc atgccgatcc gcaagctggt cgagtccaaa ggggaagaag cgcaaaaaga 781 tctcgagcgc atttttgtca atgtgcgcga tgccgcctac caaattattg agaaaaaagg 841 agcgacgtac tacgggattg cgatggggct tgcccgcgtg acgcgcgcca ttttgcataa 901 cgaaaacgct attttgaccg tatcagccta cctcgatggc ctatatgggg agcgcgacgt 961 ctacatcgga gtgccggctg tcattaaccg caatggcatc cgcgaggtga tcgaaattga 1021 attgaatgat gacgaaaaaa atcgattcca tcatagcgca gctacattaa aaagcgtgct 1081 agcccgtgct tttacgcgat gaggcaacaa acgaaaagtt gtgccgacag tctttctcgt 1141 ccaaacaggg // LOCUS MUSIGVDJA 351 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse Ig active H-chain mRNA variable region (V186-2 DFL16.1 J2) (AA at 1) anti-NP hybridoma, clone H1-7. ACCESSION M18548 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; rearranged gene; variable region. SOURCE Mouse (strain C57BL/6J) anti-NP hybridoma mRNA, clone H1-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 351) AUTHORS Blier,P.R. and Bothwell,A. TITLE A limited number of B cell lineages generates the heterogeneity of a secondary immune response JOURNAL J. Immunol. 139, 3996-4006 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Bothwell 03-MAY-1988. FEATURES from to/span description ORF < 1 > 351 Ig heavy-chain V-region (V-D-J2) (AA at 1) recomb 293 294 V(186-2) region end/ D(FL16.1) start recomb 308 309 D(FL16.1) region end/J2 start BASE COUNT 84 a 92 c 89 g 79 t 7 others ORIGIN Unreported. 1 caggtcaaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agnnaagctg 61 tcctgcaagg cttctggcta caccttcacc acctncttga tncactgggt gaaacagagg 121 cctggacgag gccttgagtg gattggaagg attgatccta atggtggtgg tactaagtac 181 gatgnaagtt caagagcaag gccacactga ctgtagacga accctccagc acagcctaca 241 tgcagctcag cagcctgaca tctgaggact ctgcggncta ttattgtgca acctattact 301 acggtctcta ctttaactac tgggnaaggc accactctca cagtctcctc a //