Path: utzoo!attcan!uunet!tut.cis.ohio-state.edu!zaphod.mps.ohio-state.edu!usc!rutgers!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 12 Jul 90 12:00:26 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5279 Approved: lear@genbank.bio.net Checksum: 44026 321 LOCUS BOVB1A 781 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta-crystallin (p-Beta 25/23) mRNA, complete cds. ACCESSION M33010 KEYWORDS beta-crystallin; crystallin. SOURCE Cow lens cortex, cDNA to mRNA, clone p-Beta 25/23. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 781) AUTHORS Gorin,M.B. and Horwitz,J. TITLE Cloning and characterization of a cow beta crystallin cDNA JOURNAL Curr. Eye Res. 3, 939-948 (1984) STANDARD simple staff_review FEATURES from to/span description pept 6 653 beta-crystallin BASE COUNT 217 a 185 c 187 g 192 t ORIGIN 1 tccagatgga gacccagact gtgcagcagg agctgaaatc ccttccaacc accaagatgg 61 ctcaaactaa ccccatgccg gggtctgtgg ggccatggaa gattaccatc tatgaccagg 121 agaacttcca gggcaagaga atggaattca ccagctcctg cccaaatgtc tctgagcgca 181 attttgacaa cgtccggtct ctcaaggtgg aatgtggcgc ctgggttggt tatgagcata 241 ccagcttctg tgggcaacag tttgtcctgg agagaggaga gtaccctcgc tgggatgcct 301 ggagcgggag taatgcctat cacattgagc gcctcatgtc cttccgcccc atctgttcag 361 ctaatcataa ggagtctaag attacaattt ttgagaaaga aaatttcatt ggacgccaat 421 gggaaatctg tgatgactac ccctccttgc aagccatggg ttggcccaac aacgaagttg 481 gctctatgaa gatacaatgt ggagcctggg tttgctacca gtatcctggg taccgtggct 541 atcagtatat cttggaatgt gaccatcatg gaggagacta caaacactgg agagagtggg 601 gttctcatgc ccagacttcc cagattcaat ccattcgccg tatccaacag tagtggatta 661 aaagctccaa gtaagaattc ctcaagcatg agaccttcct aaacaatcta gaataaaata 721 tatgttctgc tgatattgct tccaaatgtt agctgctgaa atccacaata aatgtcatta 781 a // LOCUS CFICENB 439 bp ds-DNA BCT 12-JUL-1990 DEFINITION C.fimi endoglucanase B (cenB) gene, 5' end. ACCESSION M33026 KEYWORDS endoglucanase; endoglucanase B. SOURCE C.fimi DNA. ORGANISM Cellulomonas fimi Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 439) AUTHORS Owolabi,J.B., Beguin,P., Kilburn,D.G., Miller,R.C.Jr. and Warren,R.A.J. TITLE Expression in Escherichia coli of the Cellulomonas fimi structural gene for endoglucanase B JOURNAL Appl. Environ. Microbiol. 54, 518-523 (1988) STANDARD simple staff_review FEATURES from to/span description pept 275 > 439 endoglucanase B (cenB) precursor sigp 275 373 endoglucanase B signal peptide matp 374 > 436 endoglucanase B BASE COUNT 58 a 173 c 154 g 54 t ORIGIN 1 ggatcccgcg cccggcgcga gcccgcaacc cacgcgccca cggatcgggc ctcacgagcc 61 cgacgttggc ggccgggccg gggggcgacc tcgagaccga ggagcccccg cgtgaggcga 121 cgttggccgc gcacgccgct ggtgagcggg ctgaatcgtt tagggcgttg acctgcggac 181 ggacccgtct ggacgatgcg ccaggcgtcg tgcgggtgcg actgcggaca gcacgggtcg 241 ccgaccacca ctcccgtgcc cggaagagga ccccatgctc cgccaagtcc cacgcacgct 301 cgtcgcgggt ggctccgccc tcgccgtcgc cgtcggggtg ctcgtcgccc cgctcgcgac 361 cggcgcggcc gccgcgccca cctacaacta cgccgaggcc ctgcagaagt cgatgttctt 421 ctaccaggcg cacggctcc // LOCUS RATLACTAS 250 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat lactase-phlorizin hydrolase mRNA, partial cds. ACCESSION M34730 KEYWORDS lactase; lactase-phlorizin hydrolase. SOURCE Rat (strain Sprague-Dawley CD) newborn, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 250) AUTHORS Bueller,H.A., Kothe,M.J.C., Goldman,D.A., Grubman,S.A., Sasak,W.V., Matsudaira,P.T., Montgomery,R.K. and Grand,R.J. TITLE Coordinate expression of lactate-phlorizin hydrolase mRNA and enzyme levels in rat intestine during development JOURNAL J. Biol. Chem. 265, 6978-6983 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 250 lactase-phlorizin hydrolase (AA at 2) BASE COUNT 66 a 62 c 66 g 56 t ORIGIN 1 agaaaggatc ttctaccaca aaacctatat caacgaggct ctgaaagcct acaagctgga 61 tggtgtggac cttcgagggt actctgcctg gacgctgatg gacgacttcg agtggctgct 121 tggctacacc atgagatttg gattgtatca cgttgacttt aatcatgtga gcagacctcg 181 cacagcaaga gcctcagcca gatactatgc agaggtcatt gccaacaatg gcatgcccct 241 ggccgggaag // LOCUS BOVARRB 1945 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta-arrestin mRNA, complete cds. ACCESSION M33601 KEYWORDS beta-arrestin; inhibitor. SOURCE Cow adult brain cortex, cDNA to mRNA, clone pBARRESTIN-1/1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J. TITLE Beta-arrestin: A protein that regulates beta-adrenergic receptor function JOURNAL Science 248, 1547-1550 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1945; for [1]) AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J. JOURNAL Unpublished (1990) See COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.J.Lohse, 06-APR-1990, for release after publication. Author address [1]: M.J.Lohse Howard Hughes Medical Institute, Box 3821, Duke University Medical center Durham, NC 27710 FEATURES from to/span description pept 97 1353 beta-arrestin BASE COUNT 419 a 590 c 556 g 380 t ORIGIN 1 gttccgggaa ccggctggcc cgcgcccctc ctgtcggccg gggattttcc agcctgggcg 61 ctgacgccgc ggacctcccc gcggccgcct cggaccatgg gcgacaaagg gacgcgggtg 121 ttcaagaagg cgagccccaa tggaaagctc accgtctatc tgggaaagcg ggactttgtg 181 gaccacatcg acctcgtgga gcccgtggat ggagtggttc ttgtggatcc ggagtatctc 241 aaggagagga gagtctatgt gacgctgacc tgcgccttcc gctacggccg ggaggacctg 301 gatgtcctgg gcctgacctt tcgcaaggac ctgtttgtgg ccaacgtgca gtctttcccg 361 ccggcccctg aggacaagaa gcccctgacg cggctgcagg agcgcctcat caagaagctg 421 ggcgagcatg cctacccttt cacctttgag atccctccga acctcccatg ctctgtgact 481 ttgcagccgg gacctgaaga tacagggaag gcctgcggtg tggactacga agtgaaagcc 541 ttctgtgcgg agaacctgga ggagaagatc cacaagcgga attctgtgcg cctggtcatc 601 cggaaggttc agtatgcccc agagaggcct ggcccccagc ccacggccga gaccaccagg 661 cagttcctca tgtcagacaa gcccttgcat ctggaggcct ccctggacaa ggagatctac 721 taccacggag aacccatcag tgtcaacgtc catgtcacca acaacaccaa caagacggtg 781 aagaagatca agatctcggt gcgccagtat gcagacatct gtctgttcaa cacagcccag 841 tacaagtgcc ctgtggccat ggaagaggct gatgacacag tggcacccag ctctacgttc 901 tgcaaggtct acacgctgac ccccttcctg gccaacaatc gagagaagcg gggcctcgcc 961 ctggacggga agctcaaaca cgaggacacg aacctggcct ccagcaccct gttgagggaa 1021 ggagccaacc gggagatcct gggcatcatt gtttcctaca aagtgaaagt gaagctggtg 1081 gtgtctcgtg gcggcctgtt gggagatctt gcatccagtg atgtggccgt ggaactgcct 1141 ttcaccctaa tgcaccccaa gcccaaagag gaacccccac accgggaagt tccagagcac 1201 gagacgccgg tagataccaa tctcatagaa cttgacacca acgatgacga cattgtgttt 1261 gaggactttg cccgccagag actaaaaggc atgaaggatg acaaggagga agaggaggat 1321 ggtaccggct ctccgcggct caacgacaga tagactgggg ctgccctccc tccgggcagc 1381 tccaggtcca ctctcatgca ctaggatgct tgttcgtctt cttcctgtcc tggctccccc 1441 tcccctttgt tcttccagtt tctaccaggg ggccccagcg gtcttccagg tcacggtggc 1501 gaacccctgg cctcaggatt ggcccccatc accatgccaa cagggccaca ggcagcaccc 1561 tcaccctctc actgcatcac ttctccattc cccctctttt cctattgacc cccagacagg 1621 ccagcacagc tctggccttc ggatttgact cgggatgggg agcagaaagg ggaagatggg 1681 gcacaagggc ttggcgaggt ggggatgggg gctcaagacg cgtgagagga tgtggccact 1741 gtcccaggtg atgaatacag ttctggcagc taaaacatga ccgctttgaa ggccaccctc 1801 ctctggctgg gaggggacag acccatggat agattgtcca cacagatttg ctcgaagttc 1861 agacctacca aacagctgtc ttcttcttcc ctcgtccctg ccccctgttc ctctgtggct 1921 gacagtgacc ttggtgaagg tttgt // LOCUS BBVRNA3 389 bp ss-RNA VRL 12-JUL-1990 DEFINITION Black beetle virus RNA3 proteins B1 and B2 genes, complete cds. ACCESSION M33065 KEYWORDS . SOURCE Black beetle virus. ORGANISM Black beetle virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Nodaviridae. REFERENCE 1 (bases 1 to 389) AUTHORS Guarino,L.A., Ghosh,A., Dasmahapatra,B., Dasgupta,R. and Kaesberg,P. TITLE Sequence of the black beetle virus subgenomic RNA and its location in the viral genome JOURNAL Virology 139, 199-203 (1984) STANDARD simple staff_review FEATURES from to/span description pept 10 318 B1 protein pept 20 340 B2 protein BASE COUNT 118 a 120 c 98 g 53 t ORIGIN 1 tcgttaccaa tgttaaacga tgccaagcaa actcgcgcta atccaggaac ttcccgaccc 61 cattcaaacg gcggtggaag cagccatggg aatgagctac caagacgcac cgaacaacgt 121 gcgcagggac ctcgacaacc tgcacgcttg cctaaacaag gcaaaactaa cggtaagtcg 181 gatggtaaca tcactgctgg agaaacccag cgtggtggca tacctagagg gaaaggcccc 241 cgaggaggca aaaccaacac tcgaagaacg cctccgaaag ctggagctca gccacagcct 301 tccaacaacc ggaagtgacc ccccacccgc aaaactgtag gtggctctta ggagcaccca 361 cacccgttct agcccgaaag ggcagaggt // LOCUS MUSURNAA 54 bp ss-uRNA ROD 12-JUL-1990 DEFINITION Mouse small nuclear RNA. ACCESSION M34036 KEYWORDS small nuclear RNA. SOURCE Mouse plasmacytoma cell line P301 small nuclear RNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 54) AUTHORS Chernokhvostov,V.V. and Georgiev,G.P. TITLE Complexes of nuclear matrix DNA with proteins tightly bound to DNA contain a specific small-size RNA of a novel type JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.V.Chernokhvostov, 04-MAY-1990. or address:V.V.Chernokhvostov . of Molecular Biology, USSR Acad. Sci. Vavilova str., 32 117984, Moscow USSR FEATURES from to/span description uRNA 1 54 small nuclear RNA BASE COUNT 19 a 12 c 13 g 10 t ORIGIN 1 agaagacacc ctgatttaac ttctggtatc ggaagatgca agagccgaac caga // LOCUS RATCYP2A1 18820 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds. ACCESSION M33312 KEYWORDS B2 repetitive sequence; LINE repetitive sequence; cytochrome P450; hepatic steroid hydroxylase IIA1. SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 18820) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept 4573 4749 hepatic steroid hydroxylase IIA1 (CYP2A1), exon 1 5050 5212 hepatic steroid hydroxylase IIA1, exon 2 7638 7787 hepatic steroid hydroxylase IIA1, exon 3 8005 8165 hepatic steroid hydroxylase IIA1, exon 4 9386 9562 hepatic steroid hydroxylase IIA1, exon 5 12760 12898 hepatic steroid hydroxylase IIA1, exon 6 13340 13527 hepatic steroid hydroxylase IIA1, exon 7 13960 14101 hepatic steroid hydroxylase IIA1, exon 8 17010 17191 hepatic steroid hydroxylase IIA1, exon 9 pre-msg 4545 17380 CYP2A1 mRNA and introns IVS 4750 5049 CYP2A1 intron A IVS 5213 7637 CYP2A1 intron B IVS 7788 8004 CYP2A1 intron C IVS 8166 9385 CYP2A1 intron D IVS 9563 12759 CYP2A1 intron E IVS 12899 13339 CYP2A1 intron F IVS 13528 13959 CYP2A1 intron G IVS 14102 17009 CYP2A1 intron H rpt 3750 4026 B2 repeat BASE COUNT 5274 a 4402 c 3695 g 5449 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctagt acggtagccc tggctttcat caactagtta gtgccaaata tttgagaaaa 61 gttacaggtt caagctaata aaagttgcag agagtataaa agaatgcaga ttagacaaga 121 aaaaattaat tagagccctt ctagccaaca aagcctcaga tccaggagaa aagactacca 181 tagaaatggc caaaggctta tttatcaaag aaactgggct cagtggcagc aggatgacca 241 ccttgcctgt gtttattgtt gccacagcac tgttggataa agatgcaaat aaattaactt 301 tgggacagaa gttgatcatg actgctcctc cccctgcccc cgcaattctg attgaggctc 361 agtaatgcct acatgcttca ttatcacact ttactaatca gccctggctg agatattttc 421 cagccacctg tttccctgaa ccctgcgact cttccaccca accctgactt gggctgtcca 481 cttcatcaat tcgatgaggt tcaggcccag atacacaata ccagacctta cttgaggaac 541 tctcatccat cagaaacaga gcatacctgg ttcacggaca gaagtagctt catccataag 601 ggtcagagga gaacaggggc agcaataaca acagaaggaa aggtaatctg gactcagtct 661 cttccttcca ggcacttcaa ctcaaaagaa caaactaaga acattaacac aagtcctcat 721 catgggaaaa ggactggctg ttagcatctg cagggacagc cagatatgca tttacaacta 781 ctcatgtgca cggagccatt tacaaggaac aacagctcct aacagcagaa ggaaaaactg 841 tcagaaataa agataaaact ctacaggaca tcaaaaagaa accaatggct agagataaca 901 gtctgactga taaggcttct aagaaggtag ccttaaagga aacagccaac actagattgg 961 ccattgtcct acctgaacca cctagagtaa ctgataaatg cagaaaaaga aattaaatgg 1021 gctggtgatg gtcaactgaa ggtaagcgaa tattgcccac ttctcagaaa gatccacagt 1081 cactcacttg ggagtaaaat gaatgacaaa ctttaaagtt tgccagctga ccaacactca 1141 caggaagccc aaacatccaa attcctgact gcgagttaag agacttggag cctactggga 1201 aaattgattt ttttcagaaa tcaagcaaga aagatatggc tcaaaatatc tgctgatatt 1261 tgtagacatt ttttttcgag atagatggat agatgtacta ggagagatct ttccgatgtt 1321 tggagcacct aaggtaacag gatcagacaa tgggcctgtc ttcatatctc aggtaagtca 1381 gggacttgct aagatcttgg ggactaattg gaaactccat tgttcatatc atccccagag 1441 ttcagggcag gtagaaagga tgaatagaac tctaaaagag accttaacag aattagcctt 1501 ggagactggt ggggactggg tgaggctctt tccctttgcc ctatattagg tgtacatggc 1561 attctagctc ccattgtatc tagcctacag ttggtagcta ttacagaact gaaaaatgat 1621 aatttaagat ttaaggtcag agctaccaaa taggctcatg aatttgtttg ggcctaaatt 1681 atgtaccttc tgtgaagcag gcctggttcc agaaccacac aagtcaaaag agactgggtc 1741 tctatgaaga gatttcacca aggtgcgact aaacccatgt gaaaatggca attcatcatc 1801 ctgttgatca tgatcaccac ctgggtgtac aacaaccaca ccagaccagt tcctccaatg 1861 aagaactctg cctggctcca gctgtaccaa aatagagggt tcaaaaggac accaagccct 1921 tcaagttaaa gttgactcag tctcagtcct gagtctcttg cccctgctaa ctctatgtct 1981 atatatactg tatgtcttag atccccccct gttaggaagg taccctagct ggatccttga 2041 taattttact tttatttctg acttttggcc cctgtatttt aagttgctta gtagtttata 2101 agagaattca gtcaagttaa ttatcttaag gcaacactat ctacagctgg aagcagggaa 2161 gcaagcatat gagttagaag actataagct tcaagatcaa agctatgcta aaagaaaagg 2221 ggggaatgaa aagccagagt tggggtcaat ctgaggccaa tgagaaaaac ccaccattaa 2281 catccaagca cagaacgacc cttctcttcc agaaagagta aagctagttt agttcctgga 2341 acagctacaa gccaaactgt tgaacaaagc cacatgtaac tccccatcca acctccagaa 2401 agtcccagaa tggcacactg accacaagtc attttggagg ttacttcacc ccactaatag 2461 tagtactctt cctagttact gttgtgcaaa ttctgcccca attgtttgta aggtatatac 2521 agacccagtt agagtctgct cagggtcttc tctttctgaa agggagtcaa ccccgacgca 2581 ttaaaataaa gctagtcttg gttttgcatt gattagcacc tccttgagtc tcactcaagg 2641 ggtcccggaa agggtcagat tagacctcat atacctctga gcacagcttg tatggtgact 2701 aagatacagg atacccacag gctgggatta gagagtttaa accaaagatc tttcatccat 2761 gtgctccatg cctgccctgt gcccaggggg aaacatggat tctaattaca gaagcctccc 2821 taaggatctt aatgggaacc aagtaggaga cttttccagt tagaagcctt ctgacaactg 2881 gggtttcccc atattggtag tttaggttgt tatttcacaa aactacaatt ccttcaccaa 2941 ctggagttct gagttattct cctctagtct ggaaaatgat ctgctaaaat atagctgtgg 3001 ttttctaccc ttttcaaagc catacataga cagggaaggt tgcccatcct tccctgaagt 3061 tgaagatcct tttagaagtc aatgcaccca tcagtggtga taaatgcctt taatcccagt 3121 atgcagcaaa ctctgtgagt ttgacgccaa attggtctac agagtgtgtt ctagaacagt 3181 cagagctaaa gagagaaaca ctctgtggaa aagaaagaaa gaaaaaagaa aggaaggaaa 3241 gaaggaagga aggaaggaag gaaggaagga aggaaggaag gaaggaaaaa gaaaggaagg 3301 aaggaaaaag gtacagagag agggaaagag ggagggagaa aaataacata tatgaagaca 3361 cagtacagga ccaatctggg ctcaggtgcc cactttagtc tcctactgga attttcatcc 3421 acttgtacca gaaactcagc acccacagat ccttcttgcc atgtgacctt ccagtccata 3481 gtttggaatc tttcctgttt tccttactaa tatttttctc ctaataaaaa gactaaacca 3541 tctagactct aggactccag agatgactct gtgggtaaga gcacttgttg ctcttgtaga 3601 agacccaggt tttattccta gaacccacat ggtggcttac aaccatgtgt gaccccattt 3661 ccaaaggatt cctctaaatc ttttggcatc tttggacagt gtgcccaatt gttacccaga 3721 cttaaatgga agagaaacct tcatattaca taaaaaatta acacaataag atccataaag 3781 aaatatataa aagaagataa tcttttttaa aaaggataca ttggatacat tgccaggcct 3841 ggagagatgg ctcagtggtt aagagcactg acagctcttc cagaggtcct gagttcaaat 3901 tcccagcaac cacatggtga ctcacaatta tctgtaatgg gattcaatgc ccactactgg 3961 tgtgtctgaa gacagtgaca gtgtactcat atacatgaaa gaaataaatg aatcttgaga 4021 acaacattta atgcctgaag ccatgtttcc tgtattgttc cagtcaaatc taagaatgtg 4081 aattctatca cagaccacaa catttacatc tatgagggct ttcttcatga gctcaaccat 4141 acaaatatag atttttagtt ctagatttga tctggtggac ccagaaatgg acagcctcct 4201 gataatagcc acagtcccca atacagcacc aatctcatca tagcatttga agagtgtatt 4261 atatgttggc ctgttcacct tgtccactaa aaccctcagc ttggtccacc aaagcctctt 4321 tgactgcatt gtatcaacac accaaagcac accgaggctt taagaatttt gaagtaagcc 4381 tgccacccag aggtcttcct atttgcccat gttgtgggtg ttgcaacaaa gacagggtca 4441 gtgttaggag ataggattgg agggtaaaag actcaactag acaaacagga gcaaaggcca 4501 tcctgtgtcc ctgggagtat aaaggtacta tctcagcctt ggctatcagc ctgtcaatcc 4561 tcactggcca ctatgctgga cacaggactg cttctggtgg tcatattggc ctccctgagc 4621 gtcatgctct tggtgtccct ctggcagcag aaaatcaggg ggagattgcc tccaggaccc 4681 actcctttgc ctttcattgg aaattatctg cagctgaata caaaagacgt atacagttcc 4741 atcacacagg tatcactgga tgaggggatg gatgggacat gggagcacaa gaggctgtga 4801 tgttttgcat gttttgtggc agaagattca tagaggaatc caaagtcttg tattagtgga 4861 gtttagaaag ataaggagct atttcaagtc tttggtttgt tgtttgttgt ttgttttgtt 4921 tgttgtttgt ttctttgttt ctttgttttt tcaatcattt atttgtagag taacacataa 4981 tctgacctct gtgtactggt ccagttcagt gaataagtca tctaacagcc cccatctacc 5041 ccacatcagc tcagtgagcg ctatggtcct gtgttcacca tccaccttgg gcctcgccgg 5101 gttgtggtgc tttatggata cgatgcagtc aaagaggctt tggtggacca agctgaggag 5161 ttcagtggac gaggcgaaca ggctacctac aatacactct tcaaaggcta tggtgaggag 5221 gataccacat tggggaacat gcccaaggac atttgttggc gtcatttaag tagccttcat 5281 actaactcat ctctccctca aggctgtaca gagttctctg aatttctctc catatccatg 5341 ttgaatgttg gctctcattg tgaccctccc tagcatttct gagattgaaa acagactttt 5401 gcaaattctg tgggttcttt cttccatcct tctctaccgt tttcttccgc cctttctacc 5461 acctatcact agataggaaa gaaaaggaga tagaggtgaa aggggacatt actgttagat 5521 tatttcctgc tgattaggag tgacgagctc cttagggaaa gttttatctt ctctgtcagg 5581 atatctaatt tcttcttgtt gttatttctt tacataagac tacttaacaa atcacaagca 5641 acagcaacta accaatagcc aaaaccaatt tctcagggtc cttgcattta cacaaccttg 5701 aggagtccca gtatcctgag tgtcacacac tctcagaaac tatctgcagc tggcaaaatc 5761 ataacctcct gctttggaca acctgaacca gccccatatg ccatacctgg gagtaaacag 5821 aaacatattt ctataatagt tctgtatttt tcaaagaaat caaatttctt actacatctg 5881 gccattgctg ctcttctctc tctctctctc tctctctctc tctctctctc tctcacacac 5941 acacacacac acacacacac aaacacacac acacaaacac acacacacaa acacacacac 6001 acaaacacac acacacacaa acacacacac acaaacacac acacacgcac gcacacacac 6061 acaacctctc ggcattctcc tagatggatg actccttttt aatttagctg atatttttat 6121 ccttcttaaa catttatcca cacacagagc atcagttgca ggtctcaggc attcactcct 6181 gatgcctctg gattggtttt ttagattctt tgttcttact tttccatcta tgggtgctgg 6241 gctctcaagc acatctctgc acagtgtgtg tgcctggtgc ccatggaagc aaaaagatgg 6301 agtcagatct cctgaactcc aggggttccc tgagttccag ggttatgagc tgccaggtga 6361 gtgctggggt acaagcacag gtcctctgca aggtcagcca gtgctcttga gtgcagagcc 6421 agctttgctg ccccccactg cctatatttt taaatgctgt tttacatact ccatgtgttg 6481 tccctaagat gtgtataatg cttatagaac gtcacagtct ggtaagtgct ggccaaagct 6541 acagaagtat aaaatggcct tgaacagcaa aacactggtt ataagcaaga aaggtcaaaa 6601 taaagagaaa atccacaaag agccaaatat ctttataaca ttaattctgt agttaaaatt 6661 taacacagag agtgtatctc gttccttgaa gaactgaagg acacacaaat gactacttct 6721 acctagggtc aaaatatagc ggtgactaca gctcaagaca cacaaaacca gagtcaagaa 6781 tcagggagtg gtaataaaat aataaaaaat cctggctcag ggtttcttcc cacctttccc 6841 tgatgaaagg cacacacagc ctttatattt tagtctgcct tatgcagcac aatagctggg 6901 cagctgccta ccctccatgc tgttagaatc cattttccta ttgaaagccc caagttaata 6961 ctttacaagt ttctttatac catatttgct attcttgacc caactgagga gcccttttgg 7021 ccacactgtc ttggcccata gcacatggtg tctctccttc taccttctgc tctttcttct 7081 tccatggctt ccacagaggc tcctcaatcc cattctcctt cctcatgctc tctagcccca 7141 gaaaactaag caccacaagt ctcttctccc agctattagc tgctgacatc tttatttacc 7201 aatcagaatg aactgcgggc aggatcactc agacaaacta cagactccaa atcttagagg 7261 ccaacactta ctgttatagg aaacaataaa agacaaaaac ctcaacacca gggtatgttt 7321 ctgggtaggc tgtccttgct ttaatgggga tttgctgttt tcagaaaatg ctcaatattg 7381 attgattttg ccatttccag gaccctttgc tgcattctgt ctgtaagtct ctttttattt 7441 gcctggctga cttgtttcaa ctttctttct ctgactgtgt ctgatgcaca gtctgtgttt 7501 gtgtcttttg tgtccttgcc atttctatcc aactttgtct cttttctttc ccccttagaa 7561 cccctttcca gggtgggcct catccatcct cagcctcagt ctacttctcc tgacccctta 7621 tatttatatc tctacaggcg tggcattcag cagtggggag cgggcaaaac aactcaggcg 7681 cctctctata gccacattga gagattttgg tgtgggcaag cgtggtgtag aggagcgtat 7741 cctggaggag gcaggctatt tgatcaagat gttgcagggc acttgtggta agcaagagac 7801 cattaagtgt ttgggcaaga gaaagaacat ccctgacacc tagaccctat gggttgtgga 7861 taagaagggc ggggaagacc gcctaccaaa ccatccccag aatctggtgc tgagagattg 7921 gtgcctcact ccaattccca caccatctgc taactcttct ccctcataat gccaatgtct 7981 tccaaacaat gtcacccctc tcaggagccc ccattgaccc caccatctac ctgagcaaaa 8041 cagtctccaa tgttattagc tccattgtct tcggggaacg cttcgactat gaggacacgg 8101 agttcctgtc actgctgcag atgatgggtc aaatgaacag atttgcagct tcacccacag 8161 ggcaggtaac agatccagct ctgccaattg tccttatagt gtcccacatt gaccatacca 8221 acaaagggca aggaccaccc tgactctcat ggctacaaac aaaagctccc ctcaaaaaca 8281 gaagctcccc tcaaaaccag cctttacttc agaaaactga acctttacat cagagcccac 8341 agaagctatc cagtgctcac aatctaatgt cctctggata tctcagtagc ctgagaacac 8401 agccctctgc ttgactctct tccctgggca ggtttctcca gcttaacctc taataaatcc 8461 tctatgtggt cctcctgaaa atttagacaa ctgcccaagg gatacaagtg accacctctg 8521 gccccctcct ccaatcctga acacctacct agttctgcaa aactgtggtc agtaaagcta 8581 ttcagtccat acacccagtt ctccccaaag atcccactga cacaatggca caaaagtcac 8641 ctgttgtctc aggtaaattc aggaatgagt agacaggcac ctcaaccaag gcaaccaagc 8701 acagacctct ggatggactg tttccccaaa cacccatatg tctcccagct acacacaacc 8761 cacatcaaga caatatctga caggtgtgtc tcacacctta taacctgaac caccccacca 8821 tgaagacctg actatgtgaa aaaccgattc taatctcaaa caaatatcaa gacatctaat 8881 cttagccctc tcaaatgccc aaacatatag atacttgatt cactgcgaca ctcatgtcct 8941 gaatactaga aacctggagt aatggtctga tccaaaaatc agttaaataa ctgaatgtct 9001 actaatgttc ccttttgatc cagttcattg ggattgtaag acaatgacct tcattcttta 9061 aatcacctag aaaactgtgg tctctggggc ctctgacagt tcagtggttt aagagcatgc 9121 actgctcatc ctgaggaccg agttcagttc ccactaccta tgctgaacat ttcaaaactc 9181 tatgggagta cacctgcacc gtgcacataa ttaaaagtaa aatattcaaa cgaatataaa 9241 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa 9301 cacctggatg gatcccttga gactcgaccc actcccacgg gtgttgccac tgacaagcct 9361 tttcttttct cctcccaccc cccagctcta tgacatgttc cattcagtga tgaagtacct 9421 gcctggacca cagcaacaga tcatcaaggt tactcagaaa ctggaagact tcatgataga 9481 gaaagtgagg cagaaccata gtaccctgga ccccaattcc ccaaggaact tcattgactc 9541 ctttctcatc cgcatgcaag aggtgatccc aatcatggtg gatggaatgt ctaaaacagg 9601 gcagctctaa atcatcctag aaaaggagga ggaatatagg cccattaagt gcccatgatt 9661 ctcctcacag tcccggttat agttaaacct cactctttca cctgttgagc cttatccaag 9721 ccagggtatg ggttagcaaa ttaccatgac aaccgatatt ccagtgttcc cctatgagac 9781 actgttttca gtgttcaact acttagcatg cactgaagct actgtcgaag accctgtgga 9841 gcctaaactt cgcaaagagg gaaagtgtgc ccagacttgc atgctgactt tatggagaca 9901 gaaaactata cagccttgcc tctatggctc tcaggctttt actattagcc acatggtctc 9961 tagcatttca tatctctgtt aggaaataca catcagtaca catcagtggc ctaagacctg 10021 ggtttttttt tcttttgtct gttctagtaa tttttttatt gtttttcatt tttgtgtttt 10081 tttcttttat tggatttttt atttctattt cagatattat cccctttctt ggtttccctt 10141 ccagaaacct gctatctcct catgcttcta tgaggattct ctcccaccca cacaacactc 10201 cctgccacct ccctgtgctg acattcccct acactggggc atcgagccca gacaggacca 10261 agggtctctc ctcccattga tacccaacaa ggccatcctc tgttatacat atggctgaag 10321 caataggtac atccctgtgt actcttggga tggtttagtc actgggagct ctggtgggtc 10381 tggttggtta atattgttgt tcttcttata gggtggcaaa ccccttcagc tccttcagtc 10441 ctttctctaa ctcctccata tgggaccatt ttctcagttc aatggttgac tgcaagcatc 10501 tgcctctgta attgtcacgc tctgcagagc ctctcaggag acagctatat gaggatcctg 10561 tcaacatata tttcttggca tccacaatat tgtgtgagtt tagaggatgt caatgggatg 10621 aatccacctg tagggcagtc tctgaatggc ctttccttca gactctgctc caaactttgt 10681 ctttgtattt ccttctttga gtatttttgt tccccctttc aagaaggact gaagcatact 10741 cacttgagtc tttcttcttc ttgagtttca tgtggtctct gaattctatc ttgggtattc 10801 caagtttttg gactaatatt tacttctcag tgagtgcata ccatgtgttg ggttacctca 10861 cttaggatga tattttttag ttccatccat ttgcctaaga atttcatgaa gtcattattt 10921 ttaatagcag tgtagtactc cattgtgtaa atttactata ttttttgtat atatttctct 10981 gttgaagaac atctagtttc tttccagctt ctggctatta taaataaggc tgttatgaac 11041 atagtggaga gtgtgtcttt gttatatgtt ggagcatctt ttgagtatat gcccaggaat 11101 ggtatagctg agtcctcaca taatactatg tccaattttc tgaggaacct ccaggatgat 11161 atccagagtg gttgtatcaa attacaatcc accaacaatg gaggagtgtt actctttctc 11221 cacatcctta ccagcatctg ttgtcacctt cgtttttgac ctttgccatt ctaactggtg 11281 tgaggtggaa tctcagagtt gttttgattt gcatttccct gatgactaag gaggttgaac 11341 atttctttag gtacttctca accatattcc taagctgaga attctttgct tagctcttta 11401 ctccattttt aatggggtta tttgattctc tggagtctaa cttcttgagt tctttgtata 11461 tatttaacat tagccctcta tcggatgtgg gattggtaaa gatcttttcc caatctgttg 11521 gttgtcgatt tgtcctaatg acagtgtcct ttgccttaca gaagctttgc aactttatga 11581 agtagtattt gtcaattctt gatcttagag cataagccat tggtgttttg tttaggaaac 11641 tctccctggt gcccatgtgt tcaagaccct ttcccacttt ctgttctatt agttccagtg 11701 tatctggttt tattttagtt taattttatt tttcttggat aattatgtat tacacatcaa 11761 atgttattcc ctttgtcccc tctctcatat ccccttcccc tccctctgcc tctatgggga 11821 tgctaccacc cccatccacc cactcccacc tcaaccccct agcattccct tacattgaga 11881 aaaagagcct tcactagacc aagggctttt cctcctattg atgctggaca atgccatcct 11941 ttgctacata tgcagctgaa gccacgggtc cttccatggg tacgctttgg ttggtggttt 12001 aggccctggg agctctcgtg gagtctggtt ggttagttga tattattctt ccatccctaa 12061 aatgaatgac agtcacctag acagagaaat gagcaaagct tctcatgcaa acccaagact 12121 gctaacacag cctggagatc tttttccaac gattggtctg gaccctatga gaactagatc 12181 caaaggaaat tgcagaagtg ctgcctattg catccctctc ctccatgagg aacttaatcc 12241 acagttgacg gctgtttaga gacgatgaaa taatattcct ttgcagtgtg gctactagta 12301 aattgacctt tctcaagtaa agaacccctc gcccatatgc atgcagccac acctaattat 12361 aagcagttac ccacaacacc cccaacaaac aggaaaatag gaaggagact tattaggaat 12421 aagaaatggt tcaaaaaaat ggaaagtaga aaataataga ggggaatacg tttaaagtgc 12481 atttcatgta tacgtctgaa aaataaggac tcaaggttca gtgggtatgg aaggggattc 12541 atctgggagg gtttggagga ggggtatgaa tatattcaca atacaataaa tgaaattctc 12601 aaagaattaa taaaattatt tataaaagaa ttactagaaa tgtttcagaa aattaaaacc 12661 cttaatgttc cccaaggatg acaaaatgat agatttatgc cctctctctc tctctctctc 12721 tctctctctc tctctctctc tctctctctc tctctgcagg agaaaaatgg caattcagag 12781 ttccacatga agaacctagt gatgacaaca ctaagcctct tctttgctgg gtctgagaca 12841 gtcagctcca cactacgcta cggcttcctt ctactcatga agcatccaga tgtggagggt 12901 gaggctggct atgtggcagg gaagttggga accgcagact ctccaactgc ttacaaccta 12961 acaatgaccc tcacttctcc caggttcctg gatgctcagt catgctcagc tatgcagaga 13021 caggggcata ttaaatgcat aaacacagtt ctcacaaact taaaatatta gacattccca 13081 aattgatttc actctgactt ccagatctct gctctctgtt ctcttccctg actcctgctt 13141 cttctcccca ccatgattct gtcacgaaaa ggataaaatg accctgtcca gcatttaggt 13201 atggatatat gtttaaatgg tttaaatgca tgttatttac agagacatgt aatacatgca 13261 gtggtacaca tgtgaactat tccacctgct ttgaggcctc tggattttta aaaatacccc 13321 atctccgctt gtctttcagc caaggtccat gaggaaattg agcaggtgat cggcaggaac 13381 cgacagcctc agtatgagga ccacatgaag atgccctaca cccaggctgt gatcaatgag 13441 atccaaagat tttctaactt ggctcccttg ggcattcctc gaaggattat caagaacaca 13501 accttccgtg gcttcttcct ccccaaggta gcagccatgc ccatccagga ggggcctcca 13561 gcccacttac tgatgcttca gggcttcttt ccatctgtag ctatctaact ccactctaat 13621 tcctccaacc aaagaattca tccacatgtc cccaaattct tgtccagctg ctttgaactc 13681 cattttctat ctactcttct gccttgctac cttccaatct ctcaactcct gggctagagg 13741 caaaggcctg ctgtcacact aacaccctat cttagcacat gatcccctgg agctcaaatc 13801 tccaattgct gatggcacat atcgtagccc ctcaaatctc ctattcccta atgccttttc 13861 ctgaggagac ctccaactct gtgccttgca gttgtctata tttggacatc ctttctccat 13921 caacccatct tctaaaatct cctttcttcc ctcttccagg gcaccgatgt gttccctata 13981 ttaggttctc tgatgacaga cccaaagttc ttccctagcc ccaaagactt cgacccccag 14041 aacttcctgg atgacaaggg acagttgaag aaaaatgctg ctttcctccc tttctccact 14101 ggtaaggaga cagtgggtta ttgaactact gttcacacca acatgggtag cacatgccag 14161 cttccctgtc tgtgatgctg cctagaatca ggctaaccag gtatagcccc tgcacctccc 14221 aagcaccaga catgctggat gcaggtgaga ggatccctgg gaccagtgat ctgtgtcaga 14281 gaccggggag gggttgggaa taccaacttt cctaggtgat gctcatgcaa gcaatttctt 14341 cacactcttt ctaatgcagc ttttaaataa ttgtttgttt ttctttattt tttaagtaat 14401 ttatttaatg tgcaatggtg tgaggttgtc agatgccttg gaactgaact tatagatgat 14461 tatgagctgc catgtggctg ctgagaattg aaccttggat cttcagaaga acagacagtg 14521 ctcttaacca atgagccatc tcccagcccc atcttcagac tcttaaaagt gggataacaa 14581 ccaggtggta taggtgcatg cctttaacca cagtactggt ggatatctga gttcaacacc 14641 agcctgggac tatagagtga gttacaggac aacccaggct acatggagga aaccatgact 14701 tcaaaaacta aaaataaata aataataggt aggtagatag atagatagat agatacatac 14761 atacatagat acatagatac atagatacat agatacatag atacatagat acatagatac 14821 atagatacat agatacatag atagatgcat agatagatac atagatagat agagacatag 14881 atagatgcat agatagatac atagatagat agatgcatag atagatagat acatagatag 14941 aaagatgcat gtatacatac atgcatgcat acatagataa atagatgact cataaaaaat 15001 taaaagaata aaaaaataaa caaggccaca gcagagcatc tacatttgag aggataatta 15061 ataattgata gaggaagcat ctgtactcca tattgctcca gcctaaaatg agttgtccca 15121 cgttgtgtgt agggacacca gggttttaag agggttagga gcctttccta atgatccctc 15181 atgctccagt atagcagccc cttctccttt tttttttctt tttttcttta ttaacttgag 15241 tatttcttat taacatttcg agtgttattc cctttcccgg tttccaggcc aacatccccc 15301 taatccctcc ccctcccctt ctttatgggt gttcccctcc ccaccctccc cccattgccg 15361 cgctcccccc aacaatcaca ttcacagggg gttcagtctt agcaggacca aggacttccc 15421 cttccattgg tgctcttact aggctattca ttgctaccta tgaggttgga gtccagggtc 15481 agtccatgta tagtctttag gtagtggctt agtccctgga agctctggtt ggttggcatt 15541 gttgttcata tggggtttcg agtcccttca agctcttcca gttctttctc tgattccttc 15601 aacgggggtc ctattctccc acccttcccc cactgccgcc ctccccccaa caatcacgtt 15661 cactggggct gaaccccatt tttaataggg ttatttgtct ccctgcggtc taacttcttg 15721 agttctttgt atattttgga tataagccct ctatctgttg taggattggt aaagatcttt 15781 tcccaatctg ttggttgccg ttttgtccta accacagtgt ctttgcctta cagaagcttt 15841 gcagttttat gagatcccat ttgtcgattc ttgatcttag agcataagcc attggtgttt 15901 tgttcaggaa attttctcca gtgcccatgt gttcaagatg cttccccact ttttttccta 15961 ttagtttgag tgtatctggt ttgatgtgga ggtccttgat ccacttggac ttaagctttg 16021 tacagcgtga taagcatgga tcaatctgca ttcttctaca tgttgacctc cagttgaacc 16081 agcaccattt gctgaaaatg ctatcttttt tccattgaat ggttttggcc cctttgtcaa 16141 aaatcaagtg accataggta ggtgggttca tttctgagtc ttcaattcta ttccattgat 16201 ctatctgtct gtctctgtac caataccatg cagtttttat cactattgct ctgtaatact 16261 gcttgagttc agggatagtg attccccctg aagtcctttt attgttgagg atagttttag 16321 ctatcctggg ttttttgtta ttctagatga atttgcaaat tgttctgtct aactctttga 16381 agaattggat tggtattttg atggggattg cattgaatct gtagatcgct tttggtaaaa 16441 tggtcatttt tactagatta atcctgccaa tccatgaaca tgggagatct ttccatcttc 16501 tgaggtcttc ttcaatttct ttcttcagcg tcttgaagtt cttattgtac agatctttta 16561 cttgcctggt taaagtcaca ccaaggtatt ttatattatt tgggactatt atgaagggta 16621 tcgtttccct aatttctttc tcggcttgtt tctcttttgt gtagaggaag gcaactgatt 16681 tatttgagtt aattttatac ccagccactt tgctgaagtt gtttatcagc tttagtagtt 16741 ctctggtgga acttttggga tcacttaaat acactatcat gtcatctgca aatagtgata 16801 ttttgacttc ttcttttcca atctttatcc ccttgatctc cttttgttgt ctgattgctc 16861 tggcttgaac ttcaagaact atattgaata agtagggaga gagtgcagcc ccttctcttt 16921 aagagaacac agctttgcac ttggcactga ggcaaggcag cggtgagagc ttccttccca 16981 actgtgctcc ttccctctct cctcttcagg gaagcgattc tgcttgggag atggcctggc 17041 taagatggag ctcttcctgc tgctcaccac tattttacag aacttccgtt tcaagttccc 17101 aatgaaacta gaagacatca acgagtcccc caaacccttg gggtttacca ggatcatacc 17161 aaagtacacc atgagcttca tgcccatctg attctgagtt gaatcaaggt ggggcaagag 17221 ggagagagag cctgaagtgg ggccagggtg caggtggaga gaacagggga ggtgaagatg 17281 agggttaaga agggaccaca cccatggaag aaacacaaaa gacttctcac tttggtaaaa 17341 ttgtaacagt cctaataaaa agaaagaaat actcagtggg cagcagtaac aacaactgag 17401 actcatgggg caaaggtggc tcacctctgc agaagctgtc ctgtccttct ctcagtcctc 17461 tacacaagag cagcatgtcc ccaagtccaa cgtacaggtt gcaaagatgg aacttacaaa 17521 tttgaaccta aactgaggtg gaaaaaactc aagttagcta ggattgatgt tttggactct 17581 atcaccagca ttcaggaggg agggaacatg gctctctacc atgtctgcca ggactacaca 17641 gtgagagctt atctcaaaag aaaaaagaaa aaaagaaaaa aatttatata tatatatata 17701 tatatgtata tatatgtata tatatatatg gagagagaga gagagagaga gagagagaga 17761 gagagagaga gagagagaga gagtttgcat tgtacatgat cagggaaata ataaaaacta 17821 gtttgacagt cacataccag tgggttctaa tttatcaaac tccaccccca cccccactgc 17881 cactgctgcc ctatgaagga actgaacaga agcttaactt tccttgggcc atttcgacag 17941 ctgttgtgtc atcaaggctt ctgttttcct atggagacac tacacatggg acagagagga 18001 taacagggag ctcatgactg agagaccttc aggccaaagc acttgaacct ttgtttatcc 18061 tgtttattct gaattttctg cttctgggct ctcatttccc caccattaaa atgagaatat 18121 caatatttac agctgcactg catctctttt tggagtgatt cctggtaact aagaaataag 18181 tagaaaatgg aaggatgaaa tccaccagga ggtttgagta aattccactg tgggaaacac 18241 aggggactgt gggatggcaa ggatgagagc tggaaagaat gcaaggccac actatgtctc 18301 atgcatattt tatatctttt ttatattctt tatatctttg tagtgttttt attagcctac 18361 aaagaaatac atttctcact ggcaacttct tacatatata tcactaccta tgttctcatt 18421 cactttcctt cgctggtctt ggcctcttcg caaaattatt caccggtaat ttattcacac 18481 tttctaattt ttgagcatgg tgcattccag taagatttaa tctctgtggc catggtgttt 18541 cacagctctg taacactgaa gcacattcat catcaactgc actgaagtca tcaacttaag 18601 aagcaaagga ggattcttct ggtctccatc tgcgcccaga gctaagtctg ccccacaacc 18661 ctccagattc aaaacctccc cagacagagc tagtcctcca ggagtgctct cactactaag 18721 gccacaagtg agaccccatt tcccttcaat accgatccaa agaggagccc accagatacc 18781 aggtaccaaa gttaaatgag gatccgttga cctgcaggtc // LOCUS RATCYP2A21 7247 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 1 and 2. ACCESSION M33313 KEYWORDS LINE repetitive sequence; cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 1 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7247) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept 5556 5732 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 1 6198 + 6360 hepatic steroid hydroxylase IIA2, exon 2 pre-msg 5528 > 7247 CYP2A2 mRNA and introns IVS 5733 6197 CYP2A2 intron A IVS 6361 > 7247 CYP2A2 intron B rpt < 1 649 LINE repeat rpt 1120 2122 LINE repeat rpt 5852 6037 dre repeat rpt 5852 5863 5' direct repeat rpt 6026 6037 3' direct repeat BASE COUNT 2306 a 1563 c 1530 g 1848 t ORIGIN 1 aactatcctc aacaataaaa ggacttctca gggaatcact atccctgaac tcaagcagta 61 ttacagagca atagtgatta aaaactgcat ggtattggta cagagacata cagatagacc 121 aatggaatag aactgaagac ccaaaaatga acccaagcac ctatggtcac ttgatttttg 181 acaaaggaac caaaaccatc caatggaaaa aagatagcat tttcagcaaa tggtgctggt 241 tcaactggag gtcagtatgt agaagaatgc agatcaatac attcttatca ccctgtacaa 301 agcttaagtc caaatggatc aaggacctcc acatcaaacc agatacactc aaactaatag 361 gagaaaaagt ggggaagcat ctcgaacaca tgggcactgg agaaaaatcc ctgaacaaaa 421 taccagtggc ctatgctcta agatcaagaa tcgacaaatg ggatttcata aaactacaaa 481 gcttctgtaa ggccaaggac actgttgtta ggacaaaacg gcaaccaaca gattgtgaaa 541 acatctttac caatcctaaa actgatagag gctcatatcc aaaatataca aagaactcat 601 gaagttagag tgcagggaga caaataaccc tattaaaaaa tggggttcat gggtgtagat 661 ctctcctgag agacacaccc agaatacagc atattcatat gcgaatgcca gcagcaatcc 721 actgaactga gaatgggacc cccgttgaag gaatcagaga aaggactgga agagcttgaa 781 ggggctcgag accccatatg aacaataatg tcaaccaacc agagcttcca gggactaagc 841 tattacccaa agactgtaca tggagtgacc ctgggctcca actgcataag tagcaatgaa 901 tagcctagta agagcacagt ggaaagggaa gcccttagtc ctgccaagac tgaaccccca 961 gtgaatgtga ttgttggggg gaggacagta atgggtggag gatggggagg ggaacaccaa 1021 tatagagggg agggggagga gttaggggga atgttggcct ggaaactggg aaagggagta 1081 acaatcgaaa tgtaaataag aaatactcaa gttgataaag ataaaaaaaa agtgaggttc 1141 agagctaaac aatgaattca cagctgagga atgccaaatg gctgagaagc accaaagaaa 1201 tgttcaacat ctttagtcat aagggaaatg caaatcaaaa caaccctgag attctacctc 1261 acaccagtca gaatggctaa gatcaaaaac tcaggtgaca ccaaatgctg gcgaggatgt 1321 ggagaaagag gaatactcct ccattgttgg taggattgca gactgctaca accattctgg 1381 aaatcagtct ggaggttcct cataaaattg gacatagatc tacctgagga cccagctcta 1441 cctctcttgg gcatataccc aaaagatgca ccaacatata acaaagacac atgctccact 1501 gtgttcatag cagccttatt tataatgggc agaagctgga aagaacccag atgcccttca 1561 acagaggaat ggatacagaa aatgtagtac atctacacaa tggaatacta ctccgctatc 1621 aaaaacaatg actttatgat attcataggc aaatggatgg aactcgaaaa tatcatcctt 1681 agtgaggtaa cccaatcaca gaaaaacata catggcatgc actcattggt aagtggatat 1741 tagcccaaat gctcaaatta ccctagatgc acagaacaca tgaaactcaa gaaggatgac 1801 caaaatgcgg atgcttcact ccttctttaa aacaggaaca agaataccct tgggagagga 1861 tagggaggca aagtttagaa cagaggcaga acgaacaccc attcagagcc tgcccacatg 1921 tggcccatac atatatagcc accaaactag ataagatgga tgaagcaaag aagtgcaggc 1981 tgacaggaga tctatgtaga tagatctctc ctgaaagaca cagccagaat acagcaaata 2041 cataggcgaa taccagcagc aaaccactga actgagaatg ggaccctgtt gaaggaatta 2101 gagaaaggac tgaatgttgt tgtaaaaata taaaaataaa gagtaatgtt ggtcttttac 2161 cccgctaggt atcttggcgg aaacacatcc cagccacgca ctttcctaca ctcaaaccct 2221 cacataaaag aacacacaac acaataatct ttgacccaat tggtaagata taattgccta 2281 cttaaacata caaagcccgg taccatccat cccttgagaa cattaataac aatttgtaaa 2341 tacacagagc agaatcttaa catcaccagc tatcttgtcc tgccacggct tctccgcccc 2401 tctctccctc ctgtctcttc ctctctccct tagtctcctc ctcttcctta aaacttctct 2461 cccgcccatc cttccttctc ctccaatgac aggcctcctt ctatcctgta cctgcccctc 2521 accagtactt tacaaattca gtggagaggt ggttctggtg aagtcacctg agttctgagt 2581 ccttgactag gcagctgtcc ttggggcagt ggaattagca tcaaaataca gtaacttcag 2641 ggcaaaccag aataactgaa agagcttgaa ggggcttgaa accccatatg aacaacaatg 2701 tcaaccaacc agagcttcca gggactaaga ctatacaagg actgaccctg ggctccaact 2761 gcataggtag caatgaatag cctagtaagg ccaccagtgg aaggggaagc ccttggtact 2821 gccaagactg aacccccagt gaatgtgatt gttgggggag gacggtaatg gggcgaggat 2881 ggggagggga acaccaatat agaggggagg gggagaggga gggggatgtt ggcctgaaaa 2941 cctggaaagg gaataacaat tgtaatgtaa ataagaaatg gctcagtggt taggagcact 3001 gactgctctc ccataggttc tgagttcaaa tcccagcaac cacatggtgg ctcacagcca 3061 tttgtatggg atccgattcc ctcttctggt gtgtctgaag acagcaacat tgtacttata 3121 aatgaataaa caaataaata aatctttaaa aaaaagaaat acccaattta ataaagatgg 3181 agaacaaaaa acaagaagat acattgctag ggctagagac atggctcagc agttaagagc 3241 actgactgct cttccagagg tcctgagttc aattcccaga accacatgat ggctcacaac 3301 aatctgcaat gggattcaaa gatcacttct ggtgtgtcta aagacagtga caatgtactc 3361 atatacatga aagaaagaat gaaatcttta aaactttcaa aagctgaaga catgctccct 3421 atattattcc aggcaaatcg aagaatttga attctatcac aaactacaat actcacatca 3481 atgagggttc ttttcatgtg ctcaaccaca caaatgtaga tttttagtta tggatttgat 3541 ctggggaacc tagacatgga cagtctccag ataatgccca cagttaccaa tacagcatcc 3601 ctctcaacat agcctttgaa gagtgtgcta taggttggcc tgttcacctc atccacttaa 3661 ctcctcagct tggtctccca aagcctcttt gactgcatcc cattcataaa ggaccacaac 3721 ccagtgaggc tttaagaatt ttgaagtact ggcagcagcc tatgccctgg ggacccctga 3781 gcatctcacc agttccaggt cggagactcg gctacatacg atggcaccga acccagatac 3841 tcactggaaa ggaccgtacc tggtgctgct gaccaccctg acagccatca actctcagcc 3901 ctcaccagcc gtgtactagc tgttggggct gagagctggg acctagagct gggaccagtt 3961 cttcaaaaag ctccctagac ttaatttcat gtttgccccg ggttttatca agataggtgt 4021 ggggataggc ttgatttcta ttacaaatga tgtaacattg catatgttag tactcctaac 4081 acttcttggg actgtgcctc agggatcaca atctgtataa gtttagaagt tctaaaagct 4141 agtcatgacc ttggtgtgta ggtttagata gtgtccagat tggaatcctg atgctaaaga 4201 cttagtaaga cacaaaaaaa ggagttgaga attacttagg gctaaggcta tctaggtgct 4261 gcaagggcag cacaaggaca tctgctgttg caatgcaagg cttatagaga attcagaact 4321 gccatttagg agtaattaaa gactccatga ataaacttag agaaaggtta gacaaaaggc 4381 agacagagaa gcgcatcagg gatggtttga gagctggttt agtagatctc cttggatgac 4441 tactctggta ttttccctta tgggaccctt cttagttttg cttctgcttc tgattatagg 4501 tccatgtgtg ttagagaaac tagttaatag gtttgactcc tacaaaaaga tagagacgct 4561 caacaaggtt ggtttgagtc ttggttcact cggtctccct ggatgactac cctactctct 4621 gctatatggc tgggccatta ctaataattt tcttggtttt agtttttgga ccctgcgtga 4681 caaacaggtt aattgctttt gttacaaatc gagtgagtgc tgtgcggttg gttctgagac 4741 aacagtacca gtcagttagg acaactggtg agaccaaata agagacttga tatcaaaatt 4801 ctaagattag aattacttag tagaagaaga ggggaatgaa aggaaaatta tacagattta 4861 aggtttaaaa atatgaagtt aaaagagtat gtttcaactc aggactaaac actgtgaaaa 4921 gcaagtccag gcagccccgc cctgccgcta gaactaacag accataaaag gaaaggaatg 4981 cagaacagac caggagtacc ggatctgact cacaggccac ctggcaggaa gagataagcc 5041 cccagccccc gacatccagg acgccccaaa cctgccaatg tgtgtagcta taccttatta 5101 cctcatcatg tgaaatagcc aatcatatgt gaacatgtct atgtgcctcg tttgaatcca 5161 ccaatccccg taactatgca tctgcttctg tacgcccgct tctgcttccc caatccctat 5221 aaaagcccca tgctggagct gctgggcgcg caagtcctcc gaagagactg tgtgcctgca 5281 ggtacctgtg ttttccaata aaccctcttg ctgattgcaa aaaaaaaaaa aaaaaaaaaa 5341 aaaaaaaaaa gaattttgaa gtaagcctgc cacctttctt cctatttgcc catgttgtgg 5401 gtgttgcaac aaagactggg tcaatgttag aaaatagggt tgggaggcaa aagactcaac 5461 tagacaaaca ggagcaaagg ccatcctgtg tccctgggag tataaaggta ctatctcagc 5521 cttggctatc agtctgtcca tcctcactgg ctactatgct ggacacagga ctgctcctgg 5581 tggtcatact ggcctcccta agtgtcatgt tcttggtgtc cctctggcag cagaaaatca 5641 gggagagatt gcctccagga cccactcctt tgcctttcat tggaaattat ctgcagctga 5701 atatgaaaga cgtatacagt tccatcacac aggtatcact ggatgagggg atggatggga 5761 catgggagtc caagaagctg ggttgttttg catgttttgt ggcagaagat tcatagagta 5821 aatccaaagt cttgtattca tggagtttag aaagataagg agcgggctgg agagatggct 5881 cagcggttaa gagcaccatg tgctcttcca aaggtcctga gttcaaatcc cagtaaccac 5941 atggtggctc acaaccatct ataatgagat ctggtgccct cttcttgtat tcttaatcat 6001 aataaataaa taaatctaaa aaaataagat aaggagctat ttcaactctt tggtttgttg 6061 tttgcattcg tttgtttgtt tgtttgtttg tttgtttttc aatcatttat ttgtagaata 6121 acacataatc tgacctctgt gtactggtct agttcagtga ataagtcatc taacagcctc 6181 catctaccca acatcagctc agtgagcgct atggtcctgt gttcaccatt caccttgggc 6241 ctcgacggat tgttgtgctt tatggatacg atgcagtcaa agaggctttg gtggaccaag 6301 ctgaggagtt cagtggacgt ggcgaactgc ctacctttaa tatactcttc aaaggctatg 6361 gtgaggagga taccacattg gggagcatgc ccaaggacat ttgttggcct catttaagta 6421 gccttcatcc taactcatct ttcccctcaa ggctgtacat agtcctctga tttttctctc 6481 catattcaag ttgaatgttg cttcttattg tgacccttcc tagtctttct atgattctct 6541 gtgggtgctt cctttcattc ttcttcaccc ttttcttcca ttctttaacc ctcataatac 6601 taggtaggag ataaaaagag atagaggaaa aaggggacac tattgttaga ctacttcctt 6661 ctgagaggta atgagttcct tagggcaagt ttgatcatct cagtcaggat atctaatttc 6721 ttcttcctgt tgttactttg cacaaggcga cttaacaaag cacagccaac agcaaccaac 6781 caacaaccaa aaccaatctc tcaaggccct tgcattaaaa taacctctga ggaatcccca 6841 gtatcctaag ggtcacactc tcagaaacta tctgcagtag gcaaaatcat acccctgcta 6901 gagcacaaaa taaatcatag gtctctgctt tggacaatct gattcatccc catattgcat 6961 acctggaatt aaaaaaacat attcctataa tatttctgta tttgtcaaaa aaaaacaaaa 7021 ttcttttttt tttatcttta agtaatactc caactttatt gaataaagga ataaatggag 7081 ttttcaagtt ttcccatcat ggttattttt aaagccacct gatacatgac agtacttatc 7141 aaaacaagat gtttatctat ttttgtcatt tgtatttttg cttaatttta tattcataat 7201 atatttaaat taactaatag ttcatggtaa cacttggcca cacaggt // LOCUS RATCYP2A22 4753 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 3,4 and 5. ACCESSION M33325 KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 2 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4753) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 1815 1964 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 3 2182 2342 hepatic steroid hydroxylase IIA2, exon 4 3567 + 3743 hepatic steroid hydroxylase IIA2, exon 5 IVS < 1 1814 CYP2A2 intron B IVS 1965 2181 CYP2A2 intron C IVS 2343 3566 CYP2A2 intron D IVS 3744 > 4753 CYP2A2 intron E BASE COUNT 1258 a 1204 c 911 g 1380 t ORIGIN About 5 kb after segment 1. 1 aatcaatagt ttttaagcta ctaacccttt ctagagatga tgaaaataga aaactggaag 61 aatgcctagg tagcaaatga ccttggaagt tagggactaa aaatttaagt ccacatctgt 121 gcaagataaa aattaactct tagtttgcat aagctcttat ttttttcata agtcttattt 181 gtttttttat ctttattaac ttgagtattt cttatttaca tttcgattat tattcccctt 241 cccagtttct gggtggatga ctccttttta acttagctga tatttttatt cttcttaaac 301 atttatccac acacagagca tcagtcgcag gtctgaggca caccctgcta gtgcctctgg 361 attgttttta aagatcattt gctcttactt ttctatctat gggtgttttg cttatgtgta 421 tatgtgtaca caagtctggt gcccatggaa gcaaaaagat ggagtcagat ctcctgaact 481 ccaggggttc catgagttct ataagctgtc aggcgagtgc tggggttcaa gcacaggtcc 541 tctgcaaggt cagccagtgc tcttgagtgc agagccagct ttgctgtcca tccccccgcc 601 cccgcgcatg tatttttaaa tgttgtttta catatgtcat gtgttgtccc taagatgtgt 661 ataatgctta tagaacatta cagtctggta agtgctggcc aaagttacag aagtataaaa 721 tggccttgag cagcaaaaca ttggttataa gcaagaaagt tcaaaataaa gagaaaatcc 781 acaaagagcc aaatatcttt ataacattaa ttctgtggtt gcgatttaac accaaggggg 841 tatctgtttc cctgaactaa ggggcacaga aatggctact actacttagg gtcaaaatag 901 tgactacagc tcaggacaca taagcaaaac cagagccaaa gaccagggag tggtaataaa 961 ataataaaaa atcctggctc agggattcgt cccacctttc cctggtgaaa gacacacaca 1021 gcctttatat tttagtctgc cttatgcagc acaatagctg ggcagctgcc taccctccat 1081 gttgttagaa tccatttccc tatcaatagc cttgagttga tactttacaa atttccatat 1141 tccatttttg ctgttcttaa cccaatttaa cagccttctg ggccacaatc tcttggccct 1201 tagcacatgg tatctctcct ttgcccttct tctctttctt cttccttggc ttccacggaa 1261 gctcctcggt cccattctcc ttcctcatgc tctagccaag gaaacctaaa cccctcctat 1321 gtcccttctc cccagctatt agctgctggc atctttattt accaaccaaa gtaaatgggg 1381 gcagagtccc ccaggctaag ggcagattcc aaatcttaga aggcagcacg aagcagtata 1441 gtaaacagta aaagaaaaaa acgcaacacc agagtacgtt tctatgtatg ctgtccttgc 1501 tttaatgtgg agtttctgtt ttcagaaaat gctcaaattt ggttctttta gccatgtcag 1561 cgacctggag cagcattctg agtctctctg cttctgtctg taactctctg tttccttgcc 1621 tggctgactt gttccaactt tcttactctg actgtgtctg ctgcagagcc tctgttcgtt 1681 tcttcagtgt tcttgccatc tcaatcccat ctttgtctct tttctttcct ctaagaaggc 1741 ctttccagca tgggcctggg ccttcctcag cctcagacta cctcacccca acacccatgt 1801 tcatgtctct acaggttttt cattgagcaa tgtggaacag gccaagcgta tcaggcgctt 1861 caccatagcc acattgagag attttggtgt gggcaagcgt gatgtacagg agtgtatcct 1921 ggaggaggca ggctatttga tcaagacgtt gcagggcact tgtggtaagc aagagaccat 1981 taagtgtttg ggcaagagaa agaacatccc tgacacctag accctatggg ttgtggagaa 2041 ggaggacggc gaagaccgcc taccaaacca tctccagaat ctggtgctga gagattggtg 2101 cctcactcca attcccacac catctgctaa ctcttctccc tcataatgcg aatgtcatcc 2161 aaacaatgtc acccctctca ggagccccca ttgacccttc catctacctg agcaaaacag 2221 tctccaatgt cattaactcc attgtcttcg ggaaccgctt cgactatgag gacaaagagt 2281 tcttgtcact gttggagatg atcgatgaaa tgaatatatt tgcagcctca gccacagggc 2341 aggtaaaaga ttccagctct gccaattgtg cttataatgt cctacattgg ccataccgac 2401 aaagggcaag gactacccca acgctcatgt ccacaaacat tcccctcaaa aacagaagct 2461 cccctcaaaa ccaaccttta ccttcagaaa actgaacctt tacatcagag cccacaggag 2521 ctatccagtg ctcacaatct aatgacctct ggatatctca agggcctgag aacaaagccc 2581 tctgcttggc tctcttccct gggcaggttt cccccgctta aattctgaca gatcctctgt 2641 gtggtcgtcc tgaaagttga gacacctgcc caagggagac aagtgatcac ctcaggcccc 2701 ctcctccaat cctgagcacc tacctggttc tgcaaaacta tggccagtaa agtcattcac 2761 actggacaca ctgctctccc aaaagatctc actggcacca tgacacgaga gtcacctgct 2821 tgtctcaggt aaattcagga atgagtagac aggaacctca accaaggcaa ccaagcacag 2881 acctctagat ggactgtttc cccaaacacc catacgactg ccaaccagcc acacacagtc 2941 caattcaaaa aggtctgaca ggtgtgtccc acaccttata acccgaacca tcttatcctg 3001 aatactttac tatgtggaaa acagattcta atctcaaaca aatatcaaga gatctaaatt 3061 cagccttctt tggtgcccaa acatctaaat acttgagtca ctgtgataac cctggcctga 3121 acacaggaaa cctggattaa tggtctaatc aaaaaatcaa ttgaatagtt gaatgtctgc 3181 taatgtcccc ttttgatcca gctcatccag attgtaggac aatgaccctc attctttaaa 3241 tcaactagaa aattgcagtc tctggggctt cagactgttc agtagtttaa gagcatgtac 3301 tgctcatcct gaggacctga gttcagttcc cagtacgtat gctggacatt gcacagctca 3361 aggggagtac acctgcactc gtgcacataa ttaaaagtaa aatattcaaa tgaatataaa 3421 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa 3481 cacctggatg aatgacttaa tacaagtgcc actcccactc aatgttgcca ctgacaagcc 3541 ttttcttttc tcctcccacc ccccagctct atgacatgtt ccattcagtg atgaagtacc 3601 tgcctggacc acagcaacag atcatcaagg ttactcagaa actggaagac ttcatgatag 3661 agaaagtgag gcagaaccat agtaccctgg accccaattc cccaaggaac ttcattgact 3721 cctttctcat ccgcatgcaa gaggtgatcc caatcatggt ggatggaatg tctaagactg 3781 agcagctgga aatcacccta gaaaaggagg aggaatataa gcccattaag tgcccatgat 3841 tctcctcaca gtcccggtta tagttaaacc tcactctttc acctgttgag ccttatccaa 3901 gccagggtat gggttagcaa attaccatga caaccgatat tccagtgttc ccctatgaga 3961 cactgttttc agtattcaac tacttagcat gcactgaagc aactgtcgaa gaccctgtgg 4021 agcctaaatt tcgcaaggag ggaaagtgtg cccagacttg catgctaact tcatgcagac 4081 agaaaactgc ttgcctctat ggctctcagg attttactat tagccacctg gactctagca 4141 tttcatatct ctgttagaaa atacatatca atacacaacc ctgaactggg caacctgggt 4201 tgttgtattt tttcttctat tatctgctct agtaattatg tattgttttt tattttaatg 4261 ttgtttttct tttttttttc atctttatta aattgaagat ttcttattta catttaaatt 4321 gttattcccc ttcccggttt ccaggccaac attctctaac ccctcccctt ccccttctat 4381 atgggcttcc ccttcatatc ctccccccat taccaccctt cccccaacaa tcacgttcac 4441 tgggtgttca gtcttggcag gacccggggc ttccccttcc actggtgctc ttacaagcct 4501 cattgcttcc tatgaggttg gagcccaggg tcagtccatg tgtagtcgtc gggtagtggc 4561 ttagtccctg gaagctctgg ttgcttagca ttgttgttca tatagggtct cgaccccttc 4621 aagctcttac actcctttcg ctgattcctt caacgggggt cccgttctca gttcagtggt 4681 ttgctcctgg catttgccta tgtatttgct gtattctggc tgtgtctctc aggagagatc 4741 cgttgacctg cag // LOCUS RATCYP2A23 5080 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 6,7,8 and 9. ACCESSION M34392 KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 3 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5080) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 1322 1460 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 6 1876 2063 hepatic steroid hydroxylase IIA2, exon 7 2496 2637 hepatic steroid hydroxylase IIA2, exon 8 3636 3817 hepatic steroid hydroxylase IIA2, exon 9 pre-msg < 1 4588 CYP2A2 mRNA and introns IVS < 1 1321 CYP2A2 intron E IVS 1461 1875 CYP2A2 intron F IVS 2064 2495 CYP2A2 intron G IVS 2638 3635 CYP2A2 intron H BASE COUNT 1470 a 1191 c 970 g 1449 t ORIGIN About 15 kb after segment 1. 1 gaattctttg tatatattgg acaatagccc tctatcagat gtacaattgg taaagagctt 61 ttcccaatct gttggttgtc gttttgtcct aataacagtg tcctttgcct tacagaagct 121 ttgcaatttt atgaagtccc atttgttgat tcttgatctt agagcataag cccttggtgt 181 tctcttcagg aaattctccc atgtgccctg tgttcaaggc tcttacccgc tttctcttct 241 attagtttca gtgcatctgg ttttatttta attttgtttt atttttcttg tatatttttg 301 tacttacact tcaaatgcta tctcctttgt acattctctg atatctcctc cctgtcccca 361 tgcttctatg aggatgctct cacttccacc cacccactcc cacctcaatg ccttgacatt 421 cacctacatt ggggaaatgg gcctttactg gaccaaggac ttttcctcct attaatgatg 481 gacaatgcca tcctctgcta tatatacagc tgaagccatg cttccctcca tttgtactct 541 ttggttgggg gtttagtctc tgggagctct gagggaagag tctggttggt tgataatttt 601 gctcttccag ccatgaaatg aaagacagtc acctatacag agaaacaagc aaagcttctc 661 ctgcaaacca aagattccaa acacaacctg gacattgctt ttccaaccat tggtctggac 721 actttgagaa ctagatacaa agaaaattcc agaagtgctg ccacttgggt ccatttctga 781 ggaatttaat ccacagttga tggctgctta gagatgatga aatcatattc ctttgcagtg 841 tggctactag taaattgccc tttctcaagt gaagaaccac tcacccatat gcatgcagcc 901 acacctaatt ataagcagat ctccccccaa ataaaaacag gaaaatatga ggaagactta 961 ttagaaatta gaaatggttc aataaaataa aaatagagat aatggagggg aatatgttta 1021 aggtgcattt cacatatatg tctgaaaaat gaagactcaa gattcagtgg gtatggaatg 1081 ggattcatct gggagggctt gagggagggg tgtgaatgta ttcacagtac aataaatgaa 1141 attctcaaag aaataataaa aatatttata caataatgac tagaaatgtt ttagaaaatt 1201 aaaaccctta gtgttcccca aaaggagtac aaaatgataa atagatttgc gttctctctc 1261 tctctgtctc tgtctctgtc tctgtctctc tgtctctctc tctctctctc tcccccccca 1321 ggagaaatat gttaattcag aattccacat gaacaaccta gtgatgtcat cattaggcct 1381 cctctttgct gggactgggt cagtcagctc cacgctatac catggtttcc tgctactcat 1441 gaagcatcca gatgtggaag gtgaggctgg ctgtgtggca aggaagttgg gaaccccaga 1501 ttctccaacc tgacaatgac cctcacctct cccagatccc tggatgctca gacatcctga 1561 ctatgcagac acagaggcat attaaatgca taaacagagt actaagttaa aatattaaac 1621 attctgaaat tgatttccca ctgactgcca gatccctgtt ctctgttccc tgacttctcc 1681 ttctccccac catgatttgg tcatgaaaag gataaaatga tcctggccag catttaggta 1741 tggatgtatg tatagatggt ctaaatgcat gtttacagag acatgtaata catacagtgg 1801 tacacatgtg aactattcca catgctttga ggtctctgga tttttagaaa cagcccatct 1861 tcctttgtct tccagccaag gtccatgagg aaattgagcg agtgatcggc aggaaccgac 1921 agcctcagta tgaggaccac atgaagatgc cctacaccca ggctgtgatc aatgagatcc 1981 aaagattttc taacttggct cccttgggca ttcctcgaag gattatcaag aacacaacct 2041 tccgtggctt cttcctcccc aaggtgcagc caggcccacc caagtagggg cctccaaccc 2101 actccctgat gcttcagggc ctctttccat ctacagccat ctaactcaac tctaattcct 2161 ccaaccaaag aattcaccca catgtcccca acttcttgtc acactgcttt gaactccaag 2221 ttctatctga tcttctgcct tactactatc caatctctca actcctgggc taacacacta 2281 acacattatc tcagaacatg attcccctag agctcaaatc tccaatttct ggtggcacgc 2341 atcacagccc ctcaaaactc ctattcccta atgccctttc ctcaggagac ccccaactct 2401 gtgcctttcc gttctcttca tttggacact agcaccactt ggggtccttt ctccatcaac 2461 ccatcttctc aaatttcctt tctttcctct tccagggcac cgatgtattc cctataatag 2521 gttctctgat gacagaacca aagttcttcc ctaaccacaa agacttcaac ccccagcact 2581 tcctggatga caagggacag ttgaagaaga atgctgcatt tctccctttt tccattggta 2641 aggagacagt gggttattag accactgctc ataccaacag ggataactca tgccagttcc 2701 catctctgtg attctgccta gcatcaggct aaccaggtac aatccctgca cctcccaagc 2761 accacgactc aggtcaaagt atcaatgaga tcagtgatct ctttcagaga ctgggaagcg 2821 gttcagaaca ccaaatttcc caggtcatgc tcatgcaagc aatttcttca tactcttttt 2881 aaagcagttt taaatgattt ttttgttatt ttttaataat tcatctaatg tgcattggtg 2941 tgaggttgtc agattcatta gaactggact tatagacatt ttatctgcca tgtgggtgct 3001 gagaattgaa ccttggttct tcagaagagc agacagtgct cttaaccagt gagccatctc 3061 ccagccccat attcaaattt taaaagggga taacaaccag gtggtggtgg tacatgtctt 3121 taaacccagt actcaagaag cagaagcagg tggatatcta agttcaatgc cagctggatc 3181 tatagagtaa gttagaagaa aacccagact aaatggagga aaccctgact taaaaaacta 3241 aaaataaata aataatagat agatagatgc atgcatgtat acatacatat atgcatacct 3301 acatgcatgc atacatagat acatagatga ctcagagata attagatgaa taaataaata 3361 aacaagacca cagcaggcat ccacatctga gaataaaatt aataattggt agaggaagca 3421 tctggactcc atattgcttc agcctacaat gagttgcccc actttgtgtg tagggacact 3481 ggggttctga gagggttagg aacctttcct aatgatcact catgctccag gttagcaccc 3541 cttttcccta agagaacaag gctgctcact gggtactgag ggaaagaagt gagatcttgc 3601 tccaagtctg tgctccttac ttctctcctc tttaggaaag cgattctgct tgggagatag 3661 cctggctaaa atggagctct tcctgctgct caccaccatc ttgcagaact tccgttttaa 3721 gttcccaatg aatctagaag acatcaacga gtaccccagt cccatagggt ttaccaggat 3781 cataccaaat tacaccatga gcttcatgcc catctgattc tgagttgaat caaggtgggg 3841 caagagggag ggagagcctg aagtggggcc agggtgcagg tggagagaac agagaagatg 3901 aagatgaggg ttaagaaggg accacaccca tggaagaaac acaaaagact tctcagtttg 3961 gtaaaattgt aacagtccta ataaaaagaa agaaacaccc agtaggcagc agtaacaaca 4021 actgagactc atggggcaaa ggtggctcac ctctgcagaa gctgtcctgc ccttctctca 4081 ctcagtcctc tacacaagag cagcatgtcc ccaagcccaa cgtacaggtt caaaagatag 4141 aacttaaaaa atttgaacct aaactgaggt ggaaaagaca cagttagcta ggattgacac 4201 attggactct atcaccagca ttcaggaggg agggaacatg gctccctagg aggcctgcca 4261 gaattacaaa gtgaaactca tctcaaaaaa ggaacaacag aaaataaaat ttcaaattga 4321 tttctcttag accataagag tccagatctg tatccaaagc tatttggtta tattttttgt 4381 tattgttgtt ttgtttacac attgtgtttt tctttcggtt tgtaagtctg tttgggatat 4441 ttaatttaca tttactgatt agtgtgggtg gtagggcata ccatggctca aatgtggaaa 4501 ccaaagaaaa gcttttggaa gtgtcatctc ccttacaata cgtgtgtcca agaactcaaa 4561 ttcagacaat aaagcttgat agcaagcact tctacctact gagacatcta actggccaat 4621 ttagggagtt tattttaatt tatttactta ctaatttata tgaatataag tcctctatct 4681 gcatggccac ctgcgtggca gacgaaggca tcagatcact ttacagaagg ttgagtccac 4741 ccagtggtgg atggaaattg aactcaggac ttctagaagc cgtcaaattt tgagccacct 4801 cttcaacccc ttaaacaagt ttcttaaggt caccctttcc tcaaatgaaa caacaaggac 4861 ttggaatatt ttaacataac ctgagtcctc ctacctgagg tgttgtttct acaagcctgg 4921 caggcaactg atctacctcc aacatacact ttccaacagt cttgctttct catccacacc 4981 ttaatcacct gacacctgtt ggcctcagcc cctgtgccag gtaagtccat tttgtctgac 5041 tcagtcagtc tgggagacaa aaatcccttt gacagaattc // LOCUS ECOUGRE 108 bp ds-DNA SYN 12-JUL-1990 DEFINITION Synthetic uteroglobin (UG) mRNA expressed in E.coli, 5' end. ACCESSION M34596 KEYWORDS uteroglobin. SOURCE E.coli DNA, clone pLE103-1. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 108) AUTHORS Miele,L., Cordella-Miele,E. and Mukherjee,A.B. TITLE High level bacterial expression of uteroglobun, a dimeric eukaryotic protein with two interchain disulfide bridges, in its natural quaternary structure JOURNAL J. Biol. Chem. 265, 6427-6435 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 91 > 108 synthetic uteroglobin binding 77 81 ribosomal binding site (put.) signal 10 26 phi-10 promoter BASE COUNT 37 a 23 c 19 g 29 t ORIGIN 1 gatccaaatt aatacgactc actataggga gaccacaacg gtttccctct agaaataatt 61 ttgtttaact ttaagaagga gatatacacc atggctgcag ccaagctt // LOCUS HCVCG3PE 12283 bp ss-RNA VRL 12-JUL-1990 DEFINITION Hog cholera virus polyprotein mRNA, complete cds. ACCESSION M31768 KEYWORDS envelope glycoprotein E1. SOURCE Hog cholera virus (strain Brescia), cDNA to viral RNA, passed in SK-6 cells. ORGANISM Hog cholera virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae; Mucosal disease virus group. REFERENCE 1 (bases 1 to 12283) AUTHORS Moormann,R.J.M., Warmerdam,P.A.M., van der Meer,B., Schaper,W.M.M., Wensvoort,G. and Hulst,M.M. TITLE Molecular cloning and nucleotide sequence of Hog cholera virus strain Brescia and location in the genome of the sequence encoding envelope protein E1 JOURNAL Virology 177, 184-198 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Moorman,R.J.M., 01-FEB-1990, for release after publication. FEATURES from to/span description pept 361 12057 hog cholera virus protein precursor matp 2428 3538 envelope glycoprotein E1 (put.) BASE COUNT 3850 a 2559 c 3182 g 2692 t ORIGIN 1 agttcattct cgtgtacatg attggacaaa tcaaaatctc aatttggttc agggcctccc 61 tccagcgacg gccgagctgg gctagccatg cccacagtag gactagcaaa cggagggact 121 agccgtagtg gcgagctccc tgggtggtct aagtcctgag tacaggacag tcgtcagtag 181 ttcgacgtga gcagaagccc acctcgagat gctatgtgga cgagggcatg cccaagacac 241 accttaacct agcgggggtc gttagggtga aatcacacca tgtgatggga gtacgacctg 301 atagggtgct gcagaggccc actattaggc tagtataaaa atctctgctg tacatggcac 361 atggagttga atcattttga acttttatac aaaacaaaca aacaaaaacc aatgggagtg 421 gaggaaccgg tatacgatgt aacggggaga ccattgtttg gagacccaag tgaggtacac 481 ccacaatcaa cattgaagct accacatgat agggggagag gcaacatcaa aacaacactg 541 aagaatctac ctaggagagg tgactgcagg agtggcaacc acctaggccc ggttagtggg 601 atatatgtaa agcccggccc tgtcttttat caggactaca tgggcccagt ctatcataga 661 gcccctctgg agttttttga cgaagcacag ttctgtgagg tgaccaaaag gataggtagg 721 gtgacaggta gtgacggaaa gctttaccat atatacgtgt gcatcgatgg ttgcatcctg 781 ctgaagctag ccaagagggg cgagccaaga accctgaagt ggattagaaa tctcaccgac 841 tgtccattgt gggttaccag ttgttctgat gatggtgcaa gtgcaagtaa agagaagaaa 901 ccagatagga tcaacaaggg taaattaaag atagccccaa aagagcatga gaaggacagc 961 aggactaagc cacctgatgc tacgattgta gtggaaggag taaaatacca ggtcaaaaag 1021 aaaggtaaag ttaagggaaa gaatacccaa gacggcctgt accacaacaa gaataaacca 1081 ccagaatcta ggaagaaatt agaaaaagcc ctattggcat gggcagtgat agcaattatg 1141 ttataccaac ctgttgcagc cgaaaatata actcaatgga acctgagaga caacggtacc 1201 aatggtatcc agcacgctat gtaccttaga ggagtcagca gaagcttgca tgggatctgg 1261 ccagaaaaaa tatgcaaagg agtccccacc tacctggcca cagacacgga actgagagaa 1321 atacagggaa tgatggtagc cagcgagggg acaaactata cgtgctgtaa gttacagaga 1381 catgaatgga acaaacatgg atggtgtaac tggtataaca tagacccctg gatacagtta 1441 atgaatagaa cccaagcaaa cttggcagaa ggccctccga gcaaggagtg cgccgtgact 1501 tgcaggtacg ataaaaatgc tgacattaac gtggtcaccc aggccagaaa caggccaacc 1561 accctaactg gctgcaagaa agggaccaat ttttcttttg cgggtacagt tatagagggc 1621 ccatgtaatt tcaacgtttc tgtcgaggat atcttatatg gggatcatga gtgtggcagt 1681 ctactccagg atacggctct atacctagta gatggaatga ccaacactat agagagagcc 1741 aggcagggag ccgcgagggt gacatcttgg ctagggaggc aactccgcat tgccgggaag 1801 aggttggagg gcagaagcaa aacctggttt ggtgcctatg ccctatcacc ttattgtaat 1861 gtgacaacga aaatagggta catatggtac actaacaact gtaccccggc ttgcctcccc 1921 aaaaatacaa agataatagg ccccggtaaa tttgacacta acgcggaaga cggaaagatt 1981 ctccatgaga tggggggcca cctatcagaa tttctgctgc tctctctggt cgttctgtct 2041 gacttcgccc ctgaaacagc cagcgcgtta tacctcattt tgcactacgt gatccctcaa 2101 tcccatgaag aacctgaagg ctgtgacaca aaccagctga atttaacagt ggaactcagg 2161 actgaagacg tgataccatc atcagtctgg aatgttggca aatatgtgtg tgttagacca 2221 gactggtggc catatgaaac caaggtggct ttgttatttg aagaggcagg acaggtcgta 2281 aagttagcct tgcgggcact gagggattta accagggtct ggaatagcgc atcaaccacg 2341 gcattcctca tctgcttgat aaaagtatta agaggacagg tcgtgcaagg tgtgatatgg 2401 ctgttactgg taactggggc acaaggccgg ctagcctgca aggaagatca caggtacgct 2461 atatcaacaa ccaatgagat agggctacat ggggccgaag gtctcactac cacctggaaa 2521 gaatacaacc acaatttgca actggatgat gggaccgtca aggccatctg catggcaggt 2581 tcctttaaag tcacagcact taatgtggtt agtaggaggt atctggcatc attacataag 2641 gacgctttac ccacttccgt gacattcgag ctcctgttcg acgggaccag cccattgacc 2701 gaggaaatgg gagatgactt cgggttcgga ctgtgtccgt atgatacgag ccctgtagtc 2761 aagggaaaat acaacacaac cttgttgaat ggtagtgcat tctacctagt ttgcccaata 2821 gggtggacgg gtgttataga gtgcacggca gtgagcccga caactctgag aacagaagtg 2881 gtaaagacct tcagaagaga gaaacccttt ccgtacagaa gggattgtgt gaccactaca 2941 gtggaaaatg aagatctatt ctactgtaaa tgggggggca attggacatg tgtgaaaggt 3001 gaaccagtga cctacacggg ggggccagta aaacaatgca gatggtgtgg cttcgacttc 3061 aatgagcctg acggactccc acactacccc ataggtaagt gcattttggc aaatgagaca 3121 ggttacagaa tagtggattc aacggactgt aacagagatg gcgttgtaat cagcacagag 3181 gggagtcatg agtgcttgat tggtaacaca actgtcaagg tgcatgcatt agatgaaaga 3241 ctaggcccta tgccatgcag gcctaaggag atcgtctcta gtgcgggacc tgtaaggaaa 3301 acttcctgta cattcaacta cgcaaaaact ctgaggaaca ggtattatga gcccagggac 3361 agctatttcc aacaatatat gctcaagggc gagtatcagt actggtttga tctggatgtg 3421 accgaccgcc actcagatta cttcgcagaa ttcattgtct tggtggtggt ggcactgttg 3481 ggaggaagat atgtcctgtg gctaatagtg acctacatag ttctaacaga acaactcgcc 3541 gctggtctac agttaggcca gggtgaggta gtgttaatag ggaacttaat cacccacaca 3601 gatattgagg ttgtagtata tttcttactg ctctatttgg tcatgagaga tgagcctata 3661 aagaaatgga tactactgct gttccatgct atgaccaaca atccagttaa gaccataaca 3721 gtggcactgc tcatggttag cggggttgcc aagggtggaa agatagatgg tggttggcag 3781 cggctgccgg agaccaactt tgatatccaa ctcgcgctga cagttatagt agtcgctgtg 3841 atgttgctgg caaagaaaga tccgactacc gtccccttgg ttataacggt ggcaaccctg 3901 agaacggcta agataactaa tggacttagt acagatctag ccatagctac agtgtcaaca 3961 gctttgctaa cctggaccta cattagtgac tattataaat acaagacctt gctacagtac 4021 cttattagca cagtgacagg tatcttcttg ataagggtac tgaagggggt aggtgagtta 4081 gatttacaca ccccaacctt accatcttac agacccctct tcttcatcct cgtgtacctc 4141 atttccactg cagtggtaac aagatggaat ctggacatag ccggattgct gctgcagtgt 4201 gtcccaaccc ttttaatggt tttcacgatg tgggcagaca tccttaccct gatcctcata 4261 ctgcctactt acgagttgac aaaactatat tacctcaagg aagtgaagat tggggcagaa 4321 aggggctggt tgtggaagac caacttcaag agggtaaatg acatatacga agttgaccaa 4381 gctggtgagg gggtgtacct tttcccatca aaacaaaaga caggtacaat aacaggtact 4441 atgttgccac tgatcaaagc catactcata agttgcatca gcaataagtg gcaatttata 4501 tatctattgt acttgatatt cgaagtgtct tactaccttc acaagaagat catagatgaa 4561 atagcaggag ggaccaactt catctcgaga cttgtagccg ctctgatcga agccaattgg 4621 gcctttgaca acgaagaagt tagaggttta aagaagttct tcttgctgtc tagtagggtt 4681 aaagaactga tcatcaaaca caaagtgagg aatgaagtga tggtccactg gtttggcgac 4741 gaagaggtct atgggatgcc gaagctggtt ggcttagtca aggcagcaac actgagtaaa 4801 aataaacatt gtattttgtg caccgtctgt gaaaacagag agtggagagg agaaacctgc 4861 ccaaaatgcg gccgttttgg gccaccagtg acctgtggca tgaccctagc cgactttgaa 4921 gaaaaacact ataagaggat tttctttaga gaggatcaat cagaagggcc ggttagggag 4981 gagtatgcag ggtatctgca atatagagcc agagggcaat tattcctgag gaatctcccg 5041 gtgctagcaa caaaagtcaa gatgctcctg gtcggaaatc ttgggacgga ggtgggggat 5101 ttggaacacc ttggctgggt gctcagaggg cctgccgttt gcaagaaggt taccgaacat 5161 gagaaatgca ccacatccat aatggacaaa ttaactgctt tcttcggtgt tatgccaagg 5221 ggcaccacac ctagagcccc tgtgagattc cccacctctc tcttaaagat aagaaggggg 5281 ctggaaactg gctgggcgta cacacaccaa ggtggcatca gttcagtgga ccatgtcact 5341 tgtgggaaag acttactggt atgtgacact atgggccgga caagggttgt ttgccaatca 5401 aataacaaga tgacagacga gtccgagtat ggagttaaaa ctgactccgg atgcccggag 5461 ggagctaggt gttacgtgtt caaccgagag gcagttaata tatccgggac taaaggagct 5521 atggtccact tacaaaaaac tggaggagaa ttcacctgtg tgacagcatc agggactccg 5581 gccttctttg atctcaagaa cctcaaaggc tggtcagggc taccgatatt tgaggcatca 5641 agtggaagag tagtcggcag ggttaaggtc gggaagaatg aggactctaa accaaccaag 5701 cttatgagtg gaatacaaac agtctccaaa agtaccacag acttgacaga aatggtaaag 5761 aaaataacaa ccatgaacag gggagaattc agacaaataa cccttgccac aggtgccgga 5821 aaaaccacgg aactccctag atcagtcata gaagagatag gaaggcataa gagggtcttg 5881 gtcttgatcc ctctgagggc ggcagcagag tcagtatacc aatatatgag acaaaaacac 5941 ccaagcatag cattcaactt gaggataggg gagatgaagg aaggggacat ggccacaggg 6001 ataacctatg cctcatatgg ttacttctgt cagatgccac aacctaagct gcgagccgcg 6061 atggttgagt actccttcat attccttgat gagtaccact gttccacccc cgaacaattg 6121 gctatcatgg gaaagatcca cagattttca gagaacctgc gggtagtagc catgaccgca 6181 acaccagcag gcacggtaac aactacaggg caaaaacacc ctatagaaga atacatagcc 6241 ccagaagtga tgaaggggga agacttaggt ccagagtact tggacatagc tggactaaag 6301 ataccagtag aggagatgaa gagtaacatg ctggtctttg tgcccacaag gaacatggct 6361 gtagagacgg caaagaaact gaaagctaag ggttataact caggctacta ttatagtgga 6421 gaggatccat ctaacctgag ggtggtaaca tcacagtccc cgtacgtggt ggtagcaacc 6481 aacgcaatag aatcaggtgt tactctccca gacttggatg tggtcgtcga cacagggctt 6541 aagtgtgaaa agaggatacg gctgtcacct aagatgccct tcatagtgac gggcctgaag 6601 agaatggctg tcacgattgg ggaacaagcc cagagaaggg ggagagttgg gagagtgaag 6661 cctgggagat actacaggag tcaagaaacc cccgttggtt ccaaagatta ccattacgac 6721 ctactgcaag cacagaggta cggtatagaa gatgggataa acatcaccaa atcttttaga 6781 gagatgaatt atgattggag cctttatgag gaggatagtc tgatgattac acaattggaa 6841 atcctcaaca atctgttgat atcagaagag ctaccaatgg cagtaaaaaa tataatggcc 6901 aggactgacc acccagaacc aatccaactg gcgtacaaca gctacgaaac gcaggtgcca 6961 gttctattcc caaaaataaa aaatggagag gtgactgaca gttacgataa ctataccttc 7021 ctcaacgcaa gaaagctggg ggatgatgta ccaccctacg tgtatgccac agaggatgag 7081 gacttagcgg tagagctgct gggcttagac tggccggacc ctgggaacca aggaaccgtg 7141 gaggctggta gagcactaaa acaagtagtt ggtctatcaa cagctgagaa cgccctgtta 7201 gtagctttat tcggctatgt aggatatcag gcactctcaa agaggcatat accagtagtc 7261 acagacatat attcaattga agatcacagg ttggaagaca ccacacacct acagtatgcc 7321 ccgaatgcta tcaagacgga ggggaaggag acagaattga aggagctagc tcagggggat 7381 gtgcagagat gtatggaagc tatgactaat tatgcaagag atggcatcca attcatgaag 7441 tctcaggcac tgaaagtgaa agaaaccccc acttacaaag agacaatgga caccgtggcg 7501 gactatgtaa agaagttcat ggaggcactg gcggacagca aagaagacat cataaaatat 7561 gggttgtggg ggacgcacac agccttatat aagagcatcg gtgctaggct tgggaacgag 7621 actgcgttcg ctaccctggt cgtgaaatgg ctggcatttg ggggagaatc aatagcagac 7681 catgtcaaac aagcggccac agacttggtc gtttactata tcatcaacag acctcagttc 7741 ccaggagaca cggagacaca acaggaagga aggaaatttg tagccagcct actggtctca 7801 gccctggcta cttacactta caaaagctgg aattacaata atctgtccaa gatagttgaa 7861 ccggctttgg ctactctgcc ctatgccgcc acagctctca agctattcgc ccccactcga 7921 ttggagagcg ttgtcatact gagtaccgca atctacaaaa cctacctatc aatcaggcgc 7981 ggaaaaagcg atggtttgct aggcacaggg gttagtgcgg ctatggaaat catgtcacaa 8041 aacccagtat ctgtgggtat agcggtcatg ctaggggtgg gggccgtagc ggcccacaat 8101 gcaatcgaag ccagtgagca gaagagaaca ctactcatga aagtttttgt aaagaacttc 8161 ttggatcagg cagccactga tgaattagtc aaggagagcc ctgagaaaat aataatggct 8221 ttgtttgaag cagtgcagac agtcggcaac cctcttagac tggtatacca cgtttacgga 8281 gttttttaca aagggtggga ggcaaaagag ttggcccaaa ggacagccgg taggaatctt 8341 ttcactttga taatgtttga ggctgtggaa ctactgggag tagatagcga aggaaagatc 8401 cgccagctat caagcaatta catactagag ctcctgtata agttccgtga cagtatcaag 8461 tccagcgtga ggcagatggc aatcagctgg gcccctgccc cttttagttg tgattggaca 8521 ccgacggatg acagaatagg gcttccccaa gataatttcc tccgagtgga gacaaaatgc 8581 ccctgtggtt acaagatgaa agcagttaag aattgtgctg gggagttgag actcttagag 8641 gaggaaggct catttctctg caggaataaa ttcgggagag gttcacggaa ctacagggtg 8701 acaaaatact atgatgacaa tctatcagaa ataaagccag tgataagaat ggaaggacat 8761 gtggaactct actacaaggg agccactatt aaactggatt tcaacaacag taaaacaata 8821 ttggcaaccg ataaatggga ggtcgatcac tccactctgg tcagggtgct caagaggcac 8881 acaggggctg gatattgtgg ggcatacctg ggtgagaaac cgaaccacaa acatctgata 8941 gagagggact gcgcaaccat caccaaagat aaggtttgtt ttctcaagat gaagagaggg 9001 tgtgcattta cttatgactt atcccttcac aaccttaccc ggctgattga attggtacac 9061 aagaataact tggaagacaa agagattcct gccgttacgg tcacaacctg gctggcttac 9121 acatttgtaa atgaagatat agggaccata aaaccagcct tcggggagaa aataacacca 9181 gagatgcagg aggagataac cttgcagcct gctgtattgg tggatgcaac tgacgtgacc 9241 gtgaccgtgg taggggaaac ccctactatg actacagggg agaccccaac aacgttcacc 9301 agctcaggtc cagacccgaa aggccaacaa gttttaaaac tgggtgtagg tgaaggccaa 9361 taccccggga ctaatccaca gagagcaagc ctgcacgaag ccatacaaag cgcagatgaa 9421 aggccctctg tgctgatatt ggggtctgat aaagccacct ctaatagagt gaaaactgta 9481 aagaatgtga aggtatacag aggcagggac ccactagaag tgagagatat gatgaggagg 9541 ggaaagatcc tagtcatagc cctgtctagg gttgataatg ctctattgaa atttgtagat 9601 tacaaaggca cctttttaac tagagagacc ctggaggcat taagtttggg taggccaaaa 9661 aagaaaaaca taaccaaggc agaagcacag tggttgctgc gcctcgaaga ccaaatggaa 9721 gagctacccg attggttcgc agccggggaa cccatttttc tagaggccaa tattaaacat 9781 gacaggtatc atctggtagg ggatatagct actatcaaag agaaagccaa acaattgggg 9841 gctacagact ctacaaagat atccaaggag gttggtgcaa aagtatattc tatgaaattg 9901 agtaattggg tgatgcaaga agaaaacaaa cagagcaact tgaccccctt atttgaagag 9961 ctcctacagc agtgtccacc cggaggccaa aacaaaactg cacatatggt ctctgcttac 10021 caactagctc aagggaactg gatgccaacc agctgccatg tttttatggg gaccatatct 10081 gccagaagga ctaagaccca tccatatgaa gcatatgtca agttaaggga gttggtagag 10141 gaacacaaga tgaaaacatt gtgtcccgga tcaagtctgc gtaacgacaa tgaatgggta 10201 attggcaaga tcaaatacca gggcaacctg aggaccaaac acatgttgaa ccccggcaag 10261 gtggcagagc aactgcacag agaaggacac agacacaatg tgtataacaa gacaataggc 10321 tcagtgatga cagctactgg catcaggttg gagaagttgc ccgtggttag ggcccagaca 10381 gacacaacca acttccacca agcaataagg gataagatag acaaggaaga gaatctacag 10441 accccgggtt tacataagaa actaatggaa gttttcaatg cattgaaacg acccgagtta 10501 gagtcctcct atgacgctgt ggaatgggag gaattggaga gaggaataaa cagaaagggt 10561 gctgctggtt tctttgaacg caaaaacata ggggagatat tggattcaga gaaaattaaa 10621 gtagaagaga ttattgacaa tctgaaaaag ggtagaaata tcaaatacta tgaaaccgca 10681 atcccaaaaa atgaaaagag ggatgtcaat gatgactgga ccgcaggtga ctttgtggac 10741 gagaagaaac ccagagtcat acaataccct gaagcaaaaa caaggctggc catcaccaag 10801 gtgatgtata agtgggtgaa gcagaagcca gtagtcatac ccgggtatga agggaagaca 10861 cctctgttcc aaatttttga caaagtaaag aaggaatggg atcaattcca aaatccagtg 10921 gcagtgagct tcgacactaa ggcgtgggac acccaggtga ccacaaatga tctggagctg 10981 ataaaggaca tacaaaagta ctacttcaag aagaaatggc ataaatttat tgacaccctg 11041 actatgcata tgtcagaagt acccgtaatc actgctgatg gggaggtgta tataaggaaa 11101 gggcaaagag gtagtggaca gcccgacaca agcgcaggca acagcatgct aaatgtgtta 11161 acaatggttt atgccttctg cgaggccaca ggggtaccct acaagagttt tgacagggtg 11221 gcaaaaattc atgtgtgtgg ggacgatggt ttcctgatca cagagagagc tctcggcgag 11281 aaattcgcaa gcaagggagt ccaaatcctg tatgaagctg ggaagcccca gaagatcact 11341 gaaggggaca aaatgaaagt ggcctaccaa tttgctgata ttgagttttg ctcccataca 11401 ccaatacaag taaggtggtc agataacact tctagctaca tgccagggag aaatacaacc 11461 acaatcctgg ctaaaatggc cacaaggtta gattccagtg gtgagagggg taccatagcg 11521 tacgagaaag cagtagcatt cagcttcctg ctaatgtatt cctggaaccc actaatcaga 11581 aggatttgct tattggtact atcaactgaa ctgcaagtga aaccagggaa gtcaaccact 11641 tactattatg aaggggaccc gatatctgcc tacaaggaag tcatcggcca caatcttttc 11701 gatctcaaga gaacaagctt cgagaagctg gccaagttaa atctcagcat gtccgtactc 11761 ggggcctgga ctagacacac cagcaaaaga ctactacaag actgtgtcaa tatgggtgtt 11821 aaagagggca actggttagt caatgcagac agactggtga gtagtaagac tggaaatagg 11881 tatgtacctg gagaaggcca caccctgcaa gggagacatt atgaagaact ggcgttggca 11941 agaaaacaga tcaacagctt ccaagggaca gacaggtaca atctaggccc aatagtcaac 12001 atggtgttaa ggaggctgag agtcatgatg atgaccctga tagggagagg ggtatgagtg 12061 cgggtgaccc gcgatctgga cccgtcagta ggaccctatt gtagataaca ctaatttttt 12121 atttatttag atattactat ttatttattt atttatttat tgaatgagta agaactggta 12181 caaactacct catgttacca cactacactc attttaacag cactttagct ggaaggaaaa 12241 ttcctgacgt ccacagttgg actaaggtaa tttctaacgg ccc // LOCUS HUMC6A2A1 2159 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI, alpha-2 collagen type VI-a, and alpha-2 collagen type VI-a' gene, exons 6,5,4 and 3. ACCESSION M34571 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 1 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2159) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept / 75 120 alpha-2 collagen type VI, exon 6 273 425 alpha-2 collagen type VI, exon 5 590 1042 alpha-2 collagen type VI, exon 4 1307 + 1345 alpha-2 collagen type VI-a, exon 3 pept / 75 120 alpha-2 collagen type VI-a, exon 6 273 425 alpha-2 collagen type VI-a, exon 5 590 1042 alpha-2 collagen type VI-a, exon 4 1307 + 1345 alpha-2 collagen type VI-a', exon 3 pept / 75 120 alpha-2 collagen type VI-a', exon 5 273 425 alpha-2 collagen type VI-a', exon 4 590 1042 alpha-2 collagen type VI-a', exon 3 1307 + 1345 alpha-2 collagen type VI-a', exon 2 pre-msg < 1 > 2159 alpha-2cVI mRNA and introns IVS < 1 74 intron E IVS 121 272 intron D IVS 426 589 intron C IVS 1043 1306 intron B IVS 1346 > 2159 intron A BASE COUNT 391 a 653 c 749 g 366 t ORIGIN 1 tgtccggacc ccagccagac tgctgtgaac tcttctgggc ccggggactg ccctgcctgc 61 cgtgtgcatt gcaggagtgt gacgtcatga cctacgtgag ggagacctgc gggtgctgcg 121 gtgaggcact gcccacggca gggtcggggc ccatgcaccg ggtggagggc gggagtgcag 181 cagggctggg tcatcgctgg gtcctgcatg tgcacgtgac cctagggtct gaggtctccc 241 ggtacccccc gatgaccctg ccaccccccc agactgtgag aagcgctgtg gcgccctgga 301 cgtggtcttc gtcatcgaca gctccgagag cattgggtac accaacttca cactggagaa 361 gaacttcgtc atcaacgtgg tcaacaggct gggtgccatc gctaaggacc ccaagtccga 421 gacaggtcac ggggcagggc gggtgcagca ttgcgggggg ccgcgggcgc gtgggaggcg 481 atgagatggg agaagtccag acgcgtccct ccaacgaggg cctctgcatg gctggggatg 541 ccccagaccc cgaggcctct ggcaacgacc tcacgcgtgc ggcttgcagg gacgcgtgtg 601 ggcgtggtgc agtacagcca cgagggcacc tttgaggcca tccagctgga cgacgaacat 661 atcgactccc tgtcgagctt caaggaggct gtcaagaacc tcgagtggat tgcaggtggc 721 acctggacac cctcagccct caagtttgcc tacgaccgcc tcatcaagga gagccggcgc 781 cagaagacac gtgtgtttgc ggtggtcatc acggacgggc gccacgaccc tcgggacgat 841 gacctcaact tgcgggcgct gtgcgaccgc gacgtcacag tgacggccat cggcatcggg 901 gacatgttcc acgagaagca cgagagtgaa aacctctact ccatcgcctg cgacaagcca 961 cagcaggtgc gcaacatgac gctgttctcc gacctggtcg ctgagaagtt catcgatgac 1021 atggaggacg tcctctgccc gggtgtacgt gtgggcgcgg ggcagtcagg ccgaggagca 1081 gcaggcccca gccgcgtcta gcgtgaccgc cagggacacc cctcacctga gggacgaatg 1141 tgcagcccaa ggatcttggg ctgtgggtgg gaaggggtcg gcctctcggg ctgcagggca 1201 gacgcgccag ctcgaccctg agcctgtcta ggcagatcag tgaacggccg ctgagggttc 1261 gctagggact gaccctggcc tggccggcct ctctcctctc ttccagaccc tcagatcgtg 1321 tgcccagacc ttccctgcca aacaggtaat gcagggacct gagccaccac cccagactag 1381 caaagcagcc ctggtgtcct tcctcctcga gggccgggct gggggagggg ccgtgcaggg 1441 acccgggggc ggcggacgac tgcggaggct gctccttagg gagatggccc caggatggca 1501 gcacagggga ggaggggctt ggggaaggca ggctcccagg aacgcaggaa cagcatcacg 1561 aggccatgag gtgggtgctg ctagcctggc gctgtgctcg gcatgtggcc actggtcttg 1621 aaggcccacc atgggcttgc agtctccctc agctgccgcc cagctcccat gggctggccg 1681 tgcatgtgcc accggaggaa gccctggatc agtgagtgaa accatcccgg ggtggaagca 1741 ctgacacccc ccagcaccag caggtcttgc tccaaccctg gcctgcctcg atcgagctgc 1801 agctgcggct ctcatctctg ggagtggggg agcccatgtc cggatgattg gcccagcgtg 1861 gtgtgaagct ggagctgggg gtgccgttca gctgctgctg gactggtgct gcccccatgg 1921 tgcactgctg caaccgttgc tgggcccaca ggaggtcccc gggggcggtt atgtagctga 1981 gtccccctca ttgagccgtc cccttccagg agtgtgaggg tggggatgcc atggagacag 2041 ggtgggaggg tccagactga gaggaccaca gggtaggaaa cctccaaggg tctgctggta 2101 ctaagtcagc ccttctcagc actcgggatc gcgatgtgcg atcgagagtc catggggag // LOCUS HUMC6A2A2 1348 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI and alpha-2 collagen type VI-a gene, exons 2a and 2b. ACCESSION M34572 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 2 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1348) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 437 462 alpha-2 collagen type VI, exon 2a pept + 730 1025 alpha-2 collagen type VI-a, exon 2b pre-msg < 1 1336 alpha-2cVI mRNA and introns IVS < 1 436 intron A (alt. splice site) IVS < 1 729 intron A (alt. splice site) signal 1285 1291 polyA signal BASE COUNT 238 a 457 c 398 g 255 t ORIGIN About 1.0 kbp downstream from segment 1. 1 tctggctact ggtgacacac tgctgtgcct gccctggcct tctccagaca gccctgtcca 61 cccaaagccc agccaccctg gcctgcagca ggcctgtgga gttctcagtt gcgtggggac 121 cagagggtgc tggagaaaca aaccagacgc agctgaaggc agtcagggca gggcgcaatc 181 agcgataaga gctgcatagg ggccacagcg taacctgagc tccagtcggt ggaaagaaaa 241 ggcagagacg ttgcagaggc caggtctgct caggggaaga cagttctggg tgtagaggac 301 tcacatccca gagaggctga ggaagggttt accacgcaag cttctcattc gggactcttg 361 aggggtggct ggggtcttcc tggcgacggg ctgcggcact gaagccctac tggagtttgg 421 cctgtctccg gcacaggttt ggacggagct gttttgtgct gaaaggtttt ctcggggtcc 481 gtggtgtccc ccaaaggtgc caccgtgcgg gtctcctagc tccctgccag cttcctgtcc 541 ctgtgctcac tgcccccacg cctcctgcca aggccgagcc acacacccgc tccacctgca 601 tttcctctac cgactcgcca gcccaaatgc cgctcttcac tctggcctcg ctgagcggct 661 gcccgaggag gagctctagg ccgacgccca ccgcaggcct tacagtcgtc tctggacgct 721 cccttgcaga tgcaccgtgg cctggcggcg agcccccggt caccttcctc cgcacggaag 781 aggggccgga cgccaccttc cccaggacca ttcccctgat ccaacagttg ctaaacgcca 841 cggagctcac gcaggacccg gccgcctact cccagctggt ggccgtgctg gtctacaccg 901 ccgagcgggc caagttcgcc accggggtag agcggcagga ctggatggag ctgttcattg 961 acacctttaa gctggtgcac agggacatcg tgggggaccc cgagaccgcg ctggccctct 1021 gctaaagccc gggcacccgc ccagccgggc tgggccctcc ctgccacact agcttcccag 1081 ggctgccccc gacaggctgg ctctcagtgg aggccgagag atctggaatc ggggtcagcg 1141 gggctacagt ccttccaggg gctctggggc agctcccagc ctcttcccat gctggtggcc 1201 accgtgtccc ttgctgcggc tgcatcttcc agtctctcct ccgtcttcca gtggccgctc 1261 tctttataag aaccctggtc attgaattta aggcccaccc caagtccaga atgacctcgc 1321 aagaccctta actcactccc gtctgcag // LOCUS HUMC6A2A3 1174 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI-a' gene, exon 1. ACCESSION M34573 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 3 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 140 738 alpha-2 collagen type VI-a', exon 1 pre-msg < 1 1028 alpha-2cVI mRNA and introns IVS < 1 139 intron A (alt. splice site) signal 1010 1015 polyA signal BASE COUNT 189 a 439 c 364 g 182 t ORIGIN About 2.1 kbp downstream of segment 2. 1 ctgcagaaac gccccgcaga gcccagtggt ctgtgaggtt gcaggcaggg tgcgaatgga 61 agggacaggt gcggggctgg cacctgcccg gtcctgccca cctctcctcc gcccagcccg 121 cacctgcggt ctcccacaga gctgtccgtg gcacagtgca cgcagcggcc cgtggacatc 181 gtcttcctgc tggacggctc cgagcggctg ggtgagcaga acttccacaa ggcccggcgc 241 ttcgtggagc aggtggcgcg gcggctgacg ctggcccgga gggacgacga ccctctcaac 301 gcacgcgtgg cgctgctgca gtttggtggc cccggcgagc agcaggtggc cttcccgctg 361 agccacaacc tcactgccat ccacgaggcg ctggagacca cacaatacct gaactccttc 421 tcgcacgtgg gcgcaggcgt ggtgcacgcc atcaatgcca tcgtgcgcag cccgcgtggc 481 ggggcccgga ggcacgcaga gctgtccttc gtgttcctca cggacggcgt cacgggcaac 541 gacagtctgc acgagtcggc gcactccatg cgcaacgaga acgtggtacc caccgtgctg 601 gccttgggca gcgacgtgga catggacgtg ctcaccacgc tcagcctggg tgaccgcgcc 661 gccgtgttcc acgagaagga ctatgacagc ctggcgcaac ccggcttctt cgaccgcttc 721 atccgctgga tctgctagcg ccgccgcccg ggccccgcag tcgagggtcg tgagcccacc 781 ccgtccatgg tgctaagcgg gcccgggtcc cacacggcca gcaccgctgc tcactcggac 841 gacgccctgg gcctgcacct ctccagctcc tcccacgggg tccccgtagc cccggccccc 901 gcccagcccc aggtctcccc aggccctccg caggctgccc ggcctccctc cccctgcagc 961 catcccaagg ctcctgacct acctggcccc tgagctctgg agcaagccca ataaaggctt 1021 tgaacccatt gcgtgcctgc gagcttctgt gcgcaggaga gacctcaaag gtgtcttgtg 1081 gccaggaggg aaacactgca gctgtcgctc gcccaccagg gtcaatggct cccccgggcc 1141 cagcctgacc tcctaggaca tcaactgcag gtgc // LOCUS HUMC6A2AA 888 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI mRNA, 3' end. ACCESSION M34570 KEYWORDS alpha-2 collagen type VI. SOURCE Human fibroblast, cDNA to mRNA, clone F221. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 888) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 54 alpha-2 collagen type VI BASE COUNT 136 a 329 c 249 g 174 t ORIGIN 1 gtgtgcccag accttccctg ccaaacaggt ttggacggag ctgttttgtg ctgaaaggtt 61 ttctcggggt ccgtggtgtc ccccaaaggt gccaccgtgc gggtctccta gctccctgcc 121 agcttcctgt ccctgtgctc actgccccca cgcctcctgc caaggccgag ccacacaccc 181 gctccacctg catttcctct accgactcgc cagcccaaat gccgctcttc actctggcct 241 cgctgagcgg ctgcccgagg aggagctcta ggccgacgcc caccgcaggc cttacagtct 301 tctctggacg ctcccttgca gatgcaccgt ggcctggcgg cgagcccccg gtcaccttcc 361 tccgcacgga agaggggccg gacgccacct tccccaggac cattcccctg atccaacagt 421 tgctaaacgc cacggagctc acgcaggacc cggccgccta ctcccagctg gtggccgtgc 481 tggtctacac cgccgagcgg gccaagttcg ccaccggggt agagcggcag gactggatgg 541 agctgttcat tgacaccttt aagctggtgc acagggacat cgtgggggac cccgagaccg 601 cgctggccct ctgctaaagc ccgggcaccc gcccagccgg gctgggccct ccctgccaca 661 ctagcttccc agggctgccc ccgacaggct ggctctcagt ggaggcccag agatctggaa 721 tcggggtcag cggggctaca gtccttccag gggctctggg gcagctccca gcctcttccc 781 atgctggtgg ccaccgtgtc ccttgctgcg gctgcatctt ccagtctctc ctccgtcttc 841 cagtggccgc tctctttata agaaccctgg tcattgaatt taaggccc // LOCUS PPH47CG 7726 bp ds-DNA VRL 12-JUL-1990 DEFINITION Human papillomavirus type 47 (HPV-47) +-sense strand. ACCESSION M32305 KEYWORDS . SOURCE Human papillomavirus type 47 DNA, clone pTZ18R. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7726) AUTHORS Kiyono,T., Adachi,A. and Ishibashi,M. TITLE Genome organization and taxonomic position of human papillomavirus type 47 inferred from its DNA sequence JOURNAL Virology 177, 401-405 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by T.Kiyono, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 966 981 E1/E4 fusion protein, exon 1 3324 4000 E1/E4 fusion protein, exon 2 pept 208 678 ORF E6 pept 668 979 ORF E7 pept 966 2783 ORF E1 pept 2725 4245 ORF E2 pept 3086 4000 ORF E4 pept 4334 5890 ORF L2 pept 5903 7447 ORF L1 pre-msg 198 4465 HPV-47-1 mRNA and intron IVS 982 3323 HPV-47-1 intron pre-msg < 1 4465 HPV-47-2 mRNA and intron IVS 1359 2677 HPV-47-2 intron signal 4424 4429 polyA signal BASE COUNT 2369 a 1517 c 1727 g 2113 t ORIGIN 1 aacggtaagt ttgcattaat gtaccaggtg cggtacagat catttcacaa tggatattat 61 tgttgccaac taccatagtc ataatcaagt tcttgcctgt atcgttttcg taccttacct 121 acagtatttt atattaatat ataaataaat aaatatataa atgtgtattt atttctcagg 181 ctcagttctt tgcaattatt aagacaaatg gctcagaagg ctttggaaca gactacagtt 241 aaagaggaaa agctagaact acctactact attagaggct tagctcaatt gttagacata 301 cctttagtag attgtttgct accttgcaac ttttgtggca gatttcttga ctatttagaa 361 gtttgtgaat ttgattataa aaagcttact ttaatttgga aagactacag tgtttatgcc 421 tgctgccgtt tgtgctgctc agcaactgcc acatatgaat ttaatgtttt ttatcaacaa 481 acagtgttag gtagagatat tgagctagct acaggccttt ccatttttga gattgacata 541 aggtgtcata cctgcctgtc atttcttgac attattgaaa agttagatag ctgtggaaga 601 ggacttccct ttcacaaagt aagaaacgcc tggaagggtg tttgtaggca gtgtaagcat 661 ttttacaatg attggtaaag aggtcaccgt gcgagatatt gttctggagt taagtgaggt 721 tcaacctgaa gtattaccag ttgacctgtt ttgcgacgag gaattaccaa atgaacaaca 781 ggcggaggag gagctagaca tcgacagagt cgttttcaaa gtgattgcac cgtgcggttg 841 cagctgctgc gaggtcaagc ttcgcatttt tgtgaacgca acaaaccgtg gcatcaggac 901 atttcaggaa cttttgactg gtgatctgca gctcctctgc ccagagtgcc gtgggaactg 961 caaacatggc ggattctaaa ggtagtacat ctaaagaagg gtttggtgat tggtgtattt 1021 tggaagctga ctgtagtgat gttgaggatg atttgggaca attatttgag agagatacag 1081 actcagatat ctcggacctg ttagacaatt gtgacctgga tcagggcaat tcacgggaac 1141 tatttcatca acaggagtgt aagcaaagcg aggagcaatt acaaaaacta aaacgaaagt 1201 atcttagtcc aaaagctgtc gcgcagctta gtccgcgtct tgagtcaatt tcattgtcac 1261 ctcagcagaa atccaagaga aggctctttg cagagcaaga cagcggactc gagttaacct 1321 ttaacaatga agctgaagat gttactcctg aggtggaggt accggctata gactctcggc 1381 cggatgatga tgagggagga tcaggggatg tagatattca ttatacagca ttgttgcgtt 1441 ccagcaacca aaaggccaca ttactggcaa aattcaaaca agcgtttggg gtaggcttta 1501 atgaattgac aagacaattc aaaagctaca aaacctgctg taatcattgg gttgtatccg 1561 tatatgcagt ccatgatgat ctatttgaaa gctcaaagca gctgttgcaa cagcattgtg 1621 actatatatg ggtccgtggg atagatgcaa tgtcattata tctattgtgt tttaaggcgg 1681 gaaaaaatcg tgggacagtt cataagctaa ttaccacaat gttaaatgtg catgagcaac 1741 agatattgtc tgagcctcca aagttaagaa atacagctgc tgcattattt tggtacaaag 1801 gatgtatggg acctggagtg ttcacccacg gtccttaccc tgaatggatt gcacaattaa 1861 ccattttggg ccataagagt gctgaggcaa gtgcgtttga tctgtcagtc atggttcaat 1921 gggcatttga taacaatctg tttgaggagg cagacattgc atacggatat gcaagactgg 1981 caccagagga tagcaatgca gttgcatggc ttgcacataa taaccaagct aaatatgtta 2041 gagaatgtgc tatgatggtt cgatactaca aaaaggggca aatgagagat atgagcatgt 2101 ctgagtggat atatacaagg atacatgaag tagagggaga aggacagtgg tctagcattg 2161 ttaaattttt aagatatcaa gaaataaatt ttatttcatt tttggctgct ttaaaagatt 2221 tattacattc agtacctaaa cgcaattgta ttttattcca tggccctcca aatacaggaa 2281 agtcatcgtt tggaatgtcc ttaataaaag ttctaagggg gagagtatta tcatttgtaa 2341 actccaaaag tcagttttgg ttgcagcctc ttggagaatg taaaatagca ttattagatg 2401 atgttacaga tccatgttgg gtgtatatgg atcaatattt aagaaatggg ttagatgggc 2461 attttgtgtc tttggattgt aaatatagag cacccatgca aacaaagttt ccacctttaa 2521 tacttacatc taatattaat gtacatgcag agaccaatta tagataccta catagtagaa 2581 ttaagggttt tgaatttaaa aatccatttc ctatgaaagc agataataca cctcaatttg 2641 agttaactga ccaaagctgg aaatcttttt ttacaaggct ttggacacac ttagacctga 2701 gtgaccaaga agacgagggc gaacatggag aatctcagcg agcgtttcaa tgctctgcaa 2761 gaacagctaa tgaacattta tgaagctgca gaacagacat taaaggcaca aattttacat 2821 tggcagacat tgcgaaaaga agctgtgaca ctctactttg ctaggcagaa aggcataaat 2881 aggttgggat accaaccagt gcctgcatta gcaatatctg aggcaagggc caaagaggct 2941 atatatatgg tgttgcagtt agagtcgcta caaaaatcag cgtttgcttt ggagccttgg 3001 accttagtgg acactagtac agagactttt aagagtgctc cagaaaatca ttttaaaaag 3061 gggcctgtac ctgtggaggt gatatatgac aaagatgaag caaatgctaa tttgtatact 3121 atgtggacat ttgtgtatta catggattca gatgatgtgt ggcataagac aacaagtggg 3181 gtcaatcaaa ctggcattta ctacctatat ggaacattta aacactatta tgtgttattt 3241 gctgatgatg caaagagata tagtgctact ggagaatggg aagttaaagt taataaggaa 3301 actgtgttta ctcctgtcac tagctccaca ccaccagggt caccaggagg acaaacagac 3361 ccagacacct cctccaagac ccccaccacc accacagccg ccactgacac ctcgcccaga 3421 cgccaatcca tcaataaaca gtcacaacaa accgaaacca aacgaagagg gtacggacgg 3481 agaccatcaa gcagaacaag gcgaccgcaa acgcaccaaa ggcgatccag atccagatcc 3541 cggtcgcggt ccagttctca aacccactct tccaccacca ccaccaccac cacctacagg 3601 tccaggtcta cgtcgctcaa caagactcgt gctcgttcca ggtcaaggtc cacctccaga 3661 tctaccagca ccaccagtag aaggggaggt agagggtcat ccacaaggca aagatcgcga 3721 tcaccctcca cctacacctc aaaacggtca cgggaaggaa acacaagggg cagagggagg 3781 gggagacaag ggagagcagg gagcagtggg gggagagagc agcgacggag aaggagatca 3841 ttctcaacct cccctgactc ctccaaacga gtcagacggg agtctcctaa ataccgtggc 3901 gtgtctccta gcgaggtggg aaagcaactt cgatcagttg gtgcaaaaca ttcagggcga 3961 cttggaaggt tattggagga agctagggac cccccagtaa ttcttgtgcg aggggacgca 4021 aacacattaa aatgctttcg caacagagca aggaacaaat atagagggct ttttagatca 4081 ttcagcacta cattttcctg ggtagctgga gatagcattg agcgtctagg caggtccaga 4141 atgctcatta gcttttcctg cctcactcag agaagggatt ttgatgatgc tgtcaaatat 4201 ccaaaaggag tcgagtggtc atatggtagt cttgatagcc tttaacaagc attaacgctg 4261 ctttgctact aactgctatt aacaaccaca gctttttttt tacgtttttt tattttactg 4321 attttgtact gcaatggcgc gtgctagaag ggtcaaacgt gactctgtaa cacatatata 4381 tcagacctgc aaacaggcag gcacttgccc ctcggacgtt gttaataaag ttgagcaaac 4441 aacagttgct gacaatattt tgaaatatgg cagtgctggt gtcttttttg gaggccttgg 4501 cataggaaca ggccgaggga ctgggggtgc tactgggtac gtgccacttg gggaaggtcc 4561 tggtgtccgt gtgggaggaa ccccaacggt tgtaaggcct tctcttgttc ctgaagcaat 4621 tggaccagtt gatattttac ccattgacac aatcgcacct gtcgagccta ctgcttcatc 4681 tttagtccca ttaacagagt cgtctggtgc tgatttactt cccggtgaag ttgaaactat 4741 agccgaaata catcctattc ctgaaggtcc gacaatcgac tcccctgtag tcaccacaac 4801 gacaggttcc agtgctgttc tggaagtggc tccagaacct gtacccccta cacgtgttag 4861 aattgctaga acacaatatc ataatccctc ttttcagata ctcactgaat caacacctgc 4921 gcagggcgag agttctcttg ctgaccatat tttggtcacc tcagggtctg gtggacaaag 4981 gataggcggt gatataacag acgaaattga acttactgag tttccaagca gatatacatt 5041 tgaaatagaa gaacccaccc ctccacgaaa aagtagcaca ccattacaaa ctgtagcctc 5101 tgcagtaagg cgacggggct tctcattaac aaatagaaga ttggtacaac aagtagctgt 5161 agacaatcct ttatttttaa gtcaaccttc taagatggta agattctcat ttgacaatcc 5221 agcttttgaa gaagaggtta ccaatatttt tgaacaggat gttaacagct ttgaagaacc 5281 tccagacagg gattttcttg atattaaaca attgggccgt cctcaatatt ctacaacacc 5341 agcaggttat attagggtaa gcagactagg aactcgaggc accattcgca ctcgttctgg 5401 tgcacaaata ggttctcagg tacactttta tagagattta agttctataa atactgagga 5461 tccaatagaa ctacagcttt tagggcagca ttctggagat gctactattg ttcaaggtcc 5521 tgtagaaagc acatttatag atatggacat tgctgaaaac cctttatctg aaacaataga 5581 tgcttcatct aatgatttac ttttggatga gactgtggag gattttagtg ggtcccaatt 5641 agtaattgga aatcgaagga gtacaacatc atatactgtt cccagatttg agactactag 5701 aagtagttcc tattatgttc aagacacaga tggttattat gttgcttacc cagagtcacg 5761 ggacactatt gatattattt accctacacc tgaattacct gtagttgtca ttcacaccca 5821 tgacaattct ggagactttt acttacatcc tagtcttaga aggcgtaagc gtaaaagaaa 5881 atatttgtga tttgcattgc agatggcagt gtggcactcg gctaacggta aagtatacct 5941 tcctccatca acaccagtgg ccagggttca aagcacggat gaatacatac aaaggactaa 6001 tatctattat catgcaaata ctgaccgcct tttaacagta ggacatccat atttcaatgt 6061 atacaataat aatggaacta cattagaggt tccaaaagta tcaggtaatc agcatagggt 6121 gtttcgctta aaattgccag atcctaatag atttgctcta gcggacatgt ctgtatacaa 6181 ccctgacaaa gaacgcttgg tgtgggcctg caggggtcta gaaattggaa ggggtcaacc 6241 tttaggtgtt ggcagtactg gtcacccata ttttaataag gtaaaagata cagaaaacag 6301 taattcctat atcacaaact caaaagatga cagacaagac acctcttttg atcctaaaca 6361 aatacagatg tttattgtgg gctgcactcc atgtattggc gaacactggg ataaggcaga 6421 gccttgtggg gaacagcaaa ctggtctttg tcctcctatt gaattaaaaa acacatacat 6481 tcaggatggc gacatggcag acattggttt tggcaacatt aatttcaagg ccttacaaca 6541 cagtaggtct gatgttagtc ttgacattgt aaatgaaact tgcaagtacc cggattttct 6601 caaaatgcaa aatgatgttt atggggatgc ttgctttttt tatgctcgta gagagcaatg 6661 ttatgccaga catttttttg ttagaggggg aaaaacaggt gatgacatac caggagcaca 6721 ggttggcaat ggtaatatga aaaatcaatt ttacattcct ggtgctacgg gtcaggctca 6781 gagcactata ggtaatgcca tgtatttccc aactgtcagt ggctcactag tctctagtga 6841 tgctcaactg tttaacaggc cattctggct ccaaagggct cagggtcata ataatggcat 6901 tctgtgggct aatcaaatgt ttgtcacagt tgtagacaac acaagaaata caaatttcag 6961 catctctgtt tactctcagg caggggacat aaaggatata caggattata atgcagacaa 7021 ttttagagag tatcaaagac atgtggagga atatgaaatt tctgtaatat tacaattgtg 7081 caaagttcct ttaaaagcag aagttttagc acaaattaat gccatgaatt cgtctctttt 7141 agaggaatgg cagttaggat ttgtgcctac tccagacaac cctattcagg atacatatag 7201 atatctagaa tctttggcca ctaggtgtcc tgaaaagtct cctccaaaag agaaggttga 7261 cccctacaaa ggtttaaact tttgggatgt cgatatgaca gagcgccttt ccctggattt 7321 agatcaatat tcattaggta gaaagttctt attccaggct ggattacagc agacgaccgt 7381 aaacggtaca aaaacaactc cttacagggg gtccatcaga ggaacaaagc gcaaacgaaa 7441 aaattgaaga tgaccgtttt cggtacagat tgtttaactt ttacacagta ttcaaggaat 7501 gtctgtttac tgtgactaag tgtaactctg ccaaagaaac aaccgcaccc ggtacacgta 7561 ttcagcttgt tgccaaaaca gataagcttg gcagtcagaa cacaccgtgt tcgtcgcaac 7621 acgctcggat taggtcttct gccaaaagaa atttaatctt gttatcgttt ttggcgatca 7681 catttggcac cgcgggcagc tgttttggca ctacaagaca accgtt // LOCUS RUBCG 9755 bp ss-RNA VRL 12-JUL-1990 DEFINITION Rubella virus complete genome encoding nonstructural protein, capsid protein, glycoproteins E1 and E2, complete cds. ACCESSION M15240 M18901 M32735 KEYWORDS C protein; glycoprotein; glycoprotein E1; glycoprotein E2; hemagglutinin. SOURCE Rubella virus (strain Therien) cDNA to genomic RNA and cDNA to mRNA, clones pRUB1025[1010,1012,1002,1006,1015,1001]. ORGANISM Rubella virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Rubivirus. REFERENCE 1 (bases 8155 to 9754) AUTHORS Frey,T.K., Marr,L.D., Hemphill,M.L. and Dominguez,G. TITLE Molecular cloning and sequencing of the region of the rubella virus genome coding for glycoprotein E1 JOURNAL Virology 154, 228-232 (1986) STANDARD full staff_review REFERENCE 2 (bases 5917 to 9754; revises [1]) AUTHORS Frey,T.K. and Marr,L.D. JOURNAL Unpublished (1987) STANDARD full staff_review REFERENCE 3 (bases 5247 to 8366) AUTHORS Frey,T.K. and Marr,L.D. TITLE Sequence of the region coding for virion proteins C and E2 and the carboxy terminus of the nonstructural proteins of rubella virus: comparison with alphaviruses JOURNAL Gene 62, 85-99 (1988) STANDARD full staff_review REFERENCE 4 (bases 1 to 9755) AUTHORS Domminguez,G., Wang,C.-Y. and Frey,T.K. TITLE Sequence of the genome RNA of rubella virus: Evidence for genetic rearrangement during togavirus evolution JOURNAL Virology 177, 225-258 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence in [2] kindly provided by T.K.Frey, 01-JUN-1987. Draft entry and computer-readable sequence for [4] kindly submitted by G.Dominguez, 09-MAR-1990, for release after publication. Glycoprotein E1 contains the viral hemagglutinin activity. Multiple copies of the C protein comprise the nucleocapsid. FEATURES from to/span description pept 39 6656 nonstructural polyprotein precursor pept 6505 9696 structural polyprotein precursor matp 6505 7404 capsid protein (C) matp 7405 8250 glycoprotein E2 matp 8251 9693 glycoprotein E1 mRNA 6428 9755 subgenomic RNA BASE COUNT 1457 a 3781 c 3007 g 1510 t ORIGIN 1 atggaagcta tcggacctcg cttaggactc ccattcccat ggagaaactc ctagatgagg 61 ttcttgcccc cggtgggcct tataacttaa ccgtcggcag ttgggtaaga gaccacgtcc 121 gatcaattgt cgagggcgcg tgggaagtgc gcgatgttgt taccgctgcc caaaagcggg 181 ccatcgtagc cgtgataccc agacctgtgt tcacgcagat gcaggtcagt gatcacccag 241 cactccacgc aatttcgcgg tatacccgcc gccattggat cgagtggggc cctaaagaag 301 ccctacacgt cctcatcgac ccaagcccgg gcctgctccg cgaggtcgct cgcgttgagc 361 gccgctgggt cgcactgtgc ctccacagga cggcacgcaa actcgccacc gccctggccg 421 agacggccag cgaggcgtgg cacgctgact acgtgtgcgc gctgcgtggc gcaccgagcg 481 gccccttcta cgtccaccct gaggacgtcc cgcacggcgg tcgcgccgtg gcggacagat 541 gcttgctcta ctacacaccc atgcagatgt gcgagctgat gcgtaccatt gacgccaccc 601 tgctcgtggc ggttgacttg tggccggtcg cccttgcggc ccacgtcggc gacgactggg 661 acgacctggg cattgcctgg catctcgacc atgacggcgg ttgccccgcc gattgccgcg 721 gagccggcgc tgggcccacg cccggctaca cccgcccctg caccacacgc atctaccaag 781 tcctgccgga caccgcccac cccgggcgcc tctaccggtg cgggccccgc ctgtggacgc 841 gcgattgcgc cgtggccgaa ctctcatggg aggttgccca acactgcggg caccaggcgc 901 gcgtgcgcgc cgtgcgatgc accctcccta tccgccacgt gcgcagcctc caacccagcg 961 cgcgggtccg actcccggac ctcgtccatc tcgccgaggt gggccggtgg cggtggttca 1021 gcctcccccg ccccgtgttc cagcgcatgc tgtcctactg caagaccctg agccccgacg 1081 cgtactacag cgagcgcgtg ttcaagttca agaacgccct gtgccacagc atcacgctcg 1141 cgggcaatgt gctgcaagag gggtggaagg gcacgtgcgc cgaggaagac gcgctgtgcg 1201 catacgtagc cttccgcgcg tggcagtcta acgccaggtt ggcggggatt atgaaaggcg 1261 cgaagtgcgc cgccgactct ttgagcgtgg ccggctggct ggacaccatt tgggacgcca 1321 ttaagcggtt cctcggtagc gtgcccctcg ccgagcgcat ggaggagtgg gaacaggacg 1381 ccgcggtcgc cgccttcgac cgcggccccc tcgaggacgg cgggcgccac ttggacaccg 1441 tgcaaccccc aaaatcgccg ccccgccctg agatcgccgc gacctggatc gtccacgcag 1501 ccagcgaaga ccgccattgc gcgtgcgctc cccgctgcga cgtcccgcgc gaacgtcctt 1561 ccgcgcccgc cggccagccg gatgacgagg cgctcatccc gccgtggctg ttcgccgagc 1621 gccgtgccct ccgctgccgc gagtgggatt tcgaggctct ccgcgcgcgc gccgatacgg 1681 cggccgcgcc cgccccgccg gctccacgcc ccgcgcggta ccccaccgtg ctctaccgcc 1741 accccgccca ccacggcccg tggctcaccc ttgacgagcc gggcgaggct gacgcggccc 1801 tggtcttatg cgacccactt ggccagccgc tccggggccc tgaacgccac ttcgccgccg 1861 gcgcgcatat gtgcgcgcag gcgcgggggc tccaggcttt tgtccgtgtc gtgcctccac 1921 ccgagcgccc ctgggccgac gggggcgcca gagcgtgggc gaagttcttc cgcggctgcg 1981 cctgggcgca gcgcttgctc ggcgagccag cagttatgca cctcccatac accgatggcg 2041 acgtgccaca gctgatcgca ctggctttgc gcacgctggc ccaacagggg gccgccttgg 2101 cactctcggt gcgtgacctg cccgggggtg cagcgttcga cgcaaacgcg gtcaccgccg 2161 ccgtgcgcgc tggcccccgc cagtccgcgg ccgcgtcacc gccacccggc gaccccccgc 2221 cgccgcgccg cgcacggcga tcgcaacggc actcggacgc tcgcggcact ccgccccccg 2281 cgcctgcgcg cgacccgccg ccgcccgccc ccagcccgcc cgcgccaccc cgcgctggtg 2341 acccggtccc tcccattccc gcggggccgg cggatcgcgc gcgtgacgcc gagctggagg 2401 tcgcctgcga gccgagcggc ccccccacgt caaccagggc agacccagac agcgacatcg 2461 ttgaaagtta cgcccgcgcc gccggacccg tgcacctccg agtccgcgac atcatggacc 2521 caccgcccgg ctgcaaggtc gtggtcaacg ccgccaacga ggggctactg gccggctctg 2581 gcgtgtgcgg tgccatcttt gccaacgcca cggcggccct cgctgcaaac tgccggcgcc 2641 tcgccccatg ccccaccggc gaggcagtgg cgacacccgg ccacggctgc gggtacaccc 2701 acatcatcca cgccgtcgcg ccgcggcgtc ctcgggaccc cgccgccctc gaggagggcg 2761 aagcgctgct cgagcgcgcc taccgcagca tcgtcgcgct agccgccgcg cgtcggtggg 2821 cgtgtgtcgc gtgccccctc ctcggcgctg gcgtctacgg ctggtctgct gcggagtccc 2881 tccgagccgc gctcgcggct acgcgcaccg agcccgtcga gcgcgtgagc ctgcacatct 2941 gccaccccga ccgcgccacg ctgacgcacg cctccgtgct cgtcggcgcg gggctcgctg 3001 ccaggcgcgt cagtcctcct ccgaccgagc ccctcgcatc ttgccccgcc ggtgacccgg 3061 gccgaccggc tcagcgcagc gcgtcgcccc cagcgacccc ccttggggat gccaccgcgc 3121 ccgagccccg cggatgccag gggtgcgaac tctgccggta cacgcgcgtc accaatgacc 3181 gcgcctatgt caacctgtgg ctcgagcgcg accgcggcgc caccagctgg gccatgcgca 3241 ttcccgaggt ggttgtctac gggccggagc acctcgccac gcattttcca ttaaaccact 3301 acagtgtgct caagcccgcg gaggtcaggc ccccgcgagg catgtgcggg agtgacatgt 3361 ggcgctgccg cggctggcat ggcatgccgc aggtgcggtg caccccctcc aacgctcacg 3421 ccgccctgtg ccgcacaggc gtgccccctc gggcgagcac gcgaggcggc gagctagacc 3481 caaacacctg ctggctccgc gccgccgcca acgttgcgca ggctgcgcgc gcctgcggcg 3541 cctacacgag tgccgggtgc cccaagtgcg cctacggccg cgccctgagc gaagcccgca 3601 ctcatgagga cttcgccgcg ctgagccagc ggtggagcgc gagccacgcc gatgcctccc 3661 ctgacggcac cggagatccc ctcgaccccc tgatggagac cgtgggatgc gcctgttcgc 3721 gcgtgtgggt cggctccgag catgaggccc cgcccgacca cctcctggtg tcccttcacc 3781 gtgccccaaa tggtccgtgg ggcgtagtgc tcgaggtgcg tgcgcgcccc gaggggggca 3841 accccaccgg ccacttcgtc tgcgcggtcg gcggcggccc acgccgcgtc tcggaccgcc 3901 cccacctctg gcttgcggtc cccctgtctc ggggcggtgg cacctgtgcc gcgaccgacg 3961 aggggctggc ccaggcgtac tacgacgacc tcgaggtgcg ccgcctcggg gatgacgcca 4021 tggcccgggc ggccctcgca tcagtccaac gccctcgcaa aggcccttac aatatcaggg 4081 tatggaacat ggccgcaggc gctggcaaga ctacccgcat cctcgctgcc ttcacgcgcg 4141 aagaccttta cgtctgcccc accaatgcgc tcctgcacga gatccaggcc aaactccgcg 4201 cgcgcgatat cgacatcaag aacgccgcca cctacgagcg ccggctgacg aaaccgctcg 4261 ccgcctaccg ccgcatctac atcgatgagg cgttcactct cggcggcgag tactgcgcgt 4321 tcgttgccag ccaaaccacc gcggaggtga tctgcgtcgg tgatcgggac cagtgcggcc 4381 cacactacgc caataactgc cgcacccccg tccctgaccg ctggcctacc gagcgctcgc 4441 gccacacttg gcgcttcccc gactgctggg cggcccgcct gcgcgcgggg ctcgattatg 4501 acatcgaggg cgagcgcacc ggcaccttcg cctgcaacct ttgggacggc cgccaggtcg 4561 accttcacct cgccttctcg cgcgaaaccg tgcgccgcct tcacgaggct ggcatacgcg 4621 catacaccgt gcgcgaggcc cagggtatga gcgtcggcac cgcctgcatc catgtaggca 4681 gagacggcac ggacgttgcc ctggcgctga cacgcgacct cgccatcgtc agcctgaccc 4741 gggcctccga cgcactctac ctccacgagc tcgaggacgg ctcactgcgc gctgcggggc 4801 tcagcgcgtt cctcgacgcc ggggcactgg cggagctcaa ggaggttccc gctggcattg 4861 accgcgttgt cgccgtcgag caggcaccac caccgttgcc gcccgccgac ggcatccccg 4921 aggcccaaga cgtgccgccc ttctgccccc gcactctgga ggagctcgtc ttcggccgtg 4981 ccggccaccc ccattacgcg gacctcaacc gcgtgactga gggcgaacga gaagtgcggt 5041 acatgcgcat ctcgcgtcac ctgctcaaca agaatcacac cgagatgccc ggaacggaac 5101 gcgttctcag tgccgtttgc gccgtgcggc gctaccgcgc gggcgaggat gggtcgaccc 5161 tccgcactgc tgtggcccgc cagcacccgc gcccttttcg ccagatccca cccccgcgcg 5221 tcactgctgg ggtcgcccag gagtggcgca tgacgtactt gcgggaacgg atcgacctca 5281 ctgatgtcta cacgcagatg ggcgtggccg cgcgggagct caccgaccgc tacgcgcgcc 5341 gctatcctga gatcttcgcc ggcatgtgta ccgcccagag cctgagcgtc cccgccttcc 5401 tcaaagccac cttgaagtgc gtagacgccg ccctcggccc cagggacacc gaggactgcc 5461 acgccgctca ggggaaagcc ggccttgaga tccgggcgtg ggccaaggag tgggttcagg 5521 ttatgtcccc gcatttccgc gcgatccaga agatcatcat gcgcgccttg cgcccgcaat 5581 tccttgtggc cgctggccat acggagcccg aggtcgatgc gtggtggcag gcccattaca 5641 ccaccaacgc catcgaggtc gacttcactg agttcgacat gaaccagacc ctcgctactc 5701 gggacgtcga gctcgagatt agcgccgctc tcttgggcct cccttgcgcc gaagactacc 5761 gcgcgctccg cgccggcagc tactgcaccc tgcgcgaact gggctccact gagaccggct 5821 gcgagcgcac aagcggcgag cccgccacgc tgctgcacaa caccaccgtg gccatgtgca 5881 tggccatgcg catggtcccc aaaggcgtgc gctgggccgg gattttccag ggtgacgata 5941 tggtcatctt cctccccgag ggcgcgcgca gcgcggcact caagtggacc cccgccgagg 6001 tgggcttgtt tggcttccac atcccggtga agcacgtgag cacccctacc cccagcttct 6061 gcgggcacgt cggcaccgcg gccggcctct tccatgatgt catgcaccag gcgatcaagg 6121 tgctttgccg ccgtttcgac ccagacgtgc ttgaagaaca gcaggtggcc ctcctcgacc 6181 gcctccgggg ggtctacgcg gctctgcctg acaccgttgc cgccaatgct gcgtactacg 6241 actacagcgc ggagcgcgtc ctcgctatcg tgcgcgaact taccgcgtac gcgggggcgc 6301 ggcctcgacc acccggccac catcggcgcg ctcgaggaga ttcagacccc ctacgcgcgc 6361 gccaatctcc acgacgccga ctaacgcccc tgtacgtggg gcctttaatc ttacctactc 6421 taaccaggtc atcacccacc gttgtttcgc cgcatctggt gggtacccaa cttttgccat 6481 tcgggagagc cccagggtgc ccgaatggct tctactaccc ccatcaccat ggaggacctc 6541 cagaaggccc tcgaggcaca atcccgcgcc ctgcgcgcgg aactcgccgc cggcgcctcg 6601 cagtcgcgcc ggccgcggcc gccgcgacag cgcgactcca gcacctccgg agatgactcc 6661 ggccgtgact ccggagggcc ccgccgccgc cgcggcaacc ggggccgtgg ccagcgcagg 6721 gactggtcca gggccccgcc ccccccggag gagcggcaag aaactcgctc ccagactccg 6781 gccccgaagc catcgcgggc gccgccacaa cagcctcaac ccccgcgcat gcaaaccggg 6841 cgtgggggct ctgccccgcg ccccgagctg gggccaccga ccaacccgtt ccaagcagcc 6901 gtggcgcgtg gcctgcgccc gcctctccac gaccctgaca ccgaggcacc caccgaggcc 6961 tgcgtgacct cgtggctttg gagcgagggc gaaggcgcgg tcttttaccg cgtcgacctg 7021 catttcacca acctgggcac ccccccactc gacgaggacg gccgctggga ccctgcgctc 7081 atgtacaacc cttgcgggcc cgagccgccc gctcacgtcg tccgcgcgta caatcaacct 7141 gccggcgacg tcaggggcgt ttggggtaaa ggcgagcgca cctacgccga gcaggacttc 7201 cgcgtcggcg gcacgcgctg gcaccgactg ctgcgcatgc cagtgcgcgg cctcgacggc 7261 gacagcgccc cgcttccccc ccacaccacc gagcgcattg agacccgctc ggcgcgccat 7321 ccttggcgca tccgcttcgg tgccccccag gccttccttg ccgggctctt gctcgccacg 7381 gtcgccgttg gcaccgcgcg cgccgggctc cagccccgcg ctgatatggc ggcacctcct 7441 acgctgccgc agcccccctg tgcgcacggg cagcattacg gccaccacca ccatcagctg 7501 ccgttcctcg ggcacgacgg ccatcatggc ggcaccttgc gcgtcggcca gcattaccga 7561 aacgccagcg acgtgctgcc cggccactgg ctccaaggcg gctggggttg ctacaacctg 7621 agcgactggc accagggcac tcatgtctgt cataccaagc acatggactt ctggtgtgtg 7681 gagcacgacc gaccgccgcc cgcgaccccg acgcctctca ccaccgcggc gaactccacg 7741 accgccgcca cccccgccac tgcgccggcc ccctgccacg ccggcctcaa tgacagctgc 7801 ggcggcttct tgtctgggtg cgggccgatg cgcctgcgcc acggcgctga cacccggtgc 7861 ggtcggttga tctgcgggct gtccaccacc gcccagtacc cgcctacccg gtttggctgc 7921 gctatgcggt ggggccttcc cccctgggaa ctggtcgtcc ttaccgcccg ccccgaagac 7981 ggctggactt gccgcggcgt gcccgcccat ccaggcgccc gctgccccga actggtgagc 8041 cccatgggac gcgcgacttg ctccccagcc tcggccctct ggctcgccac agcgaacgcg 8101 ctgtctcttg atcacgccct cgcggccttc gtcctgctgg tcccgtgggt cctgatattt 8161 atggtgtgcc gccgcgcctg tcgccgccgc ggcgccgccg ccgccctcac cgcggtcgtc 8221 ctgcaggggt acaacccccc cgcctatggc gaggaggctt tcacctacct ctgcactgca 8281 ccggggtgcg ccactcaagc acctgtcccc gtgcgcctcg ctggcgtccg ttttgagtcc 8341 aagattgtgg acggcggctg ctttgcccca tgggacctcg aggccactgg agcctgcatt 8401 tgcgagatcc ccactgatgt ctcgtgcgag ggcttggggg cctgggtacc cgcagcccct 8461 tgcgcgcgca tctggaatgg cacacagcgc gcgtgcacct tctgggctgt caacgcctac 8521 tcctctggcg ggtacgcgca gctggcctct tacttcaacc ctggcggcag ctactacaag 8581 cagtaccacc ctaccgcgtg cgaggttgaa cctgccttcg gacacagcga cgcggcctgc 8641 tggggcttcc ccaccgacac cgtgatgagc gtgttcgccc ttgctagcta cgtccagcac 8701 cctcacaaga ccgtccgggt caagttccat acagagacca ggaccgtctg gcaactctcc 8761 gttgccggcg tgtcgtgcaa cgtcaccact gaacacccgt tctgcaacac gccgcacgga 8821 caactcgagg tccaggtccc gcccgacccc ggggacctgg ttgagtacat tatgaattac 8881 accggcaatc agcagtcccg gtggggcctc gggagcccga attgccacgg ccccgattgg 8941 gcctccccgg tttgccaacg ccattcccct gactgctcgc ggcttgtggg ggccacgcca 9001 gagcgccccc ggctgcgcct ggtcgacgcc gacgaccccc tgctgcgcac tgcccctgga 9061 cccggcgagg tgtgggtcac gcctgtcata ggctctcagg cgcgcaagtg cggactccac 9121 atacgcgctg gaccgtacgg ccatgctacc gtcgaaatgc ccgagtggat ccacgcccac 9181 accaccagcg acccctggca tccaccgggc cccttggggc tgaagttcaa gacagttcgc 9241 ccggtggccc tgccacgcac gttagcgcca ccccgcaatg tgcgtgtgac cgggtgctac 9301 cagtgcggta cccccgcgct ggtggaaggc cttgcccccg ggggaggcaa ttgccatctc 9361 accgtcaatg gcgaggacct cggcgccgtc ccccctggga agttcgtcac cgccgccctc 9421 ctcaacaccc ccccgcccta ccaagtcagc tgcgggggcg agagcgatcg cgcgaccgcg 9481 cgggtcatcg accccgccgc gcaatcgttt accggcgtgg tgtatggcac acacaccact 9541 gctgtgtcgg agacccggca gacctgggcg gagtgggctg ctgcccattg gtggcagctc 9601 actctgggcg ccatttgcgc cctcccactc gctggcttac tcgcttgctg tgccaaatgc 9661 ttgtactact tgcgcggcgc tatagcgcct cgctagtggg cccccgcgcg aaacccgcac 9721 taggccacta gatccccgca cctgttgctg tatag // LOCUS XEL68KSA 2009 bp ss-mRNA VRT 12-JUL-1990 DEFINITION X.laevis 68 kDa serum albumin mRNA, complete cds. ACCESSION M18350 KEYWORDS serum albumin. SOURCE X.laevis adult liver hepatocyte (lambda-ZAP library), cDNA to mRNA, clone pX1A14. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2009) AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and Schoenberg,D.R. TITLE Xenopus laevis serum albumin: Sequence of the cDNAs encoding the 68 and 74 kDa peptides, relationship of the 74 kDa albumin to alpha-fetoprotein, and the regulation of albumin gene expression by thyroid hormone during development JOURNAL Mol. Endocrinol. 3, 464-473 (1989) STANDARD full staff_review REFERENCE 2 (bases 1842 to 2009) AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L. TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA JOURNAL Mol. Endocrinol. 3, 805-814 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989. FEATURES from to/span description pept 39 1859 68 kDa serum albumin precursor sigp 39 110 68 kDa serum albumin signal peptide matp 111 1856 68 kDa serum albumin signal 1986 1994 poly-A signal site 1 1 cap site BASE COUNT 699 a 388 c 393 g 529 t ORIGIN 87 bp upstream of HinfI site. 1 aggcttctca gaggtcccca cccaatacat ctccagtcat gaagtggatc accctcattt 61 gtctgttaat tagctccact ttaatagaat caagaataat tttcaaaaga gatacagatg 121 tagaccatca caagcatatt gctgacatgt acaatttatt gactgagcgg accttcaaag 181 gacttacatt ggctattgtc tcacagaatc tccagaaatg ttcattggag gagctgtcta 241 aactggtgaa tgaaattaat gactttgcca aatcctgtac aggaaacgac aaaactcctg 301 agtgtgaaaa acccataggc accctgtttt atgacaaact ctgcgcagat ccaaaagtgg 361 gtgttaatta tgagtggagc aaagagtgct gttctaagca agatccagag agagcacagt 421 gcttcagggc acatagagtt tttgaacata atccagtaag gcctaaacct gaggaaactt 481 gtgcattatt caaagaacac cctgatgatc ttctctcagc attcatacat gaagaggcga 541 gaaaccatcc agacctttat cccccagcag tactattatt aacacagcaa tatggcaaac 601 ttgttgaaca ttgttgtgaa gaagaagaca aggataaatg ctttgcagaa aagatgaagg 661 aactgatgaa acacagtcat tctattgaag ataagcaaaa acatttctgc tggattgtaa 721 ataattatcc tgaaagagtt attaaagcac taaatttggc cagagtgagc cacagatatc 781 ctaagcctga tttcaagctt gcccataaat ttaccgagga gactacacac ttcattaagg 841 attgttgtca tggggacatg tttgaatgca tgacagagag gctggagctt tctgagcata 901 cctgtcaaca taaagatgag ttatcaacaa aacttgaaaa atgctgtaac ttacctttgc 961 ttgagcgtac atactgcatt gtcaccttgg aaaatgatga cgttcctgct gaattatcaa 1021 agccaattac agaatttaca gaggaccctc atgtttgtga gaagtatgct gagaataaaa 1081 gtttcttaga gatatctcca tggcagagtc aagaaacacc agaattgtct gaacaattcc 1141 ttttgcaatc tgcaaaagaa tatgaatctt tgctgaacaa gtgctgcttt tcagacaatc 1201 ctcctgaatg ctacaaggat ggagctgaca gatttatgaa tgaagccaag gagagatttg 1261 catatttgaa acaaaactgt gatatcttgc atgaacatgg agaatatctc tttgaaaatg 1321 aattgctcat aagatacaca aagaaaatgc cccaagtgtc agatgaaaca ttgattggaa 1381 tagcacacca aatggcagat attggtgagc actgctgtgc cgtacctgaa aatcaaagga 1441 tgccatgtgc agaaggagac cttaccattc tcattggaaa aatgtgtgaa aggcaaaaga 1501 agacatttat aaataaccac gttgctcatt gctgcactga ctcatattct gggatgcgtt 1561 catgctttac tgctcttggt ccagatgagg actatgtacc acccccagtt actgatgaca 1621 catttcactt tgacgacaag atatgcactg ctaatgataa agaaaaacag catatcaaac 1681 agaaattcct tgtgaagctg attaaagtta gtcctaaatt ggaaaaaaat cacattgatg 1741 aatggctgct ggaattcctt aagatggtac agaaatgctg tactgcagat gaacaccagc 1801 catgttttga tacagagaaa ccagtactga ttgaacactg tcaaaaactc catccataag 1861 agtccataag agcaaagacc agtcttcaaa ctcactgagg aacaccttcc atctctcaaa 1921 cacaagaaaa aaaagttcct tcagctgaaa agagcatttg cttagagcat tcaactgtgt 1981 gttgtaataa ataaagcatt ttaaaaaat // LOCUS XEL74KSA 1957 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Xenopus laevis 74 kDa serum albumin mRNA, complete cds. ACCESSION M21442 KEYWORDS serum albumin. SOURCE X.laevis adult liver hepatocyte, cDNA to mRNA, clone pX1A74.1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1957) AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and Schoenberg,D.R. TITLE Xenopus laevis serum albumin: Sequence of the complementary deoxyribonucleic acids encoding the 68- and 74-kilodalton peptides and the regulation of albumin gene expression by thyroid hormone during development JOURNAL Mol. Endocrinol. 3, 464-473 (1989) STANDARD full staff_review REFERENCE 2 (bases 1801 to 1957) AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L. TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA JOURNAL Mol. Endocrinol. 3, 805-814 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989. FEATURES from to/span description pept < 1 1818 74 kDa serum albumin (AA at 1) sigp < 1 66 74 kDa serum albumin signal peptide matp 67 1815 74 kDa serum albumin signal 1935 1944 poly-A signal BASE COUNT 667 a 383 c 382 g 525 t ORIGIN 1 tggatcaccc tgatttgtct gttaattagc tcctctttca ttgaatcaag gatacttttc 61 aaaagagata cagatgcaga ccatcacaag catattgctg atgtatacac cgcattgact 121 gagcggacct tcaaaggact tacattggct attgtctctc agaatctcca gaaatgttcg 181 ttggaggagt tatctaagct ggtgaatgaa ataaatgact ttgccaaatc ctgtattaat 241 gacaaaactc ctgagtgtga aaaaccagtg ggcaccctgt tttttgacaa actctgtgca 301 gatccagcag tgggtgttaa ttatgagtgg agcaaagagt gctgtgccaa gcaagatcca 361 gagagggctc agtgcttcaa ggcgcacaga gatcatgaac atacttcaat aaagcctgaa 421 cctgaggaaa cctgcaaatt actcaaagaa caccctgatg atcttctctc agcgttcatt 481 catgaagagg caagaaacca tccagacctt tatccaccag cagtattagc attaaccaag 541 caatatcaca aacttgctga acattgttgt gaagaagaag acaaggaaaa atgcttctca 601 gaaaagatga agcaacttat gaaacaatct cattccattg aagataagca acatcatttc 661 tgctggattc tggataattt tcctgaaaaa gttcttaaag cactaaattt ggccagagtg 721 agccacagat atcctaaagc tgaattcaag cttgcccata attttactga ggaggttaca 781 cactttatta aagattgttg ccatgacgac atgtttgaat gcatgactga gaggctggag 841 cttactgagc atacctgtca acataaagat gagttatcat caaaacttga aaaatgctgt 901 aatatacctt tgcttgagcg tacatactgc attgtcacct tggaaaatga tgacgttcct 961 gctgaattgt ctcagccaat tacagaattt acagaggacc ctcatgtgtg tgagaagtat 1021 gctgagaata acgaagtttt cttaggaaga tatctccatg ctgtgtcaag aaaacaccag 1081 gaattgtctg aacaattcct tttgcaatct gcaaaagaat atgaatcttt gctgaacaag 1141 tgctgcaaaa cagacaatcc tcctgaatgc tacaaggatg gagctgacag atttatgaat 1201 gaagccaagg agagatttgc atatttgaaa caaaactgtg atatcttgca tgaacatgga 1261 gaatatctct ttgaaaatga attgctcata agatacacaa agaaaatgcc ccaagtgtca 1321 gatgaaacat tgattggaat agcacaccaa atggcagata ttggtgagca ctgctgtgcc 1381 gtacctgaaa atcaaaggat gccatgtgca gaaggagacc ttaccattct cattggaaaa 1441 atgtgtgaaa ggcaaaagaa gacatttata aataaccacg ttgctcattg ctgcactgac 1501 tcatattctg ggatgcgttc atgctttact gctcttggtc cagatgagga ctatgtacca 1561 cccccagtta ctgatgacac atttcacttt gacgacaaga tatgcactgc taatgataaa 1621 gaaaaacagc atatcaaaca gaaattcctt gtgaagctga ttaaagttag tcctaaattg 1681 gaaaaaaatc acattgatga atgttctgct gaattcctta agatggtaca gaaatgctgt 1741 actgcagatg aacaccagcc atgttttgat acagagaaac cagtactgat tgaacactgt 1801 caaaaactcc atccataaga gtccattaga gcaaaggcca gccttcaaac tcactgagga 1861 acatcttcca tctctcacat gaaaaaagtt tcctccatct gaaaagaaaa tttgttcatt 1921 caactgtctg ttgaaataaa taaagcgttt aaaatat // LOCUS MUSHOX28A 216 bp ds-DNA ROD 12-JUL-1990 DEFINITION Mouse homeobox protein gene Hox-2.8, partial cds. ACCESSION M34004 KEYWORDS homeobox protein. SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 216) AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and Lehrach,H. TITLE A yeast artificial chromosome containing the mouse homeobox cluster Hox-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Krumlauf, 04-MAY-1990, for release after publication. FEATURES from to/span description pept < 1 > 216 Hox-2.8 homeobox protein (AA at 1) site 34 216 homeobox Hox-2 BASE COUNT 51 a 68 c 66 g 31 t ORIGIN Chromosome 11D. 1 ggccccggat tgccagaatg cggcggcagc ggctcccgca gactgcgcac ggcctacacc 61 aacacgcaac tgctggagct ggagaaggag ttccacttca ataagtacct gtgccggccg 121 cgtcgcgtcg agatcgctgc cttgctggac ctcaccgaaa ggcaggtcaa agtctggttc 181 cagaaccgac gcatgaaaca caagcggcag acggag // LOCUS MUSHOX29A 183 bp ds-DNA ROD 12-JUL-1990 DEFINITION Mouse homeobox protein gene Hox-2.9, partial cds. ACCESSION M34005 KEYWORDS homeobox protein. SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 183) AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and Lehrach,H. TITLE A yeast artificial chromosome containing the mouse homeobox cluster Hox-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Krumlauf, 04-MAY-1990, for release after publication. FEATURES from to/span description pept < 1 > 183 Hox-2.9 homeobox protein (AA at 1) site 1 183 homeobox Hox-2 BASE COUNT 50 a 53 c 54 g 26 t ORIGIN Chromosome 11D. 1 cccggcggtc tccgcacaaa cttcaccacg cgccagctga cggagctgga gaaggaattt 61 catttcaaca aatacctgag ccgtgcccgg agggtggaga tcgccgccac cctggagctc 121 aatgaaacgc aggtgaagat ctggttccag aaccggcgca tgaagcagaa gaaacgcgag 181 cga // LOCUS BCCIPMD 1101 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.coagulans 3-isopropylmalate dehydrogenase gene, complete cds. ACCESSION M33099 KEYWORDS 3-isopropylmalate dehydrogenase. SOURCE B.coagulans (ATCC 7051) DNA. ORGANISM Bacillus coagulans Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1101) AUTHORS Sekiguchi,T., Ortega-Cesena,J., Nosoh,Y., Ohashi,S., Tsuda,K. and Kanaya,S. TITLE DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of Bacillus coagulans. Comparison with the enzymes of Saccharomyces cerevisiae and Thermus thermophilus JOURNAL Biochim. Biophys. Acta 867, 36-44 (1986) STANDARD simple staff_review FEATURES from to/span description pept 1 1101 3-isopropylmalate dehydrogenase BASE COUNT 288 a 255 c 328 g 230 t ORIGIN 1 atgaaaatga aactggccgt actgcccggc gatgggatcg ggccggaagt gatggatgca 61 gcgatccgcg ttttaaaaac agtgttggac aatgacgggc atgaagccgt ttttgaaaat 121 gcgctgattg ggggcgccgc cattgatgaa gcggggacgc ccctaccgga agaaacgctt 181 gacatttgcc gcaggagcga tgccattttg ctcggcgcgg taggggggcc gaaatgggat 241 cataacccgg cttccctccg cccggaaaaa ggcctgctcg ggctccggaa agaaatgggg 301 ctgtttgcga acctgcgccc ggttaaagca tatgccacac ttttaaacgc atcgccttta 361 aaacgggaac gtgtggaaaa cgtcgatctt gttattgtcc gcgaactgac gggcggcctc 421 tattttgggc gcccgagtga aaggcgcggg ccgggcgaga atgaagtggt agacacgctt 481 gcctatacaa gggaagagat tgaaagaatt attgagaaag cattccagct tgcccaaatc 541 agaagaaaaa aactggcatc cgtcgataag gcgaatgtgc tggaatcaag cagaatgtgg 601 cgcgaaattg cggaagaaac cgcgaaaaag tatccggacg tggaattgag ccatatgctt 661 gtcgactcaa cttcgatgca gctgattgca aatccgggcc aatttgatgt cattgtaaca 721 gagaatatgt tcggcgatat tttaagcgat gaagcgtccg tgattaccgg cagcctcggc 781 atgttgccat ccgcaagcct ccgttccgac cggttcggca tgtatgaacc ggtccacggc 841 tccgcgccgg atattgccgg gcagggaaaa gccaacccgc tcgggacagt gctgtcagcg 901 gctttgatgc tccgttattc gttcgggctt gagaaagaag cggcggccat tgaaaaagca 961 gtggatgatg tgcttcaaga cggctattgt acaggcgatt tgcaggtggc aaacggaaaa 1021 gtggtcagta caattgagct cacagaccgg ctgatcgaaa aattaaataa cagcgcagcc 1081 ggtccgcgca tttttcaata a // LOCUS DROSGS3A 151 bp ds-DNA SYN 12-JUL-1990 DEFINITION D.melanogaster synthetic Sgs-3 glue protein gene/Adh gene, 5' flank. ACCESSION M34726 KEYWORDS alcohol dehydrogenase; glue protein. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 151) AUTHORS Roark,M., Raghavan,K.V., Todo,T., Mayeda,C.A. and Meyerowitz,E.M. TITLE Cooperative enhancement at the Drosophila Sgs-3 locus JOURNAL Dev. Biol. 139, 121-133 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 131 > 151 synthetic Sgs-3 glue protein mRNA recomb 130 131 Adh DNA end/Sgs-3 synthetic DNA start BASE COUNT 51 a 25 c 36 g 39 t ORIGIN 1 gtcgacccaa aagtatcaaa caaaggggag aaggcttgtg tttgcataat cgaaatactg 61 actccatttt tagaattgca gtttcagtga aagcgtacct ataaaaaggt gaggtatccg 121 caagaaaagt atcagtttgt ggtaccgagc t // LOCUS MZESOD3A 1037 bp ss-mRNA PLN 12-JUL-1990 DEFINITION Z.mays manganese superoxide dismutase (SOD-3) mRNA, complete cds. ACCESSION M33119 KEYWORDS manganese superoxide dismutase; superoxide dismutase. SOURCE Z.mays (strain W64A), cDNA to mRNA, clone pSod3.1c. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1037) AUTHORS White,J.A. and Scandalios,J.G. TITLE Isolation and characterization of a cDNA for mitochondrial manganese superoxide dismutase (SOD-3) of maize and its relation to other manganese superoxide dismutases JOURNAL Biochim. Biophys. Acta 951, 61-70 (1988) STANDARD simple staff_review FEATURES from to/span description pept 46 753 manganese superoxide dismutase (SOD-3) (EC 1.15.1.1) BASE COUNT 237 a 259 c 296 g 245 t ORIGIN 1 gaattccacg cacccaggag atacagcgag cgagcgacca aagccatggc tctccgcacc 61 ctggcatcga agaaggtcct atccttcccg ttcggcggcg cgggccggcc gttggcggcg 121 gcggcgtctg cgaggggggt gacgacggtc acactccccg acctctccta cgacttcggc 181 gcgctggaac cggccatctc gggggagatc atgcgcttgc accaccaaaa gcaccacgcc 241 acctacgtcg ccaactacaa caaggcgctg gagcagcttg aaactgccgt ctccaagggc 301 gacgcctccg ctgtcgtcca gctgcaggcg gcgatcaagt tcaacggcgg cggtcatgtg 361 aaccattcaa tcttctggaa gaacctcaag cccattagcg aaggtggcgg ggagccgcct 421 catgggaaac ttggctgggc catcgatgag gattttggtt cgtttgaggc acttgtaaag 481 aagatgaatg cagaaggcgc tgctttccaa gggtctggat gggtgtggtt agctttggat 541 aaagaggcaa aaaaggtttc agttgaaaca acagctaatc aggatcctct ggtgactaaa 601 ggtgcaagct tggttccgct gttggggatt gatgtctggg aacatgcata ctacctgcag 661 tacaagaatg ttaggccgga ttacctgaac aacatctgga aggtgatgaa ctggaaatat 721 gctggagagg tgtacgaaaa tgttcttgct tgaattgtct taacggacaa tacacatctg 781 cgcgcgcggg tttcggctgt ttgatcatgt gaaataaaga tggacctgtc tagcggctgg 841 accttgtgta catttcactg agatagacta atggacggcc tgccgatttt gttcgtcctg 901 cttgcgtgct actctgtctc tgctcctagt ttttggcatc atgtttatgt tgagcaaggt 961 gatgcccaag ggaagccatt cccactcttg tctccattaa taaaatcagc tgagcttccg 1021 atgtttgctt ggaattc // LOCUS RATA2UGLBA 300 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat alpha-2u-globulin gene, 5' end. ACCESSION M33213 KEYWORDS alpha-2u-globulin. SOURCE Rat DNA, clone 91. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Kurtz,D.T., McCullough,L., Bishop,D.K. and Manos,M.M. TITLE DNA sequences required for hormonal induction of rat alpha-2u-globulin genes JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 985-988 (1983) STANDARD simple staff_review FEATURES from to/span description pept 292 > 300 alpha-2u-globulin mRNA 233 > 300 alpha-2u-globulin mRNA BASE COUNT 108 a 58 c 76 g 58 t ORIGIN 1 acccactaat ttttcgtggg aatatgtttt gcgaaatgta tgagtgatag aatcaatcca 61 taggagatga catcgccaag tttcaaaagg gcaggaacaa tcgtggcttc acatcagtac 121 atggaaaaca ttccacaaag cctgagaaga atggaaggcc catatgagaa ggaaaaaaaa 181 acaccgaaac ccagagagag tataaagacg agcaaagtgc tggaggtgga gtgtgggcac 241 catcagcaga gggattgtcc cgacagagag gcaattctat tccctaccaa catgaagctg // LOCUS SHRRGBA 409 bp ds-DNA INV 12-JUL-1990 DEFINITION Brine shrimp 5.8S ribosomal RNA gene. ACCESSION M33097 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SOURCE Brine shrimp DNA. ORGANISM Artemia sp. Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Branchiopoda; Sarsostraca; Anostraca; Artimiidae. REFERENCE 1 (bases 1 to 409) AUTHORS Vaughn,J.C., Sperbeck,S.J. and Hughes,M.J. TITLE Molecular cloning and characterization of ribosomal RNA genes from the brine shrimp: Nucleotide sequence analysis and evolution of the 5.8 S rRNA gene region and its flanking nucleotides JOURNAL Biochim. Biophys. Acta 783, 144-151 (1984) STANDARD simple staff_review FEATURES from to/span description rRNA 177 339 5.8S ribosomal RNA BASE COUNT 104 a 82 c 116 g 107 t ORIGIN 1 ggtgaaaaat agtcatattg gggacgagag tggcttcttg tgattcaagg atcatggata 61 ccactccgcg agactaaagg gagtgaaggt gagcttgccc caacagagca tggcttgagg 121 tgtgcaaggg tgcaattgca ttggccttgt ttgagggaga atttgaaaca ttcaatagaa 181 tgacccttga ggatggatca cttggctcac attacgaaga cgaacgcagc tagacgcgtg 241 attccatgcg aactgcagga cacatggaac gtctatattt tgaacgcaaa ttgcatgtcc 301 agcctttgag cttggactac gtctggctga gagacggatg tttttatcat tcggtcatct 361 gggtataccg tcactgcgag gctccttgct tctatagggc cgttgatcg // LOCUS GLATSAA 3053 bp ss-mRNA INV 12-JUL-1990 DEFINITION G.lamblia trophozoite surface antigen (TSA 417) mRNA, complete cds. ACCESSION M33641 KEYWORDS major surface protein; trophozoite surface antigen. SOURCE G.lamblia (strain WB, ATCC 30957), cDNA to mRNA, clone pFDG417. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 3053) AUTHORS Gillin,F.D., Hagblom,P., Harwood,J., Aley,S.B., Reiner,D.S., McCaffery,M., So,M. and Guiney,D. TITLE Isolation and expression of the gene for a major surface protein of Giardia lamblia JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4463-4467 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by S.B.Aley, 08-APR-1990. FEATURES from to/span description pept 205 2346 trophozoite surface antigen protein precursor (TSA 417) sigp 205 255 trophozoite surface antigen protein signal peptide matp 256 2343 trophozoite surface antigen binding 188 193 ribosomal binding site (put.) signal 67 72 TATA box signal 2359 2364 misc. signal signal 2831 2837 polyA signal BASE COUNT 785 a 777 c 865 g 626 t ORIGIN 1 gaattcttac gctatgtacg gcttatattg acaggattgc tacaggctat gaatactatg 61 ctagagtata aacatgtatc cacggcgatc tgggggtctt ctcggagact agtggccagt 121 taccatggac acgcaagaag ctgtctgtgg tagcctggcc ccgggctttg cgttggaagc 181 gccacccagc aggtcggcgg cctaatgttc ggcagatttt tgctcgcgat cgtcatcctt 241 cagctggcac ggacagcctg cacccaagaa gctgacgatg gaaagtgtaa aacgtgtggc 301 gtcaccattg gtcaagacac ttggtgctct gagtgcaacg gagcaaacta cgcccccgtg 361 aacggccagt gtgtagacgt caacgctgag gggccaagca aaacgctttg tccgcaacat 421 agcgcaggga agtgcacgca gtgcggaggc aactcattca tgtacaagga cggctgttat 481 tccagcggag aaggccttcc tggacacagc ctgtgcttaa gttccgacgg agatggcgta 541 tgcaccgagg cggccccggg gtactttgct ccggtgggag cggcgaacac tgaacagtct 601 gtgatcgcat gtggcgatac aactggagta acaatagcag ctggcggaaa cacatacaag 661 ggcattgctg actgcgcaga atgcagcgcc cctgacgcaa cagccggcgc tgaggccggc 721 aaggttgcaa cgtgtaccaa gtgtggagtc agtaagtatc tcaaggataa cgtgtgcgta 781 gataaagccc aatgtaattc tggtagcact aataagttcg ttgcagttga tgattctgag 841 aatggcaaca agtgtgtttc ttgcagcgat aacctcaatg gtggcgttgc caattgcgac 901 acctgtagct acgatgagca atctaagaag atcaagtgta caaaatgcac cgataacaac 961 tacctgaaaa ccacaagcga aggcacgtcg tgcgtacaaa aagaccaatg caaagacggc 1021 ttcttcccca aggatgacag cagtgcagga aataaatgcc tcccttgtaa tgacagcacc 1081 gacggaattg ccaattgcgc cacgtgtgct ctggttagtg gccgatcagg ggctgccctc 1141 gttacatgct ccgcctgcac ggatggatac aagcctagtg ccgacaaaac tacgtgcgag 1201 gcggtaagca actgcaagac ccccggatgc aaggcgtgca gcaacgaagg aaaggagaac 1261 gaggtctgca cagactgtga tggtagcaca tacctcacgc cgacaagcca gtgcatagac 1321 agctgcgcta agattggaaa ctactatgga gccaccgaag gagcaaagaa actctgtaaa 1381 gagtgcactg cggctaactg caagacttgc gatgatcagg ggcagtgcca agcatgcaac 1441 gacgggttct ataaaaacgg cgacgcgtgc tctccgtgcc acgaaagctg caagacatgc 1501 agcgcaggca ctgccagcga ctgcaccgag tgtcccaccg gaaaagcact caggtacggg 1561 gacgacggta ctaagggcac gtgcggagaa ggctgcacaa cgggcacagg agcaggagca 1621 tgcaagacgt gtgggctcac tatcgatggc gctagctact gctctgagtg cgccacaacg 1681 acagaatatc ctcaaaatgg cgtctgtgca ccaaaggcta gccgcgccac acctacgtgc 1741 aacgactcgc ctattcagaa tggtgtttgt ggaacgtgtg ccgataacta ctttaagatg 1801 aacggagggt gctatgaaac agtcaagtat cccggtaaga cggtttgcat tagtgcacca 1861 aatggtggta cgtgtcaaaa agctgcagat ggttacaagt tggattcagg tacccttaca 1921 gtttgttctg aagggtgtaa ggaatgtgct agcagtaccg actgtactac gtgtctggac 1981 ggatatgtaa agagtgcaag tgcgtgcaca aagtgtgacg ctagctgcga aacatgtaat 2041 ggagcagcta caacatgtaa ggcgtgtgct acgggatact acaagaccgc atcaggagaa 2101 ggtgcgtgca cgtcttgtga aagtgatagc aacggagtca ctggtattaa gggctgccta 2161 aactgcgccc ctccgcccaa caataaaggt tccgtcctct gctacctcat aaaggatagc 2221 ggtagcacca acaagagcgg gctctccact ggtgccatag cgggtatctc cgtcgctgtc 2281 atcgttgttg tcggcggcct catcggcttc ctctgctggt ggttcctctg cagggggaag 2341 gcgtagatgt acttagatag taaaccgtca tcgatgggtc tgctcggtgt ctgttcctgc 2401 tagcacagac agcagggtct cagccagtgc accaagcatc aggcgtgtgg atgaatgttt 2461 ggcttatcca gtagcgccct tgcgtgtcca cgggctcaca tgtgaccaac agtgctgtac 2521 aggtaggtag agaccagacc acggatccca tgcactgaat gcaactcctt tgcagccgtg 2581 atgggtcagt tgtggcaatt tataagacaa aacgagggcc ctgtccatcg cacagtccct 2641 tgcagcgctt ccagacgcgg agctggcggc ggtcctgcac tacctcgccg agttccgtgg 2701 gccagaggtc ttcggggact gccttcagac cttgctcagc tcgacgagcc ccgggacgag 2761 accctcaggc ttgcggcaca gaaagacata cgcggcttcc tcgaaaggat cgacagaaag 2821 gactcagctc aataaatgcc actcttcacg tcctcgttcc gcggtacatc gtgtagctgt 2881 acatccagtg gaacttttcg actcagaagt gaagttgatg agctctgtgg tagataactt 2941 ctcatggcct ggagtggcag acactgtgag cagctgattg gcatgcaatt cacaccctag 3001 acgcggtgga gagatacccc cgtccatcca ttacaaacaa gtcccacaag ctt // LOCUS HUMPLG01 1272 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 1. ACCESSION M33272 J05286 KEYWORDS plasminogen. SEGMENT 1 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1272) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept 1077 + 1125 plasminogen precursor, exon 1 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" sigp 1077 + 1125 plasminogen signal peptide pre-msg 957 > 1272 PLG mRNA and introns IVS 1126 > 1272 PLG intron A BASE COUNT 391 a 231 c 269 g 381 t ORIGIN 1 gaattccgca gacattccac ccaagaccat tgggctccca cctctactct tttgccagtt 61 aatgaatagg caggaatttc actgcctgga aagaggaaca atgctttctg gtccttattt 121 cacatctaaa atagagaggt caattgattt attcctaaat atctttgaac actaaaatag 181 aagttttaca gcatatatac tacctggttg ctctagactt aagccaggga aaagtacaga 241 ttcaacattt aaaattgaga tagacgcttt ccacttaatg ctaccagtct tgctttattt 301 catgagaatg agaatataat aatatggcat acgttcattt gggggaaaga ttgatgtctt 361 ataacataat ttataattac agaaaacatg tgagttcact gggaataaat aaattttgaa 421 gataataaga tactttcact tatgtcataa tttctatgtc atttggtgta ggatgtagag 481 atattaacgt ttacacctaa ctcaagtttg tcatctaaga cctgaaaggg ttttgtctat 541 cagctgcacc cctgggtaga gacacaacct tggggaaggc ctcagcccca tccctcgtac 601 agcaggaatg agaacagccc tgcctgttgg gaagcttgag ggaggctatg gacgtgcagc 661 gcttggcaga aggtctcgtc atggaaggtt ccagcaaatg tgagatactt ttatgatttc 721 attttctcca aaagaaaggg aataagagaa gaggggagga aataagacta attgcgagag 781 ataaagtaca agggtgaggg aaggaataag gagacatgac ggcagcgtgg agcagccgag 841 gggggagatt gctttcacca cttcccagca tctattgcag attccaccct caaacatttt 901 gtaaggactc tttattcaag gtaacgtttg aaccctgctg agccagtggc atgggtctct 961 gagagaatca ttaacttaat ttgactatct ggtttgtgga tgcgtttact ctcatgtaag 1021 tcaacaacat cctgggattg ggacccactt tctgggcact gctggccagt cccaaaatgg 1081 aacataagga agtggttctt ctacttcttt tatttctgaa atcaggtaag acatagtttt 1141 tttaaattat aataattatt ttttctccca caatgtagta aaaatacata tgccatggct 1201 ttatgtgcaa ttcatttaat ttttgattca tgaaacttcc agttgaaaat cttgtataag 1261 attgaggaat tc // LOCUS HUMPLG02 161 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron A (partial). ACCESSION M33273 J05286 KEYWORDS plasminogen. SEGMENT 2 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 161) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 161 plasminogen intron A /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 52 a 46 c 20 g 43 t ORIGIN Unknown number of base pairs after segment 1. 1 gaattcaccc atttaggcat acaatccaat ggatttcaag atattgagag ttgtgcagcc 61 accatcagaa taaattttaa aactattcat acccccaaaa acgcactcca ctctccttag 121 ctgttacccc aatctgcagc ttctggcaac cactaatcta c // LOCUS HUMPLG03 376 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 2. ACCESSION M33274 J05286 KEYWORDS plasminogen. SEGMENT 3 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 376) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 196 + 331 plasminogen (PLG) precursor, exon 2 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" sigp 196 203 plasminogen signal peptide matp 204 + 331 plasminogen IVS < 1 195 PLG intron A IVS 332 > 376 PLG intron B BASE COUNT 114 a 74 c 78 g 110 t ORIGIN Unknown number of base pairs after segment 2. 1 tctttattta tgtccaaatg cccgactgtg tgttcttaac taaacatttt gattcatagc 61 tacccattct acttccagta aacagaaagt tttatttggt taatgctaac caaatagatt 121 aaaaggaagt catgacaatt agacattgac attgatttac tgaccattta ttccacttgg 181 atctcccacc tctaggtcaa ggagagcctc tggatgacta tgtgaatacc cagggggctt 241 cactgttcag tgtcactaag aagcagctgg gagcaggaag tatagaagaa tgtgcagcaa 301 aatgtgagga ggacgaagaa ttcacctgca ggtatttcca ttgtcgttgc acctacgcag 361 gaatctgtaa ttcaga // LOCUS HUMPLG04 291 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 3. ACCESSION M33275 J05286 KEYWORDS plasminogen. SEGMENT 4 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 291) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 46 + 152 plasminogen (PLG) precursor, exon 3 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 46 + 152 plasminogen IVS < 1 45 PLG intron B IVS 153 > 291 PLG intron C BASE COUNT 88 a 60 c 51 g 92 t ORIGIN Unknown number of base pairs after segment 3. 1 taaataaaga aaaatactta ttggatttcc tgcttcgttc tgcagggcat tccaatatca 61 cagtaaagag caacaatgtg tgataatggc tgaaaacagg aagtcctcca taatcattag 121 gatgagagat gtagttttat ttgaaaagaa aggtgagtac attttcttcc tcctcctcct 181 actgtcctcc ccatcctccc actcttcctc tttctctatt ctatctttaa tttatgagac 241 cagaggagga aggcactatc gtgttataaa actgaattct gagttaggac a // LOCUS HUMPLG05 69 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron C (partial). ACCESSION M33276 J05286 KEYWORDS plasminogen. SEGMENT 5 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 69) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 69 plasminogen intron C /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 26 a 9 c 13 g 21 t ORIGIN Unknown number of base pairs after segment 4. 1 aagtgcagat taaatctaaa ctttatctgg tgaagttatt agttcttaca agtagcaagc 61 aaacggtaa // LOCUS HUMPLG06 57 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron C (partial). ACCESSION M33277 J05286 KEYWORDS plasminogen. SEGMENT 6 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 57) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 57 plasminogen intron C /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 18 a 12 c 7 g 20 t ORIGIN Unknown number of base pairs after segment 5. 1 agtgcaacat ctacaataat tactttcctt atttttgaag tggaccatat ctcgaca // LOCUS HUMPLG07 341 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 4. ACCESSION M33278 J05286 KEYWORDS plasminogen. SEGMENT 7 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 341) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 167 + 281 plasminogen (PLG) precursor, exon 4 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 167 + 281 plasminogen IVS < 1 166 PLG intron C IVS 282 > 341 PLG intron D BASE COUNT 89 a 73 c 77 g 102 t ORIGIN Unknown number of base pairs after segment 6. 1 tggctcagtt tactgcagcc tttttgcaga tgcaaaagat gatcttttag aaagcagaaa 61 cagggggtct ggtgcatgag atctttttct caacgtgact atgctgtgca gaccttcatg 121 tggtgtcttg tgaaagactt tgaccactgt gtggacttcc cttcagtgta tctctcagag 181 tgcaagactg ggaatggaaa gaattacaga gggacgatgt ccaaaacaaa aaatggcatc 241 acctgtcaaa aatggagttc cacttctccc cacagaccta ggtaagacat tccctttcat 301 ctttgtgttc atctactgta aagttgtccc tctgtgtctg t // LOCUS HUMPLG08 354 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 5. ACCESSION M33279 J05286 KEYWORDS plasminogen. SEGMENT 8 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 69 + 208 plasminogen (PLG) precursor, exon 5 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 69 + 208 plasminogen IVS < 1 68 PLG intron D IVS 209 > 354 PLG intron E BASE COUNT 101 a 83 c 72 g 98 t ORIGIN Unknown number of base pairs after segment 7. 1 ttctgccttg ctaatagcaa gctgattttt agaatatagt ctaagtgctt cttttccatc 61 ctccccagat tctcacctgc tacacacccc tcagagggac tggaggagaa ctactgcagg 121 aatccagaca acgatccgca ggggccctgg tgctatacta ctgatccaga aaagagatat 181 gactactgcg acattcttga gtgtgaaggt caggagtggt tctagaaaat gttttcattt 241 ctgcccttca cctgtaaaat aatttgttgt aaagcccctt cccacaggga tgttattaat 301 aattgagtaa cgtattcacc tctgggaaag aagcaaaacc ccagaattaa cctg // LOCUS HUMPLG09 206 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 6. ACCESSION M33280 J05286 KEYWORDS plasminogen. SEGMENT 9 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 206) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 56 + 176 plasminogen (PLG) precursor, exon 6 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 56 + 176 plasminogen IVS < 1 55 PLG intron E IVS 177 > 206 PLG intron F BASE COUNT 51 a 57 c 39 g 59 t ORIGIN Unknown number of base pairs after segment 8. 1 ttcatccatt tcagttttct tcttcctctc tgtccttcct tcccactctg tccagaggaa 61 tgtatgcatt gcagtggaga aaactatgac ggcaaaattt ccaagaccat gtctggactg 121 gaatgccagg cctgggactc tcagagccca cacgctcatg gatacattcc ttccaagtaa 181 gtctcactgg gaaaaacatt ccatgt // LOCUS HUMPLG10 100 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron F (partial). ACCESSION M33281 J05286 KEYWORDS plasminogen. SEGMENT 10 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 100 plasminogen intron F /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 29 a 17 c 26 g 28 t ORIGIN Unknown number of base pairs after segment 9. 1 ccaaaatgat aaggtcactg attctgttga gtgattttta cacatgtaaa ctgttagaaa 61 aacagtgctt ggcagccggg catggtggca catgctgtag // LOCUS HUMPLG11 247 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 7. ACCESSION M33282 J05286 KEYWORDS plasminogen. SEGMENT 11 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 247) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 68 + 186 plasminogen (PLG) precursor, exon 7 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 68 + 186 plasminogen IVS < 1 67 PLG intron F IVS 187 > 247 PLG intron G BASE COUNT 70 a 63 c 51 g 63 t ORIGIN Unknown number of base pairs after segment 10. 1 cttgaaaaag agtcttatcc atgaatgtaa atgttcagtg ctactaaaat ctttcttgtc 61 cattcagatt tccaaacaag aacctgaaga agaattactg tcgtaacccc gatagggagc 121 tgcggccttg gtgtttcacc accgacccca acaagcgctg ggaactttgc gacatccccc 181 gctgcagtga gtatgatgca cacccagatt ccaggatttg gacctgccct gttcttgaaa 241 tcaaaag // LOCUS HUMPLG12 244 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 8. ACCESSION M33283 J05286 KEYWORDS plasminogen. SEGMENT 12 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 244) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 47 + 209 plasminogen (PLG) precursor, exon 8 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 47 + 209 plasminogen IVS < 1 46 PLG intron G IVS 210 > 244 PLG intron H BASE COUNT 68 a 72 c 45 g 59 t ORIGIN Unknown number of base pairs after segment 11. 1 ctcaaaaaat atatatattc attgtaactt attttgccca ttcaagcaac acctccacca 61 tcttctggtc ccacctacca gtgtctgaag ggaacaggtg aaaactatcg cgggaatgtg 121 gctgttaccg tgtccgggca cacctgtcag cactggagtg cacagacccc tcacacacat 181 aacaggacac cagaaaactt tccctgcaag taagtcccct ccagtctcat tctgctgcta 241 tgga // LOCUS HUMPLG13 217 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 9. ACCESSION M33284 J05286 KEYWORDS plasminogen. SEGMENT 13 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 36 + 181 plasminogen (PLG) precursor, exon 9 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 36 + 181 plasminogen IVS < 1 35 PLG intron H IVS 182 > 217 PLG intron I BASE COUNT 64 a 56 c 52 g 45 t ORIGIN Unknown number of base pairs after segment 12. 1 ttggaaagct aaactcacaa tcacttcttt ttcagaaatt tggatgaaaa ctactgccgc 61 aatcctgacg gaaaaagggc cccatggtgc catacaacca acagccaagt gcggtgggag 121 tactgtaaga taccgtcctg tgactcctcc ccagtatcca cggaacaatt ggctcccaca 181 ggtaagcaag ggtatgggag cttactgagg gcccaag // LOCUS HUMPLG14 409 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 10. ACCESSION M33285 J05286 KEYWORDS plasminogen. SEGMENT 14 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 409) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 132 + 291 plasminogen (PLG) precursor, exon 10 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 132 + 291 plasminogen IVS < 1 131 PLG intron I IVS 292 > 409 PLG intron J BASE COUNT 116 a 102 c 73 g 118 t ORIGIN Unknown number of base pairs after segment 13. 1 tctgtctgct aatacagaaa agagaacagt cataattctc agaggctacc gtactgtttt 61 tgtcataaat tgcttcatgc ttcttttttt tcagtaattg ttaagcttga tttcttttat 121 tttaatttca gcaccacctg agctaacccc tgtggtccag gactgctacc atggtgatgg 181 acagagctac cgaggcacat cctccaccac caccacagga aagaagtgtc agtcttggtc 241 atctatgaca ccacaccggc accagaagac cccagaaaac tacccaaatg cgtatgtctt 301 tgatttttac tgtaagaggg gcatcagcca actgaaattt ctgttaaaag agccatgctt 361 catgcttcaa gccaacttcc taggaccaaa tttctcttag acccagaat // LOCUS HUMPLG15 266 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 11. ACCESSION M33286 J05286 KEYWORDS plasminogen. SEGMENT 15 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 266) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 60 + 241 plasminogen (PLG) precursor, exon 11 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 60 + 241 plasminogen IVS < 1 59 PLG intron J IVS 242 > 266 PLG intron K BASE COUNT 66 a 68 c 65 g 67 t ORIGIN Unknown number of base pairs after segment 14. 1 ctgggtgccc ctgaatattc tcccacctct tgtgacctgt attgttttgg aatttccagt 61 ggcctgacaa tgaactactg caggaatcca gatgccgata aaggcccctg gtgttttacc 121 acagacccca gcgtcaggtg ggagtactgc aacctgaaaa aatgctcagg aacagaagcg 181 agtgttgtag cacctccgcc tgttgtcctg cttccaaatg tagagactcc ttccgaagaa 241 ggtaagaaat ctgtggctgg acatct // LOCUS HUMPLG16 224 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 12. ACCESSION M33287 J05286 KEYWORDS plasminogen. SEGMENT 16 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 224) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 26 + 174 plasminogen (PLG) precursor, exon 12 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 26 + 174 plasminogen IVS < 1 25 PLG intron K IVS 175 > 224 PLG intron L BASE COUNT 57 a 56 c 56 g 55 t ORIGIN Unknown number of base pairs after segment 15. 1 aatcatccat tttttccctg tacagactgt atgtttggga atgggaaagg ataccgaggc 61 aagagggcga ccactgttac tgggacgcca tgccaggact gggctgccca ggagccccat 121 agacacagca ttttcactcc agagacaaat ccacgggcgg gtctggaaaa aaatgtaagc 181 cactttgatt tggactcttt ggccttttgc tcaccaatct ttgc // LOCUS HUMPLG17 223 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 13. ACCESSION M33288 J05286 KEYWORDS plasminogen. SEGMENT 17 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 223) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 31 + 124 plasminogen (PLG) precursor, exon 13 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 31 + 124 plasminogen IVS < 1 30 PLG intron L IVS 125 > 223 PLG intron M BASE COUNT 56 a 44 c 58 g 65 t ORIGIN Unknown number of base pairs after segment 16. 1 gctggagctt acatgccttc ttgttttcag tactgccgta accctgatgg tgatgtaggt 61 ggtccctggt gctacacgac aaatccaaga aaactttacg actactgtga tgtccctcag 121 tgtggtaggt tgccttcttt ttggtaagga aactgcttac ttaatatgga tttgcaacaa 181 aaaaggaaaa gggcttctga gcagactgct tctggggagg aga // LOCUS HUMPLG18 296 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 14. ACCESSION M33289 J05286 KEYWORDS plasminogen. SEGMENT 18 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 296) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 126 + 246 plasminogen (PLG) precursor, exon 14 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 126 + 246 plasminogen IVS < 1 125 PLG intron M IVS 247 > 296 PLG intron N BASE COUNT 74 a 72 c 69 g 81 t ORIGIN Unknown number of base pairs after segment 17. 1 atgattttac tatttagttc ggcctttaag atgtcaaaaa ctcagtgctt ggaatttgtc 61 tcgaattaca ccacaaaatt gctaccttgt ctcaaatggg atttctttcc caccttgtgc 121 cacagcggcc ccttcatttg attgtgggaa gcctcaagtg gagccgaaga aatgtcctgg 181 aagggttgta ggggggtgtg tggcccaccc acattcctgg ccctggcaag tcagtcttag 241 aacaaggtaa gaacaggccc agaaacgatt tatactgtcc ctccacgtaa gccctg // LOCUS HUMPLG19 361 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 15. ACCESSION M33290 J05286 KEYWORDS plasminogen. SEGMENT 19 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 361) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 66 + 140 plasminogen (PLG) precursor, exon 15 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 66 + 140 plasminogen IVS < 1 65 PLG intron N IVS 141 > 361 PLG intron O BASE COUNT 93 a 77 c 80 g 111 t ORIGIN Unknown number of base pairs after segment 18. 1 ttctgtacaa tggagcagaa caaagtatca atttaactaa aatttgaact aaatcctctt 61 tccaggtttg gaatgcactt ctgtggaggc accttgatat ccccagagtg ggtgttgact 121 gctgcccact gcttggagaa gtatgtttag gggacaattg acatgaagtc ttgtcttaaa 181 tactttttct gtccttcttt tcctcctttc ctcctttcct ttctcactct tcctcccttc 241 cttctctggc tgtgacacta gggaccaggc cagggcaatt ggataagaga gaagggaagg 301 gtttctagaa agaaactgca gaggaaagac acagtacaga tgattttgtg ggcctgaata 361 a // LOCUS HUMPLG20 331 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 16. ACCESSION M34272 J05286 KEYWORDS plasminogen. SEGMENT 20 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 331) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 + 181 plasminogen (PLG) precursor, exon 16 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 + 181 plasminogen IVS < 1 40 PLG intron O IVS 182 > 331 PLG intron P BASE COUNT 80 a 81 c 79 g 91 t ORIGIN Unknown number of base pairs after segment 19. 1 ctggaccata ttttcctctt gacatcctca tcttttctag gtccccaagg ccttcatcct 61 acaaggtcat cctgggtgca caccaagaag tgaatctcga accgcatgtt caggaaatag 121 aagtgtctag gctgttcttg gagcccacac gaaaagatat tgccttgcta aagctaagca 181 ggtactcgtt cacctgtggt cttcacccca cgctggtgaa gatatttgct ttatgtctgg 241 gttttatggg ccatggcact gcatggcagt ggggaggaac tgtctatcac atgaaaggct 301 caagggcttt ggggacagca tcaatcttca a // LOCUS HUMPLG21 251 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 17. ACCESSION M34273 J05286 KEYWORDS plasminogen. SEGMENT 21 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 251) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 + 147 plasminogen (PLG) precursor, exon 17 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 + 147 plasminogen IVS < 1 40 PLG intron P IVS 148 > 251 PLG intron Q BASE COUNT 71 a 61 c 51 g 68 t ORIGIN Unknown number of base pairs after segment 20. 1 gcagagcagt caaacataac tgctgatgct tttctttcag tcctgccgtc atcactgaca 61 aagtaatccc agcttgtctg ccatccccaa attatgtggt cgctgaccgg accgaatgtt 121 tcatcactgg ctggggagaa acccaaggtg agataaattc cattgcccac ataacgaatt 181 ggttttgacc tacagtccat gtgacaaaat gatcattttg gagaaagctg tgcaaattcc 241 tatccatgaa t // LOCUS HUMPLG22 101 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron Q (partial). ACCESSION M34274 J05286 KEYWORDS plasminogen. SEGMENT 22 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 101) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 101 plasminogen intron Q /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 24 a 34 c 24 g 19 t ORIGIN Unknown number of base pairs after segment 21. 1 agaagggtgc tccctcacac aactacagca gtccaggtga tgcacccact gcccaatgct 61 tggtagtcaa gaggagcttc ctccctgcag ctctgcccag a // LOCUS HUMPLG23 254 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 18. ACCESSION M34275 J05286 KEYWORDS plasminogen. SEGMENT 23 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 69 + 214 plasminogen (PLG) precursor, exon 18 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 69 + 214 plasminogen IVS < 1 68 PLG intron Q IVS 215 > 254 PLG intron R BASE COUNT 61 a 54 c 63 g 76 t ORIGIN Unknown number of base pairs after segment 22. 1 tgttctggaa tatcctcctg aatgtgtttt gggtgcagtt gccatttctt tcatcttttt 61 aaacacaggt acttttggag ctggccttct caaggaagcc cagctccctg tgattgagaa 121 taaagtgtgc aatcgctatg agtttctgaa tggaagagtc caatccaccg aactctgtgc 181 tgggcatttg gccggaggca ctgacagttg ccaggtaagc aaagatcaag agaccaaagt 241 tagtcttgtg ctct // LOCUS HUMPLG24 1236 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 19. ACCESSION M34276 J05286 KEYWORDS plasminogen. SEGMENT 24 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1236) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 202 plasminogen (PLG) precursor, exon 19 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 199 plasminogen pre-msg < 1 427 PLG mRNA and introns (alt.) pre-msg < 1 458 PLG mRNA and introns (alt.) pre-msg < 1 1184 PLG mRNA and introns (alt.) IVS < 1 40 PLG intron R BASE COUNT 365 a 233 c 297 g 341 t ORIGIN Unknown number of base pairs after segment 23. 1 agcctaaccc tcacatgcat ttttctctcc ctctgtatag ggtgacagtg gagggcctct 61 ggtttgcttc gagaaggaca aatacatttt acaaggagtc acttcttggg gtcttggctg 121 tgcacgcccc aataagcctg gtgtctatgt tcgtgtttca aggtttgtta cttggattga 181 gggagtgatg agaaataatt aattggacgg gagacagagt gacgcactga ctcacctaga 241 ggctgggacg tgggtaggga tttagcatgc tggaaataac tggcagtaat caaacgaaga 301 cactgtcccc agctaccagc tacgccaaac ctcggcattt tttgtgttat tttctgactg 361 ctggattctg tagtaaggtg acatagctat gacatttgtt aaaaataaac tctgtactta 421 actttgattt gagtaaattt tggttttggt cttcaacatt ttcatgctct ttgttcaccc 481 caccaatttt aaatgggcag atggggggat ttagctgctt ttgataagga acagctgcac 541 aaaggactga gcaggctgca aggtcacaga ggggagagcc aagaagttgt ccacgcattt 601 acctcatcag ctaacgaggg cttgacatgc atttttactg tctttattcc tgacactgag 661 atgaatgttt tcaaagctgc aacatgcatg gggagtcatg cgaaccgatt ctgttattgg 721 gaatgaaatc tgtcaccgac tgcttgactt gagcccaggg gacacagagc agagagctgt 781 atatgatgga gtgaaccggt ccatggatgt gtaacacaag accaactgag agtctgaatg 841 ttattctggg gcacacgtga gtctaggatt ggtgccaaga gcatgtaaat gaacaacaag 901 caaatattga aggtggacca cttatttccc attgctaatt gcctgcccgg ttttgaaaca 961 gtctgcagta cacacggtga caggagaatg acctgtggga gagatacatg tttagaagga 1021 agagaaagga caaaggcaca cgttttacca tttaaaatat tgttaccaaa caaaaatatc 1081 cattcaaaat acaatttaac aatgcaacag tcatcttaca gcagagaaat gcagagaaaa 1141 gcaaaactgc aagtgactgt gaataaaggg tgaatgtagt ctcaaatcct caaagagctg 1201 tgtttatttc attgacaaat agattatttg tattca // LOCUS PARGANTI1 162 bp ds-DNA INV 12-JUL-1990 DEFINITION P.primaurelia G surface antigen gene, 5' end. ACCESSION M11194 KEYWORDS G surface antigen; surface antigen. SEGMENT 1 of 2 SOURCE P.primaurelia macronucleus DNA. ORGANISM Paramecium primaurelia Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 162) AUTHORS Meyer,E., Caron,F. and Baroin,A. TITLE Macronuclear structure of the G surface antigen gene of Paramecium primaurelia and direct expression of its repeated epitopes in Escherichia coli JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Caron, 24-OCT-1985. FEATURES from to/span description pept 19 > 162 G surface antigen BASE COUNT 55 a 23 c 25 g 59 t ORIGIN 1 tgaattttaa tacttttaat gaataataaa ttcatcatat tctcattgtt gcttgcttta 61 gtagcaagtc aaacatacag tttaacatca tgcacatgtg cataattgtt atcagaagga 121 gattgcatca aaaatgtttc acttggatgt tcatgggata ca // LOCUS PARGANTI2 798 bp ds-DNA INV 12-JUL-1990 DEFINITION P.primaurelia G surface antigen gene, partial cds. ACCESSION M11193 KEYWORDS G surface antigen; surface antigen. SEGMENT 2 of 2 SOURCE P.primaurelia macronucleus DNA. ORGANISM Paramecium primaurelia Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 798) AUTHORS Meyer,E., Caron,F. and Baroin,A. TITLE Macronuclear structure of the G surface antigen gene of Paramecium primaurelia and direct expression of its repeated epitopes in Escherichia coli JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Caron, 24-OCT-1985. FEATURES from to/span description pept < 1 > 798 G surface antigen (AA at 1) rpt 1 222 direct repeat 1 rpt 223 444 direct repeat 2 rpt 445 666 direct repeat 3 rpt 667 > 798 direct repeat 4 BASE COUNT 262 a 153 c 177 g 206 t ORIGIN 1 tgtgcttcaa ttactggaac aggattaacc actgctattt gtggaactta tgatgcaggt 61 tgtgtggcaa atgttaacgg aacagcttgt taagaaaaat tagcaacatg tgatttgtat 121 ttaactcaaa actcttgttc tacctcggca gctgcagcaa cagcagataa atgtgcatgg 181 agtggaaccg cttgccttgc agttacaact gttggtaccc attgtgctta tgttactgga 241 actggactta ctgatttaat atgtgcagca tataatgcaa attgtacagc taataaagct 301 ggaacagcat gtcaggagaa aaaggctact tgcaatttat acacaacaga agccacctgt 361 tcaacatcag cagctgcagc aacagcagat aaatgcgcat ggagtggagc agcttgcctt 421 gcagtaacaa ctgttgctac agagtgtgct tatgttactg gaactggact tactgattta 481 atatgtgcag catataatgc aaattgtaca gctaataaag ctggaacagc atgtcaggag 541 aaaaaggcta cttgcaattt atacacaaca gaagccacct gttcaacatc agcagctgca 601 gcaacagcag ataaatgcgc atggagtgga gcagcttgcc ttgcagtaac aactgttgct 661 acagagtgtg cttatgttac tggaactgga ctaacaaatg caatatgtgc agcatataat 721 gcaaattgta cagctaataa agctggaaca gcatgtcagg agaaaaaggc tacttgcaat 781 ttatacacaa cagaagcc // LOCUS BOVCASA 1123 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Bovine alpha-s1-casein mRNA, complete cds. ACCESSION M33123 KEYWORDS alpha-s1-casein. SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA, clone p-alpha-s1 C228. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1123) AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R. TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA clone JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984) STANDARD simple staff_review FEATURES from to/span description pept 64 708 alpha-s1-casein precursor sigp 64 108 alpha-s1-casein signal peptide matp 109 705 alpha-s1-casein mRNA < 1 1123 alpha-s1-casein mRNA BASE COUNT 331 a 247 c 223 g 322 t ORIGIN 1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca 61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat 121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt 181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt 241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc 301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa 361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac 421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa 481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac 541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg 601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat 661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga 721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg 781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg 841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct 901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa 961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa 1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact 1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc // LOCUS BPHINTXIS 1741 bp ds-DNA PHG 12-JUL-1990 DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes, complete cds. ACCESSION M34832 KEYWORDS excisionase; integrase. SOURCE Bacteriophage phi-11 DNA. ORGANISM Bacteriophage phi-11 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1741) AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y. TITLE Sequence analysis and comparison of int and xis genes from Staphylococcal bacteriophages l54a and phi-11 JOURNAL J. Bacteriol. 172, 2568-2575 (1990) STANDARD simple staff_review FEATURES from to/span description pept 267 67 (c) excisionase (xis) pept 379 1425 integrase (int) BASE COUNT 650 a 264 c 263 g 564 t ORIGIN 1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc 61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa 121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac 181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt 241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca 301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt 361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt 421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa 481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac 541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct 601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga 661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta 721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt 781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct 841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta 901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt 961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt 1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa 1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat 1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta 1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa 1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat 1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat 1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg 1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc 1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt 1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat 1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact 1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag 1741 t // LOCUS BSTNPRAS 3510 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional activator (nprA) genes, complete cds. ACCESSION M34237 KEYWORDS neutral protease; transcriptional activator. SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3510) AUTHORS Nishiya,Y. and Imanaka,T. TITLE Cloning and nucleotide sequences of the neutral protease gene and its transcriptional activator gene from Bacillus stearothermophilus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Nishiya, 11-MAY-1990. FEATURES from to/span description pept 181 1401 transcriptional activator (nprA) pept 1750 3405 neutral protease (nprS) precursor sigp 1570 2274 neutral protease (nprS) signal and propeptide matp 2275 3219 neutral protease (nprS) BASE COUNT 1177 a 581 c 737 g 1015 t ORIGIN 1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa 61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat 121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa 181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt 241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa 301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa 361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag 421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa 481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat 541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa 601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt 661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg 721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac 781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga 841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat 901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat 961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca 1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac 1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat 1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca 1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt 1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac 1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta 1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga 1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg 1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag 1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc 1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt 1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag 1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga 1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca 1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc 1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt 1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt 2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact 2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac 2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct 2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac 2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta 2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa 2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca 2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca 2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat 2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt 2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac 2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca 2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat 2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat 2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt 2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa 3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc 3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca 3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat 3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa 3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc 3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa 3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca 3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt 3481 gcaaaatgcc caaaggctta taatgttgat // LOCUS HSEGP14 3347 bp ds-DNA VRL 12-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M34861 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3347) AUTHORS Guo,P. TITLE Characterization of the gene and an antigenic determinant of equine herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent glycoproteins of other herpesviruses JOURNAL Gene 87, 249-255 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3239 glycoprotein 14 (gp14) BASE COUNT 885 a 891 c 851 g 720 t ORIGIN 1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg 61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg 121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg 181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc 241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg 2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa 2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg 2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga 2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc 3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa 3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga 3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg 3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa 3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact 3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg // LOCUS HUMHBGAA 1227 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human A-gamma-globin gene, 3' end. ACCESSION M33200 KEYWORDS A-gamma-globin. SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1227) AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G. TITLE Gamma gene promoter and enhancer structure in Seattle variant of hereditary persistence of fetal hemoglobin JOURNAL Blood 71, 1108-1112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 3 A-gamma-globin (AA at 1) /hgml_locus_uid="LK0092S" /nomgen="HBG1" /map="11p15.5" mut 794 794 t in wt; c in mutant mut 970 970 c in wt; a in mutant mut 1186 1186 a in wt; g in mutant BASE COUNT 366 a 204 c 294 g 363 t ORIGIN 1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca 61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca 121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc 181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt 241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg 301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc 361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg 421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa 481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg 541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc 601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga 661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt 721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa 781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta 841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca 901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat 961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa 1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc 1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc 1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca 1201 gatgaagaag gagaaaaaaa taagctt // LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human theta-1-globin gene, complete cds. ACCESSION M33022 KEYWORDS theta-1-globin. SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1114) AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J. TITLE Nucleotide sequence of the human theta-1-globin gene JOURNAL Biochem. Genet. 26, 207-211 (1988) STANDARD simple staff_review FEATURES from to/span description pept 359 453 theta-1-globin, exon 1 /hgml_locus_uid="LV0155X" /nomgen="HBQ1" /map="16p13.3" 538 742 theta-1-globin, exon 2 852 980 theta-1-globin, exon 3 IVS 454 537 theta-1-globin intron A IVS 743 851 theta-1-globin intron B BASE COUNT 166 a 386 c 393 g 169 t ORIGIN 1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg 61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt 121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc 181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga 241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc 301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat 361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa 421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc 481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac 541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc 601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg 661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg 721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg 781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc 841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag 901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc 961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc 1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag 1081 gacgcgagag gaaggcgccg cccctcccca agga // LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds. ACCESSION M33033 KEYWORDS inter-alpha-trypsin inhibitor heavy chain. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1149) AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and Martin,J.-P. TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence of the H chain JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA at 1) /hgml_locus_uid="LE0221G" /nomgen="ITIH1" /map="3p21.2-p21.1" BASE COUNT 332 a 292 c 253 g 272 t ORIGIN 1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat 61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat 121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa 181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag 241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct 301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc 361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga 421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc 481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac 541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg 601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc 661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact 721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc 781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca 841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa 901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca 961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa 1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag 1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga 1141 ttttatttc // LOCUS HUMPTHROM 327 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human thrombin mRNA, 5'end. ACCESSION M33031 KEYWORDS serine protease; thrombin. SOURCE Human, cDNA to mRNA, clone pIIH13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 327) AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C. TITLE Recombinant genetic approaches to functional mapping of thrombin JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986) STANDARD simple staff_review FEATURES from to/span description pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L" /nomgen="F2" /map="11p11-q12" sigp 28 156 thrombin signal peptide matp 157 > 327 prothrombin BASE COUNT 60 a 97 c 109 g 61 t ORIGIN 1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc 61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct 121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg 181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc 241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag 301 acagcgagga cgcctcgaga taagctt // LOCUS MUSCC3A 312 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse complement component C3 mRNA, partial cds. ACCESSION M33032 KEYWORDS complement component C3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H. TITLE Amino acid sequences of mouse complement C3 derived from nucleotide sequences of cloned cDNA JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 complement component C3 precursor (AA at 1) matp < 1 27 complement component C3-beta subunit (AA at 1) matp 40 273 complement component C3-alpha subunit matp 274 > 312 complement component C3-alpha' subunit BASE COUNT 90 a 77 c 90 g 55 t ORIGIN 1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga 61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt 121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc 181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa 241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca 301 gaagaagata tt // LOCUS MUSN038A 1260 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse nucleolar protein N038 mRNA, complete cds. ACCESSION M33212 KEYWORDS nucleolar protein N038. SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone lambda-FML-185.19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1260) AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W. TITLE DNA cloning and amino acid sequence determination of a major constituent protein of mammalian nucleoli: Correspondence of the nucleoplasmin-related protein N038 to mammalian protein B23 JOURNAL Chromosoma 96, 417-426 (1988) STANDARD simple staff_review FEATURES from to/span description pept 79 957 nucleolar protein N038 mRNA < 1 1260 nucleolar protein N038 mRNA BASE COUNT 419 a 214 c 306 g 321 t ORIGIN 1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga 61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct 121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat 181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa 241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta 301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt 361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag 421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa 481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa 541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat 601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct 661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa 721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa 781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa 841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc 901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa 961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg 1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt 1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac 1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag 1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt // LOCUS RATPOS 1804 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete cds. ACCESSION M33114 KEYWORDS type-2A protein phosphatase catalytic subunit. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1804) AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S., Sugimura,T. and Nagao,M. TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver type 2A protein phosphatase, and detection of high levels of expression of the gene in normal and cancer cells JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988) STANDARD simple staff_review FEATURES from to/span description pept 114 1043 type-2A protein phosphatase catalytic subunit mRNA < 1 1804 type-2A protein phosphatase catalytic subunit mRNA BASE COUNT 482 a 391 c 437 g 494 t ORIGIN 1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg 61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg 121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc 181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat 241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc 301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta 361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc 421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga 481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt 541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga 601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac 661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc 721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag 781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc 841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc 901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta 961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc 1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc 1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt 1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct 1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat 1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga 1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc 1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa 1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa 1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg 1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt 1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa 1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct 1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact 1801 atat // LOCUS RATSCP2 1409 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds. ACCESSION M34728 KEYWORDS sterol carrier protein-2. SOURCE Rat liver, cDNA to mRNA, clone SP43. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1409) AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and Davis,L.G. TITLE Characterization of a cDNA encoding rat sterol carrier protein-2 JOURNAL DNA Cell Biol. 9, 159-165 (1990) STANDARD simple staff_review FEATURES from to/span description pept 307 1128 sterol carrier protein-2 (SCP-2) mRNA < 1 1409 sterol carrier protein-2 mRNA BASE COUNT 387 a 300 c 390 g 332 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc 61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc 121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc 181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt 241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt 301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc 361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa 421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac 481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc 541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg 601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta 661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc 721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag 781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg 841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct 901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag 961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg 1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc 1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg 1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca 1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc 1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg 1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac 1381 ttgagtttca gaataaagtt cagaatagt // LOCUS SYNPSBAII 556 bp ds-DNA BCT 12-JUL-1990 DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end. ACCESSION M34833 KEYWORDS D1 protein; photosystem II. SOURCE Synechococcus (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 556) AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S. TITLE Different and rapid responses of four cyanobacterial psbA transcripts to changes in light intensity JOURNAL J. Bacteriol. 172, 1998-2004 (1990) STANDARD simple staff_review FEATURES from to/span description pept 81 425 ORF1 pept 527 > 556 photosystem II D1 protein (psbAII) mRNA 59 > 556 psbAII mRNA (alt.) mRNA 478 > 556 psbAII mRNA (alt.) BASE COUNT 136 a 145 c 142 g 133 t ORIGIN 1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag 61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg 121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac 181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag 241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg 301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc 361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg 421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg 481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt 541 gcagcggcgc gagagc // LOCUS ABCAARAA 1624 bp ds-DNA BCT 12-JUL-1990 DEFINITION A.aceti acetic acid resistance protein (aarA) gene, complete cds. ACCESSION M34830 KEYWORDS acetic acid resistance protein. SOURCE A.aceti (strain 10-8) DNA, clone pAR1611. ORGANISM Acetobacter aceti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 1624) AUTHORS Fukaya,M., Takemura,H., Okumura,H., Kawamura,Y., Horinouchi,S. and Beppu,T. TITLE Cloning of genes responsible for acetic acid resistance in acetobacter aceti JOURNAL J. Bacteriol. 172, 2096-2104 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 185 1495 acetic acid resistance protein (aarA) signal 1508 1545 transcription termination signal binding 171 176 ribosomal binding site (put.) BASE COUNT 400 a 446 c 404 g 374 t ORIGIN 1 gcatgcattt gcacacattc gcgcgaccct aagcccaaaa aactgtggtt ttccaagcat 61 actcctttcc gataacgctt cgtttatcgc tggcaacctt ccggtttcct tttgaatgag 121 tgacaaagtg tgacgagcag gccgcagcag cgaccgtggc ccaaccatgc agaaggaaac 181 actaatgagc gcgtcgcaga aagaaggtaa gctatctacc gctaccattt cggttgatgg 241 aaaatccgcc gaaatgcctg tgctttcagg cactctggga ccggatgtta tcgacatccg 301 caaacttccg gcgcaactgg gcgttttcac gtttgaccca ggttacgggg aaacagcggc 361 ctgcaacagc aaaatcacct ttattgatgg tgataaaggc gttctgctgc accgtggtta 421 ccctattgcg cagctggacg aaaatgcttc ctacgaagaa gttatttatc tgcttttgaa 481 tggcgaactg cccaacaagg tgcagtacga caccttcacc aacaccctta caaaccatac 541 gctgctgcac gagcagatcc gtaacttctt taacggcttc cggcgtgatg cccacccaat 601 ggccattctg tgtggtacgg ttggggcttt gtctgccttc tacccagatg ccaacgatat 661 tgccattccc gccaatcggg atctggccgc catgcggctg attgccaaaa tcccaaccat 721 tgcggcatgg gcttacaaat acacgcaggg tgaagccttt atctacccgc ggaatgatct 781 gaactacgca gaaaacttcc tgtccatgat gttcgcgcgc atgtccgaac cttacaaggt 841 caaccctgtt ctggcccgcg ccatgaaccg gattctgatt ctgcatgccg atcatgagca 901 gaatgcctct acctccaccg tacgtctggc tggttctaca ggggccaatc cgtttgcctg 961 tattgctgcg ggcattgccg ctctgtgggg acctgcacat ggtggcgcaa acgaagctgt 1021 gctgaaaatg ctggcccgta ttggcaagaa agaaaatatt cctgccttta tcgcacaggt 1081 gaaggacaag aacagcggcg taaagctgat gggctttggc caccgcgttt acaagaactt 1141 cgacccacgt gcgaagatca tgcagcagac ctgccacgaa gtgctgacag aacttggcat 1201 taaggatgat ccgctgctgg atctggcggt tgagctggaa aagattgctc tgagcgatga 1261 ttacttcgtg cagcgcaaac tttacccgaa tgtggatttc tactctggca tcattctcaa 1321 ggccatgggc atccccacca gtatgtttac tgtgctgttt gccgtagccc gcaccaccgg 1381 ctgggtgagc cagtggaagg aaatgattga agaaccgggc cagcgtatca gccgccctcg 1441 ccagctttat attggcgcac cgcagcgtga ctatgtgccg cttgccaaac gctaaaacag 1501 actaacccaa aaagccgact tcccgtaagg aaagtcggct ttttgtttgc acgctgtttc 1561 caaaaaaata gggcggcaga gcgaataaac gctacctagc cttcaggcat aaaaaaacgc 1621 atgc // LOCUS BOVBADPTA 708 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta adaptin mRNA, partial cds. ACCESSION M34177 J05273 KEYWORDS beta adaptin. SOURCE Cow brain, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 708) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 708 beta adaptin (AA at 1) BASE COUNT 198 a 159 c 175 g 176 t ORIGIN 1 gctgtgaaga aagtgattgc tgctatgact gtggggaaag acgttagctc tctctttcca 61 gatgtagtga actgtatgca gacggataat ctggaactga agaagcttgt gtatctctac 121 ttgatgaact atgccaagag tcagccagac atggccatca tggctgtcaa cagctttgtg 181 aaggattgtg aagatcccaa tcctctgatt cgagctttgg cagtcagaac catggggtgc 241 atccgggtgg acaagataac agagtatctc tgtgagcccc tccgcaagtg cttaaaggat 301 gaagatccct acgtccggaa gacagcagca gtctgcgtgg caaaactcca tgacatcaat 361 gcccagatgg tggaagatca gggatttctg gattctctgc gggatctcat agcagattca 421 aatccaatgg tggtggctaa tgctgtagca gcactatctg aaatcagtga atctcacccc 481 aacagcaact tactcgatct gaatccacag aacattaata agctactgac agccctgaat 541 gagtgcaccg aatggggcca gattttcatc ctggactgct tatctaatta caatcctaaa 601 gatgaccggg aggctcagag catctgtgag cgggtaactc cccggttatc tcatgccaac 661 tcagcagtgg tgctttcagc agtaaaagtc ctaatgaaat ttttggaa // LOCUS BSUSENSA 1773 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.subtilis transcription regulatory protein (senS) gene, complete cds. ACCESSION M34826 M30611 KEYWORDS transcription regulatory protein. SOURCE B.subtilis (strain DB2) DNA, clone pWL[77,80]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1773) AUTHORS Wang,L.-F. and Doi,R.H. TITLE Complex character of senS, a novel gene regulating expression of extracellular-protein genes of Bacillus subtilis JOURNAL J. Bacteriol. 172, 1939-1947 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Doi, 11-DEC-1989, for release after publication. FEATURES from to/span description pept 1486 1683 transcription regulatory protein senS binding 1470 1477 ribosomal binding site (put.) site 1455 1474 transcription termination signal site 1671 1698 rho-independent transcription terminator BASE COUNT 460 a 397 c 388 g 528 t ORIGIN 1 agttcttgga aattctgatt ttcgatatct ggcgaattta cgtagtctcc catcgtttct 61 ttcgaaaggg acgttctcag cccctcaatc cagcggacat tttgtctttt ttctccaggg 121 gatgtccagt ttgttaagta ttcctgggcg atgattgcgt cacgataata aaatgccgtt 181 tggtcgggag cgacccgtcc ggctgccccg ccgagtgctt gctgccagac actggcgttt 241 tgattcggag cgtgctctaa aaagtgtttt attgttgaga tcgcacgttc tgataatggc 301 ttttcaatga aagagccgga gcgtttcatt ttttgaggct gattgcctcc cgggctgtta 361 aaaaaggtta ccgcttcaat gaatggcgtt gtttttacca ttccgcttga cggacttcct 421 gctttcaata aaggctttaa cagttttttt aactctgttt ttggcccgac aaattggccg 481 agggcttcta tgcggtttac ttctttaggc caaaactcta ttgatgatgt aagccggtca 541 tctgtatacg gggcccagtt ctgccacgtg ttatatactt cctcaaaatc atcccatccc 601 catgtaatag aaaaaatcga cacttgagag atgggcactg ctttaaatgt catggaggtg 661 actatgccga aattgcctcc tccgcctccc tgagacgccc aaaatgtgga tgatttgaac 721 agctgactgt aatcagatca gcgccctctt tttcgtctgc tacgatcatc tcaagctgca 781 cgaggctgtc gcaagtaaga ccggcagccc ttgttaaaag tccaattccc cctccgagag 841 ttaaacctgt gagccctaca ttagcaatgg tgcctgcggg aagcgtcagg ccgtattgcc 901 agagtgtccg atagacttct cccaattcag cccccgcttc aatataggcc agctttttat 961 cctgattcac agttattttt ttcatctcgc ttaaatcaat aacaagaccg ttatttaaaa 1021 gggaaaagtt ctcatagctg tgtctgccgc ctctaatacg gaaaggcaca cggttttcac 1081 gcgcccattt cagcgcattg agtgcatcct gtttgttttg gcaaaacaca atgatgtcag 1141 atcctttcta agcttaggtt aatattggtt cttgcttcgt tatagtccgg atcatcccgt 1201 gtcacgatac gtccggtcaa ttttgtcttt tccacactcc cacatctctt tctctcgtat 1261 tctagtttct ctagcttatg cgtcagggga aaagagtgta taaggaaaaa gcggggatgc 1321 aatctgatac agtgtcaaca ccctcaaaaa atagttgaca ggtcggtatt gtatgaatta 1381 acatggtcag tacaaatttt tcaaatttat cgcgctgatc ggaacaccga aggctcttat 1441 cgtttagata agggcctttt ttgtatgaaa aaggggggat tattgatggg agtcaaaaaa 1501 gaaaagggga gaaaacgatt caggaagcga aaaacctacg ggaatcagat tttgccgctt 1561 gagctgctga ttgaaaaaaa caaacgagag attataaaca gcgcggaact catggaagaa 1621 atttatatga agattgatga gaagcatacg caatgtgtaa ctaaatataa aaaaacccgc 1681 tgactacaac gggtttttgc atttctccat taagaatctt ttttaatcgg caatccaagg 1741 ccttctgcca cgcgttttcc gtattcagga tcc // LOCUS CHITDNA 176 bp ds-DNA INV 12-JUL-1990 DEFINITION C.thummi telomeric DNA. ACCESSION M33211 KEYWORDS telomeric DNA. SOURCE C.thummi heat-shocked larvae, cDNA to mRNA, clone lambda-Cth5. ORGANISM Chironomus thummi Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Chironomidae. REFERENCE 1 (bases 1 to 176) AUTHORS Carmona,M.J., Morcillo,G., Galler,R., Martinez-Salas,E., de la Campa,A.G., Diez,J.L. and Edstroem,J.E. TITLE Cloning and molecular characterization of a telomeric sequence from a temperature-induced Balbiani ring JOURNAL Chromosoma 92, 108-115 (1985) STANDARD simple staff_entry BASE COUNT 63 a 34 c 27 g 52 t ORIGIN Chromosome III. 1 aattctagaa aaatcgagtt ttttcgaaaa catgaaaatt ttttttctct catcctagaa 61 caagtgtttt agacctcaaa acagatgtga acataaaagt gatgtattga caaaagttgc 121 tccaaactga gatgcatcca acgtgatatc gatatcccat gtacccccct atggaa // LOCUS ECOSUHBA 1017 bp ds-DNA BCT 12-JUL-1990 DEFINITION E.coli extragenic suppressor (suhB) gene, complete cds. ACCESSION M34828 KEYWORDS extragenic suppressor; suhB gene. SOURCE E.coli DNA, clone pRY61. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1017) AUTHORS Yano,R., Nagai,H., Shiba,K. and Yura,T. TITLE A mutation that enhances synthesis of sigma-32 and suppresses temperature-sensitive growth of the rpoH15 mutant of Escherichia coli JOURNAL J. Bacteriol. 172, 2124-2130 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 194 997 suhB protein signal 122 127 -35 region signal 145 150 -10 region BASE COUNT 244 a 278 c 259 g 236 t ORIGIN 55 min on K12 map. 1 catggcacgg gcaacagaac ccatattgcc ggtgtgtgac gtctccacca gcacaattcg 61 aatattttgc agcattgtct ttcttcatct aaagattatt cacgcatctt atcataaaac 121 gaagacagat gccgatctcg ctgctatact ctgcgccgtt ttcccgttct ttaacatcca 181 gtgagagaga ccgatgcatc cgatgctgaa catcgccgtg cgcgcagcgc gcaaggcggg 241 taatttaatt gccaaaaact atgaaacccc ggacgctgta gaagcgagcc agaaaggcag 301 taacgatttc gtgaccaacg tagataaagc tgccgaagcg gtgattatcg acacgattcg 361 taaatcttac ccacagcaca ccatcatcac cgaagaaagc ggtgaacttg aaggtactga 421 tcaggatgtt caatgggtta tcgatccact ggatggcact accaacttta tcaaacgtct 481 gccgcacttc gcggtatcta tcgctgttcg tatcaaaggc cgcaccgaag ttgctgtggt 541 atacgatcct atgcgtaacg aactgttcac cgccactcgc ggtcagggcg cacagctgaa 601 cggctaccga ctgctcggca gcaccgctcg cgatctcgac ggtactattc tggcgaccgg 661 cttcccgttc aaagcaaaac agtacgccac tacctacatc aacatcgtcg gcaaactgtt 721 caacgaatgt gcagacttcc gtcgtaccgg ttctgcggcg ctggatctgg cttacgtcgc 781 tgcgggtcgt gttgacggtt tctttgaaat cggtctgcgc ccgtgggact tcgccgcagg 841 cgagctgctg gttcgtgaag cgggcggcat cgtcagcgac ttcaccggtg gtcataacta 901 catgctgacc ggtaacatcg ttgctggtaa cccgcgcgtt gttaaagcca tgctggcgaa 961 catgcgtgac gagttaagcg acgctctgaa gcgttaatga ctcaggcggg tgatatc // LOCUS HUMBADPTA 5701 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human beta adaptin mRNA, complete cds. ACCESSION M34175 J05273 KEYWORDS beta adaptin. SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5701) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 178 2991 beta adaptin mRNA < 1 5701 beta adaptin mRNA signal 5683 5688 polyA signal BASE COUNT 1528 a 1373 c 1284 g 1516 t ORIGIN 1 ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa 61 gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc 121 cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg 181 actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa 241 ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgctatg 301 accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac 361 aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca 421 gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg 481 attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat 541 ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca 601 gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt 661 ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta 721 gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca 781 cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc 841 atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt 901 gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa 961 gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg 1021 aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc 1081 gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc 1141 aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc 1201 atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat 1261 gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc 1321 atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc 1381 aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac 1441 cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag 1501 ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat 1561 gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag 1621 ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag 1681 ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac 1741 cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc 1801 ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat 1861 gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt 1921 gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat 1981 gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc 2041 ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat 2101 gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat 2161 agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct 2221 acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc 2281 atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa 2341 ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac 2401 ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc 2461 tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc 2521 attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat 2581 aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca 2641 ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg 2701 aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct 2761 gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg 2821 gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc 2881 gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct 2941 gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg 3001 tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa 3061 ttgtattgct gcgtagaatc tgaacacact gaggccacct agcaaggtag taactagtct 3121 aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg 3181 taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact 3241 tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag 3301 tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag 3361 caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga 3421 aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt 3481 cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta 3541 gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact 3601 ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg 3661 ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag 3721 cctgtcagca ccaaagctga cagaagttat accttactcc tttcctttcc cctgaacaaa 3781 cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct 3841 gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt 3901 agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa 3961 aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc 4021 tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa 4081 tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt 4141 tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta 4201 aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact 4261 tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc 4321 tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat 4381 taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga 4441 aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc 4501 caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg 4561 cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa 4621 atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact 4681 cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga 4741 aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc 4801 ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc 4861 ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg 4921 tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac 4981 ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct 5041 catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca 5101 atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc 5161 agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc 5221 actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact 5281 tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct 5341 tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc 5401 ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag 5461 caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc 5521 ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag 5581 gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg 5641 ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca 5701 c // LOCUS HUMCD59A 1671 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human lymphocytic antigen CD59/MEM43 mRNA, complete cds. ACCESSION M34671 X15861 KEYWORDS CD59 antigen; cell surface antigen; integral membrane protein. SOURCE Human peripheral blood monocyte, cDNA to mRNA, clone R18.. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 108 to 443) AUTHORS Sawada,R., Ohashi,K., Okano,K., Hattori,M., Minato,N. and Naruto,M. TITLE Complementary DNA sequence and deduced peptide sequence for CD59/MEM43 antigen, the human homologue of murine lymphocyte antigen Ly-6c JOURNAL Nucleic Acids Res. 17, 6728-6728 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 1671) AUTHORS Sawada,R., Ohashi,K., Anaguchi,H., Okazaki,H., Hattori,M., Minato,N. and Naruto,M. TITLE Isolation and expression of the full-length cDNA encoding CD59 antigen of human lymphocytes JOURNAL DNA 9, 213-220 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer readable copy for sequence [1] kindly provided by Naruto,M., 17-JUL-1989. [1] Author address: Naruto,M. Basic Research Laboratories Toray Industries Inc 1111 Tebiro Kamakura 248, Japan. FEATURES from to/span description pept 30 416 antigen CD59 precursor (CD59) /hgml_locus_uid="LY0169B" /nomgen="CD59" /map="11pter-p13" sigp 30 104 CD59 signal peptide matp 105 413 CD59 protein mRNA < 1 1671 CD59 mRNA signal 527 532 polyA signal BASE COUNT 434 a 347 c 390 g 500 t ORIGIN 1 ggcgccgcca ggttctgtgg acaatcacaa tgggaatcca aggagggtct gtcctgttcg 61 ggctgctgct cgtcctggct gtcttctgcc attcaggtca tagcctgcag tgctacaact 121 gtcctaaccc aactgctgac tgcaaaacag ccgtcaattg ttcatctgat tttgatgcgt 181 gtctcattac caaagctggg ttacaagtgt ataacaagtg ttggaagttt gagcattgca 241 atttcaacga cgtcacaacc cgcttgaggg aaaatgagct aacgtactac tgctgcaaga 301 aggacctgtg taactttaac gaacagcttg aaaatggtgg gacatcctta tcagagaaaa 361 cagttcttct gctggtgact ccatttctgg cagcagcctg gagccttcat ccctaagtca 421 acaccaggag agcttctccc aaactccccg ttcctgcgta gtccgctttc tcttgctgcc 481 acattctaaa ggcttgatat tttccaaatg gatcctgttg ggaaagaata aaattagctt 541 gagcaacctg gctaagatag aggggctctg ggagactttg aagaccagtc ctgtttgcag 601 ggaagcccca cttgaaggaa gaagtctaag agtgaagtag gtgtgacttg aactagattg 661 catgcttcct cctttgctct tgggaagacc agctttgcag tgacagcttg agtgggttct 721 ctgcagccct cagattattt ttcctctggc tccttggatg tagtcagtta gcatcattag 781 tacatctttg gagggtgggg caggagtata tgagcatcct ctctcacatg gaacgctttc 841 ataaacttca gggatcccgt gttgccatgg aggcatgcca aatgttccat atgtgggtgt 901 cagtcaggga caacaagatc cttaatgcag agctagagga cttctggcag ggaagtgggg 961 aagtgttcca gatagcaggg catgaaaact tagagaggta caagtggctg aaaatcgagt 1021 ttttcctctg tctttaaatt ttatatgggc tttgttatct tccactggaa aagtgtaata 1081 gcatacatca atggtgtgtt aaagctattt ccttgccttt ttttattgga atggtaggat 1141 atcttggctt tgccacacac agttacagag tgaacactct actacatgtg actggcagta 1201 ttaagtgtgc ttattttaaa tgttactggt agaaaggcag ttcaggtatg tgtgtatata 1261 gtatgaatgc agtggggaca ccctttgtgg ttacagtttg agacttccaa aggtcatcct 1321 taataacaac agatctgcag gggtatgttt taccatctgc atccagcctc ctgctaactc 1381 ctagctgact cagcatagat tgtataaaat acctttgtaa cggctcttag cacactcaca 1441 gatgtttgag gctttcagaa gctcttctaa aaaatgatac acacctttca caagggcaaa 1501 ctttttcctt ttccctgtgt attctagtga atgaatctca agattcagta gacctaatga 1561 catttgtatt ttatgatctt ggctgtattt aatggcatag gctgactttt gcagatggag 1621 gaatttcttg attaatgttg aaaaaaaacc cttgattata ctctgttgga c // LOCUS HUMKER19PA 1586 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human keratin K19 pseudogene. ACCESSION M33101 KEYWORDS keratin K19; pseudogene. SOURCE Human, cDNA to mRNA, clone IF7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1586) AUTHORS Savtchenko,E.S., Schiff,T.A., Jiang,C.-K., Freedberg,I.M. and Blumenberg,M. TITLE Embryonic expression of the human 40-kD keratin: Evidence from a processed pseudogene sequence JOURNAL Am. J. Hum. Genet. 43, 630-637 (1988) STANDARD simple staff_entry FEATURES from to/span description pept.ps 120 1310 keratin 19 pseudogene signal 1412 1418 polyA signal BASE COUNT 377 a 405 c 470 g 334 t ORIGIN 1 attgataaac atataatctg atatttatgt aaagtagcta ttttttaaaa aaagtatggc 61 tcctccctcg aatcgcagcc tctgggacca gggtcgctcc atccgtcgtc cgcctcgcca 121 tgacttccta cacgtatcgc cagtcgtagg ccaagtagtc cttctggggc ctgggtggtg 181 gctccgtgag ttttgtggca gaggttgcct ttcgcgcgct cagcatgcac tgggcctctg 241 gaggccgcgg cgtgtccgtg tcctccgccc gcttcgtgtc tgtcctcgtc ctccttgggg 301 ggctacggcg gcgtcttggc cgtgtcctac gggctgctgg cgggcaacga gaagctcaat 361 atgcagaacc tcagcgaccc tctggcctcc tacctggaca aggtgggcgc cctggaggac 421 gccaacggca aactggaggt gaagatccgc gactggtacc agaagcaggg gcccgggcct 481 cccgtgacta cagccactct acaagactat ccaggacctg cggtacaaga ttcttggtgc 541 caccattgag aactccagga ttgtcctgga gatcgacaac gcccgtctgg ctgcagatga 601 cttccgaacc aagagtgaga cggagcaggc tctgcgcatg agcggaggcc gacatcaacg 661 gcctgcgcag ggtgctggac gagctgaccc tggccattac cgacctggag atgcagatct 721 aaggcctgaa ggaagagctg gcctacctga agaagaacca tgagaaggaa atcagtgggc 781 tgaggggcca agtgggaggc caggtcagtg gggaggtgga ttcggctcag ggcacctatc 841 tcgccaagat cctgagttac atgcgaacgc aatacgaggt catggcggac aacaactgga 901 aggatgctga agcctggttc accagccgga ctgaagaatt gaaccgggag gtcgctggcc 961 acacagatca gctccagatg agccggtcca aggtcgctga cctgcggcgc accctccagg 1021 gtcttgagct ggagctgcag tcacggctga gcatgaaagc cgccttggaa gccacactgg 1081 cagaaacgga ggcgcgcttt ggagtccact tggcgcagat ccagccgctg atcaactgta 1141 ttgaagccca gctgggcgat gtgcgagctg atagtgagcg gcagaatcag gattaacagc 1201 agttcatgga catcaagtcg cggctggagc aggagatctc cacctaccgc agcctgctcg 1261 agggccagaa ggatcactac aacaacctgt ccgcctccaa ggtcctctga ggcagcaggc 1321 taaggggctt ctactgtcct ttggagggtg tctcctgggt agggggatgg gaaggaaggg 1381 acccttaccc cctgctcttc ccctgatctg ccaataaaat tttatggtcc aaggggaaaa 1441 aaaaaaaaaa aaaaaatata tatatatata tatatatata tatatatgtg tgtgtgtgtg 1501 tgtgtgtgtg tgtatatata cgtgtgtgtg tatatatata tatatgaaaa acaatacatg 1561 ctcgttgtag aaatgtggaa acatgg // LOCUS HUMLOX15A 2671 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human 15-lipoxygenase mRNA, complete cds. ACCESSION M23892 KEYWORDS 15-lipoxygenase. SOURCE Human reticulocyte, cDNA to mRNA, clone 15LOX. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2671) AUTHORS Sigal,E., Craik,C.S., Highland,E., Grunberger,D., Costello,L.L., Dixon,R.A.F. and Nadel,J.A. TITLE Molecular cloning and primary structure of human 15-lipoxygenase JOURNAL Biochem. Biophys. Res. Commun. 157, 457-464 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 4 1992 15-lipoxygenase mRNA < 1 2671 15-lipoxygenase mRNA BASE COUNT 580 a 743 c 718 g 630 t ORIGIN 1 aagatgggtc tctaccgcat ccgcgtgtcc actggggcct cgctctatgc cggttccaac 61 aaccaggtgc agctgtggct ggtcggccag cacggggagg cggcgctcgg gaagcgactg 121 tggcccgcac ggggcaagga gacagaactc aaggtggaag taccggagta tctggggccg 181 ctgctgtttg tgaaactgcg caaacggcac ctccttaagg acgacgcctg gttctgcaac 241 tggatctctg tgcagggccc cggagccggg gacgaggtca ggttcccttg ttaccgctgg 301 gtggagggca acggcgtcct gagcctgcct gaaggcaccg gccgcactgt gggcgaggac 361 cctcagggcc tgttccagaa acaccgggaa gaagagctgg aagagagaag gaagttgtac 421 cggtggggaa actggaagga cgggttaatt ctgaatatgg ctggggccaa actatatgac 481 ctccctgtgg atgagcgatt tctggaagac aagagagttg actttgaggt ttcgctggcc 541 aaggggctgg ccgacctcgc tatcaaagac tctctaaatg ttctgacttg ctggaaggat 601 ctagatgact tcaaccggat tttctggtgt ggtcagagca agctggctga gcgcgtgcgg 661 gactcctgga aggaagatgc cttatttggg taccagtttc ttaatggcgc caaccccgtg 721 gtgctgaggc gctctgctca ccttcctgct cgcctagtgt tccctccagg catggaggaa 781 ctgcaggccc agctggagaa ggagctggag ggaggcacac tgttcgaagc tgacttctcc 841 ctgctggatg ggatcaaggc caacgtcatt ctctgtagcc agcagcacct ggctgcccct 901 ctagtcatgc tgaaattgca gcctgatggg aaactcttgc ccatggtcat ccagctccag 961 ctgccccgca caggatcccc accacctccc cttttcttgc ctacggatcc cccaatggcc 1021 tggcttctgg ccaaatgctg ggtgcgcagc tctgacttcc agctccatga gctgcagtct 1081 catcttctga ggggacactt gatggctgag gtcattgttg tggccaccat gaggtgcctg 1141 ccgtcgatac atcctatctt caagcttata attccccacc tgcgatacac cctggaaatt 1201 aacgtccggg ccaggactgg gctggtctct gacatgggaa ttttcgacca gataatgagc 1261 actggtgggg gaggccacgt gcagctgctc aagcaagctg gagccttcct aacctacagc 1321 tccttctgtc cccctgatga cttggccgac cgggggctcc tgggagtgaa gtcttccttc 1381 tatgcccaag atgcgctgcg gctctgggaa atcatctatc ggtatgtgga aggaatcgtg 1441 agtctccact ataagacaga cgtggctgtg aaagacgacc cagagctgca gacctggtgt 1501 cgagagatca ctgaaatcgg gctgcaaggg gcccaggacc gagggtttcc tgtctcttta 1561 caggctcggg accaggtttg ccactttgtc accatgtgta tcttcacctg caccggccaa 1621 cacgcctctg tgcacctggg ccagctggac tggtactctt gggtgcctaa tgcaccctgc 1681 acgatgcggc tgcccccgcc aaccaccaag gatgcaacgc tggagacagt gatggcgaca 1741 ctgcccaact tccaccaggc ttctctccag atgtccatca cttggcagct gggcagacgc 1801 cagcccgtta tggtggctgt gggccagcat gaggaggagt atttttcggg ccctgagcct 1861 aaggctgtgc tgaagaagtt cagggaggag ctggctgccc tggataagga aattgagatc 1921 cggaatgcaa agctggacat gccctacgag tacctgcggc ccagcgtggt ggaaaacagt 1981 gtggccatct aagcgtcgcc accctttggt tatttcagcc cccatcaccc aagccacaag 2041 ctgacccctt cgtggttata gccctgccct cccaagtccc accctcttcc catgtcccac 2101 cctccctaga ggggcacctt ttcatggtct ctgcacccag tgaacacatt ttactctaga 2161 ggcatcacct gggaccttac tcctctttcc ttccttcctc ctttcctatc ttccttcctc 2221 tctctcttcc tctttcttca ttcagatcta tatggcaaat agccacaatt atataaatca 2281 tttcaagact agaatagggg gatataatac atattactcc acacctttta tgaatcaaat 2341 atgatttttt tgttgttgtt aagacagagt ctcactttga cacccaggct ggagtgcagt 2401 ggtgccatca ccacggctca ctgcagcctc agcgtcctgg gctcaaatga tcctcccacc 2461 tcagcctcct gagtagctgg gactacaggc tcatgccatc atgcccagct aatatttttt 2521 tattttcgtg gagacggggc ctcactatgt tgcctaggct ggaaatagga ttttgaaccc 2581 aaattgagtt taacaataat aaaaagttgt tttacgctaa agatggaaaa gaactaggac 2641 tgaactattt taaataaaat attggcaaaa g // LOCUS MUSBPGALA 334 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse beta-galactoside-binding lectin (L-14.5) mRNA, 5' end. ACCESSION M33214 KEYWORDS beta-galactoside-binding lectin. SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line UV-2237-IP, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Raz,A., Carmi,P. and Pazerini,G. TITLE Expression of two different endogenous galactoside-binding lectins sharing sequence homology JOURNAL Cancer Res. 48, 645-649 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 16 > 334 14 kDa beta-galactoside-binding lectin (L-14.5) BASE COUNT 86 a 94 c 91 g 63 t ORIGIN 1 gaattgggta caatcatggc ctgtggtctg gtggatcagc aagctgaatc tcaaactggg 61 gcaatgtctc aaagttcggg gcagaggtgg acctcggacg acaggagctt tgtgctgacc 121 ctgggaaaag acagcaacaa ccgttgccta cacttcaatc ctcgcttcaa tgcccatgga 181 gacgccaaca ccattctgtg taacaccaag gaagatggga cctggggaac cgaacaccgg 241 gaacctgcct tccccttcca gcccgggagc atcacagagt gtgcatgcac ctttgaccag 301 gctgacctga ccatgcaagc tgccagacgg acat // LOCUS MUSBPGALB 621 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse beta-galactoside-binding lectin (L-34) mRNA, 3' end. ACCESSION M33215 KEYWORDS beta-galactoside-binding lectin. SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line UV-2237-IP3, cDNA to mRNA, clone M5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 621) AUTHORS Raz,A., Carmi,P. and Pazerini,G. TITLE Expression of two different endogenous galactoside-binding lectins sharing sequence homology JOURNAL Cancer Res. 48, 645-649 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 420 34 kDa beta-galactoside-binding lectin (L-34) BASE COUNT 164 a 171 c 154 g 132 t ORIGIN 1 cccagggcaa cctggggcac ctggggccat ccccagtgct cctggaggct atcctgctgc 61 tggcccttat ggtgtccccg ctggaccact gacgtgccct atgacctgcc cttgcctgga 121 ggagtcatgc cccgcatgct gatcacaatc atgggcacag tgaaacccaa cgcaaacagg 181 attgttctag atttcaggag agggaatgat gttgccttcc actttaaccc ccgcttcaat 241 gagaacaaca gaagactcat tgtgtgtaac acgaagcagg acaataactg gggaaaggaa 301 gaaagacagt cagccttccc ctttgagagt ggaaaaccat tcaaaataca agtcctggtt 361 gcagctgacc attcaggttg cggtcacgat gctcactact gcagtacaac catcggatga 421 agaacctccg ggaaatcagc caactggcga tcagtggtga cataaccctg caccagcgct 481 gaaccagcgc catgatctaa gccagaaggg gcggcaccga aaccggccct gtgtgcctta 541 ggagtgggaa actttgcatt tctctctcct tatccttctt gtaagacatc catttaataa 601 agtctcatgc tgagagaaaa g // LOCUS MUSP32A 1510 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse tumor-induced 32 kD protein (p32) mRNA, complete cds. ACCESSION M33203 KEYWORDS tumor-induced protein. SOURCE Mouse (strain BALB/c) fibroblast cell line 3T3 A31, cDNA to mRNA, clone pMp32S. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1510) AUTHORS Kageyama,H., Hiwasa,T., Tokunaga,K. and Sakiyama,S. TITLE Isolation and characterization of a complementary DNA clone for a M-r 32,000 protein which is induced with tumor promoters in BALB/c 3T3 cells JOURNAL Cancer Res. 48, 4795-4798 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 75 944 32 kD protein (p32) mRNA < 1 1510 p32 mRNA signal 1491 1496 poly-A signal BASE COUNT 352 a 420 c 381 g 357 t ORIGIN 1 ccgcgcagag ccgtctcgag catagcccgg agcctgaatc gagcagaacc agcctgaact 61 agcccagtcc ggtgatggag cgtccacagc ccgacagcat gccccaggat ttgtctgagg 121 ccttgaagga ggccaccaag gaggtacaca tccaagccga gaatgctgag ttcatgaaga 181 actttcagaa gggtcaggtg tccagagaag gctttaagct ggtgatggct tccttgtacc 241 atatctacac ggccctggaa gaggagatag agcgcaacaa gcagaaccca gtctatgccc 301 cactctactt ccctgaggag ctgcaccgaa gggctgccct ggagcaggac atggccttct 361 ggtatgggcc tcactggcag gaaatcatcc cttgcacgcc agccacacag cactatgtaa 421 agcgtctcca cgaggtgggg cgcactcacc ctgagctgct ggtggcccac gcatataccc 481 gctacctggg tgacctctca gggggtcagg tcctgaagaa gattgcacag aaggccatgg 541 ccttgcccag ctctggggag ggcctggctt tttttacctt cccgaacatc gacagcccca 601 ccaagttcaa acagctctat cgtgctcgaa tgaacactct ggagatgaca cctgaggtca 661 agcacagggt gacagaagag gctaagaccg ccttcctgct caacattgag ctgtttgagg 721 agctgcaggt gatgctgaca gaggaacaca aagaccagag tccctcacag atggcgtcac 781 ttcgtcagag gcctgctagc ctggtgcaag atactgcccc tgcagagaca ccccgaggga 841 aaccccagat cagcactagc tcatcccaga caccgctcct ccagtgggtc ctcactctca 901 gcttcctgtt ggcaacagtg gcagtgggaa tttatgccat gtaaatgcaa tactggcccc 961 caggggctgt gaactctgtc caatgtggcc ttctctctgt aagggagaat cttgcctggc 1021 tctcttctct tgggcctcta agaaagcttt tggggtccct agcccactcc ctgtgtttcc 1081 tttctctctg gaatggaggg agatacctga cacagttccc tcaccaaaag cacatccagc 1141 cagtggcctg aactttgaaa ccagcagccc caaatcctgc agcagagccc caaaactggc 1201 ctgtaaaagc agctgttctg agcccagtgc ccatggttgt aagcatccat gttgactgac 1261 cacgactgct gtcccccagt gccatggcca ctttgatatc cgtttccaga catttctgtc 1321 tcgtatttct gtcttgtttt ttattatttc cccagttcta ccagagtaat ggtattttgt 1381 tgttttgttt tgtcttgttt ttcctaacaa agtggggcta tcttttgagg ggtgggtggg 1441 aaagaattat ttaatagttg taaccttggt ctctaacttc tgtgtgaaat aataaatggc 1501 attatctaac // LOCUS PASLKTCABD 7742 bp ds-DNA BCT 12-JUL-1990 DEFINITION P.haemolytica leukotoxin gene cluster, complete cds. ACCESSION M24197 M34943 M34944 KEYWORDS LktA membrane protein; cytolysin; hemolysin; leukotoxin. SOURCE P.haemolytica (strain PHL101) DNA, clones lambda-sh132 and pSH224. ORGANISM Pasteurella haemolytica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 7742) AUTHORS Highlander,S.K., Chidambaram,M., Engler,M.J. and Weinstock,G.M. TITLE DNA sequence of the Pasteurella haemolytica leukotoxin gene cluster JOURNAL DNA 8, 15-28 (1989) STANDARD full staff_review REFERENCE 2 (bases 46 to 276 and 3576 to 3813) AUTHORS Highlander,S.K., Engler,M.J. and Weinstock,G.M. TITLE secretion and expression of the Pasteurella haemolytica leukotoxin JOURNAL J. Bacteriol. 172, 2343-2350 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.K.Highlander, 24-APR-1989. FEATURES from to/span description pept 216 719 leukotoxin (LktC) pept 735 3596 leukotoxin membrane protein (LktA) pept 3670 5796 leukotoxin (LktB) pept 5808 7244 leukotoxin (LktD) mRNA 186 > 3596 lktCA mRNA signal 173 178 -10 region promoter binding 205 209 lktC ribosomal binding site (put.) binding 722 726 lktA ribosomal binding site (put.) binding 3660 3665 lktB ribosomal binding site (put.) binding 5797 5802 lktD ribosomal binding site (put.) BASE COUNT 2573 a 1357 c 1568 g 2244 t ORIGIN 1 taatattaca atgtaattat tttgtttaat ttccctacat tttgtataac tttaaaacac 61 tcctttttct cttctgatta tataaaagac aaaaaataca atttaagcta caaaaaacaa 121 caaaaaacaa caaaaaacac gacaataaga tcgagtaatg attatattat gttataattt 181 ttgacctaat ttagaataat tatcgagtgc aaattatgaa tcaatcttat tttaacttac 241 taggaaacat tacttggcta tggatgaact cctccctcca caaagaatgg agctgtgaac 301 tactagcacg caatgtgatt cctgcaattg aaaatgaaca atatatgcta cttatagata 361 acggtattcc gatcgcttat tgtagttggg cagatttaaa ccttgagact gaggtgaaat 421 atattaagga tattaattcg ttaacaccag aagaatggca gtctggtgac agacgctgga 481 ttattgattg ggtagcacca ttcggacatt ctcaattact ttataaaaaa atgtgtcaga 541 aataccctga tatgatcgtc agatctatac gcttttatcc aaagcagaaa gaattaggca 601 aaattgccta ctttaaagga ggtaaattag ataaaaaaac agcaaaaaaa cgttttgata 661 catatcaaga agagctggca acacgactta aaaatgaatt taattttatt aaaaaataga 721 aggagacatc ccttatggga actagactta caaccctatc aaatgggcta aaaaacactt 781 taacggcaac caaaagtggc ttacataaag ccggtcaatc attaacccaa gccggcagtt 841 ctttaaaaac tggggcaaaa aaaattatcc tctatattcc ccaaaattac caatatgata 901 ctgaacaagg taatggttta caggatttag tcaaagcggc cgaagagttg gggattgagg 961 tacaaagaga agaacgcaat aatattgcaa cagctcaaac cagtttaggc acgattcaaa 1021 ccgctattgg cttaactgag cgtggcattg tgttatccgc tccacaaatt gataaattgc 1081 tacagaaaac taaagcaggc caagcattag gttctgccga aagcattgta caaaatgcaa 1141 ataaagccaa aactgtatta tctggcattc aatctatttt aggctcagta ttggctggaa 1201 tggatttaga tgaggcctta cagaataaca gcaaccaaca tgctcttgct aaagctggct 1261 tggagctaac aaattcatta attgaaaata ttgctaattc agtaaaaaca cttgacgaat 1321 ttggtgagca aattagtcaa tttggttcaa aactacaaaa tatcaaaggc ttagggactt 1381 taggagacaa actcaaaaat atcggtggac ttgataaagc tggccttggt ttagatgtta 1441 tctcagggct attatcgggc gcaacagctg cacttgtact tgcagataaa aatgcttcaa 1501 cagctaaaaa agtgggtgcg ggttttgaat tggcaaacca agttgttggt aatattacca 1561 aagccgtttc ttcttacatt ttagcccaac gtgttgcagc aggtttatct tcaactgggc 1621 ctgtggctgc tttaattgct tctactgttt ctcttgcgat tagcccatta gcatttgccg 1681 gtattgccga taaatttaat catgcaaaaa gtttagagag ttatgccgaa cgctttaaaa 1741 aattaggcta tgacggagat aatttattag cagaatatca gcggggaaca gggactattg 1801 atgcatcggt tactgcaatt aataccgcat tggccgctat tgctggtggt gtgtctgctg 1861 ctgcagccgg ctcggttatt gcttcaccga ttgccttatt agtatctggg attaccggtg 1921 taatttctac gattctgcaa tattctaaac aagcaatgtt gagcacgttg caaataaaaa 1981 ttcataacaa aattgtagaa tgggaaaaaa ataatcacgg taagaactac tttgaaaatg 2041 gttacgatgc ccgttatctt gcgaatttac aagataatat gaaattctta ctgaacttaa 2101 acaaagagtt acaggcagaa cgtgtcatcg ctattactca gcagcaatgg gataacaaca 2161 ttggtgattt agctggtatt agccgtttag gtgaaaaagt ccttagtggt aaagcctatg 2221 tggatgcgtt tgaagaaggc aaacacatta aagccgataa attagtacag ttggattcgg 2281 caaacggtat tattgatgtg agtaattcgg gtaaagcgaa aactcagcat atcttattca 2341 gaacgccatt attgacgccg ggaacagagc atcgtgaacg cgtacaaaca ggtaaatatg 2401 aatatattac caagctcaat attaaccgtg tagatagctg gaaaattaca gatggtgcag 2461 caagttctac ctttgattta actaacgttg ttcagcgtat tggtattgaa ttagacaatg 2521 ctggaaatgt aactaaaacc aaagaaacaa aaattattgc caaacttggt gaaggtgatg 2581 acaacgtatt tgttggttct ggtacgacgg aaattgatgg cggtgaaggt tacgaccgag 2641 ttcactatag ccgtggaaac tatggtgctt taactattga tgcaaccaaa gagaccgagc 2701 aaggtagtta taccgtaaat cgtttcgtag aaaccggtaa agcactacac gaagtgactt 2761 caacccatac cgcattagtg ggcaaccgtg aagaaaaaat agaatatcgt catagcaata 2821 accagcacca tgccggttat tacaccaaag ataccttgaa agctgttgaa gaaattatcg 2881 gtacatcaca taacgatatc tttaaaggta gtaagttcaa tgatgccttt aacggtggtg 2941 atggtgtcga tactatttac ggtaacgacg gcaatgaccg cttatttggt ggtaaaggcg 3001 atgatattct cgatggtgga aatggtgatg attttatcga tggcggtaaa ggcaacgacc 3061 tattacacgg tggcaagggc gatgatattt tcgttcaccg taaaggcgat ggtaatgata 3121 ttattaccga ttctgacggc aatgataaat tatcattctc tgattcgaac ttaaaagatt 3181 taacatttga aaaagttaaa cataatcttg tcatcacgaa tagcaaaaaa gagaaagtga 3241 ccattcaaaa ctggttccga gaggctgatt ttgctaaaga agtgcctaat tataaagcaa 3301 ctaaagatga gaaaatcgaa gaaatcatcg gtcaaaatgg cgagcggatc acctcaaagc 3361 aagttgatga tcttatcgca aaaggtaacg gcaaaattac ccaagatgag ctatcaaaag 3421 ttgttgataa ctatgaattg ctcaaacata gcaaaaatgt gacaaacagc ttagataagt 3481 taatctcatc tgtaagtgca tttacctcgt ctaatgattc gagaaatgta ttagtggctc 3541 caacttcaat gttggatcaa agtttatctt ctcttcaatt tgctagagca gcttaatttt 3601 taatgattgg caactctata ttgtttcaca cattatagat tgccgtttta ttttataaaa 3661 ggagacaata tggaagctaa ccatcaaagg aatgatcttg gtttagttgc cctcactatg 3721 ttggcacaat accataatat ttcgcttaat ccggaagaaa taaaacataa atttgatctt 3781 gacggaaaag ggctttcttt aactgcttgg cttttagctg caaaatcgtt agcgttgaaa 3841 gcgaaacaca ttaaaaaaga gatttcccgc ttacacttgg tgaatttacc ggcattagtt 3901 tggcaagata acggtaaaca ttttttattg gtaaaagtgg ataccgataa taaccgctat 3961 ttaacttaca atttggaaca agatgctcca caaattctgt caacagacga atttgaagcc 4021 tgctatcaag ggcagttaat tttggtcacg tccagagctt ccgtagtagg tcaattagca 4081 aagttcgatt tcacctggtt tattccggcg gtgatcaaat accgaaaaat ctttctagaa 4141 accttgattg tttcgatctt tttgcaaatt tttgccctaa ttacaccgct attcttccaa 4201 gttgttatgg ataaagtact ggtgcatcga ggtttttcaa ccttgaatat cattacggtt 4261 gccttagcta ttgtgatcat ctttgaaatt gtactaagtg gtttgagaac ctatgttttt 4321 tctcatagca ctagccgtat tgatgttgaa ttaggcgcta aattatttcg acatttatta 4381 tcactaccca tttcttattt tgaaaacaga cgagttggag atacagtcgc tagggttaga 4441 gaattagatc aaattcgtaa tttccttacc ggacaagcat taacctcggt gttagatctc 4501 ttattctctt ttatcttttt tgccgtaatg tggtattaca gcccaaaatt aaccttggta 4561 attcttggtt cattgccctg ctatatttta tggtcaattt ttattagtcc gattttaaga 4621 cggcgtttag atgagaaatt tgcccgaagt gctgataacc aagcattctt agttgagtcg 4681 gtaacagcca tcaatatgat taaagcgatg gcggttgctc cacaaatgac ggatacatgg 4741 gataaacagc tggcaagcta tgttttcatc agtttccgtg tcaccgtatt agcaaccatt 4801 gggcaacaag gtgtacaact tattcaaaaa accgttatgg tgattaacct ttggttaggg 4861 gcacacttag ttatttcagg cgatctgagt attgggcaat taattgcctt taatatgcta 4921 tcagggcaag tgattgcacc ggtgattcgg ctggctcagc tctggcaaga tttccaacaa 4981 gttgggattt ccgtcactcg cttaggtgat gttttaaact ctccaaccga acaatatcaa 5041 ggcaaattat cactaccaga aataaaaggc gatatctcat ttaaaaatat ccgctttaga 5101 tataaaccag atgcaccaac tattttaaat aatgtgaatt tagaaattag gcaaggagaa 5161 gtgattggga ttgttggacg ttccggttca ggcaaaagta ctctgactaa attactgcaa 5221 cgtttttata ttcctgaaaa tgggcaggtt ttgattgatg gacatgatct agccttagct 5281 gatccaaact ggctacgccg tcaaataggt gtagtgctgc aagataatgt gttattaaac 5341 cgcagtatcc gagaaaatat tgcgctatca gatccaggaa tgccaatgga gcgagtaatt 5401 tatgcagcaa aattagcagg ggctcacgat tttatttcag aattgcgtga aggttatacc 5461 accattgtgg gtgaacaagg agcggggctt tcaggcgggc aacgccaacg gattgcgatt 5521 gctcgagctt tggtaaacaa cccgaaaatc ctgatttttg atgaggcaac cagtgccctc 5581 gattacgaat ctgagcatat tattatgcaa aatatgcaaa aaatatgcca aggcagaacc 5641 gtgattttga ttgcacatcg tttatcgacc gtcaaaaatg cggatcgaat tattgtgatg 5701 gaaaaggggg aaattgttga gcaaggcaag caccacgaat tactgcaaaa cagtaacgga 5761 ctttattcct acttacacca attacaactt aattaagaag gaaaacaatg aaaatatggc 5821 ttagtggtat ttatgaattt ttcctacgct ataaaaacat ttgggcagaa gtatggaaaa 5881 ttcgtaaaga attagaccac ccaaacagaa aaaaagacga aagtgaattt ttaccggcac 5941 atttagaact gattgaaacc ccggtttcta aaaaaccacg tctaattgct tatttgatta 6001 tgctattttt agttgtggca attgtgcttg ccagtgtaag caaagttgaa attgtggcga 6061 ctgctcccgg taaattaact tttagtggca gaagtaaaga aattaaaccg attgaaaacg 6121 ccattgtaca agaaattttc gttaaagatg ggcagtttgt ggaaaaaggg caattattag 6181 tcagcttaac tgcattgggt tctgatgcag atatcaaaaa gaccatggct tcactttctt 6241 tagctaaact ggagacctat cgctaccaaa ctttgcttac tgccattgaa aaagagtcct 6301 tgccggtgat tgatttatct agaaccgaat ttaaagattc atcggaagaa gatcgactac 6361 gtattaaaca cttaattgag gagcaataca ccacttggca aaaacaaaaa acacagaaaa 6421 ctttagcgta taagcgtaaa gaggctgaaa aacaaacaat atttgcctat gtccgtaaat 6481 atgaaggtgc aacacgtatt gaacaagaaa aattaaaaga ctttaaggca ctttataaac 6541 agaagtcttt atctaagcac gaacttcttg cgcaagaaaa taaattaatt gaggctcaga 6601 atgcagtagc tgtttatcgc tcaaaattaa atgaattaga aaatgatcta ctcaatgtaa 6661 aagaagaact tgaattgatc acgcaattct ttaaaagcga tgtgttggaa aaattaaagc 6721 aacatattga aaatgaacgc caacttcggc tcgagttaga aaaaaataat caacgcagac 6781 aggcctcgat gatcagagca ccggtttccg gtacggttca gcaactgaaa attcacacta 6841 taggtggtgt tgttacgact gctgaaacct tgatgatcat tgtgccggaa gacgatgtgt 6901 tagaggccac cgctctggtt ccaaacaaag atatcggctt tgttgcagca gggcaggagg 6961 tgattattaa agtggaaact ttcccttata cacgctatgg ttatctaact ggtcgaatta 7021 aacatattag cccggatgcg attgaacaac ctaatgtagg cttagttttt aatgcaacta 7081 tagctataga taggaagaat ctaacatcgc ctgatgggcg aaaaattgat ttgagttcag 7141 gtatgacaat aactgctgaa atcaaaaccg gtgaacggag tgtaatgagt tatttactca 7201 gcccattaga agaatctgtc acagaaagtt taagggaacg ctaatcgaac caaaacaaag 7261 ccataaaagc cattttgagc ttttatggct ttatttttta gtccacaagc ggacaaaaaa 7321 gcccaatttt ttacactttt ataacaaatt gttctaacta aaaattacta attcttttct 7381 tttatagcga tctctatttc atttcattaa cattgactag aagggattat gagcctaagc 7441 attacgaatc tttctcttgg ctaccgcaaa aatcagcaaa ggcttatttg aaaagcacgg 7501 tgtcgaggtg gaaaaaccgg tgatgtttcg cagctgggct cagttggtgg aagcttttta 7561 agtggcaatg tgaacgtggt gcatctgctt tcgcctatga gtttgtgggc gaaatatgga 7621 gcaaatgctc cggtgaaagc ggtaatgtgg aatcacttgg caggttcggc tttaacggtt 7681 cgccctgaaa tcaacagtat tgccgaactc tccggcaaaa cggtagaact tccgttttgg 7741 ta // LOCUS RATBADPTA 3477 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat beta adaptin mRNA, complete cds. ACCESSION M34176 J05273 KEYWORDS beta adaptin. SOURCE Rat lymphocyte, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3477) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 72 2885 beta adaptin mRNA < 1 3477 beta adaptin mRNA BASE COUNT 914 a 851 c 822 g 890 t ORIGIN 1 cggggctgtg ctctctgact gccgccgcca ccccgcccct tgcctccggt tcacgctgaa 61 gatccagaat catgactgac tccaagtact tcacaaccaa taagaaggga gaaatctttg 121 aattaaaagc tgaactcaac aatgaaaaga aagaaaagag gaaggaggct gtgaagaaag 181 tgattgctgc tatgactgtg gggaaagacg ttagctctct cttcccagat gtggtgaact 241 gtatgcagac tgacaacctg gaactaaaga agcttgtgta cctctatctg atgaactatg 301 ccaagagtca gccagacatg gccatcatgg ctgtcaacag ctttgtgaag gattgtgaag 361 accccaatcc tttgattcga gccttggcag ttagaaccat gggatgcatc cgggtggaca 421 agattacaga gtatctctgt gaacccctcc gcaagtgctt gaaggatgaa gacccctatg 481 ttcggaaaac agcagcagta tgcgtggcaa aactccatga tatcaatgcc cagatggtgg 541 aagatcaggg atttctggat tctctgcggg atctcatagc agattcaaac ccaatggtgg 601 tggctaatgc tgtagcagca ttgtctgaga tcagtgagtc tcacccaaac agcaacttac 661 ttgatctgaa ccctcagaat atcaataagc tgctcacagc cctgaatgag tgcactgagt 721 ggggccagat tttcatcttg gactgcctgt ctaattacaa ccctaaagat gaccgggaag 781 ctcagagcat ctgtgagcga gtgacgcctc ggctctctca tgccaattct gcagtggtgc 841 tttcagcagt aaaagttctg atgaagtttc tagagttgtt acccaaggac tctgactact 901 acaatatgct gctaaagaag ctagcgcctc cacttgtcac tttgctctct ggggagccag 961 aagtgcagta tgttgccctg aggaacatca acctaattgt ccagaaaagg cctgaaatct 1021 tgaagcagga aatcaaggtc ttctttgtga agtacaatga tcctatctat gttaaactag 1081 agaagttaga catcatgatt cgtcttgcat cccaagccaa cattgctcag gttctggcag 1141 aactgaagga atatgccact gaagttgatg tggactttgt tcgcaaagct gtgagggcca 1201 ttggacggtg tgccatcaaa gtggagcaat cagcagaacg ctgtgtgagc acactgcttg 1261 atctaatcca gaccaaagta aattatgtgg tccaagaggc aattgttgtc atcagggaca 1321 tcttccgaaa ataccccaac aagtatgaga gcattatcgc cacgctgtgt gagaacttgg 1381 actccctgga tgaacccgat gcccgagcgg ctatgatttg gattgtagga gagtatgctg 1441 aaagaatcga taatgccgat gagttactag agagcttcct ggaaggtttt catgatgaaa 1501 gcacccaggt gcagctcacg ttgcttaccg ccatagtgaa actgtttctc aagaagccat 1561 cagaaacaca ggagctggtc caacaggtct tgagcttggc cacacaggat tctgataatc 1621 ctgaccttcg agatcggggt tatatttatt ggcgccttct ttcaactgac cctgtgacag 1681 ccaaagaagt agtgttgtct gagaagccat tgatctctga ggaaacagac ctcattgaac 1741 ctaccctcct ggatgagctc atctgccaca ttggttcttt ggcctccgtg taccataaac 1801 ctccgaatgc ttttgtggaa gggagccatg gcattcatcg caaacacttg ccaattcacc 1861 atgggagcac tgatgcaggt gatagccctg ttggcaccac cactgcaacc aacctggaac 1921 agcctcaggt catcccctct caaggtgacc ttctggggga tcttttaaat cttgacctgg 1981 gtcccccagt gaatgtcccg caagtgtcct ccatgcagat gggagcagtg gatcttttag 2041 gaggaggact ggatagcctg gtaggacagt ccttcatccc gtcatcagtg cctgcaacct 2101 tcgctccttc acctactcct gctgtggtca gcagtggtct gaatgacctg tttgagcttt 2161 ccactgggat aggcatggca cctggcggat atgtggctcc taaggcagtc tggctacctg 2221 ctgtaaaggc taaaggcttg gagatttcgg ggacgtttac tcaccgccaa gggcacatct 2281 atatggaaat gaacttcacc aacaaagctc tgcagcacat gacggatttt gccatccagt 2341 ttaacaagaa tagcttcggt gtcatcccga gcactccctt ggccatacat actccgctga 2401 tgccaaacca gagcattgat gtgtctctgc ctctcaacac cttgggccca gtcatgaaga 2461 tggagcctct gaataacttg caggtggctg ttaaaaacaa tattgatgtc ttctacttca 2521 gctgcctcat cccactcaat gtgctttttg tagaagatgg caaaatggaa cgccaggtct 2581 tccttgcgac gtggaaggat attcccaatg aaaatgagct ccaatttcag attaaggagt 2641 gtcatttaaa cgctgacaca gtttccagca agttgcaaaa caacaatgtt tacactatcg 2701 ccaagaggaa tgtggagggg caggacatgc tgtaccagtc cctgaagctc actaatggca 2761 tttggatttt ggcagagctg cggatccagc caggaaaccc caattatacg ctgtcgctga 2821 agtgtagagc ccctgaagtc tctcagtaca tctatcaggt ctacgacagc attttgaaaa 2881 actaataaat gggtccagtc agcctgtaat cagtgcaagc cacgaactct taactgaaag 2941 acactgtatt gttgtgtaga gcctgaaccc aaaccctgcg gtacccaccc cggtagtggc 3001 cagtcatttt gtgctgatat tagcactcac cccattggta ggttagcttc ccgtgacatc 3061 tccttccact atcgcccacc tctgccacct gccgctgctc tctgtcctta gttgtgagtt 3121 cctctgtgct gtgccaatgg ctagcctttt ctacaccctc ttttgagtgt agtttgatat 3181 tttgtaatcg aaagctcatt tcacaagcag aaaaaggcaa caagttaatt agagcgagga 3241 agagtgtcac tgaaacatac actgcacctt attgttttat atttttgtac agatgagata 3301 gatattgagg tagaacgctg agtagaaagg gtgactgacc ctcctcagac acagtcttat 3361 tggagacata tggccctggc cccttctggg caaggagagg cgaccccact cctggtcttt 3421 tgcattttca ccttggccac gccttccagc tctcttatgc ccatgctctc tcatttg // LOCUS RATPSPB 1620 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat pulmonary surfactant-associated glycoprotein A (SP-A) mRNA, complete cds. ACCESSION M33201 KEYWORDS pulmonary surfactant protein A. SOURCE Rat fetal lung, cDNA to mRNA, clone SP-A [0.9, 1.6]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1620) AUTHORS Fisher,J.H., Emrie,P.A., Shannon,J., Sano,K., Hattler,B. and Mason,R.J. TITLE Rat pulmonary surfactant protein A is expressed as two differently sized mRNA species which arise from differential polyadenylation of one transcript JOURNAL Biochim. Biophys. Acta 950, 338-345 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 56 802 pulmonary surfactant protein A (56 could be 29) BASE COUNT 425 a 400 c 396 g 399 t ORIGIN 1 cagatatcca cacagcctgc aggtctgtat gtggaagcca ctggggatag tagccatgtc 61 actgtgttct ttggccttca ccctcttctt gactgttgtc gctggtatca agtgcaatgt 121 gacagacgtt tgtgctggaa gccctgggat ccctggagct cctggaaacc atggtctgcc 181 tggcagagac gggagagacg gtgtcaaagg agaccctgga cctccaggtc ccatgggccc 241 tcctggagga atgccaggtc ttcctggacg cgatgggctg cccggaggac ctggtgcacc 301 tggaggacgt ggagacaagg gagagcctgg agaaaggggc ctgccaggat ttccagctta 361 cctggatgag gagctccaga ctgaactcta tgagatcaaa catcagattc tgcaaacaat 421 gggagtcctc agcttgcaag gatccatgct gtcagtgggg gataaagtct tttccaccaa 481 tgggcagtca gtcaactttg ataccattaa agagatgtgt accagagcag gaggcaacat 541 tgctgtcccg aggactcctg aggagaacga ggccattgca agtattgcga agaagtacaa 601 caactatgtc tacttgggca tgattgaaga ccagactcct ggagacttcc actacctgga 661 tggggcttct gtgaactaca ccaactggta cccaggagaa cccaggggtc agggcaaaga 721 aaagtgtgta gaaatgtata cagatgggac atggaatgat aggggctgcc tgcagtaccg 781 gctggctgtt tgtgaatttt gatcaagcaa ttagacgaaa agatgaaccc tcacactgcc 841 tctatcctga tgattcatct ggtctgtaaa accctgcaac tacctttact tgtggccttc 901 agtaattaga agcatctttt gtcacccccg ctcccacata gttcccaaac acttctccat 961 attcattagc aatcctgagt gtttccctag agtcccatct gagcgttcat tcaaggtagc 1021 cattgtaaac cttggccttg accatgagat ggatagatac ttcctttttc ctcactttat 1081 ccagtcttca tttataaatg gtggccatga agacccagca tggaaggacc ctctaactaa 1141 gtgctgccct ctgacctttc cacccttctg tagctcggtg tcccaggatt tagaagtcca 1201 ggttaaacat aggggatttc tgggaaagcc tagtatgtgg gtgcaggcca cattcatgcc 1261 atctgtatcc atggctttca aggcaaacat tgtctctaag aagccagaga accaggagaa 1321 ccaggtagga ccaggtagta ctgggggaac ataaactcac ttggtttggc atgtatggct 1381 cctccttggg tctggaggtg ccatcttgac cttgaactaa cagcagccac cctgggtttt 1441 gagagaacga ccttcccagc ccagacccca actcaagtaa tttcctgcta acagacacag 1501 cctcagttca ctttacatca ctgaggcatt catgatacga actgcaatct gttttctcct 1561 ctcgtgagtt caatcagcta ttcattaaag tcaactgcat tcaaaaaaaa aaaaaaaaaa // LOCUS FSBCRYGM1 613 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Carp gamma-crystallin (gamma-m1) mRNA, complete cds. ACCESSION X12902 M33115 KEYWORDS crystallin; gamma-crystallin. SOURCE Cyprinus carpio lens, cDNA to mRNA. ORGANISM Cyprinus carpio Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae. REFERENCE 1 (bases 1 to 613) AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C. TITLE Carp gamma-crystallins with high methionine content: Cloning and sequencing of the complementary DNA JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988) STANDARD simple staff_review COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry, Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C.. Submitted (09-SEP-1988) on tape to the EMBL data library. FEATURES from to/span description pept 34 570 gamma-crystallin (gamma-m1) BASE COUNT 166 a 133 c 167 g 147 t ORIGIN 1 ctgaagcact gagataaaca accctctacc atcatgggca agatcatctt ctacgaggac 61 aggaacttcc agggccgcag ctatgactgc atgagcgact gctctgatat ctcctcttac 121 ctcagccgcg ttggttcaat cagggtggag agtggttgtt tcatggtcta tgagcgcaac 181 agctacatgg ggaaccagtt cttcctgagg aggggcgagt accatgatat gcagcgcatg 241 atgagcatgg gcatgatgtt tgacactatc agatcctgcc gcatgattcc tccatacagg 301 ggttcctaca gaatgaggat ctacgagagg gacaccttcg gaggacagat gcacgaggtg 361 atggatgact gtgacaacat catggaacgt taccgtatgt ctgactggca gtcttgtcat 421 gtgatggacg gccactggct cttctatgag cagccacact acagaggcag aatgtggtac 481 ttcaggcctg gagagtacag gagcttcaga gatatgggat acagcaacat gagattcatg 541 agcatgaggc gtatcactga tatgtgttaa actgctagaa tatagaagga attaaagtgt 601 tattctcaga act // LOCUS FSBCRYGM2 554 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Carp gamma-crystallin (gamma-m2) mRNA, complete cds. ACCESSION X12903 M33116 KEYWORDS crystallin; gamma-crystallin. SOURCE Cyprinus carpio lens, cDNA to mRNA. ORGANISM Cyprinus carpio Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae. REFERENCE 1 (bases 1 to 554) AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C. TITLE Carp gamma-crystallins with high methionine content: Cloning and sequencing of the complementary DNA JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988) STANDARD simple staff_review COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry, Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C.. Submitted (09-SEP-1988) on tape to the EMBL data library. FEATURES from to/span description pept 7 528 gamma-crystallin gamma-m2 (AA 1 - 173) BASE COUNT 142 a 124 c 150 g 138 t ORIGIN 1 tggcccatga aggtcacctt ttatgaggac aggaacttcc agggtcgctc ttatgactgt 61 atgagcgact gtgccgattt ctcctcctac atgagccgct gtcactcttg cagagtgcac 121 agcggatgct ggatgatgta cgatcaaccc aactacatgg gaaatcagta tttctttagg 181 aggggagagt atgctgatta catgtctatg tttggaatga gcaactgcat caggtcctgc 241 cgtatgatcc ctatgcacag gggatcctac agaatgagga tctacgagag ggagaacttc 301 atgggccaga tgtacgaaat ggccgatgac tgtgacagta tcatggaccg ttaccgcatg 361 cctcactgcc agtcctgcca tgtgatggac ggccactggc tcatgtatga gcagccccac 421 tacagaggca ggatgtggta cttcaggcct ggagagtaca ggagcttcag caatatgggt 481 ggaatgagat tcatgagcat gaggcgtatc atggactcct ggtactagag tttatattaa 541 taaaataact cctc // LOCUS HUMIL2A1 940 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human interleukin 2 gene, exons 1 and 2. ACCESSION M33199 KEYWORDS interleukin; interleukin 2. SEGMENT 1 of 2 SOURCE Human DNA, clones Lm HIG[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 940) AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K. TITLE Organization of the DNA regions flanking the human interleukin 2 gene JOURNAL Biomed. Res. 6, 197-205 (1985) STANDARD simple staff_review FEATURES from to/span description pept 629 775 interleukin 2, exon 1 /nomgen="IL2" /map="4q26-q27" /hgml_locus_uid="LT0164X" 865 / 924 interleukin 2, exon 2 IVS 776 864 IL2 intron A IVS 925 > 940 IL2 intron B BASE COUNT 313 a 181 c 137 g 309 t ORIGIN 1 cttcaactca ataagcattt taagtattct aatcttagta tttctctagc tgacatgtaa 61 gaagcaatct atcttattgt atgcaattag ctcattgtgt ggataaaaag gtaaaaccat 121 tctgaaacag gaaaccaata cacttcctgt ttaatcaaca aatctaaaca tttattcttt 181 tcatctgttt actcttgctc ttgtccacca caatatgcta ttcacatgtt cagtgtagtt 241 ttatgacaaa gaaaattttc tgagttactt ttgtatcccc acccccttaa agaaaggagg 301 aaaaactgtt tcatacagaa ggcgttaatt gcatgaatta gagctatcac ctaagtgtgg 361 gctaatgtaa caaagaggga tttcacctac atccattcag tcagtctttg ggggtttaaa 421 gaaattccaa agagtcatca gaagaggaaa aatgaaggta atgttttttc agacaggtaa 481 agtctttgaa aatatgtgta atatgtaaaa cattttgaca cccccataat atttttccag 541 aattaacagt ataaattgca tctcttgttc aagagttccc tatcactctc tttaatcact 601 actcacagta acctcaactc ctgccacaat gtacaggatg caactcctgt cttgcattgc 661 actaagtctt gcacttgtca caaacagtgc acctacttca agttctacaa agaaaacaca 721 gctacaactg gagcatttac ttctggattt acagatgatt ttgaatggaa ttaatgtaag 781 tatatttcct ttcttactaa aattattaca tttagtaatc tagctggaga tcatttctta 841 taacaatgca ttatactttc ttagaattac aagaatccca aactcaccag gatgctcaca 901 tttaagtttt acatgcccaa gaaggtaagt acaatatttt // LOCUS HUMIL2A2 569 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human interleukin 2 (IL-2) gene, 3' flank. ACCESSION M33198 KEYWORDS Alu repetitive sequence; interleukin; interleukin 2. SEGMENT 2 of 2 SOURCE Human DNA, clones Lm HIG[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 569) AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K. TITLE Organization of the DNA regions flanking the human interleukin 2 gene JOURNAL Biomed. Res. 6, 197-205 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 136 449 Alu-repeat /nomgen="IL2" /map="4q26-q27" /hgml_locus_uid="LT0164X" rpt 130 135 5' insertion target sequence rpt 450 455 3' insertion target sequence BASE COUNT 204 a 131 c 96 g 138 t ORIGIN Unknown number of bp after segment 1. 1 agcttcaata agatccaatg aatattctag attctatttg tcttctgaag acagcttaat 61 ctaatttaga taaaaataac atcatccaga gcctctacac tatttcagac acatgtagca 121 tcagcttaaa aattatgaaa cctactggct aacacgtgaa accttgtcac taccaaaaat 181 acaaaaaaaa aaaaattagc tgagtgtggt ggcgggcgcg tagtcccagc tactcaggag 241 gctgaggcag gagaatggcg tgaacttggt aggcagagct gcagtgagcc aagatcgtgc 301 cattgcactc cagcctgggt gacagagcaa gactccatct caaaaaaaaa aaaaaaaaaa 361 aagagacctg ctaacacaca cacacacaca cacacacaca ctctctctct ctctctctct 421 ctctctctct ctctctctct ctctctctca aattaagttg ggcggcaagg ggaaacaata 481 aacatctcca acataggatt caagtgtagt tataagatac agactttaac taatataata 541 tgttcaagaa aataaagcat catatctag // LOCUS TOMCPKA 103 bp ss-rRNA ORG 12-JUL-1990 DEFINITION Tomato chloroplast 4.5S ribosomal RNA. ACCESSION M33098 KEYWORDS 4.5S ribosomal RNA; ribosomal RNA. SOURCE Tomato (strain Mill) chloroplast ribosomal RNA. ORGANISM Chloroplast Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Lycopersicon esculentum. REFERENCE 1 (bases 1 to 103) AUTHORS Zhen-Qi,C., Xiao,X. and E,-Sheng.W. TITLE The nucleotide sequence of 4.5 S rRNA from tomato chloroplasts JOURNAL Biochim. Biophys. Acta 866, 89-91 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 29 a 19 c 32 g 23 t ORIGIN 1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaacagatc ggtagacttg aac // LOCUS EBOMAY 157 bp ss-RNA VRL 12-JUL-1990 DEFINITION Ebola virus 3' proximal protein gene, 5' end. ACCESSION M33062 KEYWORDS . SOURCE Ebola virus (strain MAY; Zaire 1976) RNA. ORGANISM Ebola virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae. REFERENCE 1 (bases 1 to 157) AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D. TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and Marburg virus JOURNAL Virology 149, 251-254 (1986) STANDARD simple staff_review FEATURES from to/span description pept 53 > 157 3'proximal protein BASE COUNT 56 a 22 c 31 g 48 t ORIGIN 1 gggcacacaa aaagaaagaa gaatttttag gatcttttgt gtgcgaataa ctatgaggaa 61 gattaataat ttcctctcat tgaaatttga tgatcggaat ttgaaattga aattgttgat 121 ctgtaatcac accgttgatt cagagccaca cacaagt // LOCUS ECOBISCASD 3337 bp ds-DNA BCT 12-JUL-1990 DEFINITION E.coli biotin sulfoxide reductase (bisC) gene, complete cds. ACCESSION M34827 KEYWORDS biotin sulfoxide reductase; bisC gene. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3337) AUTHORS Pierson,D.E. and Campbell,A. TITLE Cloning and nucleotide sequence of bisC, the structural gene for biotin sulfoxide reductase in Escherichia coli JOURNAL J. Bacteriol. 172, 2194-2198 (1990) STANDARD simple staff_review FEATURES from to/span description pept 577 2757 biotin sulfoxide reductase (bisC) BASE COUNT 764 a 856 c 927 g 790 t ORIGIN 1 tatccccgct gcgggttacg ctaacaccag tgccgcgcat tttgtcgcgc agttcgcttc 61 ctgcacatcc atgtaataac caacgccgcc gcccagagct gcgcctgctg ctgcgccaat 121 cagcgcgcct ttaccgcgat ctttcttcga agaagagagc gcaccaatac ccgcgcccac 181 gagagagccc agacctgcgc cgatagcaga tttacctgct tcgcgttcgc cggtgtaagg 241 gttagttgtg cagccagata ccgccagagc gccactcact acggcggcaa taagataaac 301 acgtttcttc attgttaatc cttaataacc tttttattct ttgccacggg ttccgtggcg 361 ggagattatg ccgcgtgaac atgaagatta ttcctgggaa tactcggaaa tttgtaagta 421 atatttaact gctcaataca tctaaccttt caggagtctt cggtttggcc aactcatcct 481 cacgatattc cgttctgact gccgccattg ggggcccatg ctggttgaaa ccgacggcga 541 aaccgtgttt agctgcgtgg cgcgttagcc acaggaatgg aaaactcctt gcagagcgcg 601 gttcgcgacc aggttcacag caatacgcgg gtacgatttc caatggtgcg aaaaggcttt 661 cttgcgtcac cggaaaaccc gcaaggcatt cgtgggcagg atgaatttgt tcgcgtgagt 721 tgggatgagg cgctggatct tattcaccaa caacataaac gcattcgtga ggcttatggt 781 ccggcatcga tttttgctgg ttcctacggc tggcgttcaa acggcgtgct gcataaggcc 841 tcgacattat tacaacgcta tatggcgctg gcaggcggtt ataccgggca tctgggggat 901 tattcgaccg gcgcggcaca ggcgatcatg ccgtatgtcg tgggtggtag tgaagtttat 961 caacagcaga ccagttggcc gctggtgctg gaacatagcg atgtcgtggt gctgtggagt 1021 gctaacccac tcaatacgct gaaaattgcg tggaatgcat ccgatgagca ggggctttct 1081 tacttttctg cactgcgtga cagcgggaaa aagctgatct gcattgatcc aatgcgatcg 1141 gaaaccgtcg atttctttgg cgataaaatg gagtgggtgg caccgcacat gggcaccgat 1201 gttgcgctga tgctggggat cgcccatacg ctggtggaaa atggttggca cgacgaagcg 1261 tttctggcgc gttgcaccac aggttatgcc gtcttcgcct cttatttgct gggcgagagt 1321 gacggaatag cgaaaaccgc cgaatgggca gcagagattt gtggtgttgg cgcagcgaaa 1381 atccgcgagc tggcggctat tttccaccaa aataccacca tgctgatggc aggctgggga 1441 atgcagcgcc aacagtttgg tgagcaaaaa cactggatga tcgtcacgct ggcagcaatg 1501 ttggggcaaa tcggcacacc cggcggcggt tttggtcttt cttaccattt tgccaatggt 1561 ggtaacccca cgcggcgttc tgcggtgctc tcttccatgc agggcagctt gccgggtggc 1621 tgcgatgcgg tggataaaat ccctgttgcc cgcattgttg aagcactgga aaaccctggt 1681 ggcgcatatc aacacaacgg tatgaaccga catttcccgg atattcgttt tatctggtgg 1741 gcgggcggtg ccaactttac tcatcatcag gataccaatc gcctgatccg tgcctggcaa 1801 aaaccggagc tggtggtgat ctctgaatgc ttctggacgg cggcggcaaa acacgcggat 1861 atcgttctgc ctgcgactac ctcttttgag cgtaatgatc tcaccatgac cggtgattac 1921 agtaatcagc atctggtgcc gatgaagcaa gtggtgccgc cacgctatga agcgcgtaat 1981 gattttgatg tttttgccga gttaagtgaa cgctgggaga agggcggtta tgcacgtttt 2041 acggaaggaa aaagtgagct gcaatggctg gaaacgtttt ataacgttgc ccgacagcgc 2101 ggggcaagcc agcaggttga attgccgcca tttgctgagt tctggcaagc caaccagtta 2161 attgagatgc cggaaaaccc ggacagcgag cggtttattc gcttcgctgc atttttgccg 2221 cgatccgctg gcgatccgtt aaaaacgcag cgcaagattg aaatcttctc acagcgtatt 2281 gccgattacg gttacccgga ttgccctggg catccaatgt ggctggagcc ggacgaatgg 2341 cagggcaatg ccgaaccaga acagttgcag gtactttctg cccatccggc gcaccgcctg 2401 cacagccagc tgaattacag ttctctgcgc gaattgtacg cggtggcaaa tcgtgagcct 2461 gtcaccattc atcctgacga tgcccaggag cgcggcatac aagatggcga tactgttcgg 2521 ttgtggaacg cacgcgggca aattcttgcc ggagcggtca ttagcgaggg aattaaacct 2581 ggcgtgattt gcattcacga aggggcatgg ccggatctgg atttaaccgc tgacggtatt 2641 tgtaaaaacg gcgcagtgaa cgtgctgacc aaagatctcc ccagctcgcg gctgggaatg 2701 gctgtgcggg taatacggcg ctggcatggc tggaaaaata caacggtccg gaactgacac 2761 ttacagcgtt tgaaccaccg gccagctcat aatccatgtg ggtagttggg tttcatcctg 2821 ccatgcgcaa tcgacaatgt gaaaaccctg tgcctggtaa aaatttatcg ccggttgatt 2881 tttttgataa acctccagca tcaggtgggg atggcgctgc tgcacatact gcatcagcgc 2941 cttaccaata ccgcgcctga cggccttcgg tgcgacaaac atcgctgcca gaaatcggcc 3001 ttccataatg ctgacaaaac cgagaagctt accgtcttct tcccagaccc agttttgcgc 3061 gttggcaaga taggcatccg caccagcgga atgcagtcac gccagtaatt cgcttttata 3121 aagggatgcc cccaggttgt actttccagc cacagttcga ggatcgcggg gagttctgaa 3181 cgttgcgctt cccgaatcat ggtttatttc ccggatagca acagcagcca accacatgat 3241 cattcaccag cccacatgcc tgcataaagg gagtaacaga ttgtggtgcc gacaaactta 3301 aaaccacgtt ttttcagtgc cttagatagg gcgttcg // LOCUS HUMARXA 1335 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human aldose reductase mRNA, complete cds. ACCESSION M34720 KEYWORDS aldehyde reductase; aldose reductase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1335) AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E. TITLE Cloning and prokaryotic expression of a biologically active human placental aldose reductase JOURNAL DNA Cell Biol. 9, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 964 aldose reductase (EC 1.1.1.21) mRNA < 1 1335 aldose reductase mRNA BASE COUNT 319 a 347 c 356 g 313 t ORIGIN 1 gagcgcagca gccatggcaa gccgtctcct gctcaacaac ggcgccaaga tgcccatcct 61 ggggttgggt acctggaagt cccctccagg gcaggtgact gaggccgtga aggtggccat 121 tgacgtcggg taccgccaca tcgactgtgc ccatgtgtac cagaatgaga atgaggtggg 181 ggtggccatt caggagaagc tcagggagca ggtggtgaag cgtgaggagc tcttcatcgt 241 cagcaagctg tggtgcacgt accatgagaa gggcctggtg aaaggagcct gccagaagac 301 actcagcgac ctgaagctgg actacctgga cctctacctt attcactggc cgactggctt 361 taagcctggg aaggaatttt tcccattgga tgagtcgggc aatgtggttc ccagtgacac 421 caacattctg gacacgtggg cggccatgga agagctggtg gatgaagggc tggtgaaagc 481 tattggcatc tccaacttca accatctcca ggtggagatg atcttaaaca aacctggctt 541 gaagtataag cctgcagtta accagattga gtgccaccca tatctcactc aggagaagtt 601 aatccagtac tgccagtcca aaggcatcgt ggtgaccgcc tacagccccc tcggctctcc 661 tgacaggccc tgggccaagc ccgaggaccc ttctctcctg gaggatccca ggatcaaggc 721 gatcgcagcc aagcacaata aaactacagc ccaggtcctg atccggttcc ccatgcagag 781 gaacttggtg gtgatcccca agtctgtgac accagaacgc attgctgaga actttaaggt 841 ctttgacttt gaactgagca gccaggatat gaccacctta ctcagctaca acaggaactg 901 gagggtctgt gccttgttga gctgtacctc ccacaaggat taccccttcc atgaagagtt 961 ttgaagctgt ggttgcctgc tcgtccccaa gtgacctata cctgtgtttc ttgcctcatt 1021 tttttccttg caaatgtagt atggcctgtg tcactcagca gtgggacagc aacctgtaga 1081 gtggccagcg agggcgtgtc tagcttgatg ttggatctca agagccctgt cagtagagta 1141 gaagtctctt ccagtttgct ttgcccttct ttctaccctg ctggggaaag tacaacctga 1201 ataccctttt ctgaccaaag agaagcaaaa tctaccaggt caaaatagtg ccactaacgg 1261 ttgagttttg actgcttgga actggaatcc tttcagcaag acttctcttt gcctcaaata 1321 aaaagtgctt ttgtg // LOCUS HUMARXB 652 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human aldose reductase gene, partial cds. ACCESSION M34721 KEYWORDS aldehyde reductase; aldose reductase. SOURCE Human placenta DNA, clone lambda-gt11-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 652) AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E. TITLE Cloning and prokaryotic expression of a biologically active human placental aldose reductase JOURNAL DNA Cell Biol. 9, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 27 aldose reductase, exon X (AA at 1) (EC 1.1.1.21) 626 > 652 aldose reductase, exon X+1 IVS 28 625 aldose reductase intron X BASE COUNT 142 a 164 c 184 g 162 t ORIGIN 1 gccaagcaca ataaaactac agcccaggta cagccacttc aggtgttgct gaccgtccac 61 aactgcctgc attcctgaca gtcctgttag ccaagaggag gaagtgactg agcctgttac 121 accctcacag gaagtatggt taggggtcct caagtacaga gtggaaaggg cacagatcgg 181 ggttttagaa gactctggca tgggctctta gattaatagt gcctgccccc actactgcaa 241 gggtgactgc cacgagggcc agcgcttgtt cattcatgtg gaacctcatc tgtacaaatg 301 taagagctct tagccgtgca gggaatgttc tttctcctga gtggtagtgt gcatttctag 361 ccagtggagg gcctcatgtg gtctcatgat atgcctgaga cactgaagcg tgtggcacag 421 tggctagcgc aggactctgg agtcagatct ggacctgaat gcgtcgccta cctgttgcta 481 gctgtgacct gacatcttgg agcccctctc tgatcacctg tggagttcta gcacgtcctt 541 ctgcaggttg tgtgtgtgag agactgagat gatgggtgcg agtgcctggc atgtatacac 601 actcactgtc tccttgggct cacaggtcct gatccggttc cccatgcaga gg // LOCUS MRV3TERM 59 bp ss-RNA VRL 12-JUL-1990 DEFINITION Marburg virus 3'terminal region of genome. ACCESSION M36065 KEYWORDS . SOURCE Marburg virus RNA. ORGANISM Marburg virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae. REFERENCE 1 (bases 1 to 59) AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D. TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and Marburg virus JOURNAL Virology 149, 251-254 (1986) STANDARD simple staff_review BASE COUNT 15 a 10 c 5 g 29 t ORIGIN 1 tctgtgtgtt ttgttctcta ctactaaaac acatagtata tttatttctt cttataatc // LOCUS RATQRED1 431 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat quinone reductase gene, exon 1. ACCESSION M33038 KEYWORDS quinone reductase. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 431) AUTHORS Bayney,R.M. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone reductase structural gene and prediction of the NH2 terminal sequence of the protein by double-stranded sequencing of exons 1 and 2 JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988) STANDARD simple staff_review FEATURES from to/span description pept 244 + 250 quinone reductase, exon 1 pre-msg 170 > 431 quinone reductase mRNA and introns IVS 251 > 431 quinone reductase intron A BASE COUNT 91 a 125 c 119 g 96 t ORIGIN 1 taacttggta tcctcccccc agcgcctctg ggctggcaat ccagccccgc cctcgctggc 61 tgccctgcac agtgggctgg gccggaaaag caagatataa agcctgaaag tgctcagtac 121 agctcgcact agcctaggct gtggcacgca ggatctttcc gaagcatttc agggtcgtcc 181 tggcaaccag ctgctcagcc aatcagcgct tgacactacg atccgccccc aacttctgga 241 gccatggcgg gtgagtatgg ctccaactcc agcctaattc atcctgagga ggatgtaggg 301 gcttgctatg gggtttgttc cttgcctcga agttgaaaag tgtagagatt aggatcctgg 361 atgagcctcg gtgagtcccc ggaaggagag cttcttctca gaaccatagg tgcagattat 421 tctgcagccc c // LOCUS RATQRED2 410 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat quinone reductase gene, exon 2. ACCESSION M33039 KEYWORDS quinone reductase. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 410) AUTHORS Bayney,R.M. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone reductase structural gene and prediction of the NH2 terminal sequence of the protein by double-stranded sequencing of exons 1 and 2 JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 108 / 272 quinone reductase, exon 2 IVS < 1 107 quinone reductase intron A IVS 273 > 410 quinone reductase intron B BASE COUNT 103 a 111 c 102 g 94 t ORIGIN 1 agaaactaag gtggggaacg tgtctggtcc caagcacttt tagattaggg actcacccgt 61 cctgtttgga ttttctttcc tcacctcctc acgtacgcct taaacagtga gaagagccct 121 gattgtattg gcccacgcag agaggacatc attcaactat gccatgaagg aggctgctgt 181 ggaggctctg aagaagaaag gatgggaggt ggtcgaatct gacctctatg ctatgaactt 241 taaccccctc atttccagaa acgacatcac aggtaagaat cgtctccctc cactgacagt 301 ggaccacgtg acccagcctc agcccctctt gcctcccaac aggggagccg aaggactcgg 361 agaactttca gtaccctgtt gagtcatctc tggcgtataa ggaaggccgc // LOCUS RHAFIXA 1040 bp ds-DNA BCT 12-JUL-1990 DEFINITION A.caulinodans nitrogen fixation protein (nifO and fixA) genes, complete cds and 5'end. ACCESSION M35122 KEYWORDS nitrogen fixation protein. SOURCE A.caulinodans (strain ORS571) DNA. ORGANISM Azorhizobium caulinodans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1040) AUTHORS Kaminski,P.A., Norel,F., Desnoues,N., Kush,A., Salzano,G. and Elmerich,C. TITLE Characterization of the fixABC region of Azorhizobium caulinodans ORS571 and identification of a new nitrogen fixation gene JOURNAL Mol. Gen. Genet. 214, 496-502 (1988) STANDARD simple staff_review FEATURES from to/span description pept 328 657 nitrogen fixation protein (nifO) pept 939 > 1040 nitrogen fixation protein (fixA) BASE COUNT 181 a 349 c 328 g 182 t ORIGIN 1 gagctcggcc tctatgacat cgacgccagc gcggtgaacg tcgcgcacgt gcccgtcatt 61 ccggacgaga acgaggtgag cggcgtcgat atcgtcgtcc gcctgcgtcg cacgggccgc 121 tgagggacgc tccgcctgtc gccttcgggg cacccgcatc cgcgtagcag cgcggccgcc 181 tcccgcggac tggccatcgc cagtactggc acgggcattg cttggacctc atccgtgccc 241 cgacatcggg gcaacgggtt cgcccgccaa agcgaccgga tgagttcacc tcatccgatt 301 acgcaccaga ctttcaggag acggagcatg gcgaccgccg gcggcatcct cgatcagctc 361 aacaaggcat ccagcgcgga agacttcttc gcgctgctcg aggtcgatta cgatccccaa 421 gtggtgaatg tggtgcgcct gcatatcctg cggcgcatgg gccagtatct ggtcagcgag 481 aatttcgaag gccaggcgga tgacgccatc cgcgcccggt gcaaagaggt gctggaacag 541 gcctatgcgg acttcctcgc ctcctcgccc ttgcaggagc gggtgttcaa ggtgctgaag 601 gaggccgccc agccgccgaa gcccaagccg atggtatcgc tcaccgttct caagtgacgt 661 tccccccctc ccgcgtcctt caaggcggcc tgcacccggc aggccgccct tcgcgtttca 721 gggcgcgggc gggtggtgag gggccacggg caagacgcgc ctgtcgcatt ccgacgcggg 781 tggcggacgt tcctgtcggc ggcggagccg gggcggaaag cgcattgtgg catgccagac 841 agccctttga tttcatgcgc gttttcgggc tgaaagacag ttggtacgac acttgctcat 901 tcctccccaa gagcccaacc gttccgggag cgaacgcaat gcacatcgtc gtctgcatca 961 agcaggttcc tgactccgcg cagatccgcg tgcaccccgt gacgaacacc atcatgcgtc 1021 agggtgtgcc cacgatcatc // LOCUS STMPPG 200 bp ds-DNA BCT 12-JUL-1990 DEFINITION S.griseus brown pigment production gene, 5' flank. ACCESSION M35117 KEYWORDS brown pigment production protein. SOURCE S.griseus (strain TK21) DNA, clone pARC1. ORGANISM Streptomyces griseus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Horinouchi,S., Nishiyama,M., Nakamura,A. and Beppu,T. TITLE Construction and characterization of multicopy expression-vectors on Streptomyces spp JOURNAL Mol. Gen. Genet. 210, 468-475 (1987) STANDARD simple staff_review FEATURES from to/span description mRNA 112 > 200 brown pigment production protein mRNA BASE COUNT 22 a 79 c 71 g 28 t ORIGIN 1 gatcgtccat ggtggccatc ccaccatccg ccgcgccggg gcggcgagcg cgtttcgctg 61 ggcggacacg ctccccttgc cggtgctagc gcgaccgcgc tagcgtggtc gggtgcccaa 121 gatccgtatg acgcccctga ccgaccggcg ttcggccggt tcctgaagca cgcccccgac 181 cgcgcggccg gccgggccgg // LOCUS SYNGPCNA 111 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35111 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP1-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 92 > 111 nucleoprotein BASE COUNT 37 a 17 c 21 g 36 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc atttgattgc 61 gcttttattt ggaaattcat tgtgtgacaa aatgtctttg tccaaagaag t // LOCUS SYNGPCNB 111 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35112 SOURCE Synthetic DNA, clone pACRP1-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 109 > 111 glycoprotein precursor BASE COUNT 33 a 20 c 25 g 33 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc tttttggatt 61 gcgctttcct ttaggacaac tgggtgctgg attctatcca gtaaaaggat g // LOCUS SYNGPCNC 131 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35113 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP5-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 131) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 112 > 131 nucleoprotein BASE COUNT 41 a 19 c 24 g 47 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg 61 gatcctaggc atttgattgc gcttttattt ggaaattcat tgtgtgacaa aatgtctttg 121 tccaaagaag t // LOCUS SYNGPCND 131 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35114 KEYWORDS glycoprotein precursor. SOURCE Synthetic DNA, clone pACRP5-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 131) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 129 > 131 glycoprotein precursor BASE COUNT 37 a 22 c 28 g 44 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg 61 gatcctaggc tttttggatt gcgctttcct ttaggacaac tgggtgctgg attctatcca 121 gtaaaaggat g // LOCUS SYNGPCNE 155 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35115 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP6-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 155) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 136 > 155 nucleoprotein BASE COUNT 53 a 22 c 26 g 54 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa 61 cagttttgta ataaaaaaac cccggatcct aggcatttga ttgcgctttt atttggaaat 121 tcattgtgtg acaaaatgtc tttgtccaaa gaagt // LOCUS SYNGPCNF 155 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35116 KEYWORDS glycoprotein precursor. SOURCE Synthetic DNA, clone pACRP6-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 155) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 153 > 155 glycoprotein precursor BASE COUNT 49 a 25 c 30 g 51 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa 61 cagttttgta ataaaaaaac cccggatcct aggctttttg gattgcgctt tcctttagga 121 caactgggtg ctggattcta tccagtaaaa ggatg // LOCUS XELD7 1051 bp ss-mRNA VRT 12-JUL-1990 DEFINITION X.laevis pot. developmental protein (D7) mRNA, complete cds. ACCESSION M35119 KEYWORDS developmental protein D7. SOURCE X.laevis, cDNA to mRNA, clones D7.1 and D7.0. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1051) AUTHORS Smith,R.C., Dworkin,M.B. and Dworkin-Rastl,E. TITLE Destruction of a translationally controlled mRNA in Xenopus oocytes delays progesterone-induced maturation JOURNAL Genes Dev. 2, 1296-1306 (1988) STANDARD simple staff_review FEATURES from to/span description pept 27 863 pot. developmental protein D7 BASE COUNT 318 a 240 c 222 g 271 t ORIGIN 1 gaaaaccggg acgtttgggc tgcaatatgg aatttgatga gctgatgcag tgcccatatg 61 acaaaaatca tatgattcgg cccagccggt ttccctacca ccttgttaaa tgcagagaga 121 ataatcgtgc agcagctaaa attctagcaa cttgcccata taatgcccgc cacagagtcc 181 ctaaacagga gcttgatctg cacatggcca gctgtgaata cagggtgacc atggagccca 241 tttctgctgc attttcacat cagaaggtgg agacctcaac atggcaaagc cctccttgtg 301 aagaggtctg ggaaactgac gaagatcccg tgtcaaggcc aaagcccttt attttaaatg 361 attttactcc ttctcagcct tttaatatgt cagaaggtga tggaaatatg ccgtatactg 421 gaataagcag caactacaga cctgaagtcc aacctatgaa ttcagtcatg caagtaaagc 481 aaaatcaacc tgaacctgag ccttttacct ccagtgagcg aaactatgat ccacgatcca 541 aggaaccacc caatccaaag caacctgcag tgaatggcta caaacctgca actacaaata 601 caaacccatg gtgcaggcaa acgggaggat cgaggggagc tgctcctcca aagttgggtg 661 ctaaatcctc agatgagggg ccaagaaata aggaatttcc cactccaaag gcgaacttga 721 tgaatgagta cgtacctgta gcagcaaatg caaatccatg gtgcaggcaa ccaggagggt 781 ccagtgctgc ttcagaacct ttgggtgttg actccttcga tgagtggcca tgccttggac 841 gccagccatg ggttagaaag taaatcttca ctttaaaaac aggactttca tctgaacctg 901 ttcctgactt gtccaactcc tggattttta aaatttgttg tgaagttgcc atttagtatt 961 tttgtacaaa attttaacag ccttcatttt tacatattaa gctttttatc acaaatataa 1021 tactaattta cttgaatgtt atttgttaac c // LOCUS YSCNUP1 4986 bp ds-DNA PLN 12-JUL-1990 DEFINITION S.cerevisiae nucleoporin (NUP1) gene, complete cds.. ACCESSION M33632 KEYWORDS nuclear pore complex protein; nucleoporin. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 4986) AUTHORS Davis,L.I. and Fink,G.R. TITLE The NUP1 gene encodes an essential component of the yeast nuclear pore complex JOURNAL Cell 61, 965-978 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by L.I.Davis, 06-APR-1990, for release after publication. FEATURES from to/span description pept 1001 4231 nucleoporin (NUP1) (put.) BASE COUNT 1590 a 1036 c 991 g 1369 t ORIGIN Chromosome XV; 14 cm prox. to ADE2. 1 gaattcatca gtgaactctt catcattcaa aaacacccaa tcatagttga acttggagtt 61 aaatctatct tccacggatt taatagactc agccaacgaa tatagatctc tattacgcac 121 tagagtgaca aaagtggcct tttcccgagg accggtatac tttggtagtt ccactttcga 181 gtatttatag cccgagccgg aagatagcac ggtactgtac tttgggcttg gggcagagcc 241 tctcgcatac tgagctccat gaaagaacac atacacagta aacacggcga ccagaagaag 301 tcctaatttt ttgtaaacag gctgcttgct agctgggatc ataatcttcg ccattttggt 361 tattgactct atcccttaaa aactcttctg atggagtact ttacttcgat tgcttaacga 421 aatctttgtg aggaaaaaga tatctcttaa aattagaaag tacaatagtc tagcgtatta 481 tactaagaat ctgcaaaaaa gaagcaagaa ggcaccacct attatagacc tttgacacga 541 agtctctctg gagtgctttg gcctacgtgt gcggtactcg tttacatggg acaaccacgg 601 tttttttttt ggtgttactg gaggtataca gtgcgtatat ccacttgtac gacaagagat 661 ttacactaca ccgcgtaaag aaaacgccga caccaaatat aagtcacgtg tatgcaaagc 721 ctattttatg ccctaatttt caagccccgg tttttacgcc ctagttttta tatttagggt 781 ttgtcgttgc acgtgatcaa tggttcgtat tatgtgacat tgaaatgctt tttcatttta 841 atttttttct ttgacgaaat ttcgtaatgt caagaaacac ttaaagaaaa taagtgatga 901 ggaactcaat aaggacacta cgtagcggtg caaatacgat aggatattag cctcgaaagg 961 gttataggga cagagagtga gcgacaattt ttagtcattc atgtcttcaa acacttcttc 1021 tgtgatgtct tctccacgtg tcgaaaagag atcgttttct tccactttaa aatcattctt 1081 cacaaacccc aataaaaaac ggccatcgag caagaaagtt ttcagttcaa acctctcata 1141 cgcgaatcat ttggaggaat cagatgttga agacacactg catgttaata agagaaagag 1201 ggtgtccggt acatcacagc atagcgacag cttaactcag aacaacaata atgcgccaat 1261 tataatatat ggaaccgaaa acactgagag accgccgctt ttgccaattt tgcccattca 1321 aagactgagg ttattaaggg agaagcagag ggtgagaaat atgcgtgagc ttggattaat 1381 tcaatcaact gaatttccat ctattacatc gtcggttata ttgggctctc aaagtaaaag 1441 cgatgaggga ggatcgtacc tatgcacatc atctactcct tcccctatta aaaacggttc 1501 ttgcactagg cagttggccg gaaaaagcgg tgaagacacc aatgttggac tacccattct 1561 caaatcattg aaaaatagat ccaatagaaa aaggtttcat agtcagtcaa aggggaccgt 1621 gtggtcagca aattttgaat atgatttgtc agaatatgac gctatacaaa aaaaggataa 1681 caaggataag gaaggtaacg ctggcggtga tcagaagaca agcgagaata gaaataatat 1741 taagagtagt atttcaaatg gcaatctggc tacaggccct aacctgacaa gcgaaattga 1801 agacctacgt gcagacatca actctaatag gttatcgaat cctcaaaaaa atctactttt 1861 aaaaggacca gcttccacag ttgcaaaaac tgcccctatt caggagagct ttgttcccaa 1921 ttcagagcgc tctggtacgc ctacgttaaa gaaaaatatt gagcccaaaa aggacaaaga 1981 aagtattgtt ttgcccaccg taggttttga ctttatcaag gacaatgaga ctccatctaa 2041 gaaaacttct cctaaggcaa cttcttctgc aggtgcagtc tttaaatcga gtgtagaaat 2101 gggaaaaacc gataagtcaa cgaaaactgc cgaggcgcct accttatcat tcaattttag 2161 ccaaaaggct aataaaacta aggctgtcga caatactgtc ccttccacaa ctttattcaa 2221 ttttggtggt aaatcagata ccgttacttc tgccagtcaa ccttttaaat ttggaaagac 2281 atccgaaaaa agtgaaaatc atacagaatc agacgcgcct ccaaaatcaa ctgctccaat 2341 attttctttt ggtaaacaag aagagaatgg tgatgaaggt gatgatgaaa atgagcccaa 2401 aagaaaaagg cgtttacctg ttagcgagga tacaaacacc aagcctttat tcgatttcgg 2461 caagaccggt gatcaaaagg agaccaaaaa gggagagtca gaaaaggacg catcagggaa 2521 accaagcttt gtctttggtg caagtgataa gcaagctgaa ggtacaccat tatttacatt 2581 cggaaaaaaa gctgatgtaa caagcaatat tgactcctct gcacaattta cctttggtaa 2641 agccgccacc gcgaaagaaa cacacaccaa accatctgag acacctgcca caatagtcaa 2701 gaagcctact tttacttttg ggcagtcaac aagtgaaaat aagatctctg agggaagtgc 2761 gaaacctaca ttctctttct ctaagtcaga ggaggaacgt aagagtagtc caatttcaaa 2821 cgaagcagct aaaccctcgt tttcgtttcc gggcaagcct gttgatgttc aagcaccgac 2881 ggatgataag actctcaagc caactttttc ttttactgaa cctgctcaaa aagattcatc 2941 tgttgtttcg gaacctaaaa agccctcctt tacgtttgcg tcttcaaaaa cctcacaacc 3001 aaagccattg ttttcatttg gtaagtcaga tgcagctaaa gaaccaccag gctctaacac 3061 ctcattttct ttcactaaac ctcctgctaa tgagacagat aaaagaccta caccgccatc 3121 tttcaccttt ggcggttcca caacaaataa tacaacaacc actagcacaa aaccatcttt 3181 tagttttggg gctcccgagt cgatgaagtc gacagcaagt acagcggcag caaatacgga 3241 gaagctatca aatggctttt cctttacaaa gttcaatcac aataaagaaa agtcaaactc 3301 tccaacttct ttcttcgatg gttctgcttc ctcaacgccg attcctgtct tgggtaagcc 3361 aacagacgct actggtaata caacatctaa atctgcattt tcattcggta ctgctaacac 3421 caatggtacc aatgcctcag caaactccac atcattctcg tttaacgccc ctgctactgg 3481 taacggcaca actactactt ccaatacctc aggaaccaat atagcgggta catttaacgt 3541 aggaaaaccg gatcaaagta tcgctagtgg caatacgaat ggagcgggct cggcatttgg 3601 cttttcgagc tcaggaacag cagcaactgg tgcagcttct aatcaatctt catttaattt 3661 tggaaacaat ggtgcagggg gtctcaatcc ttttacatca gcaacttcgt caactaatgc 3721 taatgctggt ttattcaata aacctccttc cacgaatgca caaaatgtca atgttccctc 3781 tgcttttaat tttacgggaa ataattcaac gcctggtggc ggctctgtat ttaatatgaa 3841 cggcaacact aatgctaata cggtgtttgc cggctctaat aaccaaccac atcaatcgca 3901 aaccccatct ttcaatacaa acagctcatt cacgccatca acagttccta atattaattt 3961 tagcggattg aatggcggaa ttactaatac cgcgaccaat gcattaaggc caagtgatat 4021 atttggtgcg aatgctgcct ctggttccaa ttcaaacgta acaaatccat catccatttt 4081 tgggggggca ggtggtgtgc cgacaacttc ttttgggcag ccgcagtcag cccctaatca 4141 gatggggatg ggaacaaata atggcatgag catgggcggt ggtgttatgg cgaacagaaa 4201 gattgcaaga atgaggcact ctaaaaggta aatatcggtt atgtaattag gtattgtgtt 4261 gcttttctga aggatatata ttcctatatt ccttcaactt atacactgca tatgaaactt 4321 cttgagaagt tttaaataat tggtcttttt tttaatcggg tacaggcgag ataatggata 4381 cctgttctag ggtaggcaaa tttggaggca agagcccaaa attgttttgt tttgacctgg 4441 tttctctcag ttaacatatt atgaatagca tcttacaaat aatttacgag cgaaactttc 4501 gatctgcaaa gcgctccttt tttctttccg aaaaaataca gtaaaataag gtctagtaca 4561 cagcctgatc tacaattagt gttcttttct gtttacttcg atcagctgtt tagctttgcc 4621 acttttctag tagctccaca cgctgtgaaa gtaacgagac atagaagtca cggatggatt 4681 tgaaaaggga ttggttacga tggaaaatta caataggtag tgggcctggc agtattgtgc 4741 tggatttccc atcatttttg gtagggtgcg tattcacaac aatgatgggg ccaatattac 4801 agaagcttat aggtaaactt ttggtaggtt taataacggt atgcaaattc ttagttatta 4861 ttggaagcat tgtcttcgtt atcggagttg catcaaagaa atacacatat gacgatttca 4921 aagtaagtat aaaaagatca ggagagccag gggaatcaca tgatatgagg actgaaccaa 4981 aaagaa //