Path: utzoo!attcan!uunet!ns-mx!iowasp.physics.uiowa.edu!maverick.ksu.ksu.edu!rutgers!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 7 Jul 90 12:00:06 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 733 Approved: lear@genbank.bio.net Checksum: 62748 46 LOCUS TRBMVAT5A 1544 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds. ACCESSION M33825 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. BASE COUNT 497 a 396 c 366 g 285 t ORIGIN 1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt 61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt 121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa 181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg 241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt 301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa 361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac 421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg 481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc 541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg 601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca 661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca 721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca 781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat 841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt 901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac 961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa 1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac 1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc 1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc 1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat 1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca 1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc 1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta 1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa 1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta // LOCUS TRBWRATATA 1544 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat A) mRNA, complete cds. ACCESSION M33823 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 42 1457 variant surface glycoprotein (WRATat A) mRNA < 1 1544 WRATat A mRNA BASE COUNT 545 a 387 c 345 g 267 t ORIGIN 1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc 61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt 121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac 181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg 241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg 301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc 361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa 421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct 481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca 541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg 601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac 661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca 721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa 781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac 841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg 901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt 961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg 1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca 1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg 1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc 1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag 1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt 1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga 1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg 1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta 1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg // LOCUS TRBWRATATB 1585 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat B) mRNA, complete cds. ACCESSION M33824 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1585) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. BASE COUNT 535 a 384 c 376 g 290 t ORIGIN 1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact 61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga 121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa 181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac 241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga 301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat 361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga 421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc 481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca 541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc 601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg 661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag 721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa 781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga 841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact 901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca 961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca 1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa 1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg 1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta 1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct 1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt 1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg 1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac 1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt 1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact 1561 tctggtatat tttaacacct ttaaa // LOCUS HUMMHB17W 3520 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human MHC class I HLA-Bw58 gene, complete cds. ACCESSION M11799 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human B lymphoblastoid cell line WT49 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3520) AUTHORS Ways,J.P., Coppin,H.L. and Parham,P. TITLE The complete primary structure of HLA-Bw58 JOURNAL J. Biol. Chem. 260, 11924-11933 (1985) STANDARD simple staff_review FEATURES from to/span description pept 88 160 HLA-Bw58 of MHC, exon 1 /nomgen="HLA-B" /map="6p21.3" /hgml_locus_uid="LL0111R" 290 559 HLA-Bw58 of MHC, exon 2 806 1081 HLA-Bw58 of MHC, exon 3 1657 1932 HLA-Bw58 of MHC, exon 4 2037 2153 HLA-Bw58 of MHC, exon 5 2595 2627 HLA-Bw58 of MHC, exon 6 2734 2777 HLA-Bw58 of MHC, exon 7 pre-msg 67 3386 HLA-Bw58 of MHC, mRNA and introns IVS 161 289 HLA-Bw58 intron A IVS 560 805 HLA-Bw58 intron B IVS 1082 1656 HLA-Bw58 intron C IVS 1933 2036 HLA-Bw58 intron D IVS 2154 2594 HLA-Bw58 intron E IVS 2628 2733 HLA-Bw58 intron F BASE COUNT 665 a 981 c 1099 g 775 t ORIGIN Chromosome 6p21.3. 1 tctagagaag ccaatcagtg tcgccggggt cccagttcta aagtccccac gcacccaccc 61 ggactcagaa tctcctcaga cgccgagatg cgggtcacgg cgccccgaac cgtcctcctg 121 ctgctctggg gggcagtggc cctgaccgag acctgggccg gtgagtgcgg ggtcgggagg 181 gaaatggcct ctgtggggag gagcgagggg accgcaggcg ggggcgcagg acctgaggag 241 ccgcgccggg aggagggtcg ggcgggtctc agcccctcct cgcccccagg ctcccactcc 301 atgaggtatt tctacaccgc catgtcccgg cccggccgcg gggagccccg cttcatcgca 361 gtgggctacg tggacgacac ccagttcgtg aggttcgaca gcgacgccgc gagtccgagg 421 acggagcccc gggcgccatg gatagagcag gaggggccgg agtattggga cggggagaca 481 cggaacatga aggcctccgc gcagacttac cgagagaacc tgcggatcgc gctccgctac 541 tacaaccaga gcgaggccgg tgagtgaccc cggcccgggg cgcaggtcac gactccccat 601 cccccacgta cggcccgggg tcgccccgag tctccgggtc cgagatccgc ctccctgagg 661 ccgcgggacc cgcccagacc ctcgaccggc gagagcccca ggcgcgttta cccggtttca 721 ttttcagttg aggccaaaat ccccgcgggt tggtcggggc ggggcggggc tcgggggacg 781 gggctgaccg cggggccggg gccagggtct cacatcatcc agaggatgta tggctgcgac 841 ctggggcccg acgggcgcct cctccgcggg catgaccagt ccgcctacga cggcaaggat 901 tacatcgccc tgaacgagga cctgagctcc tggaccgcgg cggacaccgc ggctcagatc 961 acccagcgca agtgggaggc ggcccgtgtg gcggagcagc tgagagccta cctggagggc 1021 ctgtgcgtgg agtggctccg cagatacctg gagaacggga aggagacgct gcagcgcgcg 1081 ggtaccaggg gcagtgggga gccttcccca tctcctatag gtcgccgggg atggcctccc 1141 acgagaagag gaggaaaatg ggatcagcgc tagaatgtcg ccctcccttg aatggagaat 1201 ggcatgagtt ttcctgagtt tcctctgagg gccccctctt ctctctagga caattaaggg 1261 atgacgtctc tgaggaaatg gaggggaaga cagtccctag aatactgatc aggggtcccc 1321 tttgacccct gcagcagcct tgggaaccgt gacttttcct ctcaggcctt gttctctgcc 1381 tcacactcag tgtgtttggg gctctgattc cagcacttct gagtcacttt acctccactc 1441 agatcaggag cagaagtccc tgttccccgc tcagagactc gaactttcca atgaatagga 1501 gattatccca ggtgcctgcg tccaggctgg tgtctgggtt ctgtgcccct tccccacacc 1561 aggtgtcctg tccattctca ggctggtcac atgggtggtc ctagggtgtc ccatgagaga 1621 tgcaaagcgc ctgaattttc tgactcttcc catcagaccc cccaaagaca cacgtgaccc 1681 accaccccgt ctctgaccat gaggccaccc tgaggtgctg ggccctgggc ttctaccctg 1741 cggagatcac actgacctgg cagcgggatg gcgaggacca aactcaggac actgagcttg 1801 tggagaccag accagcagga gatagaacct tccagaagtg ggcagctgtg gtggtgcctt 1861 ctggagaaga gcagagatac acatgccatg tacagcatga ggggctgccg aagcccctca 1921 ccctgagatg gggtaaggag ggggatgagg ggtcatatct cttctcaggg aaagcaggag 1981 cccttctgga gcccttcagc agggtcaggg cccctcgtct tcccctcctt tcccagagcc 2041 atcttcccag tccaccatcc ccatcgtggg cattgttgct ggcctggctg tcctagcagt 2101 tgtggtcatc ggagctgtgg tcgctactgt gatgtgtagg aggaagagct caggtaggga 2161 aggggtgagg ggtggggtct gggttttctt gtcccactgg gggtttcaag ccccaggtag 2221 aagtgttccc tccctcatta ctgggaagca gcatccacac aggggctaac gcagcctggg 2281 accctgtgtg ccagcactta ctcttttgtg cagcacatgt gacaatgaag gacggatgta 2341 tcaccttgat ggttgtggtg ttggggtcct gatttcagca ttcatgagtc aggggaaggt 2401 ccctgctaag gacagacctt aggagggcag ttggtccagg acccacactt gctttcctcg 2461 tgtttcctga tcctgccttg ggtctgtagt catacttctg gaaattcctt ttgggtccaa 2521 gacgaggagg ttcctctaag atcttaaggc cctgcttcct cccagtcccc tcacaggaca 2581 ttttcttccc acaggtggaa aaggagggag ctactctcag gctgcgtgta agtggtgggg 2641 gtgggagtgt ggaggagctc acccacccca taattcctcc tgtcccacgt ctcctgcggg 2701 ctctgaccag gtcctgtttt tgttctactc cagccagcga cagtgcccag ggctctgatg 2761 tgtctctcac agcttgaaaa ggtgagattc ttggggtcta gagtgggtcg ggtggcgggt 2821 ctgggggtgg gtggggcaga ggggaaaggc ctgggtaatg gggattcttt gattgggatg 2881 tttcgcgtgt gtggtgggct gtttacagtg tcatcgctta ccatgactaa ccagaatttg 2941 ttcatgactg ttgttttctg tagcctgaga cagctgtctt gtgagggact gagatgcagg 3001 atttcttcac gcctcccctt tgtgacttca agagcctctg gcatctcttt ctgcaaaggc 3061 acctgaatgt gtctgcgtcc ctgttagcat aatgtgagga ggtggagaca cagcccaccc 3121 ttgtgtccac tgtgacccct gttcccatgc tgacctgtgt ttcctcccca gtcatctttc 3181 ctgttccaga gaggtggggc tggatgtctc catctctgtc tcaactttac gtgcactgag 3241 ctgcaacttc ttacttccct actgaaaata agaatctgaa tataaatttg ttttctcaaa 3301 tatttgctat gagaggttga tggattaatt aaataagtca attcctggaa tttgagagag 3361 caaataaaga cctgagaacc ttccagaatc tgcatgttcg ctgtgctgag tctgttgcag 3421 gtggggtgtg gagaaggctg tggggggccg agtgtggacg gggcctgtgc ccatttggtg 3481 ttgagtccat catgggcttt atgtggttag tcctcagctg // LOCUS HUMTGFB1B 5075 bp ss-mRNA PRI 07-JUL-1990 DEFINITION Human transforming growth factor-beta 1 binding protein mRNA, complete cds. ACCESSION M34057 KEYWORDS transforming growth factor-beta 1 binding protein. SOURCE Human fibroblast, cDNA to mRNA, clone BPA13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5075) AUTHORS Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U., Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H. TITLE TGF-beta-1 binding protein: A component of the large latent complex of TGF-beta-1 with multiple repeat sequences JOURNAL Cell 61, 1051-1061 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.-H.Heldin, 08-MAY-1990. FEATURES from to/span description pept 91 4275 transforming growth factor-beta 1 binding protein precursor sigp 91 150 transforming growth factor-beta 1 binding protein signal peptide matp 151 4272 transforming growth factor-beta 1 binding protein BASE COUNT 1353 a 1122 c 1252 g 1348 t ORIGIN 1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt 61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc 121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt 181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag 241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg 301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg 361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt 421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat 481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg 541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag 601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca 661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc 721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt 781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca 841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa 901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat 961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc 1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt 1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa 1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac 1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt 1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt 1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc 1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag 1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca 1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa 1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag 1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct 1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt 1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc 1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat 1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt 1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat 1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg 2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag 2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct 2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt 2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa 2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt 2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc 2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag 2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac 2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac 2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc 2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat 2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag 2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca 2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac 2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc 2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt 3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac 3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta 3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt 3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc 3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc 3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct 3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa 3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac 3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt 3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg 3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa 3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc 3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc 3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt 3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt 3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt 3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg 4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag 4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag 4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg 4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac 4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa 4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat 4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc 4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca 4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta 4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg 4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt 4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag 4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa 4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat 4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc 4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag 4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga 5041 tccccaatat atctaccatt gtatgttaaa taaat // LOCUS MUSPLGN 2720 bp ss-mRNA ROD 07-JUL-1990 DEFINITION Mouse plasminogen mRNA, complete cds. ACCESSION J04766 KEYWORDS plasminogen. SOURCE Mouse cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2720) AUTHORS Degen,S.J. TITLE Characterization of the cDNA coding for mouse plasminogen and localization of the gene to mouse chromosome 17 JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Degen, 08-MAY-1990. FEATURES from to/span description pept 25 2463 plasminogen precursor sigp 25 81 plasminogen signal peptide matp 82 2460 plasminogen signal 2699 2704 poly-A signal BASE COUNT 731 a 707 c 710 g 572 t ORIGIN Chromosome 17. 1 ccggtgctgt tggccagtcc caacatggac cataaggaag taatccttct gtttctcttg 61 cttctgaaac caggacaagg ggactcgctg gatggctaca taagcacaca aggggcttca 121 ctgttcagtc tcaccaagaa gcagctcgca gcaggaggtg tctcggactg tttggccaaa 181 tgtgaagggg aaacagactt tgtctgcagg tcattccagt accacagcaa agagcagcaa 241 tgcgtgatca tggcggagaa cagcaagact tcctccatca tccggatgag agacgtcatc 301 ttattcgaaa agagagtgta tctgtcagaa tgtaagaccg gcatcggcaa cggctacaga 361 ggaaccatgt ccaggacaaa gagtggtgtt gcctgtcaaa agtggggtgc cacgttcccc 421 cacgtaccca actactctcc cagtacacat cccaatgagg gactagaaga gaactactgt 481 aggaacccag acaatgatga acaagggcct tggtgctaca ctacagatcc ggacaagaga 541 tatgactact gcaacattcc tgaatgtgaa gaggaatgca tgtactgcag tggagaaaag 601 tatgagggca aaatctccaa gaccatgtct ggacttgact gccaggcctg ggattctcag 661 agcccacatg ctcatggata catccctgcc aaatttccaa gcaagaacct gaagatgaat 721 tattgccaca accctgacgg ggagccaagg ccctggtgct tcacaacaga ccccaccaaa 781 cgctgggaat actgtgacat cccccgctgc acaacacccc cgcccccacc cagcccaacc 841 taccaatgtc tgaaaggaag aggtgaaaat taccgaggga ccgtgtctgt caccgtgtct 901 gggaaaacct gtcagcgctg gagtgagcaa acccctcata ggcacaacag gacaccagaa 961 aatttcccct gcaaaaatct ggaagagaac tactgccgga acccagatgg agaaactgct 1021 ccctggtgct ataccactga cagccagctg aggtgggagt actgtgagat tccatcctgc 1081 gagtcctcag catcaccaga ccagtcagat tcctcagttc caccagagga gcaaacacct 1141 gtggtccagg aatgctacca gagcgatggg cagagctatc ggggtacatc gtccactacc 1201 atcacaggga agaagtgcca gtcctgggca gctatgtttc cacacaggca ttcgaagacc 1261 ccagagaact tcccagatgc tggcttggag atgaactact gcaggaaccc ggatggtgac 1321 aagggccctt ggtgctacac cactgacccg agcgtcaggt gggaatactg caacctgaag 1381 cggtgctcag agacaggagg gagtgttgtg gaattgccca cagtttccca ggaaccaagt 1441 gggccgagcg actctgagac agactgcatg tatgggaatg gcaaagacta tcggggcaaa 1501 acggccgtca ctgcagctgg caccccctgc cagggatggg ctgcccagga gccccacagg 1561 cacagcatct tcaccccaca gacaaaccca cgggcagatc tggaaaagaa ctactgccga 1621 aacccagatg gggatgtgaa tggtccttgg tgctatacaa caaaccccag aaaactttat 1681 gactattgtg acatccccct gtgtgcatca gcatcatcct ttgagtgcgg gaaacctcag 1741 gtggaaccga agaaatgccc tgggagggtg gtgggtggct gcgtggccaa ccctcactcc 1801 tggccctggc aaatcagcct tagaacaaga tttaccggac agcacttctg tggcggtact 1861 ttaatagccc cagagtgggt tctgactgct gcccactgtt tggagaaatc ttcaagacct 1921 gaattctaca aggttatcct gggtgcgcac gaagaatata tccgtgggtt ggatgttcag 1981 gaaatatcag tagccaaact gatcttggag cccaacaacc gtgacattgc cctgctgaaa 2041 ctaagccgcc cagccaccat cacggataaa gtcattccag cttgtctgcc atctccaaat 2101 tacatggttg ctgaccggac aatatgttac atcaccggct ggggagagac tcaagggact 2161 ttcggtgccg gtcgtctcaa ggaggctcag ctgcctgtga ttgagaacaa ggtgtgcaac 2221 cgcgtcgagt atctgaacaa cagagtcaaa tccacggagc tctgtgccgg gcaactggct 2281 ggtggcgtcg acagctgcca aggcgacagt ggaggacctc tggtttgctt cgagaaggac 2341 aagtacattt tacaaggagt cacttcttgg ggtcttggct gtgctcgccc caataagcct 2401 ggtgtctacg ttcgtgtctc acggtttgtt gattggattg aaagggagat gaggaataac 2461 tgactaggtg gaaggccgag caaaacctct gcttactaaa gcttactgaa tatggggaga 2521 gggcttaggg tgtttggaaa aactgacagt aatcaaactg ggacactaca ctgaaccaca 2581 gcttcctgtc gcccctcagc ccctcccctt tttttgtatt attgtgggta aaattttcct 2641 gtctgtggac ttctggattt tgtgacaata gaccatcact gctgtgacct ttgttgaaaa 2701 taaactcgat acttactttg // LOCUS HUMMHEA 4938 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human MHC class I lymphocyte antigen (HLA-E) (HLA-6.2) gene, complete cds. ACCESSION M21533 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; lymphocyte antigen; major histocompatibility complex. SOURCE Human lymphoblastoid (cell line B-LCL) DNA, clone HLA-6.2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4938) AUTHORS Koller,B.H., Geraghty,D.E., Shimizu,Y., DeMars,R. and Orr,H.T. TITLE A novel HLA class I gene expressed in resting T lymphocytes JOURNAL J. Immunol. 141, 897-904 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.T.Orr, 23-NOV-1988. FEATURES from to/span description pept 705 768 MHC HLA-E precursor, exon 1 /nomgen="HLA-E" /map="6p21.3" /hgml_locus_uid="LT0100B" 899 1168 MHC HLA-E precursor, exon 2 1411 1686 MHC HLA-E precursor, exon 3 2308 2583 MHC HLA-E precursor, exon 4 2708 2824 MHC HLA-E precursor, exon 5 3575 3607 MHC HLA-E precursor, exon 6 3712 3752 MHC HLA-E precursor, exon 7 sigp 704 768 MHC HLA-E signal peptide matp 899 1168 MHC HLA-E 1411 1686 MHC HLA-E 2308 2583 MHC HLA-E 2708 2825 MHC HLA-E 3576 3607 MHC HLA-E 3712 3749 MHC HLA-E pre-msg < 705 4479 MHC HLA-E mRNA and introns IVS 769 898 MHC HLA-E, intron A IVS 1169 1410 MHC HLA-E, intron B IVS 1687 2307 MHC HLA-E, intron C IVS 2584 2707 MHC HLA-E, intron D IVS 2825 3574 MHC HLA-E, intron E IVS 3608 3711 MHC HLA-E, intron F IVS 3755 3919 MHC HLA-E, intron G BASE COUNT 1065 a 1332 c 1380 g 1161 t ORIGIN Chromosome 6p21.3. 1 ggcatgatgg acgcgattag ccggtaacgg atacatgaag catgaggtct ttgaagtcat 61 tggcggctct ctggtgtacg agtctcagct ggttgactac aaggcactct gaagcagatc 121 ctgttttttg ttttgttttg ttttttgttc tgtttttccc cctagacatc tcactctgct 181 gcccaggctg gagtgcagtg gtgtgatctc ggctcactgc aaccatccat cctcctgggt 241 tcaagtagat tctccatgcc tcagcctccc aagtagccag gattacagcc atgcgccacc 301 acgccggcta attttttgga cttttagtag agacagggtt tctccatatt ggtcgggctg 361 gtctcgaact cccaacctca ggtgatcagc ccgccttggc ctcccaaagt gctgagatta 421 caggcgtgag ccaccgcgcc cagccaggac taatttctaa gagtgtgcag agataccgaa 481 acctaaaagt ttaagaactg ctgattgctg ggaaactctg cagtttcccg ttcctctcgt 541 aacctggtca tgtgtccttc ttcctggata ctcatgacgc agactcagtt ctcattccca 601 atgggtgtcg ggtttctaga gaagccaatc agcgtcgcca cgactcccga ctataaagtc 661 cccatccgga ctcaagaagt tctcaggact cagaggctgg gatcatggta gatggaaccc 721 tccttttact cctctcggag gccctggccc ttacccagac ctgggcgggt gagtgcgggg 781 tcgggatgga aacggcctct accgggagta gagaggggcc ggcccggcgg gggcgaagga 841 ctcggggagc cgcgccggga ggagggtcgg gccgatctca gcccctcctc gcccccaggc 901 tcccactcct tgaagtattt ccacacttcc gtgtcccggc ccggccgcgg ggagccccgc 961 ttcatctctg tgggctacgt ggacgacacc cagttcgtgc gcttcgacaa cgacgccgcg 1021 agtccgagga tggtgccgcg ggcgccgtgg atggagcagg aggggtcaga gtattgggac 1081 cgggagacac ggagcgccag ggacaccgca cagattttcc gagtgaacct gcggacgctg 1141 cggcgctact acaatcagag cgaggccggt gagtgacccc ggccaggagc aggtcacgac 1201 ccctccccat cccccacgga cggcgcgggt cccctcgaat cttcgggtcc cagattcacc 1261 ccaaggctgc ggaacccgcc cagaccctag accggggaga gtctcaggcg cctttacccg 1321 gttctttttc agtttaggcc aaaatgccca cagggtggtg gcgacggggg cggggcttgg 1381 tgggcgggac tgactaaggg gcggggccag ggtctcacac cctgcagtgg atgcatggct 1441 gcgagctggg gcccgacagg cgcttcctcc gcgggtatga acagttcgcc tacgacggca 1501 aggattatct caccctgaat gaggacctgc gctcctggac cgcggtggac acggcggctc 1561 agatctccga gcaaaagtca aatgatgcct ctgaggcgga gcaccagaga gcctacctgg 1621 aagacacatg cgtggagtgg ctccacaaat acctggagaa ggggaaggag acgctgcttc 1681 acctgggtaa gagggtccac agggctactc tcccatctcc ttcttgggct aggactgtgc 1741 ccacagctga cagacctcaa acagtagaag aaacagggat ggaggccaga ataccactcc 1801 tcccttggat caggagaggg agctgtcacc tgaggtacag gagatcctat accacagagt 1861 gactctctta aagggccaga cctctctcag gggcaattaa ggaatctagt ctcgctggag 1921 attccatcct tcagatgaac tgatgagcag ttctctttga ctcccagtat taggaatcac 1981 gggggagttt ctctcgtgcc tgattctcag ccccacacca agagtttttg gaggtctgac 2041 tccagctttt ctcagtcact cagcatccac acaggccagg accagaaatc ccttttcacc 2101 ttctaccctg ggctagctca tcccgattct agaactttcc aaggaataag aggctatccc 2161 agatccctaa gtccaggctg gtgtcaaggt tttgtcctct tctcctacta taattgtcct 2221 cttccttctc aggatggtca catgggtgct gctggagtgt cccatgagag atacaaagtg 2281 cctgaatttt ctgactcttc ccctcagagc ccccaaagac acacgtgact caccacccca 2341 tctctgacca tgaggccacc ctgaggtgct gggccctggg cttctaccct gcggagatca 2401 cactgacctg gcagcaggat ggggagggcc atacccagga cacggagctc gtggagacca 2461 ggcctgcagg ggatggaacc ttccagaagt gggcagctgt ggtggtgcct tctggagagg 2521 agcagagata cacgtgccat gtgcagcatg aggggctacc cgagcccgtc accctgagat 2581 ggagtaagga gggggatggg aggtcatgtc tcttctcagg gaaagcggga gcccttctgg 2641 agcccttccg cagggtcagg gctgaggcct gggggtcagg gccccttacg ttcccctctt 2701 ttcccagagc cggcttccca gcccaccatc cccatcgtgg gcatcattgc tggcctggtt 2761 ctccttggat ctgtggtctc tggagctgtg gttgctgctg tgatatggag gaagaagagc 2821 tcaggtgggg aagggagaag ggtggggtct gagttttctt gtcccactgg gtgtttcaag 2881 ccctaggtaa aagtgtgtcc tgcctcgtta ctgggaagca ccatccacac acacgagcct 2941 acccagcctg gggccctgtg tgccagcacc tactcttttt ttttgagacg gagtcttggc 3001 tctgtcaccc aggctggagt gcaatggcgt ggtttcagct cactgcaacc tccgcctccc 3061 aggttcaagc aattctcctg cctcagcctc cctagtagct gggactacac atgcgtgcca 3121 ccacacctgg ctaatttttt tttttgtatt tttagtggag atggggtttc actatgttgg 3181 ccaggctggt ctcgaactcc tgactttgtg atctgcctgc ctcggcctcc caaagtgctg 3241 ggattacagt cgtgagccac cgcacccagc cgcacctact cttttgtaaa gcacctgtga 3301 caatgaagga cagatttatc accttgacga ttgtggtgat ggggacctga tcccagcagt 3361 cacaggtcac aggggaaggt ccctgctgaa gacagacctc agaagggcag ttgatccagg 3421 acccacacct gctttcttca cgtttcctga tcctgccctg ggtctgcagt cacagttcag 3481 gaaacttctc tgggatccaa aactaggagg ttcctctagg accttatggc cctgcctcct 3541 ccctggcccc tcacaggaca ttttcttcca acaggtggaa aaggagggag ctactctaag 3601 gctgagtgta agtgcggggc gggagcgtgg aggagctcgc ccaccctata attcctcctg 3661 caccacatct cctgtgggct ctgaccaggt cttgtttttg ttctacccca gggagcgaca 3721 gtgcccaggg gtctgagtct cacagcttgt aaaggtgaga ttctgggggt ctgaagtggg 3781 tggagggtgg ggcagagggg acaggactgg gttgtgggga ttttttgatt cagaattttt 3841 gagtgtgtgg tgggctgttc agagtgtcat cacttaccgt gactgacctg aatttgttca 3901 tgactatttt cttctgtagc ctgagacagc tgccttgtgt gcgactgaga tgcacagctg 3961 ccttgtgtgc gactgagatg caggatttcc tcacgcctcc cctatgtgtc ttaggggact 4021 ctggcttctc tttttgcaag ggcctctgaa tctgtctgtg tccctgttag cacaatgtga 4081 ggaggtagag aaacagtcca cctctgtgtc taccatgacc cccttcctca cactgacctg 4141 tgttccttcc ctgttctctt ttctattaaa aataagaacc tgggcagagt gcggcagctc 4201 atgcctgtaa tcccagcact tagggaggcc gaggagggca gatcacgagg tcaggagatc 4261 gaaaccatcc tggctaacac ggtgaaaccc cgtctctact aaaaaataca aaaaattagc 4321 tgggcgcaga ggcacgggcc tgtagtccca gctactcagg aggcggaggc aggagaatgg 4381 cgtcaacccg ggaggcggag gttgcagtga gccaggattg tgcgactgca ctccagcctg 4441 ggtgacaggg tgaaacgcca tctcaaaaaa taaaaattaa aaaataaaaa aagaacctgg 4501 atctcaattt aatttttcat attcttgcaa tgaaatggac ttgaggaagc taagatcata 4561 gctagaaata cagataattc cacagcacat ctctagcaaa tttagcctat tcctattctc 4621 tagcctattc cttaccacct gtaatcttga ccatatacct tggagttgaa tattgttttc 4681 atactgctgt ggtttgaatg ttccctccaa cactcatgtt gagacttaat ccctaatgtg 4741 gcaatactga aaggtggggc ctttgagatg tgattggatc gtaaggctgt gccttcattc 4801 atgggttaat ggattaatgg gttatcacag gaatgggact ggtggcttta taagaagagg 4861 aaaagagaac tgagctagca tgcccagccc acagagagcc tccactagag tgatgctaag 4921 tggaaatgtg aggtgcag // LOCUS HUMGSTPI1 1724 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exons 1-5. ACCESSION X08094 M33082 KEYWORDS glutathione S-transferase pi. SEGMENT 1 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1724) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1724) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description SITE 116 123 TRE-box SITE 129 134 Sp1 binding site SITE 138 143 Sp1 binding site PRM 154 157 TATA-box MSG 182 214 Exon 1 MSG 499 534 Exon 2 MSG 819 925 Exon 3 MSG 1042 1129 Exon 4 MSG 1488 1591 Exon 5 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept 214 214 glutathione S-transferase pi, exon 1 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" 499 534 glutathione S-transferase pi, exon 2 819 925 glutathione S-transferase pi, exon 3 1042 1129 glutathione S-transferase pi, exon 4 1488 + 1591 glutathione S-transferase pi, exon 5 pre-msg 182 > 1724 GST3 mRNA and introns IVS 215 498 GST3 intron A IVS 535 818 GST3 intron B IVS 926 1041 GST3 intron C IVS 1130 1487 GST3 intron D IVS 1592 > 1724 GST3 intron E revision 1532 1541 atgaatgac in [1]; gtgaatgacg in [2] revision 1707 1707 n in [1]; g in [2] BASE COUNT 272 a 598 c 541 g 313 t ORIGIN 1 tcgctgcgac tccgggactc cagggcgccc ctctcggccg acgcccgggg tgcagcggcg 61 ccgggctggg gccggcggga gtccgcggga ccctccagaa gagcggccgg cgccgtgact 121 cagcactggg gcggagcggg gcgggaccac ccttataagg ctcggaggcc gcgaggcctt 181 cgctggagtt tcgccgccgc agtcttcgcc accagtgagt acgcgcggcc cgcgtccccg 241 gggatggggc tcagagctcc cagcatgggg ccaacccgca gcatcaggcc cgggctcccg 301 gcaggctcct cgcccacctc gagacccggg acggggccta ggggacccag gacgtcccca 361 gtggccgtta gcggctttca gggggcccgg agcgcctcgg ggagggatgg gaccccgggg 421 gcgggagggg ggggcagctc cactcaccgc gccttggcat cctccccggg ctccacaaat 481 tttctttgtt cgctgcagtg ccgccctaca ccgtggtcta tttcccagtt cgaggtagga 541 gcatgtgtct ggcaggaagg gaggcaggcc tggctgcagc ccacagcccc tcgcccaccc 601 ggagagatcc gaaccccctt atccctccgt cgtgtggctt ttaccccggg cctccttcct 661 gttccccgcc tctcccgcca tgcctgctcc ccgccccagt gttgtgtaaa atcttggagg 721 aacctgtttc ctgttccctc ctgcactcct gaccctcccg ggttgctgcg aggcggagtg 781 gcccggtccc cacatctcgt acttctccct ccccgcaggc cgctgcgcgg ccctgcgcat 841 gctgctggca gatcagggcc agagctggaa ggaggaggtg gtgaccgtgg agacgtggca 901 ggagggctca ctcaaagcct cctgcgtaag tgaccatgcc cgggcaaggg gagggggtgc 961 tgggccttag ggggctgtgc actaggatcg ggggacgccc aagctcagtg ccccttccct 1021 gagccatgcc tcccccaaca gctatacggg cagctcccca agttccagga cggagacctc 1081 accctgtacc agtccaatac catcctgcgt cacctgggcc gcacccttgg tgagtcttaa 1141 cctcaagtcc aggagtcagg caagcctctg cccccggagc ccttttgttt aaatcaggcc 1201 ccgcagccct ctggagggga ggaaactgag acccactgag gttacgtagt ttgccaagtc 1261 aagcctggtt gtttgcaatc cttgcctgtg ccaggctggc tcccaggtgt caggtgagct 1321 ctgagcacct ggtgtgtggc agtctctcat ccttccacgc acatcctctt cccctcctcc 1381 caggctgggg ctcatcagac agaaccccct tggcctggcc catccccaat gactgtgtgt 1441 gatcagcgcc cagtcaaccc ggcctgctcc cctccaccca accccagggc tctatgggaa 1501 ggaccagcag gaggcagccc tggtggacat ggtgaatgac ggcgtggagg acctccgctg 1561 caaatacatc tccctcatct acaccaacta tgtgtgagca tctgcaccag gttgggcact 1621 gggggctgaa caaagaaagg ggcttcttgt gccctcaccc cccttacccc tcaggtggct 1681 tgggctgacc ccttcttggg tcagggggca ggggctgggt cagg // LOCUS HUMGSTPI2 591 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 6. ACCESSION X08095 M33083 KEYWORDS glutathione S-transferase pi. SEGMENT 2 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 591) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 591) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description TRANSCR <1 >591 primary transcript MSG 470 577 Exon 6 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept + 470 + 577 glutathione S-transferase pi, exon 6 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" IVS < 1 469 GST3 intron E IVS 578 > 591 GST3 intron F BASE COUNT 131 a 111 c 225 g 124 t ORIGIN About 300 bp after segment 1. 1 tgggaccagt cagcagaggc agcgtgtgtg cgcgtgcgtg tgcgtgtgtg tgcgtgtgtg 61 tgtgtacgct tgcatttgtg tcgggtgggt aaggagatag agatgggcgg gcagtaggcc 121 caggtcccga aggccttgaa cccactggtt tggagtctcc taagggcaat gggggccatt 181 gagaagtctg aacagggctg tgtctgaatg tgaggtctag aaggatcctc cagagaagcc 241 agctctaaag cttttgcaat catctggtga gagaacccag caaggatgga caggcagaat 301 ggaatagaga tgagttggca gctgaagtgg acaggatttg gtactagcct ggttgtgggg 361 agcaagcaga ggagaatctg ggactctggt ggtctggcct ggggcagacg ggggtgtctc 421 aggggctggg agggatgaga gtaggatgat acatggtggt gtctggcagg aggcgggcaa 481 ggatgactat gtgaaggcac tgcccgggca actgaagcct tttgagaccc tgctgtccca 541 gaaccaggga ggcaagacct tcattgtggg agaccaggtg agcatctggc c // LOCUS HUMGSTPI3 499 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 7. ACCESSION X08096 M33084 KEYWORDS glutathione S-transferase pi. SEGMENT 3 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 499) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 499) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description TRANSCR <1 ? primary transcript MSG 17 ? Exon 7 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept + 17 205 glutathione S-transferase pi exon 7 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" IVS < 1 16 GST3 intron F signal 260 265 poly-A signal revision 127 128 tc in [1]; cg in [2] BASE COUNT 114 a 140 c 141 g 104 t ORIGIN About 200 bp after segment 2. 1 ctggcctgcc ctgcagatct ccttcgctga ctacaacctg ctggacttgc tgctgatcca 61 tgaggtccta gcccctggct gcctggatgc gttccccctg ctctcagcat atgtggggcg 121 cctcagcgcc cggcccaagc tcaaggcctt cctggcctcc cctgagtacg tgaacctccc 181 catcaatggc aacgggaaac agtgagggtt ggggggactc tgagcgggag gcagagtttg 241 ccttcctttc tccaggacca ataaatttct aagagaggct actatgagca ctgtgtttcc 301 tgggacgggc ttagggcgtt ctcagcctcg aggtcggtgg gagggcagag cagaggacta 361 gaaaacagct cctccagcca ccgtcagtgg cttctggagc cctcagcctg gctgtgttta 421 ctcaaccatc acaaactaga agaggaagaa aaaaaaagag agagagtaca agagaaataa 481 caatggtcct gtcctggag //