Path: utzoo!attcan!uunet!bionet!daemon From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: <9004062123.AA28158@life.lanl.gov.LANL.GOV> Date: 6 Apr 90 21:23:42 GMT Sender: daemon@genbank.BIO.NET Distribution: bionet Lines: 660 Approved: lear@genbank.bio.net Checksum: 65203 44 LOCUS HUMBAT2A 6704 bp ss-mRNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) mRNA, complete cds. ACCESSION M33509 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6704) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 102 6530 HLA-B-associated transcript 2 (BAT2) mRNA 1 6704 BAT2 mRNA signal 6692 6697 poly-A signal BASE COUNT 1435 a 2224 c 1897 g 1148 t ORIGIN Chromosome 6p21.3. 1 cctaggcccg ggtcccggat ccccgcgcac ccggccaggc tctggcacgt tttgggggag 61 gtgcctgcag gacccaacat actcaatgag cttccagcgc aatgtccgat cgctcggggc 121 cgactgccaa gggaaaggat ggaaagaagt attcctcgct caacctgttt gatacgtata 181 agggcaagtc cttagagatc cagaaacccg cctgttgccc ctcgccatgg cctgcagagt 241 ctcgggaaag ttgccattgc ccggcgtatc gacctccagc caaccttcca agcctgaaag 301 ccgagaacaa aggcaatgac cccaatgtct cactagtgcc aaaagacgga acaggatggg 361 caagcaaaca ggagcagtcc gaccccaaga gttccgatgc ctcaaccgct cagccgccgg 421 aatcgcagcc actgccggct tcacagacgc ctgcctccaa ccagccgaaa cgacccccag 481 cagcccccga gaacactcct ttggttccaa gcggggtaaa gtcctgggca caagccagcg 541 tcacccatgg agcacatgga gatggtggaa gggcatcaag cctactgtca cgattctctc 601 gagaggaatt tccgaccctg caggcggctg gcgaccagga caaggctgcc aaggaaaggg 661 agtctgccga acagtcgtct gggcccggac caagcctccg cccccaaaat tctacaactt 721 ggagggacgg aggtgggcgt ggccctgatg agctggaggg cccggactcc aaacttcatc 781 atggtcatga tccccggggt gggctacagc cttcaggccc accccagttc cctccctacc 841 gcggaatgat gccgcctttc atgtatcccc catatctccc gttccctccg ccctatggac 901 cccaggggcc ttaccgatac cccactcctg atgggcccag ccgttttccc cgtgtggcgg 961 gcccccgagg ctcagggcca ccaatgcgct tagtagagcc tgtgggtcgt ccctctattc 1021 tcaaagagga taatctcaaa gagtttgatc agttggatca ggagaatgat gatggttggg 1081 caggggccca tgaagaggtt gactacactg aaaagctcaa gttcagcgat gaggaagatg 1141 ggcgagactc tgatgaggag ggagctgagg gccacaggga ttcccaatca gcttctggtg 1201 aggaacggcc ccctgaagca gatggcaaaa agggcaactc ccccaacagc gaaccgccca 1261 ctcctaagac ggcctgggca gaaacctctc ggcctccaga gacagagccg ggacctcctg 1321 ccccaaagcc tcccctaccc cctggggact acccagatcg tgggggtcct ccctgcaagc 1381 ccccagcacc tgaagatgag gatgaggcat ggcggcagcg acgaaagcag tcgtcatctg 1441 agatttccct ggcagtggag cgggcccggc gacggcgaga agaagaggag cggcgcatgc 1501 aagaagagcg ccgggcagcc tgtgctgaga agctcaagcg actcgatgaa aagtttgggg 1561 cacctgacaa gcggctcaaa gcagagcctg ctgccccacc tgctgcccct tctaccccag 1621 ccccaccacc tgcagtccct aaagaactcc ctgcacctcc agctccacct ccagcatcag 1681 ccccaacacc agagacagaa cctgaagagc cagcacaggc ccctcctgcc caatctactc 1741 ctactccagg tgtggctgcg gctcccactc tggtgagtgg tggtggcagt accagtagca 1801 ccagcagtgg cagcttcgaa gccagcccag tggaaccaca actgccctca aaagagggtc 1861 ctgaaccacc agaagaggtt cctcctccta ccacaccccc agttccaaag gtggaaccca 1921 agggtgatgg gattggtccc acccgccagc cccctagtca gggcttgggc taccccaaat 1981 atcagaagtc gttgcctcct cgtttccagc ggcagcagca ggagcagctc ctgaagcagc 2041 agcagcagca ccagtggcag cagcatcaac agggctctgc ccctcctacc ccagtgcccc 2101 catcaccacc acagcctgtg accctggggg ctgtgccagc tccacaggct ccacccccgc 2161 cccccaaggc cctgtaccca ggtgctctgg gccggccccc acccatgccc ccaatgaact 2221 ttgatccccg atggatgatg attcctcctt atgtggaccc ccggctcctc cagggtcgtc 2281 cccctctaga gttctaccct cctggtgtgc atccctctgg cctagttccc cgagagcgtt 2341 cagacagtct ggggctcagc tcagagccat ttgaccgtca tgcacctgct atgttacggg 2401 aacggggcac tccaccggtg gatccaaagt tggcctgggt aggagatgtc ttcaccgcca 2461 cacccgctga accccgccca cttacctcac ctctgcgcca ggctgcggat gaggatgaca 2521 aggggatgag gagcgagact cctccagtac ctcccccacc accctatctg gccagttatc 2581 caggctttcc tgagaatgga gcccctgggc ccccaatctc tcgctttcct ctggaggaac 2641 cagggccccg tccactcccc tggcccccag gcagtgatga agtggccaag atacaaactc 2701 caccacccaa gaaggagccc cctaaggagg agactgcaca gctgacgggg ccagaagcag 2761 gccgaaagct gcccgcgagt cggagtggag caggcccccc accaccacgc agagagagtc 2821 gcacagagac ccgctggggc cctcgtccag ggagcagtcg tcgtggaatc cctccagagg 2881 agccaggggc cccaccccgc cgggctgggc ctataaagaa acctccacca cctacaaaag 2941 tagaagagct gcctcccaag cccctcgaac agggggatga aacccccaaa cccccaaagc 3001 cagacccact caagataacc aaggggaagc tagggggccc caaggagacc ccacccaatg 3061 gaaatctttc ccctgcccca aggcttcgga gggactattc gtatgaaaga gtgggtccta 3121 cctcttgccg gggtcggggc cgaggcgagt attttgccag agggaggggt tttcggggga 3181 cctatggggg acgagggcgg ggaggccaag cgaattccgc agttaccgag agtttcgagg 3241 agatgatggg cgtggaggtg ggacaggggg accaaaccac cctcctgctc cccgaggccg 3301 ccatgccagc gagacacgga gcgagggttc agagtatgag gaaatcccca agcggtgccg 3361 gcagcggggc tcagaaacag gcagcgagac ccatgagagt gatctggctc cttcagacaa 3421 ggaggctccc acacccaagg agggaacact cacccaggtc ctctcgctcc cccaccacca 3481 ggagccccac ccttcaccga gcgccagccc gcttcacgtg cccgggggtc ggcgagtctt 3541 cactcccaga gggtgccatc tcgccggggc cgaggaggag ggaggcccct cctcaagttt 3601 gcccaggctg gagccctcca gccaagtctc tggctcccaa gaaacctccc acaggccctt 3661 tgccaccaag taaggagcct ttgaaagaga agttgatccc agggcctctg tcccctgtgg 3721 cgcgcggagg cagcaatgga ggtagcaatg tgggcatgga agatggggag cgaccccgaa 3781 ggaggcgaca tgggagggct cagcagcagg ataaaccgcc tcgtttccgg aggctgaagc 3841 aggaacggga gaatgccgca agggggtctg agggcaagcc ctccctaacc cttccagcct 3901 ccgctcctgg acctgaggag gccctcacaa cagtcacagt ggccccagca cctccgcggg 3961 cagctgccaa gtctcctgat ctgtcaaacc agaactcaga ccaagccaat gaggaatggg 4021 agactgcatc agagagcagt gacttcacca gtgagcgccg aggggacaaa gaggcacccc 4081 caccagtact gctgacaccc aaggctgtgg gaactcctgg gggaggtgga ggtggagccg 4141 taccaggtat ttcagccatg tcccgcggag atctgagcca gagagccaag gatttgagta 4201 aacggagctt ctcaagtcag cggccaggca tggaacggca gaatcggcgc cctggcccag 4261 ggggcaaggc tggcagcagt ggcagcagca gtggaggagg cggtgggggt cctggaggaa 4321 ggaccgggcc aggacgaggc gacaagagga gctggccctc tcccaagaac cgaagtcgtc 4381 ctccagagga gcgtcccccg gggcttcccc tgcctccccc acctcccagc agttctgctg 4441 tcttccgcct ggaccaagtt atccacagca accctgctgg catccaacag gctctggccc 4501 agcttagtag ccgtcaaggg agtgtaactg caccaggggg tcatccaagg cacaagcctg 4561 ggcctcccca agcccctcag ggcccctctc ctaggccccc aacccgatac gagccccaga 4621 gggtcaacag cggcctcagt tctgaccccc actttgagga gccggggcca atggtgagag 4681 gggtgggtgg gactcctcgg gactctgccg gggttagtcc ctttccccct aaacgtcggg 4741 agcggcctcc cagaaaacca gagctgctac aggaggaatc tttgccacct cctcatagct 4801 ctggattctt gggctctaag cctgagggcc caggccctca ggcagagtcc agagatacag 4861 gcacagaggc cctgacccct cacatctgga accgtttaca tactgccact agccgaaaga 4921 gttaccggcc cacgtccatg gagccttgga tggagcccct gagtcctttt gaggatgtgg 4981 ctggcacaga aatgagtcag tctgacagtg gggtggacct gagtggggat tctcaggtgt 5041 catcaggtcc ctgcagccag cgaagttccc ctgatggagg actcaagggg gcagcagagg 5101 gaccccccaa gaggcctgga ggctcctcac ccctgaatgc tgttccttgt gagggtccac 5161 ctggctctga acctcctagg agaccaccac ctgcccccca cgatggggac agaaaggagc 5221 tgccccggga gcagcctctg ccccctggcc ccattggcac agaacgatca cagcgtacag 5281 accgaggcac agagcctggc cccattcggc catcccatcg acctggtccc ccagtccagt 5341 ttggcactag tgacaaggac tcagacttac gcctagtggt aggagacagc ttgaaagcag 5401 agaaggagct aacagcatca gtcactgagg ccattcctgt atcacgagac tgggagctgc 5461 ttcccagtgc tgctgcctct gctgagccac aatccaagaa cctggattct gggcactgtg 5521 tcccggagcc cagctcctca ggccagcgcc tgtatcctga ggttttctat ggcagtgctg 5581 ggccttccag ttctcagatc tctgggggga gccatggact ctcaattaca tccaaacagt 5641 ggaggcttcg ccctgggaca ccctcactgc acccttacag atcacagccc ctatacctac 5701 ccccgggccc agcccctccc tcagcactgc tctctggggt agctctcaag ggccagtttc 5761 tggatttctc cacaatgcaa gctacagagc tggggaagtt gccggctgga ggagttctct 5821 accctccacc ttccttcctc tactctccgg ctttctgccc cagtcctttg cctgacacat 5881 cgttgcttca ggtacgccag gatctgccat ccccttcgga tttttattct actcctctgc 5941 agcctggtgg ccaaagtggc tttctccctt caggggctcc tgcccagcag atgcttctac 6001 ccatggtaga ctcacagctg cctgtggtga actttggctc cctgccgcca gcaccacctc 6061 ctgccccacc tcccctttct ctgttacctg tgggccctgc tctgcagccc cccagcctgg 6121 ctgtgcggcc cccacctgct cctgctactc gggtgctgcc ttcacctgcc aggcccttcc 6181 ccgctagctt ggggcgagca gagctgcatc cagtggaact aaagccgttc caggattatc 6241 aaaaactgag cagcaacctt gggggacctg gatcatcacg gactccccca actggaaggt 6301 ccttctctgg cctcaattcc cgtctcaagg ccacgccttc cacctacagt ggagtcttcc 6361 gcacccagcg cgtcgacctt taccagcagg cctccccacc agatgccctg cgctggatac 6421 ctaagccttg ggagcggaca gggccgccac ctcgagaagg gccctcccga cgggcagagg 6481 agcctgggtc ccgaggggac aaggagcctg ggttgccccc accccgctga gggagttcct 6541 cttgccccct acccccgggg cttgtatata gattataaat atataagggg gaaaggggtg 6601 ggcggggagg ggttgtgggg ctggggcctc acttcccctc ctcccccttc ccctggtccc 6661 ctgtccctgg ggctgtttgt taaaaaagag taataaaagg attt // LOCUS HUMBAT2B1 336 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' flank. ACCESSION M33510 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 1 of 4 SOURCE Human T-cell line MANN DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 336) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. BASE COUNT 108 a 60 c 65 g 101 t 2 others ORIGIN Chromosome 6p21.3. 1 tctagaatcg ggtagtaaga gacaaaggag ggtaacagta ctgcatttca caaaatgaaa 61 cccattgtta agaaattaca aattcccaat aatttcaaat ataaaaattt attcatgaaa 121 attataggtt ataaaattaa atgtccgtct tagtcgatgg ttgcccatat tttgatgaac 181 gagtcattcc tagcctatct ttgttcaaat gatttgcata cttatgcaaa taggtagaac 241 tgcccgaaga atgcctacnt gcgtggtgcg gacgaaacgc ttgccgggsc ctttggattg 301 gtctgtctag ccacctcatt tgcatgacgt aatata // LOCUS HUMBAT2B2 188 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' end. ACCESSION M33511 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 2 of 4 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 154 to 188) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 188) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pre-msg 154 188 BAT2 mRNA BASE COUNT 20 a 64 c 59 g 18 t 27 others ORIGIN About 500 bp after segment 1; chromosome 6p21.3. 1 gtgcthhhng gggcggcggt tccgcggatg ggccgttagt cgggstcagc cgcggagtga 61 gngagggaga cgnnaggasg aacccggcca tccgccgcca tcctcccccg ccccaccgcc 121 atccgtcccg gggacnnnnn nnnnnnnnnn nnncctaggc ccgggtcccg gatccccgcg 181 cacccggc // LOCUS HUMBAT2B3 3090 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, exons 2 through 4. ACCESSION M33512 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 3 of 4 SOURCE Human T-cell line MANN DNA (introns), and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1460 to 1572; 2396 to 2572; and 2919 to 3090) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 3090) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 1460 1572 HLA-B-associated transcript 2 (BAT2), exon 2 (first expressed exon) 2396 2572 HLA-B-associated transcript 2, exon 3 2919 + 3090 HLA-B-associated transcript 2, exon 4 pre-msg < 1 > 3090 BAT2 mRNA and introns IVS < 1 1459 BAT2 intron A (no splice consensus) IVS 1573 2395 BAT2 intron B IVS 2573 2918 BAT2 intron C BASE COUNT 747 a 709 c 730 g 901 t 3 others ORIGIN About 500 bp after segment 2; chromosome 6p21.3. 1 tctagaatcc tgcttttatc ccagcatctt tgctttctat gttgctcagt cgccctatgt 61 ctgctttttc atttttcctg ttcctcgtct cctttctccc ccaaccccgt ttttcttctt 121 gggcctctgc cccttacttc gttgtctaca tccttttttt ttttgccatt cctgtttcca 181 tatattttcc acctgctttc gtattcatta ttttctgtta gttttggact attcgctaca 241 tgactcttgt attcgttttc ccttcatata tttatcttca cagattggcc tcctcaaaca 301 cctacgaagc aacatccatc ttatgtgtag cttgtcataa agttctttct ccccaatttt 361 agctttcatt ctgggcctgt ctggatttcc ctgctttctt ccccactatt tctcatctct 421 ttacactgtt cccgaccata aacgaatgcc tggtcactct ggaatggact gagagacctg 481 tcgtccggct tgcttaggga gctggaggta tcgagtaaag aaacactggt gatggacatt 541 tttaatcagg ataggaaaac gaagatggct ctgccttggc cctctgtttt ctggcccatg 601 gttacagggt gctaaggtgg ctccataatg ctttttctca gttcttcata tggtaaaaca 661 gtatttcatc tggaggcgat tttttccagg agccaataca ggagcaagtt taccaaaaga 721 tgggatattt caaatacttg aggttcctat agcctgggag tatgtacagc cctagttgtt 781 ctatgaggat ttctctggta ccaaccccca ttccngctga gcaagctcat aaaatcctta 841 aactcccagc ataccttnct gcaaaccttc ccagatggac acgaggctgc tgggctggga 901 gctggggtac agggccctgg gggcatgatt agggagcttg tgtccaataa acagggaatc 961 taaagtgttg tttcttcttc tctgatggaa ttgtatgctt cttttttagt tttctcttag 1021 cttgaatttg tcctgttgta agtctctgaa acgattttgg tggagagaga agagattatt 1081 acttgtaggg aattactctt tngtagacag gcacaaaggg cagagtgttt atactaggag 1141 gatgctggat ttttacttag atttccttgt aacaaaggtc gtctggggcc aaggagggaa 1201 catggcattt gagctatgag ggagctaagt agatcatggt tggactttaa gaagagtggg 1261 cagtttacat agactggagg aaaagacacc agagggactc atatctgagt ccctaatgat 1321 aatgcaatgg agtttttaag tttctgttat ggtctgtaca gggacagaga ctgagacact 1381 tgcgtctggc ccacaggctc tggcacgttt tgggggaggt gcctgcagga cccaacatac 1441 tcaatgagct tccagcgcaa tgtccgatcg ctcggggccg actgccaagg gaaaggatgg 1501 aaagaagtat tcctcgctca acctgtttga tacgtataag ggcaagtcct tagagatcca 1561 gaaacccgcc tggtgagagt cctgcaaaga tgcttctgat ggttgaaaag ctaggcatgc 1621 atggggcata cgttttagag ctctaaagga agtggctgta gtagaaatac caaaagacta 1681 gaggagattt cccaacttac actgggtcct ttaaaggggg tgtgggctct gggtgaacac 1741 cagttatcct cctacaaagg cgtgtctgtg gttccctgtc tttggacacg taagaattgg 1801 aggaaataaa tgtggatttg ggaaactttg aggccagctt gcttcttgca ggctcatgat 1861 caaccaatct cacataaaag tattgaatgt tacatatctc agccttcttg atagggattt 1921 actagatttt tttttttttt tttttttttt ttttttgaga ccaagtttag ctcctgttgc 1981 ccaggctgga gtgcaatggt gtgatcttga cttaccacaa cctccaccgc ctgggtttaa 2041 gcgattatcc tgcctcagcc tcctgagtag ctgggattac aggcatgcac cccggctaat 2101 tttgtgtttt tagtagagac agggtttctc cattttggtc aagctggtct tgaactcctg 2161 acctcaggtg atccgcctcc ctcggcctgc caaagtgctg ggattgcaaa gtgtgagcca 2221 ccacaatcag cgcgatttca gagattatta aggcagggga aggaatccct tctaagagaa 2281 gtttggagga agtaggtaat aaaatattca acatgtataa atgtgtccca ggataggagg 2341 ccatcagatc tcccacatga ggcattttcg accctctctc cgtcttgttc tccagttgcc 2401 cctcgccatg gcctgcagag tctcgggaaa gttgccattg cccggcgtat cgacctccag 2461 ccaaccttcc aagcctgaaa gccgagaaca aaggcaatga ccccaatgtc tcactagtgc 2521 caaaagacgg aacaggatgg gcaagcaaac aggagcagtc cgaccccaag aggtagacag 2581 aggcttgggg gacctagagt gatgggtatt ttaacttgaa cttcagggag cattggggct 2641 tggtttagtc cagccacgtc tgaagagacg aagaggtccc tttcttacct attgcaggtt 2701 ccttgttaaa tgactaagga atggtactaa actttagctt tttgtcttgg agagagagca 2761 tgaaaaaata gacaacaggt acaaggatga caaaattaat ttgtccttat atttgtaaat 2821 ggtagcaatg ggcatgattt cagtcctgag tctccaccag ttggagaagt cagggaggca 2881 tctcaggtgt gaataacctt cccattctgt cccctcagtt ccgatgcctc aaccgctcag 2941 ccgccggaat cgcagccact gccggcttca cagacgcctg cctccaacca gccgaaacga 3001 cccccagcag cccccgagaa cactcctttg gttccaagcg gggtaaagtc ctgggcacaa 3061 gccagcgtca cccatggagc acatggagat // LOCUS HUMBAT2B4 6349 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 3' end. ACCESSION M33518 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 4 of 4 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 209 to 6349) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 6349) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept + 209 6175 HLA-B-associated transcript 2 (BAT2), exon 5 pre-msg < 1 6349 BAT2 mRNA and introns IVS < 1 208 BAT2 intron D (no splice consensus) signal 6337 6342 poly-A signal BASE COUNT 1334 a 2095 c 1798 g 1121 t 1 others ORIGIN About 370 bp after segment 3; chromosome 6p21.3. 1 agctaatttg tgtgtgttta gtagagatgg gttcacatgt tggcagatgg tctcgatctc 61 ttgacctctg tgatccgccc gcctcagccg gtcccagagt gctgggatta caggcgtgag 121 ccaccgcgcc cagccagagt cttccacttt tatnagcatg tcctcaggaa atgtcttctg 181 tctcctgttc tgcatcccca tcctaatagg tggaagggca tcaagcctac tgtcacgatt 241 ctctcgagag gaatttccga ccctgcaggc ggctggcgac caggacaagg ctgccaagga 301 aagggagtct gccgaacagt cgtctgggcc cggaccaagc ctccgccccc aaaattctac 361 aacttggagg gacggaggtg ggcgtggccc tgatgagctg gagggcccgg actccaaact 421 tcatcatggt catgatcccc ggggtgggct acagccttca ggcccacccc agttccctcc 481 ctaccgcgga atgatgccgc ctttcatgta tcccccatat ctcccgttcc ctccgcccta 541 tggaccccag gggccttacc gataccccac tcctgatggg cccagccgtt ttccccgtgt 601 ggcgggcccc cgaggctcag ggccaccaat gcgcttagta gagcctgtgg gtcgtccctc 661 tattctcaaa gaggataatc tcaaagagtt tgatcagttg gatcaggaga atgatgatgg 721 ttgggcaggg gcccatgaag aggttgacta cactgaaaag ctcaagttca gcgatgagga 781 agatgggcga gactctgatg aggagggagc tgagggccac agggattccc aatcagcttc 841 tggtgaggaa cggccccctg aagcagatgg caaaaagggc aactccccca acagcgaacc 901 gcccactcct aagacggcct gggcagaaac ctctcggcct ccagagacag agccgggacc 961 tcctgcccca aagcctcccc taccccctgg ggactaccca gatcgtgggg gtcctccctg 1021 caagccccca gcacctgaag atgaggatga ggcatggcgg cagcgacgaa agcagtcgtc 1081 atctgagatt tccctggcag tggagcgggc ccggcgacgg cgagaagaag aggagcggcg 1141 catgcaagaa gagcgccggg cagcctgtgc tgagaagctc aagcgactcg atgaaaagtt 1201 tggggcacct gacaagcggc tcaaagcaga gcctgctgcc ccacctgctg ccccttctac 1261 cccagcccca ccacctgcag tccctaaaga actccctgca cctccagctc cacctccagc 1321 atcagcccca acaccagaga cagaacctga agagccagca caggcccctc ctgcccaatc 1381 tactcctact ccaggtgtgg ctgcggctcc cactctggtg agtggtggtg gcagtaccag 1441 tagcaccagc agtggcagct tcgaagccag cccagtggaa ccacaactgc cctcaaaaga 1501 gggtcctgaa ccaccagaag aggttcctcc tcctaccaca cccccagttc caaaggtgga 1561 acccaagggt gatgggattg gtcccacccg ccagccccct agtcagggct tgggctaccc 1621 caaatatcag aagtcgttgc ctcctcgttt ccagcggcag cagcaggagc agctcctgaa 1681 gcagcagcag cagcaccagt ggcagcagca tcaacagggc tctgcccctc ctaccccagt 1741 gcccccatca ccaccacagc ctgtgaccct gggggctgtg ccagctccac aggctccacc 1801 cccgcccccc aaggccctgt acccaggtgc tctgggccgg cccccaccca tgcccccaat 1861 gaactttgat ccccgatgga tgatgattcc tccttatgtg gacccccggc tcctccaggg 1921 tcgtccccct ctagagttct accctcctgg tgtgcatccc tctggcctag ttccccgaga 1981 gcgttcagac agtctggggc tcagctcaga gccatttgac cgtcatgcac ctgctatgtt 2041 acgggaacgg ggcactccac cggtggatcc aaagttggcc tgggtaggag atgtcttcac 2101 cgccacaccc gctgaacccc gcccacttac ctcacctctg cgccaggctg cggatgagga 2161 tgacaagggg atgaggagcg agactcctcc agtacctccc ccaccaccct atctggccag 2221 ttatccaggc tttcctgaga atggagcccc tgggccccca atctctcgct ttcctctgga 2281 ggaaccaggg ccccgtccac tcccctggcc cccaggcagt gatgaagtgg ccaagataca 2341 aactccacca cccaagaagg agccccctaa ggaggagact gcacagctga cggggccaga 2401 agcaggccga aagctgcccg cgagtcggag tggagcaggc cccccaccac cacgcagaga 2461 gagtcgcaca gagacccgct ggggccctcg tccagggagc agtcgtcgtg gaatccctcc 2521 agaggagcca ggggccccac cccgccgggc tgggcctata aagaaacctc caccacctac 2581 aaaagtagaa gagctgcctc ccaagcccct cgaacagggg gatgaaaccc ccaaaccccc 2641 aaagccagac ccactcaaga taaccaaggg gaagctaggg ggccccaagg agaccccacc 2701 caatggaaat ctttcccctg ccccaaggct tcggagggac tattcgtatg aaagagtggg 2761 tcctacctct tgccggggtc ggggccgagg cgagtatttt gccagaggga ggggttttcg 2821 ggggacctat gggggacgag ggcggggagg ccaagcgaat tccgcagtta ccgagagttt 2881 cgaggagatg atgggcgtgg aggtgggaca gggggaccaa accaccctcc tgctccccga 2941 ggccgccatg ccagcgagac acggagcgag ggttcagagt atgaggaaat ccccaagcgg 3001 tgccggcagc ggggctcaga aacaggcagc gagacccatg agagtgatct ggctccttca 3061 gacaaggagg ctcccacacc caaggaggga acactcaccc aggtcctctc gctcccccac 3121 caccaggagc cccacccttc accgagcgcc agcccgcttc acgtgcccgg gggtcggcga 3181 gtcttcactc ccagagggtg ccatctcgcc ggggccgagg aggagggagg cccctcctca 3241 agtttgccca ggctggagcc ctccagccaa gtctctggct cccaagaaac ctcccacagg 3301 ccctttgcca ccaagtaagg agcctttgaa agagaagttg atcccagggc ctctgtcccc 3361 tgtggcgcgc ggaggcagca atggaggtag caatgtgggc atggaagatg gggagcgacc 3421 ccgaaggagg cgacatggga gggctcagca gcaggataaa ccgcctcgtt tccggaggct 3481 gaagcaggaa cgggagaatg ccgcaagggg gtctgagggc aagccctccc taacccttcc 3541 agcctccgct cctggacctg aggaggccct cacaacagtc acagtggccc cagcacctcc 3601 gcgggcagct gccaagtctc ctgatctgtc aaaccagaac tcagaccaag ccaatgagga 3661 atgggagact gcatcagaga gcagtgactt caccagtgag cgccgagggg acaaagaggc 3721 acccccacca gtactgctga cacccaaggc tgtgggaact cctgggggag gtggaggtgg 3781 agccgtacca ggtatttcag ccatgtcccg cggagatctg agccagagag ccaaggattt 3841 gagtaaacgg agcttctcaa gtcagcggcc aggcatggaa cggcagaatc ggcgccctgg 3901 cccagggggc aaggctggca gcagtggcag cagcagtgga ggaggcggtg ggggtcctgg 3961 aggaaggacc gggccaggac gaggcgacaa gaggagctgg ccctctccca agaaccgaag 4021 tcgtcctcca gaggagcgtc ccccggggct tcccctgcct cccccacctc ccagcagttc 4081 tgctgtcttc cgcctggacc aagttatcca cagcaaccct gctggcatcc aacaggctct 4141 ggcccagctt agtagccgtc aagggagtgt aactgcacca gggggtcatc caaggcacaa 4201 gcctgggcct ccccaagccc ctcagggccc ctctcctagg cccccaaccc gatacgagcc 4261 ccagagggtc aacagcggcc tcagttctga cccccacttt gaggagccgg ggccaatggt 4321 gagaggggtg ggtgggactc ctcgggactc tgccggggtt agtccctttc cccctaaacg 4381 tcgggagcgg cctcccagaa aaccagagct gctacaggag gaatctttgc cacctcctca 4441 tagctctgga ttcttgggct ctaagcctga gggcccaggc cctcaggcag agtccagaga 4501 tacaggcaca gaggccctga cccctcacat ctggaaccgt ttacatactg ccactagccg 4561 aaagagttac cggcccacgt ccatggagcc ttggatggag cccctgagtc cttttgagga 4621 tgtggctggc acagaaatga gtcagtctga cagtggggtg gacctgagtg gggattctca 4681 ggtgtcatca ggtccctgca gccagcgaag ttcccctgat ggaggactca agggggcagc 4741 agagggaccc cccaagaggc ctggaggctc ctcacccctg aatgctgttc cttgtgaggg 4801 tccacctggc tctgaacctc ctaggagacc accacctgcc ccccacgatg gggacagaaa 4861 ggagctgccc cgggagcagc ctctgccccc tggccccatt ggcacagaac gatcacagcg 4921 tacagaccga ggcacagagc ctggccccat tcggccatcc catcgacctg gtcccccagt 4981 ccagtttggc actagtgaca aggactcaga cttacgccta gtggtaggag acagcttgaa 5041 agcagagaag gagctaacag catcagtcac tgaggccatt cctgtatcac gagactggga 5101 gctgcttccc agtgctgctg cctctgctga gccacaatcc aagaacctgg attctgggca 5161 ctgtgtcccg gagcccagct cctcaggcca gcgcctgtat cctgaggttt tctatggcag 5221 tgctgggcct tccagttctc agatctctgg ggggagccat ggactctcaa ttacatccaa 5281 acagtggagg cttcgccctg ggacaccctc actgcaccct tacagatcac agcccctata 5341 cctacccccg ggcccagccc ctccctcagc actgctctct ggggtagctc tcaagggcca 5401 gtttctggat ttctccacaa tgcaagctac agagctgggg aagttgccgg ctggaggagt 5461 tctctaccct ccaccttcct tcctctactc tccggctttc tgccccagtc ctttgcctga 5521 cacatcgttg cttcaggtac gccaggatct gccatcccct tcggattttt attctactcc 5581 tctgcagcct ggtggccaaa gtggctttct cccttcaggg gctcctgccc agcagatgct 5641 tctacccatg gtagactcac agctgcctgt ggtgaacttt ggctccctgc cgccagcacc 5701 acctcctgcc ccacctcccc tttctctgtt acctgtgggc cctgctctgc agccccccag 5761 cctggctgtg cggcccccac ctgctcctgc tactcgggtg ctgccttcac ctgccaggcc 5821 cttccccgct agcttggggc gagcagagct gcatccagtg gaactaaagc cgttccagga 5881 ttatcaaaaa ctgagcagca accttggggg acctggatca tcacggactc ccccaactgg 5941 aaggtccttc tctggcctca attcccgtct caaggccacg ccttccacct acagtggagt 6001 cttccgcacc cagcgcgtcg acctttacca gcaggcctcc ccaccagatg ccctgcgctg 6061 gatacctaag ccttgggagc ggacagggcc gccacctcga gaagggccct cccgacgggc 6121 agaggagcct gggtcccgag gggacaagga gcctgggttg cccccacccc gctgagggag 6181 ttcctcttgc cccctacccc cggggcttgt atatagatta taaatatata agggggaaag 6241 gggtgggcgg ggaggggttg tggggctggg gcctcacttc ccctcctccc ccttcccctg 6301 gtcccctgtc cctggggctg tttgttaaaa aagagtaata aaaggattt // LOCUS HUMBAT3A 3740 bp ss-mRNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) mRNA, complete cds. ACCESSION M33519 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3740) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 250 3648 HLA-B-associated transcript 3 (BAT3) mRNA 1 3740 BAT3 mRNA BASE COUNT 744 a 1182 c 1057 g 757 t ORIGIN Chromosome 6p21.3. 1 ggcgacagcg gtggcggctc ctcggggtgc tcggctccct cccacctagg ccggccccgg 61 cccgactcgc cctcagaaac tcactgtttg gggctgcgga ctttctcgtc gtgccccaca 121 aaagtaaagc ttggggacct ggggggagcc ggaagtatcg cttcgagatc cccaaatact 181 atcggggaaa cggaagtggc cgtcggtggc aggtttgggg gagaccggaa gtgacgagac 241 ctgtcggcca tggagcctaa tgatagtacc agtaccgctg tggaggagcc tgacagcttg 301 gaggtgttgg tgaagacctt ggactctcaa actcgtacct ttattgtggg ggcccagatg 361 aatgtaaaag agtttaagga gcacattcgt gcctctgtca gcatcccatc tgaaaaacaa 421 cggctcattt accagggacg agttctgcaa gatgataaga agcttcagga atacaatgtt 481 gggggaaagg ttatccacct ggtggaacgg gctcctcctc agactcacct cccttctggg 541 gcatcttctg ggacggggtc tgcctcagcc actcatggtg ggggatcccc ccctggtact 601 cgggggcctg gggcctctgt tcatgaccgg aatgccaaca gctatgtcat ggttggaacc 661 ttcaatcttc ctagtgacgg ctctgctgtg gatgttcaca tcaacatgga acaggccccg 721 attcagagtg agccccgggt acggctggtg atggctcagc acatgatcag ggatatacag 781 accttactat cccggatgga gactctcccc taccttcagt gtcgaggagg gccccaaccg 841 cagcacagtc agccgccccc gcagccaccg gctgtgaccc cggagccagt agccttgagc 901 tctcaaacat cagaaccagt tgaaagtgaa gcacctcccc gggagcccat ggaggcagaa 961 gaagtggagg agcgtgcccc agcccagaac ccggagctca ctcctggccc agccccagcg 1021 ggcccaacac ctgccccgga aacaaatgca cccaaccatc cttcccctgc ggagtatgtc 1081 gaggtgctcc aggagctaca gcggctggag agtcgcctcc agcccttctt gcagcgctac 1141 tacgaggttc tgggtgctgc tgccaccacg gactacaata acaatcacga gggccgggag 1201 gaggatcagc ggttgatcaa cttggtaggg gagagcctgc gactgctggg caacaccttt 1261 gttgcactgt ctgacctgcg ctgcaatctg gcctgcacgc ccccacgaca cctgcatgtg 1321 gtccggccta tgtctcacta caccaccccc atggtgctcc agcaggcagc cattcccata 1381 cagatcaatg tgggaaccac tgtgaccatg acaggaaatg ggactcggcc ccccccaact 1441 cccaatgcag aggcacctcc ccctggtcct gggcaggcct catccgtggc tccgtcttct 1501 accaatgtcg agtcctcagc tgagggggct cccccgccag gtccagctcc cccgccagcc 1561 accagccacc cgagggtcat ccggatttcc caccagagtg tggaacccgt ggtcatgatg 1621 cacatgaaca ttcaagattc tggcacacag cctggtggtg ttccgagtgc tcccactggc 1681 cccctgggac cccctggtca tggccaaacc ctgggacagc aggtgccagg cttcccaaca 1741 gctccaaccc gggtggtgat tgcccggccc actcctccac aggctcggcc ttcccatcct 1801 ggagggcccc cagtctctgg gacactgcag ggcgccggtc tgggtaccaa tgcctcgttg 1861 gcccagatgg tgagcggcct tgtggggcag cttcttatgc agccagtcct tgtggctcag 1921 gggaccccag gtatggctcc accgccagcc cctgccactg cttctgccag tgctggcacc 1981 accaacacag ctaccacagc tggccccgct cctggggggc ctgcccagcc tccacccacc 2041 cctcaaccct ccatggctga tcttcagttc tctcagcttc tggggaacct gctagggcct 2101 gcagggccag gggctggagg gcctggtgtg gcttctccca ccatcactgt ggcgatgcct 2161 ggtgtccctg cctttctcca aggcatgact gacttcttgc aggcaacaca gacagcccct 2221 ccaccacccc cacctcctcc acccccacca cctgccccag agcagcagac catgccccca 2281 ccaggctccc cttctggtgg cgcagggagt cctggaggcc tgggtcttga gagcctgtca 2341 ccggagtttt ttacctcagt ggtgcagggt gtgctcagct ccctgctggg ctccctgggg 2401 gctcgggctg gcagcagtga aagtattgct gccttcatac aacgcctcag tggatccagc 2461 aacatctttg agcctggagc tgatggggcc cttggattct ttggggcctt gctttctctt 2521 ctgtgccaga acttctctat ggtggacgta gtgatgcttc tccatgggca tttccagcca 2581 ctacaacggc tccagcccca gctgcgatcc ttcttccacc agcactacct gggtggtcag 2641 gagcccacac ccagtaacat ccggatggca acccacacat tgatcacggg gctagaagag 2701 tatgtgcggg agagtttttc cttggtgcag gttcagccag gtgtggacat catccggaca 2761 aacctggaat ttctccaaga gcagtttaat agcattgctg cgcatgtgct gcattgcaca 2821 gatagtggat ttggggcccg gttgctggag ttgtgtaacc aaggcctgtt tgaatgcctg 2881 gccctaaacc tgcactgctt ggggggacag cagatggagc ttgctgctgt tatcaatggc 2941 cgaattcgtc gtatgtctcg tggggtgaat ccctccttgg tgagctggct gaccactatg 3001 atgggactga ggcttcaggt ggtactggag cacatgcctg taggccctga tgccattctc 3061 agatacgttc gcagggttgg tgatcccccc cagccacttc ctgaggagcc aatggaagtt 3121 cagggagcag aaagagcttc ccctgagcct cagcgggaga atgcttcccc agcccctgga 3181 acaacagcag aagaggccat gtcccgaggt ccacctcctg ctcctgaggg gggctcccgg 3241 gatgaacagg atggagcttc agctgagaca gaaccttggg cagctgcagt ccccccagaa 3301 tgggtcccta ttatccagca ggacattcag agccagcgga aggtgaaacc gcagccccct 3361 ctgagtgatg cctacctcag tggtatgcct gccaagagac gcaagacgat gcagggtgag 3421 ggcccccagc tgcttctctc agaggctgtg agccgggcag ctaaggcagc cggagctcgg 3481 cccctgacga gccccgagag cctgagccgg gacctggagg caccagaggt tcaggagagc 3541 tacaggcagc agctccggtc tgatatacaa aaacgactgc aggaagaccc caactacagt 3601 ccccagcgct tccccaatgc ccagcgggcc tttgctgatg atccttagct ctttgctcta 3661 tggcccttcc tcatcagggg accgtttccc ccctcttcct tcacagtatt taagaaataa 3721 aagtcggatt ttttctggcc // LOCUS HUMBAT3B1 785 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 5' end. ACCESSION M33520 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 1 of 2 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 333 to 689) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 785) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 582 + 689 HLA-B-associated transcript 3 (BAT3), exon 1 pre-msg 333 > 785 BAT3 mRNA and introns IVS 690 > 785 BAT3 intron A BASE COUNT 170 a 210 c 239 g 165 t 1 others ORIGIN Chromosome 6p21.3. 1 aaggcgcagc gagggcaata gggtggagaa gagttttagc tgctagacag tgccgcctga 61 aattatcagc ctgccaagat ttaaacatag atgaatgtgg cataatcccc catctccaaa 121 gtccaagtcc atacgaccgt ccatagcctc tcgaggcagt ggtagagtcc cagctggtga 181 ctgtttttca ggcatttacg gtagccacct caatcttcta gcgctcaacg cgcgcacaga 241 cgtgaacgcc gccagagggg ggagggggtg gggcgatgct taagtgtcca cgcatcccgt 301 agtgcgacgg cacagcgtag taggtncccc cgggcgacag cggtggcggc tcctcggggt 361 gctcggctcc ctcccaccta ggccggcccc ggcccgactc gccctcagaa actcactgtt 421 tggggctgcg gactttctcg tcgtgcccca caaaagtaaa gcttggggac ctggggggag 481 ccggaagtat cgcttcgaga tccccaaata ctatcgggga aacggaagtg gccgtcggtg 541 gcaggtttgg gggagaccgg aagtgacgag acctgtcggc catggagcct aatgatagta 601 ccagtaccgc tgtggaggag cctgacagct tggaggtgtt ggtgaagacc ttggactctc 661 aaactcgtac ctttattgtg ggggcccagg tgagacacct cactagttct ggaagacacc 721 tttagctttt ccacgtttag gccccttagc ctgagagatg agcttgattt ttctaggtca 781 ccaga // LOCUS HUMBAT3B2 4227 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 3' end. ACCESSION M33521 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 2 of 2 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 731 to 848; 963 to 4227) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4227) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept + 731 848 HLA-B-associated transcript 3 (BAT3), exon 2 963 4135 HLA-B-associated transcript 3, exon 3 pre-msg < 1 4227 BAT3 mRNA and introns IVS < 1 730 BAT3 intron A IVS 849 962 BAT3 intron B BASE COUNT 842 a 1276 c 1116 g 987 t 6 others ORIGIN About 1.1 kb after segment 1; chromosome 6p21.3. 1 ttatcttntt agatcatttc cttccacctt aacctatacc agacccactc cttctttgcc 61 attttttaat cttggaaatc acaggagngt ctgtaaatna ctggatcatc ttgtgtttgg 121 aaggggtact gatgtctcta gacacatacn cccttggatg ccagacagat aatataattt 181 ccatgtgttt tttttttgtt tttcatccgt gttatttttc ctggatctat aacctgagct 241 tcattaagtt tatttattta attttttcga gatggagtcc cacnctttca cccaggctag 301 agtgtagtga tgcgatctcg gctcactgca acctccgcct cccgaattca agtgattctc 361 ttgcttcagc ctccctagta gctgggatta caggcgacca ccatgcctgg cttatttttt 421 gtatttttgg taaaaagggg ttttacatgt tggccaggct ggtctcgaac tctgacctaa 481 gtgatctgcc tgccttggcc tcccaagtgc tggattacag tgtgagacca ccgctccagc 541 caatatgtct gtatttttga cacgtgttac tttagttaag ggtttgcaca gtaatgatct 601 cacggtcaag acaaacgggt agtgattdct gtggtggttt ttacccctca cctccacaac 661 tcggttgtct gtctttgttc ttcctctttc ctccattctt tccattcctg tgcatgcctc 721 ttcttttcag atgaatgtaa aagagtttaa ggagcacatt cgtgcctctg tcagcatccc 781 atctgaaaaa caacggctca tttaccaggg acgagttctg caagatgata agaagcttca 841 ggaatacagt aagggggctg gggaggcagt tcagaggttg gggctactgt ctggagggat 901 gaactgaggc catgggttta cctgttcata ctatgttttg gtgtgtgtct atttttctgc 961 agatgttggg ggaaaggtta tccacctggt ggaacgggct cctcctcaga ctcacctccc 1021 ttctggggca tcttctggga cggggtctgc ctcagccact catggtgggg gatccccccc 1081 tggtactcgg gggcctgggg cctctgttca tgaccggaat gccaacagct atgtcatggt 1141 tggaaccttc aatcttccta gtgacggctc tgctgtggat gttcacatca acatggaaca 1201 ggccccgatt cagagtgagc cccgggtacg gctggtgatg gctcagcaca tgatcaggga 1261 tatacagacc ttactatccc ggatggagac tctcccctac cttcagtgtc gaggagggcc 1321 ccaaccgcag cacagtcagc cgcccccgca gccaccggct gtgaccccgg agccagtagc 1381 cttgagctct caaacatcag aaccagttga aagtgaagca cctccccggg agcccatgga 1441 ggcagaagaa gtggaggagc gtgccccagc ccagaacccg gagctcactc ctggcccagc 1501 cccagcgggc ccaacacctg ccccggaaac aaatgcaccc aaccatcctt cccctgcgga 1561 gtatgtcgag gtgctccagg agctacagcg gctggagagt cgcctccagc ccttcttgca 1621 gcgctactac gaggttctgg gtgctgctgc caccacggac tacaataaca atcacgaggg 1681 ccgggaggag gatcagcggt tgatcaactt ggtaggggag agcctgcgac tgctgggcaa 1741 cacctttgtt gcactgtctg acctgcgctg caatctggcc tgcacgcccc cacgacacct 1801 gcatgtggtc cggcctatgt ctcactacac cacccccatg gtgctccagc aggcagccat 1861 tcccatacag atcaatgtgg gaaccactgt gaccatgaca ggaaatggga ctcggccccc 1921 cccaactccc aatgcagagg cacctccccc tggtcctggg caggcctcat ccgtggctcc 1981 gtcttctacc aatgtcgagt cctcagctga gggggctccc ccgccaggtc cagctccccc 2041 gccagccacc agccacccga gggtcatccg gatttcccac cagagtgtgg aacccgtggt 2101 catgatgcac atgaacattc aagattctgg cacacagcct ggtggtgttc cgagtgctcc 2161 cactggcccc ctgggacccc ctggtcatgg ccaaaccctg ggacagcagg tgccaggctt 2221 cccaacagct ccaacccggg tggtgattgc ccggcccact cctccacagg ctcggccttc 2281 ccatcctgga gggcccccag tctctgggac actgcagggc gccggtctgg gtaccaatgc 2341 ctcgttggcc cagatggtga gcggccttgt ggggcagctt cttatgcagc cagtccttgt 2401 ggctcagggg accccaggta tggctccacc gccagcccct gccactgctt ctgccagtgc 2461 tggcaccacc aacacagcta ccacagctgg ccccgctcct ggggggcctg cccagcctcc 2521 acccacccct caaccctcca tggctgatct tcagttctct cagcttctgg ggaacctgct 2581 agggcctgca gggccagggg ctggagggcc tggtgtggct tctcccacca tcactgtggc 2641 gatgcctggt gtccctgcct ttctccaagg catgactgac ttcttgcagg caacacagac 2701 agcccctcca ccacccccac ctcctccacc cccaccacct gccccagagc agcagaccat 2761 gcccccacca ggctcccctt ctggtggcgc agggagtcct ggaggcctgg gtcttgagag 2821 cctgtcaccg gagtttttta cctcagtggt gcagggtgtg ctcagctccc tgctgggctc 2881 cctgggggct cgggctggca gcagtgaaag tattgctgcc ttcatacaac gcctcagtgg 2941 atccagcaac atctttgagc ctggagctga tggggccctt ggattctttg gggccttgct 3001 ttctcttctg tgccagaact tctctatggt ggacgtagtg atgcttctcc atgggcattt 3061 ccagccacta caacggctcc agccccagct gcgatccttc ttccaccagc actacctggg 3121 tggtcaggag cccacaccca gtaacatccg gatggcaacc cacacattga tcacggggct 3181 agaagagtat gtgcgggaga gtttttcctt ggtgcaggtt cagccaggtg tggacatcat 3241 ccggacaaac ctggaatttc tccaagagca gtttaatagc attgctgcgc atgtgctgca 3301 ttgcacagat agtggatttg gggcccggtt gctggagttg tgtaaccaag gcctgtttga 3361 atgcctggcc ctaaacctgc actgcttggg gggacagcag atggagcttg ctgctgttat 3421 caatggccga attcgtcgta tgtctcgtgg ggtgaatccc tccttggtga gctggctgac 3481 cactatgatg ggactgaggc ttcaggtggt actggagcac atgcctgtag gccctgatgc 3541 cattctcaga tacgttcgca gggttggtga tcccccccag ccacttcctg aggagccaat 3601 ggaagttcag ggagcagaaa gagcttcccc tgagcctcag cgggagaatg cttccccagc 3661 ccctggaaca acagcagaag aggccatgtc ccgaggtcca cctcctgctc ctgagggggg 3721 ctcccgggat gaacaggatg gagcttcagc tgagacagaa ccttgggcag ctgcagtccc 3781 cccagaatgg gtccctatta tccagcagga cattcagagc cagcggaagg tgaaaccgca 3841 gccccctctg agtgatgcct acctcagtgg tatgcctgcc aagagacgca agacgatgca 3901 gggtgagggc ccccagctgc ttctctcaga ggctgtgagc cgggcagcta aggcagccgg 3961 agctcggccc ctgacgagcc ccgagagcct gagccgggac ctggaggcac cagaggttca 4021 ggagagctac aggcagcagc tccggtctga tatacaaaaa cgactgcagg aagaccccaa 4081 ctacagtccc cagcgcttcc ccaatgccca gcgggccttt gctgatgatc cttagctctt 4141 tgctctatgg cccttcctca tcaggggacc gtttcccccc tcttccttca cagtatttaa 4201 gaaataaaag tcggattttt tctggcc //