LOCUS BC009896 2776 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens HIV-1 Tat specific factor 1, mRNA (cDNA clone MGC:2033 IMAGE:3504952), complete cds. ACCESSION BC009896 VERSION BC009896.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2776) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2776) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC009896.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: i Column: 17 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34147671. FEATURES Location/Qualifiers source 1..2776 /db_xref="H-InvDB:HIT000034651" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2033 IMAGE:3504952" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2776 /gene="HTATSF1" /gene_synonym="dJ196E23.2" /gene_synonym="TAT-SF1" /db_xref="GeneID:27336" /db_xref="HGNC:HGNC:5276" /db_xref="MIM:300346" CDS 174..2441 /gene="HTATSF1" /gene_synonym="dJ196E23.2" /gene_synonym="TAT-SF1" /codon_start=1 /product="HIV-1 Tat specific factor 1" /protein_id="AAH09896.1" /db_xref="GeneID:27336" /db_xref="HGNC:HGNC:5276" /db_xref="MIM:300346" /translation="MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQ PTDTPYEWDLDKKAWFPKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQ EKAPEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIM RDPQTEEFKVKLYKDNQGNLKGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKF QLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMD FEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDG RWFGGRQITAQAWDGTTDYQVEETSREREERLRGWEAFLNAPEANRGLRRSDSVSASE RAGPSRARHFSEHPSTSKMNAQETATGMAFEEPIDEKKFEKTEDGGEFEEGASENNAK ESSPEKEAEEGCPEKESEEGCPKRGFEGSCSQKESEEGNPVRGSEEDSPKKESKKKTL KNDCEENGLAKESEDDLNKESEEEVGPTKESEEDDSEKESDEDCSEKQSEDGSEREFE ENGLEKDLDEEGSEKELHENVLDKELEENDSENSEFEDDGSEKVLDEEGSEREFDEDS DEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAADKKAEEGDADEKLFEESDDKEDE DADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDDSDERGTLGGFGSVEEGPLST GSSFILSSDDDDDDI" BASE COUNT 900 a 458 c 762 g 656 t ORIGIN 1 cagccgccga gcggccgcga tttcccgggg actgctgggg cgcagcgggg aggcgggccg 61 gggggcggcg gggcgcgagc agagcgcggt tgacctccct ttctctgctc agctccagcg 121 tcatttcggc ctcttagttc ttctgaaccc tgctcctgag ctaggtagga aacatgagcg 181 gcaccaactt ggatgggaac gatgagtttg atgagcagtt gcgaatgcaa gaattgtacg 241 gagacggcaa ggatggtgac acccagaccg atgccggcgg agaacccgat tctctcgggc 301 agcagccgac ggacactccc tacgagtggg acctggacaa aaaggcttgg ttccccaaga 361 ttactgaaga tttcattgct acatatcagg ccaattatgg cttctctaac gatggcgcat 421 ctagttctac cgcaaatgtt gaagatgtcc atgctaggac tgcagaggaa cctccacaag 481 aaaaagcccc ggaacccact gatgccagaa agaagggaga aaaaagaaag gctgagtcag 541 gatggtttca tgttgaagaa gacagaaata caaatgtata cgtgtctggt ttgcctccag 601 atattacagt ggatgaattt atacaactta tgtccaagtt tggcattatt atgagagatc 661 ctcagacaga agaatttaag gtcaaacttt acaaagataa tcaaggaaat cttaaaggag 721 acggtctttg ctgttatttg aaaagagaat ctgtggaact tgcattaaaa cttttggatg 781 aagatgaaat tagaggctac aaattacatg ttgaggtggc aaagtttcaa ctgaagggag 841 aatatgatgc ctcaaagaag aagaagaagt gcaaagacta taagaagaag ctgtctatgc 901 aacaaaagca gttggattgg agacctgaga ggcgagccgg accatcccgg atgcgccatg 961 agcgagttgt catcatcaag aatatgtttc atcctatgga ttttgaggat gatccgttgg 1021 tgctgaatga gatcagagaa gaccttcgag tagagtgttc gaagtttgga caaattagga 1081 aactccttct ctttgatagg cacccagatg gtgtggcctc tgtgtccttt cgggatccag 1141 aggaagctga ttattgtatt cagactctcg atggaagatg gtttggtggc cgtcaaatca 1201 ctgcccaggc atgggatggg actacagatt atcaggtgga ggaaacctca agagaaaggg 1261 aggaaaggct gagaggatgg gaggctttcc tcaatgctcc tgaggccaac agaggcctta 1321 ggcgttcaga ttctgtctct gcttccgaaa gggcagggcc ttctagagca aggcattttt 1381 cagagcaccc cagcacatct aaaatgaatg ctcaagaaac tgcaactgga atggcgtttg 1441 aagaacctat agatgagaag aagtttgaaa agacagaaga tgggggagaa tttgaagaag 1501 gtgcttctga aaacaatgct aaggaaagta gccccgaaaa agaggctgaa gaaggctgcc 1561 ctgaaaaaga atctgaagag ggctgcccca aaagagggtt tgaaggcagc tgctcccaaa 1621 aagagtctga agaaggcaat cccgtaagag gatctgaaga ggatagtcct aaaaaagagt 1681 ctaaaaagaa gacactcaaa aatgattgtg aagagaatgg ccttgcaaag gaatctgaag 1741 atgacctcaa caaggagtct gaagaggagg ttggccccac aaaagagtcc gaagaagatg 1801 actcagagaa agagtctgat gaagactgct ctgaaaaaca gtctgaagat ggctccgaaa 1861 gagaatttga agaaaatggt ctcgagaaag atttggacga ggaaggttct gaaaaggagc 1921 ttcatgaaaa tgttcttgac aaagagttag aagaaaatga ctctgaaaac tccgaatttg 1981 aagatgacgg ctctgaaaaa gtgttagatg aggaaggctc tgagagagag tttgacgaag 2041 attcagatga aaaggaagaa gaggaggata catatgaaaa agtatttgat gatgagtctg 2101 atgagaaaga ggatgaagaa tatgcagatg aaaaggggct tgaagctgct gataaaaagg 2161 cggaagaagg tgatgcagat gaaaagctgt ttgaagagtc agatgacaag gaagatgaag 2221 atgcagatgg aaaggaagtt gaagatgctg acgaaaagtt gttcgaagat gatgattcca 2281 atgagaagtt gtttgatgag gaggaagatt ccagtgagaa gttgtttgac gattctgatg 2341 agagggggac tttgggtggt tttgggagtg ttgaagaagg gcccctatcc actggcagca 2401 gctttattct cagtagcgat gatgatgacg atgatattta atcccttaaa cttgcttttt 2461 agggagagtc ctccatctac atttgcctgt gcttcagggt aattactagt agtgttacat 2521 gaacatgtgc atagtggtag gatgccatca gattaaagca ttgaagtgtt tcattgttac 2581 ctgtacctaa tggttttaaa tatatgttaa ttgattgttt agttaaaatg tcatagttac 2641 aatgcaagta aactggatac ttgttctttt gtcagatttg ttaaatgcat gcagaataat 2701 atttttaaga gtattgattg aagtttgtga tattcatcaa taaaaatgag ttgataataa 2761 aaaaaaaaaa aaaaaa //