LOCUS BC009896 2776 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens HIV-1 Tat specific factor 1, mRNA (cDNA clone MGC:2033
IMAGE:3504952), complete cds.
ACCESSION BC009896
VERSION BC009896.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2776)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2776)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC009896.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 7 Row: i Column: 17
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34147671.
FEATURES Location/Qualifiers
source 1..2776
/db_xref="H-InvDB:HIT000034651"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:2033 IMAGE:3504952"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2776
/gene="HTATSF1"
/gene_synonym="dJ196E23.2"
/gene_synonym="TAT-SF1"
/db_xref="GeneID:27336"
/db_xref="HGNC:HGNC:5276"
/db_xref="MIM:300346"
CDS 174..2441
/gene="HTATSF1"
/gene_synonym="dJ196E23.2"
/gene_synonym="TAT-SF1"
/codon_start=1
/product="HIV-1 Tat specific factor 1"
/protein_id="AAH09896.1"
/db_xref="GeneID:27336"
/db_xref="HGNC:HGNC:5276"
/db_xref="MIM:300346"
/translation="MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQ
PTDTPYEWDLDKKAWFPKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQ
EKAPEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIM
RDPQTEEFKVKLYKDNQGNLKGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKF
QLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMD
FEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDG
RWFGGRQITAQAWDGTTDYQVEETSREREERLRGWEAFLNAPEANRGLRRSDSVSASE
RAGPSRARHFSEHPSTSKMNAQETATGMAFEEPIDEKKFEKTEDGGEFEEGASENNAK
ESSPEKEAEEGCPEKESEEGCPKRGFEGSCSQKESEEGNPVRGSEEDSPKKESKKKTL
KNDCEENGLAKESEDDLNKESEEEVGPTKESEEDDSEKESDEDCSEKQSEDGSEREFE
ENGLEKDLDEEGSEKELHENVLDKELEENDSENSEFEDDGSEKVLDEEGSEREFDEDS
DEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAADKKAEEGDADEKLFEESDDKEDE
DADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDDSDERGTLGGFGSVEEGPLST
GSSFILSSDDDDDDI"
BASE COUNT 900 a 458 c 762 g 656 t
ORIGIN
1 cagccgccga gcggccgcga tttcccgggg actgctgggg cgcagcgggg aggcgggccg
61 gggggcggcg gggcgcgagc agagcgcggt tgacctccct ttctctgctc agctccagcg
121 tcatttcggc ctcttagttc ttctgaaccc tgctcctgag ctaggtagga aacatgagcg
181 gcaccaactt ggatgggaac gatgagtttg atgagcagtt gcgaatgcaa gaattgtacg
241 gagacggcaa ggatggtgac acccagaccg atgccggcgg agaacccgat tctctcgggc
301 agcagccgac ggacactccc tacgagtggg acctggacaa aaaggcttgg ttccccaaga
361 ttactgaaga tttcattgct acatatcagg ccaattatgg cttctctaac gatggcgcat
421 ctagttctac cgcaaatgtt gaagatgtcc atgctaggac tgcagaggaa cctccacaag
481 aaaaagcccc ggaacccact gatgccagaa agaagggaga aaaaagaaag gctgagtcag
541 gatggtttca tgttgaagaa gacagaaata caaatgtata cgtgtctggt ttgcctccag
601 atattacagt ggatgaattt atacaactta tgtccaagtt tggcattatt atgagagatc
661 ctcagacaga agaatttaag gtcaaacttt acaaagataa tcaaggaaat cttaaaggag
721 acggtctttg ctgttatttg aaaagagaat ctgtggaact tgcattaaaa cttttggatg
781 aagatgaaat tagaggctac aaattacatg ttgaggtggc aaagtttcaa ctgaagggag
841 aatatgatgc ctcaaagaag aagaagaagt gcaaagacta taagaagaag ctgtctatgc
901 aacaaaagca gttggattgg agacctgaga ggcgagccgg accatcccgg atgcgccatg
961 agcgagttgt catcatcaag aatatgtttc atcctatgga ttttgaggat gatccgttgg
1021 tgctgaatga gatcagagaa gaccttcgag tagagtgttc gaagtttgga caaattagga
1081 aactccttct ctttgatagg cacccagatg gtgtggcctc tgtgtccttt cgggatccag
1141 aggaagctga ttattgtatt cagactctcg atggaagatg gtttggtggc cgtcaaatca
1201 ctgcccaggc atgggatggg actacagatt atcaggtgga ggaaacctca agagaaaggg
1261 aggaaaggct gagaggatgg gaggctttcc tcaatgctcc tgaggccaac agaggcctta
1321 ggcgttcaga ttctgtctct gcttccgaaa gggcagggcc ttctagagca aggcattttt
1381 cagagcaccc cagcacatct aaaatgaatg ctcaagaaac tgcaactgga atggcgtttg
1441 aagaacctat agatgagaag aagtttgaaa agacagaaga tgggggagaa tttgaagaag
1501 gtgcttctga aaacaatgct aaggaaagta gccccgaaaa agaggctgaa gaaggctgcc
1561 ctgaaaaaga atctgaagag ggctgcccca aaagagggtt tgaaggcagc tgctcccaaa
1621 aagagtctga agaaggcaat cccgtaagag gatctgaaga ggatagtcct aaaaaagagt
1681 ctaaaaagaa gacactcaaa aatgattgtg aagagaatgg ccttgcaaag gaatctgaag
1741 atgacctcaa caaggagtct gaagaggagg ttggccccac aaaagagtcc gaagaagatg
1801 actcagagaa agagtctgat gaagactgct ctgaaaaaca gtctgaagat ggctccgaaa
1861 gagaatttga agaaaatggt ctcgagaaag atttggacga ggaaggttct gaaaaggagc
1921 ttcatgaaaa tgttcttgac aaagagttag aagaaaatga ctctgaaaac tccgaatttg
1981 aagatgacgg ctctgaaaaa gtgttagatg aggaaggctc tgagagagag tttgacgaag
2041 attcagatga aaaggaagaa gaggaggata catatgaaaa agtatttgat gatgagtctg
2101 atgagaaaga ggatgaagaa tatgcagatg aaaaggggct tgaagctgct gataaaaagg
2161 cggaagaagg tgatgcagat gaaaagctgt ttgaagagtc agatgacaag gaagatgaag
2221 atgcagatgg aaaggaagtt gaagatgctg acgaaaagtt gttcgaagat gatgattcca
2281 atgagaagtt gtttgatgag gaggaagatt ccagtgagaa gttgtttgac gattctgatg
2341 agagggggac tttgggtggt tttgggagtg ttgaagaagg gcccctatcc actggcagca
2401 gctttattct cagtagcgat gatgatgacg atgatattta atcccttaaa cttgcttttt
2461 agggagagtc ctccatctac atttgcctgt gcttcagggt aattactagt agtgttacat
2521 gaacatgtgc atagtggtag gatgccatca gattaaagca ttgaagtgtt tcattgttac
2581 ctgtacctaa tggttttaaa tatatgttaa ttgattgttt agttaaaatg tcatagttac
2641 aatgcaagta aactggatac ttgttctttt gtcagatttg ttaaatgcat gcagaataat
2701 atttttaaga gtattgattg aagtttgtga tattcatcaa taaaaatgag ttgataataa
2761 aaaaaaaaaa aaaaaa
//