LOCUS BC032859 3038 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens chromosome 1 open reading frame 101, mRNA (cDNA clone
MGC:33370 IMAGE:5269307), complete cds.
ACCESSION BC032859
VERSION BC032859.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3038)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3038)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (07-JUN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 47 Row: o Column: 17
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31341128.
FEATURES Location/Qualifiers
source 1..3038
/db_xref="H-InvDB:HIT000051084"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:33370 IMAGE:5269307"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..3038
/gene="C1orf101"
/gene_synonym="MGC33370"
/gene_synonym="RP11-523K4.1"
/db_xref="GeneID:257044"
/db_xref="HGNC:HGNC:28491"
CDS 45..2543
/gene="C1orf101"
/gene_synonym="MGC33370"
/gene_synonym="RP11-523K4.1"
/codon_start=1
/product="chromosome 1 open reading frame 101"
/protein_id="AAH32859.1"
/db_xref="GeneID:257044"
/db_xref="HGNC:HGNC:28491"
/translation="MSAREVAVLLLWLSCYGSALWRYSTNSPNYRIFSTRSTIKLEYE
GTLFTEWSVPETCFVLNKSSPTTELRCSSPGVHAIKPIVTGPDEEERYLFVESSHTCF
LWYYRVRHFFNNFTQLITVWAYDPESADPDELLGNAEEPSINSIVLSTQMATLGQKPV
IHTVLKRKVYSSNEKMRRGTWRIVVPMTKDDALKEIRGNQVTFQDCFIADFLILLTFP
LLTIPEIPGYLPISSPRGSQLMASWDACVVASAVLVTDMETFHTTDSFKSWTRIRVPP
DILSDDERRSVAHVILSRDGIVFLINGVLYIKSFRGFIRLGGIVNLPDGGITGISSRK
WCWVNYLLKAKGRRSTFAVWTENEIYLGSILLKFARLVTTTELKNILSLSVTATLTID
RVEYTGHPLEIAVFLNYCTVCNVTKKIFLVIYNEDTKQWVSQDFTLDAPIDSVTMPHF
TFSALPGLLLWNKHSIYYCYHNFTFTGILQTPAGHGNLSMLSNDSIIHEVFIDYYGDI
LVKMENNVIFYSKINTRDAVKLHLWTNYTTRAFIFLSTSGQTYFLYALDDGTIQIQDY
PLHLEAQSIAFTTKDKCPYMAFHNNVAHVFYFLDKGEALTVWTQIVYPENTGLYVIVE
SYGPKILQESHEISFEAAFGYCTKTLTLTFYQNVDYERISDYFETQDKHTGLVLVQFR
PSEYSKACPIAQKVFQIAVGCDDKKFIAIKGFSKKGCHHHDFSYVIEKSYLRHQPSKN
LRVRYIWGEYGCPLRLDFTEKFQPVVQLFDDNGYVKDVEANFIVWEIHGRDDYSFNNT
MAQSGCLHEAQTWKSMIELNKHLPLEEVWGPEFL"
BASE COUNT 956 a 541 c 615 g 926 t
ORIGIN
1 agcgtgagtg gccgaggcgg ttgggcggag gcggagcagg cgccatgtca gcccgggaag
61 tggccgtgct gctgctgtgg ctgagctgct atggctccgc cctttggagg tattccacta
121 acagcccaaa ctatcgcatt tttagtacca gaagtactat taagttagag tatgaaggaa
181 cattatttac tgagtggagt gtgccagaaa cttgttttgt gctaaataaa agctcaccca
241 cgacagaatt gcgttgttcc tcacctggtg ttcacgctat aaaaccaatt gttactggcc
301 cagatgaaga agaacgctat ttatttgtgg aaagttctca tacttgcttt ctgtggtact
361 atagagttag acatttcttt aacaacttta cccagcttat cactgtgtgg gcatatgatc
421 cagaaagtgc agatcctgat gagttgctgg ggaatgcaga agaaccttca ataaattcca
481 tagtactcag cacacagatg gccacattgg gacagaagcc tgtcatacat acagttctga
541 agagaaaagt ttattcttca aatgagaaaa tgagaagggg tacctggcgt attgtagtac
601 caatgacaaa agatgatgca ctaaaggaga ttagaggaaa ccaagttact tttcaggatt
661 gctttattgc agattttctt attctgttga cttttccttt gttgaccata cctgaaattc
721 ctggttattt accaatctcc tcaccacgtg gtagtcaatt aatggcttcc tgggatgctt
781 gtgtagttgc atctgctgtt ttggtgacag atatggagac ctttcacaca actgattcat
841 tcaaatcttg gaccagaatc agagtgcctc cagacattct gagtgatgat gaaagacgga
901 gtgtggctca tgtgatctta tcgcgggatg gaatcgtttt tcttataaat ggtgttcttt
961 acataaagag ttttcgtgga tttataagac tgggaggaat tgtaaatctt cctgatggtg
1021 gaattactgg catttcatca agaaaatggt gttgggtcaa ttatttatta aaggctaaag
1081 gaagaagaag cacctttgca gtctggacag aaaatgaaat ttacctcgga tccattcttc
1141 ttaagtttgc cagattagta actaccacag aactgaaaaa catcctaagt ctatcggtga
1201 ctgctactct gaccatagac agggttgagt atacaggaca ccctctggag attgctgtgt
1261 ttttaaatta ttgcactgta tgtaacgtca ccaaaaagat tttcttagtg atatataatg
1321 aagatacaaa acagtgggtt tcccaagact ttacattaga tgcccctatt gacagtgtta
1381 ccatgccaca ttttacattt tcagcactgc caggattact gctatggaac aagcatagta
1441 tctactattg ttaccataat ttcaccttta ctgggatttt acagacacct gcaggacatg
1501 gaaatctatc aatgctatca aatgacagca ttattcatga agttttcata gattattatg
1561 gagatatttt ggtaaaaatg gaaaataatg taatatttta ttccaagatt aatactagag
1621 atgcagtaaa gctgcattta tggacaaatt acacaacaag agcattcatt ttcttaagta
1681 catctggtca aacatatttc ctgtatgctt tggatgatgg cacaatacaa atacaggact
1741 atcccttaca tctggaagca caaagtatag ctttcacaac aaaagacaaa tgcccataca
1801 tggcatttca taacaatgtt gctcatgttt tttacttttt ggacaaggga gaggctctga
1861 cagtttggac tcagatcgtc tatccagaaa acactggtct gtatgttatt gtggaatctt
1921 atggcccaaa aatattacaa gagagtcatg agatttcctt tgaagctgcc tttggatact
1981 gcaccaaaac tctgacacta acattttatc agaatgtaga ttatgagaga atatctgatt
2041 actttgagac acaagacaag cacacgggtc ttgtgctggt tcagtttcga cctagtgaat
2101 attcaaaagc atgtccaata gcccaaaagg tgttccaaat agctgttggc tgtgatgata
2161 aaaaattcat tgcaattaaa ggatttagta aaaaaggatg tcatcaccat gatttttcat
2221 acgtgattga aaagtcatat ctgaggcatc agccatcgaa aaacttgaga gtaaggtata
2281 tttggggaga atatggctgc cctctgaggc ttgacttcac agaaaagttt caacctgtgg
2341 ttcaactatt tgatgataat ggctatgtta aagacgttga agcaaatttc atagtgtggg
2401 aaatacacgg cagggatgac tatagcttta ataatactat ggcacagagt ggttgtttac
2461 atgaagcaca gacatggaag tcaatgattg aacttaacaa gcacctccca ctagaagaag
2521 tctggggacc tgagtttctg taacctaaca gctatgtttg caatagagac atttggactg
2581 attcccagtc caagtgtcta cctggtagct tctttcctct tcgtcctgat gctgctcttc
2641 ttcactattc ttgttttgag ctactttcgg tacatgagga tttatagacg atatatttat
2701 gaaccacttc acaaacctca aagaaaacgt aagaagaatt aggaaaactg aaagtttgtt
2761 tattacagat atatgcatat agagaaacag tgtattacat agtgatattg agagtgtgtg
2821 tttgaccaag aaatactaaa tataagctcg tagtagtagg catcaccaaa ttcaagatct
2881 gaaaaatatt cttgaactat ctccaaaata gaaatgtttt catatatatt gttattaaat
2941 taatcctttg tttgccttca ttttaaagat actctatgta ctctcacatg gcatgaaaaa
3001 ataaactaaa tttgactatt acaaaaaaaa aaaaaaaa
//