LOCUS BC040452 2129 bp mRNA linear HUM 28-SEP-2006 DEFINITION Homo sapiens family with sequence similarity 114, member A1, mRNA (cDNA clone MGC:17300 IMAGE:3846822), complete cds. ACCESSION BC040452 VERSION BC040452.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2129) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2129) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-NOV-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: h Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 29789372. FEATURES Location/Qualifiers source 1..2129 /db_xref="H-InvDB:HIT000052389" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:17300 IMAGE:3846822" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_65" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2129 /gene="FAM114A1" /gene_synonym="Noxp20" /db_xref="GeneID:92689" /db_xref="HGNC:HGNC:25087" CDS 177..1868 /gene="FAM114A1" /gene_synonym="Noxp20" /codon_start=1 /product="family with sequence similarity 114, member A1" /protein_id="AAH40452.1" /db_xref="GeneID:92689" /db_xref="HGNC:HGNC:25087" /translation="MSDDAGDTLATGDKAEVTEMPNSDSLPEDAEVHCDSAAVSHEPT PADPRGEGHENAAVQGAGAAAIGPPVQPQDANALEPPLNRDVTEDTLAECIDSVSLEA EPRSEIPLQEQNYPAVDSPPSGGGWAGWGSWGKSLLSSASATVGHGLTAVKEKAGATL RIHGVNSGSSEGAQPNTENGVPEITDAATDQGPAESPPTSPSSASRGMLSAITNVVQN TGKSVLTGGLDALEFIGKKTMNVLAESDPGFKRTKTLMERTVSLSQMLREAKEKEKQR LAQQLTMERTAHYGMLFDEYQGLSHLEALEILSNESESKVQSFLASLDGEKLELLKND LISIKDIFAAKELENEENQEEQGLEEKGEEFARMLTELLFELHVAATPDKLNKAMKRA HDWVEEDQTVVSVDVAKVSEEETKKEEKEEKSQDPQEDKKEEKKTKTIEEVYMSSIES LAEVTARCIEQLHKVAELILHGQEEEKPAQDQAKVLIKLTTAMCNEVASLSKKFTNSL TTVGSNKKAEVLNPMISSVLLEGCNSTTYIQDAFQLLLPVLQVSHIQTSCLKAQP" BASE COUNT 646 a 501 c 517 g 465 t ORIGIN 1 gtactcggcc gcctgagcgc cgacccacgc cctgcctgct cttccagtcc agccaacact 61 ctaagcaggc accgcctcca ctcgcagccc ccgggatggg tcccactccc taccgcagat 121 ccccaggccc cctccaccca gtcggctagc cctcgcctct gccatccgat actaaaatgt 181 ctgatgatgc tggtgacacc ttagccactg gagacaaagc agaagttact gagatgccta 241 atagtgattc tttacctgag gatgcagaag tgcattgtga ttcagctgca gtttcacatg 301 agccaacacc agctgacccc agaggggagg ggcatgaaaa tgcagctgtg cagggtgcag 361 gggctgccgc cattgggccc cctgtgcagc ctcaggatgc caacgccctg gagccccctc 421 tcaatagaga cgtgactgag gatacacttg ctgaatgtat tgattccgtc agccttgagg 481 cagaacccag atccgaaata cccctgcaag aacagaatta tccggctgtg gattcccctc 541 caagtggagg aggatgggca ggctggggat cctggggcaa atctctgctg tcgtcagcat 601 ctgccacagt aggtcatgga ttgacggcag tcaaggaaaa agcaggagcc actctacgga 661 ttcatggtgt aaattctgga tcttctgaag gagcccaacc aaatactgaa aacggagtcc 721 ctgaaataac agatgcagcc acagatcagg gccctgcaga aagcccaccc acttcccctt 781 catcagcctc tcggggtatg ctgtctgcca tcaccaatgt ggttcaaaac acaggtaaaa 841 gtgtcttaac tggaggcctt gatgcgttgg aattcatcgg caagaaaacc atgaatgtcc 901 ttgcagaaag tgacccgggc tttaagcgga ccaagacgct catggagaga actgtttcct 961 tgtctcagat gttaagggaa gctaaggaga aggagaagca gagactggca cagcagctca 1021 cgatggagag aaccgcgcac tacgggatgc tgtttgatga atatcaaggc ttgtcacacc 1081 tggaagccct ggaaattctg tccaatgaaa gcgaaagcaa ggttcagtca tttttagcat 1141 cacttgatgg agagaagctg gaactcttaa aaaatgacct aatttccatt aaagacatct 1201 ttgcagccaa agaattagag aatgaagaaa atcaagaaga acaaggctta gaagaaaagg 1261 gagaagaatt tgctcgcatg cttacagagc ttctctttga attacatgtg gcggccacac 1321 ctgacaaact caataaggcc atgaagaggg ctcatgactg ggtggaagag gatcaaaccg 1381 tggtgtcagt agatgtggca aaagtgtccg aagaagaaac aaagaaggaa gaaaaggaag 1441 agaaatctca agaccctcaa gaagacaaaa aggaggaaaa gaaaactaag accatagagg 1501 aagtatacat gtcgtccatt gaaagtctgg cggaggtaac agcgcgctgt attgagcagc 1561 ttcataaagt agcagaatta attcttcatg gacaagaaga ggaaaaacca gctcaggacc 1621 aagcaaaagt tctaataaaa ttaactactg caatgtgcaa tgaagtggcc tctttatcaa 1681 agaagtttac gaattcttta accactgttg ggagcaacaa gaaggccgag gtccttaacc 1741 ccatgatcag tagtgtattg ttagagggct gcaacagtac aacgtacata caggatgcct 1801 tccagctgct gctgcctgtt ctgcaggtct cacatatcca gaccagttgt ttgaaagcac 1861 agccgtgacc tggccagact ccatctagtt aaaggagaca gctggccgcc ttgcctcaat 1921 atgtaccatt taaggggatg ttctctgtgc gcctggccac agacatccat ttgaggacac 1981 tacaagcaat tttgcacaga caatattgag aatgcaaatt tagagagagt tatcatttct 2041 ctcaatgtgt ataattgttt ttacaaacaa ttgtgttttc tttatgttaa tttaaactta 2101 cacagcttat attgaaaaaa aaaaaaaaa //