LOCUS BC001096 1968 bp mRNA linear HUM 28-SEP-2006 DEFINITION Homo sapiens family with sequence similarity 114, member A1, mRNA (cDNA clone MGC:3278 IMAGE:3507281), complete cds. ACCESSION BC001096 VERSION BC001096.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1968) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1968) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (11-DEC-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 28, 2003 this sequence version replaced BC001096.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: e Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 29789372. FEATURES Location/Qualifiers source 1..1968 /db_xref="H-InvDB:HIT000085958" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3278 IMAGE:3507281" /tissue_type="Kidney, renal cell adenocarcinoma" /clone_lib="NIH_MGC_14" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1968 /gene="FAM114A1" /gene_synonym="Noxp20" /db_xref="GeneID:92689" /db_xref="HGNC:HGNC:25087" CDS 11..1702 /gene="FAM114A1" /gene_synonym="Noxp20" /codon_start=1 /product="family with sequence similarity 114, member A1" /protein_id="AAH01096.2" /db_xref="GeneID:92689" /db_xref="HGNC:HGNC:25087" /translation="MSDDAGDTLATGDKAEVTEMPNSDSLPEDAEVHCDSAAVSHEPT PADPRGEGHENAAVQGAGAAAIGPPVQPQDANALEPPLNGDVTEDTLAECIDSVSLEA EPRSEIPLQEQNYLAVDSPPSGGGWAGWGSWGKSLLSSASATVGHGLTAVKEKAGATL RIHGVNSGSSEGAQPNTENGVPEITDAATDQGPAESPPTSPSSASRGMLSAITNVVQN TGKSVLTGGLDALEFIGKKTMNVLAESDPGFKRTKTLMERTVSLSQMLREAKEKEKQR LAQQLTMERTAHYGMLFDEYQGLSHLEALEILSNESESKVQSFLASLDGEKLELLKND LISIKDIFAAKELENEENQEEQGLEEKGEEFARMLTELLFELHVAATPDKLNKAMKRA HDWVEEDQTVVSVDVAKVSEEETKKEEKEEKSQDPQEDKKEEKKTKTIEEVYMSSIES LAEVTARCIEQLHKVAELILHGQEEEKPAQDQAKVLIKLTTAMCNEVASLSKKFTNSL TTVGSNKKAEVLNPMISSVLLEGCNSTTYIQDAFQLLLPVLQVSHIQTSCLKAQP" BASE COUNT 625 a 420 c 483 g 440 t ORIGIN 1 cgatactaaa atgtctgatg atgctggtga caccttagcc actggagaca aagcagaagt 61 tactgagatg cctaatagtg attctttacc tgaggatgca gaagtgcatt gtgattcagc 121 tgcagtttca catgagccaa caccagctga ccccagaggg gaggggcatg aaaatgcagc 181 tgtgcagggt gcaggggctg ccgccattgg gccccctgtg cagcctcagg atgccaacgc 241 cctggagccc cctctcaatg gagacgtgac tgaggataca cttgctgaat gtattgattc 301 cgtcagcctt gaggcagaac ccagatccga aatacccctg caagaacaga attatctggc 361 tgtggattcc cctccaagtg gaggaggatg ggcaggctgg ggatcctggg gcaaatctct 421 gctgtcgtca gcatctgcca cagtaggtca tggattgacg gcagtcaagg aaaaagcagg 481 agccactcta cggattcatg gtgtaaattc tggatcttct gaaggagccc aaccaaatac 541 tgaaaacgga gtccctgaaa taacagatgc agccacagat cagggccctg cagaaagccc 601 acccacttcc ccttcatcag cctctcgggg tatgctgtct gccatcacca atgtggttca 661 aaacacaggt aaaagtgtct taactggagg ccttgatgcg ttggaattca tcggcaagaa 721 aaccatgaat gtccttgcag aaagtgaccc gggctttaag cggaccaaga cgctcatgga 781 gagaactgtt tccttgtctc agatgttaag ggaagctaag gagaaggaga agcagagact 841 ggcacagcag ctcacgatgg agagaaccgc gcactacggg atgctgtttg atgaatatca 901 aggcttgtca cacctggaag ccctggaaat tctgtccaat gaaagcgaaa gcaaggttca 961 gtcattttta gcatcacttg atggagagaa gctggaactc ttaaaaaatg acctaatttc 1021 cattaaagac atctttgcag ccaaagaatt agagaatgaa gaaaatcaag aagaacaagg 1081 cttagaagaa aagggagaag aatttgctcg catgcttaca gagcttctct ttgaattaca 1141 tgtggcggcc acacctgaca aactcaataa ggccatgaag agggctcatg actgggtgga 1201 agaggatcaa accgtggtgt cagtagatgt ggcaaaagtg tccgaagaag aaacaaagaa 1261 ggaagaaaag gaagagaaat ctcaagaccc tcaagaagac aaaaaggagg aaaagaaaac 1321 taagaccata gaggaagtat acatgtcgtc cattgaaagt ctggcggagg taacagcgcg 1381 ctgtattgag cagcttcata aagtagcaga attaattctt catggacaag aagaggaaaa 1441 accagctcag gaccaagcaa aagttctaat aaaattaact actgcaatgt gcaatgaagt 1501 ggcctcctta tcaaagaagt ttacgaattc tttaaccact gttgggagca acaagaaggc 1561 cgaggtcctt aaccccatga tcagtagtgt attgttagag ggctgcaaca gtacaacgta 1621 catacaggat gccttccagc tgctgctgcc tgttctgcag gtctcacata tccagaccag 1681 ttgtttgaaa gcacagccgt gacctggcca gactccatct agttaaagga gacagctggc 1741 cgccttgcct caatatgtac catttaaggg gatgttctct gtgcgcctgg ccacagacat 1801 ccatttgagg acactacaag caattttgca cagacaatat tgagaatgca aatttagaga 1861 gagttatcat ttctctcaat gtgtataatt gtttttacaa acaattgtgt tttctttatg 1921 ttaatttaaa cttacacagc ttatattgaa aaaaaaaaaa aaaaaaaa //