LOCUS BC022021 1696 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase-like 5, mRNA (cDNA clone MGC:26636 IMAGE:4825619), complete cds. ACCESSION BC022021 VERSION BC022021.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1696) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1696) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (22-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 25, 2003 this sequence version replaced BC022021.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 33 Row: l Column: 20 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34222191. FEATURES Location/Qualifiers source 1..1696 /db_xref="H-InvDB:HIT000039356" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:26636 IMAGE:4825619" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..1696 /gene="GALNTL5" /db_xref="GeneID:168391" /db_xref="HGNC:HGNC:21725" CDS 224..1555 /gene="GALNTL5" /codon_start=1 /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase-like 5" /protein_id="AAH22021.1" /db_xref="GeneID:168391" /db_xref="HGNC:HGNC:21725" /translation="MRNAIIQGLFYGSLTFGIWTALLFIYLHHNHVSSWQKKSQEPLS AWSPGKKVHQQIIYGSEQIPKPHVIVKRTDEDKAKSMLGTDFNHTNPELHKELLKYGF NVIISRSLGIEREVPDTRSKMRLQKHYPARLPTASIVICFYNEECNALFQTMSSVTNL TPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGASHA SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSPLVRGTF DWNLQFKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGR ENLELSLRIWMCGGQLFIIPCSRVGHISKKQTGKPSTIISAMTHNYLRLVHVWLDEYK EQFFLRKPGLKYVTYGNIRERVELRKRLGCKSFQWYLDNVFPELEASVNSL" BASE COUNT 551 a 325 c 368 g 452 t ORIGIN 1 actgtgtggc cccgcaaccc agcacaacca ggtatctgct tggaacccag ccaccataaa 61 gcctgctagc taaaaaaaat tttacatctc tcagttcatt cggcacagac ccctgcctca 121 ttcagctgtg actctgcttg gaaaattcat cagttacaaa gcagccaatg caattatctc 181 aagggaaatt gaaaaatgga cctttgaaaa tgctagattt acaatgagaa atgccataat 241 tcaaggttta ttctatgggt ccttgacatt tgggatctgg acagctctgt tattcatata 301 tttgcaccat aatcatgtga gcagctggca gaagaaaagc caggagcctc tgtcagcttg 361 gtcccctgga aaaaaagtgc atcagcaaat tatctatggc tcagagcaaa taccaaaacc 421 tcatgtaata gtcaaaagga ctgatgaaga taaagcaaag tctatgttag gtacagattt 481 taaccataca aacccagaac ttcataaaga acttttaaaa tatggattta atgtgattat 541 cagtagaagc ttgggcatcg aaagagaagt gccagatacc aggagtaaaa tgcgtcttca 601 aaaacattac ccagcccgcc tcccgactgc cagcattgtc atttgcttct ataatgaaga 661 atgtaatgcc ttgtttcaga ccatgtccag tgtcacgaac ctcacgccac actattttct 721 tgaagaaatt attttggtag atgacatgag caaagttgat gatttgaaag aaaaactaga 781 ctatcacctg gaaacttttc ggggaaaggt taaaataata agaaacaaaa agagagaggg 841 gctgattcga gcaaggctga ttggagcttc tcatgcttca ggggatgttc tggtgttcct 901 ggacagccac tgtgaggtga acagagtatg gctggagccc ctgctgcatg ccattgccaa 961 ggaccccaaa atggtggtgt gccccctgat agatgtcatt gatgatagaa ctctggagta 1021 taagccctct cctcttgtaa ggggaacttt tgattggaac ctacaattta aatgggataa 1081 tgttttctct tatgagatgg atggaccaga aggatctact aaaccaatcc ggtcacctgc 1141 aatgtctgga ggaatttttg ctatacgtcg gcattatttt aatgaaattg gacagtatga 1201 caaggatatg gatttttggg gaagagaaaa tttggaactt tcactaagga tctggatgtg 1261 tggaggccaa ctctttataa tcccctgctc tcgagtagga catatcagta agaaacaaac 1321 tggaaaacct tctacaatca tcagtgctat gacacataac tacctaagac tggtgcacgt 1381 ttggctggat gaatataagg agcagttttt tcttcgaaag cctggtctga aatatgtcac 1441 ctacggaaat attcgcgagc gtgttgagtt aaggaaacga ctgggttgca agtcatttca 1501 gtggtatttg gataatgtct tcccagagtt ggaggcatct gtgaacagcc tgtgaaagga 1561 aaacaaatca ctttcattaa taaagggtta aaagtctcct agtcattcaa catagtgtca 1621 caagagtgta agtttggaac atcgtggaat tacgtgaaat gcaattaaaa aaatatgacc 1681 aaaaaaaaaa aaaaaa //