LOCUS BC034998 2796 bp mRNA linear HUM 18-MAR-2009 DEFINITION Homo sapiens prolyl 4-hydroxylase, alpha polypeptide I, mRNA (cDNA clone MGC:33137 IMAGE:4797051), complete cds. ACCESSION BC034998 VERSION BC034998.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2796) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2796) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-JUL-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 46 Row: a Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 63252885. FEATURES Location/Qualifiers source 1..2796 /db_xref="H-InvDB:HIT000051291" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33137 IMAGE:4797051" /tissue_type="Brain, hypothalamus" /clone_lib="NIH_MGC_96" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2796 /gene="P4HA1" /db_xref="GeneID:5033" /db_xref="HGNC:HGNC:8546" /db_xref="MIM:176710" CDS 135..1739 /gene="P4HA1" /codon_start=1 /product="prolyl 4-hydroxylase, alpha polypeptide I" /protein_id="AAH34998.1" /db_xref="GeneID:5033" /db_xref="HGNC:HGNC:8546" /db_xref="MIM:176710" /translation="MIWYILIIGILLPQSLAHPGFFTSIGQMTDLIHTEKDLVTSLKD YIKAEEDKLEQIKKWAEKLDRLTSTATKDPEGFVGHPVNAFKLMKRLNTEWSELENLV LKDMSDGFISNLTIQRQYFPNDEDQVGAAKALLRLQDTYNLDTDTISKGNLPGVKHKS FLTAEDCFELGKVAYTEADYYHTELWMEQALRQLDEGEISTIDKVSVLDYLSYAVYQQ GDLDKALLLTKKLLELDPEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKK GVAVDYLPERQKYEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDK PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENP VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRI ATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVG NKWVSNKWLHERGQEFRRPCTLSELE" BASE COUNT 870 a 494 c 596 g 836 t ORIGIN 1 agcggagtcg cgccgccagc gggctgaggg taggaagtag ccgctccgag tggaggcgac 61 tgggggctga agagcgcgcc gccctctcgt cccactttcc aggtgtgtga tcctgtaaaa 121 ttaaatcttc caagatgatc tggtatatat taattatagg aattctgctt ccccagtctt 181 tggctcatcc aggctttttt acttcaattg gtcagatgac tgatttgatc catactgaga 241 aagatctggt gacttctctg aaagattata ttaaggcaga agaggacaag ttagaacaaa 301 taaaaaaatg ggcagagaag ttagatcggc taactagtac agcgacaaaa gatccagaag 361 gatttgttgg gcatccagta aatgcattca aattaatgaa acgtctgaat actgagtgga 421 gtgagttgga gaatctggtc cttaaggata tgtcagatgg ctttatctct aacctaacca 481 ttcagagaca gtactttcct aatgatgaag atcaggttgg ggcagccaaa gctctgttac 541 gtctccagga tacctacaat ttggatacag ataccatctc aaagggtaat cttccaggag 601 tgaaacacaa atcttttcta acggctgagg actgctttga gttgggcaaa gtggcctata 661 cagaagcaga ttattaccat acggaactgt ggatggaaca agccctaagg caactggatg 721 aaggcgagat ttctaccata gataaagtct ctgttctaga ttatttgagc tatgcggtat 781 atcagcaggg agacctggat aaggcacttt tgctcacaaa gaagcttctt gaactagatc 841 ctgaacatca gagagctaat ggtaacttaa aatattttga gtatataatg gctaaagaaa 901 aagatgtcaa taagtctgct tcagatgacc aatctgatca gaaaactaca ccaaagaaaa 961 aaggggttgc tgtggattac ctgccagaga gacagaagta cgaaatgctg tgccgtgggg 1021 agggtatcaa aatgacccct cggagacaga aaaaactctt ttgccgctac catgatggaa 1081 accgtaatcc taaatttatt ctggctccag ctaaacagga ggatgaatgg gacaagcctc 1141 gtattattcg cttccatgat attatttctg atgcagaaat tgaaatcgtc aaagacctag 1201 caaaaccaag gctgaggcga gccaccattt caaacccaat aacaggagac ttggagacgg 1261 tacattacag aattagcaaa agtgcctggc tctctggcta tgaaaatcct gtggtgtctc 1321 gaattaatat gagaatacaa gatctaacag gactagatgt ttccacagca gaggaattac 1381 aggtagcaaa ttatggagtt ggaggacagt atgaacccca ttttgacttt gcacggaaag 1441 atgagccaga tgctttcaaa gagctgggga caggaaatag aattgctaca tggctgtttt 1501 atatgagtga tgtgtctgca ggaggagcca ctgtttttcc tgaagttgga gctagtgttt 1561 ggcccaaaaa aggaactgct gttttctggt ataatctgtt tgccagtgga gaaggagatt 1621 atagtacacg gcatgcagcc tgtccagtgc tagttggcaa caaatgggta tccaataaat 1681 ggctccatga acgtggacaa gaatttcgaa gaccttgtac gttgtcagaa ttggaatgac 1741 aaacaggctt ccctttttct cctattgttg tactcttatg tgtctgatat acacatttcc 1801 tagtcttaac tttcaggagt ttacaattga ctaacactcc atgattgatt cagtcatgaa 1861 cctcatccca tgtttcatct gtggacaatt gcttactttg tgggttcttt taaaagtaac 1921 acgaaatcat catattgcat aaaaccttaa agttctgttg gtatcacaga agacaaggca 1981 gagtttaaag tgaggaattt tatatttaaa gaactttttg gttggataaa aacataattt 2041 gagcatccag ttttagtatt tcactacatc tcagttggtg ggtgttaagc tagaatgggc 2101 tgtgtgatag gaaacaaatg ccttacagat gtgcctaggt gttctgttta cctagtgtct 2161 tactctgttt tctggatctg aagactagta ataaactagg acactaactg ggttccatgt 2221 gattgccctt tcatatgatc ttctaagttg atttttttcc tcccaagtct tttttaaaga 2281 aagtatactg tattttacca accccctctc ttttctttta gctcctctgt ggtgaattaa 2341 acgtacttga gttaaaatat ttcgattttt tttttttttt taatggaaag tcctgcataa 2401 caacactggg ccttcttaac taaaatgctc accacttagc ctgttttttt atcccttttt 2461 taaaatgaca gatgattttg ttcaggaatt ttgctgtttt tcttagtgct aataccttgc 2521 ctcttattcc tgttacagca gggtggtaat attggcattc tgattaaata ctgtgcctta 2581 ggagactgga agtttaaaaa tgtacaagtc ctttcagtga tgagggaatt gatttttttt 2641 aaaagtcttt ttcttagaaa gccaaaatgt ttgttttttt aagattctga aatgtgttgt 2701 gacaacaatg acctatttat gatcttaaat cttttttaaa aaaaaaaaaa aaaaaaaaaa 2761 aaaaaaaaaa aaaaaaaaag aaaaaaaaaa aaaaaa //