LOCUS       BC068585                3206 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens HERV-FRD provirus ancestral Env polyprotein, mRNA
            (cDNA clone MGC:87585 IMAGE:30346522), complete cds.
ACCESSION   BC068585
VERSION     BC068585.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3206)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3206)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-APR-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Dr. Stefan Hansson
            cDNA Library Preparation: Michael Brownstein /  Ted Usdin
            Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 168 Row: o Column: 15
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 46485772.
FEATURES             Location/Qualifiers
     source          1..3206
                     /db_xref="H-InvDB:HIT000263141"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:87585 IMAGE:30346522"
                     /tissue_type="Placenta, normal"
                     /clone_lib="NIH_MGC_147"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..3206
                     /gene="HERV-FRD"
                     /gene_synonym="FLJ41944"
                     /gene_synonym="GLLL6191"
                     /gene_synonym="MGC87585"
                     /gene_synonym="UNQ6191"
                     /db_xref="GeneID:405754"
     CDS             371..1987
                     /gene="HERV-FRD"
                     /gene_synonym="FLJ41944"
                     /gene_synonym="GLLL6191"
                     /gene_synonym="MGC87585"
                     /gene_synonym="UNQ6191"
                     /codon_start=1
                     /product="HERV-FRD provirus ancestral Env polyprotein"
                     /protein_id="AAH68585.1"
                     /db_xref="GeneID:405754"
                     /translation="MGLLLLVLILTPSLAAYRHPDFPLLEKAQQLLQSTGSPYSTNCW
                     LCTSSSTETPGTAYPASPREWTSIEAELHISYRWDPNLKGLMRPANSLLSTVKQDFPD
                     IRQKPPIFGPIFTNINLMGIAPICVMAKRKNGTNVGTLPSTVCNVTFTVDSNQQTYQT
                     YTHNQFRHQPRFPKPPNITFPQGTLLDKSSRFCQGRPSSCSTRNFWFRPADYNQCLQI
                     SNLSSTAEWVLLDQTRNSLFWENKTKGANQSQTPCVQVLAGMTIATSYLGISAVSEFF
                     GTSLTPLFHFHISTCLKTQGAFYICGQSIHQCLPSNWTGTCTIGYVTPDIFIAPGNLS
                     LPIPIYGNSPLPRVRRAIHFIPLLAGLGILAGTGTGIAGITKASLTYSQLSKEIANNI
                     DTMAKALTTMQEQIDSLAAVVLQNRRGLDMLTAAQGGICLALDEKCCFWVNQSGKVQD
                     NIRQLLNQASSLRERATQGWLNWEGTWKWFSWVLPLTGPLVSLLLLLLFGPCLLNLIT
                     QFVSSRLQAIKLQTNLSAGRHPRNIQESPF"
BASE COUNT          901 a          877 c          632 g          796 t
ORIGIN      
        1 ggcacttgga tctctcaaat ggtgcagtga ctcggatacc ttccctagtg ccattacagt
       61 actggagact gccagctaga tccatcacac ccaagtgaag ctgtggaaaa gcccttaaac
      121 tccagagcca gaaccagcaa cctcagctcc ggaatacact tgcaaggcac tggaagatct
      181 aaaattcctc tttaaacaaa aagataagta atgccccacc aacatccttt cacctcaaag
      241 taaggtgatc ccaatactag aaattttact ggcaattgct ctgattgtta tcactatttt
      301 aaccctaact tgtacaccac caggagttcc attggcagct cgttttgtga ccagtttctc
      361 ttaggtcacc atgggcctgc tcctgctggt tctcattctc acgccttcac tagcagccta
      421 ccgccatcct gatttcccgt tattggaaaa agctcagcaa ctgctccaaa gtacaggatc
      481 cccttactcc accaattgct ggttatgtac tagctcttcc actgaaacac cagggacagc
      541 ttatccagcc tcgcccagag aatggacaag catagaggcg gaattacata tttcctatcg
      601 atgggaccct aatctgaaag gactgatgag gcctgcaaat agtcttcttt caacagtaaa
      661 gcaagatttc cctgatatcc gccagaaacc tcccattttc ggacccatct ttactaatat
      721 caacctaatg ggaatagccc ctatttgtgt tatggccaaa aggaaaaatg gaacaaatgt
      781 aggcactctt ccaagtacag tctgtaatgt tactttcact gtagattcta accaacagac
      841 ttaccaaaca tacacccaca accaattccg ccatcaacca agattcccca aacctccaaa
      901 tattactttt cctcagggaa ctttgctaga taaatccagc cggttttgcc agggacgccc
      961 aagctcatgc agtactcgaa acttctggtt ccggcctgct gattataacc aatgtctgca
     1021 aatttccaac ctcagctcta cagcggaatg ggttctattg gaccaaactc gaaattctct
     1081 tttttgggaa aataaaacca agggagctaa ccagagccaa acaccctgcg tccaagtctt
     1141 agcaggcatg actatagcca ccagctacct gggcatatca gcagtctcag aattttttgg
     1201 aacctccctc acccccttat ttcatttcca tatctctaca tgccttaaaa ctcaaggagc
     1261 cttttatatt tgtggccagt cgattcacca atgcctcccc agtaactgga ctggaacttg
     1321 taccataggc tatgtaaccc cagacatctt catagcccct ggcaatctct ctcttccaat
     1381 accaatctat gggaattccc cgttgcccag ggtgaggagg gcaatccatt tcattcccct
     1441 tctcgcggga ctcggcattc tagctggtac gggaaccgga attgctggaa tcacaaaagc
     1501 ttccctcacc tatagccagc tctcaaagga aatagccaac aacattgaca ccatggctaa
     1561 agccttaacg accatgcaag aacaaatcga ctctttagca gccgtagtcc ttcaaaatcg
     1621 tcgaggacta gacatgttaa cggcagcaca gggaggaatt tgtttggcct tagatgaaaa
     1681 atgttgcttt tgggtaaatc aatcaggaaa agtacaagac aacatcagac aactcctaaa
     1741 tcaagcctcc agtttacggg aacgagccac tcagggttgg ttaaattggg aaggaacttg
     1801 gaaatggttc tcttgggttc ttccccttac aggcccactt gttagtctcc tacttttgct
     1861 cctttttggt ccatgtctcc taaatctaat aacccaattt gtctcctctc gccttcaggc
     1921 cataaagctc cagacgaatc tcagtgcagg acgccatcct cgcaatattc aagagtcacc
     1981 cttctaagga ggacccctag actgctcgct agtggaacac gacagaggcg aaatcctgcc
     2041 ccgtctcccg tggacctggc tggatatggt ttttgccaat ccacagagcc atcctgccct
     2101 gacagctagc aagaggccaa gacccacaga acaaccacta cagcccctct gtcagcagga
     2161 agcagttaaa gaagactgac cttcgtccat tttcccagat aattgggtct tggactcttg
     2221 aggtggggaa atgttggagc aggtagctag tcagacatga gcagggcagg ggagggcccc
     2281 ctcaccagga atgtcaggca accatcaggt gatggtcagg cagttgttaa gctgtgtctc
     2341 taacataata atgagtggca gctggcgcca gggaactatg gcctcccaat agataggaaa
     2401 cacctgaagc tggtgatcag ccgcttcctg ataagatctc aggagttggg tgcgcaggct
     2461 caagcatgca ccctaagagg caaaatagtg gcatttaact catatatgac cttcctttag
     2521 gaaggcttga ctggtaaggg aaaaactcct ccagtgaaca cgtgcacaac ttcagtaaaa
     2581 acactgcaca tgcgtcccct cccaagtgct ggcaggccac tgtgcatgca gacagcccgc
     2641 cccaaagaaa aatcagagga ggagaaatgg aaaccccgga aaaatgccaa tgtataaaac
     2701 cccaagtcaa gggcctacca aggcaattgg atctctcaag tcacccgctt ggctctcttc
     2761 aagtgcactt tgcttccttt tgttcttgct ctaaaacttt tactcctgct ctaaaacttg
     2821 ccttggacta tcatgctacc ttacgcctcc cgggccaaat tccctcctct cctccggggg
     2881 gcaaggatgg agtctgctgc agacccattg gatttgctgc tggtaacagt tccaccattt
     2941 aggttccagc accaagcaaa ctaacacccg actcagtgta aacagccaaa caagcttaac
     3001 caattagaaa ccaccatcta acctctaact aggtcctttc aactttaacc aagtattttc
     3061 tttgtcttgc ttctgtggga accttataaa attttccccc ttgtacctct gtagtagaga
     3121 cccagttgct tgcagtttgg ccctgcctgt tcatgaatca cccttgctca aataaactct
     3181 ctaaaatgct aaaaaaaaaa aaaaaa
//