LOCUS BC001843 2799 bp mRNA linear HUM 02-DEC-2006
DEFINITION Homo sapiens YY1 associated protein 1, mRNA (cDNA clone MGC:4481
IMAGE:2961648), complete cds.
ACCESSION BC001843
VERSION BC001843.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2799)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2799)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (29-JAN-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC001843.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 10 Row: l Column: 5
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 20986485.
FEATURES Location/Qualifiers
source 1..2799
/db_xref="H-InvDB:HIT000030690"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:4481 IMAGE:2961648"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2799
/gene="YY1AP1"
/gene_synonym="FLJ10875"
/gene_synonym="FLJ13914"
/gene_synonym="HCCA1"
/gene_synonym="HCCA2"
/gene_synonym="YAP"
/gene_synonym="YY1AP"
/db_xref="GeneID:55249"
/db_xref="HGNC:HGNC:30935"
/db_xref="MIM:607860"
CDS 315..2534
/gene="YY1AP1"
/gene_synonym="FLJ10875"
/gene_synonym="FLJ13914"
/gene_synonym="HCCA1"
/gene_synonym="HCCA2"
/gene_synonym="YAP"
/gene_synonym="YY1AP"
/codon_start=1
/product="YY1 associated protein 1"
/protein_id="AAH01843.2"
/db_xref="GeneID:55249"
/db_xref="HGNC:HGNC:30935"
/db_xref="MIM:607860"
/translation="MGFSNMEDDGPEEEERVAEPQANFNTPQALRFEELLANLLNEQH
QIAKELFEQLKMKKPSAKQQKEVEKVKPQCKEVHQTLILDPAQRKRLQQQMQQHVQLL
TQIHLLATCNPNLNPEASSTRICLKELGTFAQSSIALHHQYNPKFQTLFQPCNLMGAM
QLIEDFSTHVSIDCSPHKTVKKTANEFPCLPKQVAWILATSKVFMYPELLPVCSLKAK
NPQDKILFTKAEDNLLALGLKHFEGTEFLNPLISKYLLTCKTARQLTVRIKNLNMNRA
PDNIIKFYKKTKQLPVLGKCCEEIQPHQWKPPIEREEHRLPFWLKASLPSIQEELRHM
ADGAREVGNMTGTTEINSDQGLEKDNSELGSETRYPLLLPKGVVLKLKPVADRFPKKA
WRQKRSSVLKPLLIQPSPSLQPSFNPGKTPAQSTHSEAPPSKMVLRIPHPIQPATVLQ
TVPGVPPLGVSGGESFESPAALPAMPPEARTSFPLSESQTLLSSALVPKVMMPSPASS
MFRKPYVRRRPSKRRGARAFRCIKPAPVIHPASVIFTVPATTVKIVSLGGGCNMIQPV
NAAVAQSPQTIPIATLLVNPTSFPCPLNQPLVASSVSPLIVSGNSVNLPIPSTPEDKA
HMNVDIACAVADGENAFQGLEPKLEPQELSPLSATVFPKVEHSPGPPPVDKQCQEGLS
ENSAYRWTVVKTEEGRQALEPLPQGIQESLNNSSPGDLEEVVKMEPEDATEEISGFL"
BASE COUNT 790 a 750 c 639 g 620 t
ORIGIN
1 agagagagag aagaggaggt ggagaaggct tgggctcgcg ccgctgaagt cggcttaccc
61 gctggccgcc tcctgacaag cgggagggat ccgcggtgga cccagggaag cggaggagcc
121 tggcggccac cccctcttcc tcacttccct gtactctcat cgctctcggc ctccgacacg
181 aaaaggaagc aaatgagctg atggaagatc tgtttgaaac tagacagggt cttgccatgt
241 tctggaactc ataggctcaa gtaatcttcc tgcctcaacc tcccaaagtg ctggaattac
301 agttccaaga tgagatggga ttctccaaca tggaagatga tggcccagaa gaggaggagc
361 gtgtggctga gcctcaagct aactttaaca cccctcaagc tctacggttt gaggaactac
421 tggccaacct actaaatgaa caacatcaga tagcgaagga actatttgaa cagctgaaga
481 tgaagaaacc ttcagccaaa cagcagaagg aggtagagaa ggttaaaccc cagtgtaagg
541 aagttcatca gaccctgatt ctggacccag cacaaaggaa gagactccag cagcagatgc
601 agcagcatgt tcagctcttg acacaaatcc accttcttgc cacctgcaac cccaatctca
661 atccggaggc cagtagcacc aggatatgtc ttaaagagct gggaaccttt gctcaaagct
721 ccatcgccct tcaccatcag tacaacccca agtttcagac cctgttccaa ccctgtaact
781 tgatgggagc tatgcagctg attgaagact tcagcacaca tgtcagcatt gactgcagcc
841 ctcataaaac tgtcaagaag actgccaatg aatttccctg tttgccaaag caagtggctt
901 ggatcctggc cacaagcaag gttttcatgt atccagagtt acttccagtg tgttccctga
961 aggcaaagaa tccccaggat aagatcctct tcaccaaggc tgaggacaat ttgttagctt
1021 taggactgaa gcattttgaa gggactgagt ttcttaaccc tctaatcagc aagtaccttc
1081 taacctgcaa gactgcccgc caactgacag tgagaatcaa gaacctcaac atgaacagag
1141 ctcctgacaa catcattaaa ttttataaga agaccaaaca gctgccagtc ctaggaaaat
1201 gctgtgaaga gatccagcca catcagtgga agccacctat agagagagaa gaacaccggc
1261 tcccattctg gttaaaggcc agtctgccat ccatccagga agaactgcgg cacatggctg
1321 atggtgctag agaggtagga aatatgactg gaaccactga gatcaactca gatcaaggcc
1381 tagaaaaaga caactcagag ttggggagtg aaactcggta cccactgcta ttgcctaagg
1441 gtgtagtcct gaaactgaag ccagttgccg accgtttccc caagaaggct tggagacaga
1501 agcgttcatc agtcctgaaa cccctcctta tccaacccag cccctctctc cagcccagct
1561 tcaaccctgg gaaaacacca gcccaatcaa ctcattcaga agcccctccg agcaaaatgg
1621 tgctccggat tcctcaccca atacagccag ccactgtttt acagacagtt ccaggtgtcc
1681 ctccactggg ggtcagtgga ggtgagagtt ttgagtctcc tgcagcactg cctgctatgc
1741 cccctgaggc caggacaagc ttccctctgt ctgagtccca gactttgctc tcttctgccc
1801 ttgtgcccaa ggtaatgatg ccctcccctg cctcttccat gtttcgaaag ccatatgtga
1861 gacggagacc ctcaaaaaga aggggagcca gggcctttcg ctgtatcaaa cctgcccctg
1921 ttatccaccc tgcatctgtt atcttcactg ttcctgctac cactgtgaag attgtgagcc
1981 ttggcggtgg ctgtaacatg atccagcctg tcaatgcggc tgtggcccag agtccccaga
2041 ctattcccat cgccaccctc ttggttaacc ctacttcctt cccctgtcca ttgaaccagc
2101 cccttgtggc ctcctctgtc tcacccttaa ttgtttctgg caattctgtg aatcttccta
2161 taccatccac ccctgaagat aaggcccaca tgaatgtgga cattgcttgt gctgtggctg
2221 atggggaaaa tgcctttcag ggcctagaac ccaaattaga gccccaggaa ctatctcctc
2281 tctctgctac tgttttcccc aaagtggaac atagcccagg gcctccacca gtcgataaac
2341 agtgccaaga aggattgtca gagaacagtg cctatcgctg gaccgttgtg aaaacagagg
2401 agggaaggca agctctggag ccgctccctc agggcatcca ggagtctcta aacaactctt
2461 cccctgggga tttagaggaa gttgtcaaga tggaacctga agatgctaca gaggaaatca
2521 gtggatttct ttgagctagg agaataagag tctggagact gggagccttc acttcggcct
2581 ccgattggtg gcgcataggg tgtaaccaat aggaaacccc taaagggtac ttaaacccca
2641 gattttgcaa ctggggctct tgagcagctt gctttagcct gctcccactc tgtggaatat
2701 acttttgctt caataaatct gtgcttttat tgctaaaaaa aaaaaaaaaa aaaaaaaaaa
2761 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//