LOCUS BC023531 2956 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens chondroitin polymerizing factor, mRNA (cDNA clone
MGC:14188 IMAGE:4123041), complete cds.
ACCESSION BC023531
VERSION BC023531.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2956)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2956)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC023531.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 20 Row: d Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34222219.
FEATURES Location/Qualifiers
source 1..2956
/db_xref="H-InvDB:HIT000050813"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:14188 IMAGE:4123041"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2956
/gene="CHPF"
/gene_synonym="CSS2"
/gene_synonym="FLJ22678"
/db_xref="GeneID:79586"
CDS 179..2506
/gene="CHPF"
/gene_synonym="CSS2"
/gene_synonym="FLJ22678"
/codon_start=1
/product="chondroitin polymerizing factor"
/protein_id="AAH23531.1"
/db_xref="GeneID:79586"
/translation="MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQ
PGDSELPPRGNTNAARRPNSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKK
AVRTRYISTELGIRQRLLVAVLTSQTTLPTLGVAVNRTLGHRLERVVFLTGARGRRAP
PGMAVVTLGEERPIGHLHLALRHLLEQHGDDFDWFFLVPDTTYTEAHGLARLTGHLSL
ASAAHLYLGRPQDFIGGEPTPGRYCHGGFGVLLSRMLLQQLRPHLEGCRNDIVSARPD
EWLGRCILDATGVGCTGDHEGVHYSHLELSPGEPVQEGDPHFRSALTAHPVRDPVHMY
QLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRAAAWPVGIPAPSRPASRFEVL
RWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELNRRYHPALRLQKQQLVN
GYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEILPVPYVTEASRL
TVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVAHADVFAPV
KAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVLTPDF
LNRCRMHAISGWQAFFPMHFQAFHPAVAPPQGPGPPELGRDTGRFDRQAASEACFYNS
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARL
SEDLYHRCLQSVLEGLGSRTQLAMLLFEQEQGNST"
BASE COUNT 470 a 1044 c 942 g 500 t
ORIGIN
1 cgaggcgcgg ctccggggat tcggctcggg ccgctggctc tgctctgcgg ggagggagcg
61 ggcccgcccg cggggcccga gccctccgga tccgccccct ccccggtccc gccccctcgg
121 agactcctct ggctgctctg ggggttcgcc ggggccgggg acccgcggtc cgggcgccat
181 gcgggcatcg ctgctgctgt cggtgctgcg gcccgcaggg cccgtggccg tgggcatctc
241 cctgggcttc accctgagcc tgctcagcgt cacctgggtg gaggagccgt gcggcccagg
301 cccgccccaa cctggagact ctgagctgcc gccgcgcggc aacaccaacg cggcgcgccg
361 gcccaactcg gtgcagcccg gagcggagcg cgagaagccc ggggccggcg aaggcgccgg
421 ggagaattgg gagccgcgcg tcttgcccta ccaccctgca cagcccggcc aggccgccaa
481 aaaggccgtc aggacccgct acatcagcac ggagctgggc atcaggcaga ggctgctggt
541 ggcggtgctg acctctcaga ccacgctgcc cacgctgggc gtggccgtga accgcacgct
601 ggggcaccgg ctggagcgtg tggtgttcct gacgggcgca cggggccgcc gggccccacc
661 tggcatggca gtggtgacgc tgggcgagga gcgacccatt ggacacctgc acctggcgct
721 gcgccacctg ctggagcagc acggcgacga ctttgactgg ttcttcctgg tgcctgacac
781 cacctacacc gaggcgcacg gcctggcacg cctaactggc cacctcagtc tggcctccgc
841 cgcccacctg tacctgggcc ggccccagga cttcatcggc ggagagccca cccccggccg
901 ctactgccac ggaggctttg gggtgctgct gtcgcgcatg ctgctgcaac aactgcgccc
961 ccacctggaa ggctgccgca acgacatcgt cagtgcgcgc cctgacgagt ggctgggtcg
1021 ctgcattctc gatgccaccg gggtgggctg cactggtgac cacgaggggg tgcactatag
1081 ccatctggag ctgagccctg gggagccagt gcaggagggg gaccctcatt tccgaagtgc
1141 cctgacagcc caccctgtgc gtgaccctgt gcacatgtac cagctgcaca aagctttcgc
1201 ccgagctgaa ctggaacgca cgtaccagga gatccaggag ttacagtggg agatccagaa
1261 taccagccat ctggccgttg atggggaccg ggcagctgct tggcccgtgg gtattccagc
1321 accatcccgc ccggcctccc gctttgaggt gctgcgctgg gactacttca cggagcagca
1381 cgctttctcc tgcgccgatg gctcaccccg ctgcccactg cgtggggctg accgggctga
1441 tgtggccgat gttctgggga cagctctaga ggagctgaac cgccgctacc acccggcctt
1501 gcggctccag aagcagcagc tggtgaatgg ctaccgacgc tttgatccgg cccggggtat
1561 ggaatacacg ctggacttgc agctggaggc actgaccccc cagggaggcc gccggcccct
1621 cactcgccga gtgcagctgc tccggccgct gagccgcgtg gagatcttgc ctgtgcccta
1681 tgtcactgag gcctcacgtc tcactgtgct gctgcctcta gctgcggctg agcgtgacct
1741 ggcccctggc ttcttggagg cctttgccac tgcagcactg gagcctggtg atgctgcggc
1801 agccctgacc ctgctgctac tgtatgagcc gcgccaggcc cagcgcgtgg cccatgcaga
1861 tgtcttcgca cctgtcaagg cccacgtggc agagctggag cggcgtttcc ccggtgcccg
1921 ggtgccatgg ctcagtgtgc agacagccgc accctcacca ctgcgcctca tggatctact
1981 ctccaagaag cacccgctgg acacactgtt cctgctggcc gggccagaca cggtgctcac
2041 gcctgacttc ctgaaccgct gccgcatgca tgccatctcc ggctggcagg ccttctttcc
2101 catgcatttc caagccttcc acccagctgt ggccccacca caagggcctg ggcccccaga
2161 gctgggccgt gacactggcc gctttgatcg ccaggcagcc agcgaggcct gcttctacaa
2221 ctccgactac gtggcagccc gtgggcgcct ggcggcagcc tcagaacaag aagaggagct
2281 gctggagagc ctggatgtgt acgagctgtt cctccacttc tccagtctgc atgtgctgcg
2341 ggcggtggag ccggcgctgc tgcagcgcta ccgggcccag acgtgcagcg cgaggctcag
2401 tgaggacctg taccaccgct gcctccagag cgtgcttgag ggcctcggct cccgaaccca
2461 gctggccatg ctactctttg aacaggagca gggcaacagc acctgacccc accctgtccc
2521 cgtgggccgt ggcatggcca caccccaccc cacttctccc ccaaaaccag agccacctgc
2581 cagcctcgct gggcagggct ggccgtagcc agaccccaag ctggcccact ggtcccctct
2641 ctggctctgt gggtccctgg gctctggaca agcactgggg gacgtgcccc cagagccacc
2701 cacttctcat cccaaaccca gtttccctgc cccctgacgc tgctgattcg ggctgtggcc
2761 tccacgtatt tatgcagtac agtctgcctg acgccagccc tgcctctggg ccctgggggc
2821 tgggctgtag aagagttgtt ggggaaggag ggagctgagg agggggcatc tcccaacttc
2881 tcccttttgg accctgccga agctccctgc ctttaataaa ctggccaagt gtgaaaaaaa
2941 aaaaaaaaaa aaaaaa
//