LOCUS BC056687 1387 bp mRNA linear HUM 05-NOV-2003 DEFINITION Homo sapiens serologically defined colon cancer antigen 1, mRNA (cDNA clone IMAGE:6737450), partial cds. ACCESSION BC056687 VERSION BC056687.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1387) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1387) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (25-AUG-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: NCI cDNA Library Preparation: Michael Brownstein / Ted Usdin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 48 Row: n Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, GenomeScan gene prediction, Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..1387 /db_xref="H-InvDB:HIT000259583" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:6737450" /tissue_type="Pooled, 40 cell lines" /clone_lib="NIH_MGC_127" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..>1387 /gene="SDCCAG1" /gene_synonym="FLJ10051" /gene_synonym="NY-CO-1" /db_xref="GeneID:9147" CDS 109..>1387 /gene="SDCCAG1" /gene_synonym="FLJ10051" /gene_synonym="NY-CO-1" /codon_start=1 /product="SDCCAG1 protein" /protein_id="AAH56687.1" /db_xref="GeneID:9147" /translation="MRVNNVYDVDNKTYLIRLQKPDFKATLLLESGIRIHTTEFEWPK NMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLT DYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAEPLLTLERLTEIVASAPKGELL KRVLNPLLPYGPALIEHCLLENGFSGNVKVDEKLETKDIEKVLVSLQKAEDYMKTTSN FSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEF YSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL LSEEEDDDVDGDVNVEKNETEPSKKKKKKKKKNM" misc_feature 112..1119 /gene="SDCCAG1" /gene_synonym="FLJ10051" /gene_synonym="NY-CO-1" /note="FbpA; Region: Fibronectin-binding protein A N-terminus (FbpA). This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids" /db_xref="CDD:pfam05833" BASE COUNT 492 a 230 c 303 g 362 t ORIGIN 1 ggggagagga aattgcggta gtgaccctcg ggcctcgcca tgaagagccg ctttagcacc 61 attgacctcc gcgccgtact cgcggagctg aatgctagct tgctaggaat gagagtaaac 121 aatgtttatg atgtggataa taagacatac cttattcgtc ttcaaaaacc ggactttaaa 181 gctacacttt tacttgaatc tggcatacga attcatacaa cagaatttga gtggcctaag 241 aatatgatgc cgtctagttt tgccatgaag tgccgaaaac atttgaagag tcggagatta 301 gtcagtgcaa aacagcttgg tgtggataga attgtagatt ttcaatttgg aagtgatgaa 361 gctgcttacc atttaatcat tgagctctat gataggggga acattgttct tacagattat 421 gagtacgtaa ttttaaatat tctaaggttt cgaactgatg aggcagatga tgttaaattt 481 gctgttcgtg aacgctatcc acttgatcat gctagagctg ctgaaccttt gcttactttg 541 gaaaggttga ctgaaatagt agccagcgca cctaagggtg aactactgaa gagggtgctt 601 aacccattac ttccctatgg accagctctc attgaacact gtcttttaga aaatggattc 661 tcgggtaatg tcaaagtgga tgaaaaactt gaaactaaag atattgaaaa agtacttgtt 721 tctctgcaga aagcagaaga ctatatgaaa acaacatcca acttcagtgg gaagggatat 781 atcattcaga aaagagaaat aaaaccatgc ttggaagcag ataaaccagt tgaagacata 841 ctgacgtatg aggaatttca tcctttcttg ttttctcaac attcacaatg tccatatata 901 gaatttgaat catttgacaa ggcggtggat gaattttatt ccaagataga aggccagaaa 961 attgacttaa aagctttaca acaggaaaag caagcattga agaaattaga taatgttcga 1021 aaggatcacg aaaacagatt ggaagctctt cagcaggctc aggaaataga caaactgaaa 1081 ggagagctca tagaaatgaa cctacaaata gttgacagag ccattcaggt agttcgaagt 1141 gctttagcta accagataga ttggacagaa attgggttaa ttgtgaaaga agcccaggct 1201 caaggagacc ctgttgcaag tgcaatcaaa gaattaaaac tacaaacaaa ccatgttaca 1261 atgctgctaa gaaatccata cttgttatca gaggaggaag atgatgatgt tgatggtgac 1321 gtcaatgttg agaaaaatga aactgaacca tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1381 aacatgt //