LOCUS BC005159 2393 bp mRNA linear HUM 12-NOV-2003
DEFINITION Homo sapiens collagen, type VI, alpha 1, mRNA (cDNA clone
IMAGE:3506644), partial cds.
ACCESSION BC005159
VERSION BC005159.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2393)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2393)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (26-MAR-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Nov 6, 2003 this sequence version replaced BC005159.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 11 Row: g Column: 7.
FEATURES Location/Qualifiers
source 1..2393
/db_xref="H-InvDB:HIT000086471"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:3506644"
/tissue_type="Kidney, renal cell adenocarcinoma"
/clone_lib="NIH_MGC_14"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene <1..2393
/gene="COL6A1"
/db_xref="GeneID:1291"
/db_xref="MIM:120220"
CDS <1..1339
/gene="COL6A1"
/codon_start=2
/product="COL6A1 protein"
/protein_id="AAH05159.2"
/db_xref="GeneID:1291"
/db_xref="MIM:120220"
/translation="GHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSESIG
LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIRNVQ
ELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTPLNV
LCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLEDAF
LKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERF
LTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVN
DALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVVVVG
RQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG"
misc_feature 95..598
/gene="COL6A1"
/note="vwa; Region: von Willebrand factor type A domain"
/db_xref="CDD:pfam00092"
misc_feature 731..1252
/gene="COL6A1"
/note="VWA; Region: von Willebrand factor (vWF) type A
domain"
/db_xref="CDD:smart00327"
BASE COUNT 470 a 805 c 649 g 469 t
ORIGIN
1 aggacaccaa ggaccgcctg ggccggacga atgcgagatt ttggacatca tcatgaaaat
61 gtgctcttgc tgtgaatgca agtgcggccc catcgacctc ctgttcgtgc tggacagctc
121 agagagcatt ggcctgcaga acttcgagat tgccaaggac ttcgtcgtca aggtcatcga
181 ccggctgagc cgggacgagc tggtcaagtt cgagccaggg cagtcgtacg cgggtgtggt
241 gcagtacagc cacagccaga tgcaggagca cgtgagcctg cgcagcccca gcatccggaa
301 cgtgcaggag ctcaaggaag ccatcaagag cctgcagtgg atggcgggcg gcaccttcac
361 gggggaggcc ctgcagtaca cgcgggacca gctgctgccg cccagcccga acaaccgcat
421 cgccctggtc atcactgacg ggcgctcaga cactcagagg gacaccacac cgctcaacgt
481 gctctgcagc cccggcatcc aggtggtctc cgtgggcatc aaagacgtgt ttgacttcat
541 cccaggctca gaccagctca atgtcatttc ttgccaaggc ctggcaccat cccagggccg
601 gcccggcctc tcgctggtca aggagaacta tgcagagctg ctggaggatg ccttcctgaa
661 gaatgtcacc gcccagatct gcatagacaa gaagtgtcca gattacacct gccccatcac
721 gttctcctcc ccggctgaca tcaccatcct gctggacggc tccgccagcg tgggcagcca
781 caactttgac accaccaagc gcttcgccaa gcgcctggcc gagcgcttcc tcacagcggg
841 caggacggac cccgcccacg acgtgcgggt ggcggtggtg cagtacagcg gcacgggcca
901 gcagcgccca gagcgggcgt cgctgcagtt cctgcagaac tacacggccc tggccagtgc
961 cgtcgatgcc atggacttta tcaacgacgc caccgacgtc aacgatgccc tgggctatgt
1021 gacccgcttc taccgcgagg cctcgtctgg cgctgccaag aagaggctgc tgctcttctc
1081 agatggcaac tcgcagggcg ccacgcccgc tgccatcgag aaggccgtgc aggaagccca
1141 gcgggcaggc atcgagatct tcgtggtggt cgtgggccgc caggtgaatg agccccacat
1201 ccgcgtcctg gtcaccggca agacggccga gtacgacgtg gcctacggcg agagccacct
1261 gttccgtgtc cccagctacc aggccctgct ccgcggtgtc ttccaccaga cagtctccag
1321 gaaggtggcg ctgggctagc ccaccctgca cgccggcacc aaaccctgtc ctcccacccc
1381 tccccactca tcactaaaca gagtaaaatg tgatgcgaat tttcccgacc aacctgattc
1441 gctagatttt ttttaaggaa aagcttggaa agccaggaca caacgctgct gcctgctttg
1501 tgcagggtcc tccggggctc agccctgagt tggcatcacc tgcgcagggc cctctggggc
1561 tcagccctga gctagtgtca cctgcacagg gccctctggg gctcagccct gagctggcgt
1621 cacctgtgca gggccctctg gggctcagcc ctgagctggc ctcacctggg ttccccaccc
1681 cgggctctcc tgccctgccc tcctgcccgc cctccctcct gcctgcgcag ctccttccct
1741 aggcacctct gtgctgcgtc ccaccagcct gagcaagacg ccctctcggg gcctgtgccg
1801 cactagcctc cctctcctct gtccccatag ctggtttttc ccaccaatcc tcacctaaca
1861 gttactttac aattaaactc aaagcaagct cttctcctca gcttggggca gccattggcc
1921 tctgtctcgt tttgggaaac caaggtcagg aggccgttgc agacataaat ctcggcgact
1981 cggccccgtc tcctgagggt cctgctggtg accggcctgg accttggccc tacagccctg
2041 gaggccgctg ctgaccagca ctgaccccga cctcagagag tactcgcagg ggcgctggct
2101 gcactcaaga ccctcgagat taacggtgct aaccccgtct gctcctccct cccgcagaga
2161 ctggggcctg gactggacat gagagcccct tggtgccaca gagggctgtg tcttactaga
2221 aacaacgcaa acctctcctt cctcagaata gtgatgtgtt cgacgtttta tcaaaggccc
2281 cctttctatg ttcatgttag ttttgctcct tctgtgtttt tttctgaacc atatccatgt
2341 tgctgacttt tccaaataaa ggttttcact cctcaaaaaa aaaaaaaaaa aaa
//