LOCUS BC012452 1248 bp mRNA linear HUM 08-SEP-2006
DEFINITION Homo sapiens heparan-alpha-glucosaminide N-acetyltransferase, mRNA
(cDNA clone IMAGE:3880903), complete cds.
ACCESSION BC012452
VERSION BC012452.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1248)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1248)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (15-AUG-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP/Gazdar
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 21 Row: n Column: 18
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis, Similarity but not identity to protein
This clone has the following problem: The cds is short compared to
the longest cds in the locus.
FEATURES Location/Qualifiers
source 1..1248
/db_xref="H-InvDB:HIT000035821"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:3880903"
/tissue_type="Lung, large cell carcinoma"
/clone_lib="NIH_MGC_68"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1248
/gene="HGSNAT"
/gene_synonym="FLJ22242"
/gene_synonym="FLJ32731"
/gene_synonym="HGNAT"
/db_xref="GeneID:138050"
/db_xref="HGNC:HGNC:26527"
CDS 201..821
/gene="HGSNAT"
/gene_synonym="FLJ22242"
/gene_synonym="FLJ32731"
/gene_synonym="HGNAT"
/codon_start=1
/product="HGSNAT protein"
/protein_id="AAH12452.1"
/db_xref="GeneID:138050"
/db_xref="HGNC:HGNC:26527"
/translation="MALGLCRCFHPRHSMAAFGLFPALPSALNSHPACTCLLDPSTWR
PAHVSGPALASSPQILSVFSLGFPGFVNGSCVSRYKPDIIFPPGLPPPDLPSSVSIFY
LQLLCSHGHCCITESGPLLSFSNWPPSLVPHFLKSPVHCHQIKLSPARSPLSEKPPLT
WKHHCLAHILTYSPSRLDPHTSFQPPLPLHSLLPPPPPHPLVSPPL"
BASE COUNT 251 a 401 c 220 g 376 t
ORIGIN
1 aatggaggca gcgttcctac ttgtcatcac acagctgaag acattgtttc ttaggtgtga
61 aatcggggac aaaggacaaa cagagacaca cggcattgtt catgggaggc atcgtcaccc
121 tcctgggtgt tctgtgggaa tttcctgtgt gaggaaaacg tggccacagg gttgtgctgt
181 acccaccctt ccccggcgag atggccctcg gcctgtgccg ctgcttccac cctcgccact
241 ccatggcagc ttttggtctg tttccggctc tgccctctgc cctgaactct catccggctt
301 gtacctgcct gctggacccc tccacctgga ggccagccca tgtctcaggc ccagccctag
361 cctcttctcc tcaaattcta agtgttttct ctttaggttt ccctggcttt gtgaatggat
421 catgtgtctc taggtataaa cctgacatca tctttccacc cggcttacct ccaccagatc
481 tccccagttc tgtctccatc ttctacctgc agctgctctg ttctcatggt cactgctgca
541 tcactgagtc tggacccttg ttatcatttt caaactggcc tccttccctc gttccccact
601 tcttaaagtc acctgtccat tgccaccaga ttaagctttc tccagccaga tcacctctct
661 ctgagaaacc tccattgaca tggaaacacc attgtctggc acacatactc acatactcac
721 cttcccgtct tgatccccac acatctttcc agcctcccct cccactccac tccctgctcc
781 ctcctccacc tccccatcct cttgtctccc ctcccctctg aatccagccc agcggggctt
841 ctcctgcctc catcacatca cagaagtacc tcctgcttct ggttttaatt agagccttcc
901 ccgattacat tttcctctga attttttcct atctacattt gatctgtcat gtttaaaccc
961 cctacttcta agggaacttc tctaatctct tatcctcatc cccaaatagt gttttcttcc
1021 tctgggttct tataatgttg gtatcaatct cacagcattt agtgcttcct gcctggtgtg
1081 acagttacct gtgtgcatgt gcaatttcta atttcccacg ctagactgtg agcttcctaa
1141 ggcaagaatc atgccttgtt ggtttctgta ttcctcatgg tgccaaacac agtgccttct
1201 acattgcagg cgctgaataa acatttttaa agcaaaaaaa aaaaaaaa
//