LOCUS AC137054 4001 bp DNA linear HUM 01-FEB-2003 DEFINITION Homo sapiens 12 BAC CTD-2024F21 (Cal Tech Human BAC Library D) complete sequence. ACCESSION AC137054 VERSION AC137054.2 KEYWORDS HTG. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4001) AUTHORS Muzny,D.M., Adams,C., Adio-Oduola,B., Ali-osman,F.R., Allen,C., Alsbrooks,S.L., Amaratunge,H.C., Are,J.R., Ayele,M., Banks,T., Barbaria,J., Benton,J., Bimage,K., Blankenburg,K., Bonnin,D., Bouck,J., Bowie,S., Brieva,M., Brown,E., Brown,M., Bryant,N.P., Buhay,C., Burch,P., Burkett,C., Burrell,K.L., Byrd,N.C., Carron,T.F., Carter,M., Cavazos,S.R., Chacko,J., Chavez,D., Chen,G., Chen,R., Chen,Z., Chiu,D., Chowdhry,I., Christopoulos,C., Cleveland,C.D., Cox,C., Coyle,M.D., Dathorne,S.R., David,R., Davila,M.L., Davis,C., Davy-Carroll,L., Dederich,D.A., Delaney,K.R., Delgado,O., Denn,A.L., Ding,Y., Dinh,H.H., Douthwaite,K.J., Draper,H., Dugan-Rocha,S., Durbin,K.J., Earnhart,C., Edgar,D., Edwards,C.C., Elhaj,C., Emerling,S., Escotto,M., Falls,T., Ferraguto,D., Flagg,N., Ford,J., Foster,P., Frantz,P., Gabisi,A., Gao,J., Garcia,A., Garner,T., Garza,N., Gill,R., Gorrell,J.H., Guevara,W., Gunaratne,P., Hale,S., Hamilton,K., Han,J., Harris,C., Harris,K., Hart,M., Havlak,P., Hawes,A., Hernandez,J., Hernandez,O., Hodgson,A., Hogues,M., Holloway,C., Hollins,B., Homsi,F., Howard,S., Huber,J., Hulyk,S., Hume,J., Ioshikhes,I., Jackson,L.E., Jacobson,B., Jia,Y., Johnson,R., Jolivet,S., Joudah,S., Karlsson,E., Kelly,S., Khan,U., King,L., Korvah,J., Kovar,C., Kratovic,J., Kureshi,A., Landry,N., Leal,B., Lee,E., Lewis,L.C., Lewis,L., Li,J., Li,Z., Lichtarge,O., Lieu,C., Liu,J., Liu,W., Loulseged,H., Lozado,R.J., Lu,X., Lucier,A., Lucier,R., Luna,R., Ma,J., Maheshwari,M., Mapua,P., Marondel,I., Martin,R., Martindale,A., Martinez,E., Massey,E., Mawhiney,E., McLeod,M.P., Meador,M., Mei,G., Merscher,S., Metzker,M., Miller,A., Miner,G., Miner,Z., Mitchell,T., Mohabbat,K., Montgomery,K.T., Morgan,M., Morris,S., Moser,M., Neal,D., Nelson,D., Newtson,J., Newtson,N., Nguyen,A., Nguyen,N., Nguyen,N., Nickerson,E., Nwokenkwo,S., Oguh,M., Okwuonu,G., Oragunye,N., Oviedo,R., Pace,A., Payton,B., Peery,J., Perez,L., Peters,L., Pickens,R., Primus,E., Pu,L.L., Quiles,M., Ren,Y., Rives,M., Rojas,A., Rojubokan,I., Rolfe,M., Ruiz,S., Savery,G., Scherer,S., Scott,G., Shen,H., Shim,C., Shooshtari,N., Sisson,I., Sodergren,E., Sonaike,T., Sparks,A., Stanley,H., Stone,H., Sutton,A., Svatek,A., Tabor,P., Tamerisa,A., Tamerisa,K., Tang,H., Tansey,J., Taylor,C., Taylor,T., Telfrod,B., Thomas,N., Thomas,S., Usmani,K., Vasquez,L., Vera,V., Villalon,D., Vinson,R., Wang,Q., Wang,S., Ward-Moore,S., Warren,R., Washington,C., Watlington,S., Williams,G., Williamson,A., Wleczyk,R., Wooden,S., Worley,K., Wu,C., Wu,Y., Wu,Y.F., Zhou,J., Zorrilla,S., Kucherlapati,R., Weinstock,G. and Gibbs,R. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 (bases 1 to 4001) AUTHORS Worley,K.C. TITLE Direct Submission JOURNAL Submitted (15-NOV-2002) Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 3 (bases 1 to 4001) AUTHORS Worley,K.C. TITLE Direct Submission JOURNAL Submitted (01-FEB-2003) Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA COMMENT On Feb 1, 2003 this sequence version replaced AC137054.1. INFORMATION: http://www.hgsc.bcm.tmc.edu/ or email gc-help@bcm.tmc.edu CLONE LENGTH: This sequence does not necessarily represent the entire insert of this clone. Overlapping regions of clones are only sequenced and submitted once, so the sequence for the remainder of the insert may be found in the record for the adjacent clones. Overlapping clones are noted at the beginning and end of the Features listing. ANNOTATION OF FEATURES: STSs are identified using ePCR (Genome Res. 7:541-550) searches of a local database that includes entries from dbSTS, GDB, and local mapping efforts. Repeats are identified using RepeatMasker (A. Smit and P. Green, unpublished.) for Human and Mouse sequences. Genes and Region of sequence similarity are identified by BLAST (Nuc. Acids Res. 25:3389-3402) similarity (expect < 1e-34) to the EST and cDNA sequences. Genes demonstrate at least two exons flanked by consensus splice sites that maintained sequence continuity across the splice junctions. Sequences that are not identical matches are annotated as similar. SEQUENCING READ COVERAGE:Sequencing is completed to a minimum standard of double strand coverage with a minimum of 2 clones and 2 reads with no ambiguities or 2 chemistries with a minimum of 2 clones and 3 reads with no ambiguities. If the sequence quality for a region does not meet this standard, it will be indicated in the annotation as Low Coverage. QUALITY OF INDIVIDUAL BASES:This sequence meets stringent quality standards - estimated error rate less than 1 per 10,000 bases. Reports of lowest quality individual bases and measures of base quality are listed below. Description of the metrics can be found at URL: http://www.hgsc.bcm.tmc.edu:8088/quality.info/genbank.annotation.ht ml. FEATURES Location/Qualifiers source 1..4001 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /chromosome="12" /clone="CTD-2024F21" misc_feature 1..2115 /note="overlaps bases 120072..122186 of clone AC004466" /function="clone overlap" repeat_region complement(116..307) /rpt_family="L2" repeat_region 429..566 /rpt_family="MIR" repeat_region 1022..1179 /rpt_family="MER3" repeat_region 1199..1277 /rpt_family="MER3" repeat_region 1294..1393 /rpt_family="MIR" repeat_region 1812..2131 /rpt_family="MLT1J1" misc_feature complement(1903..4001) /note="overlaps bases 134779..136877 of clone AC004241" /function="clone overlap" repeat_region 2515..2786 /rpt_family="L2" repeat_region 2787..3100 /rpt_family="AluSx" repeat_region 3101..3487 /rpt_family="L2" repeat_region complement(3488..3600) /rpt_family="MER81" repeat_region 3601..3689 /rpt_family="L2" BASE COUNT 1101 a 962 c 870 g 1068 t ORIGIN 1 gctgtgagct cagcagcagc cctcctttag gagctgccgg tggagagtga gtgctggtcc 61 tgtggaggag ggagcaagcc ccgtggtccg gagtgcattt ccatggatgg cttcagagct 121 ggccaggatg gacagtactc caggcagtgg gaaccgcacg tgtgatggcg cagagggaag 181 aaataaagcg gccgctttgg gaaactgtaa gtcgtttgtg gctggaatgt caagtttgaa 241 ggtagagtag cgggtttgga gattagaaag gttctgaaca taaagggcct tgcatgctct 301 gttaagaagg ctgtccccct ccccttgggg ctgggggaga aaactcacaa ttctgtgttt 361 tagaactatg gccagagtaa tagagtgaag gcgaactgcc tgtgaaacac tagaagcaga 421 gagatgagtt ggacaaattt caccccacag tgctttaatt accaggtctt taaaatggag 481 actgcagtaa cacctacttc aaagtgttgt gatgaggagt gcctggaaga gtgcctagca 541 catggtagat actcaataaa tgtcaggaag tagaattagt agcagcagaa ggctgccatg 601 gcaagagagg atgaggggct tcagagctgg tccaggcaga agcagagaga atggaagaga 661 cgaaactgct tcaagagcta tttcagctat ctgaacctaa aggtcagggg agaattcatt 721 agctgagcag acagaaggag gagagcaaaa atattatgag tggacatatt aggagtatgg 781 gagagcagtg agcaagcttg ctgtgctgga aagtgagatt gcgcaagaaa agaaagagaa 841 ataattatgt aacagtagta ctagaccagg ttgtacaagg ctaaggccag gcttaaacat 901 tttttaaatt gtggaaacaa tgaagagcta ttgcagagca ttagactcag gtggggtcag 961 aggcctagct tcaccatttg ctgtgaccct gggcaagtgc ccctaactca cagatgtcca 1021 atccaattga ctttctgcct ggaagaaaat attccatatc tgcaccctcc ataatggtgg 1081 ccactaatca caggtggcta ttgaatactt gatatgtgac tagtgtgact gaagaactga 1141 atttttaatt gtatttaatt taaattaatt taaatttaat gtatttaagt ttagtgtatt 1201 taatttaaat taatttaaat ttaatttaat taatttaagt agctgcacat gactagtggc 1261 tactgtgtta gcacagctag acccaggact cctgtccctc catctgtaca cagggaatga 1321 tgatgaaaca tcacaggctt gttacaaaga tcgagatata ttgagataat acactcaaag 1381 tgctcaacac agtaattcaa caaattattg ctgctgctgt tgaaattgtt attgttttta 1441 ttgaacaggg attgcatgac atacgccaag tcttaggaag attagttaga ctataatatc 1501 cagttagatt tgatggggga aaattgtaga ggataaagca ttcacaaggt tatttcagtg 1561 gtaaggtgtg agagaattaa gatcttatcc agtgaaagac cttgagaatg ggaaagaatg 1621 gaatgattgt tgagccataa agcacatggg tgtgcaccac tcatacacat cttctcatat 1681 cagcttcctt ccaaggtatt ctcagagagt acactcccaa cccagcccag gacagacact 1741 actacgaccc ctacaagatg cacagccatt ctccctgcct gcgccagaaa ctactagtgc 1801 tccacaacac acaccaacat ttgtgtgtct ctttctgggc acagtacctc ccaaatttga 1861 actacacttc ccagcttcct tgcagtcaaa cggatgccat gggatcaggt tctgaacaat 1921 ggaatgaagg cagaagcaat gtgcgccatt tctaggctgg gctcatttaa aaatcttcca 1981 tacaacctgc attccctctt cccattctgt gacaatttta gaggccatat gtaccacata 2041 atggaaagaa cctaggcttg aatgaatgga tggagcagag ctacccctgt cccctagacc 2101 ctcactggac tatagaatat gagtgagaag tgtcagtggc tttgttacag ctcagttgct 2161 tgttacagct aaccttatta tcctaataca tcatccctca ttccactgaa gttcatctaa 2221 acaggtccta tatagattac ccgtctaaac catcaatcct tctgaagtct cttttgaact 2281 gggtcccacc aacactaaga cagatttcta attccccatc tgcagacatc ctgactgctg 2341 acattccttt ctcactcccc gcagatatac acattccaca catatcttct cccacaatgc 2401 tgggtcaagg atacacctac tgagctatct tccagatgat tgagtttatt ggacatgtca 2461 gggcttcctc ctctcctgtt aatccccatg cagagagacc tctgatgtcc ctggggcatg 2521 tctttgttta ctgtggctca gacccaaacc taggtgttat ctctgagtcc tccctttcct 2581 ctgctcctac aacaaattca ttacaaaata cagttggatt tgccctgtcc tctctctcca 2641 ttccctctgc caccctagtc cagccaccat catcaccacc tacctacctg cagcagcctc 2701 ctaactggtc ctcctctgtt cgtctttccc cctctaatcc tctctccacc aagcagctgg 2761 agcaatcttt taaaaataat agccatggcc gggtgcagtg gttcacggct gtaatcccag 2821 cactttggga ggctgaggtg ggtggatcac ttgaggtcag gagttcgagg ccagcctggc 2881 caacatggca aaacccatct ctactaaaaa tacaaaaatt aaccaggcgt ggtggcacgt 2941 gcctgtaata ccagctactt gggaggctga ggcaggagaa tcacttgaac ctgggaggtg 3001 gaggttgcag tgagccaaga tcgcaccact caactccagc ctgggcaaca gagtgagact 3061 ccgtctcaag aaaaaaaaaa tttaaataaa taaataaaaa taacaaccag aacatggtac 3121 tcccctgctt caatcactcc agtagcttcc atagctctta gagtaaattg attggggtct 3181 tgccagctcc aaccagcctt actcaccctt gctcatttta cttcagacac attcgccttt 3241 ctgtttctca aacgtgccac acttgttcct gtcccaggtt ttttgtgctt tgtgcccgga 3301 aagctcttct ctgagtgttt ataagattga tccctcatca ttcaagtctc agctcaaatg 3361 ttacctcctc agaggattcc tcgctctgtt ttatttcctc cacagcactt atcatgacaa 3421 aaaattgtat ttttactttc cacccctact cctgctgtcc ctattggaat ataatctcca 3481 ggaaggcagg gtaactgact cgtcctgatt cattggggac tcttccagtt ttaccacaaa 3541 aagttccacg tcccagaaaa cccctcaata tagggcaaac tatgatagtt attcacttta 3601 tgtctgcctt gatcacttct gtattcccag tggctagtat aacgccaggt gcctaggtag 3661 gaactcaata aatattcaat acatatgaaa agggctgaat gtccataaat aattctttga 3721 ccctggctct agggggcagc cttccccaac cttgtgcctg aacagacagc tttgcaggtt 3781 ggactggtag cttgtgtacc taggtagtga gttccacagg attctcccta gaccttatgt 3841 ggtccccttg ccgcccctcc acccccccac cccagccctt gtggagttca cagctgagga 3901 ccctggctgt ggcaacccga ggctgtgcct gacacagagg tagattcctt ccctctcccc 3961 attctacccc acccaggccc ccctcctgca gcttcgcctt c //