LOCUS CCP44416.1 1011 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv PE-PGRS family protein PE_PGRS30 protein. ACCESSION AL123456-1694 PROTEIN_ID CCP44416.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="PE_PGRS30" /locus_tag="Rv1651c" /note="Rv1651c, (MTCY06H11.16c), len: 1011 aa. PE_PGRS30,Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citations below),similar to many e.g. Q10637|Y03A_MYCTU hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 1757, E(): 0, (50.8% identity in 714aa overlap). The transcription of this CDS seems to be activated in macrophages (see Ramakrishnan et al., 2000)." /db_xref="EnsemblGenomes-Gn:Rv1651c" /db_xref="EnsemblGenomes-Tr:CCP44416" /db_xref="GOA:Q79FL8" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:Q79FL8" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSFLLVEPDL VTAAAANLAG IRSALSEAAA AASTPTTALA SAGADEVSAA VSRLFGAYGQ 61 QFQALNARAA TFHAEFVSLL NGGAAAYTGA EAASVSSMQA LLDAVNAPTQ TLLGRPLIGN 121 GADGVAGTGS NAGGNGGPGG ILYGNGGNGG AGGNGGAAGL IGNGGAGGAG GAGGAGGAGG 181 AGGTGGLLYG NGGAGGNGGS AAAAGGAGGN ALLFGNGGNG GSGASGGAAG HAGTIFGNGG 241 NAGAGSGLAG ADGGLFGNGG DGGSSTSKAG GAGGNALFGN GGDGGSSTVA AGGAGGNTLV 301 GNGGAGGAGG TSGLTGSGVA GGAGGSVGLW GSGGAGGDGG AATSLLGVGM NAGAGGAGGN 361 AGLLYGNGGA GGAGGNGGDT TVPLFDSGVG GAGGAGGNAS LFGNGGTGGV GGKGGTSSDL 421 ASATSGAGGA GGAGGVGGLL YGNGGNGGAG GIGGAAINIL ANAGAGGAGG AAGSSFIGNG 481 GNGGAGGAGG AAALFSSGVG GAGGSGGTAL LLGSGGAGGN GGTGGANSGS LFASPGGTGG 541 AGGHGGAGGL IWGNGGAGGN GGNGGTTADG ALEGGTGGIG GTGGSAIAFG NGGQGGAGGT 601 GGDHSGGNGI GGKGGASGNG GNAGQVFGDG GTGGTGGAGG AGSGTKAGGT GSDGGHGGNA 661 TLIGNGGDGG AGGAGGAGSP AGAPGNGGTG GTGGVLFGQS GSSGPPGAAA LAFPSLSSSV 721 PILGPYEDLI ANTVANLASI GNTWLADPAP FLQQYLANQF GYGQLTLTAL TDATRDFAIG 781 LAGIPPSLQS ALQALAAGDV SGAVTDVLGA VVKVFVSGVD ASDLSNILLL GPVGDLFPIL 841 SIPGAMSQNF TNVVMTVTDT TIAFSIDTTN LTGVMTFGLP LAMTLNAVGS PITTAIAFAE 901 STTAFVSAVQ AGNLQAAAAA LVGAPANVAN GFLNGEARLP LALPTSATGG IPVTVEVPVG 961 GILAPLQPFQ ATAVIPVIGP VTVTLEGTPA GGIVPALVNY APTQLAQAIA P //