For questions or suggestions e-mail us at: ioerger@cs.tamu.edu

M. smegmatis MC2 155 MSMEG_5031 (-)

annotation: uracil-DNA glycosylase superfamily protein
coordinates: 5126753 - 5127625
length: 290

GPIFGHSGRVSPQLLPHPRTGQLFPSPVPPGTGWPGDPATPSTPVAKTPAQVRRLAASAKTLDDLDAMIS
VCRACPRLVAWREEVAVTKRKSFADQPYWGRPATGFGPEAPRILIAGLAPAAQGANRTGRVFTGDRSGDF
LFAALHRAGLANQAVCVDAADGMRLIDTRMAAAVRCAPPGNAPEPAERATCAPWLAAEWRLVGPSVAVIV
ALGGFAWRAALELIADRPKPAPKFGHGATATLTTAYGDVTLLGCYHPSQQNTFTGRLTPDMLDDIFELAK
QIGNERDGQ*
Operon Prediction Model: Genebank

Paralogs
speciesidgenee-valueidentity (len)annotation
M. smegmatis MC2 155MSMEG_5031--100% (290)uracil-DNA glycosylase superfamily protein

Closest Orthologs (e-value cutoff: 1e-4)
speciesidgenee-valueidentity (len)annotation
M. bovis AF2122 / 97Mb1289-1e-10466.30% (270) hypothetical protein Mb1289
M. gilvum PYR-GCKMflv_2236-1e-11270.96% (272) uracil-DNA glycosylase superfamily protein
M. tuberculosis H37RvRv1259-1e-10466.30% (270) hypothetical protein Rv1259
M. leprae Br4923MLBr_01105-2e-8364.91% (228) hypothetical protein MLBr_01105
M. abscessus ATCC 19977-----
M. marinum MMMAR_4181-1e-10467.53% (271) hypothetical protein MMAR_4181
M. avium 104MAV_1407-2e-9965.17% (267) uracil-DNA glycosylase superfamily protein
M. thermoresistible (build 8)TH_2407-1e-12275.38% (264) CONSERVED HYPOTHETICAL PROTEIN
M. ulcerans Agy99MUL_4483-1e-10467.53% (271) hypothetical protein MUL_4483
M. vanbaalenii PYR-1Mvan_4459-1e-11170.50% (278) uracil-DNA glycosylase superfamily protein

CLUSTAL 2.0.9 multiple sequence alignment


Mb1289|M.bovis_AF2122/97            MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPVEPGSGW
Rv1259|M.tuberculosis_H37Rv         MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPVEPGSGW
MLBr_01105|M.leprae_Br4923          --------------------------------------------------
MAV_1407|M.avium_104                ------------------------MQDVGVLEHPRTGQAFDSPVPAGSGW
MMAR_4181|M.marinum_M               ------------------------MQDDRVLTHPRTGQPFGSPVRPGTGW
MUL_4483|M.ulcerans_Agy99           ------------------------MQDDRVLTHPRTGQPFGSPVRPGTGW
Mflv_2236|M.gilvum_PYR-GCK          ----------------------MMTGPRARLPHPRTGDLFETPVRPGRGW
Mvan_4459|M.vanbaalenii_PYR-1       ----------------------MMRPSHRQLPHPRTGDLFTSPVRPGSGW
TH_2407|M.thermoresistible__bu      ----------------------------VTHPHPRTGAVFESPVPPGSGW
MSMEG_5031|M.smegmatis_MC2_155      ---------------MGPIFGHSGRVSPQLLPHPRTGQLFPSPVPPGTGW
                                                                                      

Mb1289|M.bovis_AF2122/97            PGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREE
Rv1259|M.tuberculosis_H37Rv         PGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREE
MLBr_01105|M.leprae_Br4923          -------------------MLSGGAGSIPELNAQISVCRACPRLVDWREE
MAV_1407|M.avium_104                PGDPATPQTPVAADADQVIALARHAGAIPELDALVSVCRACPRLVEWREE
MMAR_4181|M.marinum_M               PGDPASSQTPVAAEAARVAELAGRAGSIVELDAAISVCRACPRLVTWREE
MUL_4483|M.ulcerans_Agy99           PGDPASSQTPVAAEAARVAELAGRAGSIAELDAAISVCRACPRLVTWREE
Mflv_2236|M.gilvum_PYR-GCK          PGDPATSRTAVASNPAQVTALAAAAKNLRQLDAGVSVCRACPRLVEWREE
Mvan_4459|M.vanbaalenii_PYR-1       PGDPATRRTAVAATSADVVTMAAAARSLSQLDAEISVCRACPRLVQWRED
TH_2407|M.thermoresistible__bu      PGDPATPHTAVAANAAQVQALAGAVETIEELDALVSVCRACPRLVQWREE
MSMEG_5031|M.smegmatis_MC2_155      PGDPATPSTPVAKTPAQVRRLAASAKTLDDLDAMISVCRACPRLVAWREE
                                                        ::  .  : :*:* :********** ***:

Mb1289|M.bovis_AF2122/97            VAVVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTG
Rv1259|M.tuberculosis_H37Rv         VAVVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTG
MLBr_01105|M.leprae_Br4923          VAVVKRRAFADQPYWGRPVPGWGSEQPRLLIVGLAPAAHGANRTGRMFTG
MAV_1407|M.avium_104                VAVVKRRAFADQPYWGRPVPSWGSARPRLLIVGLAPAAHGANRTGRMFTG
MMAR_4181|M.marinum_M               VAVAKRRAFADQPYWGRPVPGWGSERPWLLIVGLAPAAHGANRTGRMFTG
MUL_4483|M.ulcerans_Agy99           VAVAKRRAFADQPYWGRPVPGWGSERPWLLIVGLAPAAHGANRTGRMFTG
Mflv_2236|M.gilvum_PYR-GCK          AASVKRKSYADQPYWGRPAPGFGSARPRILIVGLAPAAHGANRTGRVFTG
Mvan_4459|M.vanbaalenii_PYR-1       VAVEKRRSYADQPYWGRPAPGFGSPRPRIFVVGLAPAAHGANRTGRVFTG
TH_2407|M.thermoresistible__bu      AATVKRRSYADEPYWGRPIPGWGAVRPRILVVGLAPAAHGGNRTGRVFTG
MSMEG_5031|M.smegmatis_MC2_155      VAVTKRKSFADQPYWGRPATGFGPEAPRILIAGLAPAAQGANRTGRVFTG
                                    .*  **:::**:****** ..:*.  * ::: ******:*.*****:***

Mb1289|M.bovis_AF2122/97            DRSGDQLYAALHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPT
Rv1259|M.tuberculosis_H37Rv         DRSGDQLYAALHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPT
MLBr_01105|M.leprae_Br4923          DRSGDQLYAALHRAGLVNLPISMDAADGLQANQIRITAPVRCAPPGNAPT
MAV_1407|M.avium_104                DRSGDQLYAALYRAGLVNQPTSVDAADGLRTKHIRIVAPVHCAPPANAPT
MMAR_4181|M.marinum_M               DRSGDQLYAALHRAGLVSQPTSVDAADGLRAERIRIVAPVRCAPPANAPT
MUL_4483|M.ulcerans_Agy99           DRSGDQLYAALHRAGLVSQPTSVDAADGLRAEPIRIVAPVRCAPPANAPT
Mflv_2236|M.gilvum_PYR-GCK          DRSGDFLFASLHRSGLANQSTCTDSADGLELNDVRVAAAVRCAPPDNAPS
Mvan_4459|M.vanbaalenii_PYR-1       DRSGDFLFGSLYRTGLANQQTVTDSADGLVLNDIRVAAAVRCAPPGNAPT
TH_2407|M.thermoresistible__bu      DRSGDFLFASLHRVGLANQAECVDAADGLELYDTRLAAAVRCAPPGNAPS
MSMEG_5031|M.smegmatis_MC2_155      DRSGDFLFAALHRAGLANQAVCVDAADGMRLIDTRMAAAVRCAPPGNAPE
                                    ***** *:.:*:* **..     *:***:     *:.*.*:**** *:* 

Mb1289|M.bovis_AF2122/97            PAERLTCSPWLNAEWRLVSDHIRAIVALGGFAWQVALRLA----GASGTP
Rv1259|M.tuberculosis_H37Rv         PAERLTCSPWLNAEWRLVSDHIRAIVALGGFAWQVALRLA----GASGTP
MLBr_01105|M.leprae_Br4923          QAEWVTCSPWLEAEWRLVSEYVRAIVALGGFAWQIVLRLP----GVSAMR
MAV_1407|M.avium_104                PVERDTCWPWLQAEWRLISEHVRVVVALGGFGWQIALRLP----GVPAAR
MMAR_4181|M.marinum_M               PAERKTCAPWLDAEWRLVSHDVRVIVALGGFAWKVALGLP----GSLGAP
MUL_4483|M.ulcerans_Agy99           PAERKTCAPWLDAEWRLVSHDVRVIVALGGFAWKIALGLP----GSLGAP
Mflv_2236|M.gilvum_PYR-GCK          PAERTTCAPWLDAEYRLTGTDVRVIVALGGFAWQVVLAMVRRTGGTVPVP
Mvan_4459|M.vanbaalenii_PYR-1       PAERSTCAPWLDAEWRLTAGHVRVIVALGGFAWQVALAMIRRAGGAVGAP
TH_2407|M.thermoresistible__bu      PAERATCAPWLHAEWRLTGPYVRVVVALGGFAWRAALQMI----GDVPRP
MSMEG_5031|M.smegmatis_MC2_155      PAERATCAPWLAAEWRLVGPSVAVIVALGGFAWRAALELI----ADRPKP
                                     .*  ** *** **:** .  : .:******.*: .* :     .     

Mb1289|M.bovis_AF2122/97            KPRFGHGVVTELGAG---VRLLGCYHPSQQNMFTGRLTPTMLDDIFREAK
Rv1259|M.tuberculosis_H37Rv         KPRFGHGVVTELGAG---VRLLGCYHPSQQNMFTGRLTPTMLDDIFREAK
MLBr_01105|M.leprae_Br4923          KPRFSHGVVAQLYAG---VRLLGCYHPSQQNMFTGRLTPAMLDDIFRDAK
MAV_1407|M.avium_104                KPRFGHGVVAELAPG---VRLLGCYHPSQQNMFTGRLTPAMLDDVFRDAK
MMAR_4181|M.marinum_M               RPRFGHGVVAELGSG---VQLLGCYHPSQQNMFTGRLTPAMLDDVFRDAK
MUL_4483|M.ulcerans_Agy99           RPRFGHGVVAELGSG---VQLLGCYHPSQQNMFTGRLTPAMLDDVFRDAK
Mflv_2236|M.gilvum_PYR-GCK          APKFGHAATAELATPRGAVTVLGCFHPSQQNTFTGRLTPDMMDAVFATAR
Mvan_4459|M.vanbaalenii_PYR-1       APKFGHGVTATLQTPAGDVALLGCYHPSQQNTFTGRLTPAMMDDIFATAL
TH_2407|M.thermoresistible__bu      APKFGHGATAELTGPHGPVTLIGCYHPSQQNTFTGKLTPAMLDDVFTLAI
MSMEG_5031|M.smegmatis_MC2_155      APKFGHGATATLTTAYGDVTLLGCYHPSQQNTFTGRLTPDMLDDIFELAK
                                     *:*.*...: *      * ::**:****** ***:*** *:* :*  * 

Mb1289|M.bovis_AF2122/97            KLAGIE---------------------
Rv1259|M.tuberculosis_H37Rv         KLAGIE---------------------
MLBr_01105|M.leprae_Br4923          KLAGI----------------------
MAV_1407|M.avium_104                GLAGIK---------------------
MMAR_4181|M.marinum_M               KLGGIG---------------------
MUL_4483|M.ulcerans_Agy99           KLGGIG---------------------
Mflv_2236|M.gilvum_PYR-GCK          TLSK-----------------------
Mvan_4459|M.vanbaalenii_PYR-1       AISRAE---------------------
TH_2407|M.thermoresistible__bu      SACQRSLSDRDHAEIAGRREGDGDARR
MSMEG_5031|M.smegmatis_MC2_155      QIGNERDGQ------------------