For questions or suggestions e-mail us at: ioerger@cs.tamu.edu

M. smegmatis MC2 155 MSMEG_0643 (-)

annotation: extracellular solute-binding protein, family protein 5, putative
coordinates: 725067 - 726722
length: 551

IAGRKCLKIASVAAVAVLGVAACGGGGGSGGSSGAANEGGEVNVTMTSFPDYVDPQLSYTMEGWEVLYNT
YVPLLTYKHAKGEEGAEVAPGLAEDMPEVSPDGKTYKLKLRQNMKYADGTPIKASDFTYAIQRLFKADSG
GSVFFGVIVGANDYAEGKADTITGIQTNDDTGDITINLTEANGTFDNLLGLPFSAPVPPTTPLDTDATNN
PPPGSGPFTITSVDAPHTLVMERNPQFQTVKDAGADEVADAHVDKIIVTQNKSNSAQVTDIEQNKTDFMH
DPPDADRLPEVKARFGDRFRLEDSINTYYFWMNTQQAPFNDVRVRQAVNYAIDPEALNRVFGGRLHPTQQ
ILPPGMPGYEEFTLYPGPDMDKAKQLIAEANPADRDITVWTDDEPDRKRIGEYYHDVLTQLGFNATLKVI
AGDVYWTTIGNQTTPDLDTGFGDWFQDFPHPDDFFRPLINGKSILPTNGNNFSRVAIPELDAKMNELLTQ
QLTDETKKGYADLDRAYMEQAVWAPYGNEQLATFLSDRMDFDRSYHHVLFNQDFTSFALK*
Operon Prediction Model: Genebank

Paralogs
speciesidgenee-valueidentity (len)annotation
M. smegmatis MC2 155MSMEG_0643--100% (551)extracellular solute-binding protein, family protein 5, putative
M. smegmatis MC2 155MSMEG_4545-1e-2927.83% (460) extracellular solute-binding protein, family protein 5
M. smegmatis MC2 155MSMEG_4354-3e-2524.09% (494) dipeptide-binding protein of ABC transport system

Closest Orthologs (e-value cutoff: 1e-4)
speciesidgenee-valueidentity (len)annotation
M. bovis AF2122 / 97Mb3690cdppA2e-0921.76% (510) periplasmic dipeptide-binding lipoprotein DppA
M. gilvum PYR-GCKMflv_2761-5e-2624.75% (501) extracellular solute-binding protein
M. tuberculosis H37RvRv3666cdppA8e-1021.76% (510) periplasmic dipeptide-binding lipoprotein DppA
M. leprae Br4923-----
M. abscessus ATCC 19977MAB_0719-7e-1824.44% (491) putative oligopeptide ABC transporter, solute-binding protein
M. marinum MMMAR_5154dppA7e-1024.10% (502) periplasmic dipeptide-binding lipoprotein DppA
M. avium 104MAV_0464-2e-0826.73% (217) extracellular solute-binding protein, family protein 5
M. thermoresistible (build 8)TH_4152-2e-1922.58% (527) PUTATIVE extracellular solute-binding protein, family 5
M. ulcerans Agy99MUL_4240dppA1e-0924.10% (502) periplasmic dipeptide-binding lipoprotein DppA
M. vanbaalenii PYR-1Mvan_0438-0.077.37% (548) extracellular solute-binding protein

CLUSTAL 2.0.9 multiple sequence alignment


MSMEG_0643|M.smegmatis_MC2_155      --------------------------------------MIAGRKCLKIAS
Mvan_0438|M.vanbaalenii_PYR-1       --------------------------------------MHIFRRALIIAC
Mb3690c|M.bovis_AF2122/97           ---------------------------------------------MVRRM
Rv3666c|M.tuberculosis_H37Rv        ---------------------------------------------MVRQM
MMAR_5154|M.marinum_M               ----------------------------------------------MRRM
MUL_4240|M.ulcerans_Agy99           MPLMAWGERRRDIANAPIAIRTAPMTMSRVPGALLGGAAGGSMVAVMRRM
MAV_0464|M.avium_104                --------------------------------------------------
Mflv_2761|M.gilvum_PYR-GCK          -----------------------------------------MARTAVGAA
TH_4152|M.thermoresistible__bu      -------------------------------------------------V
MAB_0719|M.abscessus_ATCC_1997      ---------------------------------------------MVRLR
                                                                                      

MSMEG_0643|M.smegmatis_MC2_155      VAAVAVLGVAACGGGGGSGGSSGAANEGGEVNVTMTSFPDYVDPQLSYTM
Mvan_0438|M.vanbaalenii_PYR-1       VASLAAFGVAACGSDDSSGGGGGS---GGDITVNATSFPDYIDPQLSYTV
Mb3690c|M.bovis_AF2122/97           RAALAALATGLLVLAPVAGCGGGV-LSPDVVLVNGGEPPNPLIPTGTNDS
Rv3666c|M.tuberculosis_H37Rv        RAALAALATGLLVLAPVAGCGGGV-LSPDVVLVNGGEPPNPLIPTGTNDS
MMAR_5154|M.marinum_M               RATLAVVAV-LLAVSPVAACGGGV-LSPDLVLVNGGEPPNPLIPTGTNDS
MUL_4240|M.ulcerans_Agy99           RATLAVVAV-LLAVSPVAACGGGV-LSPDLVLVNGGEPPNPLIPTGTNDS
MAV_0464|M.avium_104                MAVLATAAAALLVVASLAGCGGGV-LSPDLVVVNGGEPPNPLVPTGTNDS
Mflv_2761|M.gilvum_PYR-GCK          VLAVCALVLSLTGCNTGERVDLGD-GAGGALIAAIAGEPDQLDPHKTTAY
TH_4152|M.thermoresistible__bu      SNLVAVLLIAATGCAPGQRVDLGD-LSG-NLIAAIAGEPDQLDPHKTSAY
MAB_0719|M.abscessus_ATCC_1997      KLAVAASAILVAGCGMGRPPGAID---GEYLTVGTTDRVSTLDPAGAYDN
                                       :..              .         : .      . : *  :   

MSMEG_0643|M.smegmatis_MC2_155      EGWEVLYNTYVPLLTYKHAKGEEGAEVAPGLAEDMPEVSPDGKTYKLKLR
Mvan_0438|M.vanbaalenii_PYR-1       EGWEVLWNVYTPLLTYRHARGKEGTEVVPALAEALPDISPDGKTYKLKLR
Mb3690c|M.bovis_AF2122/97           NGGRIIDRLFAGLMSY-----DAVGKPSLEVAQSIESA--DNVNYRITVK
Rv3666c|M.tuberculosis_H37Rv        NGGRIIDRLFAGLMSY-----DAVGKPSLEVAQSIESA--DNVNYRITVK
MMAR_5154|M.marinum_M               YGGRIIDRLFAGLVSY-----DAKGKPSLEVAQSIDTT--DNVNYRILLK
MUL_4240|M.ulcerans_Agy99           YGGRIIDRLFAGLVSY-----DAKGKPSLEVAQSIDTT--DNVNYRILLK
MAV_0464|M.avium_104                QGGRILDRLFAGLMSY-----DAAGNPAPEVAQSIESG--DNVNYRIVLK
Mflv_2761|M.gilvum_PYR-GCK          FSFEVLENVFDTLVEP-----DQDLQMKPALAESWEVSP-DQLTWTFRLR
TH_4152|M.thermoresistible__bu      FSFQVLENVFDTLVEP-----DEDLRMRPALAESWQVSA-DQRVWTFRLR
MAB_0719|M.abscessus_ATCC_1997      GSFQVENQVYPFLMNFT----PGTGDLKPDLAAGCGFEN--PTLYRCTLK
                                     . .:  . :  *:                :*            :   ::

MSMEG_0643|M.smegmatis_MC2_155      QNMKYADGTPIKASDFTYAIQR------LFKADSGGSVFFGVIVGANDYA
Mvan_0438|M.vanbaalenii_PYR-1       PNMKYSDGTPIKASDFTYAIQR------LFKTDSGGSVFYNVIAGATEYA
Mb3690c|M.bovis_AF2122/97           PGWKFTDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGFDDVAGAPG
Rv3666c|M.tuberculosis_H37Rv        PGWKFTDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGFDDVAGAPG
MMAR_5154|M.marinum_M               PGWRFTDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGYEEVAGEPG
MUL_4240|M.ulcerans_Agy99           PGWGFTDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGYEEVAGEPG
MAV_0464|M.avium_104                PGWRFTDGSPVTAHSFVDAWNYGALSTNAQLQQSFFSPIDGYDALAA--G
Mflv_2761|M.gilvum_PYR-GCK          PGVTFHDGTPLAAEDVVFSYRR--------IIDEQLANSD----------
TH_4152|M.thermoresistible__bu      PGVTFHDGSPLTADDVVFSYRR--------IIDEHLTNVD----------
MAB_0719|M.abscessus_ATCC_1997      PGSVFANGHELTSSDVKFSYDR------ERVINDPNGPQS----------
                                     .  : :*  : : ..  :             :                 

MSMEG_0643|M.smegmatis_MC2_155      EGKADTITGIQTNDDTGDITINLTEANGTFDNLLGLPFSAPVPPTTPLDT
Mvan_0438|M.vanbaalenii_PYR-1       DGAADTITGITTDDGTGDITIQLTEPNGTFDNLLGLMFAAPIPQSTPLDA
Mb3690c|M.bovis_AF2122/97           DKSRTTMSGLRVVNDL-EFTVRLKAPTIDFTLRLGHSSFYPLPDSAFRDM
Rv3666c|M.tuberculosis_H37Rv        DKSRTTMSGLRVVNDL-EFTVRLKAPTIDFTLRLGHSSFYPLPDSAFRDM
MMAR_5154|M.marinum_M               EGKPTTMSGLHVVNNR-EFTVRLRAPTIDFMLSLGHSSFYPLPEAAFKDM
MUL_4240|M.ulcerans_Agy99           EGKPTTMSGLHVVNNR-EFTVRLRAPTIDFMLSLGHSSFYPLPEAAFKDM
MAV_0464|M.avium_104                QPQQTTMTGLRVVNDL-EFTVRLKAPTVDFKLRLGHSAFYPLPQAAFRDM
Mflv_2761|M.gilvum_PYR-GCK          --KFSSVQAVEAPDPS-TVVIRVDRPTPNMLTNLGGFKGMAIVSRANVES
TH_4152|M.thermoresistible__bu      --RLSAVTEVTAVDPL-TVRITVARPTPNLLTNLGGFKGMAIVQRANVET
MAB_0719|M.abscessus_ATCC_1997      --LLANLDRVETPDDL-TVDFRLKLPNDQTFPQVLATNAGPVIDEEVFPP
                                          :  : . :    . . :  ..      :      .:        

MSMEG_0643|M.smegmatis_MC2_155      DATNNPPPGSGPFTITSVDAPHTLVMERNPQFQTVKDAGADEVA-DAHVD
Mvan_0438|M.vanbaalenii_PYR-1       DATNNPPPASGPFMFTTVDAPRTLTMERNPQFQTVKDAGADEVA-DAGVD
Mb3690c|M.bovis_AF2122/97           AAFGRNPIGNGPYKLA--DGPAGPAWEHNVRIDLVPNPDYHGNR-KPRNK
Rv3666c|M.tuberculosis_H37Rv        AAFGRNPIGNGPYKLA--DGPAGPAWEHNVRIDLVPNPDYHGNR-KPRNK
MMAR_5154|M.marinum_M               AAFGRNPIGNGPYRLD--SGGEEPAWEHNVKIDLVPNPDYRGNR-KPRNK
MUL_4240|M.ulcerans_Agy99           AAFGRNPIGNGPYRLD--SGGEEPAWEHNVKIDLVPNPDYRGNR-KPRNK
MAV_0464|M.avium_104                AAFGRHPIGNGPYQLA--GGPDGPAWEHNVRIDLRPNPDYHGNR-KPRNK
Mflv_2761|M.gilvum_PYR-GCK          GRIATHPVGTGPFSFL--GQKSG------DSISLRANPDYWAG--PPGVA
TH_4152|M.thermoresistible__bu      GRIATHPVGTGPFEFR--GARSG------DSITLVANDDHWAG--PPRLN
MAB_0719|M.abscessus_ATCC_1997      DRLLDDDAIARAEPFA--GPYTITSHTKNQLIGLRANPKYVGGLGKPQWD
                                               .  :   .            :    :         .   

MSMEG_0643|M.smegmatis_MC2_155      KIIVTQNKSNSAQVTDIEQNKTDFMHDPPDADRLPEVKARFGDRFRLEDS
Mvan_0438|M.vanbaalenii_PYR-1       KITLIENKNQSAQVTDIMQNKVDFMMDPVPSDRLQEVKSRYSDRFRMEDS
Mb3690c|M.bovis_AF2122/97           GLRFEFYANLDTAYADLLSGNLDVLDT-IPPSALTVYQRDLGDHATSGPA
Rv3666c|M.tuberculosis_H37Rv        GLRFEFYANLDTAYADLLSGNLDVLDT-IPPSALTVYQRDLGDHATSGPA
MMAR_5154|M.marinum_M               GLRFEFYANLETAYSDLLSGNLDVLDT-IPSSALTVYGRDLGDNATSGPA
MUL_4240|M.ulcerans_Agy99           GLRFEFYANLETAYSDLLSGNLDVLDT-IPSSALTVYGRDLGDNATSGPA
MAV_0464|M.avium_104                GLRFEFYANLDTAYADLLSGNLDVLDT-IPPSALPVYRRDLGERVTAGPA
Mflv_2761|M.gilvum_PYR-GCK          GVTFRFIPEPSTALSALQAGEVDWTDS-VPPQRVSQLRDDESLQLTVTPS
TH_4152|M.thermoresistible__bu      GVTFRFISEPATALSALQAGEIDWTDA-IPTQRVTQLRNDDSLTLAVTPS
MAB_0719|M.abscessus_ATCC_1997      LIGIKYYTGGENLKIDIENRAIDVAYRSLSPNDIETLRVNPRLSVHEGPG
                                     : .            :     *       .. :               .

MSMEG_0643|M.smegmatis_MC2_155      --INTYYFWMNTQQAPFNDVRVR--QAVNYAID-PEALNRVFGGRLHPTQ
Mvan_0438|M.vanbaalenii_PYR-1       --INTYYMFMNTERAPFNDVRVR--QAINYAID-PEALNRIFGGRLHPTQ
Mb3690c|M.bovis_AF2122/97           AINQTLDTPLRLPHFGGEEGRLRR-LALSAAINRPQICQQIFAGTRSPAR
Rv3666c|M.tuberculosis_H37Rv        AINQTLDTPLRLPHFGGEEGRLRR-LALSAAINRPQICQQIFAGTRSPAR
MMAR_5154|M.marinum_M               AINQTLDTPLRLAHFGGEEGRLRR-LALSAAIDRPQICQQIFNGTRSAAR
MUL_4240|M.ulcerans_Agy99           AINQTLDTPLRLAHFGGEEGRLRR-LALSAAIDRPQICQQIFNGTRSAAR
MAV_0464|M.avium_104                AINQSLDTPLRLPHFGGEEGRLRR-LALSAAINRAQICRQIFADTRSPAR
Mflv_2761|M.gilvum_PYR-GCK          --NDYWYLALNQARAPWNDVRVR--QAIAYAIDRDAIVQATSYGTAAANQ
TH_4152|M.thermoresistible__bu      --NDYWYLALNHARAPWHDLRVR--RAIAFGIDREAIVTATGYGTATANQ
MAB_0719|M.abscessus_ATCC_1997      GELRYIVFNLKTMPGVTDAQKLAIRRAVASLVDREALSRDVYKGVYTPVY
                                             :.      .  ::    *:   ::          .   .  

MSMEG_0643|M.smegmatis_MC2_155      QILPPGMPGYEEFTLYPGP---DMDKAKQLIAEANPADRDITVWTDDEPD
Mvan_0438|M.vanbaalenii_PYR-1       QVLPPGMPGYQEYKLYPGP---DMDKARALIAEANPADRDITVWTDDEPD
Mb3690c|M.bovis_AF2122/97           DFTARSLPGFDPNLPGNEVLDYDPQRARRLWAQADAISPWSGRYAIAYNA
Rv3666c|M.tuberculosis_H37Rv        DFTARSLPGFDPNLPGNEVLDYDPQRARRLWAQADAISPWSGRYAIAYNA
MMAR_5154|M.marinum_M               DFTARSLPGFDPHIPGNEALDFNPQRARQLWAQANAISPWSGSYAIAYNA
MUL_4240|M.ulcerans_Agy99           DFTARSLPGFDPHIPGNEALDFNPQRARQLWAQANAISPWSGSYAIAYNA
MAV_0464|M.avium_104                DFTARSLPGFDPNIAGSDALDFDPERARRLWAQADAISAWSGSYPIAYNA
Mflv_2761|M.gilvum_PYR-GCK          LAIPEGNPWFTPYDRYREDGEEGLETARDLLRQANAT-----PGDLDMLV
TH_4152|M.thermoresistible__bu      LAIPPGDFWYTHYDRYRYD----LDRARALLDEARAAGVDP-PRRLDMLV
MAB_0719|M.abscessus_ATCC_1997      SVVPESMAGSVESFKALYGVKPNVDLARKFLSEAQVAAPVLINLQYN-PD
                                       . .                  : *: :  :*                

MSMEG_0643|M.smegmatis_MC2_155      RKRIGEYYHDVLTQLGFNATLKVIAG---DVYWTTIGNQTTPDLDTGFGD
Mvan_0438|M.vanbaalenii_PYR-1       RKRIGEYYHDLLTQLGFNATLKVIAG---DVYWTTVGNQSTPDVDTGFAD
Mb3690c|M.bovis_AF2122/97           DAGHRDWVDAVANSIKNVLGIDAVAAPQPTFAGFRTQITNRAIDSAFRAG
Rv3666c|M.tuberculosis_H37Rv        DAGHRDWVDAVANSIKNVLGIDAVAAPQPTFAGFRTQITNRAIDSAFRAG
MMAR_5154|M.marinum_M               DSGHQEWVDTVANSIKNVLGIEAMGAPQPTFAGLRTQITNRTIDTAFRAG
MUL_4240|M.ulcerans_Agy99           DSGHQEWVDAVANSIKNVLGIEAMGAPQPTFAGLRTQITNRTIDTAFRAG
MAV_0464|M.avium_104                DAGHQDWVDAVANGIKNVLGIDALGAPQPTFAGFRTQITNRSIGSAFRSG
Mflv_2761|M.gilvum_PYR-GCK          TTEYPETVTAAQIVADNLAPLGITVNIRTVDFATWLDEQNSGNFDMLMMG
TH_4152|M.thermoresistible__bu      TGDYPQTVTAAQIIADNVAPLGITVDIRTVDFATWLDEQNNGTFDMLMMG
MAB_0719|M.abscessus_ATCC_1997      HYGGNSSEEYAAVKGQLEASGLFSVDLQSTEWVAYQERRSSDSYPVYQFG
                                         .                                 .         .

MSMEG_0643|M.smegmatis_MC2_155      WFQDFPHPDDFFRPLINGKSILPTNGNNFSRVAIPELDAKMNELLTQQLT
Mvan_0438|M.vanbaalenii_PYR-1       WFQDFPHPDDFFRPLLHGDSILPTNGNNLSRANIAENNAKMDELVTKQIT
Mb3690c|M.bovis_AF2122/97           WQGDYPSMIGFLAPLFT-----AGAGSNDVGYINPEFDAALAAAEAAPTL
Rv3666c|M.tuberculosis_H37Rv        WRGDYPSMIEFLAPLFT-----AGAGSNDVGYINPEFDAALAAAEAAPTL
MMAR_5154|M.marinum_M               WQGDFPSMIEFLAPLYG-----TGAGSNDVGYSNQEFDDALAAAAAAPTL
MUL_4240|M.ulcerans_Agy99           WQGDFPSMIEFLAPLYG-----TGAGSNDVGYSNQEFDDALAAAAAAPTL
MAV_0464|M.avium_104                WQGDYPSMIEFLAPLFA-----TGAGSNDVGYSSPRFDAALATAEAAPDL
Mflv_2761|M.gilvum_PYR-GCK          WLGNIDPDDFYYAQHHT-----DGT-SNAQKFSNPEVDRLLDAGRVETDE
TH_4152|M.thermoresistible__bu      WLGNIDPDDYYYAQHHT-----DGA-SNAQRFSDPEVDRLLDAARAEPDR
MAB_0719|M.abscessus_ATCC_1997      WFPDFPDPDNYLTPFFMP------DNMLVNHFQNDTITRLITAEVTEPDS
                                    *  :      :                             :    .    

MSMEG_0643|M.smegmatis_MC2_155      DE-TKKGYADLDRAYME-QAVWAPYGNEQLATFLSDRMDFDRSYHHVLFN
Mvan_0438|M.vanbaalenii_PYR-1       DEGVEQQYADLDRAYME-QAVWAPYGNEQFTTFLSERMDFDKSYHHLLFK
Mb3690c|M.bovis_AF2122/97           TESHELVNDAQRILFHD-MPVVPLWDYISVVGWSSQVSNVTVTWNGLPDY
Rv3666c|M.tuberculosis_H37Rv        TESHELVNDAQRILFHD-MPVVPLWDYISVVGWSSQVSNVTVTWNGLPDY
MMAR_5154|M.marinum_M               QEADRLANVAQRILFHD-MPTIPLWDYIAVVGWSTEVSNVQVTWNGLPDY
MUL_4240|M.ulcerans_Agy99           QEADRLANVAQRILFHD-MPTIPLWDYIAVVGWSTEVSNVQVTWNGLPDY
MAV_0464|M.avium_104                PQAAALANDAQRILFHD-MPVVPLWNYISVVGWSGEVSHVTVTWNGLPDY
Mflv_2761|M.gilvum_PYR-GCK          QTRKAGYARAATLIADE-VSYIYLYNPSVIQAWNPALSGYEARRDGAVRF
TH_4152|M.thermoresistible__bu      DRRKALYDRAATLIADR-VSYLFLYNPAVVQAWSPDLSGYRARRDAAVRF
MAB_0719|M.abscessus_ATCC_1997      AKRLRIIGQIQDLMARDYISTLPLLTGKQIAVSVKNVDGIKLGPSFKFQF
                                                       .         .                    

MSMEG_0643|M.smegmatis_MC2_155      QDFTSFALK-------
Mvan_0438|M.vanbaalenii_PYR-1       QDFTSFALK-------
Mb3690c|M.bovis_AF2122/97           ENIVKA----------
Rv3666c|M.tuberculosis_H37Rv        ENIVKA----------
MMAR_5154|M.marinum_M               ENLVKA----------
MUL_4240|M.ulcerans_Agy99           ENLVKA----------
MAV_0464|M.avium_104                ENIVKA----------
Mflv_2761|M.gilvum_PYR-GCK          RSASLDSDGNV-----
TH_4152|M.thermoresistible__bu      RDASLNRDGLNRDGPP
MAB_0719|M.abscessus_ATCC_1997      TPLKKTGGTA------