For questions or suggestions e-mail us at: ioerger@cs.tamu.edu
MRRSTLLAGGLAVTMAVLLVIAMLMGRTTEPAGKTVVTVRLWDPQVAAAYRESFDAFSAEHPGIEVRVNT VAYASYFDSLRTDVAGGSADDIFWISNGYFAGYADNGHLLDIADLLGPDAATAWEPSVVEQFTRNGALWG VPQLTDAGIAVYYNADLLEKAGVSPADLSTLRWSNGPDDTLRPLLARLTVEESGRTRQWGYNAANDLQGI YLNFIGSAGGTFSEGDRFTFDNPQAVEAFEYLVRLINTDRVAPPASDTNDNGDFSRNAFLAGRMALFQSG TYNLAAIADQAPFPWGVAMLPIGPKGRVSVTNGIAAAGNAATRHPEAVREVLAWMGSRRGNEFVGRRGAA IPAVLAAQPVYHEYWASRGVDVSPFFRVLQGPRIAAPGGAGFPAGFEALTPYFAEMFLGRRDVAGTLAEA QRAANAAASR
Operon Prediction Model: Genebank
Paralogs
| species | id | gene | e-value | identity (len) | annotation |
| M. smegmatis MC2 155 | MSMEG_4468 | - | - | 100% (430) | extracellular solute-binding protein UspC |
| M. smegmatis MC2 155 | MSMEG_0515 | - | 3e-15 | 22.35% (443) | sugar transporter sugar binding lipoprotein |
| M. smegmatis MC2 155 | MSMEG_0553 | - | 2e-09 | 23.09% (459) | sugar ABC transporter, substrate-binding protein, putative |
Closest Orthologs (e-value cutoff: 1e-4)
| species | id | gene | e-value | identity (len) | annotation |
| M. bovis AF2122 / 97 | Mb2345 | uspC | 1e-172 | 68.71% (441) | periplasmic sugar-binding lipoprotein UspC |
| M. gilvum PYR-GCK | Mflv_3775 | - | 1e-07 | 25.13% (390) | extracellular solute-binding protein |
| M. tuberculosis H37Rv | Rv2318 | uspC | 1e-172 | 68.48% (441) | periplasmic sugar-binding lipoprotein UspC |
| M. leprae Br4923 | MLBr_01770 | uspC | 1e-166 | 64.25% (442) | sugar transport periplasmic binding protein |
| M. abscessus ATCC 19977 | MAB_1715c | - | 1e-150 | 62.41% (431) | sugar ABC transporter, sugar-binding protein |
| M. marinum M | MMAR_3619 | uspC | 1e-164 | 66.21% (441) | periplasmic sugar-binding lipoprotein UspC |
| M. avium 104 | MAV_2088 | - | 1e-174 | 69.39% (441) | extracellular solute-binding protein |
| M. thermoresistible (build 8) | TH_2011 | - | 3e-08 | 23.16% (354) | substrate binding protein |
| M. ulcerans Agy99 | MUL_1246 | uspC | 1e-164 | 66.21% (441) | periplasmic sugar-binding lipoprotein UspC |
| M. vanbaalenii PYR-1 | Mvan_2624 | - | 4e-08 | 25.13% (390) | extracellular solute-binding protein |
CLUSTAL 2.0.9 multiple sequence alignment
Mb2345|M.bovis_AF2122/97 ----------MTRPRQSTLVATALVLVAILLGVTAVLLGLSA--------
Rv2318|M.tuberculosis_H37Rv ----------MTRPRQSTLVATALVLVAILLGVTAVLLGLSA--------
MMAR_3619|M.marinum_M ----------MSRPRFSTLVAVAVTLIAALLGVTAVALDRID--------
MUL_1246|M.ulcerans_Agy99 ----------MNRPRFSTLVAVAVTLIAALLGVTAVALDRID--------
MAV_2088|M.avium_104 ----------MTRPRFSTLVAGAVALVAALLAAAAVLLDYSG--------
MLBr_01770|M.leprae_Br4923 ----------MTRPRYSTLVAEALALATVLLTATAMLMGWSGG-------
MSMEG_4468|M.smegmatis_MC2_155 ----------MRR---STLLAGGLAVTMAVLLVIAMLMGRTT--------
MAB_1715c|M.abscessus_ATCC_199 -------------MKASTRAALTLALVALLLFGVAAWLGIPT--------
Mflv_3775|M.gilvum_PYR-GCK MRFAQIPT-ARRGGK--TARSAVLALLAVLALVLSACAGSGGPEQAESTG
Mvan_2624|M.vanbaalenii_PYR-1 MRITQWPTSGRRSGRPGTAFSAVMALVAVLALVLTGCAGSGGPEQAEATG
TH_2011|M.thermoresistible__bu --------------VRRTPIRAVLAAVAAVCLTLTGCAGAGT--------
* :. : : .
Mb2345|M.bovis_AF2122/97 ---EPRGGKIVVTVRLWDEPIAAAYRQSFAAFTRSHPDIEVRTNLVAYST
Rv2318|M.tuberculosis_H37Rv ---EPRGGKIVVTVRLWDEPIAAAYRQSFAAFTRSHPDIEVRTNLVAYST
MMAR_3619|M.marinum_M ---APPGGKIVVTVRLWAAPIAAAYQQSFAAFSRTHPNIEVHTNLVSFST
MUL_1246|M.ulcerans_Agy99 ---APPGGKIVVTVRLWAAPIAAAYQQSFAAFSRTHPNIEVHTNLVSFST
MAV_2088|M.avium_104 ---QPHGDKTIVTVRVWGDELAEAYRQSFAAFTRAHPDIEVHVNMVAYST
MLBr_01770|M.leprae_Br4923 ---QLRGGKVVVTMRLWADQISTAYSQSFQAFTRTHPDIEVHTNVVAYSK
MSMEG_4468|M.smegmatis_MC2_155 ----EPAGKTVVTVRLWDPQVAAAYRESFDAFSAEHPGIEVRVNTVAYAS
MAB_1715c|M.abscessus_ATCC_199 ----TPHGKTVVTVRVWDQQVAEAYRGSFDEFSRRNPDIQVAVTVTSYAS
Mflv_3775|M.gilvum_PYR-GCK SGEVSADTSGTVRILMENVPDTDIVKSMVADFNAEYPGVEINIESLTFDQ
Mvan_2624|M.vanbaalenii_PYR-1 TGEVSTDVSGTVRILMENVPDTDIVKSMVADFNKEYPGVEINIESLTFDQ
TH_2011|M.thermoresistible__bu ----LGSSGNTVTIALVSNSQMSDAQRLAPHFEAANPDIRLKFVTLSENQ
* : : * *.:.: :
Mb2345|M.bovis_AF2122/97 YFETLRTDVAGGS-ADDIFWLSNAYFAAYADSGRLMKIQT--------DA
Rv2318|M.tuberculosis_H37Rv YFETLRTDVAGGS-ADDIFWLSNAYFAAYADSGRLMKIQT--------DA
MMAR_3619|M.marinum_M YFDTLRTDVAGGS-ADDIFWLSNAYLAAYADSGRLMKIDTAV------DP
MUL_1246|M.ulcerans_Agy99 YFDTLRTDVAGGS-ADDIFWLSNAYLAAYADSGRLMKIDTAV------DP
MAV_2088|M.avium_104 YFNTLRTDVAGGS-ADDIFWLSNAYLAAYADSGRLLNILDTLGTN---AA
MLBr_01770|M.leprae_Br4923 YFNTLRTDVAGGS-ADDIFWLSSAYLAAYADNGRLINISNSLGQR---AT
MSMEG_4468|M.smegmatis_MC2_155 YFDSLRTDVAGGS-ADDIFWISNGYFAGYADNGHLLDIADLLGPD---AA
MAB_1715c|M.abscessus_ATCC_199 YFNSLRTDVAGHG-ADDIFWLSNAYLSDYADTGNLVPVEP---------R
Mflv_3775|M.gilvum_PYR-GCK MRDKLVSSFQSSSPAYDLIVVDNPWMVDFANAKFLQPLDARIDSTPDYDA
Mvan_2624|M.vanbaalenii_PYR-1 MRDKLVSSFQSSSPTYDLIVVDNPWMVDFANAKFLQPLDARIDSTPDYDA
TH_2011|M.thermoresistible__bu ARAKITASTAMGGGEFDVVMISNYETPQWAADGWLVNLSDYARQTPGYDE
.: :. . *:. :.. :* * :
Mb2345|M.bovis_AF2122/97 ADWEPAVVDQFTRSGVLWGVPQLTDAGIAVFYNADLLAAAGVDPTQVDNL
Rv2318|M.tuberculosis_H37Rv ADWEPAVVDQFTRSGVLWGVPQLTDAGIAVFYNADLLAAAGVDPTQVDNL
MMAR_3619|M.marinum_M GEWEPAVVDQFTRNGVLWGVPQLTDAGIAVFYNADLLAAAGVDPAELDGL
MUL_1246|M.ulcerans_Agy99 GEWEPAVVDQFTRNGVLWGVPQLTDAGIAVFYNADLLAAAGVDPAELDGL
MAV_2088|M.avium_104 ADWERPVVEQFTRHGQLWGVPQLTDAGIALYYNADLLGAAGIDPAQLNSL
MLBr_01770|M.leprae_Br4923 SDWEPAVVDQFTRAGALWGVPQLTDAGIAVFYNADLLTAAGIDPVQLNRM
MSMEG_4468|M.smegmatis_MC2_155 TAWEPSVVEQFTRNGALWGVPQLTDAGIAVYYNADLLEKAGVSPADLSTL
MAB_1715c|M.abscessus_ATCC_199 ADWDPSVVAQFTRDGKLWGVPQLSDAGIALYYNKNLLDAAQVDPAELAEL
Mflv_3775|M.gilvum_PYR-GCK GDFFTPLTDITTVDGTRYGVPFYNYALGYLYNADDLAAANQQVPTTLDEL
Mvan_2624|M.vanbaalenii_PYR-1 ADFFKPLTDITTVDGARYGVPFYNYALGYLYNADDLTAANQQVPTTLDEL
TH_2011|M.thermoresistible__bu DDFIPSIRESLSYEGDMYAVPFYGES-SFLMYRKDLFEKSGIDMPHNPTW
: .: : * :.** : : :*
Mb2345|M.bovis_AF2122/97 RWSRGDD-DTLRPMLARLTVDADGRTANTPGFDARRVRQWGYNAANDPQA
Rv2318|M.tuberculosis_H37Rv RWSRGDD-DTLRPMLARLTVDADGRTANTPGFDARRVRQWGYNAANDPQA
MMAR_3619|M.marinum_M RWSPGPD-DTLRALLARLTVDADGHVGGTPGFDPGRVRQWGYNAANDPQA
MUL_1246|M.ulcerans_Agy99 RWSPGPD-DTLRALLARLTVDADGHVGGTPGFDPGRVRQWGYNAANDPQA
MAV_2088|M.avium_104 RWNPAGG-DTLRPLLARLTVDADGNRGDTRGFDPGRVRQWGYNAANDPQG
MLBr_01770|M.leprae_Br4923 QWTSNDD-DTLRPLLTQLTLDTNGHVAKTPGFDSRRVRQWGYNAANDPQA
MSMEG_4468|M.smegmatis_MC2_155 RWSNGPD-DTLRPLLARLTVEESG-----------RTRQWGYNAANDLQG
MAB_1715c|M.abscessus_ATCC_199 RWDPDPEVDTLRPMLHRLTA----------------PGHWGYNAANDLQG
Mflv_3775|M.gilvum_PYR-GCK VSTSKALKSGDRAGIA-------------------MQPQRGYKIFEEWG-
Mvan_2624|M.vanbaalenii_PYR-1 VSTSKALKSGDRAGIA-------------------MQPQRGYKIFEEWG-
TH_2011|M.thermoresistible__bu QQVADAAAALDTPDMAG-------------------ICLRGKPGWGEVLA
. : * :
Mb2345|M.bovis_AF2122/97 IYLNYIGSAGG-VFQRDGKFAFDNPGAIEAFRYLVGLINDDHVAPPASDT
Rv2318|M.tuberculosis_H37Rv IYLNYIGSAGG-VFQRDGKFAFDNPGAIEAFRYLVGLINDDHVAPPASDT
MMAR_3619|M.marinum_M IYLNYIGSAGG-VFMRDNEFAFDNPPAIDAFRYLVGLINNDHVAPPASDT
MUL_1246|M.ulcerans_Agy99 IYLNYIGSAGG-VFMRDNEFAFDNPPAIDAFRYLVGLINNDHVAPPASDT
MAV_2088|M.avium_104 IYLNYIGSAGG-VFQRGDEFAFDNPAAVSAFRYLVDLINRDHVAPPAADT
MLBr_01770|M.leprae_Br4923 IYLNYIGSAGG-VFQRGDEFAFDNPSAVEAFRYLVGLINNDHVAPPASET
MSMEG_4468|M.smegmatis_MC2_155 IYLNFIGSAGG-TFSEGDRFTFDNPQAVEAFEYLVRLINTDRVAPPASDT
MAB_1715c|M.abscessus_ATCC_199 IYLNYLGSAGA-VFQADDKFAFAKPRAEMAFTYLVDLINVDRVAPSAADT
Mflv_3775|M.gilvum_PYR-GCK ---NWLFAAGGSIYDADGKITLNTPEAKRALEAYIDTYN---TAAPANSL
Mvan_2624|M.vanbaalenii_PYR-1 ---NWLFAAGGSIYDADGKITLNTPEAKRALEAYIDTYN---TAAPANSL
TH_2011|M.thermoresistible__bu PLDTVINTFGGRWYDMDWNAQLDSPEVEEAVSFYVNLVRDHGEPGPATS-
. : : *. : . . : .* . *. : . . .* .
Mb2345|M.bovis_AF2122/97 NDNGDFSRNQFLAGKMALFQSGTYSLAPVARDALF----HWGVAMLPAGP
Rv2318|M.tuberculosis_H37Rv NDNGDFSRNQFLAGKMALFQSGTYSLAPVARDALF----HWGVAMLPAGP
MMAR_3619|M.marinum_M NDNGDFSRNQFLAGRMALFQSGTYSLAPVARDATF----RWGVAMLPIGP
MUL_1246|M.ulcerans_Agy99 NDNGDFSRNQFLAGRMALFQSGTYSLAPVARDATF----RWGVAMLPIGP
MAV_2088|M.avium_104 NDNGDFSRNQFLAGRMALFQSGTYNLAPVARDARF----RWGVAMMPAGP
MLBr_01770|M.leprae_Br4923 NNNGDFSRNQFLSGRMALFQSGTYNLALIAREARF----HWGIAMMPTGP
MSMEG_4468|M.smegmatis_MC2_155 NDNGDFSRNAFLAGRMALFQSGTYNLAAIADQAPF----PWGVAMLPIGP
MAB_1715c|M.abscessus_ATCC_199 NDNNDFSRNQFLQGRMALFQSGTYNLAQIQANATF----PWDVAMMPAGP
Mflv_3775|M.gilvum_PYR-GCK SWGMDEAQRSVSANQSASMINYNWQLPALNEPGSG----PAAGKIKLATI
Mvan_2624|M.vanbaalenii_PYR-1 SWGMDEAQRSVSANQAASMINYNWQLPALNEPGSG----PAAGKIKLATI
TH_2011|M.thermoresistible__bu --GFGECATQFAQGRAAMWYDATSAVSVLEDPAQSNVVGKVGYALAPTVV
. . . . .: * . . :. : . :
Mb2345|M.bovis_AF2122/97 AGRVSVTNGIAAAGNSASKHPDAVRQVLAWMGST-----EGNSYVG-RHG
Rv2318|M.tuberculosis_H37Rv AGRVSVTNGIAAAGNSASKHPDAVRQVLAWMGST-----EGNSYLG-RHG
MMAR_3619|M.marinum_M AGRVSVTNGIAAAGNSASQHPDAVRQVLAWMGSS-----AGNEYLG-RDG
MUL_1246|M.ulcerans_Agy99 AGRVSVTNGIAAAGNSASQHPDAVRQVLAWMGSS-----AGNEYLG-RDG
MAV_2088|M.avium_104 VGRVSVTNGIAAAGNAATKHPGAVRQVLAWMGSR-----QGNEYLG-RYG
MLBr_01770|M.leprae_Br4923 QGRVSVTNGIAVAGNSATKHPDAVRQVLAWMGSR-----EGNAYLG-RHG
MSMEG_4468|M.smegmatis_MC2_155 KGRVSVTNGIAAAGNAATRHPEAVREVLAWMGSR-----RGNEFVG-RRG
MAB_1715c|M.abscessus_ATCC_199 QGRVSVTNGIVAAANSSSPHPDAVHKVLAWMGST-----DGNSFLG-RSG
Mflv_3775|M.gilvum_PYR-GCK PGGKQVLGSWSWAIPANSATPDAAWAFVSWITAK-----PNDVVRTEKGG
Mvan_2624|M.vanbaalenii_PYR-1 PGGKQVLGSWSWAIPANSATPDAAWAFVSWITAK-----PNDVVRTEKGG
TH_2011|M.thermoresistible__bu KADAGWLYTWALGIPASSDNKDSAWKFISWMTDKNYIRLVGEELGWARVP
. . : : :. .::*: .: :
Mb2345|M.bovis_AF2122/97 AAIPAVLSAQPVYFDYWSARGVDVTPFFAVLNGPRIAAP----------G
Rv2318|M.tuberculosis_H37Rv AAIPAVLSAQPVYFDYWSARGVDVTPFFAVLNGPRIAAP----------G
MMAR_3619|M.marinum_M VAIPAVRSAQPSYFAFWEAKGVNVEPFFAVLSGSRIPAP----------G
MUL_1246|M.ulcerans_Agy99 VAIPAVRSAQPSYFAFWEAKGVNVNPFFAVLSGSRIPAP----------G
MAV_2088|M.avium_104 AAIPAVTSAQPVYFGYWAARGVDVTPFFAVLNGPRIAAP----------G
MLBr_01770|M.leprae_Br4923 AAVPAVRSAQSVYFDYWAAKGIDVTPFFSVLDGPHIPAP----------G
MSMEG_4468|M.smegmatis_MC2_155 AAIPAVLAAQPVYHEYWASRGVDVSPFFRVLQGPRIAAP----------G
MAB_1715c|M.abscessus_ATCC_199 SAIPAVLSARAPYFQYWADKGVDVSPFFEVLRGQQIAAP----------G
Mflv_3775|M.gilvum_PYR-GCK AAIRKSTLQNPAVLQ--GQFGEEYYRTVEQLLADAAPLT----------Q
Mvan_2624|M.vanbaalenii_PYR-1 AAIRQSTLQDPAVLG--GQFGEEYYRTVEQLLANAAPLT----------Q
TH_2011|M.thermoresistible__bu PGNRLSTYQIPEYQQASEAFGQVTLESLENADTQHPTVQPVPYTGVQFLA
. . * . .
Mb2345|M.bovis_AF2122/97 GAGFAAGQQALEPYFDEMFLGRGDVTTTLRQAQAAANAATQR-----
Rv2318|M.tuberculosis_H37Rv GAGFAAGQQALEPYFDEMFLGRGDVTTTLRQAQAAANAATQR-----
MMAR_3619|M.marinum_M GAGFAAGNDALKPYFDEMFLGRGQVATILREAQAAANAAARR-----
MUL_1246|M.ulcerans_Agy99 GAGFAAGNDALKPYFDEMFLGRGQVATILREAQAAANAAARR-----
MAV_2088|M.avium_104 GAGFAAGNDALRPYFDEMFSGRGDVATTLRRAQAAANAAAARR----
MLBr_01770|M.leprae_Br4923 GAGFPAGDDALQSYFDEMFLGHGDVEKILCQAQAAANTAAHR-----
MSMEG_4468|M.smegmatis_MC2_155 GAGFPAGFEALTPYFAEMFLGRRDVAGTLAEAQRAANAAASR-----
MAB_1715c|M.abscessus_ATCC_199 GQGFGAGFAALKPYFAEMFLGRLDVREALQQAQRAANRALER-----
Mflv_3775|M.gilvum_PYR-GCK GPSGEEMIQAVGTALNEAVAGEKSVDDALATAQAEAEKIQG------
Mvan_2624|M.vanbaalenii_PYR-1 GPSGEEMIQAVGTELNEAVAGKKSVDDALAAAQAEAEKIQG------
TH_2011|M.thermoresistible__bu IPEFQDLGTRVSQQISAAIAGQKSVKDALAQAQKYAQVVGKTYQEKP
: : . *. .* * ** *: