For questions or suggestions e-mail us at: ioerger@cs.tamu.edu
TPPTGGSVKAVAISIAAAVGGFLFGFDSSVINGAVGAITAHFALTPLMAGLTVASALLGCAVGAWFAGGI ADRIGRVRVMGVAAVLFAVSSVGSGLAFSAFDLMAWRITAGVAIGIASVIAPAYIAEIAPARIRGALTAL QQLALVIGIFVSLLSDAALASVAGGAANTSWFGVEAWRWMLLVGLVPAVVYAIIARRIPESPRYLARRGE YESAAAVLSRVLDVSIDDARRKVDQIWLFAVEGVEVVGASVPG*
Operon Prediction Model: Genebank
Paralogs
| species | id | gene | e-value | identity (len) | annotation |
| M. smegmatis MC2 155 | MSMEG_2004 | - | - | 100% (254) | arabinose-proton symporter |
| M. smegmatis MC2 155 | MSMEG_4182 | - | 1e-54 | 53.05% (213) | arabinose-proton symporter |
| M. smegmatis MC2 155 | MSMEG_5559 | - | 1e-28 | 31.28% (211) | metabolite/sugar transport protein |
Closest Orthologs (e-value cutoff: 1e-4)
| species | id | gene | e-value | identity (len) | annotation |
| M. bovis AF2122 / 97 | Mb3364 | sugI | 2e-24 | 30.60% (232) | sugar-transport integral membrane protein SugI |
| M. gilvum PYR-GCK | Mflv_3055 | - | 5e-55 | 53.99% (213) | sugar transporter |
| M. tuberculosis H37Rv | Rv3331 | sugI | 2e-24 | 30.60% (232) | sugar-transport integral membrane protein SugI |
| M. leprae Br4923 | MLBr_01562 | - | 4e-07 | 25.00% (220) | putative transmembrane efflux protein |
| M. abscessus ATCC 19977 | MAB_0692c | - | 1e-57 | 51.95% (231) | sugar transporter |
| M. marinum M | MMAR_1191 | sugI | 9e-30 | 33.80% (213) | sugar-transport integral membrane protein SugI |
| M. avium 104 | MAV_4306 | - | 2e-28 | 33.80% (216) | transporter, major facilitator superfamily protein |
| M. thermoresistible (build 8) | TH_2640 | - | 9e-18 | 52.50% (80) | PUTATIVE sugar transporter |
| M. ulcerans Agy99 | MUL_1444 | sugI | 6e-30 | 33.80% (213) | sugar-transport integral membrane protein SugI |
| M. vanbaalenii PYR-1 | Mvan_3472 | - | 7e-56 | 53.05% (213) | sugar transporter |
CLUSTAL 2.0.9 multiple sequence alignment
Mb3364|M.bovis_AF2122/97 --MTTLWQPHRNDYSPIPGRGVHARRGARRPRPRGGRAERPGTGQLTRSG
Rv3331|M.tuberculosis_H37Rv --MTTLWQPHRNDYSPIPGRGVHARRGARRPRPRGGRAERPGTGQLTRSG
MMAR_1191|M.marinum_M -------------------------------------------------M
MUL_1444|M.ulcerans_Agy99 -------------------------------------------------M
MAV_4306|M.avium_104 ---------------------------------------------MARGS
Mflv_3055|M.gilvum_PYR-GCK --------------------------MAGQGGPAGESPELYEKGQDFSSG
Mvan_3472|M.vanbaalenii_PYR-1 --------------------------MAGQGGPVGDGPEIFEQGQEFSSG
TH_2640|M.thermoresistible__bu --------------------------------------------------
MAB_0692c|M.abscessus_ATCC_199 ----------------------------------------MSEEDEFSSG
MSMEG_2004|M.smegmatis_MC2_155 ------------------------------------------MTPPTGGS
MLBr_01562|M.leprae_Br4923 MVTLYDHERAVHNWTAARSDRPAPSRLTQQVEPASERTSKYPTLLPSGRF
Mb3364|M.bovis_AF2122/97 RRALLVGLTAASVG--VLYGYDLSAIAGALLSLSEEFELTTREQELLTTT
Rv3331|M.tuberculosis_H37Rv RRALLVGLTAASVG--VLYGYDLSAIAGALLSLSEEFELTTREQELLTTT
MMAR_1191|M.marinum_M QRGILVALTAASVG--LIYGYDLSIIAGAQLFITEEFGLSTHQQELLTTM
MUL_1444|M.ulcerans_Agy99 QRGILVALTAASVG--LIYGYDLSIIAGAQLFITEEFGLSTHQQELLTTM
MAV_4306|M.avium_104 RRGLLVGLTAASVG--VIYGYDLSIIAGAQLFVTEDFGLSTRQQELLTTM
Mflv_3055|M.gilvum_PYR-GCK KTAIRIASVAALGG--LLFGYDSAVINGAVASIQEDFGIGNYALGLAVAS
Mvan_3472|M.vanbaalenii_PYR-1 KTAVRIASVAALGG--LLFGYDSAVINGAVDSIQEDFGIGNAELGFAVAS
TH_2640|M.thermoresistible__bu --------------------------------------------------
MAB_0692c|M.abscessus_ATCC_199 HSALRIASVAALGG--LLFGYDSAVINGAVQAIQDAFAIRDAELGFAVAS
MSMEG_2004|M.smegmatis_MC2_155 VKAVAISIAAAVGG--FLFGFDSSVINGAVGAITAHFALTPLMAGLTVAS
MLBr_01562|M.leprae_Br4923 PSGRFIAAVIVIGGMQLLATMDSTVAIVALPKIQNELSLSDAGRSWGITA
Mb3364|M.bovis_AF2122/97 AVLGQIAGALGGGILANAIGRKKSVVLIVAGYAVFALLGATSVSVPMLVV
Rv3331|M.tuberculosis_H37Rv AVLGQIAGALGGGILANAIGRKKSVVLIVAGYAVFALLGATSVSVPMLVV
MMAR_1191|M.marinum_M VVIGQIVGALGAGVLANAIGRKKSVVMLLVAYTMFAVLGSLSVSLPMLLA
MUL_1444|M.ulcerans_Agy99 VVIGQIVGALGAGVLANAIGRKKSVVMLLVAYTMFAVLGALSVSLPMLLA
MAV_4306|M.avium_104 AVIGQIGGALFAGVLANAIGRRRSVLLILSGYAVFALLAAFSVGLPMLLT
Mflv_3055|M.gilvum_PYR-GCK ALLGAAVGALSAGRIADRIGRIAVMKIAAVLFLLSAFGTAFAPETITLVI
Mvan_3472|M.vanbaalenii_PYR-1 ALLGAAAGAMTAGRIADRIGRIAVMKIAAVLFLVSAFGTGFAHEVWAVVL
TH_2640|M.thermoresistible__bu --------------------------------------------------
MAB_0692c|M.abscessus_ATCC_199 ALLGAAVGAMTAGRVADRIGRVAVMKIAAALFLLSAVGAGLAPNIELLVL
MSMEG_2004|M.smegmatis_MC2_155 ALLGCAVGAWFAGGIADRIGRVRVMGVAAVLFAVSSVGSGLAFSAFDLMA
MLBr_01562|M.leprae_Br4923 YVLTFGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEVTLVI
Mb3364|M.bovis_AF2122/97 ARLLLGVTIGLSVVVVPVYVAESAPAA-VRGSLVTAYQLATLSGIVVGYL
Rv3331|M.tuberculosis_H37Rv ARLLLGVTIGLSVVVVPVYVAESAPAA-VRGSLVTAYQLATLSGIVVGYL
MMAR_1191|M.marinum_M ARFLLGLAVGVSIVVVPVYVAESAPAA-VRGSLLTVYQLTTVSGLIVGYL
MUL_1444|M.ulcerans_Agy99 ARFLLGLAVGVSIVVVPVYVAESAPAA-VRGSLLTVYQLTTVSGLIVGYL
MAV_4306|M.avium_104 ARLLLGLTIGVTVVVVPVYVAESAPTA-VRGALLTAYQLAIVSGLIVGYL
Mflv_3055|M.gilvum_PYR-GCK GRIVGGVGVGVASVIAPAYIAETSPPG-IRGRLGSLQQLAIVSGIFASFA
Mvan_3472|M.vanbaalenii_PYR-1 FRIVGGIGVGVASVIAPAYIAETSPPG-IRGRLGSLQQLAIVSGIFASFA
TH_2640|M.thermoresistible__bu --------------------------------------LAIVSGIFLSLL
MAB_0692c|M.abscessus_ATCC_199 FRVIGGVGVGVASLIAPAYIAETSPSR-IRGRLGSLQQLAIVTGIFLSLT
MSMEG_2004|M.smegmatis_MC2_155 WRITAGVAIGIASVIAPAYIAEIAPAR-IRGALTALQQLALVIGIFVSLL
MLBr_01562|M.leprae_Br4923 ARLLQGIGSAIASPTGLALVATTFSKGSARNTATAVFAAMTAIGSVMGLV
* . .
Mb3364|M.bovis_AF2122/97 VGYLLAGSHG------------WRAMFGLAAAPATLLLPLLWRMP-----
Rv3331|M.tuberculosis_H37Rv VGYLLAGSHG------------WRAMFGLAAAPATLLLPLLWRMP-----
MMAR_1191|M.marinum_M TGYLLAGTHS------------WRWMLGLATVPAMLLLPLLIRMP-----
MUL_1444|M.ulcerans_Agy99 TGYLLAGTHS------------WRWMLGLATVPAMLLLPLLIRMP-----
MAV_4306|M.avium_104 SGYLLAGTHS------------WRWMLGLACVPAVLLLPLVFRMP-----
Mflv_3055|M.gilvum_PYR-GCK VNYLLQWLAGGPNEPLWLGMDAWRWMFLAMAVPAVVYGGLAFTIP-----
Mvan_3472|M.vanbaalenii_PYR-1 VNWLLQWAAGGPNEPLWFGLDAWRWMFLAMALPAVLYGVLAFTIP-----
TH_2640|M.thermoresistible__bu IDGILAALAGGSREELWLNMEAWRWMFLMMAVPAVLYGALTFTIP-----
MAB_0692c|M.abscessus_ATCC_199 IDWLLAHLAGSSRDELWLGLPAWRWMFLAMALPALVYGTLAFTIP-----
MSMEG_2004|M.smegmatis_MC2_155 SDAALASVAGGAANTSWFGVEAWRWMLLVGLVPAVVYAIIARRIP-----
MLBr_01562|M.leprae_Br4923 VGGALTEVSWRLAFLVNVPIGLVMIYLARTALHETHKERMKLDAAGAILA
. * : : .
Mb3364|M.bovis_AF2122/97 -DTARWYLLKGRIADARSALRRIQPEADIDAELADMAAAVDERGG----G
Rv3331|M.tuberculosis_H37Rv -DTARWYLLKGRIADARSALRRIQPEADIDAELADMAAAVDERGG----G
MMAR_1191|M.marinum_M -DTPRWYVMKGRIQEARAALLRVDPAADVEEELAEIGTALSEGSG----G
MUL_1444|M.ulcerans_Agy99 -DTPRWYVMKGRIQEARAALLRVDPAADVEEELAEIGTALSEGSG----G
MAV_4306|M.avium_104 -DTARWYLLKGRVDDARRALLRVEPAARVDDELAEIGRAVSEEAASLPAM
Mflv_3055|M.gilvum_PYR-GCK -ESPRYLVATHKIPEARRVLSMLLGQKNLEITITRIRETLEREDRP-SWR
Mvan_3472|M.vanbaalenii_PYR-1 -ESPRYLVASHKIPEARRVLSMLLGKKNLEITITRIRETLEREDKP-SWR
TH_2640|M.thermoresistible__bu -ESPRYLVATHRVPEARRVLSRLLGAKNLEITINRIERSLRAEKPP-SWS
MAB_0692c|M.abscessus_ATCC_199 -ESPRYLVATHRIPEARTVLATLLGEKNLDITIGRIQETLDQSTAP-SWR
MSMEG_2004|M.smegmatis_MC2_155 -ESPRYLARRGEYESAAAVLSRVL-----DVSIDDARRKVDQ--------
MLBr_01562|M.leprae_Br4923 TLACTAAVFAFSMGPEKGWISLTTIGSGLVALVAAIAFAVVERTAENPVV
: : : :
Mb3364|M.bovis_AF2122/97 IGEMVRRPYLRATLFVIALGFLVQITGINAIIYYSPRLFAAMGFAGYFAM
Rv3331|M.tuberculosis_H37Rv IGEMVRRPYLRATLFVIALGFLVQITGINAIIYYSPRLFAAMGFAGYFAM
MMAR_1191|M.marinum_M VSEMLRPPFLRATIFVITLGFLIQITGINAIIYYSPRIFEAMGFTGDFAL
MUL_1444|M.ulcerans_Agy99 VSEMLRPPFLRATIFVITLGFLIQITGINAIIYYSPRIFEAMGFTGDFAL
MAV_4306|M.avium_104 LAEMVRSPYRRATVFVVVLGFLIQITGINAIIYYSPRIFEAMGFTGNFAL
Mflv_3055|M.gilvum_PYR-GCK DLKKPTGGIYGIVWVGLGLSIFQQFVGINVIFYYSNVLWQAVGFS-ADES
Mvan_3472|M.vanbaalenii_PYR-1 DMRKPTGGLYGIVWVGLGLSIFQQFVGINVIFYYSNVLWQAVGFS-ADES
TH_2640|M.thermoresistible__bu DLRKPTGGMYGIVWVGLGLSIFQQFVGINVIFYYSNVLWEAVGFD-ESQS
MAB_0692c|M.abscessus_ATCC_199 DLRKPTGGLHAIVWIGVALAVFQQLVGINVIFYYSNVLWQAVGFG-ESSS
MSMEG_2004|M.smegmatis_MC2_155 ------------IWL-----------------------------------
MLBr_01562|M.leprae_Br4923 PFDLFRDRNRLVTFIAIFLAGGVIFSLTVSIGLYIQDILGYSALRAGIGF
.
Mb3364|M.bovis_AF2122/97 LALPAMVQVAGLAAVCASLFLVDRLGRRPILLSGIATMITADAVLITVFA
Rv3331|M.tuberculosis_H37Rv LALPAMVQVAGLAAVCASLFLVDRLGRRPILLSGIATMITADAVLITVFA
MMAR_1191|M.marinum_M LGLPALVQIAGVAAVITSLLLVDRVGRRPILLSGIAIMIVADVALMAVFA
MUL_1444|M.ulcerans_Agy99 LGLPALVQIAGVAAVITSLLLVDRVGRRPILLSGIAIMIVADVALMAVFA
MAV_4306|M.avium_104 LALPALVQVAGLVAVGTALLLVDRVGRRPILLCGTAMMIVADVVLVAVFG
Mflv_3055|M.gilvum_PYR-GCK AVYTVITSVVNVLTTLIAIALIDKIGRKPLLLIGSVGMAVTLSTMAFIFA
Mvan_3472|M.vanbaalenii_PYR-1 AVYTVITSVINVLTTLIAIALIDKIGRKPLLLIGSSGMAVTLITMAVIFA
TH_2640|M.thermoresistible__bu FLITVITSVTNIVTTLIAIALIDKIGRKPLLLIGSTGMAGTLGVMAVIFG
MAB_0692c|M.abscessus_ATCC_199 FTITVITSITNIATTLVAIALIDRVGRKPLLLIGSAGMAATLGTMAVIFG
MSMEG_2004|M.smegmatis_MC2_155 ----------------FAVEGVEVVG------------------------
MLBr_01562|M.leprae_Br4923 IPFVIAMGIG----LGVSSQLVSRFSPRVLTIGGGWLLMVAMLYGWAFMN
: :. ..
Mb3364|M.bovis_AF2122/97 NDSD--------------GGTGLVLGFAGVLLFIIGFN-FGFGSLVWVYA
Rv3331|M.tuberculosis_H37Rv NDSD--------------GGTGLVLGFAGVLLFIIGFN-FGFGSLVWVYA
MMAR_1191|M.marinum_M R-----------------GQGAAILGFAGILLFIIGYT-MGFGSLGWVYA
MUL_1444|M.ulcerans_Agy99 R-----------------GQGAAILGFAGILLFIIGYT-MGFGSLGWVYA
MAV_4306|M.avium_104 R-----------------GPGGVIAGFAGVLLFIFGYT-MGFGSLGWVYA
Mflv_3055|M.gilvum_PYR-GCK NAELIENEAGE--MVPSLPGASGVIALIAANLFVVAFG-MSWGPVVWVLL
Mvan_3472|M.vanbaalenii_PYR-1 NATVGADGN------PSLPGASGVIALIAANLFVVAFG-MSWGPVVWVLL
TH_2640|M.thermoresistible__bu TAPVIDGQP-------QLGDVAGPVALVAANLFVVSFG-MSWGPVVWVLL
MAB_0692c|M.abscessus_ATCC_199 SATMVDGKP-------HLGPVAGPVALVAANLFVVAFG-MSWGPVVWVLL
MSMEG_2004|M.smegmatis_MC2_155 ------------------ASVPG---------------------------
MLBr_01562|M.leprae_Br4923 RGVPYFPNLLAPIVVGGIGIGMAVVPLTLSAIAGVGFDRIGPVSAIALML
Mb3364|M.bovis_AF2122/97 AESFPSRLRSMGSSLMLTSTLTANAIVAAFSLTMLRVLGGAGVFAVFGTF
Rv3331|M.tuberculosis_H37Rv AESFPSRLRSMGSSPMLTSTLTANAIVAAFSLTMLRVLGGAGVFAVFGTF
MMAR_1191|M.marinum_M SESFPTRLRSIGSSTMLTANLIANAIVAGVFLTMLHSLGGSGAFAVFGVL
MUL_1444|M.ulcerans_Agy99 SESFPTRLRSIGSSTMLTANLIANAIVAAVFLTMLHSLGGSGAFAVFGVL
MAV_4306|M.avium_104 SESFPSRLRSIGSSTMLTSNLVANAIVAAVFLTLLHSLGGAGTFAVFAVL
Mflv_3055|M.gilvum_PYR-GCK GEMFPNRIRAAALGLAAAGQWTANWVITVSFPELRNFLG--AAYGFYALC
Mvan_3472|M.vanbaalenii_PYR-1 GEMFPNRIRAAALGLAAAGQWAANWLITVTFPGLREHLG--LAYGFYGLC
TH_2640|M.thermoresistible__bu GEMFPNRIRAAALGLAAAGQWTANWVITVTFPGLREHLG--PAYGFYALC
MAB_0692c|M.abscessus_ATCC_199 GEIFPNRIRAAAMGLATAGNWAANWAVTVTFPALRDALG--IAYGCYALC
MSMEG_2004|M.smegmatis_MC2_155 --------------------------------------------------
MLBr_01562|M.leprae_Br4923 QSLGGPLVLAVIQAVITSRTLYLGGTTGPVKFMNDAQLRALDHGYTYGLL
Mb3364|M.bovis_AF2122/97 AVVAFVVVYRFAPETKGRKLEEIRHFWENGGRWPAERSPAADEP
Rv3331|M.tuberculosis_H37Rv AVVAFVVVYRFAPETKGRKLEEIRHFWENGGRWPAERSPAADEP
MMAR_1191|M.marinum_M ALVAFGVVYRYAPETKGRQLEEIRHFWENGGRWPEKLTEAP---
MUL_1444|M.ulcerans_Agy99 ALVAFGVVYRYAPETKGRQLEEIRHFWENGGRWPEKLTEAP---
MAV_4306|M.avium_104 AVVAFAFVHRYAPETKGRQLEDIRHFWENGGRWD----------
Mflv_3055|M.gilvum_PYR-GCK AVLSGLFVWKWVRETKGVSLEDMHGEILHGDKS-ATG-------
Mvan_3472|M.vanbaalenii_PYR-1 AVLSGLFVWRWVMETKGVSLEDMHGEILHADKT-ATG-------
TH_2640|M.thermoresistible__bu AVLSLLFVWRWVAETKGRTLEDMQPDVPGHDRASAPG-------
MAB_0692c|M.abscessus_ATCC_199 AVLSLLFVARWVEETKGRALEDMDSAIS----------------
MSMEG_2004|M.smegmatis_MC2_155 --------------------------------------------
MLBr_01562|M.leprae_Br4923 WVAGAVVIVGVAALFIGYTPEQVAYAQEVKEAVDAGEL------