For questions or suggestions e-mail us at: ioerger@cs.tamu.edu
MFELTDIQNWDDAPGKHVSWGPSPSTVAKVAEAPVSDVPASYQQAQHLRAYREHTARGVPMARLTVPVWN MDSQCDMRAMSHVINAYLRRHDTYHSRFEFTVDDRIVRRKLRSPRDLRFVPTDHGVQTCDQWREHILDTP GPLQWDCFRFGIIQRTDHFTCYMSVDHVHVDATFLGLMLIEIHLMYAALVSGGAPITLPPAGSYDDYCVR QRKYTSGLTLDSPEIKEWVTFLEGNNGTMPKFPLPLGDLSVPCTGDLMTVQLLDEPQTQGFEKACVAAGS RFIGGVFAAAALAQYQLTDIDTYHVITPTTTRGTEAEVMATGWFTGTVPITVPVGSSFAETARTAQRSFD SGLYLAHVPFDRVLELGATERGLRAPDPGVPMVSYLDATAAPLSPAVVAEWNRINGRIFSEMGAANQVGM WVNQFGSGTWITVAFPNNPVARASVQEYVDAFRSVCVAVAEGRHDDVPTPRVNELDLRSA
Operon Prediction Model: Genebank
Paralogs
| species | id | gene | e-value | identity (len) | annotation |
| M. smegmatis MC2 155 | MSMEG_4728 | - | - | 100% (480) | condensation domain-containing protein |
| M. smegmatis MC2 155 | MSMEG_0409 | - | e-113 | 42.74% (468) | condensation domain-containing protein |
| M. smegmatis MC2 155 | MSMEG_0019 | - | 1e-05 | 23.42% (316) | amino acid adenylation |
Closest Orthologs (e-value cutoff: 1e-4)
| species | id | gene | e-value | identity (len) | annotation |
| M. bovis AF2122 / 97 | Mb3850c | papA2 | 1e-141 | 53.56% (463) | polyketide synthase associated protein PapA2 |
| M. gilvum PYR-GCK | Mflv_0404 | - | 1e-112 | 42.89% (471) | condensation domain-containing protein |
| M. tuberculosis H37Rv | Rv3820c | papA2 | 1e-141 | 53.56% (463) | polyketide synthase associated protein PapA2 |
| M. leprae Br4923 | MLBr_01230 | papA3 | 1e-131 | 47.84% (462) | PKS-associated protein, unknown function |
| M. abscessus ATCC 19977 | MAB_3147c | - | 1e-123 | 49.26% (471) | polyketide synthase associated protein |
| M. marinum M | MMAR_2343 | - | 1e-144 | 53.55% (465) | hypothetical protein MMAR_2343 |
| M. avium 104 | MAV_2723 | - | 1e-126 | 50.34% (435) | PapA2 protein |
| M. thermoresistible (build 8) | TH_2204 | papA1 | 1e-150 | 54.53% (475) | PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN |
| M. ulcerans Agy99 | MUL_1531 | papA3 | 1e-135 | 51.49% (470) | polyketide synthase associated protein PapA3 |
| M. vanbaalenii PYR-1 | Mvan_0270 | - | 1e-111 | 42.95% (468) | condensation domain-containing protein |
CLUSTAL 2.0.9 multiple sequence alignment
Mb3850c|M.bovis_AF2122/97 -------MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQ
Rv3820c|M.tuberculosis_H37Rv -------MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQ
MSMEG_4728|M.smegmatis_MC2_155 -------MFELTDIQNWDDAPGKHVSWGPSPSTVAKVAEAPVSDVPASYQ
MAV_2723|M.avium_104 --------------------------------------------MPASYQ
MMAR_2343|M.marinum_M MKLVVLGNVGLETVSNWVPAPGLVWRWQPSSATLEKVRQAPVSAVPPSHI
TH_2204|M.thermoresistible__bu --MVALGNVAVETVREWRPDPGRVVSWDPSPAALEKARQAPVSSVPVSYM
MUL_1531|M.ulcerans_Agy99 --MVRVGKVEAGTISDWHPEPGTLVSWQPSRGSLAKAAKAPISPVPPSYM
Mflv_0404|M.gilvum_PYR-GCK ---MRIGKITVGALEEWSLSPGKVISWHPTAASIEKARQAPVSSVPVSYM
Mvan_0270|M.vanbaalenii_PYR-1 ---MRIGKITVGALDEWSLNPGSVTSWHPTAAAVETARRAQVSSVPVSYM
MLBr_01230|M.leprae_Br4923 ---MQVGPLTLGTLLDWAPRAGKTISWQPTPATCEKVSQAPVSSVPVAYM
MAB_3147c|M.abscessus_ATCC_199 ---MRGGPVTVSLTDKWEPTAGSVITWQPSPASYAKALEAPVSDVPPSFM
:* :.
Mb3850c|M.bovis_AF2122/97 QAQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDT
Rv3820c|M.tuberculosis_H37Rv QAQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDT
MSMEG_4728|M.smegmatis_MC2_155 QAQHLRAYREHTARGVPMARLTVPVWNMDSQCDMRAMSHVINAYLRRHDT
MAV_2723|M.avium_104 QLHHLRRFSEHAARGLDMARLNIGVWDISGVCDVAAMTEAINAHLRRHDT
MMAR_2343|M.marinum_M QARHLRGFAEQTARGHDMSRLVVAAMDIPGECDIRAMTYVINSHLRRHDT
TH_2204|M.thermoresistible__bu QAQHLRGFVEHAARGQEMSRLCIAAFDIPGRCDLRAMTYVINAHMRRHDT
MUL_1531|M.ulcerans_Agy99 QAHHLRNFRSYRERGLEMSRLLISSWDIPGICDIRTMTHVVNAHMRRHDT
Mflv_0404|M.gilvum_PYR-GCK QGQHLRNYCDRTTEGLNFSRQIIASCDVAGVCDIEAMDHAVNAYLRRHDT
Mvan_0270|M.vanbaalenii_PYR-1 QGQHLRNYWERTTAGLNFSRQIIASCEVPGQCDIAAMDHAVNAYLRRHDT
MLBr_01230|M.leprae_Br4923 QAQHIRGYVEQKAKGLDYSRLMIVSCDQLGQCDIRAINYIVNAHLRRHDT
MAB_3147c|M.abscessus_ATCC_199 QVQHLRTYLRQAAKGLDFSRVLVFTLDMPGRCDKRAMGHVINAHLRRHDT
* :*:* : * :* : : . *: :: :*:::*****
Mb3850c|M.bovis_AF2122/97 YHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSA-ELRHHIA-TPQP
Rv3820c|M.tuberculosis_H37Rv YHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSA-ELRHHIA-TPQP
MSMEG_4728|M.smegmatis_MC2_155 YHSRFEFTVDDRIVRRKLRSPRDLRFVPTDHGVQTCD-QWREHILDTPGP
MAV_2723|M.avium_104 YHSWFEHRTDGRIVRHTFDDPADIEFAALQRGEMTPT-ELRAHILATPNP
MMAR_2343|M.marinum_M YRSWFEFTESNRIARHTLTDPNDIELTPIEHGEMSPK-QWQDYILATPGP
TH_2204|M.thermoresistible__bu YRSWFEYHDFHHIVRHTITDPADIELVPTRHGAMTPE-QWQQHLLSTPDP
MUL_1531|M.ulcerans_Agy99 FRSWFEHTEGDHFVRRTLERPNDIQFVAVEHGELTTQSQWRERLLATPSP
Mflv_0404|M.gilvum_PYR-GCK FRSWFEHSGDGEFVRRALVDPEDIEFVPVEHGHMGVDEIHAHVV-AIPSP
Mvan_0270|M.vanbaalenii_PYR-1 FRSWFERTEEGEFLRRAIADPADIEFVPIEHGDMTVDEIFAHVV-DIPSP
MLBr_01230|M.leprae_Br4923 YRSWFEYTDEGEIIRHTLCDPADIEFVPIEHGQLSLAQIRELAV-STPDP
MAB_3147c|M.abscessus_ATCC_199 YRSWFSLDDEQNIVRRTMADPADVEFVQVKLGEMTSDEVRELVVSETPDP
::* *. .: *: : * *:... : * *
Mb3850c|M.bovis_AF2122/97 LQWDCFLFGIIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGG
Rv3820c|M.tuberculosis_H37Rv LQWDCFLFGIIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGG
MSMEG_4728|M.smegmatis_MC2_155 LQWDCFRFGIIQRTDHFTCYMSVDHVHVDATFLGLMLIEIHLMYAALVSG
MAV_2723|M.avium_104 LRWDCFTFGLVQHPDHFTFYMSADHLVIDGMSVGVIFLEIHLTYAALVSG
MMAR_2343|M.marinum_M LEWDCFRFAIIQRDDHFTFCVSVDHLNVDAMFISAVFWEIEAMYNTLADG
TH_2204|M.thermoresistible__bu LQWNCFRFSIIQRSDHFTFCVCMDHVHIDAMFMGAVFMEIHMQYAALVGG
MUL_1531|M.ulcerans_Agy99 LEWGCVRFSIIQRADHFTFCVAIDHLHCDAMFVGVVFAEIHLMYLALVSG
Mflv_0404|M.gilvum_PYR-GCK LEWGCFTFGVIQNDDHFSFFASMDHVHGDATLIGTTMMEANGMYSALSSG
Mvan_0270|M.vanbaalenii_PYR-1 LEWGCFTFGVIQHDGHFTFFASMDHVHGDATLIGTTMMEANGMYSALSGG
MLBr_01230|M.leprae_Br4923 LEWGCFQFGVIQAEEHFTFYASIDHVHVDAMIVGVTLMEFHMMYSALVSG
MAB_3147c|M.abscessus_ATCC_199 FRWDCFRFGIVQSSGHFTFYFSVDHLHLDATFARLLIMEILMGYKALVQG
:.*.*. *.::* **: . *: * : * * :* *
Mb3850c|M.bovis_AF2122/97 DPPIELPPAGRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHF
Rv3820c|M.tuberculosis_H37Rv DPPIELPPAGRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHF
MSMEG_4728|M.smegmatis_MC2_155 GAPITLPPAGSYDDYCVRQRKYTSGLTLDSPEIKEWVTFLEGNNGTMPKF
MAV_2723|M.avium_104 GRPLPLPEPASYHDYCRRQHQHTEALTLQSPQVRAWIRFAEDNGGTLPSF
MMAR_2343|M.marinum_M GAPISLPEAGSYGEYCLRERAYTSALTLESPEVRQWIDFFESNDGALPSF
TH_2204|M.thermoresistible__bu GAPIPL-QAGSYDDFCVRQRQYTESLTADSPEVRAWVDFAGQMGGTLPNF
MUL_1531|M.ulcerans_Agy99 GAPLRLAEPGSYDNYCNRQREHISGLTLDSPVMSKWTEFFDNNDGSLPKF
Mflv_0404|M.gilvum_PYR-GCK GAALALPDAGSFDDFCVREREYTAELTEDSPGVRAWIEFAENNSDGFPEF
Mvan_0270|M.vanbaalenii_PYR-1 GAALTLPDAGSFDEFCVRERAHTSELTEDSPEVRAWIEFAENSGGGFPEF
MLBr_01230|M.leprae_Br4923 AGPLELPEAGSYDDFCRRQHRFTSMLTAESPQIRSWTQFAEPNEGSFPDF
MAB_3147c|M.abscessus_ATCC_199 GAPIELPPAGSYGDYCIRQHEFLSGLTPDSEPVREWTQFAENNRGSLPDF
.: * .. : :.* *: ** :* : * * . :* *
Mb3850c|M.bovis_AF2122/97 PLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAAL
Rv3820c|M.tuberculosis_H37Rv PLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAAL
MSMEG_4728|M.smegmatis_MC2_155 PLPLGDLSVPCTGDLMTVQLLDEPQTQGFEKACVAAGSRFIGGVFAAAAL
MAV_2723|M.avium_104 PLPLGDPSVPCGSGVVVAPLMDESQTERFDATCTKAGARFSGGVFACAAF
MMAR_2343|M.marinum_M PLPLGDLSLVDTGELLSLQLMDAGQTARFEAACTSAGARFSGGVFACAAL
TH_2204|M.thermoresistible__bu PLPLGDRTKPWPGELMVVKLLDGRQTDRFEAACVSAGARFVGGVFACAAL
MUL_1531|M.ulcerans_Agy99 PLPLGDTSAQC--EMMGVRLMDERQTLALEAACMSAGARFCGGVFAISAV
Mflv_0404|M.gilvum_PYR-GCK PLPLGNPKDSTRSVMTSAVLMDTAQTERFDAAATAAGARFVGGLFACLAQ
Mvan_0270|M.vanbaalenii_PYR-1 PLPLGNPAESTRSCMTSEILMDTAQTERFESACTAAGARFVGGLFACLAQ
MLBr_01230|M.leprae_Br4923 PLPLGDPLEPTQADIVTITMLDEQQTDRFEAACTVAGARFVGGVLACCGL
MAB_3147c|M.abscessus_ATCC_199 PLPLGDHGVQCGTAIVTEQLLDEQQALKFESLCVDAGARFIGGVMAALGF
*****: : ::* * :: . **:** **::* .
Mb3850c|M.bovis_AF2122/97 AERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSA
Rv3820c|M.tuberculosis_H37Rv AERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSA
MSMEG_4728|M.smegmatis_MC2_155 AQYQLTDIDTYHVITPTTTRGTEAEVMATGWFTGTVPITVPVGS-SFAET
MAV_2723|M.avium_104 AEYELTGAETYCAITPYDHRSTPAEFVTPGWFASFIPVTFPVAGASFGDA
MMAR_2343|M.marinum_M AEYELTGSRTYYAVTPTSTRSTPAEFMTTGWFVGHIPFTVAVA-SSFDET
TH_2204|M.thermoresistible__bu TENELAGVETFHAITPTDTRSTPADFMTTGWFTGMVPITVPAAGVSFGEA
MUL_1531|M.ulcerans_Agy99 VQHELTGADEYYAIVPIDIRRTEEDFMTTGWFTGFVPITVPTVGSSFGEI
Mflv_0404|M.gilvum_PYR-GCK VEHELTGALTYYGLTPRDSRSGSDNFMTQGWFTGLIPITVPIGAASFGEA
Mvan_0270|M.vanbaalenii_PYR-1 VEHELTGALTYYGLTPRDSRSASDNFMTQGWFTGLIPITVPIGATSFADA
MLBr_01230|M.leprae_Br4923 AEYELTGADTYYGLTPRDTRRIPTDVLTQGWFTGLVPITVPIAGSSFGDA
MAB_3147c|M.abscessus_ATCC_199 AERELTGTDTYYGITPSDAR-EEADMFTTGWFTGLVPITAPVDG-TFAAA
.: :*:. : :.. * :. : ***.. :*.* . *
Mb3850c|M.bovis_AF2122/97 ARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIA
Rv3820c|M.tuberculosis_H37Rv ARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIA
MSMEG_4728|M.smegmatis_MC2_155 ARTAQRSFDSGLYLAHVPFDRVLELGATERGLRAPDPGVPMVSYLDATAA
MAV_2723|M.avium_104 VIAAQASFDSAIGLADVPFDRVLELSSFGGRISKPTGDVHMLSFADARGI
MMAR_2343|M.marinum_M VRGAQANFDASAHLANVPFERVLELAPWLKKP-PPRGGFPMVSFLDGGVP
TH_2204|M.thermoresistible__bu ARAAQQSFDSGIGLGHVPYDRVLELVPSLRRA-ETCS--PMMSFLDAGVP
MUL_1531|M.ulcerans_Agy99 VKAAQGSFDSGRDLAEVPLDCVMELVLWLREG---QWGAPLLFYLDAGIP
Mflv_0404|M.gilvum_PYR-GCK AWAAQSSFDSGLNMAKVPYYRVLELAPWLGWP---RPNFPVSNFFHGGAA
Mvan_0270|M.vanbaalenii_PYR-1 AWAAQASFDSNLFMAKVPYYRVLELAPWLSWP---RPNFPVSNFFHGGAA
MLBr_01230|M.leprae_Br4923 ARAAQNCFDTDVHLAEVPYDRVVELAPTVHKP---RPNFPVINFLDAGTA
MAB_3147c|M.abscessus_ATCC_199 AVAAQESFDRGRQLVNVPFYRVLELVPQLSWP---RPYHPMINFFDGGAP
. ** ** : ** *:** : : ..
Mb3850c|M.bovis_AF2122/97 PLSTVANSDLN-----FRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNP
Rv3820c|M.tuberculosis_H37Rv PLSTVANSDLN-----FRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNP
MSMEG_4728|M.smegmatis_MC2_155 PLSPAVVAEWNRIN--GRIFSEMGAANQVGMWVNQFGSGTWITVAFPNNP
MAV_2723|M.avium_104 PFS----GQWDGLN--AGIYGDGRSSDQVLMWVNRFDTETTLTVAFPQNP
MMAR_2343|M.marinum_M PLSGVVAMQLDRIN--ARAFSDGRVAARVCIWVNKFQEETTVTASFPNNP
TH_2204|M.thermoresistible__bu PLSALVASQLDGLN--ARVYGDGKVPAQLCMWVNRMDNETSVTVFFPDNP
MUL_1531|M.ulcerans_Agy99 PLSAMANSHVEGLR--ARLCHDGGMMGQIDIRVNRLEKETQLTVLFPNNP
Mflv_0404|M.gilvum_PYR-GCK PLNAILAASEMGLADNIGIYPDGRFSYQLTIYIFRYGQGTEMAIMHPDNP
Mvan_0270|M.vanbaalenii_PYR-1 PLNAILAAADMGLANNIGIYPDGRFSYQLTIYIFRYGEGTVMAIMHPDNP
MLBr_01230|M.leprae_Br4923 PLSVLLTAGLDGLN--IGVYSDGRYSYQMSIYVIRVEQETAVAVMFPDNP
MAB_3147c|M.abscessus_ATCC_199 PLSQLFTN-PLLVSNPIGLYAESKSVYQLTIFISRFPTETTLMIAYPDNP
*:. : :: : : : * : .*:**
Mb3850c|M.bovis_AF2122/97 IASESVANYIAAMKSIYIRTADG---TLAILK-----PGT----------
Rv3820c|M.tuberculosis_H37Rv IASESVANYIAAMKSIYIRTADG---TLATLK-----PGT----------
MSMEG_4728|M.smegmatis_MC2_155 VARASVQEYVDAFRSVCVAVAEGRHDDVPTPR-----VNELDLRSA----
MAV_2723|M.avium_104 VARDSVERYIRAVRAMCLRVVEHGAAAVPNRRRVVAAVNASAARSTANAA
MMAR_2343|M.marinum_M IAYDSVARYLDTMKSVYLRIAEG------ARW-----DAIAQVL------
TH_2204|M.thermoresistible__bu VARASVEKYVATLRSIYVAVAEGRADRLGARR-----SGELQRQPA----
MUL_1531|M.ulcerans_Agy99 VARESVTRYVEALKSAFVRVAEGRDVMPAPSRTGSQLHLAYSRRTYEPPA
Mflv_0404|M.gilvum_PYR-GCK VAKRSVERYMQTMSSVASLVADTGHWGRVA--------------------
Mvan_0270|M.vanbaalenii_PYR-1 VAKKSVTRYMKVMKSVAGLVADSGTWGLVA--------------------
MLBr_01230|M.leprae_Br4923 EAQESVARYLETLKSVFECVAESGHWRNVA--------------------
MAB_3147c|M.abscessus_ATCC_199 VARESITRYVDLVKSTFARVCEQNDVVPAR--------------------
* *: .*: . : :
Mb3850c|M.bovis_AF2122/97 -------------------
Rv3820c|M.tuberculosis_H37Rv -------------------
MSMEG_4728|M.smegmatis_MC2_155 -------------------
MAV_2723|M.avium_104 DRTDRRLQGAGFGANPVLR
MMAR_2343|M.marinum_M -------------------
TH_2204|M.thermoresistible__bu -------------------
MUL_1531|M.ulcerans_Agy99 TVSPLTSWRTG--------
Mflv_0404|M.gilvum_PYR-GCK -------------------
Mvan_0270|M.vanbaalenii_PYR-1 -------------------
MLBr_01230|M.leprae_Br4923 -------------------
MAB_3147c|M.abscessus_ATCC_199 -------------------