For questions or suggestions e-mail us at: ioerger@cs.tamu.edu

M. tuberculosis H37Rv Rv3820c (papA2)

annotation: POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA2
coordinates: 4284419 - 4285825
length: 468

FSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRRYRDHVARGLDMSRLMIFTWDL
PGRCNIRAMNYAINAHLRRHDTYHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQP
LQWDCFLFGIIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPAGRYDDHCVRQY
ADTAALTLDSARVRRWVEFAANNDGTLPHFPLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARF
SGGVFACAALAERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSAARVAQISFDS
GKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSDLNFRIYDEGRVSHQVSMWVNR
YQHQTTVTVLFPDNPIASESVANYIAAMKSIYIRTADGTLATLKPGT*
Operon Prediction Model: Genebank

Paralogs
speciesidgenee-valueidentity (len)annotation
M. tuberculosis H37RvRv3820cpapA2-100% (468)POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA2
M. tuberculosis H37RvRv1182papA3e-14353.91% (460) polyketide synthase associated protein PapA3
M. tuberculosis H37RvRv3824cpapA1e-13853.68% (462) polyketide synthase associated protein

Closest Orthologs (e-value cutoff: 1e-4)
speciesidgenee-valueidentity (len)annotation
M. bovis AF2122 / 97Mb3850cpapA20.099.79% (468) polyketide synthase associated protein PapA2
M. gilvum PYR-GCKMflv_0404-1e-11546.32% (462) condensation domain-containing protein
M. leprae Br4923MLBr_01230papA31e-14053.63% (455) PKS-associated protein, unknown function
M. abscessus ATCC 19977MAB_3147c-1e-12650.11% (455) polyketide synthase associated protein
M. marinum MMMAR_2343-1e-14154.57% (460) hypothetical protein MMAR_2343
M. avium 104MAV_1762-1e-13049.78% (462) condensation domain-containing protein
M. smegmatis MC2 155MSMEG_4728-1e-14153.56% (463) condensation domain-containing protein
M. thermoresistible (build 8)TH_2204papA11e-15156.62% (461) PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN
M. ulcerans Agy99MUL_1531papA31e-12649.24% (459) polyketide synthase associated protein PapA3
M. vanbaalenii PYR-1Mvan_0270-1e-11646.32% (462) condensation domain-containing protein

CLUSTAL 2.0.9 multiple sequence alignment


Rv3820c|M.tuberculosis_H37Rv        -------MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQ
Mb3850c|M.bovis_AF2122/97           -------MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQ
MSMEG_4728|M.smegmatis_MC2_155      -------MFELTDIQNWDDAPGKHVSWGPSPSTVAKVAEAPVSDVPASYQ
MMAR_2343|M.marinum_M               MKLVVLGNVGLETVSNWVPAPGLVWRWQPSSATLEKVRQAPVSAVPPSHI
TH_2204|M.thermoresistible__bu      --MVALGNVAVETVREWRPDPGRVVSWDPSPAALEKARQAPVSSVPVSYM
MUL_1531|M.ulcerans_Agy99           --MVRVGKVEAGTISDWHPEPGTLVSWQPSRGSLAKAAKAPISPVPPSYM
Mflv_0404|M.gilvum_PYR-GCK          ---MRIGKITVGALEEWSLSPGKVISWHPTAASIEKARQAPVSSVPVSYM
Mvan_0270|M.vanbaalenii_PYR-1       ---MRIGKITVGALDEWSLNPGSVTSWHPTAAAVETARRAQVSSVPVSYM
MAV_1762|M.avium_104                ---MRIGKITIGSLGDWTPTPGPVTSWHPTAAAAEKVRQAPASPVPVSYM
MLBr_01230|M.leprae_Br4923          ---MQVGPLTLGTLLDWAPRAGKTISWQPTPATCEKVSQAPVSSVPVAYM
MAB_3147c|M.abscessus_ATCC_199      ---MRGGPVTVSLTDKWEPTAGSVITWQPSPASYAKALEAPVSDVPPSFM
                                            .      .*   .*    * .:  :  .. .*  * ** :. 

Rv3820c|M.tuberculosis_H37Rv        QAQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDT
Mb3850c|M.bovis_AF2122/97           QAQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDT
MSMEG_4728|M.smegmatis_MC2_155      QAQHLRAYREHTARGVPMARLTVPVWNMDSQCDMRAMSHVINAYLRRHDT
MMAR_2343|M.marinum_M               QARHLRGFAEQTARGHDMSRLVVAAMDIPGECDIRAMTYVINSHLRRHDT
TH_2204|M.thermoresistible__bu      QAQHLRGFVEHAARGQEMSRLCIAAFDIPGRCDLRAMTYVINAHMRRHDT
MUL_1531|M.ulcerans_Agy99           QAHHLRNFRSYRERGLEMSRLLISSWDIPGICDIRTMTHVVNAHMRRHDT
Mflv_0404|M.gilvum_PYR-GCK          QGQHLRNYCDRTTEGLNFSRQIIASCDVAGVCDIEAMDHAVNAYLRRHDT
Mvan_0270|M.vanbaalenii_PYR-1       QGQHLRNYWERTTAGLNFSRQIIASCEVPGQCDIAAMDHAVNAYLRRHDT
MAV_1762|M.avium_104                QGQHLRNYHERTAAGLDFSRQIIATCDVPGRCDISAMNYAVNAYLRRHDT
MLBr_01230|M.leprae_Br4923          QAQHIRGYVEQKAKGLDYSRLMIVSCDQLGQCDIRAINYIVNAHLRRHDT
MAB_3147c|M.abscessus_ATCC_199      QVQHLRTYLRQAAKGLDFSRVLVFTLDMPGRCDKRAMGHVINAHLRRHDT
                                    * :*:* :      *   :*  :   :  . *:  :: : :*:::*****

Rv3820c|M.tuberculosis_H37Rv        YHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSA-ELRHHIA-TPQP
Mb3850c|M.bovis_AF2122/97           YHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSA-ELRHHIA-TPQP
MSMEG_4728|M.smegmatis_MC2_155      YHSRFEFTVDDRIVRRKLRSPRDLRFVPTDHGVQTCD-QWREHILDTPGP
MMAR_2343|M.marinum_M               YRSWFEFTESNRIARHTLTDPNDIELTPIEHGEMSPK-QWQDYILATPGP
TH_2204|M.thermoresistible__bu      YRSWFEYHDFHHIVRHTITDPADIELVPTRHGAMTPE-QWQQHLLSTPDP
MUL_1531|M.ulcerans_Agy99           FRSWFEHTEGDHFVRRTLERPNDIQFVAVEHGELTTQSQWRERLLATPSP
Mflv_0404|M.gilvum_PYR-GCK          FRSWFEHSGDGEFVRRALVDPEDIEFVPVEHGHMGVDEIHAHVV-AIPSP
Mvan_0270|M.vanbaalenii_PYR-1       FRSWFERTEEGEFLRRAIADPADIEFVPIEHGDMTVDEIFAHVV-DIPSP
MAV_1762|M.avium_104                FRSWFHHSGDGEFVRHTVSNPADIEFAPIHHGEMTAEEIRAHVV-AIPNP
MLBr_01230|M.leprae_Br4923          YRSWFEYTDEGEIIRHTLCDPADIEFVPIEHGQLSLAQIRELAV-STPDP
MAB_3147c|M.abscessus_ATCC_199      YRSWFSLDDEQNIVRRTMADPADVEFVQVKLGEMTSDEVRELVVSETPDP
                                    ::* *      .: *: :  * *:...                :   * *

Rv3820c|M.tuberculosis_H37Rv        LQWDCFLFGIIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGG
Mb3850c|M.bovis_AF2122/97           LQWDCFLFGIIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGG
MSMEG_4728|M.smegmatis_MC2_155      LQWDCFRFGIIQRTDHFTCYMSVDHVHVDATFLGLMLIEIHLMYAALVSG
MMAR_2343|M.marinum_M               LEWDCFRFAIIQRDDHFTFCVSVDHLNVDAMFISAVFWEIEAMYNTLADG
TH_2204|M.thermoresistible__bu      LQWNCFRFSIIQRSDHFTFCVCMDHVHIDAMFMGAVFMEIHMQYAALVGG
MUL_1531|M.ulcerans_Agy99           LEWGCVRFSIIQRADHFTFCVAIDHLHCDAMFVGVVFAEIHLMYLALVSG
Mflv_0404|M.gilvum_PYR-GCK          LEWGCFTFGVIQNDDHFSFFASMDHVHGDATLIGTTMMEANGMYSALSSG
Mvan_0270|M.vanbaalenii_PYR-1       LEWGCFTFGVIQHDGHFTFFASMDHVHGDATLIGTTMMEANGMYSALSGG
MAV_1762|M.avium_104                LEWGCFTFGVVQNEEYFTFFAAMDHVHGDATLIGTTMLEANGMYAAASAG
MLBr_01230|M.leprae_Br4923          LEWGCFQFGVIQAEEHFTFYASIDHVHVDAMIVGVTLMEFHMMYSALVSG
MAB_3147c|M.abscessus_ATCC_199      FRWDCFRFGIVQSSGHFTFYFSVDHLHLDATFARLLIMEILMGYKALVQG
                                    :.*.*. *.::*   :*:   .: *:  *. :    : *    * :   *

Rv3820c|M.tuberculosis_H37Rv        DPPIELPPAGRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHF
Mb3850c|M.bovis_AF2122/97           DPPIELPPAGRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHF
MSMEG_4728|M.smegmatis_MC2_155      GAPITLPPAGSYDDYCVRQRKYTSGLTLDSPEIKEWVTFLEGNNGTMPKF
MMAR_2343|M.marinum_M               GAPISLPEAGSYGEYCLRERAYTSALTLESPEVRQWIDFFESNDGALPSF
TH_2204|M.thermoresistible__bu      GAPIPL-QAGSYDDFCVRQRQYTESLTADSPEVRAWVDFAGQMGGTLPNF
MUL_1531|M.ulcerans_Agy99           GAPLRLAEPGSYDNYCNRQREHISGLTLDSPVMSKWTEFFDNNDGSLPKF
Mflv_0404|M.gilvum_PYR-GCK          GAALALPDAGSFDDFCVREREYTAELTEDSPGVRAWIEFAENNSDGFPEF
Mvan_0270|M.vanbaalenii_PYR-1       GAALTLPDAGSFDEFCVRERAHTSELTEDSPEVRAWIEFAENSGGGFPEF
MAV_1762|M.avium_104                SEPLELPDAGSFDDFCAREREYTSALTVDSPEVRAWIDFAENNNGSFPEF
MLBr_01230|M.leprae_Br4923          AGPLELPEAGSYDDFCRRQHRFTSMLTAESPQIRSWTQFAEPNEGSFPDF
MAB_3147c|M.abscessus_ATCC_199      GAPIELPPAGSYGDYCIRQHEFLSGLTPDSEPVREWTQFAENNRGSLPDF
                                      .: *  .* :.:.* *:      ** :*  :  *  *     . :* *

Rv3820c|M.tuberculosis_H37Rv        PLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAAL
Mb3850c|M.bovis_AF2122/97           PLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAAL
MSMEG_4728|M.smegmatis_MC2_155      PLPLGDLSVPCTGDLMTVQLLDEPQTQGFEKACVAAGSRFIGGVFAAAAL
MMAR_2343|M.marinum_M               PLPLGDLSLVDTGELLSLQLMDAGQTARFEAACTSAGARFSGGVFACAAL
TH_2204|M.thermoresistible__bu      PLPLGDRTKPWPGELMVVKLLDGRQTDRFEAACVSAGARFVGGVFACAAL
MUL_1531|M.ulcerans_Agy99           PLPLGDTSAQC--EMMGVRLMDERQTLALEAACMSAGARFCGGVFAISAV
Mflv_0404|M.gilvum_PYR-GCK          PLPLGNPKDSTRSVMTSAVLMDTAQTERFDAAATAAGARFVGGLFACLAQ
Mvan_0270|M.vanbaalenii_PYR-1       PLPLGNPAESTRSCMTSEILMDTAQTERFESACTAAGARFVGGLFACLAQ
MAV_1762|M.avium_104                PLPLGNPSEATASAMVSELVMDAEQTERFESACTAVGVRFIGGLFACIAL
MLBr_01230|M.leprae_Br4923          PLPLGDPLEPTQADIVTITMLDEQQTDRFEAACTVAGARFVGGVLACCGL
MAB_3147c|M.abscessus_ATCC_199      PLPLGDHGVQCGTAIVTEQLLDEQQALKFESLCVDAGARFIGGVMAALGF
                                    *****:        :    ::*  *   ::  .  .* ** **::*  . 

Rv3820c|M.tuberculosis_H37Rv        AERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSA
Mb3850c|M.bovis_AF2122/97           AERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSA
MSMEG_4728|M.smegmatis_MC2_155      AQYQLTDIDTYHVITPTTTRGTEAEVMATGWFTGTVPITVPVGS-SFAET
MMAR_2343|M.marinum_M               AEYELTGSRTYYAVTPTSTRSTPAEFMTTGWFVGHIPFTVAVA-SSFDET
TH_2204|M.thermoresistible__bu      TENELAGVETFHAITPTDTRSTPADFMTTGWFTGMVPITVPAAGVSFGEA
MUL_1531|M.ulcerans_Agy99           VQHELTGADEYYAIVPIDIRRTEEDFMTTGWFTGFVPITVPTVGSSFGEI
Mflv_0404|M.gilvum_PYR-GCK          VEHELTGALTYYGLTPRDSRSGSDNFMTQGWFTGLIPITVPIGAASFGEA
Mvan_0270|M.vanbaalenii_PYR-1       VEHELTGALTYYGLTPRDSRSASDNFMTQGWFTGLIPITVPIGATSFADA
MAV_1762|M.avium_104                VEHELTGALTYYGLTPRDTRRTTDNFMTQGWFTGLVPITVPIAAASFADA
MLBr_01230|M.leprae_Br4923          AEYELTGADTYYGLTPRDTRRIPTDVLTQGWFTGLVPITVPIAGSSFGDA
MAB_3147c|M.abscessus_ATCC_199      AERELTGTDTYYGITPSDAREEAD-MFTTGWFTGLVPITAPVDG-TFAAA
                                    .: :*:.   :  :..   *     . : ***.* :*:*..     *   

Rv3820c|M.tuberculosis_H37Rv        ARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIA
Mb3850c|M.bovis_AF2122/97           ARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIA
MSMEG_4728|M.smegmatis_MC2_155      ARTAQRSFDSGLYLAHVPFDRVLELGATERGLRAPDPGVPMVSYLDATAA
MMAR_2343|M.marinum_M               VRGAQANFDASAHLANVPFERVLELAPWLK-KPPPRGGFPMVSFLDGGVP
TH_2204|M.thermoresistible__bu      ARAAQQSFDSGIGLGHVPYDRVLELVPSLR-RAETCS--PMMSFLDAGVP
MUL_1531|M.ulcerans_Agy99           VKAAQGSFDSGRDLAEVPLDCVMELVLWLR---EGQWGAPLLFYLDAGIP
Mflv_0404|M.gilvum_PYR-GCK          AWAAQSSFDSGLNMAKVPYYRVLELAPWLG---WPRPNFPVSNFFHGGAA
Mvan_0270|M.vanbaalenii_PYR-1       AWAAQASFDSNLFMAKVPYYRVLELAPWLS---WPRPNFPVSNFFHGGAA
MAV_1762|M.avium_104                AWTAQTSFDSGQQLAKVPYYRVLELAPWLK---WPQPNFPVSNFFHAGAA
MLBr_01230|M.leprae_Br4923          ARAAQNCFDTDVHLAEVPYDRVVELAPTVH---KPRPNFPVINFLDAGTA
MAB_3147c|M.abscessus_ATCC_199      AVAAQESFDRGRQLVNVPFYRVLELVPQLS---WPRPYHPMINFFDGGAP
                                    .  **  ** .  :  **   *:**               :  ::..  .

Rv3820c|M.tuberculosis_H37Rv        PLSTVANSDLN-----FRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNP
Mb3850c|M.bovis_AF2122/97           PLSTVANSDLN-----FRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNP
MSMEG_4728|M.smegmatis_MC2_155      PLSPAVVAEWNRIN--GRIFSEMGAANQVGMWVNQFGSGTWITVAFPNNP
MMAR_2343|M.marinum_M               PLSGVVAMQLDRIN--ARAFSDGRVAARVCIWVNKFQEETTVTASFPNNP
TH_2204|M.thermoresistible__bu      PLSALVASQLDGLN--ARVYGDGKVPAQLCMWVNRMDNETSVTVFFPDNP
MUL_1531|M.ulcerans_Agy99           PLSAMANSHVEGLR--ARLCHDGGMMGQIDIRVNRLEKETQLTVLFPNNP
Mflv_0404|M.gilvum_PYR-GCK          PLNAILAASEMGLADNIGIYPDGRFSYQLTIYIFRYGQGTEMAIMHPDNP
Mvan_0270|M.vanbaalenii_PYR-1       PLNAILAAADMGLANNIGIYPDGRFSYQLTIYIFRYGEGTVMAIMHPDNP
MAV_1762|M.avium_104                PLNAVLAAADLGYANNIGIYSDGRYSYQLTIYVFRYGEGTAMAMMYPDNP
MLBr_01230|M.leprae_Br4923          PLSVLLTAGLDGLN--IGVYSDGRYSYQMSIYVIRVEQETAVAVMFPDNP
MAB_3147c|M.abscessus_ATCC_199      PLSQLFTN-PLLVSNPIGLYAESKSVYQLTIFISRFPTETTLMIAYPDNP
                                    **.                  :     :: : : :    * :   .*:**

Rv3820c|M.tuberculosis_H37Rv        IASESVANYIAAMKSIYIRTADG---TLATLKPGT---------------
Mb3850c|M.bovis_AF2122/97           IASESVANYIAAMKSIYIRTADG---TLAILKPGT---------------
MSMEG_4728|M.smegmatis_MC2_155      VARASVQEYVDAFRSVCVAVAEGRHDDVPTPRVNELDLRSA---------
MMAR_2343|M.marinum_M               IAYDSVARYLDTMKSVYLRIAEG------ARWDAIAQVL-----------
TH_2204|M.thermoresistible__bu      VARASVEKYVATLRSIYVAVAEGRADRLGARRSGELQRQPA---------
MUL_1531|M.ulcerans_Agy99           VARESVTRYVEALKSAFVRVAEGRDVMPAPSRTGSQLHLAYSRRTYEPPA
Mflv_0404|M.gilvum_PYR-GCK          VAKRSVERYMQTMSSVASLVADT------GHWGRVA--------------
Mvan_0270|M.vanbaalenii_PYR-1       VAKKSVTRYMKVMKSVAGLVADS------GTWGLVA--------------
MAV_1762|M.avium_104                VAHKSVARYTETMRSVCGRVADT------GHWGRVA--------------
MLBr_01230|M.leprae_Br4923          EAQESVARYLETLKSVFECVAES------GHWRNVA--------------
MAB_3147c|M.abscessus_ATCC_199      VARESITRYVDLVKSTFARVCEQ------NDVVPAR--------------
                                     *  *: .*   . *     .:                            

Rv3820c|M.tuberculosis_H37Rv        -----------
Mb3850c|M.bovis_AF2122/97           -----------
MSMEG_4728|M.smegmatis_MC2_155      -----------
MMAR_2343|M.marinum_M               -----------
TH_2204|M.thermoresistible__bu      -----------
MUL_1531|M.ulcerans_Agy99           TVSPLTSWRTG
Mflv_0404|M.gilvum_PYR-GCK          -----------
Mvan_0270|M.vanbaalenii_PYR-1       -----------
MAV_1762|M.avium_104                -----------
MLBr_01230|M.leprae_Br4923          -----------
MAB_3147c|M.abscessus_ATCC_199      -----------