Thioesterase domains often occur integrated in or associated with peptide synthetases which are involved in the non-ribosomal synthesis of peptide antibiotics. Thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Only the last domain TE thioesterase of the acv synthetase (acvs) is included in ESTHER (Polyketide synthase Polyene PKS). Human fatty acid synthase (FAS) is uniquely expressed at high levels in many tumor types. Pharmacological inhibition of FAS therefore represents an important therapeutic opportunity. The drug Orlistat, which has been approved by the US Food and Drug Administration, inhibits FAS, induces tumor cell-specific apoptosis. Families TE-16 TE-17 TE-18 in ThYme database
Polyketides serve as rich source of therapeutically relevant drug leads. The manipulation of polyketide synthases (PKSs) for generating derivatives with improved activities usually results in substantially reduced yields. Growing evidence suggests that type I PKS thioesterase (TE) domains are key bottlenecks in the biosynthesis of polyene antibiotics, such as pimaricin and amphotericin, and their unnatural derivatives. Herein, we elucidate the structure of the 26-membered macrolide-complexed TE domain from the pimaricin pathway (Pim TE), which specifies a spacious bifunnel-shaped substrate channel with a highly hydrophobic cleft proximal to the catalytic triad and a hydrophilic loop I region specific for the cyclization of amphiphilic polyene macrolide. Notably, the natural intermediate with C12-COOH is stabilized by a hydrogen-bond network, as well as by interactions between the polyene moiety and the hydrophobic cleft. Moreover, the bottleneck in processing the unnatural intermediate with C12-CH3 is attributed to the unstable and mismatched docking of the curved substrate in the channel. Aided by an in vitro assay with a fully elongated linear polyene intermediate as the substrate, multiple strategies were adopted, herein, to engineer Pim TE, including introducing H-bond donors, enhancing hydrophobic interactions, and modifying the catalytic center. Efficient TE mutations with increased substrate conversion up to 39.2% in vitro were further conducted in vivo, with a titer increase as high as 37.1% for the less toxic decarboxylated pimaricin derivatives with C12-CH3. Our work uncovers the mechanism of TE-catalyzed polyene macrolide formation and highlights TE domains as targets for PKS manipulation for titer increases in engineered unnatural polyketide derivatives.
Many enzymes catalyse reactions that proceed through covalent acyl-enzyme (ester or thioester) intermediates(1). These enzymes include serine hydrolases(2,3) (encoded by one per cent of human genes, and including serine proteases and thioesterases), cysteine proteases (including caspases), and many components of the ubiquitination machinery(4,5). Their important acyl-enzyme intermediates are unstable, commonly having half-lives of minutes to hours(6). In some cases, acyl-enzyme complexes can be stabilized using substrate analogues or active-site mutations but, although these approaches can provide valuable insight(7-10), they often result in complexes that are substantially non-native. Here we develop a strategy for incorporating 2,3-diaminopropionic acid (DAP) into recombinant proteins, via expansion of the genetic code(11). We show that replacing catalytic cysteine or serine residues of enzymes with DAP permits their first-step reaction with native substrates, allowing the efficient capture of acyl-enzyme complexes that are linked through a stable amide bond. For one of these enzymes, the thioesterase domain of valinomycin synthetase(12), we elucidate the biosynthetic pathway by which it progressively oligomerizes tetradepsipeptidyl substrates to a dodecadepsipeptidyl intermediate, which it then cyclizes to produce valinomycin. By trapping the first and last acyl-thioesterase intermediates in the catalytic cycle as DAP conjugates, we provide structural insight into how conformational changes in thioesterase domains of such nonribosomal peptide synthetases control the oligomerization and cyclization of linear substrates. The encoding of DAP will facilitate the characterization of diverse acyl-enzyme complexes, and may be extended to capturing the native substrates of transiently acylated proteins of unknown function.
        
Title: Polyketide synthase and non-ribosomal peptide synthetase thioesterase selectivity: logic gate or a victim of fate? Horsman ME, Hari TP, Boddy CN Ref: Nat Prod Rep, 33:183, 2016 : PubMed
Covering: 1999 to 2014Type 1, alpha/beta hydrolase-like thioesterase (TE) domains are essential offloading enzymes, releasing covalently bound products from fatty acid, polyketide, and non-ribosomal peptide biosynthetic complexes. The release step can occur by attack of an exogenous nucleophile effecting hydrolysis or transesterification or by an intramolecular O-, N-, or C-nucleophile, effecting macrolactonization, macrolactamization or Claisen-like condensation of the product. Thus in addition to ensuring turnover of the pathway, TEs provide access to increased chemical diversity. We review the diversity, structure, and mechanism of PKS and NRPS TEs and discuss recent works that highlight the role of TEs as potential arbitrators in offloading. In particular, we examine cases where TEs act as logic gates that ask a particular question about the substrate and use this information to determine the substrate's fate. As the TE mechanism occurs via two steps, we analyze both the loading and release steps independently as logic gates. The use of logic gates provides an important perspective when evaluating the evolution of TEs within a pathway, as well as highlighting work towards the goal of predicting TE function in unknown and engineered pathways.
Polyketides serve as rich source of therapeutically relevant drug leads. The manipulation of polyketide synthases (PKSs) for generating derivatives with improved activities usually results in substantially reduced yields. Growing evidence suggests that type I PKS thioesterase (TE) domains are key bottlenecks in the biosynthesis of polyene antibiotics, such as pimaricin and amphotericin, and their unnatural derivatives. Herein, we elucidate the structure of the 26-membered macrolide-complexed TE domain from the pimaricin pathway (Pim TE), which specifies a spacious bifunnel-shaped substrate channel with a highly hydrophobic cleft proximal to the catalytic triad and a hydrophilic loop I region specific for the cyclization of amphiphilic polyene macrolide. Notably, the natural intermediate with C12-COOH is stabilized by a hydrogen-bond network, as well as by interactions between the polyene moiety and the hydrophobic cleft. Moreover, the bottleneck in processing the unnatural intermediate with C12-CH3 is attributed to the unstable and mismatched docking of the curved substrate in the channel. Aided by an in vitro assay with a fully elongated linear polyene intermediate as the substrate, multiple strategies were adopted, herein, to engineer Pim TE, including introducing H-bond donors, enhancing hydrophobic interactions, and modifying the catalytic center. Efficient TE mutations with increased substrate conversion up to 39.2% in vitro were further conducted in vivo, with a titer increase as high as 37.1% for the less toxic decarboxylated pimaricin derivatives with C12-CH3. Our work uncovers the mechanism of TE-catalyzed polyene macrolide formation and highlights TE domains as targets for PKS manipulation for titer increases in engineered unnatural polyketide derivatives.
Thioesterases are enzymes that hydrolyze thioester bonds in numerous biochemical pathways, for example in fatty acid synthesis. This work reports known functions, structures, and mechanisms of updated thioesterase enzyme families, which are classified into 35 families based on sequence similarity. Each thioesterase family is based on at least one experimentally characterized enzyme, and most families have enzymes that have been crystallized and their tertiary structure resolved. Classifying thioesterases into families allows to predict tertiary structures and infer catalytic residues and mechanisms of all sequences in a family, which is particularly useful because the majority of known protein sequence have no experimental characterization. Phylogenetic analysis of experimentally characterized thioesterases that have structures with the two main structural folds reveal convergent and divergent evolution. Based on tertiary structure superimposition, catalytic residues are predicted.
Many enzymes catalyse reactions that proceed through covalent acyl-enzyme (ester or thioester) intermediates(1). These enzymes include serine hydrolases(2,3) (encoded by one per cent of human genes, and including serine proteases and thioesterases), cysteine proteases (including caspases), and many components of the ubiquitination machinery(4,5). Their important acyl-enzyme intermediates are unstable, commonly having half-lives of minutes to hours(6). In some cases, acyl-enzyme complexes can be stabilized using substrate analogues or active-site mutations but, although these approaches can provide valuable insight(7-10), they often result in complexes that are substantially non-native. Here we develop a strategy for incorporating 2,3-diaminopropionic acid (DAP) into recombinant proteins, via expansion of the genetic code(11). We show that replacing catalytic cysteine or serine residues of enzymes with DAP permits their first-step reaction with native substrates, allowing the efficient capture of acyl-enzyme complexes that are linked through a stable amide bond. For one of these enzymes, the thioesterase domain of valinomycin synthetase(12), we elucidate the biosynthetic pathway by which it progressively oligomerizes tetradepsipeptidyl substrates to a dodecadepsipeptidyl intermediate, which it then cyclizes to produce valinomycin. By trapping the first and last acyl-thioesterase intermediates in the catalytic cycle as DAP conjugates, we provide structural insight into how conformational changes in thioesterase domains of such nonribosomal peptide synthetases control the oligomerization and cyclization of linear substrates. The encoding of DAP will facilitate the characterization of diverse acyl-enzyme complexes, and may be extended to capturing the native substrates of transiently acylated proteins of unknown function.
Type I polyketide synthases (PKSs) are giant multidomain proteins that synthesize many therapeutics and other natural products. The synthesis proceeds by a thiotemplate mechanism whereby intermediates are covalently attached to the PKS. The release of the final polyketide is catalyzed by the terminal thioesterase (TE) domain through hydrolysis, transesterification, or macrocyclization. The PKS 6-deoxyerythronolide B synthase (DEBS) produces the 14-membered macrolide core of the clinically important antibiotic erythromycin. The TE domain of DEBS (DEBS TE) has well-established, empirically-defined specificities for hydrolysis or macrocyclization of native and modified substrates. We present efforts towards understanding the structural basis for the specificity of the thioesterase reaction in DEBS TE using a set of novel diphenyl alkylphosphonates, which mimic substrates that are specifically cyclized or hydrolyzed by DEBS TE. We have determined structures of a new construct of DEBS TE alone at 1.7A, and DEBS TE bound with a simple allylphosphonate at 2.1A resolution. Other, more complex diphenyl alkylphosphonates inhibit DEBS TE, but we were unable to visualize these faithful cyclization analogs in complex with DEBS TE. This work represents a first step towards using DEBS TE complexed with sophisticated substrate analogs to decipher the specificity determinants in this important reaction.
Many important natural products are produced by multidomain non-ribosomal peptide synthetases (NRPSs). During synthesis, intermediates are covalently bound to integrated carrier domains and transported to neighbouring catalytic domains in an assembly line fashion. Understanding the structural basis for catalysis with non-ribosomal peptide synthetases will facilitate bioengineering to create novel products. Here we describe the structures of two different holo-non-ribosomal peptide synthetase modules, each revealing a distinct step in the catalytic cycle. One structure depicts the carrier domain cofactor bound to the peptide bond-forming condensation domain, whereas a second structure captures the installation of the amino acid onto the cofactor within the adenylation domain. These structures demonstrate that a conformational change within the adenylation domain guides transfer of intermediates between domains. Furthermore, one structure shows that the condensation and adenylation domains simultaneously adopt their catalytic conformations, increasing the overall efficiency in a revised structural cycle. These structures and the single-particle electron microscopy analysis demonstrate a highly dynamic domain architecture and provide the foundation for understanding the structural mechanisms that could enable engineering of novel non-ribosomal peptide synthetases.
        
Title: Polyketide synthase and non-ribosomal peptide synthetase thioesterase selectivity: logic gate or a victim of fate? Horsman ME, Hari TP, Boddy CN Ref: Nat Prod Rep, 33:183, 2016 : PubMed
Covering: 1999 to 2014Type 1, alpha/beta hydrolase-like thioesterase (TE) domains are essential offloading enzymes, releasing covalently bound products from fatty acid, polyketide, and non-ribosomal peptide biosynthetic complexes. The release step can occur by attack of an exogenous nucleophile effecting hydrolysis or transesterification or by an intramolecular O-, N-, or C-nucleophile, effecting macrolactonization, macrolactamization or Claisen-like condensation of the product. Thus in addition to ensuring turnover of the pathway, TEs provide access to increased chemical diversity. We review the diversity, structure, and mechanism of PKS and NRPS TEs and discuss recent works that highlight the role of TEs as potential arbitrators in offloading. In particular, we examine cases where TEs act as logic gates that ask a particular question about the substrate and use this information to determine the substrate's fate. As the TE mechanism occurs via two steps, we analyze both the loading and release steps independently as logic gates. The use of logic gates provides an important perspective when evaluating the evolution of TEs within a pathway, as well as highlighting work towards the goal of predicting TE function in unknown and engineered pathways.
        
Title: Structures of a Nonribosomal Peptide Synthetase Module Bound to MbtH-like Proteins Support a Highly Dynamic Domain Architecture Miller BR, Drake EJ, Shi C, Aldrich CC, Gulick AM Ref: Journal of Biological Chemistry, 291:22559, 2016 : PubMed
Nonribosomal peptide synthetases (NRPSs) produce a wide variety of peptide natural products. During synthesis, the multidomain NRPSs act as an assembly line, passing the growing product from one module to the next. Each module generally consists of an integrated peptidyl carrier protein, an amino acid-loading adenylation domain, and a condensation domain that catalyzes peptide bond formation. Some adenylation domains interact with small partner proteins called MbtH-like proteins (MLPs) that enhance solubility or activity. A structure of an MLP bound to an adenylation domain has been previously reported using a truncated adenylation domain, precluding any insight that might be derived from understanding the influence of the MLP on the intact adenylation domain or on the dynamics of the entire NRPS module. Here, we present the structures of the full-length NRPS EntF bound to the MLPs from Escherichia coli and Pseudomonas aeruginosa These new structures, along with biochemical and bioinformatics support, further elaborate the residues that define the MLP-adenylation domain interface. Additionally, the structures highlight the dynamic behavior of NRPS modules, including the module core formed by the adenylation and condensation domains as well as the orientation of the mobile thioesterase domain.
The type I fatty acid synthase (FASN) is responsible for the de novo synthesis of palmitate. Chain length selection and release is performed by the C-terminal thioesterase domain (TE1). FASN expression is up-regulated in cancer, and its activity levels are controlled by gene dosage and transcriptional and post-translational mechanisms. In addition, the chain length of fatty acids produced by FASN is controlled by a type II thioesterase called TE2 (E.C. 3.1.2.14). TE2 has been implicated in breast cancer and generates a broad lipid distribution within milk. The molecular basis for the ability of the TE2 to compete with TE1 for the acyl chain attached to the acyl carrier protein (ACP) domain of FASN is unknown. Herein, we show that human TE1 efficiently hydrolyzes acyl-CoA substrate mimetics. In contrast, TE2 prefers an engineered human acyl-ACP substrate and readily releases short chain fatty acids from full-length FASN during turnover. The 2.8 A crystal structure of TE2 reveals a novel capping domain insert within the alpha/beta hydrolase core. This domain is reminiscent of capping domains of type II thioesterases involved in polyketide synthesis. The structure also reveals that the capping domain had collapsed onto the active site containing the Ser-101-His-237-Asp-212 catalytic triad. This observation suggests that the capping domain opens to enable the ACP domain to dock and to place the acyl chain and 4'-phosphopantetheinyl-linker arm correctly for catalysis. Thus, the ability of TE2 to prematurely release fatty acids from FASN parallels the role of editing thioesterases involved in polyketide and non-ribosomal peptide synthase synthases.
Hydrogen exchange (HX) studies have provided critical insight into our understanding of protein folding, structure, and dynamics. More recently, hydrogen exchange mass spectrometry (HX-MS) has become a widely applicable tool for HX studies. The interpretation of the wealth of data generated by HX-MS experiments as well as other HX methods would greatly benefit from the availability of exchange predictions derived from structures or models for comparison with experiment. Most reported computational HX modeling studies have employed solvent-accessible-surface-area based metrics in attempts to interpret HX data on the basis of structures or models. In this study, a computational HX-MS prediction method based on classification of the amide hydrogen bonding modes mimicking the local unfolding model is demonstrated. Analysis of the NH bonding configurations from molecular dynamics (MD) simulation snapshots is used to determine partitioning over bonded and nonbonded NH states and is directly mapped into a protection factor (PF) using a logistics growth function. Predicted PFs are then used for calculating deuteration values of peptides and compared with experimental data. Hydrogen exchange MS data for fatty acid synthase thioesterase (FAS-TE) collected for a range of pHs and temperatures was used for detailed evaluation of the approach. High correlation between prediction and experiment for observable fragment peptides is observed in the FAS-TE and additional benchmarking systems that included various apo/holo proteins for which literature data were available. In addition, it is shown that HX modeling can improve experimental resolution through decomposition of in-exchange curves into rate classes, which correlate with prediction from MD. Successful rate class decompositions provide further evidence that the presented approach captures the underlying physical processes correctly at the single residue level. This assessment is further strengthened in a comparison of residue resolved protection factor predictions for staphylococcal nuclease with NMR data, which was also used to compare prediction performance with other algorithms described in the literature. The demonstrated transferable and scalable MD based HX prediction approach adds significantly to the available tools for HX-MS data interpretation based on available structures and models.
The ThYme (Thioester-active enzYme; http://www.enzyme.cbirc.iastate.edu) database has been constructed to bring together amino acid sequences and 3D (tertiary) structures of all the enzymes constituting the fatty acid synthesis and polyketide synthesis cycles. These enzymes are active on thioester-containing substrates, specifically those that are parts of the acyl-CoA synthase, acyl-CoA carboxylase, acyl transferase, ketoacyl synthase, ketoacyl reductase, hydroxyacyl dehydratase, enoyl reductase and thioesterase enzyme groups. These groups have been classified into families, members of which are similar in sequences, tertiary structures and catalytic mechanisms, implying common protein ancestry. ThYme is continually updated as sequences and tertiary structures become available.
        
Title: Structural basis for phosphopantetheinyl carrier domain interactions in the terminal module of nonribosomal peptide synthetases. Liu Y, Zheng T, Bruner SD Ref: Chemical Biology, 18:1482, 2011 : PubMed
Phosphopantetheine-modified carrier domains play a central role in the template-directed, biosynthesis of several classes of primary and secondary metabolites. Fatty acids, polyketides, and nonribosomal peptides are constructed on multidomain enzyme assemblies using phosphopantetheinyl thioester-linked carrier domains to traffic and activate building blocks. The carrier domain is a dynamic component of the process, shuttling pathway intermediates to sequential enzyme active sites. Here, we report an approach to structurally fix carrier domain/enzyme constructs suitable for X-ray crystallographic analysis. The structure of a two-domain construct of Escherichia coli EntF was determined with a conjugated phosphopantetheinyl-based inhibitor. The didomain structure is locked in an active orientation relevant to the chemistry of nonribosomal peptide biosynthesis. This structure provides details into the interaction of phosphopantetheine arm with the carrier domain and the active site of the thioesterase domain.
Prodiginines are a class of red-pigmented natural products with immunosuppressant, anticancer, and antimalarial activities. Recent studies on prodiginine biosynthesis in Streptomyces coelicolor have elucidated the function of many enzymes within the pathway. However, the function of RedJ, which was predicted to be an editing thioesterase based on sequence similarity, is unknown. We report here the genetic, biochemical, and structural characterization of the redJ gene product. Deletion of redJ in S. coelicolor leads to a 75% decrease in prodiginine production, demonstrating its importance for prodiginine biosynthesis. RedJ exhibits thioesterase activity with selectivity for substrates having long acyl chains and lacking a -carboxyl substituent. The thioesterase has 1000-fold greater catalytic efficiency with substrates linked to an acyl carrier protein (ACP) than with the corresponding CoA thioester substrates. Also, RedJ strongly discriminates against the streptomycete ACP of fatty acid biosynthesis in preference to RedQ, an ACP of the prodiginine pathway. The 2.12 resolution crystal structure of RedJ provides insights into the molecular basis for the observed substrate selectivity. A hydrophobic pocket in the active site chamber is positioned to bind long acyl chains, as suggested by a long-chain ligand from the crystallization solution bound in this pocket. The accessibility of the active site is controlled by the position of a highly flexible entrance flap. These data combined with previous studies of prodiginine biosynthesis in S. coelicolor support a novel role for RedJ in facilitating transfer of a dodecanoyl chain from one acyl carrier protein to another en route to the key biosynthetic intermediate 2-undecylpyrrole.
        
Title: Crystal structure of FAS thioesterase domain with polyunsaturated fatty acyl adduct and inhibition by dihomo-gamma-linolenic acid Zhang W, Chakravarty B, Zheng F, Gu Z, Wu H, Mao J, Wakil SJ, Quiocho FA Ref: Proc Natl Acad Sci U S A, 108:15757, 2011 : PubMed
Human fatty acid synthase (hFAS) is a homodimeric multidomain enzyme that catalyzes a series of reactions leading to the de novo biosynthesis of long-chain fatty acids, mainly palmitate. The carboxy-terminal thioesterase (TE) domain determines the length of the fatty acyl chain and its ultimate release by hydrolysis. Because of the upregulation of hFAS in a variety of cancers, it is a target for antiproliferative agent development. Dietary long-chain polyunsaturated fatty acids (PUFAs) have been known to confer beneficial effects on many diseases and health conditions, including cancers, inflammations, diabetes, and heart diseases, but the precise molecular mechanisms involved have not been elucidated. We report the 1.48 A crystal structure of the hFAS TE domain covalently modified and inactivated by methyl gamma-linolenylfluorophosphonate. Whereas the structure confirmed the phosphorylation by the phosphonate head group of the active site serine, it also unexpectedly revealed the binding of the 18-carbon polyunsaturated gamma-linolenyl tail in a long groove-tunnel site, which itself is formed mainly by the emergence of an alpha helix (the "helix flap"). We then found inhibition of the TE domain activity by the PUFA dihomo-gamma-linolenic acid; gamma- and alpha-linolenic acids, two popular dietary PUFAs, were less effective. Dihomo-gamma-linolenic acid also inhibited fatty acid biosynthesis in 3T3-L1 preadipocytes and selective human breast cancer cell lines, including SKBR3 and MDAMB231. In addition to revealing a novel mechanism for the molecular recognition of a polyunsaturated fatty acyl chain, our results offer a new framework for developing potent FAS inhibitors as therapeutics against cancers and other diseases.
        
Title: Thioesterases: a new perspective based on their primary and tertiary structures. Cantu DC, Chen Y, Reilly PJ Ref: Protein Science, 19:1281, 2010 : PubMed
Thioesterases (TEs) are classified into EC 3.1.2.1 through EC 3.1.2.27 based on their activities on different substrates, with many remaining unclassified (EC 3.1.2.-). Analysis of primary and tertiary structures of known TEs casts a new light on this enzyme group. We used strong primary sequence conservation based on experimentally proved proteins as the main criterion, followed by verification with tertiary structure superpositions, mechanisms, and catalytic residue positions, to accurately define TE families. At present, TEs fall into 23 families almost completely unrelated to each other by primary structure. It is assumed that all members of the same family have essentially the same tertiary structure; however, TEs in different families can have markedly different folds and mechanisms. Conversely, the latter sometimes have very similar tertiary structures and catalytic mechanisms despite being only slightly or not at all related by primary structure, indicating that they have common distant ancestors and can be grouped into clans. At present, four clans encompass 12 TE families. The new constantly updated ThYme (Thioester-active enzYmes) database contains TE primary and tertiary structures, classified into families and clans that are different from those currently found in the literature or in other databases. We review all types of TEs, including those cleaving CoA, ACP, glutathione, and other protein molecules, and we discuss their structures, functions, and mechanisms.
Polyketide natural products possess diverse architectures and biological functions and share a subset of biosynthetic steps with fatty acid synthesis. The final transformation catalyzed by both polyketide synthases (PKSs) and fatty acid synthases is most often carried out by a thioesterase (TE). The synthetic versatility of TE domains in fungal nonreducing, iterative PKSs (NR-PKSs) has been shown to extend to Claisen cyclase (CLC) chemistry by catalyzing C-C ring closure reactions as opposed to thioester hydrolysis or O-C/N-C macrocyclization observed in previously reported TE structures. Catalysis of C-C bond formation as a product release mechanism dramatically expands the synthetic potential of PKSs, but how this activity was acquired has remained a mystery. We report the biochemical and structural analyses of the TE/CLC domain in polyketide synthase A, the multidomain PKS central to the biosynthesis of aflatoxin B(1), a potent environmental carcinogen. Mutagenesis experiments confirm the predicted identity of the catalytic triad and its role in catalyzing the final Claisen-type cyclization to the aflatoxin precursor, norsolorinic acid anthrone. The 1.7 A crystal structure displays an alpha/beta-hydrolase fold in the catalytic closed form with a distinct hydrophobic substrate-binding chamber. We propose that a key rotation of the substrate side chain coupled to a protein conformational change from the open to closed form spatially governs substrate positioning and C-C cyclization. The biochemical studies, the 1.7 A crystal structure of the TE/CLC domain, and intermediate modeling afford the first mechanistic insights into this widely distributed C-C bond-forming class of TEs.
Title: Structure and functional analysis of RifR, the type II thioesterase from the rifamycin biosynthetic pathway Claxton HB, Akey DL, Silver MK, Admiraal SJ, Smith JL Ref: Journal of Biological Chemistry, 284:5021, 2009 : PubMed
Two thioesterases are commonly found in natural product biosynthetic clusters, a type I thioesterase that is responsible for removing the final product from the biosynthetic complex and a type II thioesterase that is believed to perform housekeeping functions such as removing aberrant units from carrier domains. We present the crystal structure and the kinetic analysis of RifR, a type II thioesterase from the hybrid nonribosomal peptide synthetases/polyketide synthase rifamycin biosynthetic cluster of Amycolatopsis mediterranei. Steady-state kinetics show that RifR has a preference for the hydrolysis of acyl units from the phosphopantetheinyl arm of the acyl carrier domain over the hydrolysis of acyl units from the phosphopantetheinyl arm of acyl-CoAs as well as a modest preference for the decarboxylated substrate mimics acetyl-CoA and propionyl-CoA over malonyl-CoA and methylmalonyl-CoA. Multiple RifR conformations and structural similarities to other thioesterases suggest that movement of a helical lid controls access of substrates to the active site of RifR.
PksA, which initiates biosynthesis of the environmental carcinogen aflatoxin B1, is one of the multidomain iterative polyketide synthases (IPKSs), a large, poorly understood family of biosynthetic enzymes. We found that dissection of PksA and its reconstitution from selected sets of domains allows the accumulation and characterization of advanced octaketide intermediates bound to the enzyme, permitting the reactions controlled by individual catalytic domains to be identified. A product template (PT) domain unites with the ketosynthase and thioesterase in this IPKS system to assemble precisely seven malonyl-derived building blocks to a hexanoyl starter unit and mediate a specific cyclization cascade. Because the PT domain is common among nonreducing IPKSs, these mechanistic features should prove to be general for IPKS-catalyzed production of aromatic polyketides.
Non-ribosomal peptide synthetases (NRPS) and polyketide synthases (PKS) produce numerous secondary metabolites with various therapeutic/antibiotic properties. Like fatty acid synthases (FAS), these enzymes are organized in modular assembly lines in which each module, made of conserved domains, incorporates a given monomer unit into the growing chain. Knowledge about domain or module interactions may enable reengineering of this assembly line enzymatic organization and open avenues for the design of new bioactive compounds with improved therapeutic properties. So far, little structural information has been available on how the domains interact and communicate. This may be because of inherent interdomain mobility hindering crystallization, or because crystallized molecules may not represent the active domain orientations. In solution, the large size and internal dynamics of multidomain fragments (>35 kilodaltons) make structure determination by nuclear magnetic resonance a challenge and require advanced technologies. Here we present the solution structure of the apo-thiolation-thioesterase (T-TE) di-domain fragment of the Escherichia coli enterobactin synthetase EntF NRPS subunit. In the holoenzyme, the T domain carries the growing chain tethered to a 4'-phosphopantetheine whereas the TE domain catalyses hydrolysis and cyclization of the iron chelator enterobactin. The T-TE di-domain forms a compact but dynamic structure with a well-defined domain interface; the two active sites are at a suitable distance for substrate transfer from T to TE. We observe extensive interdomain and intradomain motions for well-defined regions and show that these are modulated by interactions with proteins that participate in the biosynthesis. The T-TE interaction described here provides a model for NRPS, PKS and FAS function in general as T-TE-like di-domains typically catalyse the last step in numerous assembly-line chain-termination machineries.
Non-ribosomal peptide synthetases (NRPS) and polyketide synthases (PKS) found in bacteria, fungi and plants use two different types of thioesterases for the production of highly active biological compounds. Type I thioesterases (TEI) catalyse the release step from the assembly line of the final product where it is transported from one reaction centre to the next as a thioester linked to a 4'-phosphopantetheine (4'-PP) cofactor that is covalently attached to thiolation (T) domains. The second enzyme involved in the synthesis of these secondary metabolites, the type II thioesterase (TEII), is a crucial repair enzyme for the regeneration of functional 4'-PP cofactors of holo-T domains of NRPS and PKS systems. Mispriming of 4'-PP cofactors by acetyl- and short-chain acyl-residues interrupts the biosynthetic system. This repair reaction is very important, because roughly 80% of CoA, the precursor of the 4'-PP cofactor, is acetylated in bacteria. Here we report the three-dimensional structure of a type II thioesterase from Bacillus subtilis free and in complex with a T domain. Comparison with structures of TEI enzymes shows the basis for substrate selectivity and the different modes of interaction of TEII and TEI enzymes with T domains. Furthermore, we show that the TEII enzyme exists in several conformations of which only one is selected on interaction with its native substrate, a modified holo-T domain.
        
Title: The crystal structure of a mammalian fatty acid synthase Maier T, Leibundgut M, Ban N Ref: Science, 321:1315, 2008 : PubMed
Mammalian fatty acid synthase is a large multienzyme that catalyzes all steps of fatty acid synthesis. We have determined its crystal structure at 3.2 angstrom resolution covering five catalytic domains, whereas the flexibly tethered terminal acyl carrier protein and thioesterase domains remain unresolved. The structure reveals a complex architecture of alternating linkers and enzymatic domains. Substrate shuttling is facilitated by flexible tethering of the acyl carrier protein domain and by the limited contact between the condensing and modifying portions of the multienzyme, which are mainly connected by linkers rather than direct interaction. The structure identifies two additional nonenzymatic domains: (i) a pseudo-ketoreductase and (ii) a peripheral pseudo-methyltransferase that is probably a remnant of an ancestral methyltransferase domain maintained in some related polyketide synthases. The structural comparison of mammalian fatty acid synthase with modular polyketide synthases shows how their segmental construction allows the variation of domain composition to achieve diverse product synthesis.
Nonribosomal peptide synthetases (NRPSs) are modular multidomain enzymes that act as an assembly line to catalyze the biosynthesis of complex natural products. The crystal structure of the 144-kilodalton Bacillus subtilis termination module SrfA-C was solved at 2.6 angstrom resolution. The adenylation and condensation domains of SrfA-C associate closely to form a catalytic platform, with their active sites on the same side of the platform. The peptidyl carrier protein domain is flexibly tethered to this platform and thus can move with its substrate-loaded 4'-phosphopantetheine arm between the active site of the adenylation domain and the donor side of the condensation domain. The SrfA-C crystal structure has implications for the rational redesign of NRPSs as a means of producing novel bioactive peptides.
        
Title: Crystal structure of the thioesterase domain of human fatty acid synthase inhibited by Orlistat Pemble CWt, Johnson LC, Kridel SJ, Lowther WT Ref: Nat Struct Mol Biol, 14:704, 2007 : PubMed
Human fatty acid synthase (FAS) is uniquely expressed at high levels in many tumor types. Pharmacological inhibition of FAS therefore represents an important therapeutic opportunity. The drug Orlistat, which has been approved by the US Food and Drug Administration, inhibits FAS, induces tumor cell-specific apoptosis and inhibits the growth of prostate tumor xenografts. We determined the 2.3-A-resolution crystal structure of the thioesterase domain of FAS inhibited by Orlistat. Orlistat was captured in the active sites of two thioesterase molecules as a stable acyl-enzyme intermediate and as the hydrolyzed product. The details of these interactions reveal the molecular basis for inhibition and suggest a mechanism for acyl-chain length discrimination during the FAS catalytic cycle. Our findings provide a foundation for the development of new cancer drugs that target FAS.
Polyketides are a class of biologically active microbial and plant-derived metabolites that possess a high degree of structural and functional diversity and include many human therapeutics, among them anti-infective and anti-cancer drugs, growth promoters and anti-parasitic agents. The macrolide antibiotics, characterized by a glycoside-linked macrolactone, constitute an important class of polyketides, including erythromycin and the natural ketolide anti-infective agent pikromycin. Here we describe new mechanistic details of macrolactone ring formation catalyzed by the pikromycin polyketide synthase thioesterase domain from Streptomyces venezuelae. A pentaketide phosphonate mimic of the final pikromycin linear chain-elongation intermediate was synthesized and shown to be an active site affinity label. The crystal structures of the affinity-labeled enzyme and of a 12-membered-ring macrolactone product complex suggest a mechanism for cyclization in which a hydrophilic barrier in the enzyme and structural restraints of the substrate induce a curled conformation to direct macrolactone ring formation.
Polyketides are a diverse class of natural products having important clinical properties, including antibiotic, immunosuppressive and anticancer activities. They are biosynthesized by polyketide synthases (PKSs), which are modular, multienzyme complexes that sequentially condense simple carboxylic acid derivatives. The final reaction in many PKSs involves thioesterase-catalyzed cyclization of linear chain elongation intermediates. As the substrate in PKSs is presented by a tethered acyl carrier protein, introduction of substrate by diffusion is problematic, and no substrate-bound type I PKS domain structure has been reported so far. We describe the chemical synthesis of polyketide-based affinity labels that covalently modify the active site serine of excised pikromycin thioesterase from Streptomyces venezuelae. Crystal structures reported here of the affinity label-pikromycin thioesterase adducts provide important mechanistic insights. These results suggest that affinity labels can be valuable tools for understanding the mechanisms of individual steps within multifunctional PKSs and for directing rational engineering of PKS domains for combinatorial biosynthesis.
        
Title: The thioesterase domain of the fengycin biosynthesis cluster: a structural base for the macrocyclization of a non-ribosomal lipopeptide Samel SA, Wagner B, Marahiel MA, Essen LO Ref: Journal of Molecular Biology, 359:876, 2006 : PubMed
Many secondary metabolic peptides from bacteria and fungi are produced by non-ribosomal peptide synthetases (NRPS) where the final step of biosynthesis is often catalysed by designated thioesterase domains. Here, we report the 1.8A crystal structure of the fengycin thioesterase (FenTE) from Bacillus subtilis F29-3, which catalyses the regio- and stereoselective release and macrocyclization of the antibiotic fengycin from the NRPS template. A structure of the PMSF-inactivated FenTE domain suggests the location of the oxyanion hole and the binding site of the C-terminal residue l-Ile11 of the lipopeptide. Using a combination of docking, molecular dynamics simulations and in vitro activity assays, a model of the FenTE-fengycin complex was derived in which peptide cyclization requires strategic interactions with residues lining the active site canyon.
        
Title: Human fatty acid synthase: structure and substrate selectivity of the thioesterase domain Chakravarty B, Gu Z, Chirala SS, Wakil SJ, Quiocho FA Ref: Proc Natl Acad Sci U S A, 101:15567, 2004 : PubMed
Human fatty acid synthase is a large homodimeric multifunctional enzyme that synthesizes palmitic acid. The unique carboxyl terminal thioesterase domain of fatty acid synthase hydrolyzes the growing fatty acid chain and plays a critical role in regulating the chain length of fatty acid released. Also, the up-regulation of human fatty acid synthase in a variety of cancer makes the thioesterase a candidate target for therapeutic treatment. The 2.6-A resolution structure of human fatty acid synthase thioesterase domain reported here is comprised of two dissimilar subdomains, A and B. The smaller subdomain B is composed entirely of alpha-helices arranged in an atypical fold, whereas the A subdomain is a variation of the alpha/beta hydrolase fold. The structure revealed the presence of a hydrophobic groove with a distal pocket at the interface of the two subdomains, which constitutes the candidate substrate binding site. The length and largely hydrophobic nature of the groove and pocket are consistent with the high selectivity of the thioesterase for palmitoyl acyl substrate. The structure also set the identity of the Asp residue of the catalytic triad of Ser, His, and Asp located in subdomain A at the proximal end of the groove.
Many biologically active natural peptides are synthesized by nonribosomal peptide synthetases (NRPS). Product release is accomplished by dedicated thioesterase (TE) domains, some of which catalyze an intramolecular cyclization to form macrolactone or macrolactam cyclic peptides. The excised 28 kDa SrfTE domain, a member of the alpha/beta hydrolase enzyme family, exhibits a distinctive bowl-shaped hydrophobic cavity that hosts the acylpeptide substrate and tolerates its folding to form a cyclic structure. A substrate analog confirms the substrate binding site and suggests a mechanism for substrate acylation/deacylation. Docking of the peptidyl carrier protein domain immediately preceding SrfTE positions the 4'-phosphopantheinyl prosthetic group that transfers the nascent acyl-peptide chain to SrfTE. The structure provides a basis for understanding the mechanism of acyl-PCP substrate recognition and for the cyclization reaction that results in release of the macrolactone cyclic heptapeptide.
        
Title: Insights into channel architecture and substrate specificity from crystal structures of two macrocycle-forming thioesterases of modular polyketide synthases Tsai SC, Lu H, Cane DE, Khosla C, Stroud RM Ref: Biochemistry, 41:12598, 2002 : PubMed
Modular polyketide synthases (PKSs) synthesize the polyketide cores of pharmacologically important natural products such as erythromycin and picromycin. Understanding PKSs at high resolution could present new opportunities for chemoenzymatic synthesis of complex molecules. The crystal structures of macrocycle-forming thioesterase (TE) domains from the picromycin synthase (PICS) and 6-deoxyerythronolide B synthase (DEBS) were determined to 1.8-3.0 A with an R(crys) of 19.2-24.4%, including three structures of PICS TE (crystallized at pH 7.6, 8.0, and 8.4) and a second crystal form of DEBS TE. As predicted by the previous work on DEBS TE [Tsai, S. C., et al. (2001) Proc. Natl. Acad. Sci. U.S.A. 98, 14808-14813], PICS TE contains an open substrate channel and a hydrophobic dimer interface. Notwithstanding their similarity, the dimer interfaces and substrate channels of DEBS TE and PICS TE reveal key differences. The structural basis for the divergent substrate specificities of DEBS TE and PICS TE is analyzed. The size of the substrate channel increases with increasing pH, presumably due to electrostatic repulsion in the channel at elevated pH. Together, these structures support previous predictions that macrocycle-forming thioesterases from PKSs share the same protein fold, an open substrate channel, a similar catalytic mechanism, and a hydrophobic dimer interface. They also provide a basis for the design of enzymes capable of catalyzing regioselective macrocyclization of natural or synthetic substrates. A series of high-resolution snapshots of a protein channel at different pHs is presented alongside analysis of channel residues, which could help in the redesign of the protein channel architecture.
As the first structural elucidation of a modular polyketide synthase (PKS) domain, the crystal structure of the macrocycle-forming thioesterase (TE) domain from the 6-deoxyerythronolide B synthase (DEBS) was solved by a combination of multiple isomorphous replacement and multiwavelength anomalous dispersion and refined to an R factor of 24.1% to 2.8-A resolution. Its overall tertiary architecture belongs to the alpha/beta-hydrolase family, with two unusual features unprecedented in this family: a hydrophobic leucine-rich dimer interface and a substrate channel that passes through the entire protein. The active site triad, comprised of Asp-169, His-259, and Ser-142, is located in the middle of the substrate channel, suggesting the passage of the substrate through the protein. Modeling indicates that the active site can accommodate and orient the 6-deoxyerythronolide B precursor uniquely, while at the same time shielding the active site from external water and catalyzing cyclization by macrolactone formation. The geometry and organization of functional groups explain the observed substrate specificity of this TE and offer strategies for engineering macrocycle biosynthesis. Docking of a homology model of the upstream acyl carrier protein (ACP6) against the TE suggests that the 2-fold axis of the TE dimer may also be the axis of symmetry that determines the arrangement of domains in the entire DEBS. Sequence conservation suggests that all TEs from modular polyketide synthases have a similar fold, dimer 2-fold axis, and substrate channel geometry.