|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 45, 33825-33829, November 10, 2006
Minireview The ACT Domain: A Small Molecule Binding Domain and Its Role as a Common Regulatory Element*From the Department of Molecular Biology and Pharmacology and the Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110
The ACT domain is a structural motif in proteins of 7080 amino acids that is one of a growing number of different intracellular small molecule binding domains that function in the control of metabolism, solute transport, and signal transduction (15). The first structure of an ACT domain was determined in 1995 with the crystal structure of Escherichia coli D-3-phosphoglycerate dehydrogenase (6), a tetrameric protein containing one ACT domain per subunit. This structure represents the archetypical ACT domain and is composed of four strands and two helices arranged in a ![]() ![]() ![]() ![]() ![]() fold as shown in Fig. 1. However, it was not recognized as a recurring motif until 1999 when Aravind and Koonin (2) proposed this based on a PSI-Blast (position-specific iterating-Blast) sequence data base search using the small subunit (IlvN) of acetolactate synthase. This search identified a diverse group of proteins that were noted to be involved in some way in amino acid and purine metabolism and were regulated by specific amino acids. They named this proposed domain the ACT domain after the first letters of three of the proteins, aspartate kinase-chorismate mutase-tyrA (prephenate dehydrogenase).
In 2001, the structure of the Lrp-like transcriptional regulator from Pyrococcus furiosis was published (7). This was the first structure of a transcription factor that contained an ACT domain. A PSI-Blast analysis of its sequence by Ettema et al. (3) revealed an additional group of proteins, including both enzymes and transcription regulators, that they proposed contained a novel type of ACT domain, which they named the RAM domain for regulator of amino acid metabolism. The Lrp-like transcriptional regulator contains the ACT domain fold ( A novel type of ACT domain-containing protein family whose members contain ACT domain repeats has also been identified by sequence analysis in Arabadopsis (8). These proteins were termed ACR proteins. There are at least 8 genes in the 5 chromosomes of Arabadopsis that belong to the "ACR" protein family. Proteins similar to the ACR family have also been identified in rice (Oryza sativa) (9). The majority of ACT domain-containing proteins appear to interact with amino acids and are involved in some aspect of regulation of amino acid metabolism (24) (Fig. 1). These include both metabolic enzymes and transcription regulators. In fact, the expression of some ACT-containing enzymes is under the control of ACT-containing transcription regulators. This has resulted in the ACT domain being referred to as "the regulatory domain in amino acid metabolism" in the SCOP (structural classification of proteins) data base. However, notable exceptions to this generality have been revealed in recent years. These include the NikR transcriptional regulator that binds nickel and functions in the regulation of intracellular nickel levels (10) and the YkoF protein (11) that binds thiamine and is thought to be involved in thiamine transport.
It is noteworthy that the 8090-amino acid long ribonucleoprotein motif of RNA binding proteins (12, 13) also possesses the same
A recent search of the "protein family data base" (Pfam) listed 3779 proteins as containing the ACT domain. These represent at least 57 different architectures (Pfam accession number PF01842), and ACT domain proteins are found in Bacteria, Archaea, and Eukaryota, including vertebrates, plants, fungi, and single cellular organisms. However, the ACT domain is not found in all organisms nor is it used consistently within species or in all proteins with like function (14).
The sequence homology among the ACT domains of proteins is not immediately obvious and went undetected until more sensitive sequence comparison methods, such as PSI-Blast, were developed. Because of the large number of alignments now available, they can only be presented here in a limited fashion. The reader is referred to the original articles (25) as well as the Pfam data base for additional information. Fig. 2 presents the sequence alignments of representative ACT domains including all of those whose structures have been determined. The structures provide the additional advantage of considering the secondary structure elements in the alignment. The amino acid residues are highlighted by residue type following closely the residue-grouping pattern originally presented by Aravind and Koonin (2) and later by Ettema et al. (3). When the pattern of secondary structure is also taken into consideration, it can be seen that sequences of the RAM domain group are not inconsistent with the original ACT domain alignment (2). To illustrate this, Fig. 2 presents four putative RAM domains highlighted first according to the ACT domain alignment of Aravind and Koonin (2) and then according to the RAM domain alignment of Ettema et al. (3). A common characteristic of the initial alignment of Aravind and Koonin (2) was the presence of a nearly invariant glycyl residue at the turn between the first
The RAM domain proteins are missing the glycyl residue in this location. This coincides with the presence of another highly conserved glycyl residue in the loop between the 2 and 3 strands. Although the RAM domain proteins share many features of the ACT domain alignment, the 2- 3 loop area does appear to be rather distinctive for the RAM grouping (Fig. 2).
To date, the structures of at least 10 ACT domain-containing proteins have been determined (Fig. 3), and five of these have been solved with bound ligand. Even with this limited number of structures, they display a remarkable diversity in how the domains associate.
The structures range from stand-alone ACT domains, as seen for the YbeD protein from E. coli (14) (Fig. 3K), to large multimeric proteins with ACT domains either isolated or associating into groups of 2, 3, or 4 (6, 7, 10, 11, 1419, 21, 22). The archetypical ACT domain association with two side-by-side domains forming an extended
E. coli threonine deaminase is particularly interesting because it contains an ACT domain followed closely, within the same chain, by a ![]() ![]() ![]() fold that together forms a seven-stranded sheet (19). Mutagenesis studies (20) indicate that the effector ligand (isoleucine) binding site is located at the interface between these two domains in a position analogous to L-serine in phosphoglycerate dehydrogenase. The rat phenylalanine hydroxylase (21) (Fig. 3G) structure is a dimer where the two ACT domains do not associate with each other. However, the protein in this case is lacking the C-terminal 24 residues that are responsible for tetramer formation. Thus, a tetrameric structure could eventually show a dimeric interaction of ACT domains similar to some of the other structures.
Proteins whose ACT domains associate with their Two of the proteins, the YkoF and the glycine cleavage regulator, possess both the face-to-face and the side-by-side domain arrangements. Both proteins consist of two tandem ACT domains in each of two chains. They form the side-by-side arrangement between ACT domains within the same polypeptide and a face-to-face arrangement across the subunit interface (Fig. 3, F and I). However, as can be seen in Fig. 3, they do not adopt the same type of overall structure.
The IlvH polypeptide also consists of two
The five structures that have been solved with ligands indicate that the ligands tend to bind at the interfaces between ACT domains. There appears to be a correlation of the ligand binding sites with specific glycine residues located in loops between the helices and sheets. L-Serine binds to the phosphoglycerate dehydrogenase ACT domains primarily at the loop between 1 and 1. This location is also where L-lysine binds in aspartate kinase and has been determined by mutagenesis to be the probable location of L-valine binding to IlvH (18). The Arabadopsis aspartate kinase ACT domain also binds S-adenosylmethionine at a second site in the 3- 2 loop that seems to act synergistically with the lysine. This is the only known case of an ACT domain binding more than one kind of ligand. These three proteins are also the only ones that exhibit the extended 8-stranded sheet structure and possess a conserved glycyl residue in the 1- 1 loop region. Threonine deaminase, which has an ACT-like side-by-side dimer, also possesses a glycine residue in this location. The sequences identified as representing RAM domains (3) contain a nearly invariant glycyl residue in the loop between the 2 and 3 strands. This is the region that mutagenesis data suggest is involved in ligand binding in the Lrp-like regulator.
ATP-phosphoribosyltransferase contains neither glycyl residue and is the only available structure where the ACT domains form a triad arrangement. In this protein, the substrate, L-histidine, appears to associate mainly with the loop between the The YkoF structure is somewhat different in that it contains two successive ACT domains on the same polypeptide chain. Two of these chains associate to form a dimer that binds four ligand molecules. The main determinants for binding each ligand appear to be contributed largely by only one of the chains because there is little apparent hydrogen bonding of the ligand to the adjacent chain as is seen in most other ACT domain proteins. However, the adjacent chain does have a glycyl residue (Gly-88) in the loop between the ACT domains that is next to where two of the ligands bind. The second two ligand binding sites are opposite the C terminus of the adjacent chains. There is no glycyl residue present here, but there is also less structural obstruction at the terminus of the chain.
The NikR transcription regulator appears to be an exception. The nickel ion binds at the loop between Thus, there are at least four different ligand binding motifs that can potentially be discerned from the five available ligand-bound structures and the PSI-Blast sequence alignments. All but one involve the accommodation of enough space for the ligand to occupy the pocket by the absence of a side chain (Gly). The other is the only instance so far of a metal ion binding to an ACT domain by direct chelation with amino acid side chains.
The mechanism by which ACT domains translate the ligand binding event to the active site of enzymes or to the protein-DNA binding site in transcriptional regulators is unclear. Some metabolic enzymes containing ACT domains are inhibited by the ligands that bind whereas others are activated. Some show K-type allosteric effects whereas others display V-type effects. The uneven distribution of ACT domain proteins among and within species suggests that the ACT domain modules may have been recruited independently by each protein. If this is the case, do the ACT domains also function differently from protein to protein or is there a common mechanism? Crystal structures of E. coli D-3-phosphoglycerate dehydrogenase (6, 23) illustrate that a major effect of serine binding is the retardation of a rotation of the substrate binding domain relative to the nucleotide binding domain. The binding of phenylalanine to phenylalanine hydroxylase, along with phosphorylation of Ser-16, appears to stabilize the structure so that the binding pocket remains open to accept the substrate (21). Histidine binding to ATP-phosphoribosyltransferase synergistically enhances the inhibition of the enzyme by AMP and ADP and is associated with the conversion of a dimeric form to a hexamer (16, 17). Stabilization of the hexamer by histidine results in the active sites being partially closed and inaccessible. Crystal structures of the NikR protein from E. coli (10) and Pyrococcus horikoshii (15) have led to the suggestion that nickel binding may serve to stabilize the structure, reducing its flexibility, and thus may promote DNA binding. If there is a common theme in the regulation of ACT domain proteins, it may be that ligand binding reduces the flexibility of the structures. However, ACT domain proteins are only beginning to be studied in detail, and there is not yet enough information available. An apparent third type of ACT domain proteins are those that consist of only the ACT domain, either as stand alone proteins or with multiple copies. Examples of these are the YbeD protein and the YkoF thiamine binding protein. One possible function, as has been postulated for YkoF, is that they may function as ligand transporters. Alternatively, they may function to sequester the ligands until their intracellular levels fall to a point that they are needed.
ACT domains are found in a wide variety of proteins and in a variety of different arrangements. The majority of these bind amino acids that function to regulate some aspect of amino acid metabolism. Most ACT domain-containing proteins have been discovered in Bacteria and Archaea, indicating that the domain appeared very early on. At least one functional example has been found in a mammalian protein, rat phenylalanine hydroxylase. Interestingly, ACT domains are not found in all organisms nor are they used consistently within species or found in all proteins possessing a common activity. In their original article, Aravind and Koonin (2) suggested that the ACT domain represents a "conserved, evolutionarily mobile module" that when fused to other proteins made them susceptible to regulation by small molecules. The large variety of ACT domain-containing proteins is consistent with this idea, but they do not appear to be used universally within a single species. The preponderance of ACT domains that bind amino acids suggests that this was its original function. However, there are obvious examples that it is evolving to bind other ligands and perhaps participate in the regulation of other processes. One should anticipate that the realm of the ACT domain as a regulatory element will continue to expand as more structures are determined and more is learned about their respective functions. What was once just a very short story is developing into a drama with many ACTs.
Note Added in ProofAfter this article appeared in "JBC Papers in Press," additional articles that have been published and that are pertinent to this review are: Schreiter et al. (Schreiter, E. R., Wang, S. C., Zamble, D. B., and Drennan, C. L. (2006) Proc. Natl. Acad. Sci. U. S. A. 103, 1367613681) that describes the structure of the NikR transcriptional regulator from E. coli complexed to its operator DNA; Dian et al. (Dian, C., Schauer, K., Kapp, U., McSweeney, S. M., Labigne, A., and Terradot, L. (2006) J. Mol. Biol. 361, 715730) that describes the structure of the NikR transcriptional regulator from H. pylori; and Kotaka et al. (Kotaka, M., Ren, J., Lockyer, M., Hawkins, A. R., and Stammers, D. K. (2006) J. Biol. Chem. 281, 3154431552) that describes the structure of E. coli aspartokinase III.
* This minireview will be reprinted in the 2006 Minireview Compendium, which will be available in January, 2007. This work was supported by National Institutes of Health Grant GM56676 (to G. A. G.). 1 To whom correspondence should be addressed: Dept. of Molecular Biology and Pharmacology, Box 8103, Washington University School of Medicine, 660 S. Euclid Ave., St. Louis, MO 63110. Tel.: 314-362-3367; Fax: 314-362-4698; E-mail: ggrant{at}wustl.edu.
I thank Dr. Peter Chivers for critical reading of the manuscript and for helpful comments.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||