Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identifying the Hotspots on the Top Faces of WD40-Repeat Proteins from Their Primary Sequences by β-Bulges and DHSW Tetrads

  • Xian-Hui Wu ,

    wuxh@pkusz.edu.cn (XHW); wuyd@pkusz.edu.cn (YDW)

    Affiliation Lab of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, People’s Republic of China

  • Yang Wang,

    Affiliation Lab of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, People’s Republic of China

  • Zhu Zhuo,

    Affiliation Lab of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, People’s Republic of China

  • Fan Jiang,

    Affiliation Lab of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, People’s Republic of China

  • Yun-Dong Wu

    wuxh@pkusz.edu.cn (XHW); wuyd@pkusz.edu.cn (YDW)

    Affiliations Lab of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, People’s Republic of China, College of Chemistry, Peking University, Beijing, People’s Republic of China

Abstract

The analysis of 36 available crystal structures of WD40 repeat proteins reveals widespread existence of a beta-bulge formed at the beginning of strand a and the end of strand b, termed as WDb–a bulge: among a total of 259 WD40 blades, there are 243 such β-bulges. The R1 positions in these WDb–a bulges have fair distributions of Arg, His, Ile, Leu, Lys, Met, Phe, Trp, Tyr and Val residues. These residues protrude on the top face of the WD40 proteins and can serve as hotspots for protein-protein interactions. An analysis of 29 protein complexes formed by 17 WD proteins reveals that these R1 residues, along with two other residues (R1-2 and D-1), are indeed widely involved in protein-protein interactions. Interestingly, these WDb–a bulges can be easily identified by the 4-amino acid sequences of (V, L, I), R1, R2, (V, L, I), along with some other significant amino acids. Thus, the hotspots of WD40 proteins on the top face can be readily predicted based on the primary sequences of the proteins. The literature-reported mutagenesis studies for Met30, MDV1, Tup11, COP1 and SPA1, which crystal structures are not available, can be readily understood based on the feature-based method. Applying the method, the twelve potential hotspots on the top face of Tup11 from S. japonicas have been identified. Our ITC measurements confirm seven of them, Tyr382, Arg284, Tyr426, Tyr508, Leu559, Lys575 and Ile601, are essential for recognizing Fep1. The ITC measurements further convinced that the feature-based method provides accurate prediction of hotspots on the top face.

Introduction

WD40-repeat proteins belong to a subfamily of β-propeller proteins [1], [2]. It is one of the largest protein families from yeast to human, consisting about 1% total cell proteins [3]. These proteins show their biofunctions mainly by binding other proteins to form protein complexes [2], [3], [4], which have been found to play important roles in multiple cellular processes. For example, WDR5, EED, RbBP4, RbBP5 and RbBP7 are components of methyltransferases [5], [6]. A couple of WD40 proteins are core components of Cullin-Ring E3 ligases, which play important roles in protein degradation by catalyzing the polyubiqitination [7], [8], [9], [10]. Additionally, WD40 proteins are critical in forming the apoptosome [11], extracellular signal transduction [12], [13], [14], as well as components of the nuclear core [15], [16] and membrane vesicle [17].

WD40-repeat proteins typically contain 6–8 repeated units in the sequence order. These repeat units fold into 6–8 WD40 blades and form β-propeller structures. According to previous definition, every WD40 repeat folds into strand d–a–b–c [2]. However, strand d and strand a–b–c in a WD40 repeat belong to two different WD40 blades. To avoid the inconsistency between WD40 repeat and WD40 blade, we use WD40 blade as a repeated unit in following description. As shown in Figure 1a, the first strand d at the beginning and the last strand a, b and c at the end form a special WD40 blade, which encloses β-propeller structure. The other WD40 blades are formed by the sequential strand a, b, c and d. Figure 1b shows the folded β-propeller has a top, side and bottom faces. The top face is composed by the loop connecting strand b and c as well as several residues at the beginning of strand a. The bottom face contains two loops, which connects strand a and b, strand c and strand d. There are grooves between two neighbored WD40 blades, which is the side face. Three faces are able to address proteins to form protein complexes. Figure 1c gives some examples of the well-studied interactions involved by WD40 proteins, which predominantly occur on their top faces. WDR5 and EED bind H3R2 and tri-methylated H3K27, respectively, by cation-π interaction [18], [19]. FBW7 and β-TrCP, as an adaptor of SCF complex, recognize phosphorylated CycE and β-catenin, respectively, and facilitate their poly-ubiquitination [20], [21]. Gβ [22], [23] and TLE1 [24], [25] can bind multiple proteins on the top face by the hydrophobic and hydrophilic interactions. Besides forming complex with protein, DDB2 also binds DNA on its top face [26]. Gemin5 was proposed to interact with RNA on the top face as well [27]. In order to understand the interaction, the critical residues need to be identified on the top face. If these key residues could be predicted based on the primary sequence without structural information on the protein complexes, it will provide substantial help for addressing the protein interactions on the top faces of WD40 proteins.

thumbnail
Figure 1. The definition of WD40 blade and binding faces of WD40-repeat protein.

(a) In sequence order, every WD40 repeat protein is composed of strand d–a–b–c. Structurally, the first strand d and the last strand a–b–c form a four-strand anti-paralleled β-sheet, a WD40 blade, which encloses β-propeller structure. The other WD40 blades are formed by sequential strand a–b–c–d. (b) WD40 proteins have top, side and bottom faces. Top face: loops connecting strand b–c and strand d–a; Bottom face: loops connecting strand a–b and strand c–d; Side face: the grooves between two neighbored WD40 blades. (c) A majority of interactions are focused on the top faces.

https://doi.org/10.1371/journal.pone.0043005.g001

However, the currently available methods highly depend on the availability of crystal structures and accumulated experimental observations. Figure 2a shows the widely utilized method in determining the hotspots residues of WD40 proteins. This method requires two sequential steps. The first step is to determine loops connecting strand b–c and strand d–a. At the second step, hotspots are further identified from loops. There are three cases in practice. In every situation, both steps have a bit difference. In the first case, if there is a crystal structure of protein complex, the critical residues for interaction on the loop d–a and loop b–c are obvious [20], [28], [29], [30]. In the second case, if only the crystal structure of WD40 protein is available, the necessary residues for interaction at two loops are defined by comparing with the crystal structures of well-studied WD40 proteins, such as Gβ and TUP1 [31], [32], [33], [34], [35]. After that, conserved residues are chosen for experimental tests by aligning multiple homologs across different species. In the last case of no crystal structure, several well-studied WD40 proteins are acquired as templates, which crystal structures have already been determined. To obtain the loop d–a and loop b–c, the predicted WD40 blades are aligned with those in the templates. Then, the residues in two loops are selectively studied if the corresponding residues in the templates are already proved to be indispensable for protein-protein [36], [37], [38], [39].

thumbnail
Figure 2. Empirical methods for identifying hotspots on the top faces of WD40 proteins.

(a). Identifying hotspots by two sequential steps: 1. Determining loops composed of top faces; 2. Choosing the functional residues on two loops. (b). The 16th, 18th and 34th residues are repeatedly involved in the protein-protein interaction. Here, strand d–a–b–c was presented in the sequence order. (c). Three hotspots proposed by Stirnimann et al.3 (marked by red-stars) are incorrectly mapped on the common sequence pattern.

https://doi.org/10.1371/journal.pone.0043005.g002

As mentioned above, the empirical method highly depends on the crystal structures of protein complexes. In the case that WD40 proteins have no crystal structures, the empirical method can’t provide efficient resolution, such as Tup11 from S. pombe. Even though it has ∼53% sequence identity to a well-studied template TUP1, only two residues, Tyr362 and Leu542, are identified to be essential for Fep1 interaction [38]. Apparently, the interaction needs more residues. Even though WD40 protein itself has been crystallized, they still can’t provide much useful information for identifying key residues in protein-protein interaction. For example, although the resolution of CIA1 crystal structure is as high as 1.7Å, it indicates little information of interaction on the top face. By utilizing Gβ and Tup1 as templates, only one residue, Arg127, was found to be important for recognizing protein after screening six potential residues on the top face [32].

In 1996, Gaudet et al. reported the crystal structure of Gβ-phosducin complex [13]. As shown in Figure 2b, they noticed that the 16th, 18th and 34th residues of each WD40 blade in Gβ can be involved in binding Phosducin on the top face. They extrude on the top face of Gβ protein. The 16th and 18th residues are close to the beginning of strand a and the 34th residue locates right before the conserved Asp in Gβ. Later, in the case of β-TrCP1-β-catenin complex, the 16th and 18th residues are described to be at the beginning of strand a based on its crystal structure. Tyr271, Leu474 and Arg474 at the beginnings of two strand a are further proved to be indispensable for binding β-catenin by mutagenesis studies [21]. However, this rule has found few applications in the literature, probably due to the following reasons: (1) the precise positions of 16th, 18th, and 34th residues can’t be applied directly to other WD40 proteins because the lengths of WD40 blades are variable. (2) The beginning of strand a is irregular and hardly defined precisely. Recently, an analysis of peptide-WD protein complexes by Stirnimann et al. [3] revealed that such three residues are generally required for recognizing peptides on the top face by WD40 proteins. In addition, the 16th, 18th and 34th residues are pointed out on the structural elements proposed by Smith et al. [4] as shown in Figure 2c. However, the proposed 16th and 18th residues usually point inward as the 3D structure was taken into the consideration, which is unavailable in protein-protein interaction.

In this manuscript, we analyzed 36 WD40-repeat proteins, which crystal structures are available [40]. Based on the survey of the available crystallographic and mutagenesis studies, we will pursue the hypothesis that residues at specific positions are responsible for the protein-protein interactions on the top face of the WD40 domains partly based on the structural elements [4]. We will present a general prediction method for the identification of these key residues using only the sequence information. Finally, this method will be validated using the results of mutagenesis studies and ITC measurements.

Results and Discussion

β-bulges in WD40 Blades

An analysis of 36 available WD proteins with crystal structures has been carried out. Totally, there are 259 WD40 blades and 133 of them have DHSW hydrogen-bonded tetrads. As depicted in Figure 3, 243 WD40 blades have a WDb–a β-bulge, and 206 WD40 blades have a WDc–d bulge. Interestingly, all WD40 blades, which contain the DHSW tetrads, have WDb–a bulges.

thumbnail
Figure 3. The hydrogen bonds and structural features in the general WD40 blade by strand a–b–c–d.

Although the enclosed WD40 blade has the different order in sequence, their hydrogen bonds, β-bulges and DHSW tetrad are in common. the DHSW tetrad: Asp-His-Ser/Thr-Trp (in red). WDb–a β-bulge: formed in strand b and a; WDc–d β-bulges: formed in strand c and d. X, R1, R2 (in blue). Three residues, R1-2, R1 and D-1 colored in pink are highlighted here for the following discussion of interaction on the top face. R1 as a binding residue is “R1 residue” of WDb–a bulge. R1-2 locates on two residues before R1 of WDb–a. D-1 locates before Asp of DHSW tetrad.

https://doi.org/10.1371/journal.pone.0043005.g003

β-bulge is a special structure in β-sheet, which includes two residues (R1, R2) in one strand H-bonded to a single residue (X) in the opposite strand. Structurally they cause the change of direction for a β-sheet [41], [42]. They disrupt further extension of β-sheet and therefore, have been proposed to be naturally designed structure to prevent aggregation of β-sheet-rich proteins [43]. Currently available experiments suggest β-bulges play critical roles in the protein folding and maintaining the function in the ubiquitin [44], [45], E. coli dihydrofolate reductase [46], bacterial RNase [47] and mammalian defensing [48].

WDb–a Bulges have Distinct Residue Propensities

Table 1 summarizes the residue distributions (%) in WDb–a and WDc–d β-bulges. Several characters are apparent:

  1. As summarized in Figure 4a, the most significant feature of WDc–d is the dominance of bulky residues Val, Ile and Leu in the X and R1 positions, 84.6%, 53.7%, respectively. This situation is similar to that in the classic β-bulge of all proteins [41], [42].
  2. Contrast to the WDc–d, the X position of the WDb–a is featured with small residues, Gly and Ala in 38.2% and 25.2%, respectively. Together with Ser and Cys, these four types of residues compose of 82.3% of X residues.
  3. The most favorable R1 residues of the WDb–a are Asn, Ser and Thr, totalling to 37.8%. However, Arg, Glu, Gln, His, Ile, Leu, Lys, Met, Phe, Trp, Tyr and Val all have reasonable populations, totaling to about 57.6%. Thus, R1 residues are most diversified.
  4. As shown in Table 1, WDb–a has significantly increased populations of the hydrogen bond donor/acceptor residues, Asp, Asn, Cys, Ser and Thr with the comparison of WDc–d. In particular, Cys is not common in classic β-bulges, but has high populations in the X and R2 positions of the WDb–a.
thumbnail
Figure 4. The summation of WDb–a and WDc–d β–bulges.

(a) The schematic view and residue dominances of WDb–a and WDc–d bulges. (b) Two-side view of β-bulge. X, R1:α-conformation; R2: β-conformation.

https://doi.org/10.1371/journal.pone.0043005.g004

thumbnail
Table 1.The. percentages of the 20 natural amino acid residues in the X, R1 and R2 positions of WDb–a and WDc–d β-bulges.

https://doi.org/10.1371/journal.pone.0043005.t001

Figure 4 summarizes the residue preferences of WDb–a and WDc–d are dramatically different in the X, R1 and R2 positions. The side chains of X, R1 and R2 point to the same side and are capable of interaction by the hydrogen-bond or hydrophobic packing. Thus, the different residue preference of WDb–a bulge suggests it play dual roles in the function and structure. In further presentation, we will only focus on the structure and function of WDb–a bulge with its unusual residue propensity.

Understanding the Unusual Residue Propensity

Figure 5a shows the topology and WDb–a bulges of two neighboring WD40 blades. For clarity, the loops connecting strands are deleted. Figure 5b show the detailed hydrogen bond network formed between the two WD40 blades. The abundant Asp, Asn, Cys, Ser and Thr in the X (23.9%), R1 (41.2%) and R2 (63.9%) positions, of the WDb–a usually form the inter-blade hydrogen bonds. Besides forming the intro-blade hydrogen bonds in the β-bulges, two inter-blade hydrogen bonds, HB1 and HB2, are normally formed between the X+1 residue (Ser213) in the WDb–a-2 and the R1-1 residue (Val254) of WDb–a-1. However, if a hydrogen bond donor (Asn, Cys, Ser and Thr) is present at the X, R1 or R2 position of the WDb–a-2, another hydrogen bond is formed between these sidechains and the carbonyl group of the R1-1 position of WDb–a-1 (Figure S1). The rich Asp residues in the R2 positions are mainly form the salt-bridges with Arg/Lys from the strand a on the other WD40 blade (Figure S2). Therefore, all these unusual residues are able to stabilize entire protein structure by providing interblade interactions.

thumbnail
Figure 5. The interaction involved in WDb–a β-bulges between two neighbored WD40 blades.

(a) and (b) show the hydrogen-bonding interactions provided by two WDb–a of neighboring WD40 blades in 3D and 2D, respectively. Those residues composed of WDb–a bulges are shown in stick as numbered in BUB3 (1YFQ). The hydrogen bonds, HB1 and HB2, are formed inter-blade by Ser213 backbone (X+1 position) and Ser212 (X position) side chain with Val254 (R1-1) at the other WD40 blade. (c) χ2 dihedral angles of R1 residues. As χ2 is at gauche-conformation, Arg points upward. If χ2 is at anti-conformation, Arg point to WD40-1, which is unacceptable due to the crowdedness.

https://doi.org/10.1371/journal.pone.0043005.g005

Our analysis also indicates that in the WDb–a, almost half of R1 residues with large sidechains as shown in Figure 4 are in an unfavorable conformation. Figure 5c shows χ2 dihedral angle of Arg, which usually favors an anti-rotamer, is forced to adapt a gauche rotamer due to the steric crowdedness between the two adjacent WD40 blades. This unfavorable conformation ensures the sidechains to protrude upward and provide highly energetic interaction [49], [50]. Thus, R1 residues of WDb–a bulges are likely to be the hotspots if they belong to the aromatic (Phe, Try, Trp), bulky (Ile, Leu, Val) and polar (Arg, Lys, Asp, Glu, Asn and Gln) residues [49], [50], [51]. To confirm the assumption, we carried out a survey of mutagenesis studies of protein complexes, which structures are available.

R1 of WDb–a Along with R1-2 and D-1 are Three Residues Generally Required for Recognizing Proteins/peptides on the Top Face of WD40 Proteins

As shown in Figure 2b, we found that R1 in the WDb–a bulge is at the 18th residue in Gβ.15 The 16th residue locates at two residues before R1 (R1-2). Meanwhile, the 34th residue locates one residue before Asp (D-1) in the DHSW tetrad. Interestingly, their locations are highly relevant to the positions of structural features in Gβ. Besides Gβ, an analysis of reported WD40-peptides/protein complexes reveals that R1-2, R1 and D-1 are generally required for interaction. Totally, there are 29 protein complexes formed by 17 WD proteins on the top face besides five complexes formed by FBW7/CDC4/Doa1-ubiquitin [33], Ski8p-Ski3p [35] and Tup1-Metα [52], which only have the crystal structures for WD proteins. By the crystallographic studies, R1-2, R1 and D-1 are found to be extensively involved in binding proteins/peptides in these protein complexes. Furthermore, some residues are further convinced to be indispensable by the mutagenesis studies, which are highlighted by the underlines as shown in Figure 6. The detailed information of 29 complexes is listed in Table S1. Noticeably, few residues other than R1-2, R1 and D-1 are essential for the binding.

thumbnail
Figure 6. The summation of crystallographic and mutagenesis studies of hotspots on the top faces of 19 representative WD proteins.

As depicted on the top left corner, every WD40 blade is composed of four-strand anti-paralleled β-sheet implicitly. R1-2, R1 and D-1 residues are colored by blue, red and green, respectively. All listed residues are supposed to be important for protein-protein interaction on the top face. Some of them highlighted by the underlines were further convinced by mutagenesis studies.

https://doi.org/10.1371/journal.pone.0043005.g006

R1 Residue Might Play More Important Roles in Protein-protein Interaction through the Analysis of Complexes

In some protein complexes, several R1 residues are commonly required even in binding different proteins on the top face. Bub3 [28], [31], FBW7 [20], Gβ [12], [13], [14], [22], [23] and WDR5 [18], [53] have been all proved to share R1 residues to bind different sequences/proteins. On the other side, one R1 residue is enough to define its selectivity. LIS1 and TLE1 interact with different proteins with the use of different R1 residues. Both PAF-AFα2 and Nde1 have been found to bind LIS1 on the top face. Mutation of Arg238, a R1 residue, to Ala does not prevent the interaction between LIS1 and Nde1 in vitro, while it abolishes the interaction between LIS1 and PAF-AFα2 completely [30]. TLE1 can bind both WRPW motif in Hes and FxIxxIL motif in Eh1 on the top face [24]. Leu743, a R1 residue as replaced by Phe, selectively eliminates the interaction between TLE1 and WRPW motif and has little effect on TLE1-FxIxxIL interaction [54].

Expectedly, these residues form a cluster in the protein-protein interaction in a majority of cases although we can’t exclude that Arg, Ile, Leu, Lys, Met, Phe, Trp and Tyr may function as a binding hotspot individually. Charged residues presented at R1-2, R1 and D-1 of neighboring WD40 blades generate a positively-charged patch to accommodate negatively-charged groups, such as FBW7 and CDC4. Figure 7 shows FBW7 selectively interact with phosphorylated Thr80 of Cyclin E with the use of Arg465 and Arg505. The similar character can be found in CIAO and CIA as shown in Figure 6. We speculate that CIAO and CIA may bind phosphorylated proteins due to their tandem presented Arg and Lys in the R1 positions. Several residues including an aromatic sidechain, such as Phe, Tyr and Trp, can form a cluster to accommodate cationic group by the cation-π interactions. Figure 7 shows R1 residues Phe96 and Tyr147 in EED are both essential for the recognition of tri-methylated H3K27 through cation-π interaction. But, this kind of binding capability is uneasy to predict because it requires more factors than residue types. Hydrophobic residues located on R1-2, R1 and D-1 of sequential WD40 blades form a hydrophobic ring and aim to interact hydrophobic residues or patches. Figure 7 presents that Tyr532, Tyr578, Leu636, Phe661, Leu702 and Leu743 in TLE1 (PDB_ID: 1GXR) provide a hydrophobic interaction to bind WPRW. By this means, Figure 6 shows a few WD40 proteins are able to interact with hydrophobic patch, such as ARPC1, BUB3, SIF2 and Ski8p. Because R1-2, R1 and D-1 as well as organization of these residues can provide critical information for protein-protein interaction on the top face, it’s meaningful to predict these residues precisely from their primary sequences.

thumbnail
Figure 7. Three representative binding modes provided by WD40 proteins.

Binding negatively-charged group: R465 and R505 of two neighboring WD40 blades at FBW7 interact with phosphorylated Thr80 of Cyclin E. Binding positively-charged group: EED applies two R1, F97 and Y147, as well as Y365 to selectively interact with trimethyl-H3K27. Interacting hydrophobic group: Hydrophobic ring formed by a series of residues in TLE1 accommodates WRPW tetrapeptide. R1-2, R1 and D-1 residues are colored by blue, red and green, respectively.

https://doi.org/10.1371/journal.pone.0043005.g007

Identifying R1-2, R1 and D-1 for WD40 Proteins from the Primary Sequence

The structural elements of WD40 blade developed by Smith, et al. [4] are extremely helpful for understanding the topology of β-propeller and identifying the predicted surface residues. To provide a general concept, four strands in the WD40 blade share the constant starts and lengths by ignoring their variability. Therefore, the structural element is unsuitable for locating residues accurately on the primary sequence by unclear description of “at the beginning of strand a” and “the end of strand b” as shown in Figure 2c. Encouraged by the relevance of hotspots to the structural features, we developed a feature-based method to predict R1-2, R1 and D-1 from primary sequences based on β-bulges and DHSW tetrads.

Figure 8 shows the common sequence pattern of 259 WD40 blades in 36 WD proteins, which is derived from preferentially aligning the residues in the DHSW tetrad and β-bulges. It has the similar residue frequencies to that provided by Neer et al. [2] although the tiny difference can’t be excluded. Due to the structural and biofunctional requirement, the locations of residues in the DHSW [55], [56] and β-bulges are conserved on the primary sequence. The positions of R1-2, R1 and D-1 are clearly defined at the common sequence pattern on the basis of structural features.

thumbnail
Figure 8. R1-2, R1 and D-1 are predicted from WD40 blade primary sequence with the use of structural features.

The residues of WD40 blade are in the order of frequencies along the vertical line. The blue-colored residues are those in the DHSW tetrad. R1, R2 and X of β-bulges are highlighted by green color except for R1 of WDb–a. The red-colored residues, R1-2, R1 and D-1, are potential hotspots. Fa and Fb are two fingerprint sequences. η represents bulky residues: Ile, Leu and Val. χ represents Trp, Phe, Tyr, Ile, Leu and Val. ψ represents residues with small side chains: Gly, Ala, Ser, Cys, Thr. φ are potential hotspots.

https://doi.org/10.1371/journal.pone.0043005.g008

The feature-based method for identifying three residues is divided into three steps: 1. Annotation of WD40 blades. Typically, most of WD40 blades are annotated by SMART [57] or in UniProt [58]. So, we use the annotation in UniProt for consistency. 2. Determining the DHSW tetrad and two fingerprint sequences. 3. Define R1-2, R1 and D-1 by their dependence. If WD40 blade includes a DHSW tetrad, R1-2, R1 and D-1 are easily defined according to the relative position of Asp, His, Ser and Trp in the DHSW tetrad [55], [56]. If there is no DHSW tetrad, two fingerprint sequences, Fa and Fb, should be identified in advances within a WD40 blade as shown in Figure 8. In Fa, two η should be Ile, Leu and Val, which enclose R1 and R2. χ is the end of strand a, which dominants by aromatic or bulky residues. In Fb, three ψ are those residues with small side chains. Occasionally, Asp/Asn can be replaced by Ser, Thr, Glu or Gln. If WD40 blade contains both Fa and Fb sequences, R1-2 and R1 are two φ residues in Fa, while D-1 is φ residue in Fb. If R1-2, R1 and D-1 belong to the aromatic (Phe, Try, Trp), bulky (Ile, Leu, Val) and polar (Arg, Lys, Asp, Glu, Asn and Gln) residues, they could be potential hotspots on the top face.

The Predicted R1 Propensities from 629 Primary Sequences are in Agreement with those Derived from Crystal Structures

To test the applicability of these straightforward and simple rules, we carried out prediction for 3939 annotated WD40 blades in UniProt [58] which are composed of 629 WD40 proteins. All these proteins have less than 50% sequence identities. The feature-based method can identify R1-2, R1 and D-1 residues in 2,200 WD40 blades by DHSW tetrad or two fingerprint sequences. The coverage of prediction is about 55.9%. Figure 9 presents the correlated percentage of predicted R1 residue to those in 36 WD40 proteins. R2 value is about 0.84. The most dominant residues at R1 position have quite similar tendency, such as Ser, Thr and Asn. In addition, the other residues, Arg, Leu, Try, Tyr, Phe and Lys, also have the similar percentages, which are popular as hotspots. Statistically, the feature-based method can predict hotspots on the top face quite well. As we incorporated the residue propensity and structural features into prediction quantitatively, the coverage of prediction reaches around 95%. The responding R2 value is also much better (unpublished data).

thumbnail
Figure 9. The correlation of R1 propensity between x-ray crystal structure and WD40 blade primary sequences.

The most dominant residues, Ser, Thr, Asn, Arg, Leu, Trp, Phe, Tyr and Lys are qualitatively consistent.

https://doi.org/10.1371/journal.pone.0043005.g009

The Predicted R1-2, R1 and D-1 are Consistent with Mutagenesis Studies of WD40 Proteins

Besides statistical evaluation, we further picked out all WD proteins from the test database (629 proteins in UniProt), which structures have not been determined, but mutagenesis studies of predicted R1-2, R1 and D-1 residues have been reported. A total of six R1, seven R1-2, four D-1, two X and one R1-3 mutations were found in Met30(S. cerevisiae) [36], MDV1(S. cerevisiae) [37], Tup11(S. pombe) [38], COP1(Arabidopsis) [39] and SPA1(Arabidopsis) [59].

Figure 10 shows the Ala substitutions for Met30 and MDV1 for the predicted R1-2, R1 and D-1. As for Met30, eleven residues were chosen on the basis of consensus sequence pattern [2], [4] and the structural alignment with β-TrCP [21] as well as CDC4 [29]. Substituted by Ala, five residues are found to be important for interacting with Met4 [36]. Compared with our predicted result, three of them are R1 residues (Leu386, Asn425, Gln467) and two of them are D-1 (Tyr320, Leu529). With the use of Gβ [12] and Tup1 [60] as templates, Phe400, Arg461, Asp664 and Ser689 of MDV1 were replaced by Ala. Only Arg461 was detected to be essential for interacting with Dnm1 [37], which is D-1 according to our prediction. The mutagenesis studies of COP1 [39] and SPA1 [59] are also consistent with our prediction because those residues, which are importance for protein-protein interaction, are at the predicted R1-2, R1 and D-1 positions. The detailed interpretation of experimental observation is in the supplemental material (Figure S3). As a result, the available results support the feature-based method is more efficient at predicting key residues for protein-protein interaction on the top face from the primary sequence.

thumbnail
Figure 10. The consistency of predicted R1, R1-2 and D-1 in WD40 blades of Met30p and MDV1p with the mutagenesis studies.

The blue-colored residues are in the predicted DHSW tetrad. The green-colored residues are in the predicted WDb–a and WDc–d bulges. The red-colored residues were studied by the mutagenesis. The last WD40 blade is composed of strand d at the beginning of WD40 domain and the last strand a, b and c.

https://doi.org/10.1371/journal.pone.0043005.g010

The Feature-based Method is Able to Identify R1-2, R1 and D-1 Residues Precisely and Comprehensively on the Top Face

To demonstrate its ability, we took Tup11-Fep1 interaction for an example. By the prediction, R1-2, R1 and D-1 of Tup11 S. japonicus, a WD40-repeat protein, are identified for interacting with Fep1 from the primary sequence. Tup11 has been chosen as a demonstration by two reasons. 1. It was discovered to repress the transcription of iron-uptake genes fio1+, fip1+, frp1+ and str1+ by binding a GATA-factor Fep1. Because eukaryotes are short of the egress system for iron, the interaction between Tup11 and Fep1 is important for understanding the iron homeostasis [61]. 2. Only two residues were identified to be essential for binding Fep1 on the top face with the use of Tup1 as template, which has as high as 53% sequence identities [38].

The Predicted R1-2, R1 and D-1 are Substituted by Ala and their Binding Affinities to Fep1 are Measured by ITC

As shown in Figure 11, a total of 12 underlined residues might be responsible for binding Fep1. Except for Leu384Ser, eleven residues and Cys337 as well as Thr467 are singly substituted by Ala. Figure 12a shows Kd of the wild type Tup11-Fep1 interaction, measured by ITC, is about ∼10 µM, corresponding to ∼6.6 kcal/mol for the binding free energy. Figure 12b shows the variation of binding affinities by Ala substitutions. As for R1 positions, Cyc337 and Thr467 have little effect on binding Fep1 because both residues don’t belong to hotspots. The remained five residues are all critical for binding Fep1. The Ala substitutions cause around 1.0∼1.5 kcal/mol reduction. As expected, only parts of R1-2 and D-1 residues are essential for Fep1 interaction. Leu524Ala at D-1 and Phe557Ala at R1-2 only reduce the binding free energies for 0.5∼0.8 kcal/mol. However, Tyr382 at R1-2 and Lys575 at D-1 are found to be required for binding Fep1, which decrease the binding affinity for around 1.7 and 1.9 kcal/mol, respectively. The energetic contributions are larger than R1 residues. As shown in Figure 12c, Tyr382 and Arg384 as well as Tyr508, Leu524, Phe557, Leu559 and Lys575 may form two binding clusters. The residues composed of binding cluster can stabilize each other through the interaction. It seems residues at the outer ring play major roles in pre-organizing the cluster. The Ala substitutions of Tyr382 and Lys575 may destroy the clusters. Thus, the ITC measurements of Tyr382Ala and Lys575Ala may reflect the net contribution of cluster collapse. This part will be reported in due time.

thumbnail
Figure 11. The potential “hotspots” on the top face of Tup11.

The predicted R1-2, R1 and D-1 residues in WD40 blades marked by red-color. The underlined residues besides Cys337 and Thr467 are substituted by Ala or Ser. The last WD40 blade is composed of strand d at the beginning of WD40 domain and the last strand a, b and c.

https://doi.org/10.1371/journal.pone.0043005.g011

thumbnail
Figure 12. ITC measurements of Fep1 and Tup11 wild type as well as its Ala substitutions.

(a). The ITC profile of the wild type Tup11-Fep1 interaction. (b). The measured binding affinities of the interactions between Tup11 as well as its mutants and Fep1 by ITC. (c). The critical residues on top face of Tup11.

https://doi.org/10.1371/journal.pone.0043005.g012

Although all measured values are less than 2 kcal/mol as the general definition of “hotspots”, we still consider the reduction of ∼1.0 kcal/mol binding free energy is enough to eliminate Tup11-Fep1 interaction. Leu559 at R1 in Tup11 from S. japonicus are at the same position as Leu542 from S. pombe. The single mutation of Leu542Ser destroys the interaction with Fep1 as detected by yeast two-hybrid method [38]. As shown in Figure 12b, Leu559Ala decreases the binding affinities for around 1.0 kcal/mol, suggesting that 1.0 kcal/mol is strong enough to eliminate the interaction in this specific case. Except for R1 residues, only part of R1-2 and D-1 are necessary for binding Fep1. This is because Tup11 may interact with multiple proteins on the top face, such as Tup1. Those residues, which are unimportant for interacting with Fep1, may be indispensable for the other proteins.

Conclusively, seven residues make the dominant contribution to the interaction as shown in Figure 12c. Compared with the previous achievements, five more residues are detected to be essential for interaction of Fep1 [38]. Thus, the feature-based method is capable of predicting hotspots for WD40 proteins from the primary sequences with a much higher reliability and efficient. By Figure 12c, we expect that Tup11-Fep1 interaction is dominated by hydrophobic interactions, which is currently under investigation.

Tup11 and their mutants have the similar CD spectroscopy at the same temperature and buffer as those in ITC reaction (Figure S4), which excludes the destruction of entire structure by Ala substitution. Thus, the changes of binding affinities are major due to the substitution effect. Figure 12b shows Arg382Ala, Lys575Ala and Ile601Ala have much larger error bars because the exothermic effects are dramatically decreased (Figure S5).

Conclusion

An analysis of 36 crystal structures of WD40-repeat proteins reveals that WDb–a bulge widely formed in strand b and strand a on the top surface. WDb–a β-bulges have quite different residue propensities from those in the classic β-bulge as well as WDc–d. The X, R1 and R2 of the WDb–a are dominated by small residues. Typically, these residues form inter-blade hydrogen bonds or salt-bridges to stabilize the proteins.

The R1 position in WDb–a is also fairly populated with Arg, His, Ile, Leu, Lys, Met, Phe, Trp, Tyr and Val residues. All these residues adopt a protruded state on the top face due to the steric restraint, which have the abilities to provide highly energetic contribution in the protein-protein interaction. R1 as well as R1-2 are two residues close to the beginning of strand a, which are equal to the 18th and 16th residues proposed by Gaudet et al in Gβ-phosducin interaction [13]. Available crystallographic and mutagenesis studies of 29 protein complexes formed by 17 WD proteins support that R1-2, R1 and D-1 are generally required in protein-protein interactions.

With the use of the DHSW tetradsand β-bulges, the core sequence pattern has been obtained by aligning WD40 blades of 36 WD40 proteins. The resulting common sequence pattern provides clear information for the residue locations of the DHSW tetrad and WDb–a β-bulge. Applying the dependency of R1-2, R1 and D-1 on the structural features, a feature-based method was developed to predict R1-2, R1 and D-1 from the primary sequences.

The predicted R1-2, R1 and D-1 by the feature-based method are consistent with the available mutagenesis studies of Met30, MDV1, COP1, SPA1 and Tup11. Twelve residues on R1-2, R1 and D-1 of Tup11 are predicted to be potential hotspots for recognizing Fep1. Five R1 (Arg384, Tyr426, Tyr508, Leu559, Ile601), one R1-2 (Tyr382) and one D-1 (Lys575) are convinced by ITC measurement to be important for binding Fep1. Compared with the previous result, the feature-based method is able to identify R1-2, R1 and D-1 precisely on the top face independent of crystal structure.

Materials and Methods

Protein Expression, Purification and Mutagenesis

The cDNA of S. japonicus Tup11 (WD40 domain: amino acids 301–630) and C-terminal region of S. pombe Fep1 (amino acids 319–564) were cloned into pET-28a (+) vectors. The alignment indicates WD40 domains of Tup11 from S. japonicas and S. pombe have the identical top face. All mutated DNA were prepared by following protocol of QuickChange site-directed mutagenesis kit. The constructs and mutants were confirmed by DNA sequencing. The wild type Tup11 and its mutants as well as Fep1 were overexpressed at 25°C in E. coli BL21 (DE3) with 0.1 mM IPTG. The proteins were purified by nickel affinity chromatography and followed by gel filtration. After the gel filtration, all proteins are finally dissolved in the 1×PBS buffer containing 0.01 mM Na2HPO4, 0.01 mM NaH2PO4, pH 7.4 and 150 mM NaCl. The Fep1 protein, Tup11 WT and its mutants were concentrated to ∼1.4 mM and ∼1 mM, respectively. The protein concentration was determined by NanoDrop 2000 Spectrophotometer (Thermo Scientific) as described in our previous work [56].

ITC Measurement

The highly concentrated Fep1 was used directly, and Tup11 WT and its mutants were diluted to ∼0.2 mM by 1×PBS buffer in the titration. All ITC measurements were performed at 25°C on an iTC200 device (MicroCal). Total volume of 34 µl of Fep1 was injected by 2 µl each time at stirring speed of 1000 rpm, with 2 min injection spacing for equilibrium. The isotherm of Fep1 titrate into PBS buffer was set as background. Final data was obtained by subtracting background and fitting in Origin.

CD Measurement

All CD measurements are carried out on a Chirascan Circular Dichroism (CD) Spectrometer (Applied Photophysics). Tup11 WT and its mutants were diluted to ∼0.1 mg/ml by deionized water, and were placed in a Hellma 1 mm light path Micro-cell separately. Three CD spectra were collected for each sample at 25°C over the wavelength range from 200 nm to 260 nm, with a 1 nm step wise. Three CD spectra of deionized water were averaged and set as baseline, and subtracted from the average spectrum of sample. At last, all CD data were converted to mean residue ellipticity.

Supporting Information

Figure S1.

The intra−/inter-blade hydrogen bonds formed by two WDb–a.

https://doi.org/10.1371/journal.pone.0043005.s001

(TIF)

Figure S2.

The inter-blade salt bridges formed between Asp/Glu in the R2 of WDb–a and Arg/Lys in the strand a of the other WD40 blades.

https://doi.org/10.1371/journal.pone.0043005.s002

(TIF)

Figure S3.

The prediction of R1, R1-2 and D-1 in the Tup11 from S. pombe , COP1 and SPA1 from A. thaliana .

https://doi.org/10.1371/journal.pone.0043005.s003

(TIF)

Figure S4.

CD profiles for Tup11 wild type and fourteen Ala substituted mutants.

https://doi.org/10.1371/journal.pone.0043005.s004

(TIF)

Figure S5.

ITC measurements of interaction between Tup11 substitutions and Fep1.

https://doi.org/10.1371/journal.pone.0043005.s005

(TIF)

Table S1.

The residues involved in 29 protein complexes formed by 17 WD proteins.

https://doi.org/10.1371/journal.pone.0043005.s006

(DOC)

Author Contributions

Conceived and designed the experiments: XHW YDW. Performed the experiments: YW ZZ. Analyzed the data: XHW YDW. Contributed reagents/materials/analysis tools: XHW ZZ YW FJ. Wrote the paper: XHW YDW.

References

  1. 1. Fülöp V, Jones DT (1999) β-propellers: structural rigidity and functional diversity. Curr. Opin. Struct. Biol. 9: 715–721.
  2. 2. Neer EJ, Schmidt CJ, Nambudripad R, Smith TF (1994) The ancient regulatory protein family of WD-repeat proteins. Nature 371: 297–300.
  3. 3. Stirnimann CU, Petsalaki E, Russell RB, Müller CW (2010) WD40 proteins propel cellular networks. Trends Biochem. Sci. 35: 565–574.
  4. 4. Smith TF, Gaitatzes C, Saxena K, Neer E (1999) The WD repeat: a common architecture for diverse functions. Trends Biochem. Sci. 24: 181–185.
  5. 5. Yokoyama A, Wang Z, Wysocka J, Sanyal M, Aufiero DJ, et al. (2004) Leukemia proto-oncoprotein MLL forms a SET1-like histone methyltransferase complex with menin to regulate Hox gene expression. Mol. Cell Biol. 24: 5639–5649.
  6. 6. Rea S, Elsenhaber F, O’Carroll D, Strahl BD, Sun Z-W, et al. (2000) Regulation of chromatin structure by site-specific histone H3 methyltransferase. Nature 406: 593–599.
  7. 7. Winston JT, Koepp DM, Zhu C, Elledge SJ, Harper JW (1999) A family of mammalian F-box proteins. Curr. Biol. 9: 1180–1182.
  8. 8. Jin JP, Arias EE, Chen J, Harper JW, Walter JC (2006) A family of diverse Cul4-Ddb1-interacting proteins includes Cdt2, which Is required for S phase destruction of the replication factor Cdt1. Mol. Cell. 23: 709–721.
  9. 9. Higa LA, Wu M, Ye T, Kobayashi R, Sun H, et al. (2006) CUL4-DDB1 ubiquitin ligase interacts with multiple WD40-repeat proteins and regulates histone methylation. Nat. Cell Biol. 8: 1277–1283.
  10. 10. He YJ, McCall CM, Hu J, Zeng Y, Xiong Y (2006) DDB1 functions as a linker to recruit receptor WD40 proteins to CUL4–ROC1 ubiquitin ligases. Genes Dev. 20: 2949–2954.
  11. 11. Zou H, Henzel WJ, Liu X, Lutschg A, Wang X (1997) Apaf-1, a human protein homologous to C. elegans CED-4, participates in cytochrome c–dependent activation of caspase-3. Cell 90: 405–413.
  12. 12. Wall MA, Coleman DE, Lee E, Iñiguez-Lluhi JA, Posner BA, et al. (1995) The structure of the G protein heterotrimer Giα1β1γ2. Cell 83: 1047–1058.
  13. 13. Gaudet R, Bohm A, Sigler PB (1996) Crystal structure at 2.4 angstroms resolution of the complex of transducin betagamma and its regulator, phosducin. Cell 87: 577–588.
  14. 14. Lambright DG, Sondek J, Bohm A, Skiba NP, Hamm HE, et al. (1996) The 2.0 Å crystal structure of a heterotrimeric G protein. Nature 379: 311–319.
  15. 15. Hsia KC, Stavropoulos P, Blobel G, Hoelz A (2007) Architecture of a coat for the nuclear pore membrane. Cell 131: 1313–1326.
  16. 16. Brohawn SG, Leksa NC, Spear ED, Rajashankar KR, Schwartz TU (2008) Structural Evidence for Common Ancestry of the Nuclear Pore Complex and Vesicle Coats. Science 322: 1369–1373.
  17. 17. Fath S, Mancias JD, Bi X, Goldberg J (2007) Structure and organization of coat proteins in the COPII cage. Cell 129: 1325–1336.
  18. 18. Couture JF, Collazo E, Trievel RC (2006) Molecular recognition of histone H3 by the WD40 protein WDR5. Nat. Struct. Mol. Biol. 13: 698–703.
  19. 19. Margueron R, Justin N, Ohno K, Sharpe ML, et al. (2009) Role of the polycomb protein EED in the propagation of repressive histone marks. Nature 461: 762–767.
  20. 20. Hao B, Oehlmann S, Sowa ME, Harper JW, Pavletich NP (2007) Structure of a Fbw7-Skp1-cyclin E complex: multisite-phosphorylated substrate recognition by SCF ubiquitin ligases. Mol. Cell 26: 131–143.
  21. 21. Wu G, Xu G, Schulman BA, Jeffrey PD, Harper JW, et al. (2003) Structure of a beta-TrCP1-Skp1-beta-catenin complex: destruction motif binding and lysine specificity of the SCF(beta-TrCP1) ubiquitin ligase. Mol. Cell 11: 1445–1456.
  22. 22. Sondek J, Bohm A, Lambright DG, Hamm HE, Sigler PB (1996) Crystal structure of a G protein βγ dimer at 2.1 Å resolution. Nature 379: 369–374.
  23. 23. Lodowski DT, Pitcher JA, Capel WD, Lefkowitz RJ, Tesmer JJ (2003) Keeping G proteins at bay: a complex between G protein-coupled receptor kinase 2 and Gbetagamma. Science 300: 1256–1262.
  24. 24. Pickles LM, Roe SM, Hemingway EJ, Stifani S, Pearl LH (2002) Crystal structure of the C-terminal WD40 repeat domain of the human Groucho/TLE1 transcriptional corepressor. Structure 10: 751–761.
  25. 25. Jennings BH, Pickles LM, Wainwright SM, Roe SM, Pearl LH, et al. (2006) Molecular Recognition of Transcriptional Repressor Motifs by the WD Domain of the Groucho/TLE Corepressor. Mol. Cell 22: 645–655.
  26. 26. Fischer ES, Scrima A, Bohm K, Matsumoto S, Lingaraju GM, et al. (2011) The molecular basis of CRL4DDB2/CSA ubiquitin ligase architecture, targeting, and activation. Cell 147: 1024–1039.
  27. 27. Lau C, Bachorik JL, Dreyfuss G (2009) Gemin5-snRNA interaction reveals an RNA binding function for WD repeat domains. Nat. Struct. Mol. Biol. 16: 486–491.
  28. 28. Larsen NA, Al-Bassam J, Wei RR, Harrison SC (2007) Structural analysis of Bub3 interactions in the mitotic spindle checkpoint. Proc. Natl. Acad. Sci. 104: 1201–1206.
  29. 29. Orlicky S, Tang X, Willems A, Tyers M, Sicheri F (2003) Structural basis for phosphodependent substrate selection and orientation by the SCFCdc4 ubiquitin ligase. Cell 112: 243–256.
  30. 30. Tarricone C, Perrina F, Monzani S, Massimiliano L, Kim MH, et al. (2004) Coupling PAF signaling to dynein regulation: structure of LIS1 in complex with PAF-acetylhydrolase. Neuron 44: 809–821.
  31. 31. Larsen NA, Harrison SC (2004) Crystal structure of the spindle assembly checkpoint protein Bub3. J. Mol. Biol. 344: 885–892.
  32. 32. Srinivasan V, Netz DJ, Webert H, Mascarenhas J, Pierik AJ, et al. (2007) Structure of the yeast WD40 domain protein Cia1, a component acting late in iron-sulfur protein biogenesis. Structure 15: 1246–1257.
  33. 33. Pashkova N, Gakhar L, Winistorfer SC, Ramaswamy S, Piper RC (2010) WD40 Repeat Propellers Define a Ubiquitin-Binding Domain that Regulates Turnover of F Box Proteins. Mol. Cell 40: 433–443.
  34. 34. Cerna D, Wilson DK (2005) The structure of Sif2p, a WD repeat protein functioning in the SET3 corepressor complex. J. Mol. Biol. 351: 923–935.
  35. 35. Cheng Z, Liu Y, Wang C, Parker R, Song H (2004) Crystal structure of Ski8p, a WD-repeat protein with dual roles in mRNA metabolism and meiotic recombination. Protein Sci. 113: 2673–2684.
  36. 36. Brunson LE, Dixon C, LeFebvre A, Sun L, Mathias N (2005) Identification of residues in the WD-40 repeat motif of the F-box protein Met30p required for interaction with its substrate Met4p. Mol. Gen. Genomics 273: 361–370.
  37. 37. Cerveny KL, Jensen RE (2003) The WD-repeats of Net2p interact with Dnm1p and Fis1p to regulate division of mitochondria. Mol. Biol. Cell 14, 4126–4139.
  38. 38. Znaidi S, Pelletier B, Mukai Y, Labbé S (2004) The Schizosaccharomyces pombe corepressor Tup11 interacts with the iron-responsive transcription factor Fep1. J. Biol. Chem. 279: 9462–9474.
  39. 39. Holm M, Hardtke CS, Gaudet R, Deng XW (2001) Identification of a structural motif that confers specific interaction with the WD40 repeat domain of Arabidopsis COP1. EMBO J. 20, 118–127.
  40. 40. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, et al. (2000) The Protein Data Bank. Nucleic Acids Res. 28: 235–242.
  41. 41. Richardson JS, Getzoff ED, Richardson DC (1978) The β bulge: a common small unit of nonrepetitive protein structure. Proc. Natl. Acad. Sci. USA 75: 2574–2578.
  42. 42. Chan AW, Hutchinson EG, Harris D, Thornton JM (1993) Identification, classification, and analysis of beta-bulges in proteins. Protein Sci. 2: 1574–1590.
  43. 43. Richardson JS, Richardson DC (2002) Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation. Proc. Natl. Acad. Sci. USA 99: 2754–2759.
  44. 44. Chen PY, Gopalacushina BG, Yang CC, Chan SI, Evans PA (2001) The role of a β-bulge in the folding of the β-hairpin structure in ubiquitin. Protein Sci. 10: 2063–2074.
  45. 45. Mishra P, Volety S, Rao CM, Prabha CR (2009) Glutamate64 to Glycine Substitution in G1 β-bulge of Ubiquitin Impairs Function and Stabilizes Structure of the Protein. J. Biochem. 146: 563–569.
  46. 46. Dion-Schultz A, Howell EE (1997) Effects of insertions and deletions in a beta-bulge region of Escherichia coli dihydrofolate reductase. Protein Eng. 10: 263–272.
  47. 47. Axe DD, Foster NW, Fersht AR (1999) An irregular β-bulge common to a group of bacterial RNases is an important determinant of stability and function in Barnase. J. Mol. Biol. 286: 1471–1485.
  48. 48. Xie C, Prahl A, Ericksen B, Wu Z, Zeng P, et al. (2005) Reconstruction of the conserved β-bulge in mammalian defensins using D-amino acids. J. Biol. Chem. 280: 32921–32929.
  49. 49. Bogan AA, Thorn KS (1998) Anatomy of hot spots in protein interfaces. J. Mol. Biol. 280: 1–9.
  50. 50. Lo Conte L, Chothia C, Janin J (1999) The atomic structure of protein-protein recognition sites. J. Mol. Biol. 285: 2177–2198.
  51. 51. DeLano WL (2002) Unraveling hot spots in binding interfaces: progress and challenges. Curr. Opin. Struct. Biol. 12: 14–20.
  52. 52. Komachi K, Johnson AD (1997) Residues in the repeats of Tup1 required for interaction with α2. Mol. Cell. Biol. 17: 6023–6028.
  53. 53. Song JJ, Kingston RE (2008) WDR5 interacts with mixed lineage leukemia (MLL) protein via the histone H3-binding pocket. J. Biol. Chem. 283: 35258–35264.
  54. 54. Buscarlet M, Perin A, Laing A, Brickman JM, Stifani S (2008) Inhibition of cortical neuron differentiation by Groucho/TLE1 requires interaction with WRPW, but not Eh1, repressor peptides. J. Biol. Chem. 283: 24881–24888.
  55. 55. Wu XH, Zhang H, Wu YD (2010) Is Asp-His-Ser/Thr-Trp tetrad hydrogen-bond network important to WD40-repeat proteins: A statistical and theoretical study. Proteins: Struct. Funct. Genet. 78: 1186–1194.
  56. 56. Wu XH, Chen RC, Gao Y, Wu YD (2010) The effect of Asp-His-Ser/Thr-Trp tetrad on the thermostability of WD40-repeat Proteins. Biochemistry 49: 10237–10245.
  57. 57. Letunic I, Doerks T, Bork P (2009) SMART 6: recent updates and new developments. Nucleic Acids Res. 37: D229–D232.
  58. 58. The UniProt Consortium (2010) The universal protein resource (UniProt) in 2010. Nucleic Acids Res. 38: D142–D148.
  59. 59. Yang J, Wang H (2006) The central coiled-coil domain and carboxyl-terminal WD-repeat domain of Arabidopsis SPA1 are responsible for mediating repression of light signaling. Plant J. 47: 564–576.
  60. 60. Sprague ER, Redd MJ, Johnson AD, Wolberger C (2000) Structure of the C-terminal domain of Tup1, a corepressor of transcription in yeast. EMBO J. 19: 3016–3027.
  61. 61. Kaplan CD, Kaplan J (2009) Iron acquisition and transcriptional regulation. Chem. Rev. 109: 4536–4552.