Molecular BioSystems PAPER

Size: px
Start display at page:

Download "Molecular BioSystems PAPER"

Transcription

1 Molecular BioSystems Cite this: Mol. BioSyst., 2012, 8, Dynamic Article Links PAPER Functional annotation of the mesophilic-like character of mutants in a cold-adapted enzyme by self-organising map analysis of their molecular dynamicsw Domenico Fraccalvieri, a Matteo Tiberti, b Alessandro Pandini, cd Laura Bonati* a and Elena Papaleo* b Received 16th May 2012, Accepted 16th June 2012 DOI: /c2mb25192b Multiple comparison of the Molecular Dynamics (MD) trajectories of mutants in a cold-adapted a-amylase (AHA) could be used to elucidate functional features required to restore mesophilic-like activity. Unfortunately it is challenging to identify the different dynamic behaviors and correctly relate them to functional activity by routine analysis. We here employed a previously developed and robust two-stage approach that combines Self-Organising Maps (SOMs) and hierarchical clustering to compare conformational ensembles of proteins. Moreover, we designed a novel strategy to identify the specific mutations that more efficiently convert the dynamic signature of the psychrophilic enzyme (AHA) to that of the mesophilic counterpart (PPA). The SOM trained on AHA and its variants was used to classify a PPA MD ensemble and successfully highlighted the relationships between the flexibilities of the target enzyme and of the different mutants. Moreover the local features of the mutants that mostly influence their global flexibility in a mesophilic-like direction were detected. It turns out that mutations of the cold-adapted enzyme to hydrophobic and aromatic residues are the most effective in restoring the PPA dynamic features and could guide the design of more mesophilic-like mutants. In conclusion, our strategy can efficiently extract specific dynamic signatures related to function from multiple comparisons of MD conformational ensembles. Therefore, it can be a promising tool for protein engineering. Introduction Molecular Dynamics (MD) simulations provide links between protein structure and dynamics by allowing the exploration of the conformational energy landscape accessible to protein molecules. 1,2 The analysis of the large ensembles of conformations generated by a MD simulation provides information about the average physico-chemical and geometrical properties of a protein and enables the identification of recurring conformations and transitions between them. Moreover, more recently it has been demonstrated that the comparison of MD conformational ensembles of functionally related proteins allows us to highlight the role of flexibility in modulating protein function as well as to analyse the evolutionary a Department of Environmental Sciences, University of Milano-Bicocca, Milan, Italy. laura.bonati@unimib.it b Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy. elena.papaleo@unimib.it c Randall Division of Cell and Molecular Biophysics, King s College London, London, UK d Division of Mathematical Biology, MRC National Institute for Medical Research, London, UK w Electronic supplementary information (ESI) available. See DOI: /c2mb25192b conservation and specialization of protein dynamics across distant homologous proteins Identification of functionally relevant conformations within trajectories of a same system is generally achieved by clustering the raw ensemble of structures that are generated. Geometrical clustering algorithms have been introduced and extensively used in this context, based on the assumption that structurally similar conformations lie in the same basin of the free energy surface. However, these analyses are often affected by noise and are difficult to interpret, particularly if the goal is the comparison among trajectories of different systems to highlight subtle similarities and differences in the dynamics related to function. This emphasises the need for more advanced tools to compare conformational ensembles derived from extensive MD simulations of different systems. A suitable alternative to handle multiple comparisons among complex data is the employment of Self-Organising Maps (SOMs). A SOM is a specific architecture of artificial neural networks that projects high-dimensional input data on a low-dimensional grid (map) of so-called neurons, where similar elements are associated to neighbouring neurons. 19 Recently, SOMs were reported performing more accurately and providing more consistent results than traditional clustering 2680 Mol. BioSyst., 2012, 8, This journal is c The Royal Society of Chemistry 2012

2 algorithms in various data-mining problems. This approach was also applied to MD data analysis 17,20,21 and a comparison with traditional clustering algorithms identified SOMs among the best performing methods. 17 We have recently developed a new approach to efficiently compare conformational ensembles of proteins, which combines SOMs and hierarchical clustering. 7 In our two-level approach, the SOM captures the relevant features of the large input space in a smaller set of prototype vectors and afterwards clustering algorithms are used to divide the prototypes into groups. The major advantage over classical clustering approaches is the possibility of providing a topological mapping of the conformational space embedded in a simple 2D visualisation. This not only simplifies the identification of differences in the conformational dynamics and the definition of multiple relationships among these, but also helps in the functional annotation. To provide the users with optimal SOM parameters for this specific type of data, we developed a protocol by which a small number of important parameters was identified and their optimal values were derived. 7 Once the optimized parameters were obtained, the SOM approach was applied to the investigation of dynamic properties of a single domain, where ligand binding activity is known to be modulated by single mutations 22 that turned out to greatly affect the conformational dynamics of the protein. Our approach not only allowed a very efficient comparison of the mutants trajectories but also led to a functional interpretation of the observed differences among the mutants. 7 In this paper, a further assessment of the potential of our approach is performed and the extension of the previous proposal in the direction of a predictive tool is designed. In fact the SOM approach is employed to rationalize how single and multiple mutations are able to modify the dynamic properties of a protein to such an extent of restoring the properties of a homologous protein. It is known that specific dynamic signatures are a relevant hallmark of differently temperature adapted proteins, isolated from thermophilic, mesophilic and psychrophilic organisms We also recently demonstrated that a correlation exists between modification in the catalytic and thermodynamic parameters and the modification that the mutations exert on the protein dynamics in a battery of seven mesophilic-like mutations of a cold-adapted a-amylase (namely AHA, Pseudoalteromonas haloplanktis a-amylase). 9 In fact, these mutations are able to partially restore the kinetic and thermodynamic properties of its warm-adapted counterpart, the porcine a-amylase, namely PPA. 28,29 On the other hand, our MD simulation approach identified that the inserted mutations were able to reestablish in AHA a map of correlated motions and electrostatic interactions found in the mesophilic enzyme and, at the same time, to lose dynamic properties typical of the wild-type psychrophilic a-amylase. 9 Interestingly, although each of these mutations exerts long-range effects, the specific effects induced by each mutation and their contribution and efficiency in modulating the dynamic properties were not properly elucidated. In particular, it is not well-clarified the rationale behind the fact that the multiple AHA mutants, in which all the single mutations are combined together, did not feature kinetic and thermodynamic parameters which can account for real additive effects, as well as the mutations so-far applied to AHA are not able to completely convert the coldadapted enzyme in a mesophilic-like variant but exert intermediate effects. In light of the previous observations, the multiple comparison of the MD conformational ensembles of AHA and its mutants turns out to be a very interesting case of study to further assess the potential of our SOM approach. In particular, a first aim of this paper is to demonstrate the efficiency and reliability of this approach in elucidating the differences in flexibility related to function in a case where, due to the complex network of local and distal effects induced by mutations, the multiple comparison of the MD trajectories is particularly demanding. Moreover, a novel strategy is proposed to identify specific mutations able to efficiently convert the AHA dynamic properties to those of the homologous mesophilic system. To this aim the use of the SOM trained with the trajectories of AHA and its mutants to classify the MD conformational ensemble of PPA is proposed. The comparison between conformations sampled by mesophilic-like variants and by the reference PPA enzyme, based on the use of specific local descriptors mapped on the SOM, succeeded in both identifying and designing mutants that are more suitable to restore the PPA characteristics. This result confirmed the ability of our strategy to efficiently extract specific dynamic signatures related to function from multiple comparisons of protein MD ensembles and highlighted the potential of this tool as a support for protein engineering. Materials and methods Self-Organising Map (SOM) approach to compare MD trajectories A SOM is a widely used tool to handle multidimensional input data and visualize them in a grid (map), in which each node is called neuron. 30 During the learning process each neuron wins the data closest to it and modify itself iteratively becoming more similar to the data. The neurons obtained after the learning process are characterized by vectors with the same dimension of the input data (the so called prototype vectors ), representing a particular feature drawn from the original input space. The number of unique features that can be represented on the map increases with the number of employed neurons, along with the computational cost. A SOM can be interpreted as a semantic map where similar samples are mapped close together and dissimilar apart, as well as a discrete approximation of the distribution of training samples in the prototype vectors. The trained SOM can also be employed to assign data not used in the training process to the neurons. In this case, data are only read by and compared with the neurons of the map, and the winning neuron does not modify its prototype vector after the assignment. A two-level approach that combines SOM and hierarchical clustering of the obtained prototype vectors, previously developed to analyse MD data, 7 was employed. In this approach the SOM input data are ensembles of conformations extracted from the MD trajectories of one or more domains, each This journal is c The Royal Society of Chemistry 2012 Mol. BioSyst., 2012, 8,

3 conformation being described by the Cartesian coordinates of the Ca atoms. At the end of the SOM training process, the original conformational ensemble is projected on a bi-dimensional feature map, where a limited number of prototype vectors summarizes the entire sampled space, and similar conformations are associated to neighbour neurons. In the second step of the approach, the prototype vectors are further grouped in a small, but representative, number of clusters by using a hierarchical agglomerative clustering algorithm, the complete linkage. 31 The optimal sampling rate of the MD trajectory, previously set to use the minimum number of conformations while maintaining a reliable picture of the protein dynamics, 7 was employed. It consists of sampling one conformation every 100 ps. Both the learning and topological SOM parameters that were optimized for MD data by experimental design 7 were employed. The optimized SOM learning parameters are: learning algorithm = batch, neighbourhood function = Gaussian, alpha type = linear, radius = 3, training length = 5000 epochs, and initial value of alpha = The optimal topological parameters obtained are: map size = 100, lattice shape = hexagonal, and shape = sheet. A first inspection on the number of clusters that best summarize the information in the map was obtained by the Mojena s stopping rule. 32 Then, the number of clusters was refined by a deeper inspection of the meaning of the obtained clusters, in order to increase their interpretability. The quality of the obtained clusters was assessed using the Silhouette index (S) 33 that evaluates their compactness and separation. Its value is in the range [ 1, 1] and is generally interpreted as evidence in support of the cluster structure: 34 strong (S > 0.7), reasonable (0.5 o S r 0.7), weak (0.25 o S r 0.5) or no significant evidence (S r 0.25). The analysis was performed in the MATLAB environment using the SOM Toolbox, 35 the Statistics Toolbox 36 and the Mojena s rule implementation made available by courtesy of Prof. Fernandez ( Results and discussion Summary of structural and dynamical properties of AHA, AHA mutants and PPA The overall fold of AHA (PDB entry 1AQH 37 ) and PPA (PDB entry 1PIF 38 ) is conserved. They exhibit a multi-domain organization and the three characteristic domains of chloridedependent a-amylases: 39 the (b/a) 8 catalytic domain A, a small domain B which protrudes from domain A and includes the calcium binding site, as well as a C-terminal domain (domain C) characterized by b-strands organized in a Greek-key motif (Fig. 1A). In domain A the catalytic triad is located, including the nucleophile Asp174, the proton donor Glu200 and an acidic residue, Asp264, which favours the protonation state of the glutamic acid side chain 39 (AHA numbering). The cold-adapted a-amylase AHA and its mesophilic counterpart PPA share 43% of sequence identity (Fig. 1B), with gaps mostly located in the loop regions in the catalytic domain. Several differences have been highlighted between the two enzymes. In fact, AHA shows fewer intramolecular interactions, a reduced compactness of the hydrophobic core and of the inter-domain interfaces, increased hydrophobicity of the solvent accessible surface and a different architecture of the ion binding site with respect to PPA. 37,38,40 Furthermore, the loops surrounding the active site are generally shorter in AHA than PPA and feature distinct flexibility as judged by previous all-atom explicit solvent MD simulations. 9,41 Fig. 1 Main structural and sequence features of AHA, AHA mutants and PPA. (A) PPA and AHA 3D structures are superimposed (PDB IDs 1AQH and 1PIF). The mutation sites and the catalytic residues of AHA are shown as orange ball and sticks and yellow dots, respectively. The relevant loops for discussion L3, L5 and L7 are highlighted as light blue ribbons. Residues ignored in the SOM classification scheme (i.e. those of domain C and those corresponding to gaps in the structural alignment) are shown in black. (B) Structural alignment between AHA and PPA crystallographic structures. The alignment was calculated using the DaliLite server 52 and then manually adjusted. The intensity of the red background color for each residue is proportional to the RMSF values calculated from MD simulations. Secondary structure elements are shown as calculated by DSSP 53 on the crystal structures. Catalytic residues are highlighted using star symbols, mutation sites are highlighted by black boxes and the relevant loops are enclosed in brackets Mol. BioSyst., 2012, 8, This journal is c The Royal Society of Chemistry 2012

4 Recently, we also investigated by comparative MD simulations 9 a battery of seven previously experimentally characterized 29,42,43 mesophilic-like mutants of AHA. Particular attention was devoted in this study to the comparison of flexibility, networks of electrostatic interactions, as well as correlated motions in AHA, AHA mutants and PPA. In particular the following AHA mutants were investigated: AHANR (N12R), AHASS (Q58C, A99C), AHAVF (V196F), AHATV (T232V), AHAQI (Q164I), AHA5 (N150D, V196F, T232V, Q164I, K300R), and AHA5SS (Q58C, A99C, N150D, Q164I, V196F, T232V, K300R) (Fig. 1A). The restored weak interactions are able to modify AHA dynamics in a warmadapted-like direction, acting long-range on the loops surrounding the catalytic cleft. 9 In particular, these differences regard loops b 3 -a 3 (L3), b 5 -a 5 (L5), b 7 -a 7 (L7) and b 8 -a 8 (L8). L3, L5 and L7 project toward the catalytic site, forming a three-side cleft on one side of the substrate binding pocket, whereas interactions between L8, which is longer in PPA than AHA, and L7 regulate the conformational properties of L7 in PPA. 41 Furthermore, L3, L5 and L7 were shown to interact with co-crystallized substrate analogues, and the rearrangement of L7 was suggested to have a role in the binding and release of the substrates in a-amylases The differences between PPA and AHA in these loops cause a different distribution in the 3D structure of the most flexible regions, as judged by the analysis of Ca Root-Mean Square Fluctuation (RMSF) profiles (Fig. 1B). 41 More in detail, the L3 region was demonstrated to feature in AHA a longer C-terminal extension (L3 C ) that carries a short and highly mobile a-helical portion. In PPA, the helix is preceded by a highly flexible coil region (the N-terminal region of L3, L3 N ), which does not feature a correspondence in the structure of the cold-adapted enzymes, where gaps are located according to the structural alignment (Fig. 1A). The comparison of the intensity and localization of the L3 dynamics shows that in AHA high flexibility is localized nearer to the catalytic triad than in PPA. 41 L5 is immediately downstream the catalytic E200 AHA (E233 PPA ) residue and is conserved in both structure and fluctuation intensity in AHA and PPA. However, significant differences were identified in the map of correlated motions between the L5 and L3 regions, which are greater and more connected in AHA. 41 L7 is downstream a short a-helix carrying another catalytic residue, D264 AHA (D300 PPA ). It is the most flexible region of AHA and features the greatest RMSF differences between the two enzymes (Fig. 1B). Dominant motions of L7 involve oscillations to and from the active-site groove, with amplitudes enhanced in AHA. In PPA, above and directly interacting with L7, a high flexible insertion with respect to AHA is located (L8), which is the most flexible region of PPA (Fig. 1B). L3 C and L7 are the loops of the b-barrel that are closer to the catalytic site and their flexibility is optimized in AHA, whereas in PPA the higher protein fluctuations are shifted further away from the catalytic triad at the L3 N and L8 PPA insertions. The dominant motions of PPA are partially restored in the mesophilic-like AHA mutants, even if the mutation sites are not located in L3, L5, L7 or L8 regions or their immediate proximity (Fig. 1A). The long-range dynamic effects induced by single and multiple mutations suggest that our analysis should be directed not only toward the comparison of the global dynamics of these systems but also toward the detection of which local dynamic features are able to influence their global dynamics in a mesophilic-like direction. SOM analysis of the flexibilities of AHA and its mutants The approach combining SOMs and hierarchical clustering (complete linkage), which we previously proposed for the comparison of MD trajectories, 7 was used to analyse and compare the flexibilities of the wild-type AHA (AHAwt) and its seven mutants (AHA5, AHA5SS, AHAQI, AHASS, AHATV, AHAVF, and AHANR). The protocol and the SOM parameters employed, derived from the previous optimization process, 7 are reported in the Materials and methods section. To perform a multiple comparison, the ensemble of conformations extracted from the trajectories of AHAwt and its seven mutant variants was provided as input data to a single map. As reported in the Materials and methods section, each conformation was described by the Cartesian coordinates of its a-carbon (Ca atoms). To focus our attention on the flexibility around the catalytic site, only the domains A and B were considered for all the systems. Moreover, with the aim of a subsequent comparison with the PPA flexibility, only the matching residues extracted from the AHA PPA structural alignment 41 were retained, based on the assumption that they bear the differences in the dynamic features of the two systems (Fig. 1B). 4,5,9,41 This selection process produced a set of 351 Ca atoms considered for each system. After the learning process, the proposed dimension of the map, 7 i.e. 100 neurons, led to a set of 100 prototype vectors describing the most relevant conformational features from the input space and providing an intermediate topological reference ( proto-clusters 31 ). These vectors were then clustered using the complete linkage approach and all the original conformations won by a neuron were accordingly assigned to the same cluster. Therefore the output SOM allows us to extract information at different levels: neuron level, using only the characteristics of the prototype vectors, i.e. their variables and their hit population; centroid level, using the input conformation closest to the centroid of each cluster; cluster level, using all the input conformations assigned to a cluster. The SOM trained with the eight trajectories of AHA and its mutants is shown in Fig. 2A by using a pictorial representation where each neuron is represented by an hexagon, and the number of input data won by each neuron by black hexagons with a size proportional to the neuron population. Visual inspection of the map allows the analysis of the final distribution of the original data: neighbour neurons are associated to similar groups of conformations; map regions with highly populated neurons represent conformations frequently sampled in the trajectories. The seven clusters obtained by complete linkage (see the Materials and methods section) showed a good quality, as demonstrated by the Silhouette plot, in Fig. 2B, that reports satisfactory values of S for the neurons assigned to all the clusters. To analyse the conformational space described by each cluster, the cluster level and the centroid level information were used. First, the population of each This journal is c The Royal Society of Chemistry 2012 Mol. BioSyst., 2012, 8,

5 Fig. 2 SOM analysis of the dynamics of AHA and its mutants. (A) SOM trained with the eight trajectories of AHA and its mutants. Colors indicate the seven clusters obtained by hierarchical clustering of the neurons. The size of the black hexagons in each neuron is proportional to the number of hits in the neuron. (B) Silhouette plot: the profile of each cluster is composed by a bar for each neuron belonging to the cluster, whose height is the value of the S index; clusters are colored with the same color code used for the SOM. (C) Bar plot representing the percentage distribution of conformations of each mutant in the seven clusters. cluster was analysed to find groups of similar conformations sampled by different systems, as well as to identify conformations with peculiar characteristics sampled by a single system. Second, the centroid of each cluster was analysed to obtain a synthetic three-dimensional (3D) visualization of the conformations that characterize the cluster. The bar plot in Fig. 2C indicates that some clusters are specifically representative of one mutant flexibility, while others are composed by conformations sampled by different mutants. In particular, clusters 3 7 feature more than 40% of conformations relative to one of the systems; clusters 3 and 5 are mainly composed by conformations of AHAwt, while clusters 4, 6 and 7 are typical of AHATV, AHAQI and AHAVF, respectively. In contrast, clusters 1 and 2 do not present a dominant population from one system but it is evident the total absence of conformations relative to AHAwt in both of them. Thus, these two clusters can be interpreted as the ensemble of conformations belonging to common basins sampled in the simulations of the mesophilic-like AHA mutants. The centroid of a cluster in a SOM is defined as the average vector between all the prototype vectors of the neurons assigned to that cluster. Therefore the hit conformation closest to each cluster centroid, in terms of Euclidean distance, well represents the conformational characteristics of that cluster. Fig. 3 shows, at the centroid level, the seven clusters obtained for AHAwt and its mutants, by using a porcupine representation that describes structural differences between each centroid and the AHAwt reference structure. It can be observed that clusters 3 (red) and 5 (magenta) mainly describe the flexibility of AHAwt (Fig. 2C): they include either conformations of AHAwt close to the starting X-ray structure of AHA (cluster 3) or conformations associated to a specific motion involving loops L3, L5 and L7 (cluster 5). Cluster 4 (brown), which mainly describes the 2684 Mol. BioSyst., 2012, 8, This journal is c The Royal Society of Chemistry 2012

6 Fig. 3 Centroid level representation of the dynamics of AHA and its mutants. The porcupine representation describes direction and amplitude of the changes between a reference structure and the conformation closest to the centroid of each cluster. For each cluster cones are colored with the same color used for that clusters in the SOM (Fig. 2A). The reference structure is the X-ray structure of AHAwt (PDB code: 1AQH), represented as a cartoon trace of the Ca atoms. The yellow dots represent the catalytic triad, and the yellow spheres the position of the mutations studied. In gray the Ca atoms excluded from the analysis. AHATV flexibility, is characterized by changes in two loop regions: the loops surrounding the catalytic site (L3, L5 and L7), and the region including both the N-terminal part of L8 (L8 N, residues , in proximity of the insertion in L8 loop, L8 I, of PPA) and the C-terminal part of L8 (L8 C,residues of AHA). Cluster 6 (orange), mainly populated by AHAQI conformations, describes conformational changes located in the L3 region and in the N-terminal part of L7. Cluster 7 (cyan), with the majority of conformations from the AHAVF dynamics, shows changes associated to a conformational change involving both the loop L7 and the facing loop, including the L8 N, coupled with conformational changes of the L8 C region. Clusters 1 and 2 describe two common dynamic features of the seven mutants (see Fig. 2C), showing conformational changes very similar to those observed in cluster 7, but with a reduced displacement of the loops (cluster 1, green), and the combined displacement of L7 toward the L8 N residues and of L5 in the opposite direction (cluster 2, blue). Not surprisingly, the most relevant differences in the cluster centroids are localized in the regions characterized by the highest difference in flexibility among the different AHA variants. Notably, the L7 loop not only is the region that features the highest RMSF differences in AHAwt and its mutants, but also features significant displacements in each centroid except cluster 3, which is overall the most similar to the wild-type AHA X-ray structure. AHATV conformations are mainly ascribed to cluster 4, which is the cluster featuring remarkable structural changes in all the relevant aforementioned loops. Interestingly, AHATV is also the mutant characterized by the most striking RMSF differences when compared to AHA. 9 SOM analysis of PPA flexibility As mentioned in the Materials and methods section, once a SOM is trained using a given training set, the neurons of the output map can be used to read and assign new data. What is done is a simple comparison between each input vector presented to the map and all the prototype vectors, but the prototype vector that won the data does not modify itself. The conformations extracted from the MD trajectory of PPA were presented to the SOM trained using the MD data of AHAwt and its mutants. Since only the corresponding positions of AHA and PPA in the structural alignment were considered for all systems (Fig. 1B), both the prototype vectors of the map and the vectors with the Ca coordinates of PPA conformations have the same dimension. The results of the assignment of the PPA conformations to the AHA SOM are reported in Fig. 4A. The majority of the conformations of PPA were assigned either to cluster 1 or to cluster 7 and the highest populated neurons (the largest black hexagons) are located in neurons close to the border between these two clusters. More in detail, 71% of the PPA conformations were assigned to cluster 7 (cyan), 19% to cluster 1 (green) and the remaining 10% were distributed in the rest of the map. In Fig. 4B the PPA conformations that are closest to the centroids of clusters 1 and 7 are described by representing the conformational changes with respect to a reference structure (as shown in Fig. 3). Unlike in the analysis of the flexibility of the AHA mutants, where the compared systems shared the same starting structure, in this case the starting structure of PPA shows significant differences with respect to that of AHA, mainly located in the catalytic site and in the region of L8, as shown in Fig. 4C. While in the first case the comparison of the cluster centroids led to a clear definition of the different conformational basins whose sampling only depends on the mutations studied, in this second case the visualization has to take in account the initial structural differences between AHA and PPA plus the conformational sampling of PPA in the MD trajectory. However, by taking these differences into account, the analysis of Fig. 4B This journal is c The Royal Society of Chemistry 2012 Mol. BioSyst., 2012, 8,

7 Fig. 4 SOM analysis of the dynamics of PPA. (A) Assignment of the conformations of the PPA trajectory to the SOM trained with AHA and its mutants (Fig. 2A). The size of the black hexagons is proportional to the number of PPA conformations assigned to each neuron. (B) Porcupine representation of the conformational changes described by each cluster (see Fig. 3 for details). The reference structure used is the X-ray structure of AHAwt. (C) Structural differences among the starting structures of AHAwt (PDB code: 1AHQ) and PPA (PDB code: 1PIF). The length of the needles represents the distance between the two structures. allows us to highlight the similarities of the conformational space sampled by some AHA mutants and that of PPA. Cluster 7 is the cluster which best represents PPA specific features. Interestingly, the PPA conformations in cluster 7 are characterized by displacements in the loops of the catalytic site and in the regions L8 N and L8 C, as it was observed for conformations of AHA mutants in cluster 7 (Fig. 3). This analysis highlights the ability of AHAVF to sample the conformations closest to PPA (see cluster 7 in Fig. 2C). This suggests an important role of this mutation in producing mesophilic-like dynamic properties. Local descriptors of the clustered conformations The above analyses allowed us to identify and cluster differences in the global dynamic properties of AHA mutants, as well as to specifically identify which mutations determine the major shift from the AHA toward the PPA features. A deeper analysis of the obtained final conformational clusters was aimed at verifying if the proposed approach is also able to highlight some local features, related to flexibility of AHA mutants, that were proposed to determine the partial conversion of AHA in a mesophilic-like variant. 9 In particular, two groups of local descriptors of the clustered conformations were analysed: geometrical descriptors and chemical descriptors. The first group gives an account of the arrangement of the three loops surrounding the catalytic site (L3, L5 and L7), measured by the distances between pairs of selected Ca atoms in the loops. As also summarized above, in this region AHA and PPA show a different flexibility. 41 In fact, L7 is characterized by an enhanced conformational freedom in AHA, whereas in PPA the long insertion in L8 loop, which directly interacts with L7, constrains the flexibility of the L7 loop. On the opposite side of the catalytic triad, the flexibility of AHA is mainly gathered in the C-terminal region of L3 (L3 C ) in the immediate proximity of the catalytic residues, whereas in PPA the highest flexibility peaks are located in a more distant region with respect to the active site, such as L3 N and this is likely to be due to the different effects exerted by L5 on L3 conformation. These descriptors employ a sub-set of Ca atom coordinates, i.e. the same information used to train the map. Therefore they allow to analyse the results using all the levels of representation (neurons, centroid and cluster) made available by the SOM. The descriptors belonging to the second group were chosen among the ones previously defined as related to key intramolecular interactions significant to differentiate AHA and PPA: 9,41 salt bridges typical of AHAwt that are affected by the mutations and absent in PPA, and the value of the w 1 dihedral angle of the catalytic residues D264 AHA and D300 PPA. 41 These descriptors analyse information that was not directly used to train the map. Thus they can be represented only at the cluster level, using the original conformations assigned to each cluster. It is relevant to mention that both the groups of descriptors were employed to identify local differences on a map that was trained using as unique input information the global conformational changes described by the different MD ensemble. This is an important characteristic of the SOM approach that allowed us to highlight the relationships between local behaviours and the global domain flexibility and to detect the local features that more influence the dynamic properties in a mesophilic-like direction. Geometrical descriptors: inter-loop distances The centroids of the previously analysed clusters showed that the main conformational changes of the AHA mutants are located around the catalytic site and in L8. To deeply analyse the different flexibilities in these regions, three inter-loop distances were monitored in the simulated ensemble, by measuring the distances between the Ca atoms of the residue with the highest flexibility in each loop (see the RMSF profiles in Fig. 1B). The distances investigated are: L3 L7 (using residues 126 AHA and 271 AHA ), L3 L5 (residues 126 AHA and 206 AHA ), and L7 L8 (residues 273 AHA and 309 AHA ). A neuron level representation of these distances was plotted on Fig. 5A, using a SOM colored in a gray scale. Each prototype vector at the end of the SOM learning process can be interpreted as the average conformation among all the conformations assigned to the neuron. It is thus possible to compute the Euclidean distance between the selected Cas using the corresponding variables of the prototype vectors and to plot these distances directly on the SOM. In the map referred to the L3 L7 distance, the highest distance values are found in the neurons belonging to clusters 5 (magenta) and 7 (cyan). As discussed before (Fig. 3), these clusters describe conformational changes in both L3 and L7 in the opposite 2686 Mol. BioSyst., 2012, 8, This journal is c The Royal Society of Chemistry 2012

8 Fig. 5 Geometrical descriptors: inter-loop distances. (A) Neuron level representation of the distribution of L3 L7, L3 L5 and L7 L8 interloop distances in the SOM trained with AHAwt and its mutants (Fig. 2A). In each neuron, the distance values are calculated between the coordinates of selected Ca atoms belonging to the loops (see text) in the prototype vector. Neurons are colored in the gray scale according to the distance value (in the reference scale the distance values are in A ). The colored lines in each map remind the map clustering (Fig. 2A). (B and C) Cluster level representation of the inter-loop distances of AHAwt and its mutants (full dots) and PPA (empty dots) in the plane L3 L5 vs. L3 L7 (B) and L3 L5 vs. L7 L8 (C), colored according to the SOM clustering (Fig. 2A). direction, corresponding to the opening of the catalytic site. Cluster 3 (red) and 2 (blue) show the lowest values of this distance, defining a more closed catalytic site. In the map reporting the L3 L5 distance, the region of cluster 2 (blue) shows the highest values that, according to the analysis of the centroid conformation (Fig. 3), indicate a displacement of these two loops in the opposite direction; clusters 3 and 5, typical of AHAwt, are characterized by values around 10.0 A. For the L7 L8 distance, the latter clusters (3 and 5) have the highest values in the map, opposed to cluster 2 with the lowest ones. Once the values of each geometrical descriptor are plotted and analysed on the map, the same descriptor can be analysed at the cluster level, using all the conformations assigned to the clusters. In this case the Euclidean distances are computed using the coordinates of the selected Ca atoms in the original conformations. Both the above analysis of the map (Fig. 5A) and the populations of the clusters (Fig. 2C) suggest that some clusters are particularly suitable to highlight the different flexibility of AHAwt and its mutants with respect to PPA. For the most informative clusters, in Fig. 5B and C the interloop distances in the conformations of the AHA mutants and PPA are plotted in the plane L3 L5 vs. L3 L7 and L3 L5 vs. L7 L8, respectively. The plot in Fig. 5B, which summarizes the conformational changes of the catalytic site, shows a good separation of the conformations sampled by AHAwt and PPA that have different flexibility in this area. 41 In fact, as mentioned in the analysis of the cluster centroids, cluster 3 (red) contains the conformations close to the X-ray structure of AHA and cluster 5 (magenta) describes the main conformational changes during AHAwt MD simulation, involving primarily L7; the graph shows low L3 L5 distances for both the clusters and increasing values of the L3 L7 distance from cluster 3 to 5. The conformations of PPA, instead, feature the highest variance in the L3 L5 distance and a conserved value of the L3 L7 distance. This different flexibility of AHAwt and PPA results in a scarce overlap of the two regions in the graph. Interestingly, the conformations in cluster 2 (blue) create a connection between the two; since it contains conformations of all the mutants, without any conformations of AHAwt, it results that the mutations shift the flexibility of the catalytic site of AHA toward mesophilic-like characteristics. In the second plot (Fig. 5C), the displacement L3 L5 in the catalytic site is plotted versus that of L7 toward the corresponding L8 region in PPA. In this last region the structures of AHA and PPA are significantly different (Fig. 4C). This is reflected in the plot by the separation of clusters 5 (that describes AHAwt conformations) and 7 (typical of PPA): while AHAwt conformations have high values of L7 L8 and low values of L3 L5, the opposite behavior is observed for PPA conformations. The analysis of cluster centroids showed characteristic conformational changes in these regions especially in clusters 4 (brown) and 6 (orange), which were mainly populated by conformations of AHATV and AHAQI, respectively. Indeed, the localization of the conformations belonging to these two clusters in the plot indicates that they are intermediate between those of AHAwt and PPA in terms of the two monitored distances: cluster 4 better reproduces the approach of L7 toward L8 typical of PPA, while cluster 6 has values of both the distances intermediate between AHAwt and PPA. Chemical descriptors: salt-bridges Two previously identified salt-bridges 9 were chosen for the analysis, within those with the greatest variability in terms of persistence in the MD simulations among the studied systems. The K177-D203 salt bridge in AHA (K200-D306 in PPA) connects loops L3 and L5, and it is involved in a network of This journal is c The Royal Society of Chemistry 2012 Mol. BioSyst., 2012, 8,

9 electrostatic interactions that is partially disrupted both in some AHA mutants and in PPA. 9 The presence of this saltbridge thus well describes the structural relationships between these two loops and its persistence in the simulations may be correlated with local structural modifications. The E279-K334 salt-bridge connects the L7 and L8 loops at their C-terminal portions, and it is present in AHA only; its persistence will likely be influenced by the conformations of the L7 and L8 loops. In each cluster, the values of the distance between the charged groups of the two interacting residues were calculated and divided in 120 equally sized discrete bins in the range nm. The number of structures collected in each bin describes the relative frequency at which the charged groups are found at the given distance, thus obtaining the distribution of the conformations over the distance between the charged groups. While both for K177-D203 and E279-K334 the obtained distributions are complex and multimodal, remarkably the SOM clustering succeeded in separating the conformations in clusters that also differ for the peculiar behavior of these local features. This can be evidenced by comparing the distributions in the single clusters (Fig. 6). For the K177-D203 salt-bridge (Fig. 6A), clusters 1 (green), 3 (red), 4 (brown) and 7 (cyan) show a distinct peak around 0.2 nm, which means that the salt-bridge is actually present in these clusters. In contrast, in clusters 2 (blue), 5 (red) and 6 (orange) and in the PPA conformations (assigned to clusters 1 and 7) the distribution peak is at higher values (around 0.6 nm), implying that the salt-bridge is not present or at least much less persistent. As cluster 6 is mainly populated by conformations of the AHAQI mutant (Fig. 3) it can be inferred that this single mutation is able to reproduce this PPA local feature. In contrast, the conformations belonging to cluster 7 (cyan), which are the ones which best reproduce the global dynamic behavior of PPA, do not feature the breaking of the K177-D203 salt-bridge, which is otherwise not present in PPA. As the cyan cluster is highly composed by the structures of the AHAVF mutant, it results that this mutation alone is not sufficient to completely restore the PPA characteristics. From what discussed before, it is conceivable that the insertion of this mutation along with one able to disrupt the K177-D203 salt bridge, such as the QI mutation, could modify the flexibility of AHA even more towards that of PPA. The E279-K334 salt-bridge features a significant peak around 0.25 nm in clusters 3 (red) and 5 (magenta), in which the structures of the AHAwt are highly represented. The saltbridge is absent in all the other clusters, as it is in PPA, showing a similar behavior between all the AHA mutants and PPA to this extent. Chemical descriptors: v 1 dihedral of D264 AHA D300 PPA Fig. 6 Chemical descriptors: salt bridges. The bar plots represent the relative frequency of different ranges of the donor acceptor distance for the selected salt bridges in the conformations assigned to the clusters ( cluster level ): (A) salt bridge K177-D203 AHA (K200- D306 PPA ); (B) salt bridge E279-K334 AHA. In both graphs, the width of the bins is set to 0.15 nm. The bins of AHA mutants conformations are colored according to the SOM clustering (Fig. 2A) while the bins of the PPA conformations assigned to clusters 1 and 7 (Fig. 4A) are colored in gray. Both D264 in AHA and D300 in PPA belong to the catalytic triad in the binding site and the orientation of the side-chain of this residue, using the w 1 dihedral as descriptor, was investigated in AHAwt and PPA. 41 A detail of D264 AHA D300 PPA is shown in Fig. 7C, to highlight the difference in the two structures. As described in a previous work 41 the orientation of the side-chain of this residue toward the catalytic site is well correlated with the mobility of L7. The values of the w 1 dihedral of D264 AHA in each cluster are shown in Fig. 7A. Values were divided in discrete bins of 151. The number of structures collected in each bin describes the relative frequency at a given w 1 dihedral, obtaining the distribution of the conformations over the dihedral angle. The circular plots in each line are arranged accordingly to their neighbourhood in the map, i.e. cluster 3 (red) and cluster 5 (magenta), cluster 2 (blue) and cluster 7 (cyan), cluster 1 (green) and cluster 6 (orange), and cluster 4 (brown) by itself. This separation helps the interpretation since neighbourhood in the map means neighbourhood in conformational space, thus transition admitted between one state and the other. In previous work the population of the w 1 dihedral of D264 AHA was described defining three available orientations, centered around 601, 1601 and 3001, 41 while for the PPA population of w 1 dihedral of D300 PPA the orientation close to 3001 was not sampled. Considering the first line of the Fig. 7A, 2688 Mol. BioSyst., 2012, 8, This journal is c The Royal Society of Chemistry 2012

10 composed by clusters 3 and 5 typical of AHAwt dynamic properties, all of these three orientations are populated, but in cluster 3 the population is centered around the orientation in the AHA crystal structure (3001), while the orientation around 1601 is preferred in cluster 5. The orientation close to PPA crystal structure (601) is not the preferred in both the clusters. The second and third lines of Fig. 7A can be associated in terms of populations and shift from one orientation to the other. As described previously, both clusters 2 and 1 contain conformations of all the studied mutants while clusters 6 and 7 can be considered typical of AHAQI and AHAVF, respectively. In both the couples there is a shift from a population centered in 601 to a population centered in A peculiar behavior is observed in cluster 4, found as the typical cluster of AHATV, in which the population is completely centered in 601, assuming the same conformation as the PPA crystal structure. The PPA conformations read and assigned to the clusters are described using the same graphs in Fig. 7B. As mentioned above, the orientation around 3001 was not observed for PPA and in fact this orientation is not sampled. Also in this case the different assignment to different clusters in the map leads to the same shift observed for the AHA conformations. In fact while the population of the w1 dihedral of D300PPA is centered in 601 for cluster 1, as previously observed, in cluster 7 it is centered around As previously seen for salt-bridge persistence there is one cluster, cluster 7, that best reproduces the overall dynamics of PPA and there is one cluster, cluster 4, that best reproduces this local descriptor compared with PPA. The insertion of the mutations characterized by clusters 7 (AHAVF) and 4 (AHATV) in AHAwt could improve the overall behavior even more toward PPA. Conclusions Fig. 7 Chemical descriptors: w1 dihedral of D264AHA and D300PPA. (A) Distribution of the values of the w1 dihedral of D264AHA in the conformations of AHA mutants ( cluster level ) assigned to the SOM clusters (Fig. 2A). (B) Distribution of the values of the w1 dihedral of D300PPA in the PPA conformations assigned to clusters 1 and 7 (Fig. 4A). In all the plots: the bin size was set to 151, the cluster color code is the same adopted in Fig. 2A; the red and blue lines are the w1 values of D264AHA and D300PPA in the corresponding crystal structures, respectively. (C) Cartoon representation of the structural superimposition of the crystal structure of AHAwt (red) and PPA (blue), in the surrounding of residues D264AHA and D300PPA, represented as sticks. Loops L3, L5 and L7 are labeled and residues D174AHA and E200AHA are represented as yellow dots. This journal is c The Royal Society of Chemistry 2012 The set of seven AHA mutants here studied are functional intermediates between psychrophilic and mesophilic enzymes featuring a different degree of mesophilic-like character,9,28,29 and are characterized by a strong link between structural dynamics and functional activity, similarly to other coldadapted enzymes.23,24,47 Therefore, it is expected that a multiple comparison of their dynamics could help to elucidate relative functional effects, giving a crucial contribution to protein engineering studies. MD conformational ensembles could be used for this purpose, but, unfortunately, this incurs in a two-fold challenge. First, while the comparison of simulation replicas is routinely done for single systems, there is still a need of standard approaches for an effective comparison of trajectories of different mutants. Second, such a type of approaches should also facilitate the direct interpretation of similarities in relation to functional activity. We recently proposed a two-level approach that combines SOMs and hierarchical clustering. This is a suitable and general tool to detect and functionally annotate similarities and differences in MD ensembles.7 This approach allows the comparison of multiple trajectories of different systems at the same time, as well as the detection of subtle differences otherwise unrecoverable with analyses based on global flexibility Mol. BioSyst., 2012, 8,

Molecular design principles underlying β-strand swapping. in the adhesive dimerization of cadherins

Molecular design principles underlying β-strand swapping. in the adhesive dimerization of cadherins Supplementary information for: Molecular design principles underlying β-strand swapping in the adhesive dimerization of cadherins Jeremie Vendome 1,2,3,5, Shoshana Posy 1,2,3,5,6, Xiangshu Jin, 1,3 Fabiana

More information

BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I

BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I Please use ray-tracing feature for all the images you are submitting. Use either the Ray button on the right side of the command window in PyMOL or variations

More information

Bioinformatics & Protein Structural Analysis. Bioinformatics & Protein Structural Analysis. Learning Objective. Proteomics

Bioinformatics & Protein Structural Analysis. Bioinformatics & Protein Structural Analysis. Learning Objective. Proteomics The molecular structures of proteins are complex and can be defined at various levels. These structures can also be predicted from their amino-acid sequences. Protein structure prediction is one of the

More information

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center Comparative Modeling Part 1 Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center Function is the most important feature of a protein Function is related to structure Structure is

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Domain architecture and conformational states of the decapping complex, as revealed by structural studies. (a) Domain organization of Schizosaccharomyces pombe (Sp) and Saccharomyces

More information

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction Conformational Analysis 2 Conformational Analysis Properties of molecules depend on their three-dimensional

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Figure 1. Mechanism for signal-induced opening of the DNA box. a, An atomic model of the DNA box held closed by locks (orange and blue) that are double helices formed by two short strands

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 10.1038/nature08627 Supplementary Figure 1. DNA sequences used to construct nucleosomes in this work. a, DNA sequences containing the 601 positioning sequence (blue)24 with a PstI restriction site

More information

Protein Folding Problem I400: Introduction to Bioinformatics

Protein Folding Problem I400: Introduction to Bioinformatics Protein Folding Problem I400: Introduction to Bioinformatics November 29, 2004 Protein biomolecule, macromolecule more than 50% of the dry weight of cells is proteins polymer of amino acids connected into

More information

Ensemble refinement shows conformational flexibility in crystal structures of human complement factor D

Ensemble refinement shows conformational flexibility in crystal structures of human complement factor D Supplementary Information for Ensemble refinement shows conformational flexibility in crystal structures of human complement factor D Federico Forneris a,b, B. Tom Burnley a,b,c and Piet Gros a * a Crystal

More information

Nature Structural & Molecular Biology: doi: /nsmb.3428

Nature Structural & Molecular Biology: doi: /nsmb.3428 Supplementary Figure 1 Biochemical characterization of the monou and oligou activity switch of TUT4(7). (a) Mouse TUT4 and human TUT7 were assayed for monou and Lin28-dependent oligou addition activities

More information

Supplementary Fig. 1. Initial electron density maps for the NOX-D20:mC5a complex obtained after SAD-phasing. (a) Initial experimental electron

Supplementary Fig. 1. Initial electron density maps for the NOX-D20:mC5a complex obtained after SAD-phasing. (a) Initial experimental electron Supplementary Fig. 1. Initial electron density maps for the NOX-D20:mC5a complex obtained after SAD-phasing. (a) Initial experimental electron density map obtained after SAD-phasing and density modification

More information

Supplementary Information. Single-molecule analysis reveals multi-state folding of a guanine. riboswitch

Supplementary Information. Single-molecule analysis reveals multi-state folding of a guanine. riboswitch Supplementary Information Single-molecule analysis reveals multi-state folding of a guanine riboswitch Vishnu Chandra 1,4,#, Zain Hannan 1,5,#, Huizhong Xu 2,# and Maumita Mandal 1,2,3,6* Department of

More information

Assessing a novel approach for predicting local 3D protein structures from sequence

Assessing a novel approach for predicting local 3D protein structures from sequence Assessing a novel approach for predicting local 3D protein structures from sequence Cristina Benros*, Alexandre G. de Brevern, Catherine Etchebest and Serge Hazout Equipe de Bioinformatique Génomique et

More information

Site-specific time-resolved FRET reveals local variations in the unfolding mechanism in an apparently two-state protein unfolding transition

Site-specific time-resolved FRET reveals local variations in the unfolding mechanism in an apparently two-state protein unfolding transition Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 2017 Supplementary information for Site-specific time-resolved FRET reveals local variations

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 10.1038/nature06147 SUPPLEMENTARY INFORMATION Figure S1 The genomic and domain structure of Dscam. The Dscam gene comprises 24 exons, encoding a signal peptide (SP), 10 IgSF domains, 6 fibronectin

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION This supplementary information is an extension of the letter with the same title and includes further discussion on the comparison of our designed Fe B Mb (computer model and crystal structure) with the

More information

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid Supplementary Figures Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid sequence of Drosophila RCC1. Same colors are for Figure 1 with sequence of β-wedge that interacts with Ran in

More information

Exploring Similarities of Conserved Domains/Motifs

Exploring Similarities of Conserved Domains/Motifs Exploring Similarities of Conserved Domains/Motifs Sotiria Palioura Abstract Traditionally, proteins are represented as amino acid sequences. There are, though, other (potentially more exciting) representations;

More information

Electronic Supplementary Information

Electronic Supplementary Information Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 2018 Electronic Supplementary Information Unraveling the effects of amino acid substitutions

More information

Tutorial. Visualize Variants on Protein Structure. Sample to Insight. November 21, 2017

Tutorial. Visualize Variants on Protein Structure. Sample to Insight. November 21, 2017 Visualize Variants on Protein Structure November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

Research in Structural Bioinformatics and Molecular Biophysics. OUTLINE: What is it and why is it useful? EXAMPLES: b. Improving enzyme s function.

Research in Structural Bioinformatics and Molecular Biophysics. OUTLINE: What is it and why is it useful? EXAMPLES: b. Improving enzyme s function. Research in Structural Bioinformatics and Molecular Biophysics OUTLINE: What is it and why is it useful? EXAMPLES: a. Biomolecular surface story. b. Improving enzyme s function. c. Folding proteins. Alexey

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Multiple sequence alignments of four Swi2/Snf2 subfamily proteins, ScChd1, SsoRad54 and the RNA helicase Vasa. The sequence alignments of the Swi2/Snf2 subfamily proteins, ScChd1

More information

Supplementary Information for. Structure of human tyrosylprotein sulfotransferase-2 reveals the mechanism of protein tyrosine sulfation reaction

Supplementary Information for. Structure of human tyrosylprotein sulfotransferase-2 reveals the mechanism of protein tyrosine sulfation reaction Supplementary Information for Structure of human tyrosylprotein sulfotransferase-2 reveals the mechanism of protein tyrosine sulfation reaction Takamasa Teramoto, Yukari Fujikawa, Yoshirou Kawaguchi, Katsuhisa

More information

Final exam. Please write your name on the exam and keep an ID card ready.

Final exam. Please write your name on the exam and keep an ID card ready. Biophysics of Macromolecules Prof. R. Jungmann and Prof. J. Lipfert SS 2017 Final exam Final exam First name: Last name: Student number ( Matrikelnummer ): Please write your name on the exam and keep an

More information

Hmwk # 8 : DNA-Binding Proteins : Part II

Hmwk # 8 : DNA-Binding Proteins : Part II The purpose of this exercise is : Hmwk # 8 : DNA-Binding Proteins : Part II 1). to examine the case of a tandem head-to-tail homodimer binding to DNA 2). to view a Zn finger motif 3). to consider the case

More information

Protein Structure Prediction

Protein Structure Prediction Homology Modeling Protein Structure Prediction Ingo Ruczinski M T S K G G G Y F F Y D E L Y G V V V V L I V L S D E S Department of Biostatistics, Johns Hopkins University Fold Recognition b Initio Structure

More information

X-ray structures of fructosyl peptide oxidases revealing residues responsible for gating oxygen access in the oxidative half reaction

X-ray structures of fructosyl peptide oxidases revealing residues responsible for gating oxygen access in the oxidative half reaction X-ray structures of fructosyl peptide oxidases revealing residues responsible for gating oxygen access in the oxidative half reaction Tomohisa Shimasaki 1, Hiromi Yoshida 2, Shigehiro Kamitori 2 & Koji

More information

Current Status and Future Prospects p. 46 Acknowledgements p. 46 References p. 46 Hammerhead Ribozyme Crystal Structures and Catalysis

Current Status and Future Prospects p. 46 Acknowledgements p. 46 References p. 46 Hammerhead Ribozyme Crystal Structures and Catalysis Ribozymes and RNA Catalysis: Introduction and Primer What are Ribozymes? p. 1 What is the Role of Ribozymes in Cells? p. 1 Ribozymes Bring about Significant Rate Enhancements p. 4 Why Study Ribozymes?

More information

4/10/2011. Rosetta software package. Rosetta.. Conformational sampling and scoring of models in Rosetta.

4/10/2011. Rosetta software package. Rosetta.. Conformational sampling and scoring of models in Rosetta. Rosetta.. Ph.D. Thomas M. Frimurer Novo Nordisk Foundation Center for Potein Reseach Center for Basic Metabilic Research Breif introduction to Rosetta Rosetta docking example Rosetta software package Breif

More information

PCA and SOM based Dimension Reduction Techniques for Quaternary Protein Structure Prediction

PCA and SOM based Dimension Reduction Techniques for Quaternary Protein Structure Prediction PCA and SOM based Dimension Reduction Techniques for Quaternary Protein Structure Prediction Sanyukta Chetia Department of Electronics and Communication Engineering, Gauhati University-781014, Guwahati,

More information

Supplementary Fig. S1. SAMHD1c has a more potent dntpase activity than. SAMHD1c. Purified recombinant SAMHD1c and SAMHD1c proteins (with

Supplementary Fig. S1. SAMHD1c has a more potent dntpase activity than. SAMHD1c. Purified recombinant SAMHD1c and SAMHD1c proteins (with Supplementary Fig. S1. SAMHD1c has a more potent dntpase activity than SAMHD1c. Purified recombinant SAMHD1c and SAMHD1c proteins (with concentration of 800nM) were incubated with 1mM dgtp for the indicated

More information

Clamping down on pathogenic bacteria how to shut down a key DNA polymerase complex

Clamping down on pathogenic bacteria how to shut down a key DNA polymerase complex Clamping down on pathogenic bacteria how to shut down a key DNA polymerase complex Bacterial DNA-replication machinery Pathogenic bacteria that are resistant to the current armoury of antibiotics are an

More information

Title: A topological and conformational stability alphabet for multi-pass membrane proteins

Title: A topological and conformational stability alphabet for multi-pass membrane proteins Supplementary Information Title: A topological and conformational stability alphabet for multi-pass membrane proteins Authors: Feng, X. 1 & Barth, P. 1,2,3 Correspondences should be addressed to: P.B.

More information

Conformational changes in IgE contribute to its. uniquely slow dissociation rate from receptor FcεRI

Conformational changes in IgE contribute to its. uniquely slow dissociation rate from receptor FcεRI Conformational changes in IgE contribute to its uniquely slow dissociation rate from receptor FcεRI M.D. Holdom, A.M. Davies, J.E. Nettleship, S.C. Bagby, B. Dhaliwal, E. Girardi, J. Hunt, H.J. Gould,

More information

HEK293T. Fig. 1 in the

HEK293T. Fig. 1 in the Supplementary Information Supplementary Figure 1 Zinc uptake assay of hzip4 and hzip4-δecd transiently expressed in HEK293T cells. The results of one representative e experiment are shown in Fig. 1 in

More information

COMP364. Introduction to RNA secondary structure prediction. Jérôme Waldispühl School of Computer Science, McGill

COMP364. Introduction to RNA secondary structure prediction. Jérôme Waldispühl School of Computer Science, McGill COMP364 Introduction to RNA secondary structure prediction Jérôme Waldispühl School of Computer Science, McGill RNA world In prebiotic world, RNA thought to have filled two distinct roles: 1. an information

More information

What s New in Discovery Studio 2.5.5

What s New in Discovery Studio 2.5.5 What s New in Discovery Studio 2.5.5 Discovery Studio takes modeling and simulations to the next level. It brings together the power of validated science on a customizable platform for drug discovery research.

More information

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004 CSE 397-497: Computational Issues in Molecular Biology Lecture 19 Spring 2004-1- Protein structure Primary structure of protein is determined by number and order of amino acids within polypeptide chain.

More information

Sequence Analysis '17 -- lecture Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction

Sequence Analysis '17 -- lecture Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction Sequence Analysis '17 -- lecture 16 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction Alpha helix Right-handed helix. H-bond is from the oxygen at i to the nitrogen

More information

Not The Real Exam Just for Fun! Chemistry 391 Fall 2018

Not The Real Exam Just for Fun! Chemistry 391 Fall 2018 Not The Real Exam Just for Fun! Chemistry 391 Fall 2018 Do not open the exam until ready to begin! Rules of the Game: This is a take-home Exam. The exam is due on Thursday, October 11 th at 9 AM. Otherwise,

More information

Learning to Use PyMOL (includes instructions for PS #2)

Learning to Use PyMOL (includes instructions for PS #2) Learning to Use PyMOL (includes instructions for PS #2) To begin, download the saved PyMOL session file, 4kyz.pse from the Chem 391 Assignments web page: http://people.reed.edu/~glasfeld/chem391/assign.html

More information

1.2 Geometry of a simplified two dimensional model molecule Discrete flow of empty space illustrated for two dimensional disks.

1.2 Geometry of a simplified two dimensional model molecule Discrete flow of empty space illustrated for two dimensional disks. 1.1 Geometric models of protein surfaces. 3 1.2 Geometry of a simplified two dimensional model molecule. 5 1.3 The family of alpha shapes or dual simplicial complexes for a two-dimensional toy molecule.

More information

Nagahama Institute of Bio-Science and Technology. National Institute of Genetics and SOKENDAI Nagahama Institute of Bio-Science and Technology

Nagahama Institute of Bio-Science and Technology. National Institute of Genetics and SOKENDAI Nagahama Institute of Bio-Science and Technology A Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases Project Representative Toshimichi Ikemura Authors

More information

Structural bioinformatics

Structural bioinformatics Structural bioinformatics Why structures? The representation of the molecules in 3D is more informative New properties of the molecules are revealed, which can not be detected by sequences Eran Eyal Plant

More information

Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions

Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions Introduction Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions Jamie Duke, Rochester Institute of Technology Mentor: Carlos Camacho, University

More information

STRUCTURAL BIOLOGY. α/β structures Closed barrels Open twisted sheets Horseshoe folds

STRUCTURAL BIOLOGY. α/β structures Closed barrels Open twisted sheets Horseshoe folds STRUCTURAL BIOLOGY α/β structures Closed barrels Open twisted sheets Horseshoe folds The α/β domains Most frequent domain structures are α/β domains: A central parallel or mixed β sheet Surrounded by α

More information

CFSSP: Chou and Fasman Secondary Structure Prediction server

CFSSP: Chou and Fasman Secondary Structure Prediction server Wide Spectrum, Vol. 1, No. 9, (2013) pp 15-19 CFSSP: Chou and Fasman Secondary Structure Prediction server T. Ashok Kumar Department of Bioinformatics, Noorul Islam College of Arts and Science, Kumaracoil

More information

CS273: Algorithms for Structure Handout # 5 and Motion in Biology Stanford University Tuesday, 13 April 2004

CS273: Algorithms for Structure Handout # 5 and Motion in Biology Stanford University Tuesday, 13 April 2004 CS273: Algorithms for Structure Handout # 5 and Motion in Biology Stanford University Tuesday, 13 April 2004 Lecture #5: 13 April 2004 Topics: Sequence motif identification Scribe: Samantha Chui 1 Introduction

More information

Antibody-Antigen recognition. Structural Biology Weekend Seminar Annegret Kramer

Antibody-Antigen recognition. Structural Biology Weekend Seminar Annegret Kramer Antibody-Antigen recognition Structural Biology Weekend Seminar 10.07.2005 Annegret Kramer Contents Function and structure of antibodies Features of antibody-antigen interfaces Examples of antibody-antigen

More information

6-Foot Mini Toober Activity

6-Foot Mini Toober Activity Big Idea The interaction between the substrate and enzyme is highly specific. Even a slight change in shape of either the substrate or the enzyme may alter the efficient and selective ability of the enzyme

More information

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology. G16B BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY Methods or systems for genetic

More information

EPSRC Centre for Doctoral Training in Industrially Focused Mathematical Modelling

EPSRC Centre for Doctoral Training in Industrially Focused Mathematical Modelling EPSRC Centre for Doctoral Training in Industrially Focused Mathematical Modelling Swarm Robotics in Bearing-Only Formations Joseph Field Contents 1. Introduction... 2 2. Formulating the Problem... 3 Glossary

More information

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror Protein design CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror 1 Outline Why design proteins? Overall approach: Simplifying the protein design problem < this step is really key! Protein design methodology

More information

Supplementary Figure 1 Two distinct conformational states of the HNH domain in crystal structures. a

Supplementary Figure 1 Two distinct conformational states of the HNH domain in crystal structures. a Supplementary Figure 1 Two distinct conformational states of the HNH domain in crystal structures. a HNH-state 1 in PDB 4OO8, in which the distance from the C atom of the HNH catalytic residue 840 to the

More information

Hmwk 6. Nucleic Acids

Hmwk 6. Nucleic Acids The purpose of this homework exercise is Hmwk 6. Nucleic Acids 1). to recognize fundamental features of B-form DNA and A-form RNA 2). to view the folded structure of trna B-FORM DNA In aqueous solutions,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature16191 SUPPLEMENTARY DISCUSSION We analyzed the current database of solved protein structures to assess the uniqueness of our designed toroids in terms of global similarity and bundle

More information

Sequence Databases and database scanning

Sequence Databases and database scanning Sequence Databases and database scanning Marjolein Thunnissen Lund, 2012 Types of databases: Primary sequence databases (proteins and nucleic acids). Composite protein sequence databases. Secondary databases.

More information

Supplementary Materials. for. array reveals biophysical and evolutionary landscapes

Supplementary Materials. for. array reveals biophysical and evolutionary landscapes Supplementary Materials for Quantitative analysis of RNA- protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes Jason D. Buenrostro 1,2,4, Carlos L. Araya 1,4,

More information

Textbook Reading Guidelines

Textbook Reading Guidelines Understanding Bioinformatics by Marketa Zvelebil and Jeremy Baum Last updated: May 1, 2009 Textbook Reading Guidelines Preface: Read the whole preface, and especially: For the students with Life Science

More information

Nature Methods: doi: /nmeth Supplementary Figure 1. Construction of a sensitive TetR mediated auxotrophic off-switch.

Nature Methods: doi: /nmeth Supplementary Figure 1. Construction of a sensitive TetR mediated auxotrophic off-switch. Supplementary Figure 1 Construction of a sensitive TetR mediated auxotrophic off-switch. A Production of the Tet repressor in yeast when conjugated to either the LexA4 or LexA8 promoter DNA binding sequences.

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10963 Supplementary Table 1 Data collection, phasing and refinement statistics. Crystal Native Derivative-1 (OsO 4 ) Derivative-2 (Orange-Pt) Data collection Space group C2 C2 C2 Cell

More information

SUPPLEMENTARY INFORMATION Figures. Supplementary Figure 1 a. Page 1 of 30. Nature Chemical Biology: doi: /nchembio.2528

SUPPLEMENTARY INFORMATION Figures. Supplementary Figure 1 a. Page 1 of 30. Nature Chemical Biology: doi: /nchembio.2528 SUPPLEMENTARY INFORMATION Figures Supplementary Figure 1 a b c Page 1 of 0 11 Supplementary Figure 1: Biochemical characterisation and binding validation of the reversible USP inhibitor 1. a, Biochemical

More information

Globins. The Backbone structure of Myoglobin 2. The Heme complex in myoglobin. Lecture 10/01/2009. Role of the Globin.

Globins. The Backbone structure of Myoglobin 2. The Heme complex in myoglobin. Lecture 10/01/2009. Role of the Globin. Globins Lecture 10/01/009 The Backbone structure of Myoglobin Myoglobin: 44 x 44 x 5 Å single subunit 153 amino acid residues 11 residues are in an a helix. Helices are named A, B, C, F. The heme pocket

More information

Conformational Equilibria and Free Energy Profiles for the Allosteric Transition of the Ribose-binding Protein

Conformational Equilibria and Free Energy Profiles for the Allosteric Transition of the Ribose-binding Protein doi:10.1016/j.jmb.2005.08.009 J. Mol. Biol. (2005) 353, 196 210 Conformational Equilibria and Free Energy Profiles for the Allosteric Transition of the Ribose-binding Protein Krishna Pratap Ravindranathan,

More information

Supplementary Information

Supplementary Information Supplementary Information From non-covalent binding to irreversible DNA lesions: nile blue and nile red as photosensitizing agents Hugo Gattuso, a,b Vanessa Besancenot, c,d Stéphanie Grandemange, c,d Marco

More information

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1 Bi 8 Lecture 7 PROTEIN STRUCTURE, Functional analysis, and evolution Ellen Rothenberg 26 January 2016 Reading: Ch. 3, pp. 109-134; panel 3-1 (end with free amine) aromatic, hydrophobic small, hydrophilic

More information

CAVER Analyst 1.0: Graphic tool for interactive visualization and analysis of tunnels in protein structures

CAVER Analyst 1.0: Graphic tool for interactive visualization and analysis of tunnels in protein structures Supplementary data CAVER Analyst 1.0: Graphic tool for interactive visualization and analysis of tunnels in protein structures Barbora Kozlikova 1,#, Eva Sebestova 2,# Vilem Sustr 1, Jan Brezovsky 2, Ondrej

More information

BRITISH BIOMEDICAL BULLETIN

BRITISH BIOMEDICAL BULLETIN Journal Home Page www.bbbulletin.org BRITISH BIOMEDICAL BULLETIN Original Bioinformatics Analysis of α-amylase Three-Dimensional Structure in Aspergillus oryzae Javad Sharifi-Rad,2, Fatemeh Taktaz* 3 and

More information

Molecular Structures

Molecular Structures Molecular Structures 1 Molecular structures 2 Why is it important? Answers to scientific questions such as: What does the structure of protein X look like? Can we predict the binding of molecule X to Y?

More information

Nature Structural & Molecular Biology: doi: /nsmb.2548

Nature Structural & Molecular Biology: doi: /nsmb.2548 Supplementary Figure 1. Structure of GltPhout. (a) Stereo view of a slice through a single GltPhout protomer shown in stick representation along with 2Fo-Fc and anomalous difference electron maps. The

More information

Protein Structure/Function Relationships

Protein Structure/Function Relationships Protein Structure/Function Relationships W. M. Grogan, Ph.D. OBJECTIVES 1. Describe and cite examples of fibrous and globular proteins. 2. Describe typical tertiary structural motifs found in proteins.

More information

COMP598: Advanced Computational Biology Methods & Research

COMP598: Advanced Computational Biology Methods & Research COMP598: Advanced Computational Biology Methods & Research Introduction to RNA secondary structure prediction Jérôme Waldispühl School of Computer Science, McGill RNA world In prebiotic world, RNA thought

More information

Supplementary Figure 1. FRET probe labeling locations in the Cas9-RNA-DNA complex.

Supplementary Figure 1. FRET probe labeling locations in the Cas9-RNA-DNA complex. Supplementary Figure 1. FRET probe labeling locations in the Cas9-RNA-DNA complex. (a) Cy3 and Cy5 labeling locations shown in the crystal structure of Cas9-RNA bound to a cognate DNA target (PDB ID: 4UN3)

More information

systemsdock Operation Manual

systemsdock Operation Manual systemsdock Operation Manual Version 2.0 2016 April systemsdock is being developed by Okinawa Institute of Science and Technology http://www.oist.jp/ Integrated Open Systems Unit http://openbiology.unit.oist.jp/_new/

More information

TileSoft: Sequence Optimization Software for Designing DNA Secondary Structures

TileSoft: Sequence Optimization Software for Designing DNA Secondary Structures 1 TileSoft: Sequence Optimization Software for Designing DNA Secondary Structures P. Yin*, B. Guo*, C. Belmore*, W. Palmeri*, E. Winfree, T. H. LaBean* and J. H. Reif* * Department of Computer Science,

More information

Molecular Modeling Lecture 8. Local structure Database search Multiple alignment Automated homology modeling

Molecular Modeling Lecture 8. Local structure Database search Multiple alignment Automated homology modeling Molecular Modeling 2018 -- Lecture 8 Local structure Database search Multiple alignment Automated homology modeling An exception to the no-insertions-in-helix rule Actual structures (myosin)! prolines

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Spring 2012 Class 6 February 9, 2012 Backbone and Sidechain Rotational Degrees of Freedom Protein 3-D Structure The 3-dimensional fold of a protein is called a tertiary structure. Many proteins consist

More information

Molecular Structures

Molecular Structures Molecular Structures 1 Molecular structures 2 Why is it important? Answers to scientific questions such as: What does the structure of protein X look like? Can we predict the binding of molecule X to Y?

More information

Supplementary Information. Structural basis for duplex RNA recognition and cleavage by A.

Supplementary Information. Structural basis for duplex RNA recognition and cleavage by A. Supplementary Information Structural asis for duplex RNA recognition and cleavage y A. fulgidus C3PO Eneida arizotto 1, Edward D Lowe 1 & James S Parker 1 1 Department of Biochemistry University of Oxford

More information

Worked Example of Humanized Fab D3h44 in Complex with Tissue Factor

Worked Example of Humanized Fab D3h44 in Complex with Tissue Factor Worked Example of Humanized Fab D3h44 in Complex with Tissue Factor Here we provide an example worked in detail from antibody sequence and unbound antigen structure to a docked model of the antibody antigen

More information

BIOINFORMATICS Introduction

BIOINFORMATICS Introduction BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a 1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu What is Bioinformatics? (Molecular) Bio -informatics One idea

More information

Visualization of codon-dependent conformational rearrangements during translation termination

Visualization of codon-dependent conformational rearrangements during translation termination Supplementary information for: Visualization of codon-dependent conformational rearrangements during translation termination Shan L. He 1 and Rachel Green 1 1 Howard Hughes Medical Institute, Department

More information

Molecular Dynamics of Proteins

Molecular Dynamics of Proteins Molecular Dynamics of Proteins Ubiquitin Myoglobin Equilibrium Properties of Proteins Ubiquitin Root Mean Squared Deviation: measure for equilibration and protein flexibility RMSD constant protein equilibrated

More information

Relationship between nucleotide sequence and 3D protein structure of six genes in Escherichia coli, by analysis of DNA sequence using a Markov model

Relationship between nucleotide sequence and 3D protein structure of six genes in Escherichia coli, by analysis of DNA sequence using a Markov model Relationship between nucleotide sequence and 3D protein structure of six genes in Escherichia coli, by analysis of DNA sequence using a Markov model Yuko Ohfuku 1,, 3*, Hideo Tanaka and Masami Uebayasi

More information

Supporting Information

Supporting Information Supporting Information Shuffling Active Site Substate Populations Affects Catalytic Activity: The Case of Glucose Oxidase Dušan Petrović, 1 David Frank, 2, Shina Caroline Lynn Kamerlin, 3 Kurt Hoffmann,

More information

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions Joshua N. Burton 1, Andrew Adey 1, Rupali P. Patwardhan 1, Ruolan Qiu 1, Jacob O. Kitzman 1, Jay Shendure 1 1 Department

More information

7/14/2015. Single-cell-level measurements of transcription heterogeneity of highly mobile identical genes. Enrico Gratton and Paolo Annibale

7/14/2015. Single-cell-level measurements of transcription heterogeneity of highly mobile identical genes. Enrico Gratton and Paolo Annibale 7//5 Single-cell-level measurements of transcription heterogeneity of highly mobile identical genes Enrico Gratton and Paolo Annibale Laboratory for Florescence Dynamics University of California, Irvine

More information

Nature Structural & Molecular Biology: doi: /nsmb.2307

Nature Structural & Molecular Biology: doi: /nsmb.2307 A novel locally-closed conformation of a bacterial pentameric proton-gated ion channel Marie S. Prevost*, Ludovic Sauguet*, Hugues Nury, Catherine Van Renterghem, Christèle Huon, Frederic Poitevin, Marc

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/19/07 CAP5510 1 HMM for Sequence Alignment 2/19/07 CAP5510 2

More information

SUPPLEMENTARY INFORMATION. Supplementary Figures 1-8

SUPPLEMENTARY INFORMATION. Supplementary Figures 1-8 SUPPLEMENTARY INFORMATION Supplementary Figures 1-8 Supplementary Figure 1. TFAM residues contacting the DNA minor groove (A) TFAM contacts on nonspecific DNA. Leu58, Ile81, Asn163, Pro178, and Leu182

More information

has only one nucleotide, U20, between G19 and A21, while trna Glu CUC has two

has only one nucleotide, U20, between G19 and A21, while trna Glu CUC has two SPPLEMENTRY INFORMTION doi:1.138/nature9411 Supplementary Discussion The structural characteristics of trn Gln G in comparison to trn Glu 16 in trn Gln G is directed towards the G19 56 pair, or the outer

More information

A Decision Support System for Market Segmentation - A Neural Networks Approach

A Decision Support System for Market Segmentation - A Neural Networks Approach Association for Information Systems AIS Electronic Library (AISeL) AMCIS 1995 Proceedings Americas Conference on Information Systems (AMCIS) 8-25-1995 A Decision Support System for Market Segmentation

More information

Prediction of Protein-Protein Binding Sites and Epitope Mapping. Copyright 2017 Chemical Computing Group ULC All Rights Reserved.

Prediction of Protein-Protein Binding Sites and Epitope Mapping. Copyright 2017 Chemical Computing Group ULC All Rights Reserved. Prediction of Protein-Protein Binding Sites and Epitope Mapping Epitope Mapping Antibodies interact with antigens at epitopes Epitope is collection residues on antigen Continuous (sequence) or non-continuous

More information

BIRKBECK COLLEGE (University of London)

BIRKBECK COLLEGE (University of London) BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology

More information

BIOINFORMATICS THE MACHINE LEARNING APPROACH

BIOINFORMATICS THE MACHINE LEARNING APPROACH 88 Proceedings of the 4 th International Conference on Informatics and Information Technology BIOINFORMATICS THE MACHINE LEARNING APPROACH A. Madevska-Bogdanova Inst, Informatics, Fac. Natural Sc. and

More information

VALLIAMMAI ENGINEERING COLLEGE

VALLIAMMAI ENGINEERING COLLEGE VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur 603 203 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER BM6005 BIO INFORMATICS Regulation 2013 Academic Year 2018-19 Prepared

More information

Positional Preference of Rho-Independent Transcriptional Terminators in E. Coli

Positional Preference of Rho-Independent Transcriptional Terminators in E. Coli Positional Preference of Rho-Independent Transcriptional Terminators in E. Coli Annie Vo Introduction Gene expression can be regulated at the transcriptional level through the activities of terminators.

More information

NNvPDB: Neural Network based Protein Secondary Structure Prediction with PDB Validation

NNvPDB: Neural Network based Protein Secondary Structure Prediction with PDB Validation www.bioinformation.net Web server Volume 11(8) NNvPDB: Neural Network based Protein Secondary Structure Prediction with PDB Validation Seethalakshmi Sakthivel, Habeeb S.K.M* Department of Bioinformatics,

More information

Specificity: Induced Fit

Specificity: Induced Fit Specificity: Induced Fit Conformational changes may occur upon ligand binding (Daniel Koshland in 1958) This adaptation is called the induced fit Induced fit allows for tighter binding of the ligand Induced

More information