Codes

MiMiC

Multiscale Quantum Mechanics/Molecular Mechanics (QM/MM) simulations have become increasingly important for studying many biochemical processes that involve bond breaking and/or charge transfers among highly charged particles. Simulating these processes require algorithmic advances for calculations in both the MM and QM domains, together with an efficient and flexible interface between the two domains, which are typically treated using different software packages. An optimal strategy of algorithm design is that such interface must not hinder individual-domain calculations but help to speed up the overall simulation. The recently developed Multiscale Modeling in Computational Chemistry (MiMiC) software achieves this by providing a very flexible and computationally efficient framework for multiscale simulations. Using this multi-layered parallelization scheme, MiMiC has displayed efficient scalability over more than ten thousand cores in a single QM/MM simulation while maintaining an overall parallel efficiency above 75%, enabling nanoscale QM/MM molecular dynamics of complex biological systems. The code has been officially released. More information about it can be found on the official webpage.

References

J. M. H. Olsen, V. Bolnykh, S. Meloni, E. Ippoliti, M. P. Bircher, P. Carloni, U. Rothlisberger, MiMiC: A Novel Framework for Multiscale Modeling in Computational Chemistry, J. Chem. Theory Comput. 15, 3810–3823 (2019).
V. Bolnykh, J. M. H. Olsen, S. Meloni, M. P. Bircher, E. Ippoliti, P. Carloni, U. Rothlisberger, Extreme Scalability of DFT-Based QM/MM MD Simulations Using MiMiC, J. Chem. Theory Comput. 15, 5601–5613 (2019).

MoNvIso

MoNvIso, the Modeling eNvironment for Isoforms, is a homology modeling software developed using Python. Its main purpose is to identify the isoform of the protein most likely related to a specific disease, based on the mutations provided by the user.

It performs an evaluation on which isoform can map the highest number of mutations, then evaluates the “modellability” of all the isoforms to decide which one has the highest amount of protein surface covered by templates. It automatically searches for homologues (using BLAST API), aligns them (with COBALT), builds the Hidden Markov Model and uses it to search for templates (with HMMER API), aligns them and builds the model of the wild type and mutants (using MODELLER).

MoNvIso can be downloaded from this link.

Volume-based metadynamics

Determining the complete set of ligands' binding-unbinding pathways is important for drug discovery and for rational interpretation of mutation data. Here we have developed a metadynamics-based technique that addresses this issue and allows estimating affinities in the presence of multiple escape pathways. The calculations require a relatively small computational cost, making this approach valuable for practical applications, such as screening of small compound libraries. This approach has been tested and applied by using the PLUMED package. More information about its implementation cab be found in the PLUMED NEST repository at this link.

References

R. Capelli, P. Carloni, M. Parrinello, Exhaustive Search of Ligand Binding Pathways via Volume-Based Metadynamics. J Phys Chem Lett 10(12), 3495-3499 (2019).
The PLUMED consortium. Promoting transparency and reproducibility in enhanced molecular simulations. Nat Methods 16, 670–673 (2019).

Localized Volume-based metadynamics

Enhanced sampling methods can predict free-energy landscapes associated with protein/ligand binding, characterizing the involved intermolecular interactions in a precise way. However, these in silico approaches can be challenged by induced-fit effects. We have developed a variant of volume-based metadynamics tailored to tackle this problem in a general and efficient way.

References

Q. Zhao et al. Enhanced Sampling Approach to the Induced-Fit Docking Problem in Protein–Ligand Binding: The Case of Mono-ADP-Ribosylation Hydrolase Inhibitors. J Chem Theory Comput 17(12), 7899-7911 (2021).

T-pad

The intrinsic plasticity of protein residues, along with the occurrence of transitions between distinct residue conformations, plays a pivotal role in a variety of molecular recognition events in the cell. Analysis aimed at identifying both of these features has been limited so far to protein-complex structures. We have developed a computationally efficient tool (T-pad), which quantitatively analyzes protein residues' flexibility and detects backbone conformational transitions. The code can be obtained under request to the authors.

References

R. Caliandro, G. Rossetti, and P. Carloni. Local Fluctuations and Conformational Transitions in Proteins. J Chem Theory Comput 8(11), 4775–4785 (2012).

Numbering scheme and similarity index to automate comparisons of hydrogen-bond and hydrophobic interaction networks.

Recent developments in structural biology have led to numerous new structures of membrane proteins to be solved. When structures of distinct membrane proteins of the same family are solved at high resolution, the opportunity arises to evaluate sequence-structure-function relationships for that family of membrane proteins. However, this also brings about the challenge for how to automatize the comparisons of the relationships between sequence and structure, and how these relationships reflect on the biological function of the protein. This aspect is well illustrated by the family of microbial pump rhodopsins, which are seven-helical membrane proteins that pump protons, chloride, or sodium ions, across the cell membrane. There are more than 30 structures of unique microbial rhodopsins solved until now, and more than 8500 sequences. The internal hydrogen-bond networks are thought to be essential for the transfer of ions. To enable direct comparisons between the hydrogen-bond networks of microbial pump rhodopsins with distinct functions, we developed NS-mrho, numbering scheme for microbial rhodopsins, which uses the philosophy behind the well-known Ballesteros-Weinstein scheme for G Protein Coupled Receptors, GPCRs. Furthermore, we developed a pair-wise similarity matrix computation that can be used to automate the comparisons of pairs of hydrogen-bond or hydrophobic interaction networks, and to evaluate the relationship between the similarity index and the phylogenetic distance between the proteins. Although we applied these numbering scheme and similarity index to microbial pump rhodopsins, the methodologies we developed can in principle be extended to other membrane protein families.

The protocol for the numbering scheme and the codes are released with the publication. See the Supporting Information of the paper, and its associated content.

References

E. Bertalan, M. Konno, M. del Carmin Marín, R. Bagherzadeh, T. Nagata, L.S. Brown, K. Inoue, A-N. Bondar. Hydrogen-bonding and hydrophobic interaction networks as structural determinants of microbial rhodopsin function. J. Phys. Chem. B 128, 30, 7407–7426 (2024).

Bridge and the graphical interface Bridge2

Bridge is a graph-based algorithm coded in Python. Bridge computes graphs that consist of nodes, which for a protein are protein H-bonding groups, and edges, which are H-bonds between these groups. The edges, or H-bonds between two graph nodes, can be direct H-bonds or water-mediated bridges, or water wires. Both the H-bond criteria and the length of the water wire can be chosen by the user. Once computed with Bridge, the H-bond graph can be queried to identify, for example, H-bonds that are sampled persistently, shortest-distance paths between protein groups of interest, or the H-bond cluster of a group of interest.

Bridge2 is the graphical user interface of Bridge. A major development contributed by Bridge2 is that nodes of the graph can be arranged interactively for optimal view of the H-bond network. Bridge2 contains tools for the analysis of simulation trajectories, including the identification of H-bond motifs commonly found at proton-binding sites or sites otherwise important for protein function.

Bridge2 can also be used to compute graphs of hydrophobic interaction networks.

Bridge2 can be downloaded here.

References

M. Siemers, A-N. Bondar. Interactive interface for graph-based analyses of dynamic hydrogen-bond networks: application to spike protein S. J Chem Inf Model 61, 2998–3014 (2021).
M. Siemers, M. Lazaratos, K. Karathanou, F. Guerra, L.S. Brown, A-N Bondar. Bridge: a graph-based algorithm to analyze dynamic H-bond networks in membrane proteins. J Chem Theory Comput 15, 6781-6798 (2019).

USP: Unique Shortest Path computations for H-bond networks of large proteins

The betweeness centrality (BC) of a graph node x gives the number of shortest distance paths between nodes y and z that pass via node x. This BC is typically normalized by the total number of shortest-distance paths that inter-connect nodes y and z. In the H-bond graph of a protein, nodes with larger BC values could be of direct interest as putative ‘communication hubs’ for the protein. When the protein or protein complex are large, and thus have large numbers of nodes and edges in their H-bond graphs, the interpretation of the BC values can become challenging. To tackle this issue, we implemented unique shortest path (USP) computations. The USB of node x gives a more intuitive representation of its participation in the overall H-bond network, because it excludes over-counting the intermediate nodes and reports only the unique shortest paths between nodes y and z. We used Bridge and USP computations to identify the conserved H-bond motifs in a large dataset of membrane transporter and receptor structures.

References

M. Lazaratos, M.Siemers, L.S. Brown, A-N. Bondar. Conserved hydrogen-bond motifs of membrane transporters and receptors. BBA-Biomembranes 1864, 183896 (2022).
K. Karathanou, M. Lazaratos, E. Bertalan, M. Siemers, K. Buzar, G.F.X. Schertler, C. del Val, Bondar A-N. A graph-based approach identifies dynamic H-bond communication networks in spike S of SARS-CoV-2. J. Struct. Biol. 212, 107617 (2020).

C-Graphs (Conserved graphs)

C-graphs and its graphical user interface enable the computation of conserved and comparison H-bond graphs. Given a set of at two static protein structures or two MD simulation trajectories, the conserved H-bond graph is defined as the graph composed of nodes and edges present in both structures/MD trajectories. The set of static protein structures or MD simulation trajectories may contain more than two structures/trajectories. For two protein structures, a comparison H-bond graph color codes the nodes and edges according to their presence in either of the structures. By projecting H-bond graphs onto the z-axis, which for a membrane protein is typically the membrane normal, C-Graphs allows the user to estimate the linear length of the H-bond networks. Given a set of static protein structures, C-Graphs uses a clustering algorithm to identify sites in which water molecules are conserved, and maps these waters onto the protein-water H-bond graph.

Static protein structures subjected to analyses with C-Graphs must not necessarily belong to proteins with the same amino acid residue sequence. C-Graphs contains a protocol that can be used to compute conserved H-bond graphs for static structures of distinct G Protein Coupled Receptors (GPCRs).

C-Graphs and the User’s Manual for C-Graphs can be downloaded here.

References

É. Bertalan, E. Lesca, G.F.X. Schertler, A-N. Bondar A-N. C-Graphs tool with graphical user interface to dissect conserved hydrogen-bond networks: applications to visual rhodopsins. J Chem Inf Model 61, 3692-5707 (2021).

DFS algorithm

The Depth-First-Search (DFS) algorithm for topologies of dynamic lipid H-bond clusters visits the nodes (lipid headgroups) of the H-bond graph to identify four types of topologies: linear clusters, star and linear, circular, and combined star, circular and linear. The size of a lipid H-bond cluster is given by the number of nodes in the cluster.

You can download the DFS algorithm here.

Moreover, the DFS algorithm was extended to evaluate the H-bond clusters in a cholesterol-containing lipid bilayer. Cholesterol is a key constituent of eukaryotic membranes, and of direct interest as putative regulator of a number of G Protein Coupled Receptors. We have extended our DFS algorithm to include cholesterol in the computations of dynamic lipid H-bond clusters. In a first study on a POPS:cholesterol membrane, we found that the presence of cholesterol makes the sampling of complex lipid clusters somewhat less frequent as compared to the pure POPS lipid membrane.

References

K. Karathanou, A-N. Bondar. Algorithm to catalogue topologies of dynamic lipid hydrogen-bond networks. BBA-Biomembranes 1864, 183859 (2022).
H. Jain, K. Karathanou, A-N. Bondar. Graph-based analyses of dynamic hydrogen-bond networks in phosphatidylserine:cholesterol membranes. Biomolecules 13(8), 1238 (2023).

Last Modified: 05.06.2025

Institute of Neurosciences and Medicine (INM)

Codes

MiMiC

References

MoNvIso

Volume-based metadynamics

References

Localized Volume-based metadynamics

References

T-pad

References

R. Caliandro, G. Rossetti, and P. Carloni. Local Fluctuations and Conformational Transitions in Proteins. J Chem Theory Comput 8(11), 4775–4785 (2012).

Numbering scheme and similarity index to automate comparisons of hydrogen-bond and hydrophobic interaction networks.

References

Bridge and the graphical interface Bridge2

References

M. Siemers, A-N. Bondar. Interactive interface for graph-based analyses of dynamic hydrogen-bond networks: application to spike protein S. J Chem Inf Model 61, 2998–3014 (2021).

M. Siemers, M. Lazaratos, K. Karathanou, F. Guerra, L.S. Brown, A-N Bondar. Bridge: a graph-based algorithm to analyze dynamic H-bond networks in membrane proteins. J Chem Theory Comput 15, 6781-6798 (2019).

USP: Unique Shortest Path computations for H-bond networks of large proteins

References

C-Graphs (Conserved graphs)

References

DFS algorithm

References

Forschungszentrum Jülich GmbH