Molecular simulations with MOSAIC

Preparing and running simulations

Mosaic has the potential to change in a profound way how we perform molecular simulations. By making it possible to store in a standardized way not only the input and output of a simulation, but also many intermediate data items that are usually handled opaquely inside a simulation program, Mosaic enables the transition from black-box simulation packages to ecosystems of small programs that communicate through Mosaic data. This has a number of advantages:

Visualization

Molecular visualization programs typically load files containing a molecular configuration associated with some supporting information, whose details depend on the format being used. Various other characteristics are often inferred from the explicitly supplied information, with more or less satisfactory results.

Protein visualization, for example, is based on PDB files, which provide an atom name and a residue name for each atom, in addition to its position. The chemical element is inferred from the atom and residue names, and the bonds are inferred from the same information plus the atom positions. This inference works reasonably well for standard residue types. The more a molecular system differs from a "typical" PDB entry, the more problems appear. Visualisation of coarse-grained models, for example, is rather frustrating.

Mosaic improves the situation by storing a much more detailed description of the molecular system that is visualized. Inference of additional required information is no longer necessary.

Mosaic also makes it possible to visualize additional quantities related to a molecular system. Quantities like partial charges or atomic fluctuation amplitudes can be visualized by color scales or sphere radii. Vector quantities, such as velocities or normal mode displacements, can be shown as arrows or through animations.

Analyzing simulations

Programs that analyze the output of molecular simulations (e.g. Molecular Dynamics trajectories) are currently either written to work with a specific simulation package, or have to deal with multiple formats and data conversion issues. In other words, authors have to choose between restricted applicability and accidental complexity. With Mosaic, it becomes possible to write simple and small analysis programs that nevertheless work with everyone's data.