Running YANK

Hardware

Running on GPUs

YANK uses OpenMM as its simulation engine, which runs fastest on modern GPUs using either the CUDA or OpenCL platforms. Modern GTX-class hardware, such as the GTX-1080 or GTX-TITAN-X (Maxwell), should work very well. See Supported hardware for more information about supported and recommended hardware.

Running on the CPU

If you don’t have a GPU, the OpenMM CPU platform will run in multithreaded mode by default (see Selecting a platform). While not as fast as gromacs, running on the CPU can still let you explore the features of YANK without needing a GPU. You can also use CPUs acting as OpenCL devices utilizing the AMD OpenCL APP SDK or the Intel OpenCL SDK, though this has not been extensively tested.

Parallelization

YANK can operate in two different modes:

Serial mode

In serial mode, a single simulation is run at a time, either on the GPU or CPU. For replica-exchange calculations, this means that each replica is run serially, and then exchanges between replicas are allowed to occur. This is useful if you are testing YANK, or running a large number of free energy calculations where speed is not a major concern.

MPI mode

In MPI mode, multiple simulations can be run at once, either on multiple GPUs or multiple CPUs, though running simulations on a mixture of platforms is not supported at this time. We utilize the widely-supported Message Passing Interface (MPI) standard for parallelization. All simulations are run using the same OpenMM Platform choice (CUDA, OpenCL, CPU, or Reference)

No GPU management is needed on single GPU/node systems.

Multi-GPU and multi-node systems require masking the GPUs so YANK only sees the one its suppose to. You will need to set the CUDA_VISIBLE_DEVICES environment variables on each process to mask all but the card you intend to use. We cannot provide a universal solution as systems will differ, but we can provide some general rules of thumb.

Note

All our documentation assumes MPICH 3 formatting which has the package name mpich and version formatting 3.X. This is distinct from MPICH2 which has the package name mpich2 and version formatting 1.X. Please see the official MPICH site for more information about the version naming and numbering.

  • torque or PBS cluster - Install the clusterutils module (automatic when YANK is installed from conda). Run the build_mpirun_configfile "yank --script=yank.yaml" targeting your YAML file. The command works by targeting the cluster’s $PBS_GPUFILE to determine device IDs as the cluster sees them.
  • SLURM cluster - SLURM handles some of the GPU masking for you by setting CUDA_VISIBLE_DEVICES, but we have found breaks down over multiple nodes. The clusterutils will also help you configure the MPI jobs to run over multiple nodes. The command in this case operates by reading environment variables to figure out host and card names.
  • LSF cluster - Rely on the clusterutils module which comes with YANK when conda installed. This configures hosts and GPU masking based on the batch job environment variables which are set at job submission.

We recommend running YANK with an MPI configfile. You can specify all of these sets to CUDA_VISIBLE_DEVICES by hand, but it will make for a long command line. Creating a hostfile and configfile to feed into MPI is the preferred option for this reason. Although clusterutils will handle file creation for you, below we show an example of a hostfile and configfile in case you need to make the files by hand. This is a 2 node, 4 GPU/node torque cluster (8 processes distributed over 2 nodes) setup:

Hostfile:

gpu-1-4
gpu-1-4
gpu-1-4
gpu-1-4
gpu-2-17
gpu-2-17
gpu-2-17
gpu-2-17

Configfile:

-np 1 -env CUDA_VISIBLE_DEVICES 3 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 2 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 1 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 0 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 3 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 2 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 1 yank script --yaml=freesolv.yaml
-np 1 -env CUDA_VISIBLE_DEVICES 0 yank script —-yaml=freesolv.yaml

Hopefully this example helps you construct your own configfile if the build_mpirun_configfile is unavailable or you are on a custom system.

Simulations may be started in one mode and then can be resumed using another parallelization mode or OpenMM Platform. The NetCDF files generated by YANK are platform-portable and hardware agnostic, so they can be moved from system to system if you want to start a simulation on one system and resume it elsewhere.

Parallelization is “pleasantly parallel”, where information is exchanged only every few seconds or more. This does not require high-bandwidth interconnects such as Infiniband; 10Gbit/s ethernet connections should work very well.

We show a more formal “use case” of setting up an MPI run below.

Getting help

To get a list of all command-like options, simply use the --help flag:

$ yank --help

Running in serial mode

To run the simulation in serial mode, simply use yank run, specifying a store directory by --store=dirname:

$ yank script --yaml=yank.yaml

The optional --verbose flag will show additional output during execution.

Running in MPI mode

Alternatively, to run the simulation in MPI mode:

$ mpirun yank script --yaml=yank.yaml

Keep in mind your system may have a different MPI executable name and/or syntax.

On systems with multiple NVIDIA GPUs per node, it is necessary to perform masking using CUDA_VISIBLE_DEVICES.

On systems using the conda-installed mpi4py package, the MPICH hydra mpirun will be automatically installed for you. You can use the cluster utility command build-mpirun-configfile available in our clusterutils tools to generate an appropriate configfile:

$ build-mpirun-configfile "yank script --yaml=yank.yaml"
$ mpirun -f hostfile -configfile configfile

build-mpirun-configfile is automatically installed with YANK when you use the conda installation route. Please see our notes from above about this script’s applicability on torque, PBS, SLURM and LSF clusters.


Selecting a platform

OpenMM supports running simulations on a number of platforms, though not all platforms are available on all hardware. To see which platforms your current installation supports, you can query the list of available platforms with

$ yank platforms
Available OpenMM platforms:
    0 Reference
    1 CUDA
    2 CPU
    3 OpenCL

You can either leave the choice of platform up to YANK—in which case it will choose the fastest available platform—or specify the desired platform via the platform argument in a YAML file.

You can also (although not recommended) override the platform selection through the yank script -o flag. For example, to force YANK to use the OpenCL platform:

$ yank script --yaml=yank.yaml -o options:platform:OpenCL

See the yank script command line docs for more information on the -o flag.

Note

The CPU platform will automatically use all available cores/hyperthreads in serial mode, but in MPI mode, will use a single thread to avoid causing problems in queue-regulated parallel systems. To control the number of threads yourself, set the OPENMM_NUM_THREADS environment variable to the desired number of threads.

Breaking up Simulations

One limitation simulations run to compute free energy is that you need sufficient data to estimate all legs of the thermodynamic cycle before you can estimate many properties. YANK by default simulates each phase of the thermodynamic cycle sequentially, meaning you have to wait for simulation completion before you can do any kind of analysis.

YANK allows its simulations to switch between phases of the thermodynamic cycle mid-simulation. Setting the switch_phase_interval option in the YAML file allows you to control how many iterations the simulation will run of one phase before switching to the other to allow collection between each phase on-the-fly, and therefore allowing analysis at any point. This feature is helpful for those who want to run for an arbitrarily long length of time, then wish to end the simulation after some convergence criteria is met, e.g. error in free energy estimate.

YANK also provides a way flip between different experiments when setting the !Combinatorial flag. The switch_experiment_interval allows simulations to cycle through all the experiments created when users invoke the !Combinatorial flag. This option can be set in tandem with the switch_phase_interval. Please see the respective documentation on these options for more information.