ISOKANN.jl

Documentation for ISOKANN.jl.

Start with the Introduction for an overview of the data model and training loop. The Installation page walks through getting Julia and OpenMM set up, and Tips covers practical choices like optimizer and regularization.

Main entry points

ISOKANN.Iso — Type

Iso(data; opt=NesterovRegularized(), model=defaultmodel(data), gpu=false, kwargs...)

source

Iso(sim::IsoSimulation; nx=100, nk=10, nd=1, kwargs...)

Convenience constructor which generates the SimulationData from the simulation sim and constructs the Iso object. See also Iso(data; kwargs...)

Arguments

sim::IsoSimulation: The IsoSimulation object.
nx::Int: The number of starting points.
nk::Int: The number of koopman samples.
nout::Int: Dimension of the χ function.

source

ISOKANN.SimulationData — Type

struct SimulationData{S,D,C,F}

A struct combining a simulation with the simulated coordinates and corresponding ISOKANN trainingsdata

Fields

sim::S: The simulation object.
data::D: The ISOKANN trainings data.
coords::C: The orginal coordinates of the simulations.
featurizer::F: A function mapping coordinates to ISOKANN features.

source

ISOKANN.OpenMM.OpenMMSimulation — Type

OpenMMSimulation(; pdb, steps, ...)
OpenMMSimulation(; py, steps)

Constructs an OpenMM simulation object. Either use OpenMMSimulation(;py, steps) where pyis the location of a .py python script creating a OpenMM simulation object or supply a .pdb file viapdb` and the following parameters (see also defaultsystem):

Arguments

pdb::String: Path to the PDB file.
ligand::String: Path to ligand file.
forcefields::Vector{String}: List of force field XML files.
temp::Float64: Temperature in Kelvin.
friction::Float64: Friction coefficient in 1/picosecond.
step::Float64: Integration step size in picoseconds.
steps::Int: Number of simulation steps.
features: Which features to use for learning the chi function. - A vector of Int denotes the indices of all atoms to compute the pairwise distances from. - A vector of CartesianIndex{2} computes the specific distances between the atom pairs. - A number denotes the radius below which all pairs of atoms will be used (computed only on the starting configuration) - If nothing all pairwise distances are used.
minimize::Bool: Whether to perform energy minimization on first state.
nthreads: The number of threads to use for parallelization of multiple simulations.
mmthreads: The number of threads to use for each OpenMM simulation. Set to "gpu" to use the GPU platform.

Returns

OpenMMSimulation: An OpenMMSimulation object.

source

ISOKANN.propagate — Function

propagate(sim::OpenMMSimulation, x0::AbstractMatrix, nk)

Propagates nk replicas of the OpenMMSimulation sim from the inintial states x0.

Arguments

sim: An OpenMMSimulation object.
x0: Matrix containing the initial states as columns
nk: The number of replicas to create.

source

ISOKANN.run! — Function

run!(iso::Iso, n=1, epochs=1)

Run the training process for the Iso model.

Arguments

a iso::Iso: The Iso model to train.

n::Int: The number of (outer) Koopman iterations.
epochs::Int: The number of (inner) epochs to train the model for each Koopman evaluation.

source

ISOKANN.run_kde! — Function

run_kde!(iso; generations=1, iter=100, cutoff=Inf, kde=1, unique=true)

Train iso with adaptive sampling. Sample kde new data points followed by iter isokann iterations and repeat this generations times. cutoff specifies the maximal data size, after which new data overwrites the oldest data. unique enforces resampling from yet unsampled ys only.

source

ISOKANN.chis — Function

chis(iso::Iso, data=iso.data)

Evaluate the learned χ function on data (a SimulationData or a raw (xs, ys) tuple). Returns a (nout, n) matrix whose columns are the χ values at the starting points. Defaults to iso's own training data.

source

ISOKANN.rates — Function

rates(iso::Iso)

Return the coarse grained rate matrix Q satisfying Kχ = exp(τQ)χ

In the 1D ISOKANN case return the rates for χ and 1-χ.

source

ISOKANN.plot_training — Function

plot_training(iso; maxpoints=0)

Summary dashboard of an Iso training run: loss history, learned χ values, and a scatter of χ vs. its training-target fix point. maxpoints caps the number of points drawn in the scatter/χ plots (0 = all).

source

ISOKANN.scatter_ramachandran — Function

scatter_ramachandran(iso::Iso; kwargs...)
scatter_ramachandran(x, z; kwargs...)

Ramachandran scatter: plots each configuration's backbone dihedrals (φ, ψ) coloured by χ. Accepts an Iso, a (coords, model) pair, or a coordinate matrix with explicit χ values. Extra kwargs are forwarded to Plots.scatter.

source

ISOKANN.save_reactive_path — Function

save_reactive_path(iso::Iso,
    coords::AbstractMatrix=coords(iso.data) |> cpu;
    sigma=1,
    maxjump=1,
    out="out/reactive_path.pdb",
    source=pdbfile(iso.data),
    kwargs...)

Extract and save the reactive path of a given iso.

Computes the maximum likelihood path with parameter sigma along the given data points, aligns it and saves it to the out path.

Data construction and manipulation

ISOKANN.data_from_trajectory — Function

data_from_trajectory(xs::AbstractMatrix; reverse=true, stride=1, lag=1)

Generate the (x,y) data pairs for ISOKANN from the trajectory xs.

stride controls the stride of the starting positions x and lag the lag for the end positions y in terms of trajectory frames. If reverse is true, construct also the time-reversed pairs (recomended for stable ISOKANN training).

source

ISOKANN.data_from_trajectories — Function

data_from_trajectories(xss::AbstractVector{<:AbstractMatrix}; kwargs...)
data_from_trajectories(xs::AbstractArray{<:Any,3}; kwargs...)

Generate training data (x, y) pairs for ISOKANN from trajectories by calling data_from_trajectory on each and concatenating the results.

See data_from_trajectory for details on the keyword arguments (reverse, stride, lag).

source

ISOKANN.mergedata — Function

mergedata(d1::SimulationData, d2::SimulationData)

Merge the data and features of d1 and d2, keeping the simulation and features of d1. Note that there is no check if simulation features agree.

source

ISOKANN.addcoords! — Function

addcoords!(iso::Iso, coords::AbstractMatrix)
addcoords!(iso::Iso, n::Integer)

Extend iso's training data with new starting points and their freshly propagated Koopman samples. If coords is a matrix, use its columns as new starting points. If an integer n is given, continue a length-n lagged trajectory from the current last frame and split it into (xs, ys) pairs.

source

ISOKANN.resample_kde! — Function

resample_kde!(iso::Iso, ny; kwargs...)

Replace/augment iso's data by drawing ny new starting points via KDE-based subsampling along the current χ, then propagating them. Used by run_kde! but also callable directly for custom training loops.

source

ISOKANN.laggedtrajectory — Function

laggedtrajectory(sim::OpenMMSimulation, lags; steps=steps(sim), resample_velocities=true, kwargs...)

Generate a lagged trajectory for a given OpenMMSimulation. E.g. x0–x–x–x for lags=3 and steps=2

Arguments

sim::OpenMMSimulation: The simulation object.
lags: The number of steps.
steps: The lagtime, i.e. number of steps to take in the simulation.
resample_velocities: Whether to resample velocities according to Maxwell-Boltzman for each lag.
kwargs...: Additional keyword arguments to pass to the trajectory function.

Returns

A matrix of lags samples which each have steps simulation-steps inbetween them.

source

laggedtrajectory(data::SimulationData, n) = laggedtrajectory(data.sim, n, x0=coords(data)[:, end])

Simulate a trajectory comprising of n simulations from the last point in data

source

Models and Optimizers

ISOKANN.pairnet — Function

Fully connected neural network with layers layers from n to nout dimensions. activation determines the activation function for each but the last layer lastactivation can be used to modify the last layers activation function

source

ISOKANN.densenet — Function

densenet(; layers::Vector{Int}, activation=Flux.sigmoid, lastactivation=identity, layernorm=true) -> Flux.Chain

Construct a fully connected neural network (Flux.Chain) with customizable layer sizes, activations, and optional input layer normalization.

Arguments

layers::Vector{Int}: List of layer dimensions. For example, [10, 32, 16, 1] creates a network with input size 10, two hidden layers of size 32 and 16, and an output layer of size 1.
activation: Activation function applied to all layers except the last. Defaults to Flux.sigmoid.
lastactivation: Activation function for the final layer. Defaults to identity.
layernorm::Bool: Whether to prepend a Flux.LayerNorm layer to normalize the input. Defaults to true.

Returns

A Flux.Chain composed of dense layers (and optionally a leading LayerNorm).

source

ISOKANN.AdamRegularized — Function

AdamRegularized(adam=1e-3, reg=1e-4)

Constructs an optimizer that combines weight decay regularization with ADAM. Uses reg for the weight decay parameter and lr as the learning rate for ADAM. Note that this is different from AdamW (Adam+WeightDecay) (c.f. Decay vs L2 Reg.).

source

ISOKANN.NesterovRegularized — Function

NesterovRegularized(; lr=1e-3, reg=1e-4)

Constructs an optimizer that combines weight decay regularization with Nesterov momentum. Uses reg for the weight decay parameter and lr as the learning rate for Nesterov acceleration. This worked well as alternative where ADAM had problems.

source

Public API

ISOKANN.ExternalSimulation — Type

ExternalSimulation(; pdbfile=nothing, masses=nothing, lagtime=1, kwargs...)

Placeholder IsoSimulation for data that was sampled outside of ISOKANN. It stores metadata (topology, lagtime, masses, …) without the ability to propagate new samples — use it together with data_from_trajectory / data_from_trajectories and SimulationData to train on precomputed trajectories.

source

ISOKANN.Iso — Method

Iso(data; opt=NesterovRegularized(), model=defaultmodel(data), gpu=false, kwargs...)

source

ISOKANN.Iso — Method

Iso(sim::IsoSimulation; nx=100, nk=10, nd=1, kwargs...)

Convenience constructor which generates the SimulationData from the simulation sim and constructs the Iso object. See also Iso(data; kwargs...)

Arguments

sim::IsoSimulation: The IsoSimulation object.
nx::Int: The number of starting points.
nk::Int: The number of koopman samples.
nout::Int: Dimension of the χ function.

source

ISOKANN.MetadynamicsSimulation — Type

MetadynamicsSimulation(sim, rc, mdstate, dt, height, sigma)
MetadynamicsSimulation(iso; height=1f0, sigma=0.1f0, dt=600f0)

Well-tempered metadynamics bias that can be used as a force in a Langevin simulation.

The bias potential is a sum of Gaussians deposited at visited reaction coordinate (RC) values. When called as md(x), it returns the negative gradient of the (well-tempered) bias with respect to the configuration x, suitable for use as an additive force.

Arguments

sim: underlying IsoSimulation (provides temperature and propagation)
rc: reaction coordinate function x -> z, mapping configuration to RC space
mdstate: accumulated Gaussian centers — one of MetadynamicsState, MetadynamicsStateMatrix (GPU-optimized), or MetadynamicsStateGridded
height: Gaussian height
sigma: Gaussian width in RC space
dt: well-tempered offset temperature (ΔT); Inf for classic (untempered) metadynamics

The convenience constructor builds rc from chicoords(iso) and initializes mdstate from the current chi values of the Iso.

See also

deposit!, trajectory, wt_free_energy, plot_profile

source

ISOKANN.SimulationData — Method

SimulationData(sim::IsoSimulation, nx::Int, nk::Int; ...)
SimulationData(sim::IsoSimulation, xs::AbstractMatrix, nk::Int; ...)
SimulationData(sim::IsoSimulation, (xs,ys); ...)
SimulationData(xs, ys; pdb="", ...)  # for external simulation data

Generates SimulationData from a simulation with either

nx initial points and nk Koopman samples
xs as initial points and nk Koopman sample
xs as inintial points and ys as Koopman samples
xs and ys from external simulations
xs a trajectory of an external simulation, implicitly computing ys via data_from_trajectory of succesive samples

source

ISOKANN.AdamRegularized — Function

AdamRegularized(adam=1e-3, reg=1e-4)

source

ISOKANN.Doublewell — Method

Doublewell(; kwargs...)

1-D overdamped Langevin dynamics in the double-well potential V(x) = (x² − 1)². Returns a Diffusion; kwargs are forwarded to it (e.g. sigma, dt, lagtime).

source

ISOKANN.MuellerBrown — Method

MuellerBrown(; kwargs...)

2-D overdamped Langevin dynamics in the Müller–Brown potential — a standard test system with three metastable basins. Returns a Diffusion; kwargs are forwarded (e.g. sigma, dt, lagtime).

source

ISOKANN.NesterovRegularized — Function

NesterovRegularized(; lr=1e-3, reg=1e-4)

source

ISOKANN.Triplewell — Method

Triplewell(; kwargs...)

2-D overdamped Langevin dynamics in the triple-well potential of Metzner, Schütte, Vanden-Eijnden (2006). Returns a Diffusion; kwargs are forwarded (e.g. sigma, dt, lagtime).

source

ISOKANN.addcoords! — Method

addcoords!(iso::Iso, coords::AbstractMatrix)
addcoords!(iso::Iso, n::Integer)

source

ISOKANN.addcoords — Method

addcoords(d::SimulationData, coords::AbstractMatrix) -> SimulationData

Propagate coords under d.sim (reusing the existing nk and featurizer) and return a new SimulationData that concatenates the new (xs, ys) pairs onto d. Non-mutating counterpart to addcoords!.

source

ISOKANN.atom_indices — Method

atom_indices(filename::String, selector::String) -> Vector{Int}

Return the 1-based atom indices matching an MDTraj selector expression (e.g. "name CA", "backbone", "not element H") applied to the topology loaded from filename. Useful to restrict features/alignment to a subset of atoms.

source

ISOKANN.ca_rmsd — Function

ca_rmsd(cainds, pdb="data/villin nowater.pdb", pdbref="data/villin/1yrf.pdb")

Returns a ReactionCoordsRMSD object which is used to calculate the Root Mean Square Deviation (RMSD) of the provided C-alpha atoms.

Inputs: - cainds: Indices of the C-alpha atoms to consider for the RMSD - target: PDB File containing the target structure to which the RMSD is computed - source: Alternative PDB File for the source coordinates in the case that the indices differ (i.e. when matching different topologies)

Example: rsmd = ca_rmsd(3:10, "data/villin/1yrf.pdb", "data/villin nowater.pdb") rmsd(rand(300,10))

source

ISOKANN.chicoords — Method

chicoords(iso::Iso, xs)

Evaluate χ at raw coordinates xs (a (d, n) matrix), running the simulation's featurizer first. Handles CPU/GPU placement automatically.

source

ISOKANN.chis — Function

chis(iso::Iso, data=iso.data)

source

ISOKANN.coords — Method

coords(iso::Iso)       -> (xs, ys)
features(iso::Iso)     -> (features(xs), features(ys))
propcoords(iso::Iso)   -> ys
propfeatures(iso::Iso) -> features(ys)

Accessors for the training data attached to iso. coords returns the raw simulation coordinates, features the featurized inputs actually passed to the network; the prop* variants return only the propagated (Koopman) samples.

source

ISOKANN.data_from_trajectories — Method

data_from_trajectories(xss::AbstractVector{<:AbstractMatrix}; kwargs...)
data_from_trajectories(xs::AbstractArray{<:Any,3}; kwargs...)

Generate training data (x, y) pairs for ISOKANN from trajectories by calling data_from_trajectory on each and concatenating the results.

See data_from_trajectory for details on the keyword arguments (reverse, stride, lag).

source

ISOKANN.data_from_trajectory — Method

data_from_trajectory(xs::AbstractMatrix; reverse=true, stride=1, lag=1)

Generate the (x,y) data pairs for ISOKANN from the trajectory xs.

source

ISOKANN.densenet — Method

densenet(; layers::Vector{Int}, activation=Flux.sigmoid, lastactivation=identity, layernorm=true) -> Flux.Chain

Construct a fully connected neural network (Flux.Chain) with customizable layer sizes, activations, and optional input layer normalization.

Arguments

layers::Vector{Int}: List of layer dimensions. For example, [10, 32, 16, 1] creates a network with input size 10, two hidden layers of size 32 and 16, and an output layer of size 1.
activation: Activation function applied to all layers except the last. Defaults to Flux.sigmoid.
lastactivation: Activation function for the final layer. Defaults to identity.
layernorm::Bool: Whether to prepend a Flux.LayerNorm layer to normalize the input. Defaults to true.

Returns

A Flux.Chain composed of dense layers (and optionally a leading LayerNorm).

source

ISOKANN.flattenlast — Method

flattenlast(x)

Concatenate all but the first dimension of x. Usefull to convert a tensor of samples into a matrix

source

ISOKANN.laggedtrajectory — Method

laggedtrajectory(data::SimulationData, n) = laggedtrajectory(data.sim, n, x0=coords(data)[:, end])

Simulate a trajectory comprising of n simulations from the last point in data

source

ISOKANN.load_trajectory — Method

load_trajectory(filename; top=nothing, kwargs...)

wrapper around Python's mdtraj.load(). Returns a (3 * natom, nframes) shaped array.

source

ISOKANN.localpdistinds — Method

localpdistinds(coords::AbstractMatrix, radius)

Given coords of shape ( 3n x frames ) return the pairs of indices whose minimal distance along all frames is at least once lower then radius

source

ISOKANN.mergedata — Method

mergedata(d1::SimulationData, d2::SimulationData)

Merge the data and features of d1 and d2, keeping the simulation and features of d1. Note that there is no check if simulation features agree.

source

ISOKANN.pairnet — Method

source

ISOKANN.pdists — Method

pdists(coords::AbstractArray, inds::Vector{<:Tuple})

Compute the pairwise distances between the particles specified by the tuples inds over all frames in traj. Assumes a column contains all 3n coordinates.

source

ISOKANN.picking — Method

picking(X, n; dists = pairwise_one_to_many)

The picking algorithm, i.e. greedy farthest point sampling, for n points on the columns of X. A custom distance function (::Vector, ::Matrix)->(::Vector) may be passed through dists.

Returns X[:,qs], i.e. the picked samples, their former indices qs and their distances d to all other points.

source

ISOKANN.plot_training — Method

plot_training(iso; maxpoints=0)

source

ISOKANN.rates — Method

rates(iso::Iso)

Return the coarse grained rate matrix Q satisfying Kχ = exp(τQ)χ

In the 1D ISOKANN case return the rates for χ and 1-χ.

source

ISOKANN.reactionpath_minimum — Function

reactionpath_minimum(iso::Iso, x0; steps=100)

Compute the reaction path by integrating ∇χ with orthogonal energy minimization.

Arguments

iso::Iso: The isomer for which the reaction path minimum is to be computed.
x0: The starting point for the reaction path computation.
steps=100: The number of steps to take along the reaction path.

source

ISOKANN.reactionpath_ode — Method

reactionpath_ode(iso, x0; steps=101, extrapolate=0, orth=0.01, solver=OrdinaryDiffEq.Tsit5(), dt=1e-3, kwargs...)

Compute the reaction path by integrating ∇χ as well as orth * F orthogonal to ∇χ where F is the original force field.

Arguments

iso::Iso: The isomer for which the reaction path minimum is to be computed.
x0: The starting point for the reaction path computation.
steps=100: The number of steps to take along the reaction path.
minimize=false: Whether to minimize the orthogonal to ∇χ before integration.
extrapolate=0: How fast to extrapolate beyond χ 0 and 1.
orth=0.01: The weight of the orthogonal force field.
solver=OrdinaryDiffEq.Tsit5(): The solver to use for the ODE integration.
dt=1e-3: The initial time step for the ODE integration.

source

ISOKANN.reactive_path — Method

reactive_path(xi::AbstractVector, coords::AbstractMatrix; sigma, minjump=0, maxjump=1, method=QuantilePath(0.05), normalize=false, sortincreasing=true)

Find the maximum likelihood path (under the model of brownion motion with noise sigma) through coords with times xi. Supports either CPU or GPU arrays.

Arguments

coords: (ndim x npoints) matrix of coordinates.
xi: time coordinate of the npoints points
sigma: spatial noise strength of the model.
minjump, maxjump: lower and upper bound to the jump in time xi along the path. Tighter bounds reduce the computational cost.
method: either FromToPath, QuantilePath, FullPath or MaxPath, specifying the end points of the path
normalize: whether to normalize all coords first
sortincreasing: return the path from lower to higher xi values

source

ISOKANN.readchemfile — Function

readchemfile(source::String, steps=:) -> Matrix{Float32}
readchemfile(traj::Chemfiles.Trajectory, frames) -> Matrix{Float32}

Load trajectory coordinates via the Chemfiles library and return them as a (3*natoms, nframes) matrix in nanometers (converted from Å). steps / frames selects a subset of frames; passing an Int returns a single flattened frame. Useful to pipe external trajectories into data_from_trajectory.

source

ISOKANN.resample_kde! — Method

resample_kde!(iso::Iso, ny; kwargs...)

source

ISOKANN.resample_kde — Method

resample_kde(xs, ys, n; kwargs...)

Return n indices of ys such that the corresponding points "fill the gaps" in the KDE of xs. For possible kwargs see kde_needles.

source

ISOKANN.resample_kde — Method

resample_kde(data::SimulationData, model, n; bandwith, unique)

add new samples to data by running new simulations starting at some ys (i.e. the propagated points) of data where these points are iteratively selected to be closest to the minimum of a KDE of the current chi values from xs. If unique is true, start simulations from point only where there were no simulations before. bandwith controls the bandwidth of the KDE.

source

ISOKANN.restricted_localpdistinds — Method

restricted_localpdistinds(coords, radius, atoms)

Like localdists, but consider only the atoms with index in atoms

source

ISOKANN.run! — Function

run!(iso::Iso, n=1, epochs=1)

Run the training process for the Iso model.

Arguments

a iso::Iso: The Iso model to train.

n::Int: The number of (outer) Koopman iterations.
epochs::Int: The number of (inner) epochs to train the model for each Koopman evaluation.

source

ISOKANN.run_kde! — Method

run_kde!(iso; generations=1, iter=100, cutoff=Inf, kde=1, unique=true)

source

ISOKANN.save_reactive_path — Function

save_reactive_path(iso::Iso,
    coords::AbstractMatrix=coords(iso.data) |> cpu;
    sigma=1,
    maxjump=1,
    out="out/reactive_path.pdb",
    source=pdbfile(iso.data),
    kwargs...)

Extract and save the reactive path of a given iso.

Computes the maximum likelihood path with parameter sigma along the given data points, aligns it and saves it to the out path.

Internal API

ISOKANN.DataTuple — Type

DataTuple = Tuple{Matrix{T},Array{T,3}} where {T<:Number}

We represent data as a tuple of xs and ys.

xs is a matrix of size (d, n) where d is the dimension of the system and n the number of samples. ys is a tensor of size (d, k, n) where k is the number of koopman samples.

source

ISOKANN.IsoSimulation — Type

abstract type IsoSimulation

Abstract type representing an IsoSimulation. Should implement the methods coords, propagate, dim

source

ISOKANN.LazyTrajectory — Method

LazyTrajectory(path::String)

Represents the trajectory path as matrix whose columns are lazily loaded from disk.

source

ISOKANN.MetadynamicsState — Type

MetadynamicsState{T, V}

Vector-of-vectors storage for Gaussian centers.

Performance: CPU: OK | GPU: slow (~1000x) | Add center: O(1) push Best for: CPU-only, frequent center additions Dynamics: Exact

See also: MetadynamicsStateMatrix, MetadynamicsStateGridded

source

ISOKANN.MetadynamicsStateGridded — Type

MetadynamicsStateGridded{ITP}

Grid-based approximation of bias potential with cubic spline interpolation.

Performance: CPU: very fast | GPU: N/A | Add center: unsupported Best for: 1D–2D rapid exploration (low-dim only) Dynamics: Approximate (spline interpolation)

See also: MetadynamicsState, MetadynamicsStateMatrix

source

ISOKANN.MetadynamicsStateMatrix — Type

MetadynamicsStateMatrix{T}

Matrix storage (nrc × n_centers) for Gaussian centers — GPU-optimized.

Performance: CPU: fast | GPU: very fast | Add center: O(n) hcat Best for: GPU production runs Dynamics: Exact

See also: MetadynamicsState, MetadynamicsStateGridded

source

ISOKANN.ReactionCoordsRMSD — Type

struct ReactionCoordsRMSD

Instances of this object allow to compute the Root Mean Square Deviation (RMSD) to a part of a reference molecule. See also ca_rmsd.

source

ISOKANN.Stabilize — Type

TransformStabilize(target, last=nothing)

Wraps another target and permutes its target to match the previous target

Currently we also have the stablilization (wrt to the model though) inside most Transforms. TODO: Decide which to keep

source

ISOKANN.TransformGramSchmidt2 — Type

TransformGramSchmidt()

Compute the target through a Gram-Schmidt orthonormalisation.

source

ISOKANN.TransformISA — Type

TransformISA(permute)

Compute the target via the inner simplex algorithm (without feasiblization routine). permute specifies whether to apply the stabilizing permutation

source

ISOKANN.TransformPseudoInv — Type

TransformPseudoInv(normalize, direct, eigenvecs, permute)

Compute the target by approximately inverting the action of K with the Moore-Penrose pseudoinverse.

If direct==true solve chi * pinv(K(chi)), otherwise inv(K(chi) * pinv(chi))). eigenvecs specifies whether to use the eigenvectors of the schur matrix. normalize specifies whether to renormalize the resulting target vectors. permute specifies whether to permute the target for stability.

source

ISOKANN.TransformShiftscale — Type

TransformShiftscale()

Classical 1D shift-scale (ISOKANN 1)

source

ISOKANN._pickclosestloop — Method

scales with n=length(hs)

source

ISOKANN.bootstrap — Method

bootstrap(sim, nx, ny)

compute initial data by propagating the molecules initial state to obtain the xs and propagating them further for the ys

source

ISOKANN.centercoords — Method

centercoords any given states by shifting their individual 3d mean to the origin

source

ISOKANN.chi_exit_rate — Method

compute the chi exit rate as per Ernst, Weber (2017), chap. 3.3

source

ISOKANN.constrained_free_energy — Method

constrained_free_energy(iso, xs; sim, steps)

Compute the free energy using Thermodynamic Integration. Starting from the levelset samples xs orhtogonal simulations estimate the mean force along χ, which is integrated to yield the PMF.

Arguments

iso the Iso object. xs the starting points (which should be well distributed in state space). sim the simulation used for the orthongal sampling. steps the number of steps in each orthogonal simulation.

Returns

F the free energy energy surface of χ in kJ/mol up to an additive constant.

source

ISOKANN.delta_G — Method

delta_G(PMF,chi_vals)

Convenience function to compute free energy differences in a double well free energy surface.

source

ISOKANN.energyminimization_chilevel — Method

energyminimization_chilevel(iso, x0; f_reltol=1e-3, alphaguess=1e-5, iterations=20, show_trace=false, skipwater=false, algorithm=Optim.GradientDescent, xtol=nothing)

Local energy minimization on the current levelset of the chi function

source

ISOKANN.expectation — Method

expectation(f, xs)

Computes the expectation value of f over xs. Supports WeightedSamples through extra dispatch

source

ISOKANN.exportdata — Function

exportdata(data::AbstractArray, model, sys, path="out/data.pdb")

Export data to a PDB file.

This function takes an AbstractArray data, sorts it according to the model evaluation, removes duplicates, transforms it to standard form and saves it as a PDB file to path.

source

ISOKANN.fixperm — Method

fixperm(new, old)

Permutes the rows of new such as to minimize L1 distance to old.

Arguments

new: The data to match to the reference data.
old: The reference data.

source

ISOKANN.flatpairdists — Function

flatpairdists(x)

Assumes each col of x to be a flattened representation of multiple 3d coords. Returns the flattened pairwise distances as columns.

source

ISOKANN.flattenfirst — Method

collapse the first and second dimension of the array A into the first dimension

source

ISOKANN.growmodel — Method

Given a model and return a copy with its last layer replaced with given output dimension n

source

ISOKANN.inputdim — Method

obtain the input dimension of a Flux model

source

ISOKANN.integrate_chi — Method

integratechi(f, chivals)

Cumulative integral of the mean force with respect to χ using the trapezoid rule.

Arguments

f The mean force. chi_vals The levelset χ values.

Returns

F the (rigid) free energy surface of χ.

source

ISOKANN.load — Method

load(path::String, iso::Iso)

Load the Iso object from a JLD2 file Note that it will be loaded to the CPU, even if it was saved on the GPU. An OpenMMSimulation will be reconstructed anew from the saved pdb file.

source

ISOKANN.local_mean_force — Method

localmeanforce(iso, xs; sim, steps)

Compute the free energy using Thermodynamic Integration. Bins the samples into levelsets, computes the mean force along χ locally in every levelset. (Extremely extensive sampling necessary.)

Arguments

iso the Iso object. xs the starting points (which should be well distributed in state space). nbins The number of bins/levelsets.

Returns

F the free energy surface of χ in kJ/mol up to an additive constant.

source

ISOKANN.marginal_free_energy — Method

 marginal_free_energy(iso::Iso;nbins)

Compute the free energy from the density of chi values.

Arguments

iso the Iso object. nbins the number of bins of the histogram used for estimation.

Returns

F the free energy energy surface of χ in kJ/mol up to an additive constant.

source

ISOKANN.model_with_opt — Function

convenience wrapper returning the provided model with the default AdamW optimiser

source

ISOKANN.outputdim — Method

Obtain the output dimension of a Flux model

source

ISOKANN.pairdistfeatures — Method

pairdistfeatures(inds::AbstractVector)

Returns a featurizer function which computes the pairwise distances between the particles specified by inds

source

ISOKANN.pickclosest_sort — Method

pickclosest(haystack, needles)

Return the indices into haystack which lie closest to needles without duplicates by removing haystack candidates after a match. Note that this is not invariant under pertubations of needles

scales with n log(n) m where n=length(haystack), m=length(needles)

source

ISOKANN.picking_aligned — Method

picking_aligned(x::AbstractMatrix, m::Integer)

The picking algorithm using pairwise aligned distances, e.g. for molecular coordinates. Assumes the columnes of x to be vectors of size 3xN holding the cartesian coordinates.