Simulation Algorithms

There are five algorithms that can be run.

Non-spatial algorithms: - "Branching". A branching process based on the single progenitor model from Clayton, et al. "A single type of progenitor cell maintains normal epidermis." Nature 446.7132 (2007): 185-189. - "Moran". A Moran-style model. At each simulation step, one cell dies and another cell divides, maintaining the overall population.
- "WF". A Wright-Fisher style model. At each simulation step an entire generation of cells is produced from the previous generation.

2D algorithms: - "Moran2D". A Moran-style model constrained to a 2D hexagonal grid. At each simulation step, one cell dies and a cell from an adjacent location in the grid divides. - "WF2D". A Wright-Fisher style model constrained to a 2D hexagonal grid. At each simulation step an entire generation of cells is produced from the previous generation, where cell parents must be from the local neighbourhood in the grid.

In all of these models, there is a fixed division/generation rate. Fitness changes alter the cell fate (i.e. fitter cells produce more cells that will go on to divide).

This guide gives a quick explanation of each algorithm. For a brief discussion of how the models compare in terms of the rate of drift and the patterns of non-neutral clone growth, see the introduction chapter of this thesis: https://www.repository.cam.ac.uk/handle/1810/333970

This guide also does not go into much detail on how to actually run the simulations. See the other tutorial guides to the parameter settings and functions used here.

Branching

This is based on the single progenitor model.

By default, only proliferative cells are simulated. Differentiated cells in the basal layer can be accounted for by scaling the clone size distribution of proliferative cells only or by simulating the differentiated cells too (see the guide on simulating differentiated cells).

Cell fitness alters the cell fate bias. A fitness of 1 means an equal proportion of cell divisions and cell deaths, so the total size of the cell population remains constant on average.
A lower fitness reduces the proportion of cell divisions and increases cell death, so the population size will decline.
A higher fitness increases the proportion of cell divisions and decreases cell death, the population size will grow.

This is the only algorithm in which the total cell population can vary.

import numpy as np
import matplotlib.pyplot as plt
from clone_competition_simulation import Parameters, PopulationParameters, TimeParameters


# Branching simulation starting from 1000 single-cell clones, all with fitness 1. 
p = Parameters(
    algorithm="Branching", 
    population=PopulationParameters(initial_size_array=np.ones(1000)), 
    times=TimeParameters(max_time=10, division_rate=1)
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

This is the only algorithm where the total population varies:

s.plot_overall_population()
plt.show()

png

There is no limit on population size. Fitness values above 1 will result in exponential growth. Here, each of the initial single-cell clones has a fitness of 1.1

from clone_competition_simulation import FitnessParameters

p = Parameters(
    algorithm="Branching", 
    population=PopulationParameters(initial_size_array=np.ones(100)),
    times=TimeParameters(max_time=10, division_rate=1),
    fitness=FitnessParameters(fitness_array=np.full(100, 1.1)), 
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

The fitness values range between 0 (all cells die, none divide to form new cells) and 2 (all cells divide to form two new cells and no cells die). Any fitness values above 2 have the same effect as a fitness of 2.

# A fitness of 2 leads to rapid growth
np.random.seed(0)
p = Parameters(
    algorithm="Branching",
    population=PopulationParameters(initial_cells=100),
    times=TimeParameters(max_time=3, division_rate=1),
    fitness=FitnessParameters(fitness_array=[2]),
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

# Increasing the fitness further has no additional effect 
np.random.seed(0)
p = Parameters(
    algorithm="Branching",
    population=PopulationParameters(initial_cells=100),
    times=TimeParameters(max_time=3, division_rate=1),
    fitness=FitnessParameters(fitness_array=[100]),
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

The unlimited population size means the simulations can grow very large, take inconveniently long times to finish and could use too much memory, especially if additional mutations are added during the simulations.

There is an option to stop the simulations if the population ever exceeds a set limit.
This does not limit the population in the sense of a carrying capacity, it just raises an error and stops the simulation early.

p = Parameters(
    algorithm="Branching",
    population=PopulationParameters(
        initial_size_array=np.ones(100), 
        population_limit=30000  # Set the population limit here 
    ),
    times=TimeParameters(max_time=3, division_rate=1),
    fitness=FitnessParameters(fitness_array=np.full(100, 2))
)
s = p.get_simulator()
s.run_sim()

OverPopulationError: Ending early as population limit exceeded

Moran

At each step in this model one cell dies and another cell divides to replace it. The overall cell population remains constant.
The cells to die and divide are randomly selected at each step. One cell could divide many times before another has done anything.

The higher a cell's fitness compared to the rest of the cell population, the more likely it is to divide.
There is no upper limit on cell fitness.

# Moran simulation starting from 1000 single-cell clones, all with fitness 1. 
# The total population size is fixed. 
p = Parameters(
    algorithm="Moran",
    population=PopulationParameters(initial_size_array=np.ones(1000)),
    times=TimeParameters(max_time=10, division_rate=1),
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

Wright-Fisher

At each step in this model, the entire population of cells is replaced.
Each cell "picks a parent" from the previous generation at random.

The higher a cell's fitness compared to the rest of the cell population, the more likely it is to be picked to be a parent.
There is no upper limit on cell fitness, and the only limit on the number of offspring a single cell can have in one generation is the total population size.

# Wright-Fisher simulation starting from 1000 single-cell clones, all with fitness 1. 
# The total population size is fixed. 
p = Parameters(
    algorithm="WF", 
    population=PopulationParameters(initial_size_array=np.ones(1000)),
    times=TimeParameters(max_time=10, division_rate=1)
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

This Muller plot looks less smooth than the Branching and Moran plots. This is because the WF populations are only calculated once per generation, whereas the other algorithms can update the populations as often as after every individual cell division.

2D simulations: Moran2D and WF2D

The Moran2D and WF2D models are the Moran and WF models constrained to a 2D hexagonal grid with periodic boundary conditions.

In the Moran2D, at each step a cell dies, and one of the neighbouring cells divides to fill the vacated space in the grid.
In the WF2D, the parents of a cell are drawn from the immediate neighbourhood of the cell in the previous generation.

# A simulation on a small 6x6 grid. 
p = Parameters(
    algorithm='Moran2D', 
    population=PopulationParameters(initial_cells=36, cell_in_own_neighbourhood=False),
    times=TimeParameters(max_time=1, division_rate=1)
)
s = p.get_simulator()


# Will colour a cell and its neighbourhood to show the hexagonal neighbours. 
from clone_competition_simulation.simulation_algorithms.general_2D_class import get_neighbour_coords_2D
row, col = 2, 3
s.grid[row, col] = 1
neighbours = get_neighbour_coords_2D(s, row, col)
s.grid[neighbours[:, 0], neighbours[:, 1]] = 2
s.plot_grid(grid=s.grid, figsize=(3, 3))
plt.show()

png

For the Moran process, it makes sense that the replacement cell comes from one of the 6 surrounding cells. To use this six-cell neighbourhood, use the option cell_in_own_neighbourhood=False.

p = Parameters(
    algorithm="Moran2D", 
    population=PopulationParameters(initial_grid=np.arange(30**2).reshape(30, 30), 
                                    cell_in_own_neighbourhood=False),
    times=TimeParameters(max_time=10, division_rate=1)
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

In the Wright-Fisher, where the new cell comes from the previous generation, it is also reasonable to assume that the cell in the same grid location in the previous generation can divide to fill the space.
To use this seven-cell neighourbourhood, use the option cell_in_own_neighbourhood=True.

p = Parameters(
    algorithm="WF2D", 
    population=PopulationParameters(initial_grid=np.arange(30**2).reshape(30, 30), 
                                    cell_in_own_neighbourhood=True),
    times=TimeParameters(max_time=10, division_rate=1)
)
s = p.get_simulator()
s.run_sim()
s.muller_plot(figsize=(5, 5), allow_y_extension=True)
plt.show()

png

You can however, use either neighbourhood option with both algorithms.