Glossary

Analyte

Sample being analysed. [Wikipedia]

Bernoulli trial

A random experiment with two possible outcomes (success, fail), parameterised by a probability p of success. In the case of qPCR, success is a molecule being amplified during a given cycle, and the probability of success is the efficiency r. [Wikipedia]

Binomial distribution

The binomial distribution Binom(N, p) is the distribution of the number of successes from N Bernoulli trials with probability p. In the case of qPCR, Binom(Xi, r) represents the number of molecules amplified after cycle i when the efficiency is r. [Wikipedia]

Binomial branching process

A stochastic process used to model population as a set of discrete random variables Xi over time units i, where the transition between consecutive states is dictated by a binomial distribution. [Wikipedia]

Dilution series

A set of standards obtained by sequentially diluting the solution by a fixed factor (e.g. 10). For example, a dilution series may have initial copy numbers [100,000; 10,000; 1,000; 100]. Only the initial copy number of the first solution needs to be determined experimentally, the value of others will follow from the dilution factor (smaller by 10 each time).

Empirical covariance

The empirical covariance is a statistic computed from a set of (possibly multi-dimensional) samples. The empirical covariance of accepted samples is used in the Adaptive Metropolis Hastings algorithm to take into account the history of samples when proposing a new sample. [Wikipedia]

Heteroplasmy

Heteroplasmy represents the presence of more than one type of mtDNA within a cell. [Wikipedia]

Jeffreys prior

A Jeffreys prior is a non-informative prior distribution proportional to the square root of the determinant of the Fisher information matrix. The two choices for priors on efficiency r and noise variance σ2 used in this project are examples of Jeffreys priors (see Priors). [Wikipedia]

Mutant mtDNA

A DNA segment showing a modification (mutation) in several base pairs. These mutations may cause functional defects.

Non-linear regression

A form of regression analysis where the function used to model the data is non-linear (in this case, a sigmoidal curve). The regression attempts to minimise the sum of squared residuals between points on the model and data samples, by performing successive approximations. [Python documentation]

Posterior distribution

Probability distribution of a variable given observation data. The posterior distribution is the quantity of interest in inference on experimental data, as it provides estimates with error bars that capture our belief in experimental interpretation.

Primers

A single-stranded complementary DNA sequence that takes advantage of DNA complementarity to bind to a specific segment in the solution. For heteroplasmy measurements, primers are used to distinguish between mutant and wildtype mtDNA molecules by taking advantage of sequence particularities. Priors). [Wikipedia]

Prior distribution

Probability distribution of a model parameter that incorporates prior knowledge about the parameter. For example a prior on the efficiency of a qPCR reaction may incorporate experimental knowledge that efficiency has a lower bound of 0.6.

Proposal distribution

A probability distribution used to propose a new sample during inference. For the inference used in this project, the proposal distribution is a multivariate Gaussian distribution N(θ, Σ) centered at the current sample θ and having a covariance Σ. The choice of proposal distribution has a large impact on the mixing of the Markov Chain (see more in Analysing the Markov chain).

Standard

Quantitative analysis term used to described a sample with known properties, used for calibration. In the case of qPCR, a standard is a sample with known initial copy number X0. It is used in this project to obtain the fluorescence coefficient α that relates fluorescence intensity to molecule number.

Target molecule

Molecule of interest. In the case of heteroplasmy measurements, it may be either a mutant mtDNA (when measuring mutant load) or any mtDNA molecule (when measuring total load).

Unbiased estimate

An unbiased estimate of a parameter has the mean of the sampling distribution of the estimate equal to the true value of the parameter.

Wildtype mtDNA

Non-mutant, common, normally-functioning DNA molecule.

Abbreviations and notations

HMM

Hidden Markov Model

PMMH

Pseudo-marginal Metropolis Hastings

mtDNA

Mitochondrial DNA

qPCR

Quantitative Polymerase Chain Reaction

Binom(N , p)

Binomial distribution with N trials and success rate p

N(μ, σ2)

Normal distribution with mean μ and variance σ2