Bioprocess Engineering

Fed-Batch Bioreactor Parameter Estimation

Mechanistic modeling and inverse-problem calibration for mammalian cell culture

Introduction

Fed-batch bioreactors are the workhorse of biopharmaceutical manufacturing, producing monoclonal antibodies, therapeutic proteins, and vaccines in mammalian cell cultures such as CHO (Chinese Hamster Ovary) cells. Process development and optimization require accurate mathematical models to predict cell growth, substrate consumption, metabolite accumulation, and product formation under varying operating conditions.

The core challenge is parameter estimation: given sparse, noisy measurements from online sensors (pH, dissolved oxygen) and offline assays (cell counts, glucose, lactate, titer), how do we calibrate the unknown kinetic and stoichiometric parameters of a mechanistic ODE model?

Why this matters: Accurate parameter estimation enables digital twin modeling for scale-up prediction, feed strategy optimization, and real-time process monitoring—reducing experimental burden and accelerating drug development timelines in compliance with FDA PAT (Process Analytical Technology) guidance.

Science Background: Monod Kinetics and Mass Balance

Mammalian cell growth in fed-batch culture follows Monod kinetics with dual substrate limitation (glucose and glutamine) and product inhibition (lactate and ammonia). The governing processes are:

Mathematical Model: ODE System

The mechanistic model consists of 8 ordinary differential equations describing the temporal evolution of state variables:

States: $$\begin{aligned} X_v &= \text{viable biomass [g/L]} \\ \text{Glc} &= \text{glucose [g/L]} \\ \text{Gln} &= \text{glutamine [mM]} \\ \text{Lac} &= \text{lactate [g/L]} \\ \text{Amm} &= \text{ammonia [mM]} \\ \text{mAb} &= \text{product titer [g/L]} \\ V &= \text{reactor volume [L]} \end{aligned}$$ ODEs: $$\begin{aligned} \frac{dX_v}{dt} &= \mu \cdot X_v - k_d \cdot X_v - D \cdot X_v \\ \frac{d\text{Glc}}{dt} &= -q_{\text{Glc}} \cdot X_v - D \cdot (\text{Glc} - \text{Glc}_{\text{feed}}) \\ \frac{d\text{Gln}}{dt} &= -q_{\text{Gln}} \cdot X_v - D \cdot (\text{Gln} - \text{Gln}_{\text{feed}}) \\ \frac{d\text{Lac}}{dt} &= q_{\text{Lac}} \cdot X_v - D \cdot \text{Lac} \\ \frac{d\text{Amm}}{dt} &= q_{\text{Amm}} \cdot X_v - D \cdot \text{Amm} \\ \frac{d\text{mAb}}{dt} &= q_{\text{mAb}} \cdot X_v - D \cdot \text{mAb} \\ \frac{dV}{dt} &= F(t) \end{aligned}$$ Specific rates: $$\begin{aligned} \mu &= \mu_{\max} \cdot \frac{\text{Glc}}{K_{s,\text{Glc}}+\text{Glc}} \cdot \frac{\text{Gln}}{K_{s,\text{Gln}}+\text{Gln}} \cdot \frac{K_{i,\text{Lac}}}{K_{i,\text{Lac}}+\text{Lac}} \cdot \frac{K_{i,\text{Amm}}}{K_{i,\text{Amm}}+\text{Amm}} \\ q_{\text{Glc}} &= \frac{\mu}{Y_{x/\text{Glc}}} + m_{\text{Glc}} \\ q_{\text{Gln}} &= \frac{\mu}{Y_{x/\text{Gln}}} + m_{\text{Gln}} \\ q_{\text{Lac}} &= \alpha_{\text{Lac}} \cdot \mu + \beta_{\text{Lac}} \\ q_{\text{Amm}} &= \alpha_{\text{Amm}} \cdot q_{\text{Gln}} \\ q_{\text{mAb}} &= \alpha_{\text{mAb}} \cdot \mu + \beta_{\text{mAb}} \\ D &= \frac{F(t)}{V} \end{aligned}$$

Unknown Parameters (14 total)

Parameter Description Units Typical Range
μmaxMaximum specific growth rateh⁻¹0.02 – 0.05
KsGlc, KsGlnMonod saturation constantsg/L, mM0.01 – 0.5
KiLac, KiAmmInhibition constantsg/L, mM10 – 50
YxGlc, YxGlnYield coefficientsg/g, g/mmol0.1 – 0.5
mGlc, mGlnMaintenance coefficientsg/g/h, mmol/g/h0.001 – 0.01
αLac, βLacLactate formation (Luedeking-Piret)g/g, g/g/h0.5 – 2.0
αmAb, βmAbProduct formationmg/g, mg/g/h1 – 10
kdDeath rate constanth⁻¹0.001 – 0.01

Parameter Estimation Problem

Given experimental measurements from a fed-batch run:

The inverse problem is formulated as nonlinear least squares:

Minimize: $$J(\theta) = \sum_{i} \left[ \frac{(y_{\text{obs},i} - y_{\text{model},i}(\theta))^2}{\sigma^2_i} \right]$$ Where: $$\begin{aligned} \theta &= [\mu_{\max}, K_{s,\text{Glc}}, K_{s,\text{Gln}}, K_{i,\text{Lac}}, K_{i,\text{Amm}}, Y_{x/\text{Glc}}, Y_{x/\text{Gln}}, \ldots, k_d] \\ y_{\text{obs}} &= \text{experimental measurements } [X_v, \text{Glc}, \text{Gln}, \text{Lac}, \text{Amm}, \text{mAb}] \\ y_{\text{model}}(\theta) &= \text{model predictions from ODE integration} \\ \sigma^2_i &= \text{measurement variance (assay noise)} \end{aligned}$$
Maximum likelihood interpretation: Under Gaussian measurement noise, minimizing weighted least squares is equivalent to maximizing the likelihood function, yielding parameter estimates θ̂ and uncertainty quantification via the Fisher Information Matrix.

Simulation Framework

To test estimation algorithms, we generate synthetic experimental data:

  1. Define nominal parameters: Use literature values or pilot-scale data as ground truth
  2. Specify feed profile: Exponential or constant glucose/glutamine feed starting at t = 48–72 h
  3. Integrate ODEs: Forward simulation with ODE solver (LSODA, Radau) to generate "true" trajectories
  4. Add measurement noise:
    • Xv: ±5% coefficient of variation (CV)
    • Glc, Lac: ±3% CV (enzymatic assays)
    • Gln, Amm: ±5% CV (HPLC variability)
    • mAb: ±2% CV (Protein A accuracy)
  5. Sparse sampling: Extract data at realistic time points (every 12–24 h, plus t=0)

Estimation Workflow

1. Nonlinear Least Squares (NLS)

Use Levenberg-Marquardt or Trust-Region-Reflective optimization with:

2. Extended Kalman Filter (EKF)

For online state and parameter estimation:

$$\begin{aligned} \text{Augment state vector: } & x_{\text{aug}} = [X_v, \text{Glc}, \text{Gln}, \text{Lac}, \text{Amm}, \text{mAb}, V, \theta] \\[0.5em] \text{Prediction: } & \hat{x}_{k|k-1} = f(\hat{x}_{k-1|k-1}, u_k) \\[0.5em] \text{Update: } & \hat{x}_{k|k} = \hat{x}_{k|k-1} + K_k (y_k - h(\hat{x}_{k|k-1})) \\[0.5em] \text{Kalman gain: } & K_k = P_{k|k-1} H^T (H P_{k|k-1} H^T + R)^{-1} \\[1em] \text{Process noise } Q&: \text{ accounts for model mismatch} \\ \text{Measurement noise } R&: \text{ assay covariance matrix} \end{aligned}$$

3. Unscented Kalman Filter (UKF)

Superior for highly nonlinear systems—uses sigma-point propagation instead of linearization. Recommended for fed-batch models with strong substrate-growth coupling.

Identifiability Analysis

Not all 14 parameters can be uniquely estimated from the available data. Identifiability is assessed via:

Practical identifiability criteria:

Recommendation: Fix maintenance coefficients (mGlc, mGln) and death rate (kd) from independent experiments. Estimate the 11 remaining parameters (growth, uptake yields, inhibition, product formation) from fed-batch data.

Validation Strategy

Validation Test Method Acceptance Criterion
Parameter recovery Synthetic data with known θ_true |θ̂ - θ_true|/θ_true < 10%
Cross-validation Fit on runs 1–3, validate on run 4 RMSE(Xv) < 0.5 g/L, RMSE(mAb) < 0.3 g/L
Prediction accuracy Forecast final titer from t=72h data Prediction error < 15%
Residual diagnostics Durbin-Watson, Shapiro-Wilk tests No autocorrelation, residuals normal

Industrial Value

Mechanistic parameter estimation delivers quantifiable ROI:

Case study: A biopharmaceutical company reduced process development time by 6 months by replacing empirical DoE with mechanistic modeling and parameter estimation, accelerating a biosimilar program from IND to Phase I.

Try ProcessLM

ProcessLM automates the entire workflow: you describe your fed-batch system in natural language, and it generates the ODE model, runs parameter estimation with cross-validation, and produces calibration reports—no MATLAB scripting required.

Request Early Access