Nonlinear Regression in Chemical Kinetics Analysis
Nonlinear regression plays a crucial role in analyzing chemical kinetics, as seen in the application of the Michaelis-Menten model to Puromycin kinetics. The process involves modeling enzyme reaction velocities, linearizing the model for regression analysis, and deriving insights for further analysis and experimentation. Sarah Duclos Ivetich from ETH Zurich provides valuable insights and examples in this area of study.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
Nonlinear Regression ( ) = + , Y f x i i i Sarah Duclos Ivetich ETH Zurich, Institut f r Chemie- und Bioingenieurwissenschaften ETH H nggerberg / HCI F106 Z rich E-Mail: sarah.duclos@chem.ethz.ch https://shihlab.ethz.ch/education/Snm/statistics.html Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 1
Example: Puromycin Puromycin is an antibiotic used in bio-research to select cells modified by genetic engineering The Michaelis-Menten model for enzyme kinetics relates the initial velocity of an enzymatic reaction to the substrate concentration x by the equation ( ) , f x x = 1 + x 2 At high concentrations, the velocity is essentially constant ( 1), while at low concentrations it is a linear function of the concentration Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 2
Puromycin Kinetics Raw Data 220 200 180 Reaction Velocity [counts/min2] 160 140 120 The model: 100 80 x ( ) = + 1 , f x 60 x 2 40 20 0 0 0.2 0.4 0.6 0.8 1 1.2 Substrate Concentration [ppm] Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 3
Model Linearization The initial model reads x ( ) = , f x 1 + x 2 This can be rearranged to 1 , f x + 1 1 x x = = + 2 2 ( ) x 1 1 1 Renaming the variables yields a linear model = + ( , ) f y y 1 2 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 4
Linearized Model: Fit Linearized Raw Data 0.022 0.02 0.018 0.016 0.014 1/f 0.012 0.01 Regression Line 1 = 0.0051072 2 = 0.00024722 0.008 0.006 0.004 0 5 10 15 20 25 30 35 40 45 50 55 1/x Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 5
Regression from Linearized Model 220 200 180 Reaction Velocity [counts/min2] 160 140 Regression from linearized model 1 = 195.8 2 = 0.048407 120 100 80 60 40 20 0 0 0.2 0.4 0.6 0.8 1 1.2 Substrate Concentration [ppm] Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 6
Nonlinear Regression The objective function of non-linear regression reads N obs ( ) ( ) 2 = ( , ) i f x S y i = 1 i where y is the vector of responses, x is the vector of observations, is the vector of parameters and f is the nonlinear model function In the 2-parameter case, we can plot S( ) as a function of the parameter values to look for a minimum Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 7
Objective Function S() 0.1 2500 3000 5000 4000 7500 12500 0.09 2000 2500 1800 10000 2000 2500 1600 0.08 2000 3000 3000 1800 5000 1400 4000 1350 1300 0.07 7500 1800 1350 4000 1250 2500 1600 1600 1400 1250 2 5000 1200 2000 0.06 3000 2500 2000 1300 3000 1800 1350 1400 5000 Minimum 4000 7500 4000 1600 0.05 1800 5000 10000 2000 2500 2500 3000 0.04 15000 Estimated value of from linearization 7500 4000 3000 5000 12500 17500 10000 20000 4000 0.03 180 190 200 210 220 230 240 1 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 8
Minimization of S() If we linearize the model with respect to the parameters, we get ( ) ( ) 0 , , ( ) i i i i f x f x + J 0 where J( 0) is the Jacobian matrix of f with respect to , evaluated at 0 In this case, the residuals are ( 0 ( ) ( ) ( ) f y f = Analogy: = T T X X X Y ) + ( ) J y 0 If we now define the objective function in terms of the residuals, we can apply the Gauss-Newton method ( ) S = = J J ( ) T T T J 0 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 9
Gauss-Newton Method applied to S() 0.1 2500 3000 5000 4000 7500 12500 0.09 2000 2500 1800 10000 2000 2500 1600 0.08 2000 3000 3000 1800 5000 1400 4000 1350 1300 0.07 7500 1800 1350 4000 1250 2500 1600 1600 1400 1250 2 5000 1200 2000 0.06 3000 2500 2000 1300 3000 1800 1350 1400 5000 Convergence path of Gauss- 4000 7500 4000 1600 0.05 5000 Newton Method ( 1)opt = 212.66 ( 2)opt = 0.064091 1800 10000 2000 2500 2500 3000 0.04 15000 7500 4000 3000 5000 12500 17500 10000 20000 4000 0.03 180 190 200 210 220 230 240 1 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 10
Nonlinear Regression 220 200 180 Reaction Velocity [counts/min2] 160 Nonlinear Regression 140 Regression from linearized model 120 100 80 60 40 20 0 0 0.2 0.4 0.6 0.8 1 1.2 Substrate Concentration [ppm] Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 11
Matlab Nonlinear Regression Routines There are several options for doing nonlinear regression: lsqcurvefit: Best for quickly fitting models to data; Not much statistical output; Provides more numerical information and options; Can use bounded parameter ranges nlinfit: General nonlinear regression routine; Provides some statistical output NonLinearModel: Most useful for nonlinear regression; All statistical and numerical tools in one place; Works analogously to LinearModel fitnlm: new version of NonLinearModel (since 2013b) Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 12
Using the Nonlinear Regression Routines First we need to create a function that takes as inputs a vector b, the model parameters, and a vector (or matrix) x, the data points of the observed variables, and returns as output a vector (or matrix) y, the model responses Then we use one of the nonlinear regression routines: Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 13
Plotting the Fitted Curves Plotting is as simple as calling the cost function or the predict routine of the NonLinearModel 220 200 180 160 140 120 100 80 60 40 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 14
Tukey-Anscombe Plots Plot of residuals vs. fitted values Plot of residuals vs. fitted values 30 25 20 15 Residuals 10 5 0 -5 -10 -15 40 60 80 100 120 Fitted values 140 160 180 200 220 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 15
Normal Probability Plot Normal Probability Plot 0.95 0.90 0.75 Probability 0.50 0.25 0.10 0.05 -10 -5 0 5 Data 10 15 20 25 Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 16
Estimation of the Confidence Intervals Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 17
Assignment 1 1. Import the data relative to the COVID19 pandemics in Switzerland (covid_19_data_switzerland.csv source: https://corona-data.ch/). https://corona-data.ch/ 2. Plot the data and select only those relative to the first wave (day 2 to 25). Consider first the following exponential model: ? = ???? Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 18
Assignment 1 3. Linearize the model and perform a linear regression using fitlm. Transform back the obtained regression coefficients to have an initial estimate for ? and ?. 4. Use the previously regressed coefficients to fit the non linear model in Matlab by using fitnlm. 5. For each model plot the fitted curves against the data and analyze the TA and QQ plots. How are the two model compared to each other? Consider now the following model: ? = ????+ ? Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 19
Assignment 1 (Continued) 6. Perform a non linear regression using the updated model. What coefficients are significant? Did the model improve the TA and QQ plots? Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 20
Assignment 2 An experiment was conducted to evaluate the effect of two different methionin nutrition supplements (sourceA and sourceB) on the growth of newborn turkeys. The turkeys were separated into 10 experimental units with 15 birds each. For each supplement, the amount fed was varied between 0.04% and 0.44% of the total food. The target variable, the mean body weight of the units, is expressed as a function of the dosage of the two supplements, according to the following non-linear model ( 1 2 1 exp ) ( ) = + + A i B i Y x x 3 4 i where Y is the mean weight, and xAand xBare the dosages of the two supplements, respectively. Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 21
Exercise 2 (Continued) ( ) ( ) 1 exp = + + A i B i Y x x 1 2 3 4 i The model assumes that the intercept 1and the horizontal asymptote 1+ 2are the same for both supplements, and that only the growth rates 3* 4and 3for supplements A and B, respectively, are different. The question is whether one of the supplements is superior, i.e. whether 4is significantly different from unity. Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 22
Assignment 2 1. Find online the data file turkeyweight.dat and some code to load the data into Matlab. 2. Find suitable initial values for the parameters by linearizing the model. If you cannot solve this use 1= 638, 2= 175, 3= 5 and 4= 1. 3. Write a model function of the form function y = turkeymodel(theta, x) 4. Perform the nonlinear regression, using nlinfit or NonLinearModel. 5. Are the growth rates for the two supplements significantly different, i.e. is 4significantly different from unity? Use either nlparci or coefCI to estimate the confidence intervals. Sarah Duclos Ivetich/ Numerical Methods for Chemical Engineers / Nonlinear Regression 23