Understanding Regression Lines for Predicting English Scores
Learn how to utilize regression lines to predict English scores based on math scores, recognize the dangers of extrapolation, calculate and interpret residuals, and understand the significance of slope and y-intercept in regression analysis. Explore the process of making predictions using regression lines and how to address missing data points effectively.
Download Presentation
Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
E N D
Presentation Transcript
Do Now What is the predicted English score when the math score is 10?
LESSON 2.5 REGRESSION LINES
Objectives Make predictions using regression lines, keeping in mind the dangers of extrapolation. Calculate and interpret a residual. Interpret the slope and y intercept of a regression line.
Make a prediction We have a hole in our data because someone in the group forgot to record the value for 5 rubber bands .so we are just going to make our best prediction. What do you think the lowest point could be for 5 rubber bands? Discuss with your groups and come up with a prediction
Lets Calculate it! We re going to graph a line to fit this data and use that to make a prediction next
1. Put your data into L1 and L2 (Stat-Edit) 2. Go to Stat-Calc- #8 LinReg (a+bx), enter 3. Do NOT hit enter again yet! 4. After LinReg(a+bx) comes on your screen we must enter L1,L2,Y1 after it and then hit enter To get to L1- 2nd1 To get to L2 2nd2 To get to Y1 VARS- Y-VARS-Function Y1 5. Zoom- #9
Predict using our graph On your graph 1. Hit 2ndTrace 2. Hit value 3. Enter in the x value you would like to find and hit enter 4. The y shows in the bottom right corner
When the relationship between two quantitative variables is linear, we can use a regression line to model the relationship and make predictions
Definition A regression line is a line that describes how a response variable y changes as an explanatory variable x changes. Regression lines are expressed in the form ? = a + bx, where ? (pronounced y hat ) is the predicted value of y for a given value of x.
Definition The y intercept, a is the predicted value of y when x = 0. The slope b of a regression line describes the predicted change in the y variable for each 1-unit increase in the x variable. It is very important to include the word predicted (or its equivalent) in the interpretation of the slope and y intercept.
Definition Extrapolation is the use of a regression line for prediction far outside the interval of x values used to obtain the line. Such predictions are often not accurate.
The prediction we make using the regression line is called an extrapolation
Even when we are not extrapolating, our predictions are seldom perfect. For a specific point, the difference between the actual value of y and the predicted value of y is called a residual.
Definition A residual is the difference between an actual value of y and the value of y predicted by the regression line. That is,
Roller coasters with larger maximum heights usually go faster than shorter ones. Here is a scatterplot of x = height (in feet) versus y = maximum speed (in miles per hour) for nine roller coasters that opened in 2012. The equation of the regression line for this relationship is ? = 28.17 + 0.2143?. Calculate, and interpret the residual for the Iron Shark, which has maximum height of 100 feet and a top speed of 52 miles per hour. Interpret the slope of the regression line. Does the value of the y intercept have meaning in this context? If so, interpret the y intercept. If not, explain why.
Do Now Data on x = size of a house (in square feet) and y= amount of natural use (therms) during a specified period were used to fit the least squares regression line. The slope was 0.017 and the y-intercept was -5. Houses in this data set ranged in size from 1000 to 3000 square feet. a)What is the equation of the least squares regression line? b)What would you predict for gas usage for a 2100 sq. ft. house? c)What is the approximate change in gas usage associated with a 1 sq. ft. increase is size? d)Would you use the least squares regression line to predict gas usage for a 500 sq. ft. house? Why or why not?
Do Now Complete #1-3 It s exactly what we ve been doing in class. Use that note sheet to follow directions
Lesson 2.6 Least Squares Regression Lines Objectives: Calculate the equation of the least squares regression line using technology Calculate the equation of the least squares regression line using summary statistics Describe how outliers affect the least-squares regression line
Discuss How did outliers affect our least squares regression line?
Definition The least squares regression line is the line that makes the sum of the squared residuals as small as possible
What is a residual? The magnitude of a typical residual can give us a sense of generally how close our estimates are.
What could the standard deviation of a residual tell us about our regression line? The smaller the residual standard deviation, the closer is the fit to the data. The smaller the residual standard deviation is compared to the sample standard deviation, the more predictive the model is. The standard deviation of your residuals, S, can also be thought of as the "typical" residual, so most points should be within S points from the line of best fit.
Important Ideas from the Text 1. Least squares means smallest squares of residuals 2.Outliers pull lines towards themselves 3.Outliers often change slope and y-intercept
Researchers have investigated if pomegranates antioxidant properties are useful in the treatment of cancer. One study investigated whether pomegranate fruit extract (PPE) was affective in slowing the growth of prostate cancer tumors. In this study, 24 mice were injected with cancer cells. The mice were then randomly assigned to one of three treatment groups. One group of eight mice received normal drinking water, the second group received drinking water supplemented with 0.1% PPE, and the third received drinking water supplemented with 0.2% PPE. The average tumor volume for the mice in each group was recorded at several points in time.
x y 11 150 15 270 19 450 23 580 27 740 x = number of days after injection of cancer cells y = average tumor volume (in ??3) The summary quantities necessary to calculate the equation of the least squares regression line are: ? = 95 ? = 2190 (? ?)(? ?) = 5960 (? ?)2= 160
If the goal is to learn about how birth weight is related to mothers age, which of these two variables response variable and the explanatory variable? Construct a scatterplot of these data. Would it be reasonable to use a line to summarize the relationship between birth weight and mother s age? Find the equation of the least squares regression line.