Data Analysis and Regression Quiz Overview

 
UNGRADED QUIZ
 
Unit 5
 
Show me your fingers
 
Do not shout out the answer, or your classmates will
follow what you said.
Use your fingers
One finger (the right finger) = A
Two fingers = B
Three fingers = C
Four fingers = D
No finger = I don’t know. I didn’t study
 
Which of the following is NOT a problem
of traditional OLS regression?
 
A.
Too many assumptions about the residuals and the
predictors
B.
It tends to overfit to the sample.
C.
The model is unstable when some predictors are strongly
correlated (collinearity)
D.
There is only one unique solution when the sample size is
large.
E.
It must be a linear model.
 
Which of the following statement is NOT a
characteristic of generalized regression?
 
A.
Similar to abduction or IBE: don't fix on one single answer,
consider a few.
B.
It imposes penalty on the model to reduce complexity
C.
Start with a full model model and then scale back the
model.
D.
It is also known as regularized regression.
 
Which of the following is NOT an option
in JMP’s generalized regression?
 
A.
Elastic Net
B.
Lasso
C.
Double Lasso
D.
Ridge
E.
Dante Selector
 
Elastic net combines the penalty
methods of 
.
 
A.
Double lasso and Ridge
B.
Lasso and Ridge
C.
Double Lasso and Dantzig selector
D.
Ridge and Dantzig selector
 
AIC or AICc is better than BIC because
 
A.
AIC and AICc is based on the principle of
information loss.
B.
The Bayesian approach requires a prior input but
usually it is debatable.
C.
AIC is asymptotically optimal in model selection in
terms of the root mean square error, but BIC is not
asymptotically optimal.
D.
All of the above.
 
In one of the leafs (partitioned group) of a
tree, there are 9 males and 1 females.
According to GINI, this group is
 
A.
Highly impure
B.
Highly pure
C.
Neither pure nor impure; insufficient information
 
Which of the following statements
about the leaf report is UNTRUE?
 
A.
It is based upon the nested-if logic.
B.
It shows interactions of variables
C.
The probability of each scenario is the same as the
p value in hypothesis testing.
 
What of the following about the ROC
curve is UNTRUE?
 
A.
It is originated from Signal Detection Theory during
WWII.
B.
The area under the curve indicates the predictive
accuracy.
C.
When the AUC value is .6 or above, it is considered
acceptable.
 
Which of the following statement is
TRUE?
 
A.
If the outcome variable is categorical, LogWorth (the
likelihood ratio of chi-square) is reported.
B.
LogWorth is the inverse of the 
p
 value.
C.
Like the 
p
 value, a lower LogWorth is better.
D.
LogWorth is a partitioned criterion based on
impurity.
 
In JMP if you uncheck informative missing
before generating a partition tree, what
would happen?
 
A.
The missing data will be deleted in a listwise way.
B.
The missing data will be deleted in a casewise way.
C.
The missing data will be imputed.
 
Which of the following is NOT an option
in SPSS’s classification trees?
 
A.
C4.5
B.
CHAID
C.
QUEST
D.
CRT
 
Which of the following is NOT a drawback of
SPSS’s classification trees?
 
A.
The graphs of JMP are dynamic and interactive whereas
the counterparts of SPSS are static.
B.
The interactive model outline in SPSS shows much less
details than the tree in JMP.
C.
The hierarchy of the SPSS tree might not correspond to
the rank of predictor importance.
D.
Neither viewer nor model outline allows tree-pruning.
Slide Note
Embed
Share

This quiz covers topics related to traditional OLS regression problems, generalized regression characteristics, JMP options, penalty methods in Elastic Net, AIC vs. BIC, GINI impurity in decision trees, and more. Test your knowledge and understanding of key concepts in data analysis and regression techniques.

  • Data Analysis
  • Regression
  • Quiz
  • OLS
  • Generalized Regression

Uploaded on Oct 04, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. UNGRADED QUIZ Unit 5

  2. Show me your fingers Do not shout out the answer, or your classmates will follow what you said. Use your fingers One finger (the right finger) = A Two fingers = B Three fingers = C Four fingers = D No finger = I don t know. I didn t study

  3. Which of the following is NOT a problem of traditional OLS regression? A. Too many assumptions about the residuals and the predictors B. It tends to overfit to the sample. C. The model is unstable when some predictors are strongly correlated (collinearity) D. There is only one unique solution when the sample size is large. E. It must be a linear model.

  4. Which of the following statement is NOT a characteristic of generalized regression? A. Similar to abduction or IBE: don't fix on one single answer, consider a few. B. It imposes penalty on the model to reduce complexity C. Start with a full model model and then scale back the model. D. It is also known as regularized regression.

  5. Which of the following is NOT an option in JMP s generalized regression? A. Elastic Net B. Lasso C. Double Lasso D. Ridge E. Dante Selector

  6. Elastic net combines the penalty methods of . A. Double lasso and Ridge B. Lasso and Ridge C. Double Lasso and Dantzig selector D. Ridge and Dantzig selector

  7. AIC or AICc is better than BIC because A. AIC and AICc is based on the principle of information loss. B. The Bayesian approach requires a prior input but usually it is debatable. C. AIC is asymptotically optimal in model selection in terms of the root mean square error, but BIC is not asymptotically optimal. D.All of the above.

  8. In one of the leafs (partitioned group) of a tree, there are 9 males and 1 females. According to GINI, this group is A. Highly impure B. Highly pure C. Neither pure nor impure; insufficient information

  9. Which of the following statements about the leaf report is UNTRUE? A. It is based upon the nested-if logic. B. It shows interactions of variables C. The probability of each scenario is the same as the p value in hypothesis testing.

  10. What of the following about the ROC curve is UNTRUE? A. It is originated from Signal Detection Theory during WWII. B. The area under the curve indicates the predictive accuracy. C. When the AUC value is .6 or above, it is considered acceptable.

  11. Which of the following statement is TRUE? A. If the outcome variable is categorical, LogWorth (the likelihood ratio of chi-square) is reported. B. LogWorth is the inverse of the p value. C. Like the p value, a lower LogWorth is better. D. LogWorth is a partitioned criterion based on impurity.

  12. In JMP if you uncheck informative missing before generating a partition tree, what would happen? A. The missing data will be deleted in a listwise way. B. The missing data will be deleted in a casewise way. C. The missing data will be imputed.

  13. Which of the following is NOT an option in SPSS s classification trees? A. C4.5 B. CHAID C. QUEST D. CRT

  14. Which of the following is NOT a drawback of SPSS s classification trees? A. The graphs of JMP are dynamic and interactive whereas the counterparts of SPSS are static. B. The interactive model outline in SPSS shows much less details than the tree in JMP. C. The hierarchy of the SPSS tree might not correspond to the rank of predictor importance. D. Neither viewer nor model outline allows tree-pruning.

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#