Graduate Level Categorical Data Analysis Course

 
INTRODUCTION
 
Dear instructor of graduate level Categorical Data Analysis,
 
This powerpoint (ppt) presentation was created by me in order to teach a first-semester course in
Categorical Data Analysis (CDA) based on Alan Agresti’s textbook, 
Categorical Data Analysis
, 
Third
Edition, John Wiley & Sons, Inc., 2013 (ISBN:  978-0-470-46363-5, hardback).
 
This ppt presentation covers most of the sections of chapters 1 – 6, 9, and 10.  There is also a
separate ”chapter”, Ch. M (Multigraph), independent of Agresti’s textbook which covers the
multigraph representation of hierarchical loglinear models
.  Finally, Ch. S (Statistical Software)
briefly describes the use of SAS, SPSS, and R in analyzing categorical data.
 
There is some material in Agresti’s book which I have left out of my ppt slides, and there is some
material that I have included in the slides that does not appear in Agresti’s book.  Of course, one
may edit the ppt slides to suit one’s own preferences for the course.
 
Categorical Data Analysis
 
Categorical Data Analysis
 
INTRODUCTION, continued
 
What has worked well in my classes is to provide the student with the ppt slides electronically
ahead of the lecture.  Those who wish can print the slides before the lecture so that they can make
notes on the paper copy during the lecture.  One can print up to 8 slides on one side of a sheet of
paper.  Alternatively, students can follow the slides on their computer during lecture.
 
For some of the real-data examples I have provided personal stories about my experience with the
given data or example.  Students have found this very interesting.  I think it helps to give them the
sense that statistics is ”real” or ”palpable” in real world applications.
 
SAS is the primary statistical software that is presented and used throughout the ppts to carry out
computations.  An introduction to SAS is given in chapter 2.
 
Solutions to the chapter 
SUMMARY EXERCISES 
are provided in a WORD document.
 
There is a great deal of information about the book at:
 
www.stat.ufl.edu/~aa/cda/cda.html
 
At this website one can find, in addition to this ppt presentation, discussion of software, solutions
to selected exercises, SAS code, etc.
 
I hope that you will find this ppt presentation useful.  I welcome
feedback.  Best wishes for a high 
probability of success
.
 
 
H. J. Khamis, Ph.D., P.Stat., FASA
Director and Professor Emeritus, Statistical Consulting Center
Professor Emeritus, Boonshoft School of Medicine
Wright State Univeristy
Dayton, OH.
 
harry.khamis@wright.edu
 
3
 
Most of the references given in these ppt slides can be found in the
”References” list at the back of Agresti’s book.
 
When a reference is given in these slides that is not contained in
Agresti’s reference list, the full citation will be provided at the location
of the reference.
 
The references for chapter M
(Multigraph) are given at the
end of chapter M.
 
Now, let’s begin our journey.
 
4
 
CONTENTS
 
5
 
Chapter 0.5
 
INTRODUCTION, CONTENTS
 
Chapter 1
 
DISTRIBUTIONS
   1.1
 
Levels of measurement
   1.2
 
Distribution theory
   1.3
 
Statistical inference for categorical data
   1.4
 
Inference – binomial parameter
   1.5
 
Inference – multinomial parameters
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 2
 
DESCRIBING CONTINGENCY TABLES
   2.1
 
Two-way contingency table
   2.2
 
Two-way contingency table -- association
   2.3
 
Partial and marginal contingency tables
   2.4
 
I x J contingency tables
 
Introduction to SAS
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 3
 
INFERENCE FOR TWO-WAY TABLES
   3.1
 
Confidence interval estimation
   3.2
 
Testing independence in two-way contingency tables
   3.3
 
Following up chi-squared tests
   3.4
 
Ordinal variables
   3.5
 
Small-sample inference for contingency tables
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 4
 
GLM
   4.1
 
GLM introduction
   4.2
 
GLM for binary data
   4.3
 
GLM for counts and rates
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 5
 
LOGISTIC REGRESSION
   5.1
 
Logistic regression parameters
   5.2
 
Inference for logistic regression
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 6
 
LOGISTIC REGRESSION -- APPLICATION
   6.1
 
Model selection
   6.2
 
Model performance and diagnostics
   6.3
 
Predictive power
   6.4
 
Cochran-Mantel-Haenszel (CMH) test
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 9
 
LOGLINEAR MODEL
   9.0
 
Motivating example
   9.1
 
Loglinear model for two-way tables
   9.2
 
Loglinear model for three-way tables
   9.3
 
Inference for loglinear models
   9.4
 
Loglinear model – higher dimensions
   9.5
 
Loglinear model – logistic regression model connection
 
Motivating example -- revisit
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter 10
 
LOGLINEAR MODEL -- APPLICATION
  10.0
 
Fixed margins
  10.1
 
Collapsibility
  10.2
 
Model selection
 
Motivating example -- revisit
 
CONCLUSION
  
 
SUMMARY EXERCISES
 
Chapter M
 
MULTIGRAPH
 
CIG review
 
Graphical models
 
 
Generator multigraph
 
Maximum spanning tree
 
Decomposability
 
Factorization
 
Fundamental conditional independencies
 
Collapsibility conditions
 
Real data examples
 
CONCLUSION
 
SUMMARY EXERCISES
 
CHAPTER M REFERENCES
 
Chapter S 
 
Statistical Software
 
Introduction
 
Two-way tables—SAS
 
 
Two-way tables – SPSS
 
 
Two-way tables – R
  
 
Multi-way tables – SAS
  
 
Multi-way tables – SPSS
 
Multi-way tables – R
Slide Note
Embed
Share

This PowerPoint presentation offers a comprehensive guide to teaching a first-semester course in Categorical Data Analysis at the graduate level. Covering key sections from Alan Agresti's textbook, the presentation includes real-data examples, software guidance, and solutions to exercises. The instructor shares personal insights to make statistics palpable, with SAS as the primary tool for computations. Additional resources and references enhance the learning experience.

  • Categorical Data Analysis
  • Graduate Level
  • Statistics
  • SAS
  • Alan Agresti

Uploaded on Jul 22, 2024 | 1 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Categorical Data Analysis Categorical Data Analysis INTRODUCTION Dear instructor of graduate level Categorical Data Analysis, This powerpoint (ppt) presentation was created by me in order to teach a first-semester course in Categorical Data Analysis (CDA) based on Alan Agresti s textbook, Categorical Data Analysis, Third Edition, John Wiley & Sons, Inc., 2013 (ISBN: 978-0-470-46363-5, hardback). This ppt presentation covers most of the sections of chapters 1 6, 9, and 10. There is also a separate chapter , Ch. M (Multigraph), independent of Agresti s textbook which covers the multigraph representation of hierarchical loglinear models. Finally, Ch. S (Statistical Software) briefly describes the use of SAS, SPSS, and R in analyzing categorical data. There is some material in Agresti s book which I have left out of my ppt slides, and there is some material that I have included in the slides that does not appear in Agresti s book. Of course, one may edit the ppt slides to suit one s own preferences for the course.

  2. INTRODUCTION, continued What has worked well in my classes is to provide the student with the ppt slides electronically ahead of the lecture. Those who wish can print the slides before the lecture so that they can make notes on the paper copy during the lecture. One can print up to 8 slides on one side of a sheet of paper. Alternatively, students can follow the slides on their computer during lecture. For some of the real-data examples I have provided personal stories about my experience with the given data or example. Students have found this very interesting. I think it helps to give them the sense that statistics is real or palpable in real world applications. SAS is the primary statistical software that is presented and used throughout the ppts to carry out computations. An introduction to SAS is given in chapter 2. Solutions to the chapter SUMMARY EXERCISES are provided in a WORD document.

  3. There is a great deal of information about the book at: www.stat.ufl.edu/~aa/cda/cda.html At this website one can find, in addition to this ppt presentation, discussion of software, solutions to selected exercises, SAS code, etc. I hope that you will find this ppt presentation useful. I welcome feedback. Best wishes for a high probability of success. H. J. Khamis, Ph.D., P.Stat., FASA Director and Professor Emeritus, Statistical Consulting Center Professor Emeritus, Boonshoft School of Medicine Wright State Univeristy Dayton, OH. harry.khamis@wright.edu 3

  4. Most of the references given in these ppt slides can be found in the References list at the back of Agresti s book. When a reference is given in these slides that is not contained in Agresti s reference list, the full citation will be provided at the location of the reference. The references for chapter M (Multigraph) are given at the end of chapter M. Now, let s begin our journey. 4

  5. CONTENTS Chapter 0.5 INTRODUCTION, CONTENTS Chapter 5 5.1 5.2 LOGISTIC REGRESSION Logistic regression parameters Inference for logistic regression CONCLUSION SUMMARY EXERCISES Chapter M MULTIGRAPH CIG review Graphical models Generator multigraph Maximum spanning tree Decomposability Factorization Fundamental conditional independencies Collapsibility conditions Real data examples CONCLUSION SUMMARY EXERCISES CHAPTER M REFERENCES Chapter 1 1.1 1.2 1.3 1.4 1.5 DISTRIBUTIONS Levels of measurement Distribution theory Statistical inference for categorical data Inference binomial parameter Inference multinomial parameters CONCLUSION SUMMARY EXERCISES Chapter 6 6.1 6.2 6.3 6.4 LOGISTIC REGRESSION -- APPLICATION Model selection Model performance and diagnostics Predictive power Cochran-Mantel-Haenszel (CMH) test CONCLUSION SUMMARY EXERCISES Chapter 2 2.1 2.2 2.3 2.4 DESCRIBING CONTINGENCY TABLES Two-way contingency table Two-way contingency table -- association Partial and marginal contingency tables I x J contingency tables Introduction to SAS CONCLUSION SUMMARY EXERCISES Chapter 9 9.0 9.1 9.2 9.3 9.4 9.5 LOGLINEAR MODEL Motivating example Loglinear model for two-way tables Loglinear model for three-way tables Inference for loglinear models Loglinear model higher dimensions Loglinear model logistic regression model connection Motivating example -- revisit CONCLUSION SUMMARY EXERCISES Chapter S Statistical Software Introduction Two-way tables SAS Two-way tables SPSS Two-way tables R Multi-way tables SAS Multi-way tables SPSS Multi-way tables R Chapter 3 3.1 3.2 3.3 3.4 3.5 INFERENCE FOR TWO-WAY TABLES Confidence interval estimation Testing independence in two-way contingency tables Following up chi-squared tests Ordinal variables Small-sample inference for contingency tables CONCLUSION SUMMARY EXERCISES Chapter 10 10.0 10.1 10.2 LOGLINEAR MODEL -- APPLICATION Fixed margins Collapsibility Model selection Motivating example -- revisit CONCLUSION SUMMARY EXERCISES Chapter 4 4.1 4.2 4.3 GLM GLM introduction GLM for binary data GLM for counts and rates CONCLUSION SUMMARY EXERCISES 5

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#