Chi-Square Tests in Statistics

C
h
i
-
S
q
u
a
r
e
 
T
e
s
t
s
 
Categorical data
 
1-sample, compared to theoretical distribution
Goodness-of-Fit Test
 
2+ samples, 2+ levels of response variable
Chi-square Test
Chi-Square Tests
Slide #1
Chi-Square Tests
Slide #2
G
o
o
d
n
e
s
s
-
o
f
-
F
i
t
 
T
e
s
t
 
Compare observed to theoretical frequencies of
individuals in categories.
 
Examples –
Test whether responses are “random” (e.g., preference)
Test Mendelian genetics (e.g., 3:1 and 9:3:3:1 theories).
Test use of available resources (e.g., compare habitat
usage to availability).
Chi-Square Tests
Slide #3
A
n
 
I
l
l
u
s
t
r
a
t
i
v
e
 
E
x
a
m
p
l
e
 
Determine, at the 10% level, if Northland students
prefer the Chris Duarte Group (CDG), Ronnie
Baker Brooks (RBB), or Bernard Allison (BA).
 
Hypotheses?
H
a
: “different # of students prefer each artist”
H
o
: “same # of students prefer each artist”
Chi-Square Tests
Slide #4
 
Under H
o
, what proportion prefer each artist?
 
If n=78, how many students prefer each artist
if H
o 
is true?
 
1/3
 
26
A
n
 
I
l
l
u
s
t
r
a
t
i
v
e
 
E
x
a
m
p
l
e
Chi-Square Tests
Slide #5
Suppose these results were obtained:
 
Is there a preference – i.e., are these observations
significantly different from what was expected
when assuming no preference?
A
n
 
I
l
l
u
s
t
r
a
t
i
v
e
 
E
x
a
m
p
l
e
Chi-Square Tests
Slide #6
A
 
N
e
w
 
T
e
s
t
 
S
t
a
t
i
s
t
i
c
 
df = cells - 1
Chi-Square Tests
Slide #7
 
2
 =
 
2
 =  0.15 + 5.54  + 3.85  = 9.54
 
df
 = (3-1) = 
2
 
p-value 
= 0.00848
 
Conclusion?
A
n
 
I
l
l
u
s
t
r
a
t
i
v
e
 
E
x
a
m
p
l
e
Chi-Square Tests
Slide #8
G
o
o
d
n
e
s
s
-
o
f
-
F
i
t
 
T
e
s
t
 
H
o
: 
distribution of individuals into levels follows
the theoretical distribution
H
A
: 
distribution of individuals into levels does
NOT follow the theoretical distribution
Sample:
 randomized, single variable of size n
Assume: 
at least 5 in each cell of 
expected table
Statistic: 
Observed frequency table
Chi-Square Tests
Slide #9
G
o
o
d
n
e
s
s
-
o
f
-
F
i
t
 
T
e
s
t
 
Test Statistic:
 
df: 
cells-1
Confidence Region:
 
 
E
x
a
m
i
n
e
 
H
O
 
 
P
a
g
e
 
5
 
Chi-Square Tests
 
Slide #10
 
R - Chi-Square
 
Slide #11
 
A particular type of corn is known to have one of four
types of kernels: purple-smooth, purple-wrinkled, yellow-
smooth, and yellow-wrinkled. The cross between
heterozygous individuals
1
 should produce a 9:3:3:1 ratio
(in same order of types).  Of the kernels on a random cob
  
32 were purple-smooth
  
14 were purple-wrinkled
  
  8 were yellow-smooth
  
  4 were yellow-wrinkled
 
Use the results to determine, at the 5% level, if the
theoretical 9:3:3:1 ratio is upheld with these data.
 
E
x
a
m
p
l
e
 
D
a
t
a
 
 
C
o
r
n
 
G
e
n
e
t
i
c
s
 
1
 i.e., PpSs where t
he purple (P) and smooth (S) alleles are dominant.
 
Chi-Square Tests
 
Slide #12
 
A
 
F
u
l
l
 
E
x
a
m
p
l
e
 
The leader of a local lakes association conducted a
survey of all members of the association. One question
on the survey was “What is your preferred method of
receiving notices from the lakes association: by regular
mail, by e-mail, by phone, by poster (at the local boat
landing), or other?”  Of the surveys returned, 47
respondents preferred regular mail, 63 e-mail, 17
phone, 73 by poster, and 8 some other method. OF THE
RESPONDENTS THAT DID NOT PREFER SOME
OTHER METHOD, is there evidence, at the 5% level,
of a difference in the preferred method of contact?
 
Chi-Square Tests
 
Slide #13
 
A
 
F
u
l
l
 
E
x
a
m
p
l
e
 
In a randomly selected national sample of 1,007
adults, aged 18 and older, conducted Aug. 22-25,
2005, Gallup polls found that that 403
respondents approved of the way that George W.
Bush was handling his presidency. In a previous
sample (Aug. 8-11, 2005), 45% of the
respondents approved of George W. Bush’s
handling of the presidency.  Assuming that this
earlier value was true for the entire population,
determine, at the 5% level, if the approval rating
has changed by the Aug. 22-25, 2005 sample.
Slide Note
Embed
Share

Explore the concept of Chi-Square tests through an illustrative example of testing preferences among artists. Learn about Goodness-of-Fit tests, hypotheses, expected versus observed frequencies, new test statistics, and interpreting results through p-values. Discover how these tests compare observed data to theoretical distributions in categorical data analysis.

  • Chi-Square Tests
  • Goodness-of-Fit
  • Hypotheses
  • Expected Frequencies
  • P-Values

Uploaded on Sep 12, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Chi-Square Tests Categorical data 1-sample, compared to theoretical distribution Goodness-of-Fit Test 2+ samples, 2+ levels of response variable Chi-square Test Slide #1 Chi-Square Tests

  2. Goodness-of-Fit Test Compare observed to theoretical frequencies of individuals in categories. Examples Test whether responses are random (e.g., preference) Test Mendelian genetics (e.g., 3:1 and 9:3:3:1 theories). Test use of available resources (e.g., compare habitat usage to availability). Slide #2 Chi-Square Tests

  3. An Illustrative Example Determine, at the 10% level, if Northland students prefer the Chris Duarte Group (CDG), Ronnie Baker Brooks (RBB), or Bernard Allison (BA). Hypotheses? Ha: different # of students prefer each artist Ho: same # of students prefer each artist Slide #3 Chi-Square Tests

  4. An Illustrative Example 1/3 Under Ho, what proportion prefer each artist? If n=78, how many students prefer each artist if Ho is true? 26 Artist CDG RBB BA Expected Table Freq 26 26 26 Slide #4 Chi-Square Tests

  5. An Illustrative Example Suppose these results were obtained: Artist CDG RBB BA Observed Table Freq 24 38 16 Is there a preference i.e., are these observations significantly different from what was expected when assuming no preference? Slide #5 Chi-Square Tests

  6. A New Test Statistic ( ) 2 ected observed exp ected table = 2 exp df = cells - 1 Slide #6 Chi-Square Tests

  7. An Illustrative Example Artist CDG RBB BA Observed Table # 24 38 16 Artist CDG RBB BA Expected Table # 26 26 26 ( ) ( )+ ( )+ 2 2 2 26 26 16 26 24 26 38 26 2 = 26 2 = 0.15 + 5.54 + 3.85 = 9.54 df = (3-1) = 2 p-value = 0.00848 Conclusion? Slide #7 Chi-Square Tests

  8. Goodness-of-Fit Test Ho: distribution of individuals into levels follows the theoretical distribution HA: distribution of individuals into levels does NOT follow the theoretical distribution Sample: randomized, single variable of size n Assume: at least 5 in each cell of expected table Statistic: Observed frequency table Slide #8 Chi-Square Tests

  9. Goodness-of-Fit Test ( ) 2 ected observed exp ected table Test Statistic: = 2 exp df: cells-1 ( ) p p 1 *z p Confidence Region: n where is sample proportion in level of interest p Slide #9 Chi-Square Tests

  10. Examine HO Page 5 Slide #10 Chi-Square Tests

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#