Gaining Power in Statistics for HCI

Understanding Statistics for
HCI and Related Disciplines
Part 3 – 
Gaining Power
the dreaded ‘too few participants’
Alan Dix
http://alandix.com/statistics/
 
don
t you hate it when …
 
statistical power – what is it?
 
 
if there is a real effect
how likely are you to be able to detect it?
 
avoiding false negatives
 
increasing power
 
standard approach …
     
add more participants
 
but not the only way!
 
can get more power …
  
but often sacrifice a little generality
  
need to understand and explain
 
with great power comes great responsibility ;-)
 
noise–effect–number
triangle
 
three routes to gain power
 
recall Stats 101   
(for simple data)
 
σ  –  standard deviation of data
often ‘noise’ – things you can
t control or measure
e.g. individual variability
 
s.e.  – standard error of mean (s.e.)
the accuracy of your estimate (error bars)
 
s.e.  =  σ / √n                   
(if σ is an estimate  
n-1 )
 
to half standard error you must  quadruple number.
 
effect size
 
how big a difference do you want to detect?
  
call it δ
 
the accuracy (s.e.) needs to be better than δ
 
                  
δ   >>  σ / √n
 
effect size
 
noise
 
number of
participants
 
noise–effect–number triangle
 
to gain power
address any of these
 
not just more subjects!
 
 
general strategies
 
increase number
the standard approach … but …
square root often means very large increases
reduce noise
control conditions     
(physics approach)
measure other factors and fit  
(e.g. age, experience)
increase effect size
manipulate sensitivity  
(e.g. photo back of crowd!)
 
subjects
 
control or manipulate who
 
more subjects or trials
 
more subjects
average out 
between
 subject differences
more trials
average out 
within
 subject variation
  
e.g. Fitts’ Law experiments
 
… but both
 both may need lots
e.g.  to reduce noise by 10, need 100 times more
 
 
within subject/group
 
cancels out between subject variation
 
helpful if effect reasonably consistent
but between subject variability high
 
may cause problems with order effects, learning
 
 
condition A
 
condition B
 
Subjcet #     1      2      3      4      5      6      7     8      9    10
 
narrow/matched users
 
aims to reduce between subject variation
 
choose subjects who are very similar to each
other
 or in some way have matched
  
e.g. balance gender, skills
 
allows between subject experiments
how do you know what is important to match?
 
targeted user group
 
aims to increase the effect size
 
choose group of users who are likely to be
especially affected by the intervention
   
e.g. novices or older users
 
but … generalisation to other users will be
by theoretical argument not empirical data
 
tasks
 
control or manipulate what
 
distractor tasks
 
aim to saturate user’s cognitive resources
so make them more sensitive to intervention
  
e.g. count backwards while performing task
 
helpful when coping mechanisms mask effects
overload
leads to
errors
add
distractor
 
t
argeted tasks
 
design a task that will expose effect of
intervention
e.g. trouble with buttons paper (expert slip)
 
… but … care again with generalisation!
 
example: trouble with buttons
 
error-free
behaviour
 
release mouse
over button
 
press mouse
over button
 
expert
slip
 
move off
button first
 
… and then
release
 
trouble with buttons (2)
 
novices:
work more slowly – less likely to make slip
notice lack of semantic feedback – so they recover
experts:
act quickly – so make more slips
focused on next action, so miss feedback
 
problem:
experts slips don
t happen often … never in experiments
needed to craft task to engineer expert slips
 
demonic interventions!
 
extreme version create deliberately nasty task!
e.g. 'natural inverse' steering task
  
Fitts’ Law-ish experiment
  
added artificial errors to cause overshoots
 
… but … again generalisation
… and … subjects may hate you!
 
reduced vs. wild
 
in the wild has lots of extraneous effects
  
= noise!
control environment  => lab or semi-wild
reduced task
e.g. scripted use in wild environment
reduced system
e.g. mobile tourist app with less options
Slide Note
Embed
Share

In this insightful read, Alan Dix dives into the crucial aspect of gaining statistical power in the field of Human-Computer Interaction (HCI). Explore ways to enhance research outcomes by understanding statistical power, avoiding false negatives, and the impact of participant numbers. Discover strategies to increase power without compromising generality and learn about the significance of effect size in detecting differences. Navigate through statistical nuances to optimize research effectiveness.

  • Statistics
  • HCI
  • Research
  • Power
  • Effect Size

Uploaded on Feb 25, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Understanding Statistics for HCI and Related Disciplines Part 3 Gaining Power the dreaded too few participants Alan Dix http://alandix.com/statistics/ Understanding Statistics for HCI and Related Disciplines Alan Dix

  2. dont you hate it when Understanding Statistics for HCI and Related Disciplines Alan Dix

  3. statistical power what is it? if there is a real effect how likely are you to be able to detect it? avoiding false negatives Understanding Statistics for HCI and Related Disciplines Alan Dix

  4. increasing power standard approach add more participants but not the only way! can get more power but often sacrifice a little generality need to understand and explain with great power comes great responsibility ;-) Understanding Statistics for HCI and Related Disciplines Alan Dix

  5. Understanding Statistics for HCI and Related Disciplines Alan Dix

  6. noiseeffectnumber triangle three routes to gain power Understanding Statistics for HCI and Related Disciplines Alan Dix

  7. recall Stats 101 (for simple data) standard deviation of data often noise things you can t control or measure e.g. individual variability s.e. standard error of mean (s.e.) the accuracy of your estimate (error bars) s.e. = / n (if is an estimate n-1 ) to half standard error you must quadruple number. Understanding Statistics for HCI and Related Disciplines Alan Dix

  8. effect size how big a difference do you want to detect? call it the accuracy (s.e.) needs to be better than >> / n Understanding Statistics for HCI and Related Disciplines Alan Dix

  9. number of participants >> / n effect size noise Understanding Statistics for HCI and Related Disciplines Alan Dix

  10. noiseeffectnumber triangle to gain power address any of these not just more subjects! Understanding Statistics for HCI and Related Disciplines Alan Dix

  11. general strategies increase number the standard approach but square root often means very large increases reduce noise control conditions (physics approach) measure other factors and fit (e.g. age, experience) increase effect size manipulate sensitivity (e.g. photo back of crowd!) Understanding Statistics for HCI and Related Disciplines Alan Dix

  12. Understanding Statistics for HCI and Related Disciplines Alan Dix

  13. subjects control or manipulate who Understanding Statistics for HCI and Related Disciplines Alan Dix

  14. more subjects or trials more subjects average out between subject differences more trials average out within subject variation e.g. Fitts Law experiments but both both may need lots e.g. to reduce noise by 10, need 100 times more Understanding Statistics for HCI and Related Disciplines Alan Dix

  15. within subject/group cancels out between subject variation helpful if effect reasonably consistent but between subject variability high may cause problems with order effects, learning condition A condition B Subjcet # 1 2 3 4 5 6 7 8 9 10 Understanding Statistics for HCI and Related Disciplines Alan Dix

  16. narrow/matched users aims to reduce between subject variation choose subjects who are very similar to each other or in some way have matched e.g. balance gender, skills allows between subject experiments how do you know what is important to match? Understanding Statistics for HCI and Related Disciplines Alan Dix

  17. targeted user group aims to increase the effect size choose group of users who are likely to be especially affected by the intervention e.g. novices or older users but generalisation to other users will be by theoretical argument not empirical data Understanding Statistics for HCI and Related Disciplines Alan Dix

  18. Understanding Statistics for HCI and Related Disciplines Alan Dix

  19. tasks control or manipulate what Understanding Statistics for HCI and Related Disciplines Alan Dix

  20. distractor tasks aim to saturate user s cognitive resources so make them more sensitive to intervention e.g. count backwards while performing task helpful when coping mechanisms mask effects overload leads to errors mental load add distractor capacity condition A B condition A B Understanding Statistics for HCI and Related Disciplines Alan Dix

  21. targeted tasks design a task that will expose effect of intervention e.g. trouble with buttons paper (expert slip) but care again with generalisation! Understanding Statistics for HCI and Related Disciplines Alan Dix

  22. example: trouble with buttons error-free behaviour release mouse over button press mouse over button expert slip move off button first and then release Understanding Statistics for HCI and Related Disciplines Alan Dix

  23. trouble with buttons (2) novices: work more slowly less likely to make slip notice lack of semantic feedback so they recover experts: act quickly so make more slips focused on next action, so miss feedback problem: experts slips don t happen often never in experiments needed to craft task to engineer expert slips Understanding Statistics for HCI and Related Disciplines Alan Dix

  24. demonic interventions! extreme version create deliberately nasty task! e.g. 'natural inverse' steering task Fitts Law-ish experiment added artificial errors to cause overshoots but again generalisation and subjects may hate you! Understanding Statistics for HCI and Related Disciplines Alan Dix

  25. reduced vs. wild in the wild has lots of extraneous effects = noise! control environment => lab or semi-wild reduced task e.g. scripted use in wild environment reduced system e.g. mobile tourist app with less options Understanding Statistics for HCI and Related Disciplines Alan Dix

  26. Understanding Statistics for HCI and Related Disciplines Alan Dix

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#