Stratification
Concept of stratified sampling and its importance in ensuring representative samples in population studies. Learn how stratification helps control sample composition and reduce sampling errors. Discover the reasons for implementing stratification and its application in nationwide household surveys.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
Stratification February 24, 2020 Nadi, Fiji
Stratified sampling Technique of organizing sample frame into sub-groupings strata and to select separate samples in the strata to ensure sample selection is spread across important population sub-groups
We select a simple random sample from our population. Let s say that we happen to get 55 people from island 1 and 60 and 5 from island 2 and 3. (True P = 0,56) From our sample of 120 we estimate that 76 people will vote for Mr Green n P (%) n*P 0,5 0,8 0,1 0,5 55 60 5 120 28 48 Estimated P=76/120= 0.63 76 The situation is simplified for pedagogical reasons.
Lets say that instead of 55,60 and 5 people from the three islands in our SRS sample we got 75, 25 and 20 people from the islands From our sample of 120 we estimate that 60 people will vote for Mr Green n P (%) n*P 0,5 0,8 0,1 75 25 20 120 38 20 2 60 Estimated P=60/120= 0.50
Now we have stratified the population into three strata (islands) and decided to allocate the sample of 12 households in proportion to the number of people in each stratum. The largest island will get the largest sample. From our sample of 120 we estimate that 67 people will vote for Mr Green n P (%) n*P 0,5 0,8 0,1 1,3 67 40 13 120 34 32 Estimated P=67/120= 0.55 67 We get an estimated P that is closer to the true P in the population.
Reasons for stratification Reasons for stratification To potentially reduce sampling error by gaining greater control over the composition of the sample. To ensure that particular groups within a population are adequately represented in the sample. Reduce the chance of being unlucky and getting too few sample units selected from a sub-group that is important for the analysis.
In nationwide household surveys the primary strata are always geographical or administrative areas (province, region, district) The primary strata are often divided into urban and rural areas
Stratification Establishment survey Stratification of establishments by economic activity and employment size National household survey Geographic domains regions, provinces Urban/rural Socio-economic groups Agricultural survey Agro-ecological zones Land use Farm size
Stratification in the Pacific Example 1 : Samoa has 4 levels of stratification Apia Urban Area Rest of Upolu North West Upolu Savaii
Stratification in the Pacific Example 2 : Vanuatu has 8 levels of stratification 1. Torba 2. Sanma-urban (Luganville) 3. Sanma-rural 4. Penama 5. Malampa 6. Shefa-urban (Port Vila) 7. Shefa-rural 8. Tafea
Stratification Common examples of sample allocation among the strata: Proportional allocation Equal allocation Square root allocation Optimum allocation Practical allocation
Proportional allocation The sample allocated to each stratum is proportionally to the number of units in the frame for the stratum: N N h = n n h Simplest form of sample allocation. Provides self-weighting sample. Efficient sample design for national-level results when variability is similar for the different strata.
Equal allocation Each stratum is allocated an equal number of sample units: ? ? ? = H = Number of Strata Used when same level of precision is required for each stratum. Example: estimates of similar quality required for each region.
Neyman allocation Provides minimum total error and minimum cost for a fixed sample size: ? ? ? ? = ? =1 ? ? sh = estimated standard deviation in stratum h
Neyman allocation for P ? = ? ? ? (1 ? ) ? ? (1 ? )
Stratification var ( ) var ( ) var ( ) var ( ) y y y y eq srs prop opt usually always
Practical allocation criteria For national household surveys, sometimes allocation is a compromise between proportional, equal and Neyman allocation; e.g. we start with a proportional allocation and then we increase the sample size in the smaller regions. In countries with high proportion of rural population, sometimes a higher sampling rate is used for the urban stratum, to increase the urban sample size and because of the lower cost of data collection in urban areas.
Implicit Stratification Sort the sampling frame within each explicit stratum by lower-level geographic areas, such as districts, municipalities, wards and EA codes, then select a systematic sample within each stratum. Ensures a representative sample for each subgroup Automatically provides proportional allocation by size of subgroup
Second Stage Stratification Sometimes it is desirable to stratify the sample in the last stage (household or individual level). Examples: male/female headed households, program beneficiaries, households with orphans and vulnerable children (OVCs). Beware of the dangers. Second stage stratification increases the need for close supervision of field teams.
Weighting under stratified sample designs A proportionally allocated sample is self-weighted. In non-proportionally allocated samples, we must use weights to account for different sampling fractions by stratum.
Exercise #2 Given the information below, what stratification strategy would you recommend to the government for a total sample size of 800? Calculate the sample size for each strata under proportional, equal, and optimal allocation to inform your answer. Stratum 1 Capital City ?1= 7500 ?1= 1000 ?1= 200 Stratum 2 Mountain ?2= 2500 ?2= 200 ?2= 25 ? = 10,000 ?1= ?2
Proportional Allocation ? = ? ? ? ?1= ? ?1 ? = 800 7500 1000= 600 ?2= ? ?2 ? = 800 2500 1000= 200
Equal Allocation ? =? ? ? ?=800 2 = 400 ?1= ?2=
Neyman Allocation ? ? ? = ? ? =1 ? ? ?1?1 (7500)(200) ?1= ? = 800 (7500)(200) + (2500)(25)= 768 ?1?1+ ?2?2 ?2?2 (2500)(25) ?2= ? = 800 (7500)(200) + (2500)(25)= 32 ?1?1+ ?2?2
What do you recommend? It depends Proportional allocation Simplest form of sample allocation. Provides self-weighting sample. Efficient sample design for national- level results when variability is similar for the different strata. Equal allocation Easy to explain. Generates same level of precision is required for each stratum. Optimal allocation Generates the most efficient. allocation at the national level for one variable. People really like the name. Practical allocation Most common sample design for complex, multi-indicator sample surveys.