Between February and March 2016, the World Bank, in collaboration with Somali statistical authorities conducted the first wave of the Somali High Frequency Survey to monitor welfare and perceptions of citizens in all accessible areas of 9 regions within Somalia’s pre-war borders including Somaliland which self-declared independence in 1991. The survey interviewed 2,882 urban households, 822 rural and 413 households in Internally Displaced People (IDP) settlements. The sample was drawn randomly based on a multi-level clustered design. This dataset contains information on economic conditions, education, employment, access to services, security and perceptions. It also includes comprehensive information on assets and consumption, to allow estimation of poverty based on the Rapid Consumption methodology as detailed in Pape and Mistiaen (2014).
Kind of data
Sample survey data [ssd]
v03 - This version includes revised datasets Input_hh and Output_hh.
The following pre-war regions: Awdal, Banadir, Bari, Mudug, Nugaal, Sanaag, Sool, Togdheer and Woqooyi Galbeed (Somaliland self-declared independence in 1991).
Unit of analysis
Producers and sponsors
Utz J. Pape
The sample employs a stratified two-staged clustered design with the Primary Sampling Unit (PSU) being the enumeration area. Within each enumeration area, 12 households were selected for interviews.
Two different listing approaches were used. In 2 strata with more volatile security as well as for IDP camps, a multi-stage cluster design was employed (micro-listing). Each selected enumeration area was divided into multiple segments and each segment was further divided into blocks. Within each enumeration area, one segment was randomly selected and within the segment 12 blocks were chosen. In each block, all structures were listed before selecting randomly one structure. Within the selected structure, all households were listed and one household randomly selected for interview. In strata less volatile (14 strata), the complete enumeration area was listed before 12 households were randomly selected for interviews (full-listing).
Deviations from sample design
EAs were replaced if security rendered field work unfeasible. Replacements were approved by the project manager. Replacement of households were approved by the supervisor after a total of three unsuccessful visits of the household.
The sampling weight is the inverse probability of selection.
For strata with full-listing, the selection probability for a household can be decomposed into the selection probability of the EA and the selection probability of the household within the EA. For strata with a micro-listing, the selection probability for a household can be decomposed into the selection probability of the EA, the selection probability of the block and selection probability of the household within the block
Sampling weights were then scaled to equal the number of households per analytical strata using the data from the Population Estimation Survey of Somalia (PESS) 2014. More information can be found in the Technical Appendix.
Before being granted access to the dataset, all users have to formally agree:
1. To make no copies of any files or portions of files to which s/he is granted access except those authorized by the data depositor.
2. Not to use any technique in an attempt to learn the identity of any person, establishment, or sampling unit not identified on public use data files.
3. To hold in strictest confidence the identification of any establishment or individual that may be inadvertently revealed in any documents or discussion, or analysis. Such inadvertent identification revealed in her/his analysis will be immediately brought to the attention of the data depositor.
The dataset has been anonymized and is available as a Public Use Dataset. It is accessible to all for statistical and research purposes only, under the following terms and conditions:
1. The data and other materials will not be redistributed or sold to other individuals, institutions, or organizations without the written agreement of the World Bank Microdata Library.
2. The data will be used for statistical and scientific research purposes only. They will be used solely for reporting of aggregated information, and not for investigation of specific individuals or organizations.
3. No attempt will be made to re-identify respondents, and no use will be made of the identity of any person or establishment discovered inadvertently. Any such discovery would immediately be reported to the World Bank Microdata Library.
4. No attempt will be made to produce links among datasets provided by the World Bank Microdata Library, or among data from the World Bank Microdata Library and other datasets that could identify individuals or organizations.
5. Any books, articles, conference papers, theses, dissertations, reports, or other publications that employ data obtained from the World Bank Microdata Library will cite the source of data in accordance with the Citation Requirement provided with each dataset.
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Utz J. Pape, World Bank. Somali High Frequency Survey, Wave 1 (SHFS-W1) 2016, Ref. SOM_2016_SHFS-W1_v03_M. Dataset downloaded from [url] on [date].
Use of the dataset must be acknowledged using a citation which would include: - the Identification of the Primary Investigator - the title of the survey (including country, acronym and year of implementation) - the survey reference number - the source and date of download
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Utz J. Pape
Development Data Group
The World Bank
Documentation of the DDI
Version 03 (March 2019)
This version includes revised datasets Input_hh and Output_hh.The rest of the survey metadata remains the same