In December 2017, the World Bank, in collaboration with Somali statistical authorities conducted the second wave of the Somali High Frequency Survey to monitor welfare and perceptions of citizens in all accessible areas of 17 regions within Somalia’s pre-war borders including Somaliland which self-declared independence in 1991. The survey interviewed 4,011 urban households, 1,106 rural households, 468 households in Internally Displaced People (IDP) settlements and 507 nomadic households. The sample was drawn randomly based on a multi-level clustered design. This dataset contains information on economic conditions, education, employment, access to services, security, perceptions and details before displacement for displaced households. It also includes comprehensive information on assets and consumption, to allow estimation of poverty based on the Rapid Consumption methodology as detailed in Pape and Mistiaen (2014).
Kind of data
Sample survey data [ssd]
The following pre-war regions: Awdal, Bakool, Banadir, Bari, Bay, Galgaduug, Gedo, Hiran, Lower Juba, Mudug, Nugaal, Sanaag, Middle and lower Shabelle, Sool, Togdheer and Woqooyi Galbeed (Somaliland self-declared independence in 1991).
Unit of analysis
Producers and sponsors
Utz J. Pape
IBRD - World Bank
Abdirahman Omar Dahir
Director General; Directorate of National Statistics; Ministry of Planning, Investment and Economic Development; Federal Government of Somalia
Somalia Knowledge for Results from the Multi-Partner Fund for Somalia
Environment and Natural Resources Global Practice
Wave 2 of the SHFS employed a multi-stage stratified random sample, ensuring a sample representative of all subpopulations of interest. Strata were defined along two dimensions - administrative location (pre-war regions and emerging states) and population type (urban areas, rural settlements, IDP settlements, and nomadic population). Households were clustered into enumeration areas (EAs), with 12 interviews was expected for each selected EA. Primary sampling units (PSUs) were generated using a variety of techniques depending on the population type. The primary sampling unit (PSU) in urban as well as rural strata was the enumeration area (EA). For IDP strata, primary sampling units were IDP settlements as defined by UNCHR’s Shelter Cluster. Across all strata, PSUs were selected using a systematic random sampling approach with selection probability proportional to size (PPS). In IDP strata, PPS sampling is applied at the IDP settlement level. In second- and final-stage sample selection, a microlisting approach was used, such that EAs were divided into 12 smaller enumeration blocks, which were selected with equal probability. Every block was selected as 12 interviews per EA were required. A similar second-stage sampling strategy was employed for IDP strata. Each IDP settlement was segmented manually into enumeration blocks. Finally, one household per block was interviewed in all selected blocks within the enumeration area.The household was selected randomly with equal probability in two stages, following the micro-listing protocol. The strategy for sampling nomadic households relied on lists of water points. The list of water points was divided up by stratum at the federated member state level and they served as primary sampling units. Water points were selected in the first stage with equal probability, with 12 interviews to be conducted at each selected water point. The selection of nomadic households to interview relied on a listing process at each water point whose aim was to compile an exhaustive list of all nomadic households at the water point. For more details, see accompanying documents, available under the related materials tab.
Deviations from sample design
EAs were replaced if security rendered field work unfeasible. Replacements were approved by the project manager. Replacement of households were approved by the supervisor after a total of three unsuccessful visits of the household.
Based on the sampling design, sampling weights were calculated after the completion of data collection. The sampling weight is the inverse probability of selection. In general, the selection probability for a household can be decomposed into the selection probability of the EA, the selection probability of the block and selection probability of the household within the block. Sampling weights were then scaled to equal the number of households per analytical strata using the data from the Population Estimation Survey of Somalia (PESS) 2014. For more details see accompanying documents, available under the related materials tab.
Dates of collection
Mode of data collection
Computer Assisted Personal Interview [capi]
The household questionnaire is in English. It includes the following modules:
- Module A: Administrative Information
- Module B: Interview Information and Filters
- Module C: Household Roster
- Module D: Household Characteristics
- Module E: Food Consumption
- Module F: Non-Food Consumption
- Module G: Livestock
- Module H: Durable Goods
- Module I: Perceptions and Social Services
- Module J: Displacement
- Module K: Fishing
- Module L: Catastrophic Events and Disasters
- Module M: Enumerator Conclusions
- Appendix A - Enabling Conditions
- Appendix B - Validation Conditions and Messages
- Appendix C - Instructions
- Appendix D - Options
- Appendix E - Variables
- Appendix F - Option Filters
The household questionnaire is provided under the Related Materials tab.
Before being granted access to the dataset, all users have to formally agree:
1. To make no copies of any files or portions of files to which s/he is granted access except those authorized by the data depositor.
2. Not to use any technique in an attempt to learn the identity of any person, establishment, or sampling unit not identified on public use data files.
3. To hold in strictest confidence the identification of any establishment or individual that may be inadvertently revealed in any documents or discussion, or analysis. Such inadvertent identification revealed in her/his analysis will be immediately brought to the attention of the data depositor.
The dataset has been anonymized and is available as a Public Use Dataset. It is accessible to all for statistical and research purposes only, under the following terms and conditions:
1. The data and other materials will not be redistributed or sold to other individuals, institutions, or organizations without the written agreement of the World Bank Microdata Library.
2. The data will be used for statistical and scientific research purposes only. They will be used solely for reporting of aggregated information, and not for investigation of specific individuals or organizations.
3. No attempt will be made to re-identify respondents, and no use will be made of the identity of any person or establishment discovered inadvertently. Any such discovery would immediately be reported to the World Bank Microdata Library.
4. No attempt will be made to produce links among datasets provided by the World Bank Microdata Library, or among data from the World Bank Microdata Library and other datasets that could identify individuals or organizations.
5. Any books, articles, conference papers, theses, dissertations, reports, or other publications that employ data obtained from the World Bank Microdata Library will cite the source of data in accordance with the Citation Requirement provided with each dataset.
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Utz J. Pape, World Bank. Somali High Frequency Survey, Wave 2 (SHFS-W2) December 2017, Ref. SOM_2017_SHFS-W2_v01_M. Dataset downloaded from [url] on [date].
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.