Prior to the High Frequency South Sudan Survey Wave 1 (2015), a High Frequency Panel Survey was conducted between August 2012 and September 2014, which scaled up an existing pilot project to a nationwide survey. The High Frequency South Sudan Survey Wave 1 (2015) is followed by Wave 2 conducted in 2016.
Between February and September 2015, World Bank in collaboration with South Sudan’s National Bureau of Statistics, funded by DfID, conducted the first wave of the High Frequency South Sudan Survey to monitor welfare and perceptions of citizens in all accessible areas of South Sudan.
This dataset contains information on security, economic conditions, education, employment, access to services, and perceptions. It also includes comprehensive information on assets and consumption, to allow estimation of poverty based on the Rapid Consumption methodology as detailed in Pape and Mistiaen (2014).
Kind of Data
Sample survey data [ssd]
Unit of Analysis
v01, edited, anonymous datasets for public distribution
3/20/2020 - Input data files and replication programs removed form dissemination package.
The scope of the survey includes:
- Household : Household listing, household characteristics: household member information (relation to head, migration, education, occupation, employment status, earnings) household head module (marital status, migration,tribe, religion, housing type and tenure, amenities, land ownership and use, proximity to school and market, hunger, livelihood)
- Food consumption - quantity, price and source
- Non-food consumption - quantity, price and source
- Durable goods
- Physical, Psychological and Social Well-Being
Six of South Sudan’s ten former states: Western Equatoria, Central Equatoria, Eastern Equatoria, Northern Bahr El Ghazal, Western Bahr El Ghazal, and Lakes state.
The survey covered urban and rural households
Producers and sponsors
Utz J. Pape
The World Bank
South Sudan's National Bureau of Statistics
UK Department for International Development
The sample design employs a stratified two-stage clustered design. Within each of the 12 strata (6 states, urban and rural), the primary sampling units are enumeration areas (EAs) that were drawn randomly proportional to size. Within EAs, a listing was conducted and 12 household were drawn randomly as unit of observation.
Deviations from the Sample Design
EAs were replaced if inaccessibility rendered field work unfeasible. Replacements were approved by the project manager. Replacement of households were approved by the supervisor after a total of three unsuccessful visits of the household.
The sampling weight is the inverse probability of selection. The selection probability for a household can be decomposed into the selection probability of the EA and the selection probability of the household within the EA. The selection probability of an EA is calculated as the number of households within the EA divided by the number of households within the stratum multiplied by the number of selected EAs in the stratum estimated using the 2008 Census. The selection probability for a household within an EA is constant across households and is calculated as the number of households selected in the EA over the number of listed households in the EA. Sampling weights were then scaled to equal the number of households per strata using the Census 2008 data. More information can be found in the technical appendix of the South Sudan Poverty Profile 2015.
Dates of Data Collection
Data Collection Mode
Computer Assisted Personal Interview [capi]
Data Collection Notes
Data collection was implemented in 2 phases by randomly splitting each stratum into two equal-sized parts.
The survey was conducted using a CAPI mode (SurveyCTO sofware). Some questions refered to responses of earlier questions in their wordings. To allow that, the questions might have special characters in their wordings.
Question"Please confirm that your ID number is [qr_id]," where qr_id is another variable. Here the square brackets  are used so that instead of "qr_id," the value of qr_id as selected by the respondent, will appear within the wordings of this question in the tablet when the survey is administered.
The data tables were downloaded directly from the SurveyCTO platform. Each data table has variables "State", "ea", and "hh". These are the identifying variables for the households and are used to match the observations in the given datasets.
The questionnaire comprises the following modules
Module B: Household Roster
Module C: Household Head
Module D: Household Characterisitcs
Module E: Food consumption
Module F: Non-food consumption
Module G: Livestock
Module H: Durable goods
Module I: Physical, Psychological and Social Well-Being
Module J: Enumerator Conclusions
The questionnaire is provided under the Related Materials tab.
See accompanying STATA do-files, available under the related materials tab
Utz J. Pape
The World Bank
Before being granted access to the dataset, all users have to formally agree:
1. To make no copies of any files or portions of files to which s/he is granted access except those authorized by the data depositor.
2. Not to use any technique in an attempt to learn the identity of any person, establishment, or sampling unit not identified on public use data files.
3. To hold in strictest confidence the identification of any establishment or individual that may be inadvertently revealed in any documents or discussion, or analysis. Such inadvertent identification revealed in her/his analysis will be immediately brought to the attention of the data depositor.
The dataset has been anonymized and is available as a Public Use Dataset. It is accessible to all for statistical and research purposes only, under the following terms and conditions:
1. The data and other materials will not be redistributed or sold to other individuals, institutions, or organizations without the written agreement of the World Bank Microdata Library.
2. The data will be used for statistical and scientific research purposes only. They will be used solely for reporting of aggregated information, and not for investigation of specific individuals or organizations.
3. No attempt will be made to re-identify respondents, and no use will be made of the identity of any person or establishment discovered inadvertently. Any such discovery would immediately be reported to the World Bank Microdata Library.
4. No attempt will be made to produce links among datasets provided by the World Bank Microdata Library, or among data from the World Bank Microdata Library and other datasets that could identify individuals or organizations.
5. Any books, articles, conference papers, theses, dissertations, reports, or other publications that employ data obtained from the World Bank Microdata Library will cite the source of data in accordance with the Citation Requirement provided with each dataset.
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Utz.J. Pape, The World Bank. South Sudan National Bureau of Statistics. High Frequency South Sudan Survey 2015, Wave 1 (HFS-W1). Ref. SSD_2015_HFS-W1_v02_M. The World Bank Group. Dataset downloaded from [URL] on [date]
Archive where study is originally stored
World Bank Microdata Library
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
DDI Document ID
Development Economics Data Group
The World Bank
Documentation of the DDI
Date of Metadata Production
DDI Document version
Version 02 (March 2020)
Input data files and replication programs removed form dissemination package.