KEN_2022_K-LSRH_v01_M
Longitudinal Socioeconomic Study of Refugees and Host Communities 2022-2023
Wave 1
K-LSRH 2022-2023
Name | Country code |
---|---|
Kenya | KEN |
1-2-3 Survey, phase 1 [hh/123-1]
This cross-sectional survey was conducted with refugee and host community households in Dadaab, Kakuma, Kalobeyei, Nairobi, Nakuru and Mombasa and was conducted between 12 May 2022 to 13 June 2023.
Sample survey data [ssd]
Household, individual.
v01 - Edited, anonymized dataset for public distribution.
This survey covers four major modules named after their respondents: the ‘Household respondent (HR)’, ‘Representative respondent (RR)’, ‘Women empowerment respondent (WER)’ and ‘Child respondent (CR)’. The HR module covers the following topics: demographics, dwelling characteristics, consumption and expenditures, and household members’ labor. The RR module includes a more in-depth module on labor in addition to modules on food insecurity and assistance, transfers and remittances, household shocks, savings and credit, migration, social cohesion and social capital, health and wellbeing, marriage, fertility, ethnicity and religion and weather variability. The WER module covers women empowerment for female respondents aged 15 and above. Finally, CR module covers the education and aspirations of 5th grade children and their caregivers.
The CR module is available upon request only, as are the submodules on child labor (A8.3), HR agriculture (A8.1-E2) and RR agriculture (B3E1).
Garissa, Turkana, Nairobi, Nakuru and Mombasa counties, Kenya
Name | Affiliation |
---|---|
Precious Zikhali | World Bank |
Nistha Sinha | World Bank |
Utz Pape | World Bank |
Theresa Beltramo | UNHCR |
Edward Miguel | University of California- Berkeley |
Name | Role |
---|---|
PROSPECTS | Funding |
The survey has two primary samples contained in the ‘status’ variable: the refugee sample and the host community sample. Refugees were sampled from two frames, the UNHCR registration database proGres and the Socioeconomic Surveys (SES) conducted in Kakuma, Kalobeyei and Nairobi. ProGres holds information on all refugee and asylum seekers families in Kenya registered with UNHCR including their phone numbers and the approximate location, data on nationality. The second frame consists of the samples of the Kalobeyei SES (2018), Kakuma Refugee Camp SES (2019) and Urban SES in Nairobi (2020-21). It holds rich socioeconomic baseline data and is a subset of the first. In the first stage, a subsample of proGres families is drawn from the entire proGres frame using implicit stratification by sub-county and nationality. Notably, proGres families are not always the same as households defined in standard surveys. For K-LSRH, a household is defined as a “person or group of people living in the same compound (fenced or unfenced); answerable to the same head and sharing a common source of food/share cooking arrangement.” Meanwhile, proGres families refer to the group of people in which refugees are registered to UNHCR. A household can comprise of multiple proGres families and the selection probability has been adjusted accordingly.
In the second stage, an independent subsample was drawn from the SES frame for each of the strata with the existing SES. Households who had arrived in Kenya before the SES and who had not left the country, represented an overlap group which has a positive probability of being selected into the sample through both the SES frame and through proGres directly. For this subset of households, weights were adjusted using an overlapping frames correction.
Hosts were sampled using a two-step cluster design. The sampling frame of host households consists of communities living in close proximity to refugees who are potentially impacted by their presence. The host community of camp refugees in Turkana and Dadaab are defined as those living within a 15 km radius of the camp borders, while host communities of urban refugees are defined as neighborhoods hosting at least 10 percent of proGres families living in each city of the stratum including; Kayole, Eastleigh North, and Kasarani in Nairobi, Old Town and Majengo in Mombasa, and Lanet, Shabaab, and Langalanga in Nakuru. Sampling follows a two-step clustered design, whereby a set of Enumeration Areas (EAs) and replacement EAs is drawn with probability proportional to size (PPS) from the universe of eligible EAs. Subsequently, a listing exercise was carried out to eliminate ineligible households including households not living within the boundary of selected EAs and households not providing consent to the listing interview. Households where one member was registered with UNHCR as a refugee or asylum seeker were also not interviewed, to rule out overlap with the refugee sampling frame. In the second stage, for each selected EA, 10 households and a list of 5 replacements for the case of non-response was drawn using simple random sampling. This resulted in a sample of 2,000 households for the host community of camp refugees and of 1,500 households for the host community of urban refugees.
A representative respondent (RR) was randomly selected from among the household members aged 18 and above and administered a set of in-depth modules on employment and individual-level characteristics. In addition, the survey included a module on women’s empowerment, administered by trained female enumerators to randomly selected female respondents aged 15 and above. If the RR was a woman, the same respondent also completed the woman empowerment module; otherwise, a woman aged 15 years and above was selected among the women in the household based on simple random sampling. Finally, a module on child education and aspirations was administered to a cohort of upper primary school children and their main caregivers. One Child Respondent (CR) was randomly selected among 5th-grade students currently attending school of any age up to and including 17 years (to also capture over-age students) in selected households. The child’s main caregiver (CG) was also interviewed to capture parental aspirations and main barriers to education. The Caregiver refers to the household member responsible for making the educational decisions for the child. Unlike other modules, the Education Module was only implemented in the Kakuma, Kalobeyei, and Dadaab refugee strata and the Turkana and Dadaab host community strata.
Weights are calculated separately for refugees drawn from the proGres frame and the SES frame. Refugees drawn from proGres were selected using simple random sampling (SRS) and their base weights equal the inverse of the selection probability. Weights are then adjusted to account for the possibility of multiple proGres families forming one surveyed household and for the possibility of nonresponse. Finally, weights are post-stratified to population totals.
For refugees drawn from the SES frame, the socioeconomic surveys serve as a baseline survey, and weights can be calculated following the approach for panel survey weights:
Finally, after obtaining nonresponse adjusted weights for both subsamples, weights are adjusted for the possibility of some households having been drawn from both frames. This is achieved using a single frame estimator method.
Final household weight variable: weight.
For hosts, the weight incorporates two selection probabilities: The probability of the enumeration area being selected for listing and the probability of the household being subsequently selected for interview. The host weight equals the inverse of the multiple of both selection probabilities, adjusted for nonresponse and post-stratified to population totals obtained from the 2019 Kenya Population and Housing Census.
Final household weight variable: weight.
The nonresponse adjustment step in all three cases follows the same procedure:
The representative respondent (RR) is randomly drawn from the list of adult household members. If the household had both male and female members aged 18+, the gender of the RR was first selected at random. If all adults have the same gender, that gender is selected automatically. The RR is then drawn from the list of members of the selected gender using simple random sampling. RR weights are calculated in four steps:
For the Education Module, one child respondent (CR) is randomly drawn among the 6th-grade children in selected households using a simple random sampling. The probability of a CR being selected from an interviewed household (p_cr) is therefore equal to the inverse of the number of 5th-grade students in the household, and CR weights are calculated by scaling the household weight by the inverse of p_cr.
Variable: weight_cr.
If the RR is a woman, she is automatically selected as the Women Empowerment respondent (WER) and assigned the RR weight. If the RR is a man, the WER was selected at random among the women/girls aged 15 and above and the WER weight is scaled by the selection probability.
Variable: weight_wer.
Targeted modules on women empowerment and child education were administered to women aged 15 and above and 5th grade children, respectively. A part of the education module was also answered to the child’s caregiver (adult 18 years and above). Unlike other modules, the education module was only implemented in the Kakuma, Kalobeyei, and Dadaab refugee strata and the Turkana and Dadaab host community strata. Targeted modules were also administered to a ‘representative respondent’, a randomly drawn adult from the household, covering employment, migration, savings and mental health.
The Questionnaire is provided as external resources in pdf format. Questionnaires were produced through the Ipsos developed iField software.
When variable names consist of a letter and a number, the letter indicates the questionnaire section and the number the order in which the question was administered. An explanation of the variable contents is contained in the variable labels.
Extended missing values are used to indicate why a value is missing for all variables. The following extended missing values are used in the dataset:
• .a for ‘Don’t know’
• .b for ‘Refused to respond’
• .c for ‘Outliers set to missing’
• .d for ‘Variables dropped during anonymization due to PII’
• .e for ‘Field Skipped’ (where an error in the survey tool caused the question to be missed)
• .z for ‘Not applicable’ (as the variable was not relevant to the observation)
Start | End | Cycle |
---|---|---|
2022-05-13 | 2023-06-12 | Wave 1 |
PRE-LOADED INFORMATION: Basic household information from the host community household listing (hosts), UNHCR’s proGres database and SES database (refugees) was pre-loaded in iField for each enumerator. The information, for example the household's location, household head name, phone numbers etc, was used to help enumerators identify the target households.
RESPONDENTS: The survey had four modules responded by various respondents per households. The Household module was responded by Household Respondent (HR) who was any knowledgeable adult (18 years or older) who was willing to provide information about the household. The Representative Respondent module was responded to by a Representative respondent (RR) who was a randomly selected adult (18 years or older). The RR could be the same person as the HR or was randomly selected from the household roster if the HR was not available. The women empowerment module was responded by a Women empowerment respondent (WER) who could be the same person as the RR if RR was a woman, otherwise, a girl/woman aged 15 years and above was randomly selected as WER. The education module was responded by two respondents namely Child respondent (CR) and caregiver (CG). The CR was a randomly selected child among 5th-grade students currently attending school of any age up to and including 17 years (to also capture over-age students). The child’s main caregiver who is the household member responsible for making educational decisions for the CR was also interviewed.
Name |
---|
World Bank |
Confidentiality declaration text |
---|
Before being granted access to the dataset, all users have to formally agree: 1. To make no copies of any files or portions of files to which s/he is granted access except those authorized by the data depositor. 2. Not to use any technique in an attempt to learn the identity of any person, establishment, or sampling unit not identified on public use data files. 3. To hold in strictest confidence the identification of any establishment or individual that may be inadvertently revealed in any documents or discussion, or analysis. Such inadvertent identification revealed in her/his analysis will be immediately brought to the attention of the data depositor. |
Use of the dataset must be acknowledged using a citation which would include:
Example:
Precious Zikhali (World Bank), Nistha Sinha (World Bank), Utz Pape (World Bank), Theresa Beltramo (UNHCR), Edward Miguel (University of California- Berkeley). Kenya - Longitudinal Socioeconomic Study of Refugees and Host Communities 2022-2023, Wave 1 (K-LSRH 2022-2023). Ref: KEN_2022_K-LSRH_v01_M. Downloaded from [uri] on [date].
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
DDI_KEN_2022_K-LSRH_v01_M_WB
Name | Abbreviation | Affiliation | Role |
---|---|---|---|
Development Data Group | DECDG | World Bank | Documentation of the study |
2024-11-18
Version 01 (2024-11-18)
This site uses cookies to optimize functionality and give you the best possible experience. If you continue to navigate this website beyond this page, cookies will be placed on your browser. To learn more about cookies, click here.