State of the Cities Baseline Survey 2012-2013

Kenya, 2012 - 2013

Get Microdata

Reference ID

KEN_2012_SCBL_v01_M

DOI

https://doi.org/10.48529/1gb8-s885

Producer(s)

Sumila Gulyani, Wendy Ayres, Ray Struyk, Clifford Zinnes

Metadata

Documentation in PDF DDI/XML JSON

Created on

Mar 24, 2017

Last modified

Mar 24, 2017

Page views

38383

Downloads

75904

Identification

Survey ID number

KEN_2012_SCBL_v01_M

Title

State of the Cities Baseline Survey 2012-2013

Country/Economy

Name	Country code
Kenya	KE

Study type

Other Household Survey [hh/oth]

Abstract

The objective of the survey was to produce baselines for 15 large urban centers in Kenya. The urban centers covered Nairobi, Mombasa, Naivasha, Nakuru, Malindi, Eldoret, Garissa, Embu, Kitui, Kericho, Thika, Kakamega, Kisumu, Machakos, and Nyeri. The survey covered the following issues: (a) household characteristics; (b) household economic profile; (c) housing, tenure, and rents; and (d) infrastructure services. The survey was undertaken to deepen understanding of the cities’ growth dynamics, and to identify specific challenges to quality of life for residents. The survey pays special attention to living conditions for residents of formal versus informal settlements, poor versus non-poor, and male and female headed households.

Kind of Data

Sample survey data [ssd]

Unit of Analysis

Household
Urban center

Version

Version Description

v2.1: Edited, anonymous dataset for public distribution.

Scope

Notes

Household characteristics, including (a) population description, (b) household size and composition, (c) household education, and (d) household health.
Household economic profile, including (a) household poverty, income, and expenditure; (b) employment; (c) household wealth; (d) household finance: banking, credit, and remittances; and (e) household-owned enterprises.
Housing, tenure, and rents, including (a) housing size and quality; (b) tenure and security; (c) housing values and rents; and (d) social capital, civic participation, and crime.
Infrastructure services, including (a) water access and quality; (b) electricity; (c) refuse and sanitation; (d) transportation; and (e) communications.

Producers and sponsors

Primary investigators

Name	Affiliation
Sumila Gulyani	World Bank
Wendy Ayres	World Bank
Ray Struyk	NORC
Clifford Zinnes	NORC

Producers

Name	Role
Kenya Ministry of Local Government	Advice
Kenya National Bureau of Statistics	Advice

Funding Agency/Sponsor

Name	Role
Cities Alliance	Donor
Bill and Melinda Gates Foundation	Donor
Swedish Internation Development Cooperation Agency	Donor

Sampling

Sampling Procedure

The Kenya State of the Cities Baseline Survey is aimed to produce reliable estimates of key indicators related to demographic profile, infrastructure access and economic profile for each of the 15 towns and cities based on representative samples, including representative samples of households (HHs) residing in slum and non-slum areas. For this baseline household survey, NORC used a two- or three-stage stratified cluster sampling design within each of the 15 urban centers. Our first-stage sampling frame was based on the 2009 census frame of enumeration areas. For each of the 15 towns and cities, NORC received the sampling frame of EAs from the Kenya National Bureau of Statistics (KNBS). In the first stage, NORC selected a sample of enumeration areas (PSUs). The second stage involved a random selection of households (SSUs) from each selected EA. In order to manage the field interviewing efficiently, we drew a fixed number of HHs from each selected EA, irrespective of EA size. The third stage arose in instances of very large EAs (EAs containing more than 200 households) in which EAs were divided into 2, 3 or 4 segments, from which one segment was selected randomly for household selection.

Stratification of Enumeration Areas: A few stratification factors were available for stratifying the EAs to help to achieve the survey objectives. As mentioned earlier, for this baseline survey we wanted to draw representative samples from slum and non-slum areas and also to include poor/non-poor households (HHs). For the 2009 census, depending on the location, KNBS divided the EAs into three categories: rural, urban, and peri-urban.

Although there is a clear distinction of EAs into slum and non-slum areas, it is hard to classify EAs into poor and non-poor categories. To guarantee enough representation of HHs living in slum and non-slum areas (also referred to as formal and informal areas) as well as HHs living below and above the poverty line, NORC stratified the first-stage sampling units (EAs) into strata, based on EA type (3 types) and settlement type (2 types). Given the resources available, we believe this stratification would serve our purpose as HHs living in slum and in rural areas tend to be poor. Table 1 in Appendix C of final Overview Report (provided under the Related Materials tab) presents the allocation of sampled EAs across the strata for each of the 15 cities in the baseline survey.

Sampling households is not as straightforward as the first-stage sampling of EAs, since the 2009 census frame of HHs does not exist. In the absence of a household sampling frame, NORC carried out a listing of HHs within each EA selected in the first stage. Trained listers, accompanied by local cluster guides (local residents with some form of authority in the EA), systematically listed all households in each selected EA, gathering the address, names of head of household and spouse, household description, latitude and longitude. To ensure completeness of listing data, avoid duplication and improve ease of locating households that were eventually selected for interview, listers enumerated households by chalking household identification number above the household doorway (an accepted practice for national surveys). The sampling frame of HHs produced from the listing activity was, therefore, up-to-date and included new formal and informal settlements that appeared after the 2009 census.

For adequate representativeness and to manage the interviewing task efficiently, NORC planned seven completed household interviews per EA. The final recommended sample size for the Kenya State of the Cities baseline survey is found in Table 2 in Appendix C of the final Overview Report.

Because the expected response rate was unknown prior to the start of the field period, the sampling team randomly selected ten households per enumeration area and distributed them to the interviewers working within the EA. Interviewing teams were instructed to complete at least seven interviews per EA from among the ten selected households. Interviewers were instructed to attempt at least three contacts with each selected household, approaching potential respondents on different days of the week and different times of day. Table 2 presents the final number of EAs listed per city and the final number of completed interviews per city. The
table also presents the percent of planned EAs and interviews that were completed vs. planned. Please note that in several cities more interviews were completed than planned. As part of NORC's data quality plan, data collection teams were instructed to overshoot slightly the target of seven interviews per EA, if feasible, to
mitigate any potential loss of cases due to poor quality or uncooperative respondents. Few cases were lost due to poor quality, therefore the target number of interviews remains over 100 percent in ten of the fifteen cities.

Response Rate

The completion rate is reported as the number of households that successfully completed an interview over the total number of households selected for the EA. These are shown by city in Table 5 in Appendix C of the final Overview Report, and have an average rate of 68.66 percent, with variation from 66 to 74 percent (aside from Nairobi at 61.47 percent and Machakos at 56 percent). As described earlier, ten households were selected per EA if the EA contained more than 10 households. For EAs where fewer than ten households were selected for interviews, all households were selected. In some EAs, more than ten households were selected due to a central office error.

Survey instrument

Questionnaires

The questionnaire was developed by World Bank staff with input from stakeholders in the Kenya Municipal Program and NORC researchers and survey methodologists. The base questionnaire for the project was a 2004 World Bank survey of Nairobi slums. However, an extended iterative review process led to many changes in the questionnaire. The final version that was used for programming provided under the Related Materials tab, and in Volume II of the Overview.

The questionnaire’s topical coverage is indicated by the titles of its nine modules:

Demographics and household composition
Security of housing, land and tenure
Housing and settlement profile
Economic profile
Infrastructure services
Health
Household enterprises7
Civil participation and respondent tracking

Data collection

Dates of Data Collection

Start	End
2012-06-15	2013-02-15

Supervision

Staffing the large scale data collection was a crucial factor in establishing high quality data. Supervisors and interviewers were recruited by IRC using guidelines developed by NORC, which emphasized CAPI experience, face-to-face interviewing experience, the ability to gain cooperation and a commitment to data quality. Interviewers were grouped into eight teams of 6-8 interviewers, each of which was led by an IRC supervisor with experience managing complex face-to-face social scientific surveys Training for the data collection team took place in three phases. In the first phase, supervisors were recruited,with particular care taken to include supervisors from ethnic and linguistic groups represented among the15 cities. Supervisors participated in a five-day pretesting activity that included 2.5 days of classroom and small group training to become familiar with the tablet computers and programmed questionnaire, followed by two days of pretesting among a convenience sample of respondents in informal settlements in Nairobi.

The second phase of training included a one-day Training of Trainers (ToT) and two days of Supervisor training, including detailed instruction on carrying out listing and sampling, gaining cooperation among respondents, coaching interviewers, reporting and ensuring quality control, confidentiality and security. Eight supervisors attended the ToT and Supervisor training.

The third phase of training included five days of classroom and small group activities for the 58 interviewers brought to training followed by two days of piloting among a convenience sample in informal areas of Nairobi. All interviewers were required to pass a practical exam using the tablet questionnaire and to successfully demonstrate all listing and interviewing tasks during the two day pilot. After training, three interviewers were dismissed from the data collection.

Data Collection Notes

The project goals include a comparison of households with expenditures above and below the poverty line. In the course of questionnaire development, NORC recommended using a relative poverty measure, whose formula is given at the end of the Introduction of this report. Poverty rate data from the KIHBS published report shows a 27 percent poverty rate for all urban areas. There is,however, a wide variation among poverty rates among the four urban centers for which they are separately reported, ranging from 19.6 percent in Nairobi to 41.4 percent in Nakuru. For these four towns, the place specific rates could be used to divide households into poor and non-poor groups. For the balance, the national rate for urban areas excluding these four municipalities, 33.1 percent could be employed.

The Kenya State of the Cities questionnaire underwent extensive testing prior to the main data collection. NORC and its data collection subcontractor, Infotrak Research and Consulting (IRC), carried out focus groups in Nairobi and Thika, incorporating suggested wording and flow changes. The questionnaire was translated by two independent translators and then pretested again amongst interviewers and supervisors for additional input to both the English and Kiswahili versions. Finally, the data collection team pre-tested the questionnaire, including protocols for gaining cooperation, among a convenience sample in two neighborhoods in Nairobi. Changes to the questionnaire were tracked, with explanations for changes, deletions and additions. All changes were reviewed by the WB research team and programmed into the survey application only after approval by the World Bank.

NORC contracted Manobi, S.A., a telecommunications and data company based in Senegal to program the questionnaire for use as a computer-assisted-personal-interview (CAPI). The program was loaded onto tablet computers and field-tested prior to data collection.

Duration: Pretesting of the paper questionnaire prior to programming suggested a mean duration of approximately 50 minutes. Pretesting of the programmed questionnaire during supervisor pretest in-office and in the field showed an approximate length of 45 minutes. Fielded duration showed a median of 21 minutes, with some variation among cities, as shown in Table 4 in Appendix C. While duration values are captured automatically within the questionnaire in the form of timestamps at each question, the total duration of interviews may have been compromised when some supervisors, in keeping with common practice for paper and pencil surveys, reviewed enumerators’ completed electronic questionnaires after completion and before transmitting the surveys to the server. This activity of scrolling through the questionnaire may have reset timestamps, causing completed surveys to appear very short in duration.

Challenges and Adjustments: The Kenya State of the Cities baseline survey comprised a complex and very large-scale set of data collection
activities. The listing task required in-person door-to-door enumeration of over 140,000 households in 15 cities
across Kenya. The interviewing task required locating, gaining cooperation and interviewing approximately 14,600
respondents. The NORC/IRC team experienced several challenges over the course of the project, specifically
• Missing or inaccessible enumeration areas;
• Extended field period overlapping election season; and,
• Enumerator errors.
These are described in more detail in section A.7 of the final Overview Report (provided under the Related Materials tab).

Data Access

Confidentiality

Is signing of a confidentiality declaration required?	Confidentiality declaration text
yes	Before being granted access to the dataset, all users have to formally agree: 1. To make no copies of any files or portions of files to which s/he is granted access except those authorized by the data depositor. 2. Not to use any technique in an attempt to learn the identity of any person, establishment, or sampling unit not identified on public use data files. 3. To hold in strictest confidence the identification of any establishment or individual that may be inadvertently revealed in any documents or discussion, or analysis. Such inadvertent identification revealed in her/his analysis will be immediately brought to the attention of the data depositor.

Access conditions

Public use files, accessible to all

Citation requirements

Use of the dataset must be acknowledged using a citation which would include:

the Identification of the Primary Investigator
the title of the survey (including country, acronym and year of implementation)
the survey reference number
the source and date of download

Example: Gulyani Sumila, Wendy Ayres, Ray Struyk and Clifford Zinnes.2012. Kenya State of the Cities Baseline Survey 2012-2013. Ref: KEN_2012_SOCBL_v01_M. World Bank & NORC. Downloaded from [URL] on [Date]

Disclaimer and copyrights

Disclaimer

The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.

2014, The World Bank

Contacts

Name	Affiliation	Email
Sumila Gulyani	World Bank	sgulyani@worldbank.org
Wendy Ayres	World Bank	wayres@worldbank.org

Metadata production

DDI Document ID

DDI_KEN_2012_SCBL_v01_M_WB

Producers

Name	Affiliation	Role
Development Economics Data Group	The World Bank	Documentation of the DDI

Date of Metadata Production

2017-03-15

Metadata version

DDI Document version

Version 01 (March 2017)

Citation

loading, please wait...

Export citation: RIS | BibTeX | Plain text

Back to Catalog