Household Income, Consumption and Expenditure Survey 1999-2000 - World Bank SHIP Harmonized Dataset
Survey based Harmonized Indicators (SHIP) files are harmonized data files from household surveys that are conducted by countries in Africa. To ensure the quality and transparency of the data, it is critical to document the procedures of compiling consumption aggregation and other indicators so that the results can be duplicated with ease. This process enables consistency and continuity that make temporal and cross-country comparisons consistent and more reliable.
Four harmonized data files are prepared for each survey to generate a set of harmonized variables that have the same variable names. Invariably, in each survey, questions are asked in a slightly different way, which poses challenges on consistent definition of harmonized variables. The harmonized household survey data present the best available variables with harmonized definitions, but not identical variables. The four harmonized data files are
a) Individual level file (Labor force indicators in a separate file): This file has information on basic characteristics of individuals such as age and sex, literacy, education, health, anthropometry and child survival.
b) Labor force file: This file has information on labor force including employment/unemployment, earnings, sectors of employment, etc.
c) Household level file: This file has information on household expenditure, household head characteristics (age and sex, level of education, employment), housing amenities, assets, and access to infrastructure and services.
d) Household Expenditure file: This file has consumption/expenditure aggregates by consumption groups according to Purpose (COICOP) of Household Consumption of the UN.
Kind of data
Sample survey data [ssd]
The SHIP datasets contains harmonized variables, produced by Africa Region, Office of the Chief Economist (AFRCE) based on the raw data from houshold surveys conducted in African countries.
The data provided by the National Statistical Office is harmonized across countries and across time using a standard SHIP methodology.
Unit of analysis
- Individual level for datasets with suffix _I and _L
- Household level for datasets with suffix _H and _E
The survey covered all de jure household members (usual residents).
Producers and sponsors
Central Statistical Authority (CSA)
Ministry of Finance and Economic Development
Office of the Chief Economist - Africa Region
The 1999/2000 Household Income, Consurnption, and Expendi.ture Survey covered both the urban and the sedentary rural parts of the country. The survey has not covered six zones in Somalia Region and two zones in Afar Region that are inhabited mainly by nomadic population. For the purpose of the survey, the country was divided into three categories . That is, the rural parts of the country and the urban areas that were divided into two broad categories taking into account sizes of their population. Category I: Rural parts of nine Regional States and two administrative regions were grouped in this category each of which were the survey dornains (reporting levels). These regions are Tigrai,Afar, Amhara, Oromia, Sornalia, Eenishangul-Gunuz, SNNP,Gambela, Flarari, Addis Ababa and Dire Dawa.
Category II: All Regional capitals and five major urban centers of the country were grouped in this category. Each of the urban centers in this category was the survey domain (reporting level) for which separate survey results for rnajor survey characteristics were reported.
Category III: Urban centers in the country other than the urban centers in category II were grouped in this category and formed a single reporting level. Other than the reporting levels defined in category II and category III one additional domain, namely total urban (country level) can be constructed by eombining the basic domains defined in the two categories. All in all 35'basie rural and urban domains (reporting levels) were defined for the survey. In addition to the above urban and rural domains, survey results are to be reported at regional and eountry levels by aggregating the survey results for the conesponding urban and rural areas.
Definition of the survey dornains was based on both technical and resource considerations. More specifically, sample size for the domains were determined to enable provision of major indicators with reasonable precision subject to the resources that were available for the survey.
Selection Scheme and Sample Size in Each Category
CategoryI : A stratified two-stage sample design was used to select the sample in which the primary sampling units (PSUs) were EAs. Sample enumeration areas( EAs) from each
domain were selected using systematic sampling that is probability proportional to the size being number of households obtained from the 1994 population and housing census.A total of 722 EAs were selected from the rural parts of the country. Within each sample EA a fresh list of households was prepared at the beginning of the survey's field work and for the administration of the survey questionnaire 12 households per sample EA for rural areas were systematically selected.
Category II: In this category also,a stratified two-stage sample design was used to select the sample. Here a strata constitutes all the "Regional State Capitals" and the five "Major Urban Centers" in the country and are grouped as a strata in this category. The primary sampling units (PSUs) are the EA's in the Regional State Capitals and the five Major Urban Centers and excludes the special EAs (non-conventional households). Sample enumeration areas( EAs) from each strata were selected using systematic sampling probability proportional to size, size being number of households obtained from the 1994 population and housing census. A total of 373 EAs were selected from this domain of study. Within each sample EAs a fresh list of households was prepared at the beginning of the survey's field work and for the administration of the questionnaire 16 household per sample EA were systematically selected-
Category III: Three-stage stratified sample design was adopted to select the sample from domains in category III. The PSUs were other urban centers selected using systematic sampling that is probability proportional to size; size being number of households obtained from the 1994 population and housing census. The secondary sampling units (SSUs) were EAs which were selected using systematic sampling that is probability proportional to size; size being number of households obtained from the 1994 population and housing census. A total of 169 sample EAs were selected from the sample of other urban centers and was determined by proportional allocation to their size of households from the 1994 census. Ultimately, 16 households within each of the sample EAs were selected systematically from a fresh list of households prepared at the beginning of the survey's fieldwork for the administration of the survey questionnaire.
Dates of collection
First round data collection (G.C calender)
Second round data collection (G.C calender)
Mode of data collection
The Household Income, Consumption and Expenditure Survey questionnaire contains the following forms:
- Form 1: Area Identification and Household Characteristics
- Form 2A: Quantity and value of weekly consumption of food and drinks consumed at home and tobacco/including quantity purchased, own produced, obtained, etc for first and second week.
- Form 2B: Quantity and value of weekly consumption of food and drinks consumed at home and tobacco/including quantity purchased, own produced, obtained, etc for third and fourth week .
- Form 3A: All transaction (income, expenditure and consumption) for the first and second weeks except what is collected in Forms 2A and 2B
- Form 3B: All transaction (income, expenditure and consumption) for the third and fourth weeks except what is collected in Forms 2A and 2B
- Form 4: All transaction (expenditure and consumption) for last 6 months for Household expenditure on some selected item groups
- Form 5: Cash income and receipts received by household and type of tenure. The survey questionnaire is provided as external resource.
Data collected by the survey was harmonized in accordance with the guidelines in the Africa SHIP manual. Program with suffix _I extracts individual level information, _L extracts labor and time use information, _H extracts household information and _E extracts expenditure information.
Office of the Chief Economist, Africa Region
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the Identification of the SHIP Harmonized Dataset producer
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the producer of the SHIP Harmonized Dataset, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.