BIH_2001_LSMS_v01_M
Living Standards Measurement Survey 2001 (Wave 1 Panel)
Name | Country code |
---|---|
Bosnia and Herzegovina | BiH |
Living Standards Measurement Study [hh/lsms]
This is the first Living Standards Measurement Survey conducted in Bosnia and Herzegovina. It provides Wave 1 of a panel survey which also includes LSMS surveys 2002, 2003, and 2004.
Sample survey data [ssd]
Roster
Basic demographic information on the household. The module was used to list all the members of the household, their relationship to the head of household and other household members, their age, sex, and marital status. Information was also collected about individuals absent from the household.
Housing
Information on the housing in which the household lives as well as utilities used. The module has four sections:
A. Description of Primary Residence: Type and condition of dwelling, number of rooms, living area, and presence of utilities such as electricity, water, sewerage, and telephone.
B. Legal status of ownership of dwelling unit: Legal status as well as expenditures on housing and related services, Ownership and Purpose of Secondary Residence
C. Durable goods: Ownership, date of purchase and present value of such goods
Education
Data on levels of schooling, attendance and characteristics of schooling, including:
A. Child Care and Kindergarten: Attendance and monthly expenditures for child care or pre-school. The section was administered for children from 0 through 6 years of age.
B. General Education: Literacy status, educational qualifications and specialization, type of schools attended, formal and informal education expenditures, source of financial assistance during the academic year 2000-2001, distance of the school from home etc. The section was administered to all persons 7 years and older and for children less than 7 years who attended school.
Health
Data on health status and use of health services including:
A. Utilization of Health Care Services: Use of different levels/types of health services , self medication and all health expenditures. Questions were also included on the prevalence of chronic ailments and the availability of health insurance. The section was administered for all household members, regardless of age.
B. Health Status: This section elicited information on individual’s selfreported health status as well as the screening questions for clinical depression. The section was only administered to adults 17 years and older.
Labor
This module elicited information on labor market activity status during the reference week preceding the survey. For employed persons, information on their occupation, sector of employment, type of employment, place of work, previous employment, number of hours worked in the week and monthly earnings were asked. For unemployed persons, questions were asked on the duration of unemployment, previous employment (sector, occupation), method of seeking work, and whether or not they were registered as unemployed with the Employment Bureau. For inactive persons, questions on present status, previous employment as well as registration at the Employment Bureau were asked. The entire module was administered to persons 15 years and older.
Credit
Information was gathered on the number of times the person had borrowed from different sources, amount borrowed during the last 12 months, and the amount presently owed, as well as the month and year of the last borrowing, reasons for borrowing and refusals of loans. The entire module was administered to persons 15 years and older.
Privatization Vouchers and Certificates
This module included questions on a person’s eligibility for a voucher or certificate, the value of the vouchers or certificates received, transactions made with them, sale value of vouchers or certificates sold, and the nominal value of the vouchers or certificates in their possession. The module was administered for all household members even though the certificates in the Federation were not given to children. But the RS vouchers were and, given that people with rights in one entity can live in another, information was needed for all household members.
Migration
Information collected on the person’s (i) current residence, (ii) municipality of birth, (iii) residence prior to the war (April 1992), (iv) reason for migration and (v) current residential status (categories based on migration history not simply present place of residence). The module was administered to persons 15 years and older.
Social Assistance
This module included questions on (i) the individual’s eligibility for old age pension, disability pensions, survivors pensions, and/or war veteran’s pension, (ii) monthly pension received, and (iii) the allowances and services received during the preceding 12 months.
End of First Visit
This module is intended to identify households to be covered by Module 12 (non-agricultural activities ) and Module 13 (agricultural activities). It also includes questions on efforts to start a household business (whether this effort was or was not successful) and key problems faced.
Household Consumption
Each of the following sections elicited information on the quantity and value of purchased items, own production and the value of items received as gifts.
A. Daily Expenses: Purchases in the last 7 days of frequently purchased items such as tobacco, cigarettes and meals/snacks eaten outside the home.
B. Food Consumption: Average monthly expenditures on items of food consumption such as bread and cereals, meat, fish, edible oil and fat, sugar, and confectionary, other commonly consumed items like salt, vinegar etc, soft drinks, and alcoholic drinks, and, seasonal products such as fruits and vegetables.
C. Non-Food: Monthly Expenditures on such non-food products as transport, cosmetics, fuel , and cleaning products. Annual Expenditures on such non-food products such as clothing and footwear, furniture and fixtures, personal transport, recreation equipments and services, personal care services, financial services, other miscellaneous expenses such as gifts, losses from lottery, thefts etc and expenditures on weddings and other ceremonies
Nonagricultural Household Businesses
This module elicited information from households engaged in nonagricultural business activities:
A. Identification of enterprises or household businesses: nature of the activity pursued, persons engaged in such activities and the number of such activities.
B. General Information on enterprise or household business: Length of time the enterprise has been in operation, location, ownership, number of days in a week operated, number of persons engaged.
C. Labor in Enterprise or Household Business: The number of persons engaged in the business, both household member and non-member, the number receiving wages in cash or in kind.
D. Revenues and Inputs: The number of months the business operated, gross earnings in an average month, expenses on inputs in an average month
E. Capital Assets: The value of fixed capital such as land, buildings, equipment and machines, furniture, small and large tools, big vehicles, small vehicles, other fixed assets, value additions to total assets during the past 12 months and main problems faced by the establishment
Agricultural Activities
This module collected information on farming operations with special focus on farm management, inputs and earnings.
A1. Land Used: Area of land used by type of use, irrigation, present value of the land, ownership, lease value during 2000-01.
A2. Unused Land Owned by Household: The type of land, how obtained, present value, time since last used, type of use contract, lease amount received during 2000-01 etc.
B1. Use of Forest Land: Age of forest, whether the forest was harvested, value of products sold, value of products used by household
B2. Crop Production and Use: Area of land used by crop, amount harvested, sold, lost to pests, used as wages, used as animal feed, processed, consumed by the household and given away as gifts.
C1. Inputs and Investments: The quantity of seeds or seedlings used, amount purchased, cost, used from own production, whether obtained as gift and from whom.
C2. Inputs and Investments—Fertilizers: Quantity used, purchased, cost, obtained as gift and from whom.
C3. Inputs and Investments--Fuel and Energy: Amount used, purchased, cost, obtained as gift and from whom.
C4. Inputs and Investments—Labor: Then number of paid workers by job type (soil preparation, sowing and planting, input preparation, weeding, spraying, watering, harvesting, mowing and other), number of paid work days, average daily wage, whether payment was made in-kind.
C5. Inputs and Investments—Machinery: Whether machinery was hired for ploughing, harrowing, other cultivation, sowing and planting, harvesting, mowing, transport or other activities. The source of hire, number of machine hours hired, amount paid per hour and whether payments were made in-kind.
D1. Livestock: Quantity of livestock and their value. Number sold, consumed, lost, given away, or bought during the past 12 months. Number of new born, number received as gifts, whether any livestock product was sold and its value.
D2. Animal Feed: Quantity of animal feed used during past 12 months, quantity and value of purchases, own-produced and received as gifts, and source.
E. Farm Capital Assets: Type of capital assets, their market value, age of the assets, whether the asset is rented out, earnings during 2000-01 from renting out the capital assets.
National coverage.
Domains: Urban/rural/mixed; Federation; Republic
Name |
---|
State Agency for Statistics (BHAS) |
Republika Srpska Institute of Statistics (RSIS) |
Federation of BiH Institute of Statistics (FIS) |
Name | Role |
---|---|
The World Bank | Technical assistance |
Name |
---|
Department for International Development of the British Government |
Japanese Government |
United Nations Development Program |
World Bank |
A total sample of 5,400 households was determined to be adequate for the needs of the survey: with 2,400 in the Republika Srpska and 3,000 in the Federation of BiH. The difficulty was in selecting a probability sample that would be representative of the country's population. The sample design for any survey depends upon the availability of information on the universe of households and individuals in the country. Usually this comes from a census or administrative records. In the case of BiH the most recent census was done in 1991. The data from this census were rendered obsolete due to both the simple passage of time but, more importantly, due to the massive population displacements that occurred during the war.
At the initial stages of this project it was decided that a master sample should be constructed. Experts from Statistics Sweden developed the plan for the master sample and provided the procedures for its construction. From this master sample, the households for the LSMS were selected.
Master Sample
[This section is based on Peter Lynn's note "LSMS Sample Design and Weighting - Summary". April, 2002. Essex University, commissioned by DfID.]
The master sample is based on a selection of municipalities and a full enumeration of the selected municipalities. Optimally, one would prefer smaller units (geographic or administrative) than municipalities. However, while it was considered that the population estimates of municipalities were reasonably accurate, this was not the case for smaller geographic or administrative areas. To avoid the error involved in sampling smaller areas with very uncertain population estimates, municipalities were used as the base unit for the master sample.
The Statistics Sweden team proposed two options based on this same method, with the only difference being in the number of municipalities included and enumerated. For reasons of funding, the smaller option proposed by the team was used, or Option B.
Stratification of Municipalities
The first step in creating the Master Sample was to group the 146 municipalities in the country into three strata- Urban, Rural and Mixed - within each of the two entities. Urban municipalities are those where 65 percent or more of the households are considered to be urban, and rural municipalities are those where the proportion of urban households is below 35 percent. The remaining municipalities were classified as Mixed (Urban and Rural) Municipalities. Brcko was excluded from the sampling frame.
Urban, Rural and Mixed Municipalities: It is worth noting that the urban-rural definitions used in BiH are unusual with such large administrative units as municipalities classified as if they were completely homogeneous. Their classification into urban, rural, mixed comes from the 1991 Census which used the predominant type of income of households in the municipality to define the municipality. This definition is imperfect in two ways. First, the distribution of income sources may have changed dramatically from the pre-war times: populations have shifted, large industries have closed and much agricultural land remains unusable due to the presence of land mines. Second, the definition is not comparable to other countries' where villages, towns and cities are classified by population size into rural or urban or by types of services and infrastructure available. Clearly, the types of communities within a municipality vary substantially in terms of both population and infrastructure.
However, these imperfections are not detrimental to the sample design (the urban/rural definition may not be very useful for analysis purposes, but that is a separate issue). [Note: It may be noted that the percent of LSMS households in each stratum reporting using agricultural land or having livestock is highest in the "rural" municipalities and lowest in the "urban" municipalities. However, the concentration of agricultural households is higher in RS, so the municipality types are not comparable across entities. The percent reporting no land or livestock in RS was 74.7% in "urban" municipalities, 43.4% in "mixed" municipalities and 31.2% in "rural" municipalities. Respective figures for FbiH were 88.7%, 60.4% and 40.0%.]
The classification is used simply for stratification. The stratification is likely to have some small impact on the variance of survey estimates, but it does not introduce any bias.
Selection of Municipalities
Option B of the Master Sample involved sampling municipalities independently from each of the six strata described in the previous section. Municipalities were selected with probability proportional to estimated population size (PPES) within each stratum, so as to select approximately 50% of the mostly urban municipalities, 20% of the mixed and 10% of the mostly rural ones. Overall, 25 municipalities were selected (out of 146) with 14 in the FbiH and 11 in the RS. The distribution of selected municipalities over the sampling strata is shown below.
Stratum / Total municipalities Mi / Sampled municipalities mi
Note: Mi is the total number of municipalities in stratum i (i=1, … , 6); mi is the number of municipalities selected from stratum i;
As the selection of the specific municipalities in the Master Sample was made PPES within strata, for each municipality, the probability of selection was:
Pj = mi X (Nij / Ni*)
Where:
Mi is the total number of municipalities in stratum i (i=1, … , 6);
mi is the number of municipalities selected from stratum i;
Nij is the estimated number of households in municipality j in stratum i (j = 1, …, M i
Ni* is the estimated total number of households in stratum i.
(See the resulting porobabilities in document "BASIC INFORMATION DOCUMENT", Table 3)
Listing Operation
In each of the selected municipalities a full listing of households ("microcensus") was carried out. The work was carried out in a decentralized approach, wherein the FIS and the RSIS were responsible for carrying out the fieldwork under the general guidance of the BHAS. The municipalities cooperated by providing temporary office and storage space and recruitment of enumerators and controllers for the survey. The fieldwork was supervised by the staff of the two entity institutes, and these were trained in their respective institutes. This involved three phases:
Preparatory Phase: The tasks carried out during this phase included updating of maps with respect to street names, street numbers and buildings, defining the boundaries of the municipalities, and the enumeration areas within them. This was done by the geodesic institutes of the two entities. The next step was identifying enumerators, controllers and supervisors, training them and assigning them to specific areas. The other tasks during this phase were the printing of questionnaires and instructions, defining the codes to be used and informing the municipalities about their specific responsibilities. While the controllers were selected by municipalities, the supervisors were provided by the entity institutes.
Listing Phase: Enumerators were provided maps of their areas and the questionnaires and instruction manuals They collected information on the households in their assigned areas using a short questionnaire which gathered information on the identify of the head of household, address, and the number of members in the household by sex and age. If no one was home, the household was visited again to record the information. If, after three such visits, no one was home, the information was obtained from the neighbors. The controllers supervised the fieldwork, checked the filled-in schedules and completed a report form on the fieldwork. They also assisted the interviewers whenever there were difficulties. The supervisors of the entity institutes conducted spot checks and ensured completeness and accuracy of data collection and the transfer of all the filled schedules to the entity institutes.
Data-entry Phase: The data entry was performed at the entity institutes using a custom data entry system based on ACCESS software. Forty data entry operators (18 in the RS and 22 in the Federation) were selected and trained by the institute staff. The data entry was performed in two shifts and was supervised by two programmers of the entity institutes. The data were checked for logic and coding errors and tabulated to provide the essential information such as number of enumeration areas covered, number of households covered, number of members in the households by sex, number of refusals, number of households whose members were absent even after three visits etc. These tabulations were made by municipality and enumeration areas and formed the basis for the second stage sampling.
LSMS SAMPLE
Selection of EAs
The municipalities are divided into geographic areas called enumeration areas (EAs). In theory, each enumeration area consists of the number of households that can be interviewed in a census by an enumerator in one day. The EAs in BiH are based on the 1991 Census. But, at the time the Master Sample listing operation was carried out, many of the enumeration areas actually contained many fewer households (in some cases, zero). As enumeration areas were to be the primary sampling unit for the LSMS survey, the first step was to combine contiguous EAs until a new enumeration area with a minimum of 50 households was formed. These newly constructed EAs were called groups of enumeration areas (GNDs) and replaced the original small EAs. Thus the primary sampling units (PSUs) were actually a mix of the original EAs of sufficient size and the new constructed GNDs. For simplicity, the remaining discussion will use the term EA to refer to both. Based on the population figures from the Master Sample microcensus, 250 EAs were selected with PPS from the municipalities in the FBiH and, and 200 EAs were selected with PPS in the municipalities of the RS.
Detailed information on the calculation of the number of EAs to select in each municipality is available in document "BASIC INFORMATION DOCUMENT")
Selection of Households
Within each of the 450 selected EAs, 12 households were selected systematically. Detailed information on household selection and overall selection probabilities is available in document "BASIC INFORMATION DOCUMENT")
Overall, the response rate in the survey was 82 percent. For each enumeration area, four replacement households were selected prior to the field work. Using these replacement households as needed (a total of 938 households), the final sample size was 5,402 households interviewed.
To produce unbiased estimates for LSMS, each sample household should be weighted by the inverse of its selection probability. Detailed information is available in document "BASIC INFORMATION DOCUMENT (2001)".
An important point about the LSMS weights is that they have considerable variability.
The LSMS in Bosnia-Herzegovina is a multi-topic household survey covering a wide range of topics that affect welfare: housing, education, health, labor, migration, credit, vouchers, social assistance, consumption, agricultural and non-agricultural activities. The LSMS was designed to collect the information required for an assessment of living standards and to provide the key indicators required for social and economic planning. Inter alia, the LSMS in BiH was designed to measure welfare in both monetary and non-monetary terms. Detailed information was collected on household consumption (expenditures, home production, use value of housing and durables), on social assistance such as old age pensions, war veterans pensions, assistance received by orphans, widows, and on sources of income. Non-monetary measures include detailed information on housing, and access to, and the use of, public services such as education and health.
In addition to the household questionnaire, a price questionnaire was also administered to identify the variations in price levels of key food products in the different municipalities covered by the survey.
Household Questionnaire
The overall content of the household questionnaire and the individual questions included in it were designed to address the specific situation of the country and the data needs of policymakers. In addition, several sections of the questionnaire were based on draft questionnaires for future surveys (the HBS and the LFS) and/or older surveys and thus will be helpful in allowing some tracking of indicators over time. The process of designing the questionnaire was lengthy and involved an inter-institutional team from the three statistical organizations of the country—the Survey Management Team. Although efforts to create a formal data users’ group of line ministries and other users were not successful, several ministries did provide detailed comments and suggestions on the modules relevant to their ministries.
It is worth noting the importance of several of these in the BiH context. First, the migration module collected information on present status: given the dislocation of the population by the war and the legal ramifications of present status this module was considered to be of great importance. Second, a module on non-agricultural household businesses was used as the existing administrative data in the country cannot provide any information for assessing the prevalence or size of this sector. Third, in the health module questions pertaining to depression were added to determine how prevalent this ailment was given the post-conflict situation. Fourth, a module on anthropometric measurement of children was not included: a recent Multiple Indicators Cluster Survey (MICs) done by UNICEF had shown that malnutrition was negligible in the country.
The Price Questionnaire
A price questionnaire was administered in each group of enumeration areas covered by the survey. Three locations where food is sold (market, shop, etc.) were visited in each area and prices were collected for 39 commonly consumed food items. Limited information on the point of sale was also collected. It should be noted that a community questionnaire, usually standard in an LSMS survey to collect data on the presence of services and social infrastructure in the areas in which households selected for the survey are situated, was not done in the BiH LSMS.
Start | End |
---|---|
2001-09 | 2001-11 |
Pilot Survey
A draft questionnaire was prepared comprised of the following 11 modules: Roster, Housing, Education, Health, Labor, Credit, Voucher, Migration, Consumption, Non-Agricultural activities and Agricultural activities. This was piloted (tested) during the period June 25-July 20, 2001 in the two entities. For the Pilot survey 9 interviewers were selected in each entity and were trained in the concepts and methodology of the survey. Each interviewer was required to interview 12 households in specified areas. Both the areas and the field staff were selected by the entity institutes. Training for the Pilot Survey was carried out from June 18 to 22, 2001 by the Survey Management Team with participation of experts from UNDP, World Bank and DfID. The training covered the concepts and approaches used in the survey modules, question and answer sessions and practice sessions. Two data entry operators from each entity institute were also trained in the use of a specialized data entry software: CS-Pro.
The actual pilot survey was carried out over a four week period. In the first week, the interviewers visited their 12 households and administered the first 9 modules of the questionnaire: essentially the basic household data and the individual data sections. The Survey Management Team served as supervisors for the Pilot survey.
A workshop was then held in Laktasi (July 3-6, 2001) with all interviewers and members of the survey management team to review the experience and discuss any issues that had arisen. In parallel, data collected from the first week of interviews was entered into the data entry program so that this was also tested during the pilot survey. During the third week the interviewers returned to their 12 selected households and finalized the interview by completing modules 10-13. The final week was used for a second workshop (in Zenica, July 17-20, 2001) to discuss the final modules and field experiences.
Again, data from the third week interviews were entered and the resulting problems identified with the data entry phase discussed.
The main conclusions from the two workshops are summarized below:
The household questionnaires were revised incorporating the suggestions received in the Laktasi and Zenica workshops. Two additional modules were added- the End of First Round Module and Social Assistance Module. The End of First Round Module was intended to identify households where the agricultural and non-agricultural business modules needed to be administered in the second visit to the household. This section was designed to minimize any loss of information on household businesses and agricultural activities. The Social Assistance module was included to obtain information on the various social welfare benefits
received by individuals such as old age pension, family pension, disability pensions, etc. The health and labor modules were cut back substantially. The credit module, given the concern about responses was also cut back. The non-agricultural enterprise module was also reduced substantially. The refusal to provide information on non-registered businesses lowered the value of the module. It was felt that a reduced, less invasive module could elicit better responses, although it would not provide the detailed data required to analyze the sector.
The concern about the need for more training was taken into account and a three-week training course for the survey was developed. Finally, it was decided not to pay households for participating in the survey.
Fieldwork
Organization of Data Collection
The field work for the LSMS survey was carried out in the following manner. Mobile teams of interviewers were formed with three interviewers each plus one supervisor. A data entry operation with a computer was assigned to each pair of teams. The team was provided with a car and driver to ensure that time was not wasted in transportation.
Each interviewer was assigned, per month, two clusters of households. (Each cluster was 12 households in an enumeration area or group of enumeration areas). In week 1, the interviewer carried out the first half of the interview (modules 1-10) with the 12 households in Cluster A. In week 2, the interviewer carried out the first half of the interview with the 12 households in Cluster B. While the interviewer was working in Cluster B, all of the questionnaires from Cluster A were entered electronically by the data entry operator and lists of errors, inconsistencies and missing data were produced. In the third week, the interviewer returned to Cluster A to finish the interview (modules 11-13) with the 12 households and clarify with the households any problems found from the first visit and fill in any missing information.
While the interviewer was in Cluster A for the second time, the data from Cluster B were entered, and lists of errors created. In week four, the interviewer returned to Cluster B to finalize the interview and to make any necessary corrections.
Often, the interviewers visited each household more than two times. All information was collected from direct informants, except in the case of children under 15 whose parents were asked to provide the information. Otherwise, the interviewer carried out a series of interviews in the household, one for each member. In order to find and interview each member of the household, it was often necessary to return to the household multiple times. For this reason, the work load of 12 households in a two-week period was considered sufficient.
Recruitment and Training of Field Staff
Interviewers and supervisors were recruited through the entity Employment Bureaus. The responsibility for recruiting the field staff was vested with the entity institutes. The Institutes contacted the Employment Bureaus and obtained lists of unemployed persons who were on their roster and selected those with at least high school certificates and some prior work experience. The Survey Management Team was responsible for conducting the training for the field staff. Four training sessions running parallel were held in Zenica (FBiH) and Teslic (RS). Each training session had a mixture of interviewers from both entities to ensure that the implementation of the survey did not vary between entities. In each training session, the trainers also represented a mixture of staff from the three statistical organizations.
Training Course
Each training course was three weeks long and had a practical orientation. The morning sessions were usually devoted to discussing the individual modules, and in the afternoons the interviewers and supervisors completed the different modules by interviewing each otherone playing the role of interviewer and the other playing the role of respondent by turn. The completed questionnaires were then discussed and mistakes were pointed out and corrected. These completed questionnaires were later used for training data entry operators. In the Zenica courses, the interviewers also carried out 1-2 actual interviews with households. In Teslic, this was not feasible: instead interviewers carried out a full interview on another member of the training session. Two days of training were devoted to learning about the control procedures--four control forms were provided to monitor the flow of questionnaires from the time when they are given to the interviewers until they are received finally after data entry-as well as map reading and other administrative and control details. Following the training a test was conducted to determine each person's level of knowledge of the questionnaire and instructions. The candidates who performed best were selected as supervisors. Note that most of the supervisors were those people who had been interviewers during the pilot test.
Fieldwork
Each survey team was comprised of 3 interviewers, one supervisor and one driver with a car. Since the interviewers were recruited from the same municipalities where they were to work, they knew the area well. In addition they were provided with maps of the area assigned to them. The supervisors provided logistic support, and helped solve difficulties. The fieldwork started on 26 September 2001 and ended on 23 November 2001. The timing of the fieldwork was limited by the need to finalize all interviews before the start of Ramadan since household consumption patterns were expected to change during the fasting month. On average, interviewers took 1.5 hours per household to collect the data. Only in the case of households with over 5 members did the interview take longer. The interval between the two rounds benefited the survey in the following ways. First, it shortened the time spent in a household for a given visit, thus reducing the risk of respondent fatigue And, second, this structure allowed sufficient time for entering the data and listing the errors for field verification.
One data entry operator was provided for every two teams. The data were entered soon after the questionnaire was completed, and the customized data entry programme was used to produce a list of errors (missing data, inconsistencies and the like) in the data. This enabled the interviewers and supervisors to review each questionnaire, resolve any small difficulties and/or decide that the questionnaire needed to be sent back to the household for clarification.
The interviewing was conducted at the convenience of the respondents which meant interviews were conducted both during the day and during the evenings and throughout the week, including weekends. The supervisors were responsible for planning each day's work for their teams. He or she also planned the activity of the driver to ensure that the questionnaires are collected each day and delivered to the data entry operators, and, once entered and an error list produced, returned to the interviewers for correction. In many municipalities, temporary office accommodation was provided where the interviewers could meet and store the questionnaires. Where such accommodation was not available, the car served as a temporary office and in some cases the supervisor's home served as office.
Finally, a member of the Survey Management Team visited the different municipalities and conducted spot checks of the fieldwork throughout the interviewing period. Each entity provided badges and letters of introduction to the interviewers and supervisors. Communication with field staff was improved by recruiting interviewers and drivers who had cell-phones.
Data Entry
An integrated approach to data entry and fieldwork was adopted in Bosnia and Herzegovina. Data entry proceeded side by side with data gathering to ensure verification and correction in the field. Data entry stations were located in the regional offices of the entity institutes and were equipped with computers, modem and a dedicated telephone line. The completed questionnaires were delivered to these stations each day for data entry.
Twenty data entry operators (10 from Federation and 10 from RS) were trained in two training sessions held for a week each in Sarajevo and Banja Luka. The trainers were the staff of the two entity institutes who had undergone training in the CSPro software earlier and had participated in the workshops of the Pilot survey. Prior to the training, laptop computers were provided to the entity institutes, and the CSPro software was installed in them. The training for the data entry operators covered the following elements:
The data entry programme was fine-tuned during the training. Some unexpected responses during the interviews had to be accommodated and a few skip patterns fixed. The training emphasized the role of the data entry operator as a member of the survey team, and how the outputs of the programme (error lists) were to be provided to the supervisors and interviewers for necessary correction.
The goal was to produce high quality data. Several of the key features of this were:
The following checks were incorporated in the data entry software:
After the data entry was completed in the field, the data were transferred through email to the central offices in Sarajevo and Banja Luka with the help of PCAnywhere Software. The data entry programme was designed to detect many of the errors even at the stage of data entry, thereby minimizing the need for ex-post facto data editing. Once all data was compiled in the entity offices, a check was made to ensure the structural consistency of data files, i.e. that no records were duplicated or omitted.
When the RS and FBiH data files were merged it became apparent that a last minute decision on the treatment of decimal places in several modules had been different in the two entities. Thus the two data bases were not compatible. A correction was made and data from these modules were re-entered. Once this was done, the data sets were compatible and a countrywide data set was created. During this process some additional double entry was carried out to correct any data entry operator errors that had occurred.
Data Cleaning
It is important to note what is meant by ‘data cleaning’ in terms of the BiH-LSMS data set. In the sense that the data set is a faithful reflection of the responses of all interviewees the data set can be considered ‘cleaned’. Every effort was made to ensure that the information provided during the interviews was correctly entered in electronic format. As in any survey, this does not mean that the data set is perfect. As participation in the survey is voluntary, informants had the option to refuse to answer specific questions, and may have provided information that is not always consistent. The interviewers resolved as many inconsistencies as possible with the informants but there are, of course, limits.
However, given the widely differing needs of the range of analysts who will use the BiHLSMS data, nothing further has been done to the original data. While some data sets are processed so that all missing values are imputed, all outliers revalued and all inconsistencies fixed based on some set of assumptions, this has not been done here. The reason being that there is no correct way to resolve the problems of missing data, outliers and inconsistencies.
Each person will need to make his or her own decision on how to treat such data problems based on the type of analysis being carried out. For some analyses, the information in outlier values is key while for others, such outliers would distort findings and would need to be dropped or provided an imputed value. The same for missing values. Some analysts will chose to drop cases with missing values for the variables of interest to them while others will impute such values, using medians, mean or complex multi-variate techniques. In order to ensure the usefulness of the data set for all users, no attempt has been made to impute missing values, reconcile inconsistencies, re-value outliers, or in any way alter the responses provided by the respondents.
n receiving these data it is recognized that the data are supplied for use within your organization, and you agree to the following stipulations as conditions for the use of the data:
The data are supplied solely for the use described in this form and will not be made available to other organizations or individuals. Other organizations or individuals may request the data directly.
Three copies of all publications, conference papers, or other research reports based entirely or in part upon the requested data will be supplied to:
State Agency for Statistics of BiH
TRG Bosne i Hercegovine 1
71000 Sarajevo
Bosnia-Herzegovina
Email: bhas@bih.net.ba
http://www.bhas.ba
Statistical Institute of the Federation of BiH
Zelenih Beretki,26,
71000, Sarajevo
Bosnia-Herzegovina
Email: bhstat@bih.net.ba
http://www.fzs.ba
Republika Srpska Statistical Institute
Veljka Mlaðenovica bb
Banja Luka
78000 Banja Luka
Bosnia and Herzegovina
Email: rs_stat@inecco.net
http://www.rzs.rs.ba The World Bank
Development Economics Research Group
LSMS Database Administrator
MSN MC3-306
1818 H Street, NW
Washington, DC 20433, USA
tel: (202) 473-9041
fax: (202) 522-1153
e-mail: lsms@worldbank.org
The researcher will refer to the 2001 Bosnia and Herzegovina Living Standards Measurement Study Survey as the source of the information in all publications, conference papers, and manuscripts. At the same time the statistical institutions of Bosnia and Herzegovina are not responsable for the estimations reported by the analyst(s).
Users who download the data may not pass the data to third parties.
The database cannot be used for commercial ends, nor can it be sold.
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Name | Affiliation | URL | |
---|---|---|---|
LSMS Data Manager | The World Bank | lsms@worldbank.org | surveys.worldbank.org/lsms |
DDI_BIH_2001_LSMS_v01_M
2010-06-15
Version 0.1 (June 2010).
This site uses cookies to optimize functionality and give you the best possible experience. If you continue to navigate this website beyond this page, cookies will be placed on your browser. To learn more about cookies, click here.