The Tanzania Demographic and Health Survey (TDHS) is a national sample survey of women of reproductive ages (15-49) and men aged 15 to 60. The survey was designed to collect data on socioeconomic characteristics, marriage patterns, birth history, breastfeeding, use of contraception, immunisation of children, accessibility to health and family planning services, treatment of children during times of illness, and the nutritional status of children and their mothers.
The primary objectives of the TDHS were to:
- Collect data for the evaluation of family planning and health programmes,
- Determine the contraceptive prevalence rate, which will help in the design of future national family planning programmes, and
- Assess the demographic situation of the country.
Kind of data
Sample survey data
The Tanzania Demographic and Health Survey (TDHS) is a national sample survey. This sample should allow for separate analyses in urban and rural areas, and for estimation of contraceptive use in each of the 20 regions located on the mainland and in Zanzibar.
Unit of analysis
- Children under five years
- Women age 15-49
- Men age 15-60
Producers and sponsors
Bureau of Statistics
Macro International Inc.
Ministry of Health
Technical advice and logistical support
United Nations Population Fund -Tanzania
Advisory and logistic support
United Nations Children's Fund - Tanzania
Advisory and logistic support
United States Agency for Intemational Development
The principal objective of the Tanzania Demographic and Health Survey (TDHS) was to collect data on fertility, family planning, and health of the people. This survey involved randomly selected women aged 15-49 and men aged 15-60 in selected households.
Before the sampling frame was developed, two possibilities for the TDHS sample design were considered:
- The 1988 Population census list of Enumeration Areas (EAs)
- The National Master Sample for Tanzania created in 1986 (NMS).
The NMS was intended mainly for agricultural purposes and, at that time, only for rural areas. The NMS was based on the 1978 Census information while the urban frame was still being worked upon. Therefore, it was decided that the TDHS sample design would use the 1988 Census information as the basic sampling frame. Since the TDHS sample was to be clustered, it was necessary to have sampling units of manageable and fairly uniform size and with very well defined boundaries. The 1988 Census frame provided the list of enumeration area units (EAs) that had well defined boundaries and manageable uniform size. Therefore, EAs were used as primary sampling units (PSUs).
The target of the TDHS sample was about 7850 women age 15-49 with completed interviews. This sample should allow for separate analyses in urban and rural areas, and for estimation of contraceptive use in each of the 20 regions located on the mainland and in Zanzibar. Estimates for large domains (by combination of a group of regions) were also taken into consideration.
The TDHS used a three-stage sample. The frame was stratified by urban and rural areas. The primary sampling units in the TDHS survey were the wards/branches. The design involved the target of 350 completed interviews for each of 19 regions on the mainland and 500 in each of Dar es Salaam and Zanzibar.
In the first stage, the wards/branches were systematically selected with probability proportional to size (according to 1988 census information). In a second sampling stage, two EAs per selected rural ward/branch and one EA per selected urban ward/branch were chosen with probability proportional to size (also according to 1988 census information). In total, 357 EAs were selected for the TDHS, 95 in the urban area and 262 in the rural. A new listing of households was made shortly before the TDHS fieldwork by special teams including a total of 14 field workers. These teams visited the selected EAs all over the country to list the names of the heads of the households and obtain the population composition of each household (total number of persons in the household). In urban areas, the address of the dwelling was also recorded in order to make it easy to identify the household during the main survey. A fixed number of 30 households in each rural EA and 20 in each urban EA were selected.
About 9560 households were needed to achieve the required sample size, assuming 80 percent overall household completion rate.
See detailed sampling information in the APPENDIX B of the final 1991-1992 Tanzania Demographic and Health Survey report.
Out of the 9282 households selected for interview, 8561 households could be located and 8327 were actually interviewed. The shortfall between selected and interviewed households was largely due to the fact that many dwellings were either vacant or destroyed or no competent respondents were present at the time of the interview. A total of 9647 eligible women (i.e., women age 15-49 who spent the night before the interview in a sampled household) were identified for interview, and 9238 women were actually interviewed (96 percent response rate). The main reason for non-interview was absence from the home or incapacitation.
The Tanzania DHS male survey covered men aged between 15 and 60 years who were living in selected households (every fourth household of the female survey). The results of the survey show that 2392 eligible men were identified and 2114 men were interviewed (88 percent response rate). Men were generally not interviewed because they were either incapacitated or not at home during the time of the survey.
Dates of collection
Mode of data collection
The household, female, and male questionnaires were designed by following the Model Questionnaire "B" which is for low contraceptive prevalence countries. Some adaptations were made to suit the Tanzania situation, but the core questions were not changed. The original questionnaire was prepared in English and later translated into Kiswahili, the language that is widely spoken in the country. There are parts in the country where people are not very conversant with Kiswahili and would find it difficult to respond in Kiswahili but would understand when they are asked anything. The translated document was given to another translator to translate it back into English and comparisons were made to determine the differences.
A pretest to assess the viability of the survey instruments, particularly the questionnaires and the field organization, was carried out in Iringa Rural District, Iringa Region. It covered 16 enumeration areas with a total of 320 households. The pretest, which took a month to complete, was carded out in November/December, 1990, and covered both rural and urban EAs.
The pretest training took two weeks and consisted of classroom training and field practice in neighborhood areas. In all, 14 newly recruited interviewers and the Census staff were involved. The Census staffs who were to be transformed into the TDHS team handled the training for both the fieldwork management and the questionnaire. During the later fieldwork, they supervised the field exercise.
During the fieldwork, the administrative structure of the CCM Party, which involved the Party Branch Offices and the ten-cell leadership, were utilized in an effort to secure the maximum confidence and cooperation of the people in the areas where the team was working. At the end of the fieldwork, the interviewers and the supervisory team returned to the head office in Dares Salaam for debriefing and discussion of their field experiences, particularly those related to the questionnaires and the logistic problems that were encountered. All these experiences were used to improve upon the final version of the questionnaires and the overall logistic arrangements.
The data processing staff for the survey work consisted of four clerks and one supervisor who were staff of the Bureau of Statistics. They were given initial data processing training by the DHS staff. Four microcomputers were installed in the Census Office and were used exclusively to process the Tanzania DHS data.
Before the questionnaires were passed for data entry, they had to undergo office editing to check for internal consistency of responses recorded in the questionnaire and to see that the skip instructions had been followed, that there were no omissions, and that all entries were legible. Data entry began in November 1991 after the Census Office received questionnaires from the Dares Salaam/Coast team and was completed during the last week of May 1992. When all the data had been entered, a final edit was done. This was completed in July 1992 and the preliminary report was published in September 1992.
The results from sample surveys are affected by two types of errors, non-sampling error and sampling error. Non-sampling error is due to mistakes made in carrying out field activities, such as failure to locate and interview the correct household, errors in the way the questions are asked, misunderstanding on the part of either the interviewer or the respondent, and data entry errors. Although efforts were made to minimize this type of error during the design and implementation of the TDHS, non-sampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be measured statistically. The sample of women selected in the TDHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each one would have yielded results that differed somewhat from the actual sample selected. The sampling error is a measure of the variability between all possible samples; although it is not known exactly, it can be estimated from the survey results.
Sampling error is usually measured in terms of standard error of a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which one can be reasonably assured that, apart from non-sampling errors, the true value of the variable for the whole population falls. For example, for any given statistic calculated from a sample survey, the value of that same statistic as measured in 95 percent of all possible samples with the same design (and expected size) will fall within a range of plus or minus two times the standard error of that statistic.
If the sample of women had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the Tanzania DHS sample designs depended on stratification, stages, and clusters. Consequently, it was necessary to utilize more complex formulas. The computer package CLUSTERS, developed by the International Statistical Institute for the World Fertility Survey, was used to assist in computing the sampling errors with the proper statistical methodology.
Note: See detailed sampling error calculation in the APPENDIX C of the final 1991-1992 Tanzania Demographic and Health Survey report.
Other forms of data appraisal
The following Data Quality Tables are provided in the Final Report:
- Household age distribution
- Age distribution of eligible and interviewed women
- Completeness of reporting
- Birth by calendar year since birth
- Reporting of age at death in days
- Reporting of age at death in months
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Data and Data Related Resources
World Bank, Development Economics Data Group
Documentation of the study
Version 1.0: (June 2011) Variable level metadata is not provided.