The World Bank Working for a World Free of Poverty Microdata Library
  • Data Catalog
  • Collections
  • Citations
  • Terms of use
  • About
  • Login
    Login
    Home / Central Data Catalog / LSMS / TZA_2019_NPS-SDD_V06_M
lsms

National Panel Survey 2019-2020 - Extended Panel with Sex Disaggregated Data

Tanzania, 2019 - 2020
Get Microdata
Reference ID
TZA_2019_NPS-SDD_v06_M
DOI
https://doi.org/10.48529/0y7d-1v78
Producer(s)
World Bank
Collection(s)
Living Standards Measurement Study (LSMS)
Metadata
DDI/XML JSON
Created on
Apr 13, 2021
Last modified
Dec 14, 2022
Page views
259805
Downloads
9723
  • Study Description
  • Data Description
  • Documentation
  • Get Microdata
  • Identification
  • Version
  • Coverage
  • Producers and sponsors
  • Sampling
  • Survey instrument
  • Data collection
  • Data processing
  • Data appraisal
  • Depositor information
  • Data Access
  • Disclaimer and copyrights
  • Contacts
  • Metadata production
  • Citation
  • Identification

    Survey ID number

    TZA_2019_NPS-SDD_v06_M

    Title

    National Panel Survey 2019-2020 - Extended Panel with Sex Disaggregated Data

    Country/Economy
    Name Country code
    Tanzania TZA
    Study type

    Living Standards Measurement Study [hh/lsms]

    Series Information

    The National Panel Survey (NPS) was originally launched in Tanzania in 2008, with support from the Living Standards Measurement Survey – Integrated Surveys on Agriculture [LSMS-ISA ] program at the World Bank and other donors. Four rounds of the NPS have been implemented by the Tanzania National Bureau of Statistics (NBS). The first round of the survey was conducted in 2008/09, the second round in 2010/11, the third round in 2012/13 and the fourth round in 2014/15.

    Abstract
    The NPS 2019/20 with sex-disaggregated data (NPS-SDD 2019/20) is an off-shoot survey undertaken by following the entire NPS 2014/15 “Extended Panel” sample. The NPS-SDD 2019/20 is the first Extended Panel with sex-disaggregated data survey, collecting information on a wide range of topics including agricultural production, non-farm income generating activities, individual rights to plots, consumption expenditures, and a wealth of other socioeconomic characteristics.
    Kind of Data

    Sample survey data [ssd]

    Unit of Analysis

    Households; Individuals;

    Version

    Version Description

    Version 06

    Version Date

    2022-12-14

    Version Notes

    This version includes the addition of AG_FILTERS.dta

    Coverage

    Geographic Coverage

    Designed for analysis of key indicators at the national level.

    Universe

    The universe includes all households and individuals in Tanzania with the exception of those residing in military barracks or other institutions.

    Producers and sponsors

    Primary investigators
    Name
    World Bank
    Producers
    Name Affiliation Role
    Talip Kilic World Bank Primary Investigator
    Living Standards Measurement Study Team World Bank

    Sampling

    Sampling Procedure

    The sample design for the NPS-SDD 2019/20 targeted the sub-sample of households from the initial NPS cohort originating in 2008/09 and subsequently surveyed in all four consecutive rounds, considered the “Extended Panel”. This consisted of 989 households from the NPS 2014/15 sample to be tracked and interviewed in the NPS-SDD 2019/20.

    It is worth mentioning that the sample design included complete households that could not be interviewed in NPS 2014/15, excluding those households that had refused to be interviewed in NPS 2014/15. This constituted an additional 8 households. Individuals meeting the eligibility requirement that were interviewed as part of the NPS 2012/13, but were not located and interviewed during the NPS 2014/15, were also included in this round if located. Additionally, individuals from NPS 2014/15 who moved into another This constituted an additional 158 individuals assigned to their last known associated household.

    The eligibility requirement for inclusion in the NPS is defined as any household member aged 15 years and above, excluding live-in servants. Households with at least one eligible member were completely interviewed, including any non-eligible members present in the household. Any household or eligible members that had either moved or split away from a primary household were tracked and interviewed in their new location.

    Additionally, the final sample for NPS-SDD 2019/20 included any resulting split-off households identified during data collection (i.e. a previous NPS member who had moved or started another household). Ultimately, the final sample size for NPS-SDD 2019/20 was 5,587 individuals in 1,184 households.

    Response Rate

    As with most panel surveys a certain portion of panel respondents are not able to be re-interviewed over time. This attrition of panel respondents can lead to attrition bias where respondents drop out of the survey non-randomly and where the attrition is correlated with variables of interest. The Tanzania NPS has fortunately maintained low attrition over the rounds, thus minimizing the potential for attrition bias within the datasets.

    By the end of data collection, 974 of the 989 households had been located and 908 households were successfully re-interviewed for a total household attrition rate of 9.2 percent. At the individual level, 2,621 of the 3,188 eligible household members (over the age of 15 years and not a household servant) were successfully re-interviewed during the NPS-SDD 2019/20, equating to an individual attrition rate of roughly 17.7 percent between the NPS 2014/15 and the NPS-SDD 2019/20 (for extended panel households).

    Weighting

    In order to produce nationally representative statistics with the NPS data, it is necessary to apply weighting or expansion factors. The panel survey weights adjust for differences in the probability of selection into the NPS 2008/09 sample for observations in various strata, 2008/09 households splitting into multiple households in NPS 2010/11 and NPS 2012/13, splitting even further in NPS 2014/15, and attrition between rounds of the survey.

    The first round of the NPS sample was a multi-stage clustered sample design. First stage sampling involved the selection of survey clusters with the probability of selection proportional to cluster size within a stratum. The sampling of these clusters was stratified along two dimensions: (i) eight administrative zones (seven on Mainland Tanzania plus Zanzibar as an eighth zone), and (ii) rural versus urban clusters within each administrative zone. The combination of these two dimensions yields 16 strata. In rural areas a cluster is defined as an entire village. In urban areas, a cluster is defined as a census enumeration area. As a general rule, the probability of selection was higher for clusters within strata where existing data sources showed that the variance of key variables of interest for the NPS (e.g., household consumption and maize production) were likely to be very high – implying the need for more observations to produce reliable estimates.

    The methodology used to calculate the panel weights for the extended panel households in NPS 2019/20 was developed as part of the LSMS-ISA work program. Details on the methodology can be found in the paper: Himelein, Kristen. 2013. “Weight Calculations for Panel Surveys with Subsampling and Split-off Tracking.” Statistics and Public Policy, vol (1), pp40-45

    Survey instrument

    Questionnaires

    The NPS-SDD 2019/20 consists of four survey instruments: a Household Questionnaire, Agriculture Questionnaire, Livestock Questionnaire, and a Community Questionnaire.

    The Household Questionnaire is comprised of thematic sections. This questionnaire allows for the construction of a full consumption-based welfare measure, permitting distributional and incidence analysis. Data within the household instrument is structured around a household panel survey, and will add additional living standards measure in the form of sex-disaggregated data, this additional level of information will add value in the analysis of intra-household dynamics and revealing a more refined picture of welfare of Tanzania. To protect the confidentiality of respondents, sensitive information has been masked in or removed from the public household data files.

    The NPS Extended Panel also includes a robust instrument on household agriculture activities. It offers an essential data source to understand the dynamic role of agriculture to household welfare. Agriculture information is collected at both the plot and crop level on inputs, production and sales, consistent with key phases in the agricultural value chain.
    The NPS Extended Panel likewise recognizes the importance of livestock activities to many households. As with the integrated instrument on agriculture, the NPS contains a robust instrument to capture details on these activities. The Livestock Questionnaire is administered to all households participating in these activities and asks about the inputs, outputs, labour, and sales related to these activities. Table 3 provides a more comprehensive list of the sections found within the Livestock Questionnaire.

    The Community Questionnaire collects information on physical and economic infrastructure and events in surveyed communities . Responses to the community questionnaire are provided through a group discussion among key informants within the community.

    Each of the NPS questionnaires were developed in collaboration with line ministries and donor partners, including the Technical Committee, over a period of several months. The NBS solicited feedback from various stakeholders in regards to survey content and design paying due consideration to comparability with previous panel rounds.

    Data collection

    Dates of Data Collection
    Start End
    2019-01-01 2020-01-01
    Mode of data collection
    • Computer Assisted Personal Interview [capi]
    Data Collectors
    Name
    Tanzania National Bureau of Statistics
    Office of the Chief Government Statistician Zanzibar

    Data processing

    Data Editing

    Additional data cleaning was conducted as the final stage of the data processing. Further adjustment of the data post-entry was conducted under the principle of absolute certainty where adjustments must be evidence-based and correction values true beyond a reasonable doubt. As such, the resulting final data files may still contain some inconsistencies and outliers. Handling of these values is thus left entirely to the data user. Throughout the data processing system, versions of the data are archived at all key steps and all checking and cleaning syntax documented and archived.

    Data appraisal

    Estimates of Sampling Error

    The sample of households selected in the NPS-SDD 2019/2020 is only one of many samples that could have been selected from the same population. Each alternative sample would yield slightly different from the results of the selected sample. Sampling errors are a measure of the variability between all possible samples and although the degree of variability cannot be directly observed, it can be estimated from the survey results and statistically evaluated. A sampling error can be measured in terms of the standard error for a particular statistic. The computer software program STATA used estat effects to calculate sampling errors for the NPS-SDD 2019/2020. In addition to the standard error, STATA computed the design effect (DEFF) for each estimate, which is defined as the ratio between the standard error using the given sample design and the standard error that would result if a simple random sample had been used. A DEFF value of 1.0 indicates that the sample design is as efficient as a simple random sample, while a value greater than 1.0 indicates the increase in the sampling error is due to the use of a more complex and less statistically efficient (but perhaps more logistically efficient) design. STATA also computed the relative error and confidence limits for the estimates. Sampling errors for the NPS-SDD 2019/2020 are calculated for selected variables considered to be of primary interest at the household and individual levels. For each variable of interest, the value of the statistic (R), its standard error (SE), the number of cases, the design effect (DEFF), the relative standard error (SE/R), and the 95 percent confidence limits (R±2SE) are provided in Tables 1-10 in the BID. The DEFF is considered undefined when the standard error in a simple random sample is zero (when the estimate is close to 0 or 1).

    Depositor information

    Depositor
    Name Affiliation
    Data Production and Methods World Bank

    Data Access

    Access authority
    Name Affiliation URL
    Microdata Library World Bank microdata.worldbank.org
    Confidentiality
    Is signing of a confidentiality declaration required? Confidentiality declaration text
    yes The Primary Data Investigator undertakes that no attempt will be made to identify any individual person, family, business, enterprise or organization. If such a unique disclosure is made inadvertently, no use will be made of the identity of any person or establishment discovered and full details will be reported to the NBS. The identification will not be revealed to any other person not included in the Data Access Agreement.
    Access conditions

    The dataset has been anonymized and is available as a Public Use Dataset. It is accessible to all for statistical and research purposes only, under the following terms and conditions:

    1. The data and other materials will not be redistributed or sold to other individuals, institutions, or organizations without the written agreement of the National Bureau of Statistics, Tanzania.
    2. The data will be used for statistical and scientific research purposes only. They will be used solely for reporting of aggregated information, and not for investigation of specific individuals or organizations.
    3. No attempt will be made to re-identify respondents, and no use will be made of the identity of any person or establishment discovered inadvertently. Any such discovery would immediately be reported to the National Bureau of Statistics.
    4. No attempt will be made to produce links among datasets provided by the NBS, or among data from the National Bureau of Statistics and other datasets that could identify individuals or organizations.
    5. Any books, articles, conference papers, theses, dissertations, reports, or other publications that employ data obtained from the National Bureau of Statistics will cite the source of data in accordance with the Citation Requirement provided with each dataset.
    6. An electronic copy of all reports and publications based on the requested data will be sent to the National Bureau of Statistics The original collector of the data, the National Bureau of Statistics, and the relevant funding agencies bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
    Citation requirements

    Use of the dataset must be acknowledged using a citation which would include:

    • the Identification of the Primary Investigator
    • the title of the survey (including country, acronym and year of implementation)
    • the survey reference number
    • the source and date of download

    Example:
    World Bank. Tanzania National Panel Survey 2019-2020 - Extended Panel with Sex Disaggregated Data (NPS 2019-2020). Ref: TZA_2019_NPS-SDD_v06_M. Downloaded from [uri] on [date]

    Disclaimer and copyrights

    Disclaimer

    The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.

    Copyright

    World Bank 2021

    Contacts

    Contacts
    Name Affiliation Email
    Jonathan G Kastelic World Bank jkastelic@worldbank.org
    Talip Kilic World Bank tkilic@worldbank.org
    Heather Moylan World Bank hmoylan@worldbank.org
    LSMS Data Manager World Bank lsms@worldbank.org

    Metadata production

    DDI Document ID

    DDI_TZA_2019_NPS-SDD_v06_M_WB

    Producers
    Name Affiliation Role
    Development Data Group World Bank Documentation of the study
    Date of Metadata Production

    2022-12-14

    Metadata version

    DDI Document version

    Version 02 (July 2021): Identical to version 01, with an additional panel key data.
    Version 03 (September 2021): Consumption aggregates data added.
    Version 04 (May 2022): AG_SEC_3B_time and AG_SEC_3A_time data added.
    Version 05 (July 2022): nps_sdd.child.anthro.dta data and updated version of the BID added.
    Version 06 (Dec 2022): Addition of AG_FILTERS.dta, updates to the Basic Information Document ( BID) - revised to include description of added dataset, and addition of the Agriculture questionnaire

    Citation

    Citation
    loading, please wait...
    Citation format
    Export citation: RIS | BibTeX | Plain text
    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.

    This site uses cookies to optimize functionality and give you the best possible experience. If you continue to navigate this website beyond this page, cookies will be placed on your browser. To learn more about cookies, click here.