As part of its strategic goal of building a climate for investment, job creation, and sustainable growth, the World Bank has promoted improving business environments as a key strategy for development, which has led to a systematic effort in collecting enterprise data across countries. The Enterprise Surveys are an ongoing World Bank project in collecting both objective data based on firms' experiences and enterprises' perception of the environment in which they operate.
The Enterprise Survey is a firm-level survey of a representative sample of an economy's private sector. Firm-level surveys have been conducted since 1998 by different units within the World Bank. Since 2005-2006, most data collection efforts have been centralized within the Enterprise Analysis Unit. The Enterprise Surveys are conducted across all geographic regions and cover small, medium, and large companies. The surveys are administered to a representative sample of firms in the non-agricultural formal private economy. Data are used to create indicators that benchmark the quality of the business and investment climate across countries.
Tanzania Enterprise Skills Survey was conducted between April and August 2015 by the Enterprise Analysis Unit (DECEA) and the Education Global Practice (GEDDR) of the World Bank Group.
The objective of the survey is to develop and test a methodological approach for a diagnostic of the composition and demand for skills and the relationship between skills (and/or skills constraints) and firm performance of selected economic sectors in Tanzania. A detailed skills module was developed as part of a larger firm-level Enterprise Survey, collecting information, among others, on the characteristics of firms and their owners, innovation and export activities, and firm performance.
The sample for the survey was selected using stratified random sampling, following a broadly similar methodology used in the World Bank's Enterprise Surveys.
Kind of Data
Sample survey data [ssd]
Unit of Analysis
The primary sampling unit of the study is an establishment. The establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.
v01, edited, anonymous dataset for public distribution
All variables are named using, first, the letter of each section and, second, the number of the variable within the section, i.e. a1 denotes section A, question 1 (some exceptions apply). All variables are numeric with the exception of those variables with an "x" at the end of their names. The suffix "x" denotes that the variable is alpha-numeric.
There is a unique establishment identifiers, variable name id. The variables a2 (sampling region), a6a (sampling establishment's size), and a4a (sampling sector) contain the establishment's classification into the strata chosen for each country using information from the sample frame.
Note that the fiscal years vary by firm as there is no standard for all firms in Tanzania. The start and end dates for the fiscal year for each firm can be found in the fymonb, fyyearb, fymone and fyyeare variables in the dataset.
The scope of the study includes:
- characteristics of establishment;
- sales and markets;
- skills and labor;
- vacancies and hiring;
- skills and training;
- outsourcing services;
- innovation and technology use;
Regions covered are selected based on the number of establishments, contribution to employment, and value added. In most cases these regions are metropolitan areas and reflect the largest centers of economic activity in a country.
Firms in eight selected economic activities: food processing (ISIC15), textile and garments (ISIC 17 & 18), fabricated metal products (ISIC 28), furniture (ISIC 36), construction (ISIC 45), hotel and restaurant (ISIC 55), transport (ISIC 60 & 61) and Information technology (ISIC 72)).
Producers and sponsors
The sample was selected using stratified random sampling.
Three levels of stratification were used for this survey: economic activity, establishment size, and region:
1) Eight economic activities - food processing (ISIC15), textile and garments (ISIC 17 & 18), fabricated metal products (ISIC 28), furniture (ISIC 36), construction (ISIC 45), hotel and restaurant (ISIC 55), transport (ISIC 60 & 61) and information technology (ISIC 72);
2) Three sizes - small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees);
3) Five regions (city and the surrounding business area): Arusha, Dar es Salaam, Mbeya, Mwanza, and Zanzibar.
Two sample frames were used:
1) The first frame was the 2011/2012 Central Registry of Establishment (CRE) of the National Bureau of Statistics (NBS);
2) The second frame was 2012 Central Registry of Establishment (CRE) of the Office of Chief Government Statistician (OCGS). The sample frame was used for the establishments in Zanzibar.
The enumerated establishments with 5 employees or more were then used as the sample frame for the 2015 Tanzania Skills Survey with the aim of obtaining interviews of 390 establishments.
Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions.
Item non-response was addressed by re-contacting firms. That is, establishments with incomplete information were re-contacted in order to complete this information, whenever necessary. However, there were clear cases of low response. The response rates are particularly low for questions about the names and locations of the main universities and schools attended by the establishment's recent hires (questions l13, l14, l15 and l25 in the questionnaire). Despite repeated callbacks, respondents note that they just do not know the names and locations of schools attended by their employees.
Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.
The number of realized interviews per contact contacted establishments was 0.28. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.07.
For some units it was impossible to determine eligibility because the contact was not successfully completed. Consequently, different assumptions as to their eligibility result in different universe cells' adjustments and in different sampling weights. Three sets of assumptions were considered:
a- Strict assumption: eligible establishments are only those for which it was possible to directly determine eligibility.
b- Median assumption: eligible establishments are those for which it was possible to directly determine eligibility and those that rejected the screener questionnaire or an answering machine or fax was the only response. Median weights are used for computing indicators on the www.enterprisesurveys.org website.
c- Weak assumption: in addition to the establishments included in points a and b, all establishments for which it was not possible to finalize a contact are assumed eligible. This includes establishments with dead or out of service phone lines, establishments that never answered the phone, and establishments with incorrect addresses for which it was impossible to find a new address. Note that under the weak assumption only observed non-eligible units are excluded from universe projections.
Dates of Data Collection
Data Collection Mode
Computer Assisted Personal Interview [capi]
Data Collection Notes
The Enterprise Surveys are usually implemented following a two-stage procedure. In the first stage, a screener questionnaire is applied over the phone to determine eligibility and to make appointments; in the second stage, a face-to-face interview takes place with the manager, owner or director of each establishment. In some cases, when the phone numbers were unavailable in the sample frame, the enumerators applied the screeners in person.
Computer-assisted personal interviewing (CAPI) method with TiKiTi mobile research app was used to collect the data.
DataVision International Ltd.
The data was collected using a standardized questionnaire administered to all firms. The questionnaire has eight sections: six main sections and two sections on control information.
Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.
Enterprise Analysis Unit
Confidentiality of the survey respondents and the sensitive information they provide is necessary to ensure the greatest degree of survey participation, integrity and confidence in the quality of the data. Surveys are usually carried out in cooperation with business organizations and government agencies promoting job creation and economic growth, but confidentiality is never compromised.
The use of this dataset must be acknowledged using a citation which would include:
- the identification of the Primary Investigator (including country name)
- the full title of the survey and its acronym (when available), and the year(s) of implementation
- the survey reference number
- the source and date of download (for datasets disseminated online)
The World Bank. Tanzania Enterprise Skills Survey (ESS) 2015, Ref. TZA_2015_ESS_v01_M. Dataset downloaded from [URL] on [date].
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
DDI Document ID
Development Data Group
Documentation of the study
Date of Metadata Production
DDI Document version
v01 (April 2016)