Quarterly Labour Force Survey 2009, Second Quarter
Labor Force Survey [hh/lfs]
The Quarterly Labour Force Survey (QLFS) is a household-based sample survey conducted by Statistics South Africa (Stats SA). It collects data on the labour market activities of individuals aged 15 years or older who live in South Africa.
Kind of Data
Sample survey data [ssd]
Unit of Analysis
v3.0: Edited, anonymised dataset for public distribution
The current version of the QLFS data was downloaded from the Statistics South Africa (Stats SA) website in April 2014. Stats SA updated the QLFS results (2008-2013) to reflect the new population benchmarks from Census 2011. Although the weighting changes are not clearly documented by Stats SA, users are advised to remain aware of these slight calibration differences when employing weights. These updates are in addition to the following changes to the previous versions:
The previous version of the QLFS 2009 Q2 was downloaded from the Statistics South Africa (Stats SA) website by DataFirst in January 2012. This version differs in a number of ways from the version that was obtained by DataFirst (from Stats SA) at some undeteremined time prior. The first of these differences is the way in which observations that fit into "unspecified", "not applicable" or "missing" type categories are coded for certain variables. For example, in the older version of the QLFS 2009 Q2 the "Occup" variable is coded 888, with the associated label "Not applicable", for 67,653 observations. In the newer version this category of responses is assigned the code 0 and is not labelled (as it was in the previous version) for the same 67,653 observations. This recoding process has been applied to a large number of categorical variables in the datafile. A few other categorical variables have been recoded in a similar vein but as different (non-zero) values. For example, values of 888 for "Q4212TOTALHRS" have been redefined as having the value 88.
Second, a number of extra variables were introduced in the later version. It is unclear why these are not present in the older version of the datafile as they are detailed in metadata that was released at the same time as the original data:
1) "Geo_type" - Geography type (e.g. urban formal, rural informal, etc.)
2) "Hrswrk" - Hourse worked. A derived variable that was probably aimed at getting around problems created by the recoding of the hours worked variables used in the derivation of the underemployment variable
3) "Metro_code" - Metropolitan area code (e.g. Cape Town, eThekwini, Johannesburg, etc.)
4) "Status_Exp" - Expanded unemployment status.
5) "Stratum" - 6 digit number representing stratum formed during master sample 2006 where digit 1 represents province, based on 2005 provincial boundaries, digits 2-3 represent the metro/non-metro area and digit 4 confers geography type.
Finally, the two versions have different weights. To DataFirst's knowledge, the weighting changes are not clearly documented by Stats SA. The most likely explanation for the difference between the two sets of weights is that the newer version is calibrated to an updated set of mid-year population estimates. Users are advised to remain aware of these slight calibration differences when employing weights.
INDIVIDUALS: labour market activity, labour preferences, labour market history, demographic characteristics, marital status, employment status, education, grants, tax.
The QLFS sample covers the non-institutional population except for those in workers' hostels. However, persons living in private dwelling units within institutions are enumerated. For example, within a school compound, one would enumerate the schoolmaster's house and teachers' accommodation because these are private dwellings. Students living in a dormitory on the school compound would, however, be excluded.
Producers and sponsors
Statistics South Africa
The QLFS frame has been developed as a general purpose household survey frame that can be used by all other household surveys irrespective of the sample size requirement of the survey. The sample size for the QLFS is roughly 30 000 dwellings per quarter.
The sample is based on information collected during the 2001 Population Census conducted by Stats SA. In preparation for the 2001 Census, the country was divided into 80 787 enumeration areas (EAs). Stats SA's household-based surveys use a Master Sample of Primary Sampling Units (PSUs) which comprises of EAs that are drawn from across the country.
The sample is designed to be representative at the provincial level and within provinces at the metro/non-metro level. Within the metros, the sample is further distributed by geography type. The four geography types are: urban formal, urban informal, farms and tribal. This implies, for example, that within a metropolitan area the sample is representative at the different geography types that may exist within that metro.
The current sample size is 3 080 PSUs. It is divided equally into four sub-groups or panels called rotation groups. The rotation groups are designed in such a way that each of these groups has the same distribution pattern as that which is observed in the whole sample. They are numbered from one to four and these numbers also correspond to the quarters of the year in which the sample will be rotated for the particular group.
The sample for the QLFS is based on a stratified two-stage design with probability proportional to size (PPS) sampling of primary sampling units (PSUs) in the first stage, and sampling of dwelling units (DUs) with systematic sampling in the second stage.
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Copyright, Statistics South Africa
DDI Document ID
University of Cape Town
Date of Metadata Production
DDI Document version
Version 02 (April 2014) - Adapted version of the DDI "DDI-ZAF-DATAFIRST-QLFS-2009-Q2-V2" received from Data First.