This section provides detailed information on the unit-record data generated by this study. This includes a detailed description of the contents of each data file and all the variables within each file. Variables may contain the recorded results of a direct question asked, or be derived in some way. The number of variables and the number of data points (cases) are summarized for each variable.
It collects information from the household control form (hhroster) at individual level. Attention should be paid in using the w3_roster_all (and w3_ind_all) file, as it contains 168 duplicated records of individuals. Out of 168, 167 have moved away and 1 is "not present". The reason for this apparent anomaly is that migrants have been followed, whenever possible, in their new family, and they appear twice in the dataset, once in their old (classified as "not member", because moved) and again in their new household (considered as valid members, instead).
[Note: They have an equal m1_q01 personal identification code (pid), but a different m0_q00 household identification code (and, hence, chid). Depending on the users' objective, these duplicated records can be dropped so that national representativeness is guaranteed. A solution to leave them aside in the APS 2004 is to select individuals at their final location residence, through the variable cfinloc. The APS 2003 shares the same feature in w2_ind_all and w2_roster_all files: in that case the relevant filter variable is bfinloc.]
The file contains a sub-set of variables present in w3_ind_all and includes all enumerated household members
It is an individual-level file for all survey members including children under 15 years old, which collect information from sections: control form (hhroster), cover page (metadata), panelmembers, educationorig, educationnew, health, fertility, labor and migration. The file has been organized to have an individual record for each row; hence sometimes a reshaping of the original individual files was needed. It occurs because for some sections data were collected by item instead of by individual.
Wherever possible, variables have retained same name and label of wave 1 and 2. A detailed description of them is available in the documentation file “Codebook_w3_ind_all”.
Wave 1 N=7,475 longitudinal members including children aged under 15.
Wave 2 N= 7,438 panel members including children under 15 years, of which 7,160 OSM and 278 new members. 5,433 are valid members aged 15 or over and have compiled individual sections of questionnaire; 2,005 are children under 15 years old.
Wave 3 N=7,476 longitudinal individuals including children under 15 years, of which 7,212 already sampled in wave 1 or 2, and 264 new members. 5,499 are valid household members aged 15 or over and have provided individual information; 1,977 are under 15 years.
Note that in wave 2 and 3 the individual questionnaire did not collect data on children under 15 years. The questionnaire was filled in by all present household members aged 15 or over.
It collects information at household level, summing up data drawn from section cover page (metadata), dwelling, numdurables, agriculturea1, agriculturea2, agricultureb1, agricultureb2, filters, remittancesa, remittancesb, socialassistance, subjectivea, subjectiveb, household interview outcome where information has been asked the household head or the most knowledgeable person. As in the individual level case, sometimes a reshaping has been applied to the original files to insure that each record has been assigned to a single household.
Whenever possible, variables are assigned original files names (shown below). Their content is summarized in the documentation file “Codebook_w3_hh_all”.
Wave 1 dataset (w1_hh_all) includes interviewed and non-interviewed households. For longitudinal analysis, interviewed panel households only should be selected.
Wave 1 N=1,741 panel households (891 urban, 850 rural)
Wave 2 N=1,780 interviewed households. 900 of them are urban and 880 are rural families. The majority of the non-interviewed households were due to split-off moves out of the country (N=348). A further 23 households had moved out of scope within Albania, e.g. had moved into an institution (hospital, prison) or were non-contacts or refusals. Only 4 households had moved and could not be traced. 83 households had moved and were traced to their new address.
Wave 3 N=1,797 households. It was possible to classify 1,767 of them through the Enumeration Area (EA) they had in the 2002 LSMS, while 30 missing domain codes are due to the lack of information on urban/rural and EA residence. [Note: According to panel construction methodology, the sampling design does not select a predetermined number of households to be sampled by PSU (as in the LSMS 2002), rather original households are traced wherever they move within Albania: some are living in the same PSU as in 2003, other have moved to other PSUs.]
It contains data referred to sections: cover page (metadata), household interview outcome, dwelling, filters, social assistance, numdurables, subjectivea, subjectiveb. It is a sub-set of the w3_hh_all.
It collects information drawn from the following modules: cover page (metadata), household interview outcome, agriculturea1, agriculturea2, agricultureb1, agricultureb2. As w3_hh_basic, it is a sub-set of w3_hh_all.