Random Route and Quota Sampling : Do They Offer Any Advantage over Probably Sampling Methods ?

The aim of this paper is to compare sample quality across two probability samples and one that uses probabilistic cluster sampling combined with random route and quota sampling within the selected clusters in order to define the ultimate survey units. All of them use the face-to-face interview as the survey procedure. The hypothesis to be tested is that it is possible to achieve the same degree of representativeness using a combination of random route sampling and quota sampling (with substitution) as it can be achieved by means of household sampling (without substitution) based on the municipal register of inhabitants. We have found such marked differences in the age and gender distribution of the probability sampling, where the deviations exceed 6%. A different picture emerges when it comes to comparing the employment variables, where the quota sampling overestimates the economic activity rate (2.5%) and the unemployment rate (8%) and underestimates the employment rate (3.46%).


Introduction
Since the earliest applications of quota sampling in the early twentieth century, there has been a wealth of references to its unsuitability for some purposes, such as to obtain population-representative samples [1]- [6].Most of the criticism focuses on its non-probability nature (which precludes the possibility of calculating sampling error), and the heavy influence of the interviewer in the choice of ultimate respondent.
Nevertheless, non-probability sampling methods remain among those commonly used by the majority of private opinion poll and market research companies [7] [8]; among others, whose predominant approach is quota sampling within households previously selected by the random methods [9].Without denying the issues raised, the research sector has responded to the above mentioned criticism, which emanates mainly from academics and statisticians [3] [10], by claiming that these samples "work", and, in fact, sometimes deliver better results than those obtained via strictly random sampling methods [5] [8].
The aim of this paper is to compare sample quality and response to the survey across two probability samples based on the municipal register of inhabitants and one that uses probabilistic cluster sampling combined with random route and quota sampling within the selected clusters in order to define the ultimate survey units.All of them use the face-to-face interview as the survey procedure.The hypothesis to be tested is that it is possible to achieve the same degree of representativeness using a combination of random route sampling and quota sampling (with substitution) as it can be achieved by means of household sampling (without substitution) based on the municipal register of inhabitants.According to Bauer [9] "there are few studies focus on to assess the quality of random route samples", and this one adds the use of selection with quota methods inside home.
The paper is organized into 5 sections.The introduction of the research topic is followed by the presentation of the data sources, with a brief indication of their differences and similarities.The three surveys are then compared, taking into account the distribution by age, sex, education and four employment variables (economic activity ratio, rate of unemployment, rate of employment and employment status).The discussion and conclusion sections are followed by the references used.

Data Sources
The two probability samples considered correspond to the European Social Survey (ESS) round 5, conducted in 2011, and the ISSP II Environmental Survey, which was carried out in 2010 by the Centro de Investigaciones Sociológicas-CIS, that is, the Spanish Sociological Research Centre, under an agreement signed with the ISSP (CIS survey number 2837).These two are compared with the sample used for the 2010 Health Barometer (henceforth HB) (CIS survey number 2850) where households are randomly drawn from census districts and respondents are selected within households using gender and age quotas.The next section presents the methodological details of these three samples.

European Social Survey (ESS)
The target universe is the population aged 15 years and over and resident in the main home in the whole of Spain (including Ceuta and Melilla), irrespective of nationality [11] [12].The sampling frame is the municipal register.The universe is stratified by 17 autonomous regions and four municipal size categories: fewer than 10.001 inhabitants, 10.001 to 50.000 inhabitants, 50.001 to 100.00 inhabitants, and over 100.000 inhabitants [13].
Since 2004 (the year of the second round) this survey has used two-stage cluster sampling from census districts with probability proportional to the size of the universe (resident population over the age of 15), taking 6 or 7 individuals from each district; 6 from smaller municipalities (<50,001 residents) and 7 from larger municipalities (>50,000 residents) [11].Respondents are drawn from lists taken from the May 2010 Municipal Register of inhabitants and supplied by the National Institute of Statistics (INE).The fieldwork was undertaken between April 11 and July 31, 2011 by 66 interviewers and 22 provincial coordinators.
The non-substitutability of respondents [11] who refused to participate required up to seven household visits following non-contact (at least two of these to be made during evening or weekend hours), refusal-conversion techniques, and other resources in order to ensure the participation of more reluctant sample members (Cuxart & Riba, 2009).Selected respondents were sent two letters of presentation prior to the interviewer's visit (three after a refusal), promised payment of 12 Euros, and provided with a leaflet showing the results of previous rounds of the survey.With this range of resources, 1885 respondents were successfully recruited from an original sample of 2865 interviews, which represents a participation rate of 68.5% after the removal of 114 non-eligible respondents [15].

ISSP (II) Environment Survey, CIS, Survey Number 2837
The universe for these surveys is all Spanish residents, of either sex, aged 18 and over, based on the 2009 municipal register sampling framework which has the same stratification as the European Social Survey, except that it has 7 municipal size categories: <2001 inhabitants; 2001 -10,000; 10,001 -50,000; 50,001 -100,000; 100,001 -400,000; 400,001 -1,000,000, and >1,000,000 inhabitants.As in the ESS, it uses two-stage cluster sampling drawn from municipal registers with probability proportional to size, followed by a systematic selection of individuals from households in each section.
This survey is based on face-to-face interviews conducted in participants' homes over the period May 13 to July 24, 2010.Of 4000 sample persons, 2560 were achieved, giving a participation rate of 64.5% after excluding 31 non-eligible subjects [16].For a confidence level of 95.5% (two sigmas), and P = Q, the sampling error is ±1.98% for the sample as a whole.As in the ESS, refusals that persist after four visits to the household are not replaced.
Despite the similarity of the participation rates (66% for the ESS and 64% the Environment ISSP), the use of fewer techniques to ensure the participation of the more reluctant sample members in the second of these surveys explains the marked differential in number of non-contacts: 84 and 811 (3% and 20% of all attempted contacts), respectively.

Health Barometer 2010, CIS Survey 2850
The universe for the Health Barometer is the resident population aged 18 and over, stratified by autonomous regions, and 7 municipal size categories (<2001 inhabitants; 2001 -10,000; 10,001 -50,000; 50,001 -100,000; 100,001 -400,000; 400,001 -1,000,000, and >1,000,000 inhabitants).Three nationally-representative subsamples of 2600 respondents were taken and interviewed in March, June, and the period October to November, using multi-staged cluster sampling, where the primary units (municipalities), and secondary units (census districts) were selected with probability proportional to size (numbers of residents aged 18 and over) [17].The aggregate sample of 7750 questionnaires has a sampling error of ±1.13% for the sample as a whole, assuming simple random sampling, for a confidence level of 95.5% (two sigmas), and P = Q [18].
Households were randomly drawn from every other apartment building in each census district, using gender and age quotas for the selection of the ultimate respondent (household member) [17].When the household contained more than one eligible interviewee, the youngest of these was selected for interview [19].Whenever the interviewer was unable to conduct the interview in the appointed place (non-contact, refusal, etc.), the sample unit is replaced according to the standards laid out in the document General standards for correct sampling, which states that, "when an interview is not possible at the first contact, the interviewer can try next door" [19], that is, in the first household of the following segment (group of six households).In the case of apartment buildings, where no interview can be achieved, this address is replaced with the one next to it [19].These procedural methods explain why it was necessary to contact 161,395 households in order to achieve 7750 interviews, which represents an average of 20.8 contacts per interview.
Summing up, let us note that all three surveys stratify by autonomous region and municipal size, the only difference being that the ESS uses fewer strata (see Table 1).The data are collected by means of face to face interviews, using multi-stage (two-stage in the probability samplings, and multi-stage in the case of the HB) cluster samples drawn from municipal register data stratified by municipal size.It is from this point onwards that the differences between the three surveys begin.The HB uses a combined route/quota sampling procedure, whereas the ESS and the ISSP take random samples from lists of individuals.In addition, the ESS uses a wide range of resources to encourage participation (letters of presentation, re-contacts, refusal conversion techniques, productivity-linked remuneration of interviewers, etc.), and the ISSP uses a letter of presentation, and four re-contacts.
Another key difference is that the HB uses non-contact replacement, while the other two do not (see Table 1).

Distribution Comparison
Representativeness will be examined by comparing the distribution obtained by each survey against the reference population [17].This comparison can only be conducted for those variables about which there is available information for the population as a whole; namely, the separate gender and age distributions, and the joint age and gender distribution.The data in question are those available from the annual municipal register.Source: [13] [16] [18].
It would be interesting to extend the comparison to other highly relevant variables, such as educational attainment, and the economic activity ratio, about which there is no current data for the survey universe.There is, however, a source of data that is highly representative of the Spanish population.This is the Economically Active Population Survey (hence forth EAPS), which gathers and publishes statistics that are essential for an understanding of the human side of the nation's economic activity, with the primary objective of assessing the degree of economic activity and other related issues among the population [20].It is the largest household survey in terms of sample size, workforce size, and cost.
As a result of the large size of the sample used for the EAPS, and in view of its particular design characteristics, it can be considered a source of data that are very similar to real Spanish population data.In addition, the EAPS also provides data on the economic activity ratio, the majority gender differentiated.For a discussion of the comparability of the two surveys, see Díaz de Rada and Núñez Villuendas [21].This enables the comparison to include four of the most relevant socio-political research variables: age by gender, educational attainment by gender, and employment rate by gender.
Our analysis looks at the gender distributions rather than the marginal distributions of the variables of interest.The use of jointed distributions will enable us to detect differences in some variable frequencies that would go undetected if we were to use marginal distributions, where there is a possibility of cross-segment compensation.(In the first part of Table 2, for example, if we were working with the marginal age distribution, the 55 to 64 age bracket would show a 0.45% negative deviation from the municipal register data; while the gender breakdown shows males to be under-represented by 0.67% and females to be over-represented by 0.21%.This enables us to attribute the observed deviation in that age bracket to males).These variables are examined separately in the following sections.

Age and Gender Comparisons
Table 2 shows a comparison between the age distribution data from each survey and the sampling frame.Since the ESS used the 2010 municipal register and the fieldwork was conducted in 2011, the displayed age distribution is for the population in 2011.The discrepancy between the number of cases shown in Table 2 and the number mentioned in the sample design described above is due partly to the removal of under-eighteen year olds and Ceuta and Melilla residents and partly to the weighting of the data.
These points aside, the difference with the universe is just over 8 points, due mainly to the under-estimation by 2.33% of the population over the age of 64 years.It is also possible to observe over-estimation of the youngest population group and those in the 45 -54 age bracket; as has been observed in every round of the survey up to the present [11].The gender-differentiated analysis reveals slightly higher male representation.Males between the ages of 45 and 54 are over-represented by 1.58%, while those in the 55 -64 and 25 -44 age brackets are under-represented.Females in the youngest age bracket are over-represented by 1.6%, and those in the over 65 age bracket are under-represented by more than 2%.
By dividing each magnitude in Table 2 by the total sum of absolute differences-SAV (8.11 points), we are able to detect which subgroup (or subgroups) contributes most to the observed deviation.In this case, close to 30% (29% to be exact) of the deviation appears in the subgroup of women over the age of 64, and 18.89% of it appears in the youngest female age bracket.The next highest-19.21%-appears in the male 45 -54 age bracket.These three subgroups account for 68.4% of the age-and gender-related deviation.
The "ISSP II Environment Survey" uses the Municipal Register for 2009 (CIS, 2010) as its sample frame and carries out the fieldwork in July 2010, which produces a similar situation to that of the preceding case.That is, the age distribution shown in Table 2 is for the 2010 population.The differences with respect to the Municipal Register decrease to 6.97 points, and the under-representation of over 65-year-old (especially females) accounts for most of this deviation.The next most feature is the over-estimation of the 45 -54 age bracket.
The distribution by gender shows that the sample does not adequately represent males.The 45 -54 male age bracket is overestimated by 1.15%, and younger men are underestimated overall and to a slightly higher degree in the 25 -34 age bracket (0.50%, 0.97% and 0.42%).The main difference in the case of the females lies in the under-estimation of the over 65 age group, and the over-estimation of the 45 -54 age group.
The subgroups that contribute most to the observed deviations are older women and men between the ages of 45 and 54.These two subgroups account for 34.7% of the observed differences, with the percentage deviation explained increasing to 48.6% when the male 25 -34 age bracket is included.
When compared, the HB and the January 1, 2010 revision of the Municipal Register show strong similarity in all age brackets, except the 25 to 34 age bracket, which is over-represented by 0.61%.From that point onwards, we find slight but increasing under-representation, which reaches a level of 0.38% in the oldest age bracket.The 35 to 44 age bracket is the best represented.Differentiating by gender, we find the most poorly represented group to be that of males between the ages of 25 and 34, followed by older females.In both cases, the difference is less than 1%, and is therefore attributable to sampling error (±1.13%).Taken together, these two groups explain half of the total deviation, which-it should be noted-is 1.60%.
It appears logical that the quota-based sample should display fewer differences in the age and gender distributions than samples obtained by other procedures, which might make this comparison appear spurious.However, we consider it justifiable by the fact that interviewers are authorized to replace a respondent from one quota with another (of the same sex) from the adjacent quota when contact proves difficult.Despite the fact that detailed analysis of the fieldwork records shows that contact difficulties involved mainly young people and were slightly more frequent in the case of females, they have no significant impact on the representativeness of the age and gender distributions.

Differences in the Educational Attainment Distribution
The responses to the educational attainment questions were recoded to align them with the EAPS categorization.As can be seen in the first part of Table 3, the deviation in the ESS arises mainly from under-estimation in the secondary education category (−9.05%), strong over-representation in the semi-higher qualification category (8.27%), and somewhat less marked over-representation in the incomplete primary and less category (2.15%).The absolute differences of the male and female secondary education categories, divided by total standard deviation (21.48%), show that they account for 42.12% of all the deviation in the table.
The ISSP II Environment Survey over-estimates the semi-higher qualification category (10.32%) and underestimates the incomplete primary and less category (−6.36%), the deviation being greater in the female subcategory (−3.97%).The subcategories that contribute most to the total deviation are the semi-higher qualification category and the female incomplete primary and less category, which, together, account for 60.7% (and 70.9% when the male incomplete primary and less category is included).Note that this survey provides a better representation of men but a poorer representation of women, in complete contrast to the ESS.In sum, the marked differences in educational attainment found in the two probability samplings showed a sum of absolute differences of 21.48 and 23.54 points, respectively.
The data compiled by the HB (using random route and quotas) are somewhat better, since none of the differences across the table enter double figures.There is 7.61% over-estimation in the semi-higher qualification category, and an under-estimation of 6.44% in the highest educational attainment category (university degree).The deviation is notably lower in the primary education category, where we find an over-estimation of 2.43%, which is slightly greater than the under-estimation found in the incomplete primary and less category.No major differences emerge when these results are broken down by gender, since they continue to show the same over-representation in the semi-higher qualification category and under-representation in the university education category, as found in the overall results.The differences in these two subcategories account for 69.7% of all the differences in the table.Source: [13] [16] [18].Data for the universe taken from the NSI [23] [24].

Sample vs. Universe Comparison of Employment Variables
The data collected from the employment status question were use to estimate economic activity, employment and unemployment rates, which we consider more useful than the raw responses.It should be noted that, in the questions about economic activity in all three surveys, interviewees are assigned to the employment or unemployment category according to their own responses.This contrasts with the EAPS questionnaire design, which distinguishes the economically active from the unemployed by purposely wording the questions.In other words, in the surveys considered in this paper, "unemployed" is an ascription that covers respondents who may, in fact, be non-active, that is, not part of the economically active population.This point will be discussed further at a later stage.Table 4 shows that the ESS over-estimates both the economic activity rate (particularly the male estimate) and the employment rate.The ESS unemployment rate estimate is, nevertheless, 0.49% lower than reported by the EAPS, which estimated female unemployment less accurately.It should be noted, however, that this is the lowest deviation found in any of the surveys considered, and that there is no observable change in the results when the population aged 16 to 17 and Ceuta and Melilla residents are included.
Wider deviation can be observed in all three estimates given by the ISSP II Environment Survey, whose economic activity and employment rates deviate from the EAPS by more than 5%.Differentiation by gender reveals the widest deviations to be the over-estimation of the female economic activity rate (7.41%) and the over-estimation of the male employment rate (3.73%).It is also possible to observe an under-estimation of the overall unemployment rate by 1.49%, and by double that amount in the male unemployment subgroup.The sum of absolute values (SAV) reveals deviations of more than 10 points in the economic activity and employment rates.
The HB over-estimates the economic activity and unemployment rates, the second of these by 8.64%, and the female unemployment subgroup by even more (9.32%).The sum of absolute values (SAV) of differences in the unemployment rate is 17.5 points, which is the widest observed deviation in this study, which also finds the overall employment rate to be under-estimated by 3.46%, and male employment by 3.65%.

Discussion
It comes as a surprise to find such marked differences in the age and gender distribution of the probability sampling, where the joint deviation exceed 6 points, and can, therefore, in no way be explained by sampling error.
The most noteworthy discrepancies are those that appear between the two probability samplings-which, as already noted, use a similar sampling design-and can, therefore, only be attributed to the techniques by the ESS to increase survey participation.Re-contacts and refusal conversion techniques enabled the ESS-in the third round (2006)-to retrieve 18.3% and 6.3% of the sample, respectively.Differences in the successful representation of the universe can be explained by the specific features of these two respondent groups [27].
The quality of the HB data is superior to that achieved by the two probability sampling-based surveys, due to the use of age and gender quotas.These require the interviewers to seek out individuals with given characteristics, with the result that the sample matches the universe because of the nature of the quota selection policy.In light of these considerations, quota replacements, by which interviewers are permitted to substitute a non-contact with an individual from an adjacent quota in either direction, are offset by replacements made by other interviewers in the opposite direction, and can, therefore, be classed as errors that cancel each other out.Given that the sample is defined by the age and gender distribution in the universe, age and gender quotas serve little purpose, except to confirm that age and gender are not strongly associated with other variables, such as the economic activity rate.This casts some doubt on the usefulness of age and gender data for quota sampling.
The range of differences that appear in the probability samplings is situated between 10.32 points in the estimation of semi-higher qualification category figures in the ISSP II Environment, and 0.12 points in the primary education figures obtained by the ESS.These two surveys present deviations of 23.54 and 21.48 points, respectively.The HB, in contrast, reflects educational attainment in the population slightly better (20.18 points).This could be due to lower age dispersion, since there is a very close link between age and educational attainment.
A completely different picture emerges when it comes to comparing the employment variables, particularly with respect to the employment, economic activity, and unemployment rates.The ESS over-estimates the economic activity and employment rates by more than 3 points, and more than 4 points in the male subcategory, while the Environment Survey reports even higher rates, particularly in the female activity (7.41%) and in the male employment where the deviations reach the 5.81%.The HB overestimates the economic activity rate by 2.5% and the unemployment rate by 8.64% (9.3% in the case of women), and under-estimates the employment rate by 3.46%.
Two important issues underlie these differences.The first has to do with the different universes under consideration, the population over the age of 15 in the EAPS and the population over the age of 18 in the HB.More significantly, from our point of view, the survey designed for the HB-as noted previously-recommends interviewers to substitute addresses where there is no reply with the one next door.Calling at one household after another (up to 20.8 contacts) results in the replacement both of households where there is no reply-because the occupants are at work or absent for other reasons-and those that are empty-because they are second homes or (to a lesser degree) false entries in the municipal register-with households whose occupants are at home when the interviewer calls.Given that the employed spend less time at home than the unemployed, the probability of not being at home when the interviewer calls is higher among the employed than among the unemployed.In our opinion, this is the reason for the higher rate of unemployed found by the HB (and similar surveys).
This could account for the marked over-estimation of the unemployment rate.The large number of unemployed also explains the higher economic activity ratio, which is calculated as the active population (employed and unemployed) over the population as a whole.
Another possible interpretation of these findings could be that different methods were used to measure the economic activity rate.Some use the number of self-declared unemployed, making no distinction between the unemployed (those who have previously been employed) and the non-active population (those who do not form part of the labour market).Depending on the wording of the question, individuals who were employed in the past but have ceased working for a period in order to care for a relative might select the "unemployed" response in the questionnaire.The EAPS questionnaire, however, is worded in such as way as to enable a distinction between the true unemployed and members of the non-active population.Similar confusion arises with people who have worked in the underground economy and subsequently ceased to do so.The predominance of females in the social care workforce [28] and in the underground economy accounts for the over-estimation of the unemployment rate.

Conclusion
Thus, in summary, random route and quota samples provide a better representation of the age and educational attainment distributions, but over-estimate the figures for the unemployment variable, which the probability samplings estimate more accurately.The CIS, however, which uses random route and quota sampling with substitution methods, obtains more representative samples than other surveys of its type conducted in Spain [29].

Table 1 .
Differences between those three sampling methods.

Table 2 .
Sample vs. universe comparison of age and gender distributions.Vertical percentages and differences between magnitudes (sample estimate minus universe).

Table 3 .
Sample vs. universe comparison of the education distribution by gender.Vertical percentages and differences between magnitudes (sample estimate minus universe).

Table 4 .
Sample vs. universe comparison of economic activity, unemployment and employments rates by gender.Vertical percentages and differences between magnitudes (sample estimate minus universe).