Literature DB >> 35994461

What is the best proxy for political knowledge in surveys?

Abstract

Online surveys are becoming the dominant form for survey data collection. This presents a problem for the measurement of political knowledge, because, according to recent scholarship, unsupervised measurement of political knowledge in web-based surveys suffers from respondent dishonesty. This study examines the validity of five possible survey proxies for political knowledge: self-assessed sophistication, political interest, internal political efficacy, accuracy of party placements on a left-right dimension and political participation. The analysis draws on a 2020 survey data (n = 1,097) and partial replications with identical measures from a 2008 survey data (n = 1,021) from Finland. Through several tests, the five proxies are assessed in terms of convergent validity, criterion validity and predictive validity. Across all tests, political interest performs best on all dimensions of validity and demonstrates largely identical relationships with political knowledge. Although the survey measurement of political interest and political knowledge may partly tap into slightly different constructs, the analysis supports the conclusion that political interest is the most suitable survey proxy for political knowledge from among the five proxy candidates included in the analysis.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35994461 PMCID： PMC9394832 DOI： 10.1371/journal.pone.0272530

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.752

Introduction

Political knowledge, often defined as ‘the range of factual information about politics that is stored in long-term memory’ [1], belongs to the most commonly deployed variables in the study of political behavior. Political knowledge is the prime empirical indicator of political sophistication, i.e., expertise in the political domain [2]. Used both as a dependent and an independent variable, political knowledge is widely considered as an important facilitator of meaningful political participation. Its analytical significance is based on a widespread scholarly consensus, according to which an enlightened citizenry is a crucial condition for the functioning of representative democracy [1]. Scholars (almost exclusively) rely on survey data for the measurement of political knowledge. The preferred method is asking factual knowledge questions, i.e., questions with clearly defined right and wrong answers. However, asking such questions requires a controlled setting to ensure that no cheating occurs. This has become increasingly difficult due to the widespread use of technological devices and the omnipresence of internet search engines. Recent scholarship has demonstrated that due to respondent dishonesty, responses to factual knowledge items in self-administered surveys cannot fully be trusted [3-6]. However, conducting face-to-face survey interviews is much more costly and time-consuming than, for example, web-based surveys. In panel designs, which may stretch across several decades and include numerous waves, face-to-face interviews are seldom a realistic alternative. In many occasions, this makes proper measurement of political knowledge impossible for many survey projects. For scholars, this is frustrating, because panel studies offer analytical possibilities that are far beyond cross-sectional, or even experimental, survey designs. Secondly, with declining response rates causing much headache [e.g. 7], survey researchers are struggling to find ways to keep the hurdles for survey participation low. Asking factual knowledge questions risks respondent frustration. Political knowledge levels among the public are often low and no one likes to reveal their own ignorance by failing a knowledge quiz in a survey interview. As online surveys are irresistibly becoming the dominant form for survey data collection, scholars have sought ways to deal with the problem of cheating. For example, Motta et al. [5] explore the impact of item wordings on the reliability of unsupervised political knowledge questions. Clifford and Jerit demonstrate that asking respondents to commit to honesty alleviates, but does not eliminate, the distortion caused by cheating [4] and Höhne et al. [6] and Marquis [8] develop techniques for identifying cheating behavior in web surveys (see also [9]). However, as Clifford and Jerit in particular demonstrate, cheating propensity varies across individuals and there is no waterproof way of identifying cheating on the individual-level [4], they suggest using (imperfect) aggregate-level measures to control for the effects of cheating. So while tackling respondent dishonesty can make online measurement of political knowledge possible, it is an uncertain solution. This study explores another possibility–the use of a proxy measure. Political sophistication scholars have previously used a range of different proxies, such as self-assessed sophistication, education or interviewer assessments [10]. However, scholars have not properly analyzed the suitability of such proxies. One reason for this research lacuna is the lack of appropriate survey data with population-based samples that would include factual knowledge items, but also different potential proxies. Using a survey from Finland from 2020 with a sample representative of the voting age population, this study overcomes this problem. Additionally, another Finnish survey from 2008 data with an identical sample and identical measures for some of the proxies, allows replications of the key analyses. This allows the study to go beyond tests of validity and even assess whether the findings regarding the key proxies are consistent across two different measurements that are temporally quite far apart. The main analysis of the 2020 data includes five potential proxies for political knowledge: self-assessed sophistication, political interest, internal political efficacy, accuracy of party placements on a left-right dimension and political participation. The study examines these potential proxies in terms of 1) convergent validity, that is, the extent to which they are measuring the same construct; 2) criterion validity, the extent to which the various proxies are associated with other variables that we would expect them to be associated with, based on what we know about political knowledge; and 3) predictive validity, the extent to which the proxies accurately predict political knowledge. Additionally, face validity, that is, the extent to which a variable on the surface appears to adequately measure what it is supposed to measure, is discussed but not tested. The study finds that, overall, political interest is the best proxy for political knowledge. Unlike the other proxy candidates, political interest has high convergent validity, it demonstrates the expected associations with age, gender and education and is a stable predictor of political knowledge across the two genders and different levels of age and educational attainment.

Survey measurement of political knowledge

The significance of being politically informed was widely recognized by political theorists already before the survey method was developed. Already during the early days of survey research in the US in the 1940s and the 1950s, questions of public competence in selecting leaders and forming meaningful political opinions became an active area of study [10]. However, it was not until a wave of studies beginning in the 1980s [e.g. 1, 2, 11–13] that scholars started to give the measurement of political knowledge proper attention. Since then, researchers have made several advances regarding the conceptualization and empirical measurement of political knowledge and its neighboring concepts. In the empirical literature on political sophistication, many scholars have consistently found low levels of political knowledge among democratic publics. Consequently, also much of the methodological work regarding the measurement of knowledge has examined whether public ignorance about politics is a genuine finding or attributable to various shortcomings in measurement strategies. Scholars have, for example, looked at the impact of excessively inflexible coding methods [14] and the lack of incentives for respondents to try and provide correct answers [15]. Furthermore, traditional survey items may give a false impression of the range of knowledge ordinary citizens hold about politics. As Prior [16] has demonstrated, while people may not always be able to express verbally what they know about politics, they can, for example, recognize key political actors from pictures. Also question format has a significant impact on individual-level findings regarding who is knowledgeable and who is not [17] and there is a particularly persistent and vibrant scholarly exchange regarding the meaning of ‘don’t know’ responses and guessing behavior. While some researchers suggest that there are systematic differences between respondents’ tendency to say they ‘don’t know’ [18], other analyses do not corroborate this finding [19]. Tsai and Lin [20] advance the idea that the measurement of knowledge is affected by guessing propensity and that the success of guessing depends on a person’s level of sophistication. Taken together, research suggests that it is difficult to draw definite lines between knowledgeability and ignorance using simple, additive survey measures of political knowledge, which do not account for the nuances that lie underneath the surface of the actual responses. Thanks to these and many other contributions, there is a good understanding among survey scholars about the problems, restrictions and best practices in the survey measurement of political knowledge. However, this methodological literature is relevant only in a context where political knowledge can be measured under controlled circumstances. Inspired by the possibilities offered by online surveys, a growing literature has started to examine unsupervised survey measurement of political knowledge. Although web-based surveys are cost-effective and convenient, the lack of surveillance by an interviewer is potentially a lethal problem for the reliable measurement of political knowledge, which can be distorted by cheating. With no interviewer or researcher present, checking correct answers online, from books or by asking someone else is not only possible but also probable. Research has confirmed these concerns. When the opportunity presents itself, survey respondents routinely check the correct answers when they respond to political knowledge questions in web-based surveys. Cheating occurs even when respondents are explicitly asked to commit themselves to honesty [4]. Although mobile device use is associated with less cheating than computer use, looking up answers is a problem both in samples that consist of semi-professional respondents (such as those accessible through MTurk) and non-professional respondents in a probability sample [5]. Open-ended questions are usually more difficult than multiple-choice questions, and they seem to elicit even more outside searches for correct answers [6, 21]. Overall, the findings consistently show that cheating enhances political knowledge levels and that although the effects can be alleviated through the choice of question wording and appeals to the respondents’ honesty, the basic dilemma remains. Moreover, existing research emphasizes that cheating in online surveys results in substantively different results regarding political knowledge compared to results from traditional surveys. Checking for correct answers muddles the observed effects; the resulting answers are no longer a measure of how well a person can recall political facts, but a measure of how well a person is able to perform the information search. As Smith, Clifford, and Jerit (2020) demonstrate, fact recall is a product of political interest, whereas ability to find information is related to the motivation to do well in the test. Consequently, they conclude that ‘search engine use reduces the validity of political knowledge measures and undermines the ability to replicate canonical findings in the public opinion literature’ [22]. Given these discouraging findings, scholars have suggested various remedies. Discouragement to use external information sources has some, albeit not enough, effect (e.g. [5]). Using visual instead of verbal knowledge items could offer a way forward [23], but research has yet to establish whether this would work as a replacement for traditional knowledge measurement. Proxy measures offer another alternative. The use of proxies is in itself nothing new, as political knowledge has in previous research been measured through various indirect indicators. However, a comprehensive analysis involving several different candidates for a knowledge proxy has not been done. With online surveys becoming increasingly common and cheating is undeniably a major concern for the measurement of political knowledge, it is arguably timely to return to some of the most commonly used survey items and see if they might provide a feasible solution for survey scholars. The forthcoming analysis examines self-assessment of one’s personal level of factual knowledge, political interest, internal political efficacy, party placements on the left-right dimension and political participation level as potential proxies for political knowledge. While the list is not exhaustive, it includes the most obvious proxies, which are also available in most standard surveys of political attitudes and behavior. All of these proxy candidates, along with a commonly used measure of political knowledge, are available in a cross-sectional 2020 survey from Finland, which was conducted in face-to-face interviews. Additionally, identical measures for knowledge, self-assessment and political interest are available from a comparable survey from 2008, allowing a partial replication of the analysis through a repeated cross-sectional design.

The proxy measures

Self-assessment

Perhaps the most obvious candidate is self-assessment of one’s own level of political sophistication. Various forms of self-assessments are widely used in surveys to measure, for example, ideological self-placement, political interest and personal health status. A typical survey measure of self-assessed political sophistication is a simple question such as ‘how well informed would you say you are of political matters?’ Face validity is arguably very high for self-assessments of political sophistication, because the items explicitly refer to knowledgeability or use some other expression of expertise related to politics. Although a self-assessment would be a highly convenient way to measure actual political knowledge, cognitive biases could weaken its accuracy. According to the Dunning-Kruger-effect, which has received empirical support in many studies from different domains, low-performers are particularly poor at self-evaluating their own abilities [24]. This suggests that not only is there a risk that self-assessments are inaccurate, but their inaccuracy is likely to be different for those who truly are knowledgeable compared to those who are not. Moreover, it is simply tempting for anyone to portray oneself as more politically sophisticated than what might actually be the case, because it is socially desirable to be politically informed. Hence, alongside, or in addition to, a possible Dunning-Kruger-effect, people might exaggerate their sophistication in surveys because they want to give a good impression. If, however, self-assessment turns out to be a suitable proxy, it offers a simple, one-item alternative that can be used conveniently in self-administered surveys.

Political interest

Another apparent candidate is the self-declared interest in politics. It is available in practically all surveys about political attitudes and behaviors, which would also make political interest a very convenient proxy for political knowledge. The expression of political interest is widely considered to indicate motivation to engage with politics and which can be linked to behaviors such as following politics in the media and discussing it [e.g. 25]. If interest entails motivation, it seems plausible that interest would have a strong relationship with factual knowledge as well. People, who are interested in politics, or anything else, typically expose themselves to information about it. While it is logically possible to be very interested in something but not be well informed about it (or vice versa), in real life the two are typically closely linked. In democracies, following politics is voluntary and therefore expressions of political interest are likely to reflect genuine motivation to become involved with political matters. However, self-expressed political interest could be particularly vulnerable to social desirability effects, because normative expectations regarding democratic citizenship tend to emphasize paying attention to politics as a citizen virtue. Although face validity seems high, it could be that respondents do not consider interest, i.e. an expression of motivation, as a matter of sophistication.

Internal political efficacy

The two-dimensional concept of political efficacy includes an external and an internal component. External political efficacy refers to the feeling of system responsiveness to one’s needs and wants, and its face validity as a proxy for knowledge seems poor. Internal political efficacy (IPE), on the other hand, involves a subjective evaluation of how well a person understands politics and feels capable of participating in it [see e.g. 26]. IPE is a widely used concept and it is routinely included in political surveys, in slightly varying formats. IPE resembles self-assessment (see above), but it nevertheless taps into a slightly different aspect of self-evaluation. While a self-evaluation of sophistication requires the respondent to assess knowledgeability, IPE assesses ability to understand and take part in politics. IPE is measured using various combinations of (semi)standardized items, such as ‘I consider myself to be well qualified to participate in politics’ or ‘Sometimes politics and government seem so complicated that a person like me can’t really understand what’s going on’ [see e.g. 27]. Metaphorically, one could say that while a self-evaluation of sophistication asks whether the respondent is familiar with the parts of a car, IPE asks whether the respondent feels confident in driving a car. Despite this difference, face validity is likely to be high for IPE, as it includes a direct reference to personal ability to understand politics. However, given the variation in the specific items used to measure IPE, also face validity is likely to vary depending on these choices.

Party placements

Another widely available and potential measurement is the placing of parties or their policy platforms on an ideological dimension, typically the left-right dimension. Used e.g. by [28] as a proxy for sophistication, comparing the respondents’ party placements with those by experts, could be a close proxy for factual knowledge. The closer a person is to experts’ party placements, the more likely it is that the person is highly informed about politics. The underlying assumption is that more political knowledge leads to more accurate perceptions of party ideology, assuming that expert perceptions are accurate. A potential caveat is that respondents may be prone to evaluating preferred parties closer to their own ideological positions, rather than positioning parties according to an objective assessment. This pertains also to the face validity of party placements. Although many respondents are likely to think of party placements as a matter of factual knowledge in the sense that some placements are more ‘correct’ than others, the question itself is nevertheless framed as a matter of opinion, not as a matter of knowledge, as political knowledge items are. However, placing parties on any ideological continuum also depends on the country context. In Finland, where there are several parties with somewhat fluid ideological boundaries, there is always room for disagreement about their correct placement. In similar cases of high fragmentation in the party system, party placements are likely to be good proxies for knowledge. However, in two-party systems, where there is little discussion about which of the parties is, e.g., more conservative versus liberal, party placement is more likely a direct measure of basic knowledge about the system, rather than a proxy for it Consequently, usage of party placement as a knowledge measure is always context-dependent. Although projections of own ideological positioning may cause some confusion in measurement, party placements show much promise as a potential proxy for factual knowledge items. However, non-responses, especially in contexts with a high number of parties, pose an analytical challenge. It is difficult to place certain niche parties on a left-right (or liberal-conservative) continuum. In those cases, choosing not to place a party at all could be a sign of sophistication, although in survey analysis such missing values are often interpreted as indicating ignorance. Moreover, a related problem is that it may not be sensible to ask people to place parties on a specific ideological dimension if that dimension lacks a connection to the party’s political profile. Obviously, statistical imputation offers several alternatives for dealing with the resulting missing values, but all techniques also have limitations. In the forthcoming analysis, the missing values have been replaced by mean imputation, that is, by assigning the mean absolute difference between the respondents in the sample and the experts. The measure used in the analysis is the grand total of the absolute differences in evaluations of the eight parliamentary parties in Finland. Additionally, one MP in the Finnish parliament formed his own group alone. This one-person group was not included in the Chapel Hill survey and is therefore omitted. The values have been weighted for the number of parties that the respondent was able to place: the more parties a respondent placed on the ideological scale, the higher the (potential) value. For example: Respondent 1 places 8 parties on the scale, whereas Respondent 2 only places 6 parties. Assume that the grand total of the absolute differences between respondent and expert placement, is -10 for both respondents. For respondent 1, -10 is the total difference after 8 party placements whereas for respondent 2, the same score is the total difference after only 6 party placements. The weighted score for respondent 1 is calculated as follows: -10/8 = -1.25; and for respondent 2: -10/6 = -1.67. Thus, respondent 1 gets a score closer to 0, which indicates a better score than the score for respondent 2 who only placed six parties. For the analysis, the scores have been converted and rescaled into values between 0 and 1. By applying this weight, it is assumed that willingness to provide party placements in a multiparty context is in itself a sign of sophistication, whereas providing missing values is the result of inability to place parties.

Political participation

Perhaps the most significant insight that has emerged from the mainstream empirical literature on political sophistication is that people who know more about politics also participate more actively [1, 29, 30]. This finding is firmly anchored in the normative debate regarding the importance of sophistication for the functioning of democracy. It is widely accepted among the engaged scholars that political knowledge is a resource, which lowers the threshold for active participation and the realization of democratic citizenship [e.g. 1, 31]. The connection between participation and knowledge is therefore close both theoretically and empirically and e.g. Krosnick and Millburn [32] have used participation as an indicator for knowledge. Questions measuring (self-reported) political participation are also commonplace in surveys, which makes it another attractive option for a knowledge proxy. However, in comparison with the other proxy candidates, face validity seems low for participation. It is unlikely that questions asking about a person’s political participation could be seen as questions intended to measure political sophistication. Consequently, the appropriateness of participation as a proxy for sophistication relies primarily on the empirical linkage between the two.

Materials and methods

The analysis is primarily based on a survey conducted in Finland in 2020. The survey data used in the study has been gathered in accordance with the GDPR regulations. As the data does not include any sensitive personal data, an ethics committee approval was not required in Finland. Informed consent was acquired from each respondent verbally, as per the standard procedures of the survey company. Before asked to state if they would agree to being interviewed, the professional interviewers showed every respondent a written description of the survey, including the data protection statement. After that, respondents were asked whether they would consent to being interviewed. The consent was documented by the interviewer. The 2020 survey replicated several political knowledge items and standard formulations of political interest and self-assessed sophistication from a 2008 survey. For these variables, the question wordings were identical in 2008 and 2020, and the same analyses will be repeated with both data for robustness. For the other variables (IPE, party placements and participation) only data from 2020 is available. The two surveys are also based on identical sampling of the voting age population. Both were conducted through face-to-face interviews by the same survey company and the same post-survey weighting method is used for both data. Both surveys were also conducted in election off-years, so that possible effects of a close surveillance of political campaigning among the public would not contaminate the findings. The expert data, used to calculate the deviations from expert party ratings, comes from the Finnish data in the Chapel Hill 2019 survey, available through https://www.chesdata.eu/2019-chapel-hill-expert-survey. Table A1 in S1 Appendix reports all variable information, including original wordings and response categories. For the analyses, all variables have been standardized by using z scores.

Results

Table 1 begins testing convergent validity through correlational analysis. The table shows the correlation coefficient and the 95% confidence intervals in brackets. All correlations are statistically significant (Spearman rank-order correlation, two-tailed significance at < .001-level).

Table 1

Correlational analysis with the 2020 data (n = 1,097).

	Knowledge	Self-assessment	Political interest	IPE	Party placement	Participation
Knowledge	-	.431 [.382 .478]	.444 [.395 .490]	.307 [.252 .360]	.372 [.319 .422]	.212 [.154 .268]
Self-assessment	.431 [.382 .478]	-	.638 [.601 .672]	.432 [.383 .479]	.267 [.210 .321]	.343 [.289 .395]
Political interest	.444 [.395 .490]	.638 [.601 .672]	-	.386 [.335 .436]	.259 [.203 .314]	.417 [.366 .465]
IPE	.307 [.252 .360]	.432 [.383 .479]	.386 [.335 .436]	-	.247 [.190 .303]	.273 [.217 .328]
Party placement	.372 [.319 .422]	.267 [.210 .321]	.259 [.203 .314]	.247 [.190 .303]	-	.242 [.185 .299]
Participation	.212 [.154 .268]	.343 [.289 .395]	.417 [.366 .465]	.273 [.217 .328]	.242 [.185 .299]	-

All proxy candidates correlate at least moderately with knowledge and each other. This suggests that all the variables are inter-related, as expected. Self-assessment and political interest have the strongest correlation (.638). Knowledge correlates most strongly–but modestly–with political interest (.444) and almost as strongly with self-assessment (.431), and least with participation (.212). Consistent with these findings, in the 2008 data knowledge correlates with self-assessment and political interest at .450 and .408, respectively (Table A2 in S1 Appendix). This reinforces the initial finding that self-assessment and political interest have highest convergent validity. The relationship between knowledge and interest is stable across measurement during off-election years and in the context of a general election. While the data used in this study comes from an off-election year, the latest Finnish parliamentary election survey data from 2019 (FNES 2019) was collected immediately after the elections. It uses a similar sample, a comparable 5-item political knowledge measure and an identical political interest measure. The Spearman correlation between knowledge and interest in FNES 2019 is .417, which is nearly identical to the .444 correlation reported in this analysis (Table 1). Principal component analysis (PCA) provides a more durable test of convergent validity. The variables load on a single dimension (Table 2), suggesting that they measure a single construct. Horn’s parallel analysis, which is the preferred method for ascertaining the appropriate number of factors retained by PCA [see e.g. 33], confirms the single-factor interpretation (Fig A1 in S1 Appendix). In other words, all proxies show potential as they all tap into the same underlying dimension.

Table 2

Knowledge and its proxies: Principal component analysis with 2020 data (n = 1,097).

	Loadings	Uniqueness
Knowledge	.562	.652
Self-assessment	.777	.380
Political interest	.817	.322
IPE	.567	.678
Party placement	.359	.818
Participation	.512	.705

Note: Eigenvalue: 2.298, Chi2 test <***

Note: Eigenvalue: 2.298, Chi2 test <*** Factor loadings and uniqueness reflect the strength with which each component is connected to the common dimension. As with the correlational analysis, party placement shows the weakest connection with the rest (weak loading, high uniqueness), while particularly self-assessment and political interest are the components, which are most closely connected to the common dimension. Using data from 2008, Table 3 confirms the same pattern. For the three variables that are available in both data, the results are strikingly similar for both surveys. The variables load on a single dimension, and again self-assessment and political interest are more closely connected to the underlying dimension than knowledge (stronger loadings, lower uniqueness). This suggests that the underlying construct, which could be termed ‘self-expressed cognitive engagement with politics’, is more about cognitive engagement expressed through self-assessed sophistication, interest and a sense of efficacy, and somewhat less about objectively measured knowledge. The common dimension is also less connected to participation and party placement. Hence, in terms of convergent validity, self-assessment and political interest stand out as the primary contenders for best proxy for political knowledge, based on both a correlational and a principal component analysis.

Table 3

Knowledge, self-assessment and political interest: Principal component analysis with 2008 and 2020 data (n = 1,020/n = 1,097).

	Loadings		Uniqueness
	2008	2020	2008	2020
Knowledge	.589	.574	.653	.671
Self-assessment	.746	.788	.444	.378
Political interest	.715	.808	.489	.347

Note: 2008: Eigenvalue: 1.4141, Chi2 test <***. 2020: Eigenvalue: 1.6047, Chi2 test <***

Note: 2008: Eigenvalue: 1.4141, Chi2 test <***. 2020: Eigenvalue: 1.6047, Chi2 test <*** Criterion validity evaluates the extent to which the various proxies are associated with other variables as expected. As a first step, multivariate linear regressions were run, where the knowledge measure and the proxy candidates from the 2020 data take turns as the dependent variable, while gender, age and education are entered as independent variables in each analysis. The reasons for choosing (only) these variables are twofold. Firstly, canonical findings from previous research show that these variables are the most significant individual-level sociodemographic predictors of political knowledge. That men tend to score higher in survey knowledge questions is a persistent finding [e.g. 34]. Age typically shows a positive association with knowledge [35], while high education is a particularly strong predictor of high political knowledge [1]. In many studies, the association is so strong that education could also have been considered as another potential proxy, rather than a control variable. The bivariate correlations in the data used in this analysis were, however, lower for education than those reported above in Table 1, suggesting that education does not quite reach the same proximity to political knowledge as the other proxy candidates. In the 2020 data, Pearson correlations for education and knowledge and the proxies range between .226 and .361 and in the 2008 data between .227 and .304. Secondly, gender, age and education are available in practically all surveys that measure political attitudes and behavior. To maximize the generalizability of the findings across contexts, age, gender and education form the most appropriate set of independent variables for the current analysis. Fig 1 below summarizes the results from a series of analyses (see Supporting information for full results and Fig A2 in S1 Appendix for the partial replication with 2008 data). It shows the coefficients from linear regressions with the 2020 data, where gender, education and age are used as predictors for political knowledge and the proxy candidates. Each analysis has been run separately, with gender, education and age entered simultaneously as independent variables, and knowledge and its proxies as the dependent variable, one at a time. All variables have been standardized using z-scores for comparability.

Fig 1

Gender, age and education as predictors of knowledge and its proxies (n = 1,097, 95% CIs).

For knowledge, political interest and self-assessment, the observed patterns roughly follow expectations. Most significantly, male gender, education and age are strongly and positively associated with knowledge, which aligns with previous research. Additionally, interest and self-assessment show similar associations, although regarding age, the large CIs imply more individual variation than in the case of knowledge. For the others, the findings deviate substantially from knowledge. Age is negatively associated with IPE, party placement and participation, which conflicts with how they associate with knowledge. This suggests that using these variables as a proxy for knowledge might lead to substantially different findings. Lastly, let us assess predictive validity, which, in psychometric testing, refers to the ability of a score on a scale to predict the values of a criterion measure. Here, the proxies represent the scores and political knowledge is the criterion measure. Fig 2 compares the coefficients from five separate analyses, where one proxy candidate at a time and the three control variables have been entered as independents and where political knowledge is entered as the dependent variable. The figure only displays the coefficients for the proxies, and excludes the controls, for convenient reading (see Supporting information for full results and Fig A3 in S1 Appendix for partial replication with 2008 data). All variables have been standardized using z scores for comparability across the different measures.

Fig 2

The proxies as predictors of knowledge (coefficients and 95% CIs).

All proxy candidates are statistically significant predictors of knowledge, when controlling for age, gender and education. However, self-assessment barely crosses the threshold of significance (p = .046), while all others are significant at p < .001. Comparing the magnitude of the coefficients, the ability to provide party placements that are similar to placements by experts stands out as the prime candidate in this comparison. Political interest is not far behind and the two have partly overlapping confidence intervals. Moreover, the range for CIs is significantly greater for party placements than for interest, suggesting that the latter might be less affected by individual-level differences. Instead of using one of the proxies, they could all be combined into one factor, as suggested by the one-dimensional solution of the PCA reported in Table 2. All of the proxy candidates might not be available in all surveys, which makes this solution less likely to be practicable, but nevertheless worth exploring. As reported in S4 Table in S1 File, the factor score for the proxy candidates (PCA, Bartlett method for estimating factor scores) is almost as strong a predictor of political knowledge as political interest. It is therefore also a viable method for using a proxy for knowledge, but in terms of predictive validity, using only political interest is at least as good a solution. Another way to compare the magnitude of the coefficients is to enter all of them and the control variables in the same model and as a post-test compare the standardized coefficients. Table 4 reports the pairwise comparisons across all the proxy candidates based on such a regression model (detailed results and replication with the 2008 data in Tables A3 and A4 in S1 Appendix) to test whether the differences between the proxy coefficients are statistically significant.

Table 4

Comparisons of coefficient magnitudes with the 2020 data.

Variable [coefficient]	Comparison (F)
Self-assessment [.013]–political interest [.058]	18.78***
Self-assessment [.013]–IPE [.016]	.13
Self-assessment [.013]–party placement [.023]	1.54
Self-assessment [.013]–participation [.007]	.50
Political interest [.058]–IPE [.016]	18.33***
Political interest [.058]–party placement [.023]	11.83***
Political interest [.058]–participation [.007]	18.66***
IPE [.016]–party placement [.023]	.64
IPE [.016]–participation [.007]	.97
Party placement [.023]–participation [.007]	2.80

The coefficients in Table 4 are based on z-scores, making the proxy candidates comparable in terms of how strongly they predict political knowledge. By comparing the F-values, it is possible to assess the probability of statistically significant differences between the predictive strength of the proxies. Again, political interest stands out as the prime candidate. It has a larger coefficient than the other candidates, suggesting a stronger association with political knowledge, and in comparison with the other candidates, the difference is statistically significant. As a final test, let us examine whether the predictive ability of the proxy candidates varies across different categories of the control variables. In other words, do the proxy candidates have equal predictive validity for men and women, people in different ages and levels of educational attainment? Figs 3–7 below display the adjusted predictions for all combinations of the proxy candidates and the sociodemographic controls, in altogether 15 separate analyses with the 2020 data (see Supporting information for full results). In these analyses, statistically significant interaction coefficients are interpreted as suggestive evidence of poor predictive validity, as these suggest differences in how the proxies predict political knowledge across the sociodemographic controls. These differences are, however, sensitive to the choice of reference group. In the reported analyses, the group with supposedly lowest level of political knowledge was selected as the reference group (18-30-year-olds, female, comprehensive or less education).

Fig 3

Interactions between self-assessment and the sociodemographic variables.

Fig 7

Interactions between political participation and the sociodemographic variables.

Generally, the proxies predict knowledge similarly across the various respondent categories, with some significant exceptions. For political interest, there are no statistically significant interaction effects for age groups or education, but there is a gender difference. Predictive validity is higher for men, indicated by the steeper slope of the coefficient, which suggests that interest is more accurate in predicting political knowledge among men than women. A similar effect for gender is found for party placements. For party placements, predictive validity also varies across age groups although the differences are not substantial. For self-assessment, there are statistically significant interactions for age and education, suggesting that predictive validity for self-assessment varies across these groups. In the case of party placements, predictive validity is highest for the youngest age group, 18 to 30-year-olds, who also differ statistically significantly from all the other groups. For education and gender, predictive validity is stable for party placements. When it comes to IPE, the only significant interaction is with age, suggesting that in this regard, IPE is a reasonably good predictor of knowledge. Table 5 below offers a simple summary of all the preceding tests. The plus-signs indicate an assessment that a proxy candidate ‘passed the test’, whereas the brackets indicate borderline findings.

Table 5

Summary of findings.

	Convergent validity	Criterion validity	Predictive validity
Self-assessment	+	(+)	-
Political interest	+	+	+
Internal political efficacy	(+)	-	(+)
Party placement	(+)	(+)	+
Participation	(+)	-	-

In terms of convergent validity, all proxy candidates showed some potential. They tap into the same underlying construct and show at least a moderately strong correlation with political knowledge. However, self-assessment and political interest are more closely correlated with knowledge and they also load more strongly to the common factor, suggesting they converge more intensely with knowledge than the rest. For criterion validity, the observations are more straightforward. Strictly speaking, only political interest shows similar associations with age, gender and education, as political knowledge. Self-assessment comes close, but the large confidence intervals in the coefficient for age suggest that there is plenty of individual variation in terms of how accurately self-assessment predicts political knowledge. Age is a statistically insignificant predictor of IPE and party placements and a strongly negative predictor of participation. These observations clearly deviate from the expected pattern. Political interest is the most consistent performer even in terms of predictive validity. Although the ability to correctly place parties on the left-right dimension has an even stronger predictive capability (controlling for age, gender and education) than political interest, the latter predicts political knowledge more evenly across age groups. IPE has also high predictive validity, but does not quite reach the same level as interest. In a summarizing assessment, political interest seems slightly better than party placements when it comes to predictive validity. Consequently, political interest is, among the candidates covered by the analysis, the most suitable proxy for political knowledge in terms of all three types of validity.

Discussion

Survey research is struggling with low response rates. Online surveys continue to gain ground as a cost-efficient and respondent-friendly way to attract respondents. For reliable and valid measurement of political knowledge, this development presents a problem because of widespread respondent dishonesty in self-administered web surveys. This study has explored the possibility of circumventing the problem through survey proxies. The analysis evaluated self-assessed political sophistication, political interest, internal political efficacy, placement of parties on an ideological dimension and political participation as potential proxies. While all the proxy candidates show some promise, the recommendation that emerges from the analysis is that political interest is the best survey proxy for political knowledge. For survey researchers, self-reported political interest is an attractive option for indirect measurement of political knowledge, because it is simple and convenient. Particularly self-assessment, IPE and party placements are also likely to produce somewhat similar results as a typical index measure of political knowledge. However, in this analysis, none of them demonstrated the expected associations with the most widely used predictors of political knowledge, suggesting that each of them are partly driven by other factors than those that affect factual knowledge. In the typical situation, where researchers wish to make conclusions about the individual-level precursors of political knowledge, using these variables as proxies therefore runs the risk of misjudgment. If, on the other hand, only a rougher population-level estimate of political knowledge levels is needed, self-assessment, IPE and party placements are likely to be adequate. So what would substituting knowledge for interest entail? The two are evidently related empirically, but do they tap into the same phenomena in the minds of survey respondents? Previous research gives reason to optimism. It is widely thought that an expression of political interest reflects a person’s level of motivation to engage with politics. This understanding originates from psychology, where interest as a general concept is connected to feelings of motivation [36, 37]. Similarly, motivation has been considered also as a precursor of political knowledge [1]. It seems plausible that motivation is in some sense a shared ‘root cause’ for the empirical similarities between interest and knowledge, as demonstrated in this study. As Prior and Lupia [15], for example, have shown, the number of correct answers to knowledge questions increases with respondent motivation, which can be manipulated through rewards. Therefore, getting knowledge questions right in a survey interview is to some extent a product of motivational factors, such as interest. However, responding to factual knowledge questions in a survey setting inevitably involves also a dimension of performance. Getting factual questions right requires an ability to perform well in a test situation, and some are better at it than others. Providing a self-assessment of political interest does not require a similar ability to perform in a test situation, but it could be vulnerable to social desirability bias, because declaring interest in politics is usually considered a citizen virtue. Therefore, using political interest as a proxy for political knowledge could mean substituting a problem with disentangling the measurement of knowledge from performance ability with a social desirability problem. Although self-assessment and IPE fell short of interest in the overall empirical evaluation, they might nevertheless be acceptable proxies for knowledge as well, with some reservations. Both have arguably very high face validity, which could help in circumventing the problem of motivation versus test performance measurement, as discussed in regard to political interest. It seems plausible that self-assessment and IPE elicit responses reflect an informational, rather than a motivational component in the minds of survey respondents. Hence, the choice of appropriate proxy for political knowledge could to some extent depend on analytical context and design. Moreover, it is important to note that using political interest (or any of the others) as a proxy for political sophistication does not allow distinguishing between different dimensions of political sophistication [see e.g. 38, 39] or examining in detail how sophistication relates to other similar constructs. Based on this study, it is only possible to suggest that political interest is slightly better than other closely related survey items as a substitute for direct measurement of political knowledge. It is best employed in situations where a simple measure of political knowledge is used as an explanatory variable or even as the dependent variable, but not in situations that involve a more complex conceptual design around the broader notion of political sophistication. The conclusions in this study are based on a survey of a sociodemographically representative sample of the Finnish voting-age population from 2020, and partial replications using identical variables and sample from 2008. These data include an unusually rich set of political knowledge items and potential proxies. Although the analyses could only be replicated with self-assessment and political interest, the results were remarkably consistent in the two datasets across all the analyses. While this consistency gives confidence to the presented observations and suggests high test-retest reliability, the findings come with certain limitations and gaps remain for subsequent research. Some potential proxies for political knowledge were unavailable in the data, most notably a suitable measure of media consumption. The measure that was included in the survey asked how important the various forms of media are for the respondent for keeping informed about political matters. Although media preference and political news consumption are undoubtedly associated with political knowledge, the item was excluded from the analysis, because it is formulated in a way that is a bit unusual for most surveys. In election studies, for example, media consumption is typically measured by asking how much time a person has spent following the on-going or recent election campaign. The item that was available in the data focused on the preferred type of media, rather than intensity. To maximize generalizability of the results to typical survey designs across national contexts, only those proxies were included, which were measured with commonly used question wordings. For the same reason, only age, gender and education were included for testing criterion validity. However, subsequent research should include other variables, such as income. Further research is also needed to unlock regional differences across national contexts in terms of political knowledge, which could not be accounted for in this single-country analysis. Moreover, future scholarship should design online surveys that make it possible to compare whether knowledge proxies and items asking to commit to honesty produce similar findings. This would help bridge the gap between the previous literature looking at honesty commitment items and the current study, which examines the use of proxies. In this way, we might reach a better understanding of whether one method of indirect measurement of political sophistication is somehow superior compared with the other. A related, but much broader, question is whether recruiting people to online versus offline surveys produce different kinds of samples. If so, this could even affect the validity of any sophistication measure, depending on whether it is employed in an online or offline survey. Although recent scholarship suggests different modes of recruitment lead to similar samples [40], more research into the matter is still needed. (DOCX) Click here for additional data file. (DOCX) Click here for additional data file. 9 May 2022

PONE-D-22-05474

What is the best proxy for political knowledge in surveys?

PLOS ONE Dear Dr. Rapeli, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jun 23 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Sean Richey Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please amend your current ethics statement to address the following concerns: a) Did participants provide their written or verbal informed consent to participate in this study? b) If consent was verbal, please explain i) why written consent was not obtained, ii) how you documented participant consent, and iii) whether the ethics committees/IRB approved this consent procedure [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: This manuscript is on an important topic, is clearly structured and written, uses appropriate data and methods, and produces interpretable results. My major hesitation is whether the findings are of sufficient interest and move forward our understanding enough to warrant publication. The problem of measuring political knowledge (either because there are no direct knowledge questions on a survey or because of concerns about reliability/validity of knowledge questions in online surveys) is a real one but not a new one. Researchers regularly use the kinds of proxies included in this manuscript. The analyses here provide evidence that political interest does a marginally better job than the other proxies (though none of them, to my mind, stand out as particularly equivalent to directly measuring knowledge. In addition, using proxies necessarily means one can’t study the various causal and interactional relationships among the various components of political sophistication if some of these components are used as stand-ins for the others. So what have we learned? If one wants to measure knowledge but does not have (or fears the reliability and validity of) direct knowledge questions, and one is not interested in the relationships among the components of political sophistication and/or the causes or effects of these distinct qualities, then using political interest as a substitute for knowledge is marginally the best choice. Is this enough of a contribution to be published in Plos One? I’m torn on this, though lean slightly to saying yes. Specific Comments: 1. Abstract: Clear and concise on the manuscripts purpose, relevance to existing research, data, methods, and conclusions. 2. Introduction (pp. 2-5): Generally clear and informative, setting the stage for the research to follow. To my knowledge there is little evidence of respondent frustration or survey incompletion due to asking properly asked knowledge questions (p. 3, lines 51-54). The Clifford and Jerit research cited (p. 4, lines 58-59) would seem to address the problem being addressed in this manuscript – say more about why “another possible solution” is still needed? 3. “Survey Measurement of Political Knowledge” section (pp. 5-8): Summary of research on measurement and its potential shortcomings (pp. 5-6, lines 91-119) are generally accurate, though might specifically note the relationship between guessing and gender, which has been a major area of study/debate, as well as the substantive topics queried about. Stating that “Cheating occurs even when respondents are explicitly asked to commit themselves to honesty” (p. 7, lines 132-133) seems to contradict what is said in the prior section (P. 4, lines 58-59) and below (p. 7, lines 140-141. More clarity is needed. Nonetheless, the general point – that cheating occurs and that it reduces the validity of the measure – is an important one that is generally supported. The concluding argument regarding the use of proxies and the absence of research on how well they do is clear and convincing. 4. “Proxy Measures” section (pp. 8-13): Generally good overview of the proxies to be tested and the logic underlying the choices. I would have added “political attention” (i.e., self-reports on how often one follows politics) to the list. Also, party placement” is actually a (simple and limited) knowledge question and not really a proxy, despite the authors attempt to deny this (P. 11, lines 245-247. The approach to measuring party placement (pp. 12-13. Lines 248-275) is interesting, but not necessarily the obvious or only choice. I like the speculation on non-response as evidence of sophistication as opposed to ignorance, but this is just speculation. And the issue of what dimensions to place parties on seems an issue of question design and wording – we ae not locked into a single question on a single dimension. Nonetheless, the authors approach (imputation and weighting) is not an unreasonable one. In the end I remain concerned that the use of proxies conflate different components of citizenship that ideally we would want to keep distinct for purposes of identifying causal relationships among them, but this does not diminish the value of the research presented here. 5. “Materials and Methods” section (pp. 13-25): Could use more info about the surveys here (p.14, lines 295-307) -- e,g., who did the survey, sampling method and size, response rates, demographics relative to population, etc.- here or in an appendix. Would also be useful to know how the knowledge scale was constructed and its reliability. That the knowledge and proxy variables correlate is not surprising, To put the size of these correlations in perspective, the strongest correlations with knowledge (self-assessment and interest) “explain” roughly 20% of the variance in knowledge. The factor analysis does suggest a single dimension (something like “political sophistication”), though again this is not surprising. Also I’m not clear on how the factor analyses support the conclusion that self-assessment and interest are “the best proxies” for political knowledge (as opposed to best proxies for political sophistication). The convergent and predictive validity tests are interesting and provide some new/useful information regarding the use of specific proxies. I would have included partisanship and income in these analyses. The central conclusion drawn from these analyses – that relatively speaking, political interest appears to be the strongest proxy – is generally supported. I wonder if the authors thought of creating a measure combining the various proxies (perhaps based on the factor analyses, but excluding knowledge) to see how it performs relative to any single measure? Also, any reason to think the reliability of the various proxies (relative to each other and the knowledge measures) vary in important ways? Also any reason to think that responses to the proxies might also be affected/different in online surveys? 6. “Conclusion” section (pp. 25-27): Generally fair summary of findings, limitations, and implications. Reviewer #2: The manuscript offers an examination of predictors of political knowledge based on data from two face-to-face surveys of voting-age Finnish residents. Self-reported knowledge, political interest, internal efficacy, party ideology assessment, and political participation are examined as potential proxies of factual political knowledge. The paper is clearly written and presents solid arguments for the problematic nature of political knowledge metrics in online surveys. Two aspects of the data used in the study make it more valuable and interesting in my assessment. One is the national context – Finland is an interesting case since much of the political knowledge literature is US-centric. The second aspect is the face-to-face interviews used for data collection, a fairly rare modality these days. The main challenges I see with the study are a certain lack of novelty, as well as lack of evidence that the examined variables are useful as proxy measures in practice. In this work a series of variables are framed in the context of finding proxies for political knowledge. Those variables are not new -- there is already a vast body of literature informing our understanding of the relationships they have with political knowledge. The manuscript does not discuss very much of that existing literature when presenting the proxy variables, treating this as an exploratory study. Perhaps a case can be made that this is so in the Finnish context – overall though, the connections between all of those variables are well established. The second challenge I see is that the study does not (and cannot, given its design) show that using these variables as proxies is really beneficial. To do that, we would have to confirm that those variables do a better job of representing the political knowledge construct compared to direct measurements of factual knowledge gathered through online surveys. Previous studies (e.g. Burnett, cited by the paper) give us some information by comparing the measurement of knowledge across survey modalities (online and offline). The present work confirms the known link between political interest and factual political knowledge. It also summarizes challenges that make online measurements of political knowledge somewhat problematic. Yet those two claims are not enough to show that the somewhat flawed online measurement of knowledge is a worse or more biased way of approximating political knowledge compared to using proxy variables that seem likely to be weaker as predictors. A few other notes on the paper: -- It would be helpful to give some standard information about the sample in the methods section – e.g. recruitment, incentives, demographics and their match with the general population demographics. One thing to consider/discuss more generally is if a sample of people that we can recruit for face-to-face interviews today skews in ways relevant to political knowledge compared to people who can be recruited to do online surveys. -- One thing to note is that political interest and other variables may be more highly linked to knowledge in non-election years (when the surveys were done) when the media environment is less saturated with political information. -- The study examines the connections of variables with political knowledge across gender, age, and education groups. One other variable that may be useful to include is income, unless it’s too highly correlated with education. -- Some of the variables in the study may have a non-linear relationship with education, hence some of the patterns captured in the figures. -- Ideally internal political efficacy would be measured through more than a single item. -- The study suggests that using two datasets from different years can be used to examine test-retest reliability. That is likely not the case given that the two surveys had different participants. Moreover, political variables would not be expected to remain stable over a period of 12 years. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 29 Jun 2022 I thank the reviewers for a detailed reading of my manuscript and for all the insightful comments. I found the critical remarks to be fair and I have done my best to respond to them. My responses below roughly follow the order of the decision letter. I have provided responses to the reviewers’ comments and documented the actual changes I have made to the manuscript, for more convenient reading. I have also tried to format the manuscript according to the journal requirements. I look forward to hearing how you feel about the revised version. 1. Significance of the paper and its findings Both reviewers challenge whether the analysis is interesting and relevant enough to warrant publication. Although the reviewers, thankfully, slightly lean towards an affirmative answer, it is still necessary to give the issue proper attention. Firstly, as reviewer 1 points out, researchers regularly use those variables, which I examine in the paper, as proxies for knowledge. However, since there is almost no research into how those proxies actually relate to knowledge, this lacuna is arguably in itself a strong reason for why papers such as the one at hand are needed. My paper, if accepted, will not provide all answers, but it would serve as a useful guiding post for further research and it could, I hope, offer some concrete information to those survey researchers who are forced to resort to using a proxy for knowledge. Until now, they have had very little to go on when choosing between possible proxies. In my opinion, an important argument that supports publishing the results is (as I have tried to argue in the paper) that a further expansion of web-based surveys seems inevitable and the validity problem of knowledge questions will only grow in importance. We still do not have optimal solutions to deal with that problem. Any step towards a feasible solution is, in my view, valuable. So far, research has portrayed the use of honesty appeals in web surveys as to some degree useful but imperfect, but the use of proxies has not been properly explored. This is how I have sought to frame the paper, both in the Introduction and in the concluding Discussion. Additionally, thanks to reviewer comment #2 below, I now include a better explanation for why existing literature on alleviating online cheating is not enough. However, I also understand the reviewer’s hesitation regarding the primary finding(s) of the paper. As the reviewer correctly concludes, it seems that political interest is slightly better than the other proxy candidates, but using any of them is problematic, if one wishes to investigate how knowledge associates with its close conceptual relatives or be able to distinguish between different dimensions of sophistication. I agree and admit that in the initial submission I had not adequately discussed under what circumstances and in what kinds of research designs the investigated proxies could actually be used – and when using them instead of direct measurement of knowledge is not a good option. To address this, I suggest adding the following paragraph to the Discussion: Moreover, it is important to note that using political interest (or any of the others) as a proxy for political sophistication does not allow distinguishing between different dimensions of political sophistication (see e.g. (35,36)) or examining in detail how sophistication relates to other similar constructs. Based on this study, it is only possible to suggest that political interest is slightly better than other closely related survey items as a substitute for direct measurement of political knowledge. It is best employed in situations where a simple measure of political knowledge is used as an explanatory variable or even as the dependent variable, but not in situations that involve a more complex conceptual design around the broader notion of political sophistication. Reviewer #2 also wonders about the novelty of the analysis, although from a slightly different perspective. What is new about this analysis, asks the reviewer, when there is already plenty of research that looks into the relationships between political knowledge and the proxies? It is true that there is research where, for example, some of the proxies are among the key independent variables used to explain political knowledge / sophistication. There is also some research, which focuses specifically on, for example, the relationship between (internal) efficacy and knowledge or on education and knowledge. These studies do not, however, examine the possibility of using variables included in this study as proxies for political knowledge. Consequently, previous literature does not include research designs that would 1) include all five of these variables in a comparative setting, 2) in order to assess them in terms of various types of validity 3) with the explicit aim to determine which of them could be used as a survey proxy for knowledge. Whereas previous literature approaches the same relationships from the perspective of how they relate to patterns of political behavior or political cognition, the current manuscript has a methodological approach, which also utilizes the rare possibility to measure several closely related concepts at the same time. To underline the main point here - the current study offers the first examination where a number of plausible proxies are included in the same analysis and evaluated explicitly as proxies, rather than included in regression models as predictors, control variables or closely related concepts. To me, as a (survey) scholar of political sophistication, the matter is significant enough to warrant publication (obviously), but I fully realize that this is debatable. I hope that the addition documented above and my response to the next comment have made the paper more convincing regarding this matter. 2. Connection to previous literature Moreover, two additional comments by Reviewer #1 also touch upon the same issue of contribution and the positioning of this paper in existing literature. Firstly, the reviewer points out that since Clifford and Jerit already seem to suggest a solution to online cheating, some more explaining is needed as to why this analysis is needed. Secondly, the reviewer correctly notes that the reference to Clifford and Jerit on lines 58-59 is not entirely compatible with what is said later on p. 7. The questions are fair and I thank the reviewer for locating these spots where I have been too imprecise in my recaps of Clifford and Jerit (whose study is an important predecessor for this submission). When it comes to the first point, Clifford and Jerit conclude that asking respondents to commit to not cheating is the *best* way to tackle cheating *among the ones they studied*. However, they find that the tendency to cheat varies between different types of individuals and samples so they end up recommending using aggregate-level assessments of cheating rather than individual-level. What is maybe most important, they concede that “At present, researchers do not have a definitive way to identify cheating.” (p. 874) Consequently, I would argue that in addition to using verbal commitments to honesty in online surveys, we need to also look at other possibilities, because 1) verbal commitments are not bulletproof and their effect varies across contexts and 2) we can never be absolutely sure how widespread cheating really is. To better explain this in the manuscript, I suggest the following addition (see lines 61-64): However, as Clifford and Jerit in particular demonstrate, cheating propensity varies across individuals and there is no waterproof way of identifying cheating on the individual-level (4), they suggest using (imperfect) aggregate-level measures to control for the effects of cheating. And the following paragraph (lines 65-67) now continues by pointing out what the contribution of the current study is, given the uncertainties revealed by previous research: So while tackling respondent dishonesty can make online measurement of political knowledge possible, it is an uncertain solution. This study explores another possibility– the use of a proxy measure. As for the second point, this was just careless on my part. The original lines 58-59 oversimplified what Clifford and Jerit actually find. This could be easily fixed by rewording the original lines 58-59 as follows: Clifford and Jerit demonstrate that asking respondents to commit to honesty alleviates, but does not eliminate, the distortion caused by cheating Additionally, the text added to fix point #1 (see above) further clarifies what the study by Clifford and Jerit really concludes. 3. The proxies included / excluded Reviewer #1 makes valid comments about the list of proxies analyzed in the paper. I fully understand the reviewer’s comment that there is no variable measuring (media) attention to politics. Admittedly frustrating, the measure that is available only measures the choice of media for consumption of political news, not intensity. As I have explained in the concluding discussion, there is no proper and commonly used measure of political attention available in the data. While it could have been possible to do something with the sub-optimal item that is available, I chose not to, because it was not in line with the overall design of the study. I believe that much of the strength of the current study lies in the fact that the proxy items are essentially identical with those commonly used in surveys everywhere. This makes the analysis useful for a broad audience of survey researchers and my concern is that using one measure, which deviates from all others in this crucial aspect, would be harmful to the study as a whole. To acknowledge this, a paragraph in the discussion comments on the absence of a media attention measure and agrees with the reviewer that political attention would have been on the list had a proper measure been available. The reviewer also makes valuable comments regarding party placement. I am fully aware that it is nowadays commonly accepted that there are other ideological dimensions in the (typical) political space in Western democracies besides left/right. In many countries, there is at least one other ideological dimension, a cultural dimension, which crosscuts the left/right and is therefore distinguishable from it. In this sense, only relying on left/right is, of course, insufficient. However, given the continued prominence of left/right as a dimension that structures democratic politics, it is handy for survey researchers, because it is familiar to most respondents and available in most surveys. Measure of the cultural dimension (or similar construct) are becoming more widely available, but unfortunately such a measure was not present in the data used here. The reviewer, on the other hand, contests whether we should not in fact consider placing parties on an ideological left/right spectrum as a direct indicator of knowledge, rather than a proxy. This is a fair question. Given that it is possible to think that parties have a ‘correct’ placement on an ideological dimension, it is also possible to see the question as a direct test of knowledge. While I readily admit that the party placement is very close to a direct knowledge measure, I would nevertheless defend considering it as a proxy for knowledge on two grounds. Firstly, in the case of multiparty contexts with some fluidity between parties’ ideological boundaries, not even experts agree to a 100 percent on the exact placement of all parties. While they will agree, also in the Finnish case, which parties are considered leftist and rightist, the evaluations are not identical. Secondly, in the survey(s), the question is not framed as a knowledge item, while the direct measurements of knowledge are. Party placement is framed as an expression of one’s view or understanding, not as a test of whether one knows where a party should be placed. Such a framing leads the respondent to reflect on the issue rather than look for an objectively correct response. Hence, respondents might express an opinion about where they think a party currently should be placed, because e.g. of a particular stand on a big, topical policy question, as opposed to where that same party historically, and in a more general assessment, should be placed. In other words, it seems likely that most respondents will approach this question as a matter of expressing a personal view or understanding, rather than as a matter of correct/incorrect. Additionally, as mentioned in the manuscript, respondents may even project their ideological self-identification on these responses. That said, I understand that party placements could, especially in majoritarian two-party contexts, be considered as a question of knowledge and nothing else. In fact, this was mentioned in the original submission where party placement is introduced as a proxy candidate. However, to further stress the point, I have now expanded this discussion, which now reads as follows: Although many respondents are likely to think of party placements as a matter of factual knowledge in the sense that some placements are more ‘correct’ than others, the question itself is nevertheless framed as a matter of opinion, not as a matter of knowledge, as political knowledge items are. However, placing parties on any ideological continuum also depends on the country context. In Finland, where there are several parties with somewhat fluid ideological boundaries, there is always room for disagreement about their correct placement. In similar cases of high fragmentation in the party system, party placements are likely to be good proxies for knowledge. However, in two-party systems, where there is little discussion about which of the parties is, e.g., more conservative versus liberal, party placement is more likely a direct measure of basic knowledge about the system, rather than a proxy for it. Consequently, usage of party placement as a knowledge measure is always context-dependent. 4. Information about the surveys Both reviewers were hoping for more basic information about the surveys. This is a reasonable request and I have added the following information to the Appendix: Data description 2008 data The following data description is from the Finnish Social Science Data Archive, where the data is deposited and available. The data identification code is FSD2499: Target population: Finnish citizens aged 18 or over living in Finland, excluding the Åland Islands Data collector(s): Taloustutkimus Mode of data collection: Face-to-face interview Sampling procedure: Probability; Stratified Quota sampling based on age, gender and municipality of residence. 2020 data The data is deposited in the Finnish Social Science Data Archive, from where it will be accessible for research. Target population: Finnish citizens aged 18 or over living in Finland, excluding the Åland Islands Data collector(s): Taloustutkimus Mode of data collection: Face-to-face interview Sampling procedure: Probability; Stratified Quota sampling based on age, gender and municipality of residence. There is no official documentation about response rates, but based on in-person exchanges with the survey company, there were typically 10 contacts made for each f2f-interview. This is a typical rate for such surveys in Finland. Moreover, as explained in the manuscript, the slight demographic skews in both data were corrected using post-survey weights. The weights were constructed using official population statistics from Statistics Finland. 5. Comments about the variables and scales Reviewer #1 wonders how the knowledge scale was constructed. The origins of the items date back to the planning of the 2008 data, which included approximately 40 political knowledge items. Those items were chosen roughly based on the same procedure used by Delli Carpini and Keeter (1996), who surveyed a large number of political scientist about what people should know about politics. The goal of the 2008 survey was to produce a comprehensive picture of what the Finnish electorate knows (for a description, unfortunately only in Finnish: Elo & Rapeli, 2008; see also Elo & Rapeli, 2010). Based on the reasoning about the dimensions of political knowledge (see Elo & Rapeli, 2010; Barabas et al. 2014), the 2020 data repeats parts of the 2008 knowledge items. As the figures below demonstrate, the distributions across the samples are similar and the sample means are almost identical. The knowledge scales, which have been calculated by simply adding up the correct answers, distinguish between individuals in terms of what they know about politics and the individual-level predictors of (high) knowledge are in both cases fully in line with canonical findings: male gender, age and education are all positively associated. To avoid harmful guessing behavior, ‘don’t know’ responses were not encouraged (Mondak, 2001). The reviewer also commented that partisanship and income could have been useful to include. This is true and something I also considered very carefully when planning the analyses. Both are indeed likely to be related to knowledge and its proxies. My reasons for choosing not to initially include them, and for suggesting they also now remain outside analyses, are twofold. Firstly, I wanted to only include control variables that are more or less universal across surveys that are conducted across democratic publics. There is a good deal of variation in how income is operationalized, e.g. as personal income, annual household income or subjective feeling about how well one can cope with current (household) income. Although I nevertheless admit that including an indicator of income, even if full concordance with other surveys might be lacking, could have been reasonable, my second reason made me lean towards excluding it. As it now stands, the analysis is already quite extensive with a high number of figures etc. My main concern has been not to cram too much into one paper, but instead conduct proper analyses with the most essential variables. In my reading of the literature, and with reference to the point made above, I chose to go with age, gender and education as the most important and suitable control variables. However, I fully understand that the optimal solution here is debatable. Additionally, the reviewer also asks whether combining the proxies through a factor score, instead of using them individually, would be feasible. Yes, this possibility occurred to me too. I originally left it out, because I was (again) worried about the scope of the analysis becoming too large and because all of the proxies are unlikely to be available in many surveys anyway. Hence, I thought, such an analysis would probably not be helpful to many readers. However, as the reviewer also suggested it, I thought it would be a good idea to include the analysis. I ran a similar analysis of predictive validity as the one presented in Figure 2. It shows that a factor score of the proxies is almost identical in predictive strength as political interest, political interest being marginally stronger. I now refer to this analysis in the body text and include the table in the Supporting information in order not to expand the main text too much. Here is the text added to the manuscript: Instead of using one of the proxies, they could all be combined into one factor, as suggested by the one-dimensional solution of the PCA reported in Table 2. All of the proxy candidates might not be available in all surveys, which makes this solution less likely to be practicable, but nevertheless worth exploring. As reported in Table S4 in the Supporting information, the factor score for the proxy candidates (PCA, Bartlett method for estimating factor scores) is almost as strong a predictor of political knowledge as political interest. It is therefore also a viable method for using a proxy for knowledge, but in terms of predictive validity, using only political interest is at least as good a solution. The table added to Supporting information: Table S4: Factor score1 of PCA with all proxies as the predictor of political knowledge Coef. SE t P>t 95% conf. interval Factor score .0801825 .0076128 10.53 0.000 .0652442 .0951208 Gender (male = 1) .0687076 .0120261 5.71 0.000 .0451094 .0923058 Education .063963 .0192175 3.33 0.001 .0262533 .1016726 Age .1948574 .0346664 5.62 0.000 .1268332 .2628815 Constant .4384795 .0265519 16.51 0.000 .3863779 .490581 1 Factor score for single factor derived from PCA including all the potential proxy measures. The coefficient is the z-standardized factor score, to ensure comparability with the other analyses. Observations = 1,043 F(4, 1038) = 72.71 Prob > F = 0.0000 Adj. R-squared = 0.2854 6. Lack of online/offline comparison Reviewer #2 makes a good point about the fact that the study cannot assess whether the use of proxies instead of direct measurement in online surveys is somehow beneficial, because the two cannot be directly compared. This is, of course, true. While it is regrettable that such a comparison is not possible given the research design, I would nevertheless argue that the current design has other, unusual benefits. The two surveys provide, according to my knowledge, the only existing survey data that allows testing the relationships between a sound knowledge measure and a number of potential proxies. Hence, the focus of the paper has been on this particular advantage. The reviewer is, however, correct in pointing out that we still do not know whether a proxy would work better in online settings than some type of an encouragement to refrain from cheating. I have added a paragraph discussing this and it reads as follows: Moreover, future scholarship should design online surveys that make it possible to compare whether knowledge proxies and items asking to commit to honesty produce similar findings. This would help bridge the gap between the previous literature looking at honesty commitment items and the current study, which examines the use of proxies. In this way, we might reach a better understanding of whether one method of indirect measurement of political sophistication is somehow superior compared with the other. Additionally, Reviewer #2 had some important, minor comments, which I have responded to as follows: 7. Differences in samples in online/offline surveys The reviewer suggests discussing the more general issue of how potential differences in respondent recruitment to f2f versus online surveys may affect the measurement of political knowledge. Although this might perhaps be slightly outside the core message of the article, it is certainly an important aspect that deserves to be mentioned. A recent study sheds some light into the matter. Cornesse et al. (2021) analyze whether different modes of recruiting to an online panel result in different samples in terms of key sociodemographic characteristics. The modes they study are online-only, concurrent mode, online-first, and paper-first. They conclude that all modes lead to samples that are similar when it comes to sociodemographic representation. Although this cannot be considered conclusive evidence, the findings by Cornesse et al. are cause for optimism that sociodemographic differences, which are the main drivers of sophistication, could in fact be equally accounted for in both online and offline survey settings. I see this issue as an issue that should be noted as a possible limitation to the reported findings and therefore suggest adding the following to the Discussion: A related, but much broader, question is whether recruiting people to online versus offline surveys produce different kinds of samples. If so, this could even affect the validity of any sophistication measure, depending on whether it is employed in an online or offline survey. Although recent scholarship suggests different modes of recruitment lead to similar samples (37), more research into the matter is still needed. 8. Political interest in non-election years and election years An excellent point by reviewer 2. It is indeed possible that the connection between political interest and knowledge might look different depending on whether it is measured in a survey that is conducted in conjunction with an election (e.g. national election studies) or in a non-election year (such as in this case). Although this is speculation, I also find it plausible that people could, for example, report higher levels of political interest during an election or immediately after it than during an off-year. Moreover, as the reviewer suggests, when political information is widely available during election time through the media, many people are likely more attuned to politics and perhaps even be better informed. To address this, I checked the Spearman correlation between a comparable 5-item political knowledge measure and an identical political interest measure in the latest Finnish parliamentary election survey data from 2019 (FNES 2019). It is a post-election survey, conducted immediately after the parliamentary elections, by the same survey company as the data used in this paper, using a similar sample and a similar weight to correct for sociodemographic imbalances. The FNES 2019 data is downloadable in English through https://services.fsd.tuni.fi/catalogue/FSD3467?lang=en&study_language=en. The correlation between knowledge and interest in the FNES 2019 data is .417, whereas it is .444 in this analysis (Table 1). So the correlations are almost identical, when using similar variables + samples, regardless of whether the measurement is from an off-election year or from a study conducted in conjunction with an election. This gives me confidence in the key finding in the paper regarding the suitability of interest as a proxy for knowledge. To explain this in the paper, I have added the following footnote: The relationship between knowledge and interest is stable across measurement during off-election years and in the context of a general election. While the data used in this study comes from an off-election year, the latest Finnish parliamentary election survey data from 2019 (FNES 2019) was collected immediately after the elections. It uses a similar sample, a comparable 5-item political knowledge measure and an identical political interest measure. The Spearman correlation between knowledge and interest in FNES 2019 is .417, which is nearly identical to the .444 correlation reported in this analysis (Table 1). 9. Non-linear relationship between education and the proxies The reviewer notes that some of the patterns between education and the proxies are likely due to non-linear relationships. I can only agree – it is often the case that people with a university degree are particularly distinguishable from other educational groups. 10. Measurement of internal political efficacy Yes, I fully agree that it would have been better to have more items that tap into the sense of internal political efficacy. Although nothing can be done at this stage, it may be worth noting that, on the positive side, the one item that is included is arguably the most relevant among the 2-4 standard items used to measure internal political efficacy and that the two surveys used identical wordings. 11. Test-retest reliability The reviewer points out that since the respondents were not the same in the two surveys, test-retest reliability cannot be examined. The reviewer is right, this was a sloppy formulation. I suggest changing the original sentence to the following, which I believe more accurately states what I think is relevant here: This allows the study to go beyond tests of validity and even assess whether the findings regarding the key proxies are consistent across two different measurements that are temporally quite far apart. References Barabas J, Jerit J, Pollock W, Rainey C. 2014. The Question(s) of Political Knowledge. American Political Science Review 108(4),840–855. Delli Carpini M X, Keeter S. 1996. What Americans Know About Politics and Why It Matters. Yale University Press: New Haven, CT. Elo, K, Rapeli, L. 2008. Suomalaisten politiikkatietämys [What Finns Know about Politics]. Edita Prima: Helsinki. Elo, K, Rapeli, L. 2010. Determinants of Political Knowledge: The Effects of Media on Knowledge and Information. Journal of Elections, Public Opinion and Parties 20(1), 133 – 146. Mondak J. 2001. Developing Valid Knowledge Scales. American Journal of Political Science 45(1), 224-238. Cornesse C, Felderer B, Fikel M, Krieger U, Blom AG. 2021. Recruiting a Probability-Based Online Panel via Postal Mail: Experimental Evidence. Social Science Computer Review (Online First). doi:10.1177/08944393211006059 Submitted filename: Response to Reviewers.docx Click here for additional data file. 21 Jul 2022 What is the best proxy for political knowledge in surveys? PONE-D-22-05474R1 Dear Dr. Rapeli, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Sean Richey Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 11 Aug 2022 PONE-D-22-05474R1 What is the best proxy for political knowledge in surveys? Dear Dr. Rapeli: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Sean Eric Richey Academic Editor PLOS ONE

2 in total

1. PageFocus: Using paradata to detect and prevent cheating on online achievement tests.

Authors: Birk Diedenhofen; Jochen Musch
Journal: Behav Res Methods Date: 2017-08

2. Exploring the Sensitivity of Horn's Parallel Analysis to the Distributional Form of Random Data.

Authors: Alexis Dinno
Journal: Multivariate Behav Res Date: 2009-05 Impact factor: 5.923

2 in total