Literature DB >> 31557209

Analyzing a networked social algorithm for collective selection of representative committees.

Alexis R Hernández¹, Carlos Gracia-Lázaro², Edgardo Brigatti¹, Yamir Moreno^2,3,4.

Abstract

A recent work (Hernández, et al., 2018) introduced a networked voting rule supported by a trust-based social network, where indications of possible representatives were based on individuals opinions. Individual contributions went beyond a simple vote-counting and were based on proxy voting. This mechanism selects committees with high levels of representativeness, weakening the possibility of patronage relations. By incorporating the integrity of individuals and its perception, we here address the question of the resulting committee's trustability. Our results show that this voting rule provides sufficiently small committees with high levels of representativeness and integrity. Furthermore, the voting system displays robustness to strategic and untruthful application of the voting algorithm.

Entities: Chemical Disease Gene Species

Year: 2019 PMID： 31557209 PMCID： PMC6763197 DOI： 10.1371/journal.pone.0222945

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

The form of citizen participation in contemporary and complex democracies is a central issue in social debate. Many transformations and possible innovations have been recently discussed [2-4], which redesign our interactions in politics and society, often forced by the widespread use of digital technologies. A general problem, which ranges from national to neighborhood scales, is the problem of selecting an exemplary group of representatives to make decisions on behalf of the community [5-7]. Despite the prolific theoretical and philosophical debate over these issues [8, 9], examples of empirical construction of new algorithms have been relatively limited [1, 10–12]. Recently, Hernandez et al. introduced a new social algorithm for collective selection of a committee of representatives [1]. The algorithm is developed starting from a standard situation where each voter is allowed to vote for only one candidate. However, the elected representatives are the ones who obtain a better rank among their counterparts, in a way that individual contributions go far beyond a simple vote-counting. The introduced formal algorithm presents new specific features which could improve governance legitimation and fairness. The lists of candidates are not fixed in advance, but they emerge as a self-organized process controlled by the voting rules. This fact introduces an effective participation and engagement of the whole community, in contrast to top-down candidate rigid lists. The voters express not preferences, but opinions, which determine their indications about whom they would like to see as their representatives. Finally, the new proposed mechanism improves the committee representativeness, weakening the possibility of patronage and clientelism relations. Additionally, the vote aggregation mechanism is supported by a self-declared confidence circle, which defines a network of trusted individuals. This trust-based social network, which can be implemented on an online platform, is a fundamental ingredient that allows for direct accountability of the elected committee. Even if based on a local network, it can naturally scale to national sizes, translating to those larger scales an effective accountability typical of small-sized communities. In this work, we analyze a new aspect that can be introduced in the original algorithm. Specifically, we incorporate the possibility of a form of direct choice of individuals over the possible elected representatives. Hence, we mitigate the aspect that voters determine their indications about whom they would like to see as their representative through opinions, valuing the principle that individuals directly select candidates. This new ingredient is implemented by introducing a declared preference among the contact network of individuals. Preferences act as a weight on the original opinion-based ranking algorithm in such a way that higher rates for these preferences are assigned to individuals considered more apt to participate in the committee. The described mechanism improves the legitimation, fairness, and effectiveness of the committee. In fact, overlaps, which are not controlled by voters, are weighted by a term subjectively assigned by the individuals. This weight should encourage a check on incompetence and corruption: Incompetence because an equal say for every individual is not necessarily always desired; Corruption as the preference should be proportional to the person who demonstrates and promises true integrity (sound ethical principles and trust). As each voter knows their representatives and each committee member knows whom he is accountable to, this fact allows for a strong control over representatives’ actions. The purpose of this work is to present and characterize in depth the new social algorithm throughout computational analyses. In Sec. II we describe the details of the algorithm. Sec. III is devoted to test the new voting rule, modeling the behavior of the selected committee. The quality of the elected committee is assessed looking at how much their final decisions are consistent with the community’s personal opinions and estimating the general integrity of the elected committee. Finally, Sec. IV presents some discussions of our results and concluding remarks.

The model

Let us assume a system composed by N electors interacting on an internet-based platform. The platform allows the voters to declare who belongs to their interaction circle, rendering a network of well-known individuals. Voters also declare their perception of integrity for each individual k belonging to their interaction circle. This perception is condensed in a scalar value I ∈ [0, 1], which represents the perception that individual j has about the integrity of individual k. In a following step, voters manifest their opinions on N issues. Issues are organized in questions which can be defined by a committee or by means of a self-organized process internal to the community. The answers of each individual j are organized in a vector v, composed by N cells. Each cell assumes the value 1 for a positive answer, −1 for a negative one or 0 for a question left unanswered. Given the previous steps, the representative of a given individual j is selected by means of the following algorithm. The vector’s overlap of each individual j with all his neighbors k is computed through the following expression [1]: where the numerator counts the number of questions answered in the same way (only yes or not) and the denominator counts the number of questions answered simultaneously by both individuals; δ stands for the Kronecker delta which is 1 if and 0 otherwise. Then, we calculate the product of the previously defined overlap with the variable I (i.e., the integrity of k as perceived by j), obtaining the ranking function: The introduction of the term I establishes a form of direct choice of the individual j over the possible elected representative. Overlaps, which are not controlled by voters, are weighted by a term subjectively assigned by the individuals. Notice that we are simply considering the term I associated to each agent k. Instead, we can consider a statistical measure over the social circle of k to obtain a more precise evaluation of k’s integrity. However, this will introduce an external interference in j’s choice which can be undesirable in a democratic process. Finally, each individual j will indicate as his representative the individual k′ for which R is maximum. In the case where the same maximum value is shared by more than one individual, the one with a higher connectivity is selected. For the exceptional case of equal connectivity, the representative is randomly chosen between the equivalent ones. The introduction of the perception of integrity and its use in the evaluation of the ranking function is the principal novelty of this work in relation to the original algorithm of the voting rule introduced in [1]. After the selection of the representative k′ for every voter j, the final step consists of choosing the aggregate of representatives for the entire community. To this end, we construct a directed graph, which we call the representative graph, where a node represents each individual and a directed link connects the individual with his personal representative. In this graph, which in general is composed by different disconnected clusters, cycles are present. These cycles represent individuals that have been mutually indicated by themselves. Technically the representative graph is a directed graph with out-degree 1. It is composed of disconnected components, each one formed by a cycle with trees attached to the cycle nodes (see Fig 1). Considering a transitivity process, votes flow through the trees until they get to the cycles. Hence, the cycles’ individuals are proper potential representatives for the community.

Fig 1

Schematic representation of the vote process.

Schematic representation of the vote process.

Nodes stand for the individuals; the red ones belong to a cycle and will be confirmed as representatives if they collect more votes than the established threshold. The big numbers associated to the nodes represent the received cumulated votes. Arrows stand for the indication of each individual and the small numbers associated to them represent the number of transferred votes. Dotted arrows belong to a cycle where there is no cumulative vote transfer. Figure extracted from Ref. [1]. As a final step, among the individuals belonging to a cycle, only the ones with a number of votes larger than a threshold Θ are indicated as representatives. Votes are counted considering the cumulative flow defined by the directed graph: If the individual j is pointing to z, z receives all the votes previously received by j plus one. This flow of votes is computed only following links outside the cycles. Inside the cycles, only the single vote of an individual is counted. To sum up, the votes v received by an individual i inside a cycle are equal to: where G(i) is the set of all the trees ending at node i and l is the number of links of the tree t. Based on this score, the number of representatives is reduced and results in a fraction of the total number of individuals that belongs to the cycles.

Results and discussion

In our simulations, each individual is assigned an intrinsic integrity i, which is a number uniformly distributed in the interval [0, 1]. The perceived integrity I corresponds to i shifted by the error in the perception that j has on the integrity of individual k, which is modeled by a scalar δi drawn from a Gaussian distribution N(0, σ). In order to keep I ∈ [0, 1], I values greater than 1 are set to 1 and negative values are set to 0: I = max[min(i + δi, 1), 0]. On the other hand, the individuals’ opinions in relation to the selected issues are randomly generated with the following rule: given an issue i, an individual does not have an opinion (v = 0) with probability 1/3. The probability to have an opinion v = +1(−1), is 1/3 + ϵ (1/3 − ϵ), where ϵ is a random variable following a normal distribution with mean value equal to zero and σ2 = 0.05. The interaction circles are modeled by generating a network where nodes represent individuals and links the social relationships present in the community. The interaction circle of an individual is obtained selecting a node and considering its first neighbors. Note that an important simplification of this approach is the fact that it generates individuals with symmetric social relationships. In the following analysis three types of networks are considered. Homogeneous random networks, implementing the Erdös-Rényi model [13], where the degree distribution is peaked around a typical value 〈k〉; heterogeneous networks, using the Barabási-Albert model [14], with a power-law degree distribution P(k) ∝ k−3; and the so-called small-world Watts and Strogatz network model [15]. Our aim is not to model specific aspects of a real social network, but to use simple examples just to discuss the possible influence of some relevant network properties on the behavior of our model (such as the heterogeneity in the degree distribution, the average degree and the small-world property). The system can be characterized by three observables: The normalized committee size, which is the ratio between the number of elected individuals (E) and the total number of individuals of the community: F = E/N. The representativeness R, which is measured by calculating the fraction of decisions expressed by the elected committee (e) which matches with the community decisions (c) over all the considered N issues: . The committee’s decisions are attained by means of a majority vote where each representative’s vote is weighted by the number of collected votes during the election procedure. The community decision corresponds to the result of a plebiscite, where every individual votes follow the opinion expressed in his vector v (no opinion corresponds to abstention). For R = 1 a perfect representativeness is obtained: a committee makes all the decisions in line with the popular will. On the opposite, for binary decisions, R = 1/2 corresponds to a non-representative committee, whose decisions are completely uncorrelated to the popular will. A useful observable is 1 − R, which measures how far the system is from the perfect representativeness. This quantity is particularly interesting because, for the original model without integrity [1], it presents a simple and robust relation with F: The integrity I which is the mean value of the intrinsic integrity i of the individuals selected for the committee. We perform our analysis varying the value of the threshold Θ, such as to obtain committees of relatively small size but with a high representativeness level—close to 0.9—(see [1] for details). In order to explore the relation between committee size and representativeness we plot the representativeness versus the normalized committee size. As can be seen the logarithmic plot of 1 − R versus the normalized committee size, F (Fig 2), the introduction of the integrity parameter has a marginal impact on relation 4. Only for higher values of F, which are unpractical, a slightly worse representativeness in relation to the classical algorithm is perceived. As for the classical algorithm, for fixed R, the normalized committee size increases with the number of issues. The integrity behavior as a function of F has a quite simple response: it shows very high values and a final abrupt drop for large committee size. This is due to the probability for lower integrity individuals to obtain the necessary amount of votes to be elected becoming relevant. The dependence on N is weak and establishes a trade-off between Representativity and Integrity. More issues make the overlap less relevant in the computation of R improving the integrity at the expenses of the representativity.

Fig 2

Top: On the left, logarithmic plot of 1 − R versus normalized committee size for the NVR proposed in [1] (dark blue) and the one proposed here (NVR, light blue) with Ni = 40. On the right, 1 − R versus normalized committee size for different N values. Bottom: Representativity (left) and Mean Committee Integrity (right) as a function of normalized committee size. We consider a Erdös-Rényi network with N = 10000, 〈k〉 = 40 and . Results are averaged over 100 different realizations. The dependence of the above observables with the system size N (Fig 3) shows that the latter has an impact on the representativeness but not on the integrity behavior. In fact, as it was the case in the original model, when fixing R the committee size decreases for larger system sizes. For example, for the parameters used in Fig 3, a representativity of 0.9 corresponds to a committee of 78 members for a community of 2500 individuals, and to 36 representatives for N = 40000. Furthermore, as can be seen in Fig 4, the error in integrity perception, which is controlled by the parameter σ, has no effect on the representativeness. In contrast, it obviously affects the committees’ integrity. The plateaux values of I decrease with σ, following a simple linear dependence on this parameter. Higher values of errors in the integrity perception correspond linearly to worse values in the integrity selection (see inset in Fig 4).

Fig 3

Fig 4

Logarithmic plot of 1 − R versus normalized committee size (left). Mean Committee Integrity as a function of normalized committee size (right). In the inset we show the linear behavior of I with σ (I = −0.34σ + 0.98). We consider a Erdös-Rényi network with N = 10000, N = 40, σ2 = 0.05 and 〈k〉 = 40. Results are averaged over 100 different realizations.

Logarithmic plot of 1 − R versus normalized committee size (left). Mean Committee integrity as a function of normalized committee size (right), for different numbers of electors N. We consider a Erdös-Rényi network with N = 10000, N = 40 and . Results are averaged over 100 different realizations. Logarithmic plot of 1 − R versus normalized committee size (left). Mean Committee Integrity as a function of normalized committee size (right). In the inset we show the linear behavior of I with σ (I = −0.34σ + 0.98). We consider a Erdös-Rényi network with N = 10000, N = 40, σ2 = 0.05 and 〈k〉 = 40. Results are averaged over 100 different realizations. In Fig 5, we can see that the representativeness is not strongly dependent on network connectivity. For sufficiently high 〈k〉, the curves show the same behavior. The heterogeneity in the degree distribution of the network marginally impacts the results. For high values of F, the Barabási-Albert network performs moderately worse than Erdös-Rényi’s. As in the case of the original algorithm, higher connectivity generates a small bias in the selection of the more representative individuals. In contrast, the small world property of the Watts-Strogatz network positively influences the algorithm, allowing for slightly better results in terms of representativeness. This last behavior is more pronounced than in the case of the original algorithm. We have also analyzed the impact that the presence of community structure can have on the outcome of the committee selection algorithm. We implemented the stochastic block model [16], varying the intra- and inter-community link densities and the number of communities, and we were not able to evidence relevant trends on the general results of our algorithm (see the Supplementary Material for more details).

Fig 5

Top: Logarithmic plot of 1 − R versus normalized committee size (left) and Mean Committee Integrity as a function of normalized committee size (right) for different values of 〈k〉. Bottom: Logarithmic plot of 1 − R versus normalized committee size (left) and Mean Committee Integrity as a function of normalized committee size (right) for different network topologies. We consider a Erdös-Rényi network with N = 40, 〈k〉 = 40 and . Results are averaged over 100 different realizations. In addition to the so far discussed synthetic networks, we have tested our voting rule via some data-driven simulations where real social networks are taken as an underlying structure and the committee selection process is implemented on the top of these real networks. We used collected data from the music streaming service Deezer, collected at November 2017 [17, 18]. This dataset represents the friendships networks of users from three European countries. Nodes represent the users and edges are the mutual friendships. We used the data relative to Croatia. They contain 54573 users and 498202 friendship relations, which correspond to 〈k〉 ≈ 9, a reasonable mean connectivity. We also consider data from the free on-line social network Orkut. Orkut allows users to form groups which external members can join. We used data collected by Alan Mislove et al. [17, 19]; they contain 3072626 nodes and 11718083 edges (〈k〉 ≈ 38). In this two scenarios, we interpret the friendship relations as a social tie, which renders a network of well-known individuals based on these real data. The integrity and the opinion vector of each individual are synthetically generated following the previously implemented rules. Finally we simulated the voting process on these real social networks, in which the community elects a committee. Results, displayed in Fig 6, appear to be similar to the ones obtained with synthetic networks.

Fig 6

Top: On the left, logarithmic plot of 1 − R versus committee size. On the right, R versus committee size. Bottom: Commitee Integrity as a function of R (left) and committee size (right). We consider the two social networks, Deezer and Orkut. Results are averaged over 100 different realizations; . In order to compare our model’s behavior with other traditional methods of representatives selection, we analyze the representativeness and the integrity of equally sized committees. A widespread method is a traditional majority voting rule (TMV) for electing representatives in a closed list of previously determined candidates. In our implementation, a list of Nc candidates in the community is randomly selected and each individual j votes for the candidate who presents the higher R value (k* belongs to the list of Nc candidates). Note that in this case the integrity evaluation is influenced by errors in perception. Decisions are taken with the same weighted voting rule. This modeling approach mimics a voter who has a perfect knowledge of the candidates, assuming he makes a rational decision to maximize his representation. For this voting rule, representativeness is also computed by comparing the decisions taken by the committee, obtained with a weighted majority voting process, with the results of direct popular vote. As can be appreciated in Fig 7, our model is by far more efficient, halving the committees’ size and showing a better selection of representatives integrity.

Fig 7

Top: Representativity versus normalized committee size (left), logarithmic plot of 1 − R versus normalized committee size (right). Bottom: Mean Committee integrity as a function of normalized committee size (left) and Representativity (right). The results correspond to the Networked Voting Rule (NVR), the Traditional Majority Voting (TMV) and a Perfect Voting Rule (PVR) with parameters N = 10000, N = 40 and . For the NVR we use a Erdös-Rényi network with 〈k〉 = 40. For the TVR we set the initial number of candidates N = 100. Results are averaged over 100 different realizations. See the main text for a detailed explanation of the voting rules. Finally, we compare our method to an idealized perfect voting rule (PVR). This rule represents a situation of rational individuals that have a perfect knowledge of all others and their opinions. Moreover, they are globally networked, having direct access to all others and allowing their acts to be checked. In this situation, a voter indicates the individual with the highest overlap with his opinion vector and the best integrity (the higher R value). Note that in this case the evaluation of the integrity is not influenced by errors in perception. The selected committee is formed by the first F ⋅ N individuals which poll more. In this case, the committee decisions are also taken as a weighted majority vote. This voting rule, although unrealistic, is useful in at least two respects: First, very small communities can exhibit similar characteristics; Second, the model is a useful yardstick for evaluating the levels of representativeness of more realistic models. The relation between representativeness and committee sizes can be compared also in the case of the PVR rule (see Fig 7). It is quite impressive that the representativeness of our networked voting rule is comparable with the perfect one. The PVR rule is able to select a committee with a higher integrity score, but this is only possible because in this situation the integrity of every member is tested, and not only the integrity of a small subset, as it happens for our networked rule. We conclude our analysis testing the resilience of our networked voting rule to possible attacks. We consider the situation in which a group of voters decides to assign high I scores to some individuals who, in contrast, are characterized by a low personal integrity. This behavior can model patronage and clientelism relations, where individuals with low integrity organize a network of social relationships for obtaining political support. In our model this behavior can be modeled introducing a percentage of individuals p for whom I = 1 − I. As can be seen in Fig 8, representativeness is not seriously affected by this action. In contrast, the integrity of the elected committee is strongly influenced by this ill behavior. By fixing the normalized committee size F, the integrity undergoes an abrupt transition from high values towards very low values as p increases, see Fig 9. Finally, we analyzed if the presence of individuals who refuse to join the elected committee can impact our results. Even if 80% of the population do not accept to be elected, results are substantially unaltered. Similarly, allowing individuals to vote for themselves does not have a significant impact on our voting rule (see the Supplementary Material for more details on these points).

Fig 8

Representativity versus normalized committee size (left), Mean Committee integrity as a function of normalized committee size (right) for different values of p (the percentage of individuals with a distorted integrity perception).

We consider a Erdös-Rényi network with N = 10000, 〈k〉 = 40 and . Results are averaged over 100 different realizations.

Fig 9

Mean Committee Integrity for a normalized committee size F = 0.005 (I) as a function of the percentage of individuals with a distorted perception of the integrity (p).

We consider a Erdös-Rényi network with N = 10000, 〈k〉 = 40 and . Results are averaged over 100 different realizations.

Representativity versus normalized committee size (left), Mean Committee integrity as a function of normalized committee size (right) for different values of p (the percentage of individuals with a distorted integrity perception).

We consider a Erdös-Rényi network with N = 10000, 〈k〉 = 40 and . Results are averaged over 100 different realizations.

Mean Committee Integrity for a normalized committee size F = 0.005 (I) as a function of the percentage of individuals with a distorted perception of the integrity (p).

We consider a Erdös-Rényi network with N = 10000, 〈k〉 = 40 and . Results are averaged over 100 different realizations.

Conclusion

We analyzed a new voting algorithm, particularly well suited for online social networks, for selecting a committee of representatives with the aim of enhancing the participation of a community both as electors and as representatives. This voting system is based on the idea of transferring votes through a path over the social network (proxy-voting systems). Votes are determined by an algorithm which weights the similarity of individuals opinions and the trust between individuals directly connected in a specific social network. Our computational analyses suggests that this voting algorithm can generate high representativeness for relatively small committees characterized by a high level of integrity. Results of representativeness and integrity are comparable with a theoretically defined perfect voting rule and, in general, perform better than a traditional voting rule with a closed list of candidates. The introduction of a term which expresses the trust on the candidate’s integrity does not significantly impact the representativeness of the committee, in particular for committees of small and medium sizes. The rule shows a robust dependence on community size. Besides, the perception of individual integrity directly influences the committee’s quality: higher error values in the integrity perception linearly correspond to poorer values in the committees’ integrity. On the other hand, representativeness is not strongly influenced by integrity perception. Interestingly enough, these findings are not strongly dependent on the general properties of the network used to describe the community of voters, as shown by the analysis of networks characterized by different topologies. Finally, the voting system seems robust to strategic and untruthful application of the voting algorithm. In fact, even with a 20% of the votes produced by individuals which vote for candidates with a low personal integrity, the integrity of the committee is substantially unaltered, and only if unfair votes are around 40% an abrupt change is observed. In conclusion, we believe that the proposed voting rule, which fixes a particular way for the voters to express their preferences and defines a clear algorithm for determining the final identification of the committee, could be implemented in practice. If our results are confirmed in such hypothetical scenario, the algorithm discussed here will define an efficient form of democracy by delegation based on proxy voting [12], which robustly shows a high level of representativeness and integrity of the selected committee. These results are important improvements over the original voting rule introduced in [1], as by incorporating the integrity of individuals and its perception, we can address the important problem of the committee’s trustability without compromising the high level of representativeness already shown by the original algorithm.

Supplementary Material: Analyzing a networked social algorithm for collective selection of representative committees.

(PDF) Click here for additional data file. 18 Jul 2019 PONE-D-19-14447 Analyzing a networked social algorithm for collective selection of representative committees PLOS ONE Dear Dr. Hernández, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. ============================== ACADEMIC EDITOR: Please insert comments here and delete this placeholder text when finished. Be sure to: Indicate which changes are required versus recommended for acceptance Address any conflicts between the reviews Provide specific feedback from your evaluation of the manuscript ============================== We would appreciate receiving your revised manuscript by Aug 25 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Ginestra Bianconi Academic Editor PLOS ONE Journal Requirements: Additional Editor Comments (if provided): [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: I have read carefully the paper entitled as “Analyzing a networked social algorithm for collective selection of representative committees“ by Hernández et al., submitted to PLoS ONE for publication. The paper builds on an earlier work published by the same set of authors recently [1], which proposes an algorithm to construct representative committees using personal and collective preferences in networked populations. In this paper they extend their earlier findings by considering committee’s trustability. I have some comments what I suggest to the authors to consider: - Figure 1 has been already published in [1] and this fact is not acknowledged in the actual manuscript. - It is not clear what is novel algorithmically and in terms of results as compared to the pervious paper of the authors [1]. Please highlight. - Their method assumes global network knowledge about the underlying social network what is commonly unknown. The authors (implicitly) argue that this is not a problem as their method is meant for online social networks where social ties are mapped with high precision. However, it is usually not the case as (a) detailed online social network data is not available but only for the provider, (b) it may contain several non-real social ties and non-human actors (e.g. bots), and (c) it may not capture all social ties (e.g. offline relationships) which at the same time might be important for opinion formation. I would suggest to the authors to address these questions and show how the outcome of their process is changing by assuming incomplete knowledge about the network structure. - In page 4 the authors explain that a representative committee can be selected in two steps: first identifying cycles in the representative graphs and then by thresholding to select people by the number of votes they gained in their downstream tree structure. In my opinion, it is a possible scenario that a group of people agree in advance to bias the first step of this process to vote such that they form a cycle. This way they would increase the probability that some of them will be selected from the cycle in the committee. The authors address resilience issues in the end of the manuscript (starting from line 218) but miss to address the problem when fraud is not individual but organised between a larger group of people. - In the paragraph starting from line 119 the authors discuss that they tested their algorithm on three conventional network models while they were concentrating on the dependencies of the selection outcome on generic network properties like degree heterogeneity, average connectivity, or shortest paths. One important characteristic missed here is community structure, which can largely influence the outcome of the committee selection algorithm. I would suggest to the authors to use one of the many (Planted L-partition model, NG benchmark, LFR benchmark) community network model to test the effect of intra/inter community link density on the outcomes. - For validation purposes it would be necessary that the authors explore their model via data-driven simulations where they take a real social network as an underlying structure and model the committee selection process on the top. Simulating the process only on synthetic overly simplified network models is important for exploration but may provide results far from reality. Typos: Erdos -> Erd\\H{o}s Barabasi -> Barab\\'asi ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. 29 Aug 2019 Dear Editor, Please, find enclosed a revised version for our manuscript "Analyzing a networked social algorithm for collective selection of representative committees", co-authored by Alexis R. Hernández, Carlos Gracia-Lazaro, Edgardo Brigatti and Yamir Moreno, which we are resubmitting for publication in Plos One. We would like to thank the reviewer for his/her favorable opinion on our work as well as for providing very useful feedback that has led to an improved version of our contribution. Aside from the necessary modifications to the manuscript, which we detail below, we are enclosing here a detailed response to the reviewer's comments. We have gone through all his/her points and suggestions and we believe he/she will be satisfied with our answers and with the corresponding improvements to the manuscript. We also provide below a list of changes, summarizing the modifications to the manuscript, which are described in more detail in the answers to the reviewer. This manuscript describes original work and is not under consideration by any other journal. All authors approved the new version of the manuscript and this submission. Yours sincerely, Alexis R Hernández, on behalf of all authors. Instituto de Física, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil. List of Changes (summary): • In line 85, we clarify the algorithmic contribution of the paper. • In Fig. 1 caption we indicate that figure 1 was extracted from Ref. 1. • In line 189 we mention the results obtained for modular networks. • In line 196 we describe the results considering real networks. These results are reported in the new Fig. 6. • In line 253 we slightly modified the text and figures citation. • In line 258 we point out the results related to unavailable representatives. • In line 261 we point out the results related to self-declared candidates. • In line 295 we clarify the main contribution of our work. • In line 301 we introduce the supplementary information section. • We add a reference on block models (Ref. 16). • Suggested and other typos were corrected. Answer to Reviewer #1 We thank very much the reviewer for his/her very positive assessment of our manuscript and the feedback provided. Please, find below a detailed response to all comments received as well as the changes we have done to implement your suggestions whenever needed. Reviewer #1: I have read carefully the paper entitled as “Analyzing a networked social algorithm for collective selection of representative committees“ by Hernández et al., submitted to PLoS ONE for publication. The paper builds on an earlier work published by the same set of authors recently [1], which proposes an algorithm to construct representative committees using personal and collective preferences in networked populations. In this paper they extend their earlier findings by considering committee’s trustability. I have some comments what I suggest to the authors to consider: - Figure 1 has been already published in [1] and this fact is not acknowledged in the actual manuscript. R: Thanks for pointing out this issue, we corrected that in the revised version. - It is not clear what is novel algorithmically and in terms of results as compared to the previous paper of the authors [1]. Please highlight. R: The new ingredient in the algorithm appears in equation (2) where the overlap is multiplied by the perceived integrity. In the previous paper, the transfer of votes depends only on opinion’s overlap. The introduction of this term allows us to study the performance of the algorithm with respect to the committee’s integrity. We write it explicitly on the revised version in order to make this clearer. - Their method assumes global network knowledge about the underlying social network what is commonly unknown. The authors (implicitly) argue that this is not a problem as their method is meant for online social networks where social ties are mapped with high precision. However, it is usually not the case as (a) detailed online social network data is not available but only for the provider, (b) it may contain several non-real social ties and non-human actors (e.g. bots), and (c) it may not capture all social ties (e.g. offline relationships) which at the same time might be important for opinion formation. I would suggest to the authors to address these questions and show how the outcome of their process is changing by assuming incomplete knowledge about the network structure. R: Thanks for pointing out this very important issue. In order to illustrate the robustness of the algorithm, we ran new simulations where we considered a fraction of users as unavailable (in the sense of participating as representatives). In practice, it washes out the participation of a fraction of the committee representatives with their respective voting trees. Our results show that even for very high values of this fraction (0.8) the algorithm performs pretty well. These results makes it possible to reasonably think that incomplete knowledge of the network structure would not jeopardize our conclusions. We mention these results in the body of our manuscript and include figures and description in the new supplementary material. - In page 4 the authors explain that a representative committee can be selected in two steps: first identifying cycles in the representative graphs and then by thresholding to select people by the number of votes they gained in their downstream tree structure. In my opinion, it is a possible scenario that a group of people agree in advance to bias the first step of this process to vote such that they form a cycle. This way they would increase the probability that some of them will be selected from the cycle in the committee. The authors address resilience issues in the end of the manuscript (starting from line 218) but miss to address the problem when fraud is not individual but organized between a larger group of people. R: Thanks for raising this interesting question. To address this issue we considered a slightly different situation. We introduced a fraction of individuals who refuse to transfer their votes, making them self-candidates to the resulting committee. Again we observe the system to be very robust and even for a very high fraction of self-declared individuals (0.8), we find it to behave properly, achieving committees with high levels of representativity and integrity. These results are also included in the new supplementary material and commented on the body of our manuscript. - In the paragraph starting from line 119 the authors discuss that they tested their algorithm on three conventional network models while they were concentrating on the dependencies of the selection outcome on generic network properties like degree heterogeneity, average connectivity, or shortest paths. One important characteristic missed here is community structure, which can largely influence the outcome of the committee selection algorithm. I would suggest to the authors to use one of the many (Planted L-partition model, NG benchmark, LFR benchmark) community network model to test the effect of intra/inter community link density on the outcomes. R: Thanks again, this is also a very interesting point. In order to answer this question, we generated random networks with a specific number of communities and modularity. The results are described in the new supplementary material and commented on the body of the article. Again the system presents a robust behavior allowing the conformation of a committee with high values of representativity and integrity even for highly modular networks. - For validation purposes it would be necessary that the authors explore their model via data-driven simulations where they take a real social network as an underlying structure and model the committee selection process on the top. Simulating the process only on synthetic overly simplified network models is important for exploration but may provide results far from reality. R: Thanks again. We ran new simulations considering the network structure of two online social networks: one from a music streaming service (Deezer) and the other from a free online social network (Orkut). In both cases the results are consistent with the ones obtained for synthetic networks. We include these results in the manuscript. Typos: Erdos -> Erd\\H{o}s Barabasi -> Barab\\'asi R: Typos where corrected, thank you very much for pointing out. Submitted filename: reply5.pdf Click here for additional data file. 11 Sep 2019 Analyzing a networked social algorithm for collective selection of representative committees PONE-D-19-14447R1 Dear Dr. Hernández, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Ginestra Bianconi Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 13 Sep 2019 PCOMPBIOL-D-19-00976R2 Analyzing a networked social algorithm for collective selection of representative committees Dear Dr. Hernández: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Ginestra Bianconi Academic Editor PLOS ONE

3 in total

Analyzing a networked social algorithm for collective selection of representative committees.

Introduction

The model

Schematic representation of the vote process.

Results and discussion

Representativity versus normalized committee size (left), Mean Committee integrity as a function of normalized committee size (right) for different values of p (the percentage of individuals with a distorted integrity perception).

Mean Committee Integrity for a normalized committee size F = 0.005 (I) as a function of the percentage of individuals with a distorted perception of the integrity (p).

Conclusion

Supplementary Material: Analyzing a networked social algorithm for collective selection of representative committees.

1. Emergence of scaling in random networks

2. Collective dynamics of 'small-world' networks.

3. A networked voting rule for democratic representation.