Submitted:
13 November 2024
Posted:
13 November 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Description of Data
3. Review of the Statistical Models
3.1. Conditional Inference Tree
3.2. Boruta Models
- ○
- Boruta begins by creating shadow or “randomized” features based on the original dataset. This is achieved by permuting values within each column to generate a set of synthetic features.
- ○
- A Random Forest model is then trained on the combined set of actual and shadow features. The importance scores of each feature are calculated based on metrics such as Gini impurity or information gain.
- ○
- Boruta performs a statistical test to determine whether the importance score of each actual feature is significantly higher than the maximum importance score observed for its corresponding shadow features. Features with significantly higher importance are labeled as “important,” while those failing the test are deemed “unimportant.”
- ○
- The process is iteratively repeated until all features are either labeled as “important” or “unimportant” or until a predefined number of iterations are reached. This ensures a comprehensive evaluation of feature importance across multiple iterations.
- ○
- Features marked as “important” contribute to the final subset of selected features. This refined feature set is then used for training machine learning models and promoting model efficiency and interpretability.
3.3. Generalized Linear Models
4. Analysis of Data and Discussion
4.1. Results
4.2. Discussion
5. Conclusions
References
- Downs, Anthony. “An economic theory of democracy.” Harper and Row 28 (1957).
- Riker, William H., and Peter C. Ordeshook. “A Theory of the Calculus of Voting.” American political science review 62, no. 1 (1968): 25-42. [CrossRef]
- Learning Analytics Data Analysis Laboratory (LADAL) https://ladal.edu.au/index.html.
- Casella, George, and Roger L. Berger. “Statistical lnference.” Duxbury press (2002).
- Blais, André, and Filip Kostelka. “The decision to vote or abstain in the 2014 European elections.” Revue européenne des sciences sociales 53, no. 1 (2015): 79-94. [CrossRef]
- Schraufnagel, Scot, Michael J. Pomante II, and Quan Li. “Cost of voting in the American states: 2020.” Election Law Journal: Rules, Politics, and Policy 19, no. 4 (2020): 503-509. [CrossRef]
- Rockefeller Institute of Government, https://rockinst.org/issue-areas/fiscal-analysis/balance-of-payments-portal/.
- Keane, Michael P., and Kenneth I. Wolpin. “The solution and estimation of discrete choice dynamic programming models by simulation and interpolation: Monte Carlo evidence.” The review of economics and statistics (1994): 648-672. [CrossRef]
- Abramovich, Felix, and Marianna Pensky. “Classification with many classes: Challenges and pluses.” Journal of Multivariate Analysis 174 (2019): 104536. [CrossRef]
- Blais, André, Jean-François Daoust, Ruth Dassonneville, and Gabrielle Péloquin-Skulski. “What is the cost of voting?.” Electoral Studies 59 (2019): 145-157. [CrossRef]
- Gelman, A., Silver, N., & Edlin, A. (2012). What is the probability your vote will make a difference? Economic Inquiry 2012, 50, 321–326. [CrossRef]
- Blais, André, Carol Galais, and Danielle Mayer. “Is It a Duty to Vote and to be Informed?.” Political Studies Review 17, no. 4 (2019): 328-339. [CrossRef]
- Blais, André, and Jean-François Daoust. The motivation to vote: Explaining electoral participation. UBC Press, 2020.
- Alvarez, R. Michael, Delia Bailey, and Jonathan N. Katz. “An empirical Bayes approach to estimating ordinal treatment effects.” Political Analysis 19, no. 1 (2011): 20-31. [CrossRef]
- Baron, Jonathan. “The ‘culture of honor’in citizens’ concepts of their duty as voters.” Rationality and Society 24, no. 1 (2012): 37-72. [CrossRef]
- Brennan, Jason, and Freiman, Christopher. “Why swing-state voting is not effective altruism: The bad news about the good news about voting.” The Journal of Political Philosophy 31, no. 1 (2023): 60–79. [CrossRef]
- Choi, Hyeong In. “Lectures on Machine Learning (Fall 2017). Lecture 9: Classi cation and Regression Tree (CART)”.
- Feddersen, Timothy, and Alvaro Sandroni. “A theory of participation in elections.” American Economic Review 96, no. 4 (2006): 1271-1282. [CrossRef]
- Feddersen, Timothy, and Alvaro Sandroni. “The calculus of ethical voting.”. International Journal of Game Theory, 2006; 35, 1–25. [CrossRef]
- Edlin, Aaron S., Andrew Gelman, and Noah Kaplan. “Vote for charity’s sake.” The Economists’ Voice 5, no. 6 (2008). [CrossRef]
- Facchini, François, and Louis Jaeck. “Ideology and the rationality of non-voting.” Rationality and Society 31, no. 3 (2019): 265-286. [CrossRef]
- Faravelli, Marco, Kenan Kalayci, and Carlos Pimienta. “Costly voting: A large-scale real effort experiment.” Experimental economics 23, no. 2 (2020): 468-492. [CrossRef]
- Feddersen, Timothy J. “Rational choice theory and the paradox of not voting.” Journal of Economic perspectives 18, no. 1 (2004): 99-112. [CrossRef]
- Feddersen, Timothy, and Alvaro Sandroni. “Ethical voters and costly information acquisition.” Quarterly Journal of Political Science 1, no. 3 (2006): 287-312. [CrossRef]
- Galais, Carol, and André Blais. “Beyond rationalization: Voting out of duty or expressing duty after voting?” International Political Science Review 37, no. 2 (2016): 213-229. [CrossRef]
- Gelman, Andrew, and Nate Silver. “What Do We Know at 7 PM on Election Night?” Mathematics Magazine 83, no. 4 (2010): 258-266. [CrossRef]
- Goldfarb, Robert S., and Lee Sigelman. “Does ‘civic duty’‘solve’the rational choice voter turnout puzzle?” Journal of Theoretical Politics 22, no. 3 (2010): 275-300. [CrossRef]
- Hansen, Stephen, Thomas R. Palfrey, and Howard Rosenthal. “The Downsian model of electoral participation: Formal theory and empirical analysis of the constituency size effect.” Public Choice 52, no. 1 (1987): 15-33. [CrossRef]
- Harsanyi, John C. “Morality and the theory of rational behavior.” Social research (1977): 623-656. [CrossRef]
- Henley, T., Brown S., Diawara, N., Hossain M.I. & Rivera G. (2024) “ Contemporary Voters Suppression: Impact on the 2020 General Election,” Ralph Bunche Journal of Public Affairs: Vol. 7: Iss. 1, Article 4.
- Law, Wilson. “Decomposing political advertising effects on vote choices.” Public Choice 188, no. 3 (2021): 525-547. [CrossRef]
- Heckelman, Jac C., and Nicholas R. Miller. “Introduction: issues in social choice and voting.” In Handbook of Social Choice and Voting, pp. 1-12. Edward Elgar Publishing, 2015.
- Feddersen, Timothy, and Alvaro Sandroni. The foundations of warm-glow theory. working paper, 2009.
- Ong Whaley, Carah and Goldberg, Abraham. Voting Trends Among Young People, Encyclopedia of American Political Parties and Elections, 2021. (https://virginia.academia.edu/CarahOngWhaley/Papers).
- Hayes, Danny, and Jennifer L. Lawless. “As local news goes, so goes citizen engagement: Media, knowledge, and participation in US House Elections.” The Journal of Politics 77, no. 2 (2015): 447-462. [CrossRef]
- Wolfinger, Raymond E., and Steven J. Rosenstone. Who votes?. Yale University Press, 1980.
- Lee, Dwight R. “Ignoring the Electoral College: why public choice economists understate the probability of decisive voters.” Public Choice 187, no. 3 (2021): 439-454. [CrossRef]
- Addeo, F., Ammirato, L., Delli Paoli, A., Fruncillo, D., Maddaloni, D. (2024). Voter.
- turnout: Overtime and regional trends in Europe. Working paper no. 2.3. TRUEDEM: Trust in.
- European Democracies Project (www.truedem.eu).
- Holbein, John B., and Hans Hassell. “George Floyd’s Murder Prompted Thousands of Americans to Register to Vote.” (2023).
- Barber, Michael, and John B. Holbein. “400 million voting records show profound racial and geographic disparities in voter turnout in the United States.” Plos one 17, no. 6 (2022): e0268134. [CrossRef]
- Srinivasan SM, Paat Y-F. A Data-Centric Approach to Understanding the 2020 U.S. Presidential Election. Big Data and Cognitive Computing. 2024; 8(9):111. [CrossRef]



| Variables | Not Voted | Voted | |
|---|---|---|---|
| Duty | Views voting as choice Views voting as duty |
154 (13.95%) (69.37%) 68 (3.58%)(30.63%) |
950 (86.05%) (34.12%) 1834 (96.42%) (65.88%) |
| Age | 18-34 35-49 50-64 65+ |
74 (12.80%) (33.33%) 71 (9.62%) (31.98%) 43 (5.26%) (19.36%) 34 (3.90%) (15.31%) |
504 (87.20%) (18.10%) 667 (90.38%) (23.95%) 775 (94.74%) (27.83%) 838 (96.10%) (30.10%) |
| Race | White Black Hispanic Asian Others |
136 (6.08%) (61.26%) 27 (10.00%) (12.16%) 40 (16.19%) (18.02%) 7 (6.86%) (3.13%) 12 (8.05%) (5.41%) |
2102 (93.92%) (75.50%) 243 (90.00%) (8.73%) 207 (83.81%) (7.44%) 958 (93.14%)) (3.41%) 137 (91.95%) (4.92%) |
| Gender | Male Female |
116 (8.54%) (52.49%) 105 (6.41%) (47.51%) |
1242 (91.46%) (44.79%) 1531 (93.59%) (55.21%) |
| Education | Less than high school High school graduate Some college or less than 4 years 4-year college degree Or Postgraduate degree |
30 (31.91%) (13.51%) 67 (15.22%) (30.18%) 72 (7.42%) (32.43%) 53 (3.53%) (23.87%) |
64 (68.09%) (2.3%) 373 (84.78%) (13.4%) 898 (92.58%) (32.26%) 1449 (96.26%) (52.05%) |
| Marital Status | Partnered Single |
90 (5.44%) (40.54%) 132 (9.78%) (59.46%) |
1565 (94.56%) (56.21%) 1219 (90.23%) (43.79%) |
| Occupation | Full Time Unemployed Part Time |
140 (7.46%) (63.06%) 73 (7.30%) (32.88%) 9 (6.92%) (4.05%) |
1736 (92.54%) (62.36%) 927 (92.7%) (33.3%) 121 (93.08%) (4.35%) |
| Home Ownership | Inapplicable Rent Own Others |
111 (7.06%) (50.00%) 49 (14.12%) (22.07%) 51 (5.07%) (22.97%) 11 (13.58%) (4.95%) |
1462 (92.94%) (52.51%) 298 (85.88%) (10.70%) 954 (94.93%) (34.27%) 70 (86.42%) (2.51%) |
| Number | Variable | Values |
|---|---|---|
| V201510 | Education *** | 1 = Less than high school, 2 = High school graduate, 3 = Some college or less than 4 years, 4 = 4-year college degree Or Postgraduate degree |
| V201508 | Marital Status | 1 = Partnered, 2 = Single |
| V201533x | Occupation | 1= Full Time, 2 = Unemployed, 3 = Part Time, |
| V201549x | Race * | 1 = White, 2 = Black, 3 = Hispanic, 4 = Asian, 5 = Others |
| V201581 | Home ownership ** | 1 = Inapplicable, 2 = Rent, 3 = Own, 4 = Others |
| V202064 | Party Registration | 1 = Inapplicable, 2 = Democrate, 3 = Republican, 4 = Others |
| V202219 | Votes Counted Fairly | 1 = Never, 2 = Some of the time, 3 = About half of the time, 4 = Most of the time, 5 = All of the time |
| V202017 | Monetary Contribution | 1 = Yes, 0 = No |
| V202051 | Registered to Vote | 1 = Inapplicable, 2 =Yes, 3 = No |
| V202352 | Social Class | 1 = Lower class, 2 = Working class, 3 = Middle class, 4 = Upper class |
| V201244 | Party better handling theCOVID-19 (Voting Choice) | 1 = R. would do a much better job, 2 = R. would do a somewhat better job, 3 = Not much difference between them, 4 = D. would do a somewhat better job, 5 = D. would do a much better job |
| COVI Index | Cost | Scot Schraufnagel, Michael J. Pomante II, and Quan Li (2020) |
| V201221 | Duty*** | 1 = Mainly a duty, 0 = Mainly a choice |
| Benefit | https://rockinst.org/issue-areas/fiscal-analysis/balance-of-payments-portal/ | |
| V202066 | Vote | 1 = Voted, 0 = Did not vote |
| Variables | Estimate | Std. Error | Pr(>|z|) | ||
|---|---|---|---|---|---|
| (Intercept) | -0.07203 | 0.5336 | 0.892622 | ||
| Education | High school graduate | 0.9767 | 0.2796 | 0.000478 | *** |
| Some college or less than 4 years | 1.148 | 0.2602 | 0.000010 | *** | |
| 4-year college degree or postgraduate degree | 1.44 | 0.2793 | 0.000000 | *** | |
| Marital Status | Single | -0.2024 | 0.1561 | 0.194813 | |
| Occupation | Unemployed | 0.3973 | 0.1664 | 0.016923 | * |
| Part Time | 0.03135 | 0.3462 | 0.927853 | ||
| Race | Black | -0.5125 | 0.2366 | 0.030284 | * |
| Hispanic | -0.7793 | 0.2179 | 0.000349 | *** | |
| Asian | -0.08955 | 0.4873 | 0.854213 | ||
| Others | -0.3193 | 0.303 | 0.291884 | ||
| Home | Rent | -0.193 | 0.2001 | 0.334891 | |
| Own | 0.3093 | 0.1796 | 0.085134 | . | |
| Others | -0.1249 | 0.3647 | 0.731934 | ||
| Party Registration | Democrat | -1.01 | 0.7072 | 0.153407 | |
| Republican | 0.1378 | 0.6401 | 0.829524 | ||
| Others | -1.099 | 0.5572 | 0.048607 | * | |
| Votes Count Fairly | 0.2888 | 0.0641 | 0.000006 | *** | |
| Monitory Contribution | 0.514 | 0.2466 | 0.037137 | * | |
| Registered to Vote | Yes | -1.013 | 0.3594 | 0.004802 | ** |
| No | -7.116 | 0.7224 | < 0.000000 | *** | |
| Social Class | 0.3496 | 0.1103 | 0.001529 | ** | |
| Voting Choice | Inapplicable | 0.5941 | 0.3359 | 0.076978 | . |
| Moderately strongly | 0.1762 | 0.3691 | 0.633076 | ||
| Very strongly | -0.2437 | 0.3486 | 0.484605 | ||
| Covid | -0.01712 | 0.06174 | 0.781538 | ||
| Cost | -0.2896 | 0.1089 | 0.007839 | ** | |
| Benefit | -0.00003864 | 0.00002192 | 0.077959 | . |
| Variables | Estimate | Std. Error | Pr.(>|z|) | ||
| (Intercept) | 0.750286 | 0.692661 | 0.278722 | ||
| Duty | 0.977474 | 0.237736 | 3.93E-05 | *** | |
| Education | High school graduate | 1.03235 | 0.462034 | 0.02546 | * |
| Some college or less than 4 years | 1.516267 | 0.44844 | 0.000722 | *** | |
| 4-year college degree Or Postgraduate degree | 1.521595 | 0.450853 | 0.000738 | *** | |
| Marital Status | Single | -0.1522 | 0.250999 | 0.544267 | |
| Occupation | Unemployed | 0.145875 | 0.263636 | 0.580045 | |
| Part Time | -0.36271 | 0.504709 | 0.472361 | ||
| Race | Black | -0.32022 | 0.4019 | 0.425594 | |
| Hispanic | -0.81911 | 0.345101 | 0.017618 | * | |
| Asian | 0.117249 | 0.663521 | 0.859739 | ||
| Others | -0.23948 | 0.546181 | 0.661048 | ||
| Home | Rent | -0.82618 | 0.308191 | 0.007346 | ** |
| Own | 0.013546 | 0.285243 | 0.962123 | ||
| Others | -1.18811 | 0.521745 | 0.022776 | * | |
| Party Registration | Democrat | -1.7505 | 1.364323 | 0.199472 | |
| Republican | 0.680638 | 1.248893 | 0.585758 | ||
| Others | -1.50016 | 0.856503 | 0.079862 | . | |
| Votes Count Fairly | 0.194254 | 0.108764 | 0.074098 | . | |
| Monitory Contribution | 0.621206 | 0.39193 | 0.112968 | ||
| Registered to Vote | Yes | -0.91516 | 0.542745 | 0.091764 | . |
| No | -20.1501 | 532.3398 | 0.969806 | ||
| Social Class | -0.00698 | 0.177184 | 0.968593 | ||
| Covid | 0.044846 | 0.097712 | 0.646265 |
| Variables | Estimate | Std. Error | Pr.(>|z|) | ||
| (Intercept) | 0.9565 | 0.7156 | 0.181377 | ||
| Duty | 0.995 | 0.2386 | 0.0000303 | *** | |
| Education | High school graduate | 1.038 | 0.4612 | 0.024375 | * |
| Some college or less than 4 years | 1.501 | 0.4471 | 0.000787 | *** | |
| 4-year college degree Or Postgraduate degree | 1.498 | 0.4495 | 0.000863 | *** | |
| Marital Status | Single | -0.1692 | 0.2517 | 0.501495 | |
| Occupation | Unemployed | 0.163 | 0.264 | 0.537112 | |
| Part Time | -0.356 | 0.5049 | 0.480769 | ||
| Race | Black | -0.2603 | 0.4048 | 0.520158 | |
| Hispanic | -0.8251 | 0.3521 | 0.019112 | * | |
| Asian | 0.0893 | 0.6727 | 0.894397 | ||
| Others | -0.2463 | 0.5451 | 0.651466 | ||
| Home | Rent | -0.8203 | 0.3086 | 0.007854 | ** |
| Own | 0.02517 | 0.2853 | 0.929716 | ||
| Others | -1.173 | 0.5249 | 0.025416 | * | |
| Party Registration | Democrat | -1.757 | 1.37 | 0.199749 | |
| Republican | 0.7439 | 1.255 | 0.553352 | ||
| Others | -1.526 | 0.8613 | 0.076419 | . | |
| Votes Count Fairly | 0.2077 | 0.1093 | 0.057385 | . | |
| Monitory Contribution | 0.61 | 0.3927 | 0.120267 | ||
| Registered to Vote | Yes | -0.9545 | 0.5471 | 0.081073 | . |
| No | -20.19 | 531.5 | 0.969697 | ||
| Social Class | -0.003236 | 0.1773 | 0.985443 | ||
| Covid | 0.02333 | 0.09928 | 0.814203 | ||
| Cost | -0.1157 | 0.1731 | 0.50382 | ||
| Benefit | -0.00004021 | 0.00003495 | 0.249965 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).