Submitted:
18 July 2025
Posted:
21 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Background
3. Study Motivation
- At least one daily volume is necessary for each day of the week (DOW) within a month.
- A minimum of 19 hourly observations must be recorded for each daily volume.
- The daily traffic volume should fall within 20% of the average for that specific day of the week in the month.
4. Study Approach
- i.
- Sampling technique I: In this technique, random days within a specific month are selected to represent the level of available (or missing) data. For instance, one week of available data would include one Monday, one Tuesday, one Wednesday, etc., all selected randomly within the month.
- ii.
- Sampling technique II: In this technique, the duration of the available data (one, two, or three weeks) is selected randomly within the month as a continuous period of time. This sampling approach seems more realistic as the periods when ATRs or WIMs are down, or malfunction tend to be continuous within any given month.
4. Data Collection
5. Analysis And Results
5.1. Base Condition (No missing Data)
5.2. Scenario 1 (Permanent stations with one week of data per month)
5.3. Scenario 2 (Permanent Stations with Two Weeks of Data Per Month)
5.4. Scenario 3 (Permanent Stations with Three Weeks of Data Per Month)
6. Summary And Conclusions
- i.
- Study results clearly show that the missing data has a consistent effect on the accuracy of AADT estimation, measured using the absolute percent discrepancy between the actual and estimated AADT. This finding supports the research hypothesis that the greater the amount of missing data, the less accurate the AADT estimation. However, this effect was not found statistically significant using the two-way ANOVA analysis at the 95% confidence level.
- ii.
- The increase in % discrepancy for AADT estimation was not linearly proportional to the increase in the amount of missing data. Despite the dramatic scenarios of missing data used in the analysis (31% to 77%), the change in the AADT approximation between the highest and lowest levels of missing data was in the order of 6%.
- iii.
- Given the extreme scenarios of missing data used in the study (all permanent stations missing significant amounts of data simultaneously) and the relatively low effect on % discrepancy in AADT estimation (less than 7% discrepancy for the most extreme scenario), it is reasonable to conclude that the current practice in treating missing data does not involve an important compromise in the accuracy of AADT estimation. The finding also suggests that the data containing at least one day of the week for each month can be utilized for developing daily and seasonal adjustment factors (i.e., MDOW factors) without the need for imputing missing data.
- iv.
- Sampling technique I of selecting random days within the month was associated with lower % discrepancy in AADT estimation compared with sampling technique II of selecting a random period within the month (one, two, or three weeks).
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AADT | Annual Average Daily Traffic |
| MDOW | Monthly Day of the Week |
| MDT | Montana Department of Transportation |
| FHWA | Federal Highway Administration |
| ATR | Automatic Traffic Recorder |
| WIM | Weigh-in-Motion |
References
- Sharma SC, Lingras P, Liu GX, Xu F. Estimation of Annual Average Daily Traffic on Low-Volume Roads: Factor Approach Versus Neural Networks. Transportation Research Record: Journal of the Transportation Research Board. 2000 Jan 1;1719(1):103–11. [CrossRef]
- Zhao F, Chung S. Contributing Factors of Annual Average Daily Traffic in a Florida County: Exploration with Geographic Information System and Regression Models. Transportation Research Record: Journal of the Transportation Research Board. 2001 Jan 1;1769(1):113–22. [CrossRef]
- Sun X, Das S. Estimating Annual Average Daily Traffic for Low-Volume Roadways: A Case Study in Louisiana. In: 12th International Conference on Low-Volume Roads. 2019. Available from: https://trid.trb.org/View/1689977.
- Robichaud K, Gordon M. Assessment of Data-Collection Techniques for Highway Agencies. Transportation Research Record: Journal of the Transportation Research Board. 2003 Jan 1;1855(1):129–35. [CrossRef]
- Albright, D. History of estimating and evaluating annual traffic volume statistics. Transportation Research Record: Journal of the Transportation Research Board 1991; 1305:103–7. Available from: https://onlinepubs.trb.org/Onlinepubs/trr/1991/1305/1305-013.pdf.
- Fekpe E, Gopalakrishna D, Middleton D. Final Report Prepared for Office of Highway Policy Information Federal Highway Administration U.S. Department of Transportation Washington, D.C. 2004. Highway Performance Monitoring System Traffic Data for High-Volume Routes: Best Practices and Guidelines.
- Vandervalk-Ostrander A, American Association of State Highway and Transportation Officials, United States Federal Highway Administration, National Cooperative Highway Research Program. AASHTO Guidelines for Traffic Data Programs. Vol. 1. Washington, D.C.: PB - American Association of State Highway and Transportation Officials; 2009. Available from: http://dl1.wikitransport.ir/book/AASHTO_Guidelines_for_Traffic_Data_Programs_2009.pdf.
- Federal Highway Administration (FHWA). Traffic Monitoring Guide. 2022. Available from: https://rosap.ntl.bts.gov/view/dot/74643.
- Zhong M, Sharma S, Liu Z. Assessing Robustness of Imputation Models Based on Data from Different Jurisdictions: Examples of Alberta and Saskatchewan, Canada. Transportation Research Record: Journal of the Transportation Research Board 2005 Jan 1;1917(1):116–26. [CrossRef]
- AASHTO. AASHTO Guidelines for Traffic Data Programs - Table of Contents. 2009. Available from: http://dl1.wikitransport.ir/book/AASHTO_Guidelines_for_Traffic_Data_Programs_2009.pdf.
- Albright D. 1990 survey of traffic monitoring practices among state transportation agencies of the United States. In 1990. Available from: https://api.semanticscholar.org/CorpusID:168276711.
- Albright D. AN IMPERATIVE FOR, AND CURRENT PROGRESS TOWARD, NATIONAL TRAFFIC MONITORING STANDARDS. Ite Journal-institute of Transportation Engineers. 1991;61. Available from: https://api.semanticscholar.org/CorpusID:107206642.
- Zhong M, Sharma S, Liu Z. Assessing Robustness of Imputation Models Based on Data from Different Jurisdictions: Examples of Alberta and Saskatchewan, Canada. Transportation Research Record: Journal of the Transportation Research Board. 2005 Jan 1;1917(1):116–26. [CrossRef]
- Khan Z, Khan SM, Dey K, Chowdhury M. Development and Evaluation of Recurrent Neural Network-Based Models for Hourly Traffic Volume and Annual Average Daily Traffic Prediction. Transportation Research Record: Journal of the Transportation Research Board. 2019 Jun 2;2673(7):489–503. [CrossRef]
- Jia X, Dong X, Chen M, Yu X. Missing data imputation for traffic congestion data based on joint matrix factorization. Knowledge-based Systems. 2021; 225:107114. Available from: https://www.sciencedirect.com/science/article/pii/S0950705121003774.
- Shafique MA. Imputing Missing Data in Hourly Traffic Counts. Sensors. 2022 Dec 15;22(24):9876. Available from: https://www.mdpi.com/1424-8220/22/24/9876.
- Roll, J. Daily Traffic Count Imputation for Bicycle and Pedestrian Traffic: Comparing Existing Methods with Machine Learning Approaches. Transportation Research Record: Journal of the Transportation Research Board. 2021 Jul 27;2675(11):1428–40. [CrossRef]
- William Mendenhall RJBBMB. Introduction to Probability and Statistics. 14th ed. 2012. Available from: https://books.google.com/books/about/Introduction_to_Probability_and_Statisti.html?id=fQsKAAAAQBAJ.











| Stations | A-008 | A-046 | W-101 | W-110 | W-115 | W-116 | W-132 | W-144 | W-147 | W-149 |
|---|---|---|---|---|---|---|---|---|---|---|
|
Absolute Average Percent Discrepancy |
7.71 | 10.43 | 9.79 | 9.37 | 12.26 | 11.30 | 16.31 | 13.82 | 10.90 | 21.11 |
| One-Tailed t-Test | Two-Tailed t-Test | ||
|---|---|---|---|
| Stations | Base Condition vs Sampling Technique I | Base Condition vs Sampling Technique II | Sampling Technique I vs Sampling Technique II |
| A-008 | 0.030 | 0.019 | 0.818 |
| A-046 | 0.108 | 0.071 | 0.802 |
| W-101 | 0.078 | 0.043 | 0.759 |
| W-110 | 0.111 | 0.085 | 0.877 |
| W-115 | 0.135 | 0.104 | 0.879 |
| W-116 | 0.096 | 0.079 | 0.911 |
| W-132 | 0.469 | 0.470 | 0.997 |
| W-144 | 0.146 | 0.104 | 0.836 |
| W-147 | 0.112 | 0.123 | 0.955 |
| W-149 | 0.497 | 0.515 | 0.963 |
| One-Tailed t-Test | Two-Tailed t-Test | ||
|---|---|---|---|
| Stations | Base Condition vs Sampling Technique I | Base Condition vs Sampling Technique II | Sampling Technique I vs Sampling Technique II |
| A-008 | 0.317 | 0.213 | 0.737 |
| A-046 | 0.374 | 0.263 | 0.752 |
| W-101 | 0.321 | 0.231 | 0.783 |
| W-110 | 0.341 | 0.331 | 0.978 |
| W-115 | 0.365 | 0.319 | 0.891 |
| W-116 | 0.334 | 0.305 | 0.934 |
| W-132 | 0.505 | 0.516 | 0.978 |
| W-144 | 0.361 | 0.311 | 0.889 |
| W-147 | 0.358 | 0.395 | 0.923 |
| W-149 | 0.507 | 0.527 | 0.959 |
| One-Tailed t-Test | Two-Tailed t-Test | ||
|---|---|---|---|
| Stations | Base Condition vs Sampling Technique I | Base Condition vs Sampling Technique II | Sampling Technique I vs Sampling Technique II |
| A-008 | 0.427 | 0.327 | 0.788 |
| A-046 | 0.449 | 0.333 | 0.758 |
| W-101 | 0.427 | 0.310 | 0.754 |
| W-110 | 0.444 | 0.395 | 0.901 |
| W-115 | 0.449 | 0.365 | 0.827 |
| W-116 | 0.441 | 0.375 | 0.866 |
| W-132 | 0.551 | 0.527 | 0.948 |
| W-144 | 0.448 | 0.369 | 0.839 |
| W-147 | 0.453 | 0.490 | 0.926 |
| W-149 | 0.594 | 0.536 | 0.928 |
| Absolute Average % Discrepancies | |||
|---|---|---|---|
| Scenario 1 | Scenario 2 | Scenario 3 | |
| % Missing Data | 76.98 | 53.97 | 30.95 |
| Sampling Technique I | 13.017 [5.83] * |
12.517 [1.76] |
12.383 [0.68] |
| Sampling Technique II | 13.121 [6.67] |
12.603 [2.46] |
12.496 [1.59] |
| DF | Sum Sq | Mean Sq | F value | Pr (>F) | |
|---|---|---|---|---|---|
| Missing Data Scenario | 2 | 181 | 90.47 | 1.302 | 0.272 |
| Sampling Technique | 1 | 21 | 20.64 | .297 | 0.586 |
| Sampling Technique: Scenario | 2 | 0 | 0.03 | 0 | 1 |
| Residuals | 2184 | 151815 | 69.51 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).