Submitted:
10 May 2024
Posted:
14 May 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
2.1. Adoption
2.2. Usage
| Serial | Title (Year) | Data | Variables Considered | Method(s) | Key Findings |
|---|---|---|---|---|---|
| 1 | Modelling real choice between conventional ad electric cars for homebased journeys (2015) | Data from 667 Danish households | Journey time, driving time, number of trip legs, journey distance, at least one charge, windspeed, precipitation, Citroen dummy, number of driving licenses, city dummy, first week dummy. | Logit model | The number of trip legs, the drivetime, requirement to charge the vehicle all had negative effects on the choice of EV. While precipitation, urban area had positive effects on the choice. |
| 2 | Electric vehicles in multi-vehicle households (2015) | Data from 446 vehicles in the Puget Sound Region in Washington | Range, DRA (days requiring adaptation)/Threshold for inconvenience | Trip Counting, Analytic estimations | Electric vehicles of the same range if deployed as a second car in two vehicle households would be electrify roughly twice as many miles as the deployment into one car households (replacing their only vehicle). |
| 3 | Are multi-car households better suited for battery electric vehicles? – Driving patterns and economics in Sweden and Germany (2016) | German household survey data (from 6339 vehicles) and Swedish GPS data (from 700 vehicles) | VKT (vehicles kilometers traveled), DRA (days requiring adaptation), range, capital expenditure, operating expenditure | Extrapolation and economic analysis | From the economic analysis, it was also found that BEVs are best suited for multi-car households. Secondary household cars in these households are better suited to be replaced by a BEV. |
| 4 | How are driving patterns adjusted to the use of a battery electric vehicle in two-car households? (2016) | GPS data from 10 Swedish households | DRA (days requiring adaptation), annual VKT, daily driving distance | Extrapolation | For most households, the EV is driven more than the replaced car. There exists a large heterogeneity in the usage and adaptation among the households. The EVs mainly replace the 40-70 km trips of the replaced car. |
| 5 | What are the value and implications of two-car households for the electric car? (2017) | GPS logging data for both cars in 64 commuting Swedish two-car households | VKT(vehicles kilometers traveled), SOC (State of Charge), TCO (Total Cost of Ownership) | Mixed integer quadratically constrained programming (MIQCP) | Two-car households in Sweden could derive a value of $7000 from the flexibility of owning an EV. This is because they can drive more on electricity, which is cheaper, and rely on their internal combustion engine vehicles for longer trips. |
| 6 | Exploring Factors that Influence Individuals' Choice Between Internal Combustion Engine Cars and Electric Vehicles (2020) | A dataset of 129 Swiss drivers over a period of 1 year | weekday/weekend, temperature, precipitation, sex, age, number of cars in household, work status, household size, long-distance trip leg, duration of activity, trip duration, household income, hour of day, month of year | Random Forest and Logit Model | The duration, distance, weekday/weekend has a larger effect than household size, but these variables do not possess a high predictive power. This indicated that the range of the vehicles is not a deciding factor in this choice. |
| 7 | Utilization of battery-electric vehicles in two-car households: Empirical insights from Gothenburg Sweden (2020) | GPS data from 20 Swedish two-car households | Range, VKT (vehicles kilometers traveled), Flexibility Utilization Index | Ex-post analysis | The electric vehicles performed a major share of the below range driving during the weekends. |
| 8 | Electrification of Vehicle Miles Traveled and Fuel Consumption within the Household Context A Case Study from California, U.S.A. (2022) | A dataset of 650 vehicles from 287 Californian households | range, charging frequency, frequency of long-distance travel, frequency of overlaps, household VMT, ICEV mileage | Statistical Analysis and Regression | A short-range PHEV can electrify up to 70% of the eVMT of long-range BEVs (Bolt and Model S). Hence, PHEVs with 35-mile all-electric range can be used as tools to decarbonize the transport sector. |
| 9 | How do users adapt to a short-range battery electric vehicle in a two-car household? Results from a trial in Sweden (2022) | GPS data from 25 Swedish two-car households | DRA (days requiring adaptation), daily driving distance | Quantitative, qualitative, and mixed methods | There exists a large heterogeneity in driving adaptation and behavior; some households use the electric vehicle more than the replaced car; some use it less. Some households change their driving style when they use the electric vehicle. |
| 10 | Integrating plug-in electric vehicles (PEVs) into household fleets- factors influencing miles traveled by PEV owners in California (2022) | Survey data of 4125 Californian Households with BEVs or PHEVs | PEV characteristics, other household vehicle characteristics, built environment variables, household characteristics, respondent characteristics, other factors influencing PEV use | OLS Regression, SUR Model, Hypothesis Tests | eVMT is correlated with traditional factors such as population density, attitudes towards technology and lifestyle preferences. PEVs are driven as much as ICEVs. The availability of level 2 charging at home greatly influences the eVMT. |
2.3. Summary
3. Data
3.1. NHTS 2017
3.2. Data Cleaning

3.3. Variable Selection
| Variable | Categories | Distribution | ||
|---|---|---|---|---|
| Household Attributes | Home Ownership (Binary) | Rent's a House | 91.26% | |
| Own's a House | 8.74% | |||
| Household Income (Discrete) | Low (<$50,000) | 9.73% | ||
| Medium ($50,000 - $150,000) | 57.24% | |||
| High (>$150,000) | 33.03% | |||
| Number of Household Workers (Discrete) | 0 | 17.53% | ||
| 1 | 25.61% | |||
| 2 | 45.92% | |||
| 3 | 8.49% | |||
| 4 | 2.12% | |||
| 5 | 0.33% | |||
| Children (Binary) | No children | 90.42% | ||
| 1 or more children | 9.58% | |||
| Driver Attributes | Driver's Age (continuous) | - | 52.49 ± 15.65* | |
| Driver's Education Level (Discrete) | No high school degree | 8.64% | ||
| High school or associate degree | 19.31% | |||
| Bachelor's degree or higher | 72.06% | |||
| Driver's Sex (Binary) | Female | 43.54% | ||
| Male | 56.46% | |||
| Built Environment Variables | Employment Density in Workers Per 0.01 Square Miles (Continuous) | - | 3.74 ± 4.52* | |
| Population Density in Persons Per 0.01 Square Miles (Continuous) | - | 1.54 ± 1.52* | ||
| **Trip Day Gas Price in cents per gallon (Continuous) | - | 246.86 ± 25.64* | ||
| Trip Attributes | Weekday/Weekend (Binary) | Weekday | 77.42% | |
| Weekend | 22.58% | |||
| Starting Time (Discrete) | 12 AM - 6 AM | 2.53% | ||
| 6 AM - 10AM | 30.06% | |||
| 10 AM - 3 PM | 38.06% | |||
| 3 PM - 7 PM | 24.44% | |||
| 7 PM - 12 AM | 4.9% | |||
| Home/Non-home Based (Binary) | Home-based trip | 49.42% | ||
| non-home-based trip | 50.58% | |||
| Trip Purpose (Discrete) | Errands | 16.01% | ||
| Others | 10.41% | |||
| Shopping or Dining | 38.28% | |||
| Social or recreational | 14.81% | |||
| Work | 20.49% | |||
| Dwelling time (Discrete) | 1-15 minutes | 29.31% | ||
| 15-50 minutes | 24.04% | |||
| 50-150 minutes | 25.08% | |||
| More than 150 minutes | 21.57% | |||
| Trip Distance (Discrete) | 0-2 miles | 27.82% | ||
| 2-5 miles | 28.41% | |||
| 5-15 miles | 27.35% | |||
| More than 15 miles | 16.41% | |||
| Number of Passengers (Discrete) | 1 passenger | 56.92% | ||
| 2-4 passengers | 41.42% | |||
| 5 - 10 passengers | 1.66% | |||
4. Methodology

4.1. Clustering Model
- (1)
- A set of observations from the dataset Q = [Q1, Q2, Q3 ….. Qk] is initialized as the cluster centroids using a density-based initialization algorithm (Cao et al., 2009). This initialization method helps avoid the necessity of running the algorithm multiple times to search for an effective solution. The observation with the maximum density is initialized as the first centroid. The remaining centroids are initialized based on density as well as the distance from other centroids.
- (2)
- Every trip/observation (denoted by X) outside Q, is assigned to a cluster from Q whose centroid had the smallest hamming distance (Pandit and Gupta, 2011) from X. Hamming distance can be defined as follows:
- (3)
- After every trip is assigned to a cluster, the cluster centroids in Q are updated based on the newly assigned trips. The hamming distances for each trip are recalculated and trips are assigned to new clusters based on the newly calculated hamming distances.
- (4)
- Step 3 is repeated until no trip in the dataset changes clusters. And the cost or the sum of the hamming distances for all the observations were recorded for the model with k clusters.
4.2. Classification Model
4.2.1. Decision Tree
4.2.2. Random Forest
- (1)
- At first a subset of the data is formed by bootstrapping (Breiman, 1996). In this step, a random sample of the data is drawn with replacement. This implies that some observations may be duplicated, and others may be left out of the sample.
- (2)
- Next, a decision tree is constructed using the bootstrapped sample and a set of randomly selected features. As recommended by previous studies (Liaw and Wiener, 2002), the number of randomly selected features was set as the square root of the total number of features rounded to the nearest integer. The decision tree is grown until the number of observations at a node reaches 1.
- (3)
- The number of decision trees to be grown was set to 1000; the first two steps were repeated 1000 times.
- (4)
- To make a prediction for a new observation, each decision tree in the forest is traversed and the predictions from each tree are recorded. Finally, the majority vote of predictions is taken as the final prediction.
4.2.3. Extreme Gradient Boosting
4.2.4. Binary Logit
4.2.5. Classification Model Interpretation
5. Results and Discussion
5.1. Clustering Model
5.1.1. Model Comparison

5.1.2. Model Interpretation
| Trip Attribute | Trip Clusters | |||||
|---|---|---|---|---|---|---|
| Cluster 1 |
Cluster 2 |
Cluster 3 |
Cluster 4 |
Cluster 5 |
||
| (8060 trips) | (4956 trips) | (2075 trips) | (2249 trips) |
(2485 trips) | ||
| Weekday/Weekend | Weekday trip | 86% | 90% | 88% | 74% | 20% |
| Weekend trip | 14% | 10% | 12% | 26% | 80% | |
| Starting time | 10 AM - 3 PM | 60% | 11% | 17% | 19% | 56% |
| 12 AM - 6 AM | 1% | 7% | 2% | 1% | 1% | |
| 3 PM - 7 PM | 18% | 6% | 64% | 60% | 17% | |
| 6 AM - 10 AM | 15% | 75% | 14% | 13% | 17% | |
| 7 PM - 12 AM | 6% | 2% | 3% | 6% | 8% | |
| Home-based/non-home based | Home-based trip | 22% | 81% | 76% | 25% | 75% |
| non-home-based trip | 78% | 19% | 24% | 75% | 25% | |
| Trip Purpose | Errands | 19% | 6% | 47% | 10% | 6% |
| Others | 7% | 13% | 13% | 11% | 12% | |
| Shopping or Dining | 58% | 6% | 23% | 24% | 64% | |
| Social or recreational | 6% | 12% | 15% | 51% | 16% | |
| Work | 9% | 63% | 2% | 4% | 2% | |
| Dwelling time (time spent at destination) | 1-15 minutes | 51% | 9% | 32% | 10% | 16% |
| 15-50 minutes | 29% | 8% | 49% | 16% | 25% | |
| 50-150 minutes | 14% | 18% | 16% | 60% | 52% | |
| More than 150 minutes | 5% | 66% | 4% | 14% | 8% | |
| Trip Distance | 0-2 miles | 33% | 29% | 13% | 27% | 21% |
| 2-5 miles | 43% | 17% | 19% | 19% | 19% | |
| 5-15 miles | 15% | 28% | 62% | 16% | 49% | |
| More than 15 miles | 9% | 26% | 6% | 38% | 11% | |
| Number of Passengers | 1 passenger | 67% | 82% | 63% | 11% | 11% |
| 2-4 passengers | 32% | 17% | 35% | 86% | 86% | |
| 5 - 10 passengers | 2% | 1% | 1% | 3% | 3% | |
5.2. Classification Model
5.2.1. Model Comparison
| Model | Cross-validation Accuracy | Training Accuracy | Testing Accuracy |
|---|---|---|---|
| Decision Tree | 88% | 98.9% | 87.3% |
| Random Forest | 84.5% | 98.9% | 82.6% |
| XG Boost | 87.1% | 97.2% | 86.2% |
| Binary Logit | 57.7% | 57.6% | 57.8% |
5.2.2. Model Interpretation
5.2.2.1. Variable Importance

5.2.2.2. Accumulated Local Effects


6. Conclusion
References
- Apley, D.W., Zhu, J., 2020. Visualizing the effects of predictor variables in black box supervised learning models. J R Stat Soc Series B Stat Methodol 82. [CrossRef]
- Bhat, C.R., 1997. Work Travel Mode Choice and Number of Non-Work Commute Stops. Transportation Research Part B: Methodological 31, 41–54ode Choice and Number of Non-Work Commute Stops. Transportation Research Part B: Methodological 31, 41–54. [CrossRef]
- Biau, G., Scornet, E., 2016. A random forest guided tour. Test 25, 197–227. [CrossRef]
- Breiman, L., 2001. Random Forests. Mach Learn 45, 5–32.
- Breiman, L., 1984. Classification and Regression Trees, 1st ed. Chapman and Hall/CRC.
- Brown, A., Cappellucci, J., Schayowitz, A., White, E., Heinrich, A., Cost, E., 2022. Electric Vehicle Charging Infrastructure Trends from the Alternative Fueling Station Locator: First Quarter 2022.
- Bucher, D., Martin, H., Hamper, J., Jaleh, A., Becker, H., Zhao, P., Raubal, M., 2020. Exploring Factors that Influence Individuals’ Choice Between Internal Combustion Engine Cars and Electric Vehicles. AGILE: GIScience Series 1, 1–23. [CrossRef]
- Cameron, A.C., Trivedi, P.K., 2010. Microeconometrics using stata. Stata press, College Station, TX.
- Cao, F., Liang, J., Bai, L., 2009. A new initialization method for categorical data clustering. Expert Syst Appl 36, 10223–10228. [CrossRef]
- Carlucci, F., Cirà, A., Lanza, G., 2018. Hybrid electric vehicles: Some theoretical considerations on consumption behaviour. Sustainability 10. [CrossRef]
- Caulfield, B., Farrell, S., McMahon, B., 2010. Examining individuals preferences for hybrid electric and alternatively fuelled vehicles. Transp Policy (Oxf) 17, 381–387. [CrossRef]
- Chakraborty, D., Hardman, S., Tal, G., 2022. Integrating plug-in electric vehicles (PEVs) into household fleets- factors influencing miles traveled by PEV owners in California. Travel Behav Soc 26, 67–83. [CrossRef]
- Chaturvedi, A., Green, P.E., Caroll, J.D., 2001. K-modes Clustering. J Classif 18, 35–55.
- Chen, T., Guestrin, C., 2016. XGBoost: A scalable tree boosting system, in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, pp. 785–794. [CrossRef]
- De Borger, B., Mulalic, I., Rouwendal, J., 2016. Substitution between cars within the household. Transp Res Part A Policy Pract 85, 135–156. [CrossRef]
- Egbue, O., Long, S., 2012. Barriers to widespread adoption of electric vehicles: An analysis of consumer attitudes and perceptions. Energy Policy 48, 717–729. [CrossRef]
- Friedman, J.H., 2001. Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics 29, 1189–1232.
- Ghandi, A., Paltsev, S., 2020. Global CO2 impacts of light-duty electric vehicles. Transp Res D Transp Environ 87. [CrossRef]
- Hackbarth, A., Madlener, R., 2013. Consumer preferences for alternative fuel vehicles: A discrete choice analysis. Transp Res D Transp Environ 25, 5–17. [CrossRef]
- Hardman, S., Chandan, A., Tal, G., Turrentine, T., 2017. The effectiveness of financial purchase incentives for battery electric vehicles – A review of the evidence. Renewable and Sustainable Energy Reviews 80, 1100–1111. [CrossRef]
- Huang, Z., 1998. Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Min Knowl Discov 12, 283–304he k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Min Knowl Discov 12, 283–304.
- Jakobsson, N., Gnann, T., Plötz, P., Sprei, F., Karlsson, S., 2016a. Are multi-car households better suited for battery electric vehicles? - Driving patterns and economics in Sweden and Germany. Transp Res Part C Emerg Technol 65, 1–15. [CrossRef]
- Jakobsson, N., Karlsson, S., Sprei, F., 2016b. How are driving patterns adjusted to the use of a battery electric vehicle in two-car households?, in: Electric Vehicle Symposium.
- Jakobsson, N., Sprei, F., Karlsson, S., 2022. How do users adapt to a short-range battery electric vehicle in a two-car household? Results from a trial in Sweden. Transp Res Interdiscip Perspect 15. [CrossRef]
- Jensen, A.F., Mabit, S.L., 2015. MODELLING REAL CHOICES BETWEEN CONVENTIONAL AND ELECTRIC CARS FOR HOME-BASED JOURNEYS, in: Annual Transport Conference at Aalborg University.
- Jia, J., 2019. Analysis of Alternative Fuel Vehicle (AFV) Adoption Utilizing Different Machine Learning Methods: A Case Study of 2017 NHTS. IEEE Access 7, 112726–112735. [CrossRef]
- Karlsson, S., 2020. Utilization of battery-electric vehicles in two-car households: Empirical insights from Gothenburg Sweden. Transp Res Part C Emerg Technol 120. [CrossRef]
- Kim, E.J., 2021. Analysis of Travel Mode Choice in Seoul Using an Interpretable Machine Learning Approach. J Adv Transp 2021. [CrossRef]
- Kim, S., Gudmundur F. Ulfarsson, 2004. Travel mode choice of the elderly- effects of personal, household, neighborhood, and trip Characteristics. Transp Res Rec 1894, 117–126. [CrossRef]
- Li, X., Liu, C., Jia, J., 2019. Ownership and usage analysis of alternative fuel vehicles in the United States with the 2017 National Household Travel Survey data. Sustainability 11. [CrossRef]
- Liaw, A., Wiener, M., 2002. Classification and Regression by randomForest.
- Liu, J., Khattak, A.J., Li, X., Fu, X., 2019. A spatial analysis of the ownership of alternative fuel and hybrid vehicles. Transp Res D Transp Environ 77, 106–119. [CrossRef]
- Mandev, A., Sprei, F., Tal, G., 2022. Electrification of Vehicle Miles Traveled and Fuel Consumption within the Household Context: A Case Study from California, U.S.A. World Electric Vehicle Journal 13. [CrossRef]
- Musti, S., Kockelman, K.M., 2011. Evolution of the household vehicle fleet: Anticipating fleet composition, PHEV adoption and GHG emissions in Austin, Texas. Transp Res Part A Policy Pract 45, 707–720. [CrossRef]
- Myles, A.J., Feudale, R.N., Liu, Y., Woody, N.A., Brown, S.D., 2004. An introduction to decision tree modeling. J Chemom. [CrossRef]
- Ozhegov, E.M., Ozhegova, A., 2019. Heterogeneity in demand and optimal price conditioning for local rail transport. arXiv:1905.12859v1 [econ.EM]. arXiv:1905.12859v1 [econ.EM].
- Pandit, S., Gupta, S., 2011. A comparative study on distance measuring approaches for clustering. International Journal of Research in Computer Science 2, 29–31. [CrossRef]
- Parker, N., Breetz, H.L., Salon, D., Conway, M.W., Williams, J., Patterson, M., 2021. Who saves money buying electric vehicles? Heterogeneity in total cost of ownership. Transp Res D Transp Environ 96D Transp Environ 96. [CrossRef]
- Perez, J., 2020. The Fastest Cars You Can Buy From Every Automaker [WWW Document]. URL https://www.motor1.com/features/428317/fastest-cars-from-every-automaker/.
- Rousseeuw, P.J., 1987a. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics.
- Rousseeuw, P.J., 1987b. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20, 53–65.
- en, Chitradeep., 2010. Performance analysis of batteries used in electric and hybrid electric vehicles. University of Windsor, Windsor, Ontario, Canada.
- Sherman, L., 1980. Implications of current household vehicle ownership and use patterns on the feasibility of electric cars. Transportation (Amst) 9, 209–227. [CrossRef]
- Shin, H.-S., Farkas, Z.A., Nickkar, A., 2019. An Analysis of Attributes of Electric Vehicle Owners’ Travel and Purchasing Behavior: The Case of Maryland, in: International Conference on Transportation and Development 2019: Innovation and Sustainability in Smart Mobility and Smart Cities. American Society of Civil Engineers, Reston, VA, pp. 77–90.
- Singh, Virender, Singh, Vedant, Vaibhav, S., 2020. A review and simple meta-analysis of factors influencing adoption of electric vehicles. Transp Res D Transp Environ 86. [CrossRef]
- Soltani-Sobh, A., Heaslip, K., Stevanovic, A., Bosworth, R., Radivojevic, D., 2017. Analysis of the Electric Vehicles Adoption over the United States. Transportation Research Procedia 22, 203–212212. [CrossRef]
- Srinivasa Raghavan, S., Tal, G., 2021. Behavioral and technology implications of electromobility on household travel emissions. Transp Res D Transp Environ 94. [CrossRef]
- Tamor, M.A., Milačić, M., 2015. Electric vehicles in multi-vehicle households. Transp Res Part C Emerg Technol 56, 52–60. [CrossRef]
- Tanaka, M., Ida, T., Murakami, K., Friedman, L., 2014. Consumers’ willingness to pay for alternative fuel vehicles: A comparative discrete choice analysis between the US and Japan. Transp Res Part A Policy Pract 70, 194–209. [CrossRef]
- Tiikkaja, H., Liimatainen, H., 2021. Car access and travel behaviour among men and women in car deficient households with children. Transp Res Interdiscip Perspect 10. [CrossRef]
- Tompkins, M., Bunch, D., Santini, D., Bradley, M., Vyas, A., Poyer, D., 1998. Determinants of alternative fuel vehicle in the continental united states choice. Transp Res Rec 1641, 130–138. [CrossRef]
- U.S. Department of Energy, 2013. Alternative Fuels Data Center.
- U.S. Department of Transportation, 2017. 2017 National Household Travel Survey.
- ergis, S., Chen, B., 2014. Understanding variations in U.S. plug-in electric vehicle markets, Transportation Research Board 94th Annual Meeting. Transportation Research Board, Davis, CA.
- Wang, F., Ross, C.L., 2018. Machine Learning Travel Mode Choices: Comparing the Performance of an Extreme Gradient Boosting Model with a Multinomial Logit Model. Transp Res Rec 2672, 35–45. [CrossRef]
- Yang, Y., Wang, C., Liu, W., Zhou, P., 2018. Understanding the determinants of travel mode choice of residents and its carbon mitigation potential. Energy Policy 115, 486–493. [CrossRef]
- Zhao, X., Yan, X., Yu, A., Van Hentenryck, P., 2020. Prediction and behavioral analysis of travel mode choice: A comparison of machine learning and logit models. Travel Behav Soc 20, 22–35. [CrossRef]
- Zulinski, J., 2018. U.S. Leads in Greenhouse Gas Reductions, but Some States Are Falling Behind [WWW Document]. URL https://www.eesi.org/articles/view/u.s.-leads-in-greenhouse-gas-reductions-but-some-states-are-falling-behind.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).