2. Materials and Methods
2.1. Methodology
The studies used both quantitative and qualitative methods to analyze the safety and zebra crossing topics such as [
19], [
13], and [
7]. In addition, other qualitative techniques were used commonly in terms of graphical analysis importantly photography based on time-lapse photos [
20,
21], and mapping of data through overlaying of graphical features [
22,
23]. The quantitative methods and techniques applied the structured observation to study behavioral patterns of the users [
9,
24], on the other hand, the quantitative methods used the statistical data to sort based on the software analysis [
1,
10,
25]. Particularly, Chi-square and regression tests applied to measure two independents variable due to the association for the hypothesis through statistical tests [
26,
27,
28].
2.2. Aims, Sampling and Scoping
In order to assess and portray safety challenges on pedestrian crossings, a survey was made on various crossing facilities in the City of Kigali. It has been designed in an observatory manner, to assess how drivers abide by the pedestrian crossing markings. It is also noted that Pedestrian interaction with any type of vehicles (Two or four wheel types) generates a serious conflict on roads while crossing at crosswalks [
29]
Since the study area for this research was under the geographical scope of the city of Kigali, a non-probability purposive sampling was used to select pedestrian crossings. The journey started by surveying Kigali major road networks, in its main corridors. High traffic and low traffic areas were purposively sampled. Safety risk is hypothetically present in either situation.
2.3. Research Design
There are many pedestrian crossing facilities that exist and this research is only limited to crossings that fall in the category of white stripes on black road. They are commonly referred to as Zebra crossings. In the first place, the main research team made a tour of Kigali and identified more than 50 zebra crossings, in 3 districts that make Kigali. Nyarugenge is the most densely populated, second to Gasabo and Kicukiro. For uniformity in the data collection process, observation sites were reduced to the number of 15. The following criteria were employed to reduce the number of sites from 50 to 15. Selected crossings were those limited to the following factors:
Existence of relatively high pedestrian mobility (Hospital, Market, School, Church);
Existence of stop lines (white lines that mark the beginning of a Zebra crossing);
No existence of Traffic Police or Traffic Cameras;
No road humps before or after the crossing;
No intersection around the crossing;
No round-about around the crossing;
Only for 1 carriage way, 2-way traffic;
For site randomization purposes, data collection sites were reduced from 15 to 10 using random probability sampling. This gave sampling an edge of statistical reliability and reduced purposive bias.
2.4. Observation Protocol
Data was collected using handheld cameras fixed on tripods. These cameras were positioned in an angle that can capture all pedestrian crossing events. Cameras were mounted on tripods, in hidden locations, to allow an almost purely anonymous observation. The local traffic safety law enforcement institution (Rwanda National Police) was informed of the survey for research ethics. In addition, a clearance letter from the University of Rwanda was passed to the City Council to reiterate the same.
Collectively, there were 10 data collectors, deployed to 10 data collection sites each. Data were collected for one week (7 days) during peak hours (6 - 8 AM and 5 - 8 PM). This research was therefore cross-sectional in nature. In bulk, total observation time counted to 280 hours. That is, 4 hours of recording per day, per site for seven days on 10 sites. Mathematically, it is can be as:
Every video was timestamped for further analysis. Geographical meta data such as altitude, wind speed, Global Positioning System (GPS) coordinates, micro maps and live location were appended to videos as well. These were enforced to help with data quality, reliability and future research. All data have been digitally archived on mass and cloud storage repositories.
2.5. Data Analysis
Using online questionnaire tools, to capture pedestrian crossing events, basic observation and replay of videos helped with spotting and noting down various traffic flow variables. These included, but were not limited to, type of vehicles, crossing times, stopping times, stopping distances, count of vehicles per crossing event and more. Zebra characteristics, mostly geometric, were also captured in a separate questionnaire. Tape measures, to the precision of 1cm, have been used to measure road marking characteristics, including, but not limited to, carriage way width, zebra lines lengths, distance between center lines and stopping lines, etc. Having collected an overwhelmingly large amount of data, this study will only seek to evaluate driver’s stopping behaviors with regard to various traffic flow characteristics.
2.6. Statistical Tests
To begin with, all data was stored in a central dataset to perform cleaning and critical joins. Using spatial dimensions attached to each entry, a spatial analysis was made. Using Tableau (Version 2021.4), spatial visualizations have been built to compare population densities, traffic densities and pedestrian crossing compliance on each surveyed site. Descriptive statistics have been used to portray safety challenges caused by vehicles. Cross tabulations and Pearson Chi-square tests have been used to evaluate probable dependencies of subject variables. To wind up the analysis part, Binary logistic regression models were built and have been used to evaluate driver’s stopping behaviors against traffic flow characteristics.
2.6.1. Chi-Square
Chi-square tests of independence test whether two qualitative variables are independent, that is, whether there exists a relationship between two categorical variables [
27]. If the test shows no association between the two variables (i.e., the variables are independent), it means that knowing the value of one variable gives no information about the value of the other variable [
26].
The Chi-square test of independence is a hypothesis test so it has a null (H0) and an alternative hypothesis (H1):
H0: the variables are independent, there is no relationship between the two categorical variables. Knowing the value of one variable does not help to predict the value of the other variable
H1: the variables are dependent, there is a relationship between the two categorical variables. Knowing the value of one variable helps to predict the value of the other variable
The calculation aspects of this test stems from the logic of Mathematical Probability. For two independents events A and B, the following probability equation should always hold true:
Therefore, the Chi-square test of independence works by comparing the observed frequencies (so the frequencies observed in your sample) to the expected frequencies if there was no relationship between the two categorical variables (so the expected frequencies if the null hypothesis was true). This difference, referred as the test statistic (or t-stat) and denoted by χ2, and is computed as follows:
O represents the observed frequencies and E the expected frequencies.
The test statistic alone is not enough to conclude for independence or dependence between the two variables. As previously mentioned, this test statistic (which in some sense is the difference between the observed and expected frequencies) must be compared to a critical value to determine whether the difference is large or small.
The critical value can be found in the statistical table of the Chi-square distribution and depends on the significance level, denoted α, and the degrees of freedom, denoted . The significance level is usually set equal to 5%.
Figure 2.
Chi-square Distribution.
Figure 2.
Chi-square Distribution.
Like for many other statistical tests, when the test statistic is larger than the critical value, the null hypothesis is rejected at the specified significance level.
2.6.2. Binary Logistic Regression
Collected data was also analyzed using binary logistic regression. This one can predict the probability (P) of a specific event when the dependent variable is dichotomous [
30] . In this study, the occurrence of a specific stopping behavior can be considered as a binary response variable. The driver will either stop or continue driving during a pedestrian crossing event. The log odds (logit) of P equals the natural logarithm of P/(1-P). Therefore, Binary logistic regression estimates the log odds of the independent variables as a linear combination as shown in the equation below:
In this equation, P is the probability of the behavior occurrence; X
n is the independent variable; and β
n is the logistic regression coefficient. For each research question, the significance value of 0.05 was accepted as an influential predictor variable. For each model, the odds ratio was calculated to determine if there is an increase (>1) or decrease (0–1) in the probability of the behavior when the independent variables increase with one unit [
31]. The Cox & Snell R Squared and Nagelkerke R Squared tests have been used to assess each model’s goodness of fit. The test determines the percentage of variance caused by independent variables in the equation [
28]. SPSS 25.0 has been used for all statistical computations.
3. Results
3.1. Zebra Characteristics
According to the
Figure 3, using satellite imagery, it is easy to track subject zebra crossings. Using remote sensing, on the 6m scale, on precise coordinates, it’s possible to estimate zebra crossing characteristics and track asymmetries. Generally, in road design, it’s normal to get non-uniform dimensions depending on terrain nature. The focus of this research is on investigating the relationship between zebra characteristics and vehicle’s (un)stopping behaviors. Stopping behaviors of vehicles are subject of pedestrian safety.
Figure 3.
From left to right, satellite visualization of data collection sites: 15, 14 and 12. The last mockup shows various dimensions that were measured using a tape measure. It’s already obvious that stopping line are not uniform on each zebra crossing.
Figure 3.
From left to right, satellite visualization of data collection sites: 15, 14 and 12. The last mockup shows various dimensions that were measured using a tape measure. It’s already obvious that stopping line are not uniform on each zebra crossing.
A pedestrian crossing can be characterized by various aspects. Some are inherently due to the way the road infrastructure has been set up. Other factors may depend of how markings have been drawn. In both cases, data collectors have collected important information using tape measures.
While sites have been trimmed down to granular uniformity details, recorded measurements still show that there are clear inconsistencies in Zebra characteristics in the City of Kigali. For instance, one site (Site 9) did not have a stop line one side of the lane. Zebra lines lengths also greatly vary in measure with μ=3.7m (mean) and σ=0.96m (standard deviation). Zebra lines lengths are arguably designed depending on pedestrian traffic on both ends of the road facility.
Table 1.
Zebra characteristics for selected sites. Quick analyses involving spark lines and bars have been done. Additional columns to show average measurements and standard deviations is shown as well. All are measured in meters.
Table 1.
Zebra characteristics for selected sites. Quick analyses involving spark lines and bars have been done. Additional columns to show average measurements and standard deviations is shown as well. All are measured in meters.
One other major characteristic with variability is the distance between stopping lines and zebra lines themselves. They were measured, on both lanes with maximum μ=2.9m and σ=1.25m. Zebra lines markings vary from country to country. Like all other road markings, there is no fixed or correct length. Most markings are adapted to the traffic situation, to address factors like overtaking possibility, design speed, and more. Therefore, design standards suggest acceptable ranges. Nonetheless, it is of great interests, to investigate whether zebra characteristics have any, or, are in anyway associated to drivers stopping behaviors in cases of pedestrian crossings. According to our observations, most measurements fell into acceptable ranges with some exceptions. This was partly due to the fact that data collection sites were hand-picked upon various factors.
Pedestrian walkways may affect pedestrian safety at pedestrian crossings. According to collected data, at least 3 out of 10 sites, did not have a pedestrian walkway on either of its lanes. It’s unknown whether this characteristic has a positive or negative stopping behavior toward drivers, but it’s arguably a non-ignorable safety risk for pedestrians. In most cases, when pedestrians are crossing, they tend to feel safer when the crossing destination is safer. The lack of pedestrian walkways in some roads therefore poses a significant hazard toward pedestrian safety. At worst cases, the lack of walkways was associated with existence of non-covered water canals and drenches. It’s therefore undeniably risky to cross over such a facility.
3.2. Spatial Analysis
This study was carried out in the capital city of Rwanda, Kigali. This analysis aims to visualize traffic flow on zebra crossings across the capital. Data collection sites were purposively distanced.
Figure 4.
From the left: Data collection sites on Kigali Map, Vehicle frequencies by type on stopping occasions, Vehicle activity when pedestrian are crossing on each site.
Figure 4.
From the left: Data collection sites on Kigali Map, Vehicle frequencies by type on stopping occasions, Vehicle activity when pedestrian are crossing on each site.
Spatial analysis data almost corresponds to population density map portrayed earlier in
Figure 1. Transport by foot being the most used mode of transport to reach workplaces and schools, it shows on
Figure 3. It is as, for relatively dense areas, there will be a lot of pedestrian traffic, thus prompting multiple vehicles to stop on crossing occasions. This is particularly true for site 2 and 4.
On a second glance, it’s almost noticeable that buses and bicycles are rarely spotted, which motorcycles and cars are in abundance. For dense areas, motorcycles do greatly outnumber cars. One of the reasons is arguably due to the economic profile of densely populated households. In places where there are a few residents, for instance, site 9 and 6, there will be relatively more cars since residents are more affluent.
3.3. Descriptive Analysis
The collection of data was long and interesting. Over 22,000 vehicles were identified in 10,259 stopping occasions. Across 10 sites; drivers, vehicles, pedestrians and road characteristics were different. This variety of situations came to interesting findings summarized in the following table:
Table 2.
Cross tabulation of major frequencies.
Table 2.
Cross tabulation of major frequencies.
Variable Name |
Evening |
Morning |
Totals |
|
N |
% |
N |
% |
N |
% |
|
11,354 |
(51.5) |
10,704 |
(48.5) |
22,058 |
(100) |
Did the vehicle stop? |
|
|
|
|
|
|
No |
9,124 |
(41.4) |
9,049 |
(41.0) |
18,173 |
(82.4) |
Yes |
2,230 |
(10.1) |
1,655 |
(7.5) |
3,885 |
(17.6) |
Vehicle Type |
|
|
|
|
|
|
Bicycle |
95 |
(0.4) |
245 |
(1.1) |
340 |
(1.5) |
Motorbike |
7,499 |
(34.0) |
7,366 |
(33.4) |
14,865 |
(67.4) |
Bus |
89 |
(0.4) |
116 |
(0.5) |
205 |
(0.9) |
Car |
3,428 |
(15.5) |
2,401 |
(10.9) |
5,829 |
(26.4) |
Stopping Distance |
|
|
|
|
|
|
Breached |
9,124 |
(41.4) |
9,049 |
(41.0) |
18,173 |
(82.4) |
< 0.5 Meters |
142 |
(0.6) |
109 |
(0.5) |
251 |
(1.1) |
< 1.0 Meters |
120 |
(0.5) |
48 |
(0.2) |
168 |
(0.8) |
< 1.5 Meters |
147 |
(0.7) |
110 |
(0.5) |
257 |
(1.2) |
< 2.0 Meters |
344 |
(1.6) |
148 |
(0.7) |
492 |
(2.2) |
< 2.5 Meters |
296 |
(1.3) |
162 |
(0.7) |
458 |
(2.1) |
< 3.0 Meters |
429 |
(1.9) |
299 |
(1.4) |
728 |
(3.3) |
< 3.5 Meters |
265 |
(1.2) |
370 |
(1.7) |
635 |
(2.9) |
Greater than 4m |
487 |
(2.2) |
409 |
(1.9) |
896 |
(4.1) |
Week Day |
|
|
|
|
|
|
Monday |
1,230 |
(5.6) |
1,175 |
(5.3) |
2,405 |
(10.9) |
Tuesday |
1,400 |
(6.3) |
1,690 |
(7.7) |
3,090 |
(14.0) |
Wednesday |
1,536 |
(7.0) |
2,064 |
(9.4) |
3,600 |
(16.3) |
Thursday |
2,020 |
(9.2) |
1,937 |
(8.8) |
3,957 |
(17.9) |
Friday |
1,327 |
(6.0) |
1,863 |
(8.4) |
3,190 |
(14.5) |
Saturday |
2,241 |
(10.2) |
1,224 |
(5.5) |
3,465 |
(15.7) |
Sunday |
1,600 |
(7.3) |
751 |
(3.4) |
2,351 |
(10.7) |
At a glance, zebra compliance is arguably in a bad shape and safety at pedestrian crossings is not the best. In all observed occasions, from Monday to Sunday, in morning and evening peak hours, only 17.6% of drivers stopped to allow pedestrian to cross. Everyone else just continued to drive while pedestrians are crossing, not to mention that those who stopped didn’t necessary stop far away from pedestrians. According to recorded observations, only 896 vehicles (4.1%) stopped at least 4m away from the Zebra center line.
The survey has identified four types of vehicles. Big ones on more than four wheels (Cars and Buses) and two-wheeled ones (Motorcycles and Bicycles). In terms of frequency, it’s clear that motorcycles are the most present vehicles on surveyed roads, with a prominence of 14,865 counts (67.4% of all surveyed vehicles). Cars come second with about a quarter prominence while bicycles and buses are rare with a collective negligible percentage of about 1.5% and 0.9 % respectively.
Figure 5.
Integral visualization of vehicle stopping behavior on all sites throughout the week.
Figure 5.
Integral visualization of vehicle stopping behavior on all sites throughout the week.
In this study, researchers tried to visualize four variables of traffic flow and pedestrian crossing events in one graph. These include: vehicle types, days of the week, peak hour periods (hours, minutes and seconds) and stopping distances. Advanced statistical and data computational tools have been used to come up with such a graphs
Figure 4 shown above, is a clear and integral visualization of pedestrian safety on zebra crossings in Kigali. From the left, on the upper axis, there are consecutive days of the week when the survey was carried. On the bottom x-axis, there are stopping distances per zebra crossings. Distance 0 is recorded for every vehicle which crossed the center line of the zebra, thence breaching safety regulations and putting pedestrians lives at risk.
On the Y-axis, the graph portrays traffic peak hours in a continuous fashion. By trimming out outliers and other edge observations, the graph shows data from the morning at 6:00 AM till 7:00 PM in the evening. A 4 – semi-transparent color legend variable has been used to portray vehicle type, while circle radii were used to showcase the density of the same.
On the granular level, there are obvious observations that can be noted. For instance, on the traffic flow aspect, there are a few vehicles in weekend mornings. Same is for a Wednesday evening, nevertheless this could have been an anomaly since it rained heavily in that evening halting pedestrian mobility and data collection altogether.
In addition, on the first day of recording data, observers and data analysts were at first agnostic of vehicle type, until when they realized that vehicles behave differently upon various crossing events. As shows the videos, two wheeled vehicles are likely to continue moving while pedestrian are crossing, even when four-wheeled (or more) ones have stopped to allow safe passage of pedestrians. From this point forward, vehicle types were recorded as part of the survey.
3.4. Stopping Behaviors
To compute various equation elements of the Chi-Square equation, the following table has been made:
Table 3.
Cross Tabulation and Chi-Square Values (Expected values in Bold).
Table 3.
Cross Tabulation and Chi-Square Values (Expected values in Bold).
Did the vehicle stop? |
No |
Yes |
Totals |
|
N |
% |
N |
% |
N |
% |
|
18173 |
(82.4) |
3885 |
(17.6) |
22058 |
(100.0) |
1.Peak Hour Period – X2 (1) =66.3, p<.001 |
|
|
|
|
Evening |
9124 |
(41.4) |
(42.4) |
2230 |
(10.1) |
(9.1) |
11354 |
(51.5) |
Morning |
9049 |
(41.0) |
(40.0) |
1655 |
(7.5) |
(8.5) |
10704 |
(48.5) |
2.Vehicle Type – X2 (4) =2774, p<.001 |
|
|
|
|
|
Bicycles |
323 |
(1.5) |
(1.3) |
17 |
(0.1) |
(0.3) |
340 |
(1.5) |
Motorcycles |
13519 |
(61.3) |
(55.5) |
1346 |
(6.1) |
(11.9) |
14865 |
(67.4) |
Buses |
131 |
(0.6) |
(0.8) |
74 |
(0.3) |
(0.2) |
205 |
(0.9) |
Cars |
3522 |
(16.0) |
(21.8) |
2307 |
(10.5) |
(4.7) |
5829 |
(26.4) |
Null |
678 |
(3.1) |
(3.1) |
141 |
(0.6) |
(0.7) |
819 |
(3.7) |
3.Day of the Week – X2 (6) =259.2, p<.001 |
|
|
|
|
Monday |
1897 |
(8.6) |
(9.0) |
508 |
(2.3) |
(1.9) |
2405 |
(10.9) |
Tuesday |
2391 |
(10.8) |
(11.5) |
699 |
(3.2) |
(2.5) |
3090 |
(14.0) |
Wednesday |
2898 |
(13.1) |
(13.4) |
702 |
(3.2) |
(2.9) |
3600 |
(16.3) |
Thursday |
3197 |
(14.5) |
(14.8) |
760 |
(3.4) |
(3.2) |
3957 |
(17.9) |
Friday |
2625 |
(11.9) |
(11.9) |
565 |
(2.6) |
(2.5) |
3190 |
(14.5) |
Saturday |
3107 |
(14.1) |
(12.9) |
358 |
(1.6) |
(2.8) |
3465 |
(15.7) |
Sunday |
2058 |
(9.3) |
(8.8) |
293 |
(1.3) |
(1.9) |
2351 |
(10.7) |
4.Vehicle Density – X2 (2) =181.3, p<.001 |
|
|
|
|
Low |
3039 |
(13.8) |
(14.3) |
788 |
(3.6) |
(3.1) |
3827 |
(17.3) |
Medium |
4426 |
(20.1) |
(18.6) |
564 |
(2.6) |
(4.0) |
4990 |
(22.6) |
High |
10708 |
(48.5) |
(49.5) |
2533 |
(11.5) |
(10.6) |
13241 |
(60.0) |
5.Pedestrian Density – X2 (3) =138.4, p<.001 |
|
|
|
|
Less Dense |
6327 |
(28.7) |
(27.6) |
1061 |
(4.8) |
(5.9) |
7388 |
(33.5) |
Dense |
1138 |
(5.2) |
(5.3) |
291 |
(1.3) |
(1.1) |
1429 |
(6.5) |
Very Dense |
4863 |
(22.0) |
(23.2) |
1353 |
(6.1) |
(5.0) |
6216 |
(28.2) |
Extremely Dense |
5845 |
(26.5) |
(26.2) |
1180 |
(5.3) |
(5.6) |
7025 |
(31.8) |
Model Fit
A logistic regression was conducted to determine if five traffic flow variables (Vehicle type, peak hour period, day of the week, vehicle density and pedestrian density) are able to predict stopping behaviors of drivers to allow pedestrian use zebra crossings in safety. The results of the logistic regression were highly statistically significant, with X (13) =2954.6, p<.001, using the Omnibus Test.
Effect size
The variance in driver’s stopping behaviors predicted by traffic flow variables in the model ranges from 12.5% to 20.7% based on the Cox & Snell R Square or Nagelkerke R Square, respectively.
Classification Table
Without any predictor, the model would have predicted 82.4% of the stopping behavior correctly. On the flipside, with independent variables included, this number increases slightly with 0.50 basis points to 82.9%. Some statisticians refer this to percentage accuracy(PAC) in classification too.
Sensitivity
Percentage of cases in which Stopping at a Zebra crossing was observed ("yes") and were correctly predicted by the model. 9.5% of drivers who stopped were predicted by the model to have stopped. The positive predictive value is the percentage of correctly predicted cases with the observed characteristic (those who stopped) compared to the total number of cases that were predicted to stop. That is, of all cases predicted to stop, 93.7% were correctly predicted.
Specificity
Percentage of cases in which Stopping at a Zebra crossing was observed ("no") and were correctly predicted by the model. 98.6% of drivers who breached the zebra crossing were predicted by the model to have violated. The negative predictive value is the percentage of correctly predicted cases with the observed characteristic (those who breached) compared to the total number of cases that were predicted to breach. That is, of all cases predicted to violate, 99.9% were correctly predicted.
Table 4.
Logistic regression equation variables and their statistical significance.
Table 4.
Logistic regression equation variables and their statistical significance.
|
|
|
|
|
|
|
Lower |
Upper |
Variable |
B |
S.E. |
Wald |
df |
Sig. |
Exp(B) |
95% C.I. - Exp(B) |
Intercept |
-3.390 |
0.266 |
162.468 |
1 |
.000 |
0.034 |
|
|
1. Peak Hour Period |
-0.311 |
0.040 |
59.095 |
1 |
.000 |
0.733 |
0.677 |
0.793 |
2.Vehicle Type |
|
|
|
|
|
|
|
|
Bicycle (1) |
|
|
2289.644 |
4 |
.000 |
|
|
|
Moto |
0.415 |
0.252 |
2.704 |
1 |
.100 |
1.514 |
0.924 |
2.481 |
Bus |
2.208 |
0.290 |
57.838 |
1 |
.000 |
9.094 |
5.148 |
16.064 |
Car |
2.341 |
0.252 |
86.486 |
1 |
.000 |
10.389 |
6.343 |
17.014 |
Null |
0.992 |
0.273 |
13.157 |
1 |
.000 |
2.696 |
1.578 |
4.609 |
3.Day of the Week |
|
|
|
|
|
|
|
|
Monday (1) |
|
|
168.105 |
6 |
.000 |
|
|
|
Tuesday |
-0.078 |
0.081 |
0.923 |
1 |
.337 |
0.925 |
0.789 |
1.085 |
Wednesday |
-0.162 |
0.080 |
4.061 |
1 |
.044 |
0.851 |
0.727 |
0.996 |
Thursday |
-0.269 |
0.079 |
11.601 |
1 |
.001 |
0.764 |
0.655 |
0.892 |
Friday |
-0.276 |
0.083 |
10.936 |
1 |
.001 |
0.759 |
0.645 |
0.894 |
Saturday |
-0.884 |
0.088 |
100.973 |
1 |
.000 |
0.413 |
0.348 |
0.491 |
Sunday |
-0.599 |
0.093 |
41.426 |
1 |
.000 |
0.549 |
0.458 |
0.659 |
4.Vehicle Density* |
0.671 |
0.044 |
233.313 |
1 |
.000 |
1.956 |
1.795 |
2.132 |
5.Pedestrian Density*
|
-0.494 |
0.056 |
77.360 |
1 |
.000 |
0.610 |
0.547 |
0.681 |
Model Summary
The model (traffic variables name them) explained between 12.5% (Cox & Snell R2) and 20.7% (Nagelkerke R2) of the variance in Driver’s stopping behaviors and correctly classified 82.9% of cases. Sensitivity was 9.5% and Specificity was 98.6%. The positive predictive value was 93.7%. and the negative predictive value was 99.9%.
4. Discussion
Based on the results of the Nagelkerke R2 as an adjusted version of the Cox & Snell R2 to adjust the scale of the statistic to cover the full range from 0 to 1. By the Value of around 21% change in the dependent variables (Driver’s Stop or Driver’s Non-stop) can be accounted to the independent variables in the model. Only about 13% to 21% of recorded traffic variables were able to predict the variance in stopping behaviors model, according to the Cox & Snell and Nagelkerke R2 values, respectively.
The model sensitivity also called as the true positive rate of 9.5% here, represents the percentage of the observed cases to fall under the category of driver’s behavior to stop (Target group). In brief the drivers rate of 9.5% will be predicted to stop at zebra crossings to let pedestrians to cross.
The model specificity also known as the true negative rate of 98.6% standing for the percentage of the observed behavior of the driver not to stop but to breach traffic rules shows the fall under the non-target category. The drivers’ behavior at a rate of 98.6% will be predicted not to stop but breaching at the zebra crossings.ie preventing the pedestrian’s rights to cross.
4.1. Effect of Peak Hour Period
As per the statistical raw data in
Table 3 the peak hour period plays a minor role under descriptive statistics. Driver’s behaviors to give access to pedestrians to cross is better and more during evening times than mornings rated at. This may due to the fact that the evening period people are no longer stressed by the duty reporting delay but going at home. Hence still the probability of giving access to pedestrians is below 1 as per the odds ratio of 0.733 from
Table 4.
4.2. Effect of Vehicle type
Vehicles under this research, vehicles were classified under two broad categories of four wheel vehicles (Cars and buses) and two wheel vehicles (Motorcycles and bicycles). Statistically Motorcycles represent 67.4%, Buses cover less than 1%, bicycles have 1.5% and Cars counts 26.4% of the vehicle population and all have different impact on the traffic flow characteristics specifically for motorcycles and cars based on their volume and effect on the predicted driver’s behavior model.
Cars, buses and motorcycles (moto) have their respective odds ratio of 10.389, 9.094 and 1.514 respectively greater than 1; hence the probability of falling into the target group to give access to crossing pedestrians is greater. Vehicle type has a considerable effect on driver’s behavior to the model and give access to crossing pedestrians.
4.3. Effect of Week Days
Considering weekdays and their effect on the driver’s behavior, the model shows that the odds ratio varies between 0.413 to 0.925 and still are less than one (1) reason why the probability of falling in to target group of stopping and giving access to pedestrian is very less and limited regardless of any day of the week. It can be noted that vehicle flow number during the week was found almost uniformly distributed from Tuesday to Saturdays and present an abrupt fall of around 3% on Sunday as per
Table 3. This may be related to weekend break/off of activities.
4.4. Effect of Vehicle Density
Vehicle density is expressed in terms of number of vehicles per kilometer length of a roadway. Referring to
Table 3 and Considering High vehicle densities informs that individual vehicles are very close to each other, while low densities indicate that greater distances between vehicles is experienced. In general practice high density vehicles will move slowly with a great chance to stop at any time, giving right to pedestrians to cross (10.6%), but low vehicle density implies that vehicles are more spaced and tend to move at high speed with little chance of stopping (3.1%) and give access. As per our research outcome, it is realized that the odds ratio of this independent variable is 1.956 that is greater than 1, giving the probability of falling under the target group of stopping and give access to crossing pedestrian being greater, hence the vehicle density being low, medium and high affect the driver’s behavior.
4.5. Effect of Pedestrian Density
The Pedestrian density was estimated based on the counting of number of people crossing at a particular road width in a crossing event and classified as Less Dense, Dense, Very Dense and Extremely Dense. As per this research the odds ratio for the pedestrian density variable is 0.610 i.e. less than 1 and indicates that the probability of drivers’ behavior to stop is minimum/less.
5. Conclusions
The analysis of the relationship between the dependent variables of Driver’s behaviors of “stopping” and “not stopping” to give access to pedestrians to cross based on independent variable Traffic characteristics of peak hour period, vehicle type, day of the week, vehicle density and pedestrian density in the City of Kigali showed the followings:
Week days is found to be the less insignificant traffic independent variables to affect the driver’s behavior as per the odds ratio of 0.413;
Pedestrian density could not really show its significance to the model due to the lack of information about its classification as far as traffic classification is concerned, but it showed that its contribution is insignificant. Further research is needed to look in to its specific categorization;
Peak hours could play an importance in analysis of the driver’s behavior and this necessitates further research as morning weekends of data collection show some critical information to share;
Vehicle Density and types are very important independent variable characteristics that affect greatly the driver’s behavior to give access to pedestrian to cross and that research has proved that in the model by displaying the odds ratios greater than 1;
Therefore, further research is encouraged to uncover other relevant parameters that would influence a driver to stop or not to stop. These may include external factors like infrastructure characteristics or internal factors that may be unique for each driver.