7.1. Answering Questions Through Data Visualization
This section leverages the R programming language along with the ggplot2 and plotly packages to effectively visualize data and derive insights.
User Engagement Analysis
1. How does the time spent on different page categories (Administrative, Informational, ProductRelated) correlate with revenue generation? Which pages are most effective at engaging users?
The line plot indicates that Returning Visitors show the highest levels of engagement, spending more time on the site compared to New Visitors and Others. Engagement peaks in December and March, likely influenced by seasonal promotions or special events. In contrast, New Visitors consistently exhibit lower engagement throughout the year, highlighting the need for strategies to convert them into active users.
Figure 4.
Monthly user engagement by visitor type.
Figure 4.
Monthly user engagement by visitor type.
The violin plot illustrates the distribution of Time_on_Site for sessions that resulted in revenue (Revenue = TRUE) and those that did not (Revenue = FALSE). Interestingly, sessions without revenue tend to show longer visit durations, suggesting that extended time on the site does not necessarily lead to purchases. Conversely, revenue-generating sessions are more concentrated around shorter visit durations, implying that efficient navigation—especially on product-related pages—is linked to successful transactions. Excessive time on informational or administrative pages might hinder conversions.
Figure 5.
Violin plot: Time_on_Site distribution across revenue.
Figure 5.
Violin plot: Time_on_Site distribution across revenue.
Demographic-Based User Behavior Analysis
2. How does user behavior differ across demographics like Region, Operating Systems, and Browser in terms of revenue generation and engagement? Are there significant variations in user interactions based on these features?
A bar plot examining revenue generation by browser reveals that users of Browser 2 contribute significantly more revenue compared to other browsers. Despite Browser 1 having a large user base, Browser 2 achieves a higher conversion rate, suggesting a more engaged or purchase-ready audience. Browsers 3 through 13 exhibit minimal revenue generation, indicating lower interaction or conversion potential.
Figure 6.
Revenue distribution across different web browsers.
Figure 6.
Revenue distribution across different web browsers.
Revenue generation across operating systems reveals that Operating System 2 leads in total revenue, followed by Systems 1 and 3. Although Operating System 1 has a larger user base, its conversion rate is lower, suggesting that high interaction does not always equate to successful purchases. Operating Systems 4 and beyond contribute minimally, indicating platform-specific differences in user engagement and conversion.
Figure 7.
Revenue analysis across operating systems.
Figure 7.
Revenue analysis across operating systems.
The box plot shows the time spent on the site by region, categorized by revenue generation. Region 1 has wide variability in session durations, with many non-revenue sessions. Regions 2 and 3, though associated with shorter sessions, show higher revenue rates. This suggests that productive sessions—rather than lengthy ones—drive conversions in certain regions.
Figure 8.
Time spent on site by region and revenue.
Figure 8.
Time spent on site by region and revenue.
A bar plot of regional revenue distribution shows that Region 1, despite having the highest user count, contributes a lower percentage to overall revenue, possibly due to low conversion efficiency. In contrast, Regions 2 and 3, with smaller user bases, demonstrate higher conversion rates, underscoring regional differences in user engagement and transaction behavior.
Figure 9.
Revenue generation across regions.
Figure 9.
Revenue generation across regions.
Evaluating the Impact of Visitor Type and Special Events on Revenue
3. What is the impact of being a New Visitor versus a Returning Visitor on revenue generation, especially during significant holidays or special days? Are returning visitors more likely to generate revenue during special events?
The distribution of visitor types highlights that Returning Visitors account for the majority of sessions. This trend suggests strong customer loyalty and frequent repeat interactions. In contrast, New Visitors constitute a smaller portion and show minimal impact on revenue, indicating lower initial engagement or familiarity with the platform.
Figure 10.
Visitor type distribution across sessions.
Figure 10.
Visitor type distribution across sessions.
The revenue contribution from visitor types further underscores that Returning Visitors are responsible for the vast majority of transactions. While New Visitors are more numerous in some instances, they account for a significantly smaller share of revenue, reinforcing the importance of customer retention and long-term engagement.
This visual illustrates disparities in revenue generation across New, Returning, and Other Visitor categories, incorporating the influence of weekend sessions. The plotted data reveals that Returning Visitors consistently contribute a significantly higher share of income on all weekdays. Both weekend and weekday interactions reflect enhanced engagement and monetary yield from Returning Visitors. In contrast, New Visitors contribute less financially, and their session behavior remains relatively unaffected by the weekend factor. This indicates that weekend timing does not substantially alter the conduct of New Visitors, while Returning Visitors consistently demonstrate high value in terms of revenue.
Figure 11.
Revenue Patterns Across Visitor Type and Weekend Behavior.
Figure 11.
Revenue Patterns Across Visitor Type and Weekend Behavior.
To summarize, regardless of temporal factors such as weekdays, weekends, or proximity to promotional events, the analysis consistently identifies Returning Visitors as primary contributors to earnings. Their greater session volumes and improved conversion efficiency reinforce the necessity of emphasizing user retention strategies. On the contrary, New Visitors exhibit a lower propensity for financial conversion. This suggests that while audience expansion remains relevant, sustained engagement with the existing user base yields superior revenue optimization, particularly during high-traffic periods.
Analyzing the Impact of Bounce and Exit Metrics 1. In what manner do elevated BounceRates and ExitRates influence revenue across different geographic areas? Are there certain regions that display inflated exit or bounce figures that warrant enhancements in layout or content?
This facet scatter visualization conveys the interaction between BounceRates and ExitRates, categorized by whether a session generated revenue. Sessions without financial output exhibit a pronounced correlation between elevated BounceRates and ExitRates, implying that users frequently abandon the platform early, likely due to inadequate engagement. Conversely, sessions linked to revenue display markedly reduced bounce and exit figures, signifying that prolonged interaction is conducive to conversions. These patterns suggest that mitigating bounce and exit tendencies may bolster user persistence and elevate transactional success.
Figure 12.
Analyzing Bounce Rates vs. Exit Rates by Revenue Segments.
Figure 12.
Analyzing Bounce Rates vs. Exit Rates by Revenue Segments.
This bubble chart presents the association between BounceRates and ExitRates across various territories, where bubble size represents geographic segmentation and color denotes revenue classification. A dense clustering of red bubbles—denoting non-revenue sessions—emerges in the upper BounceRate/ExitRate quadrant, underscoring a negative relationship with revenue production. In contrast, blue bubbles, representative of revenue-positive interactions, predominantly inhabit areas characterized by diminished bounce and exit rates. These trends indicate that high-metric regions merit immediate attention for interface or content refinement to improve conversion likelihood.
Figure 13.
Exploring Regional Bounce and Exit Rates with Revenue Differences.
Figure 13.
Exploring Regional Bounce and Exit Rates with Revenue Differences.
The patterns observed in both visualizations affirm that elevated BounceRates and ExitRates undermine revenue outcomes. Territories exhibiting such characteristics are generally associated with reduced session value. Consequently, refining site experience and enhancing user interaction are essential measures for improving retention and encouraging transactional behavior.
This temporal line graph, mapping BounceRate fluctuations across months, unveils several behavioral insights. There are observable peaks—reaching up to 0.20—in months such as December, February, and November, indicating diminished user retention and higher single-page exit tendencies. This trend may be indicative of seasonal disengagement or misaligned content offerings.
Conversely, months like May and September demonstrate sudden dips in bounce percentages, potentially reflecting the impact of successful marketing initiatives or improved navigational structures that sustained visitor interest.
These findings underscore the necessity of strategic content or interface adjustments during months with heightened bounce statistics. Meanwhile, low-bounce periods may reflect optimized conditions, offering benchmarks for replicable design and outreach practices in future cycles.
Figure 14.
Time Series of BounceRates Over Months.
Figure 14.
Time Series of BounceRates Over Months.
Figure 15.
Visualizing Traffic Source Counts Across Types.
Figure 15.
Visualizing Traffic Source Counts Across Types.
Figure 16.
Monthly Conversion Trends by Traffic Type.
Figure 16.
Monthly Conversion Trends by Traffic Type.
Figure 17.
Proportion of Traffic Sources and Their Revenue Contributions.
Figure 17.
Proportion of Traffic Sources and Their Revenue Contributions.
Figure 18.
Conversion from Traffic Type to Revenue Sessions.
Figure 18.
Conversion from Traffic Type to Revenue Sessions.
Figure 19.
Traffic Type Breakdown- Revenue vs Non-Revenue Sessions.
Figure 19.
Traffic Type Breakdown- Revenue vs Non-Revenue Sessions.
Evaluating Funnel Efficiency via Traffic Source Analysis
Traffic data shows that Types 1, 2, and 5 dominate visits, with Type 1 exceeding 4000 sessions, while many sources contribute marginally. Revenue analysis reveals Types 1, 2, and 3 as the most profitable, though they also experience notable attrition, particularly Type 2, which has high traffic but modest conversion rates. Underperformers such as Types 14, 15, and 19 generate minimal or no revenue, indicating early drop-offs.
Temporal trends highlight promotional spikes for certain channels (e.g., Type 2 in November), while treemap analysis confirms a concentration of revenue in a few high-yield streams. The disparity between monetized and non-monetized sessions within top sources underscores the need to optimize landing experiences and retention strategies for high-traffic but low-conversion channels, while improving targeting for low-performing sources.
Feature Association Analysis
The correlation heatmap shows that ProductRelated_Duration has a slight negative relationship with BounceRates (), suggesting longer product engagement reduces early exits, while its weak positive link with PageValues () indicates minimal direct value contribution. PageValues also inversely relate to BounceRates (), implying valuable content may improve retention. Duration features show small negative correlations with BounceRates, notably for Administrative_Duration () and Informational_Duration ().
Inter-feature correlations among durations (Administrative–ProductRelated, ; Informational–ProductRelated, ) reveal that users often explore multiple sections in one session. While no single metric is a strong predictor, combined duration patterns and low bounce tendencies subtly indicate higher purchase intent.
Revenue Attribution and Page Value Dispersion
The PageValues distribution shows most pages contribute minimally to revenue, while a small subset—high-yield pages—drives a disproportionate share of transactions, reflecting an 80/20 dynamic. Prioritizing and refining these key pages can significantly boost profitability without major structural changes.
Analysis of ProductRelated_Duration versus PageValues reveals that high-value content, even with short engagement times, correlates strongly with conversions. Non-revenue sessions often have low PageValues despite longer product interaction, underscoring that content quality outweighs duration. Strategically placing and optimizing high-value assets can enhance conversions more effectively than prolonging visits.
Figure 20.
Correlation Matrix of Behavioral and Revenue-Linked Features.
Figure 20.
Correlation Matrix of Behavioral and Revenue-Linked Features.
Figure 21.
Distribution of PageValues.
Figure 21.
Distribution of PageValues.
Figure 22.
Effect of Product-Page Interaction Time on Value by Revenue Group.
Figure 22.
Effect of Product-Page Interaction Time on Value by Revenue Group.
Figure 23.
Pie Chart for VisitorType.
Figure 23.
Pie Chart for VisitorType.
Figure 24.
Tracking Cumulative Revenue by Visitor Type Across Months.
Figure 24.
Tracking Cumulative Revenue by Visitor Type Across Months.
Figure 25.
Cohort Analysis- Revenue Generation by VisitorType Over Time.
Figure 25.
Cohort Analysis- Revenue Generation by VisitorType Over Time.
Cohort Evaluation:
Analysis shows that 85.6% of sessions come from returning users, 13.7% from newcomers, and 0.68% from others, underscoring the dominance of loyal customers in platform engagement and revenue. Monthly trends confirm that recurring users consistently drive the majority of sales, with peaks in May, July, and November, while new visitors contribute minimally.
These insights highlight the strategic value of loyalty programs, retention incentives, and personalized engagement to sustain and grow this core revenue base. At the same time, targeted onboarding strategies are needed to convert new visitors into repeat customers, expanding the high-value user segment over time.
Geolocation-Based Evaluation:
1. What regional disparities exist in revenue production, and which locales demonstrate potential for expansion?
The accompanying diagram reveals region-specific performance by showing the share of sessions that resulted in transactions. Notably, regions 1, 2, 5, and 9 exhibit superior conversion percentages, designating them as key markets with high efficiency in generating revenue. In contrast, locations such as 3, 6, and 8 display comparatively lower conversion ratios, suggesting an opportunity to enhance commercial outcomes. These underutilized areas might benefit from geographically personalized marketing efforts, upgraded user experiences, or region-specific engagement campaigns. Identifying and addressing these gaps could unlock untapped revenue potential and strengthen the brand’s geographic footprint.
Figure 26.
Exploring Revenue Proportions Across Geographic Regions.
Figure 26.
Exploring Revenue Proportions Across Geographic Regions.
Revenue Trends by Visitor Classification:
1. How does financial input differ between repeat and initial users month by month? What measures could boost engagement and sales conversion for new users?
A heatmap visualizes monthly revenue distribution across visitor categories, revealing key behavioral insights. Returning users consistently emerge as top contributors, with significant spikes observed during May and November—periods likely influenced by promotional events or seasonal campaigns. This pattern confirms the value of nurturing client loyalty through long-term engagement strategies.
On the other hand, the lighter tones associated with first-time users imply relatively low financial participation. This discrepancy highlights the importance of improving introductory experiences, such as guided navigation, targeted content, or welcome incentives, to elevate conversion potential. Meanwhile, the “Other” user segment generates negligible revenue, indicating minimal influence and the need for further segmentation or analytical attention. Overall, the matrix underscores the combined impact of visitor loyalty, event-driven seasonality, and onboarding efficacy in optimizing revenue flow.
Figure 27.
Revenue Generation by Visitor Type and Month.
Figure 27.
Revenue Generation by Visitor Type and Month.
User Interaction with Content and Revenue Strategy:
1. In what ways do user engagements with commercial and informative content affect bounce rates and sales metrics? What refinements could be introduced to enhance both experience and financial outcomes?
The scatter plot offers a multi-dimensional representation of page-level user engagement and its relationship with financial performance. Notably, users interacting with a greater volume of product-oriented pages—indicated by teal points—are more inclined to make purchases. However, the presence of users navigating through multiple product pages without converting (red points) suggests that the mere quantity of product exposure does not equate to transaction success. Content quality and contextual relevance play a vital role.
Moreover, moderate engagement with both product and educational content is associated with increased conversion likelihood. Users who explore both page types seem more informed and, thus, more decisive. Consequently, blending informational elements with product showcases may bolster buyer confidence and shorten the decision-making timeline.
The bubble sizes further indicate bounce tendencies; larger circles—denoting higher bounce rates—are frequently associated with non-converting users, while successful transactions often correlate with lower exit frequencies. This dynamic stresses the need for immersive and compelling content to retain attention and steer users toward purchase paths.
In summation, optimizing both content categories and balancing them effectively can improve the overall conversion funnel, lower bounce rates, and amplify revenue.
Figure 28.
Exploring the Interaction Between Product and Informational Pages with Revenue.
Figure 28.
Exploring the Interaction Between Product and Informational Pages with Revenue.