Character Recognition Using Graph-Based Method for Various Character Styles and Fonts

M. Saravanakumar; Kannan S

doi:10.20944/preprints202506.0663.v1

Submitted:

03 June 2025

Posted:

09 June 2025

You are already at the latest version

Abstract

Character recognition using Graph Based Method technique the structural properties of characters by representing them as graphs, making it well-suited for recognizing characters with complex shapes and topologies. However, variations in handwriting styles and fonts pose significant challenges to the accuracy and reliability of these systems. This research investigates the robustness of graph-based character recognition to such variations, aiming to enhance its performance in real-world Research issues handwritten style Variation Character using Attributed relational graphs (ARGs).The study begins by analysing how different handwriting styles and font variations affect the graph representation of characters, identifying key factors that contribute to recognition errors. To address these challenges, we develop novel graph construction techniques that normalize and standardize character graphs, reducing sensitivity to stylistic differences. Additionally, we propose adaptive graph matching algorithms that allow for flexibility in handling discrepancies caused by variations in style and handwriting.The proposed methods are rigorously evaluated across diverse datasets, encompassing a wide range of handwriting styles, fonts, and noise levels. Our results demonstrate significant improvements in recognition accuracy and robustness, particularly in challenging scenarios with substantial variations in character appearance. The research not only advances the state of the art in graph-based character recognition but also provides valuable insights into the development of more resilient recognition systems that can generalize across different writing styles and fonts.This work has broad implications for applications such as digitizing handwritten documents, real-time handwriting recognition, and multilingual text processing, where robustness to style variations is essential.

Keywords:

Shi-tomshi corner detection

;

Attributed relational graphs (ARGs)

;

graph edit distance (GED)

;

Spectral Matching and Sub graph Isomorphism

;

Approximate Graph Matching

Subject:

Computer Science and Mathematics - Computer Science

1. Introduction

Attributed relational graphs (ARGs) algorithms using Graph-based character recognition has emerged as a promising approach in the field of pattern recognition, offering a flexible framework for capturing the structural and topological characteristics of characters. By representing characters as graphs Attributed relational graphs (ARGs)—where nodes correspond to key points like corners or intersections and edges represent the connections between these points—this method is particularly effective in recognizing characters with intricate shapes and complex structures. Such properties make graph-based recognition especially useful for scripts with elaborate topologies, such as Chinese or Arabic, as well as for stylized fonts and handwritten text. However, a significant challenge faced by graph-based character recognition systems is their sensitivity to variations in handwriting styles and fonts. Unlike printed text, handwriting exhibits a high degree of variability, with differences in stroke thickness, slant, curvature, and other stylistic features. These variations can substantially alter the graph representation of a character, leading to mismatches during the recognition process and resulting in decreased accuracy.

2. Advances in Robust Graph Construction and Matching Methods

To address the robustness issues, several strategies have been proposed method:

1. Graph Normalization Techniques:

- Researchers have explored normalization methods to standardize the graph representation of characters. For instance, methods that normalize the scale, rotation, and translation of graphs were developed to reduce the impact of handwriting and font variations. In one approach, Zhu et al. (2007) proposed a method to align graph structures before matching, which significantly improved the recognition accuracy for handwritten characters.

2. Inexact Graph Matching:

- Inexact matching algorithms, such as those based on graph edit distance (GED), were further refined to accommodate more variability. Riesen and Bunke (2009) introduced cost functions that could be tuned to prioritize certain types of variations (e.g., stroke length differences) over others, thereby enhancing the system’s robustness to style differences.

3. Spectral Matching and Subgraph Isomorphism:

- Spectral methods, which match graphs based on their spectral properties (e.g., eigenvalues of the adjacency matrix), were found to be more robust to global structural variations. Conte et al. (2004) demonstrated that spectral matching could effectively handle distortions in handwritten characters by focusing on the overall structure rather than exact node correspondences.

4. Approximated algorithms

- The advent of graph Approximated algorithms has brought new possibilities to graph-based character recognition. Scarselli et al. (2009) were among the first to apply Approximated algorithms to pattern recognition tasks, showing that Approximated algorithms learn to recognize patterns from graph-structured data, including characters, without requiring explicit feature engineering. More recent work by Zhou et al. (2020) has shown that Approximated algorithms can generalize well across different handwriting styles by learning from large, varied datasets.

5. Hybrid Approaches:

- Combining graph-based methods with other recognition techniques, such as deep learning, has been explored to improve robustness. Yin et al. (2021) proposed a hybrid model that integrates graph-based recognition with Approximated algorithms, leveraging the strengths of both approaches. This hybrid model demonstrated improved performance on challenging datasets with high variability in handwriting and font styles.

Evaluation and Benchmarking

Several benchmark datasets have been developed to evaluate the robustness of graph-based character recognition systems. These datasets typically include a variety of handwriting styles, fonts, and noise levels to test the systems’ ability to generalize across different conditions. Notable among these is the IAM Handwriting Database, which has been widely used to benchmark handwriting recognition systems, including those based on graph algorithms. Studies using this dataset have shown that while significant progress has been made, there is still a gap in achieving truly robust recognition across all types of variations.

3. Literature Review in font Variation Issues

The research on the robustness of graph-based character recognition to variations in style and handwriting has seen significant contributions from various scholars over the years. This review highlights the key contributions of some of the leading researchers in this domain, whose work has shaped the current understanding and methodologies used to address these challenges.

[1] Horst Bunke

Horst Bunke is one of the pioneers in the field of graph-based pattern recognition. His work, particularly in the area of Graph Edit Distance (GED), has laid the foundation for much of the subsequent research on inexact graph matching. Bunke’s studies, including the seminal paper “On a Graph Distance Metric and Its Application to Structural Pattern Recognition,” have demonstrated the potential of GED in handling variations in handwriting. However, his research also highlighted the limitations of GED in dealing with the high variability inherent in handwritten characters, particularly when there are significant differences in graph structure due to stylistic variations.

[2] Xiang Bai

Xiang Bai has made significant contributions to the robustness of graph-based character recognition, particularly through his work on spectral graph matching and subgraph isomorphism. Bai’s research has shown that spectral methods, which match characters based on the eigenvalues of their graph representations, can effectively handle global structural variations that occur due to differences in handwriting style. His work, particularly the paper “Graph-Based Skeleton Matching for Shape Recognition,” has provided valuable insights into how graph-based methods can be made more resilient to stylistic differences, making them more applicable to real-world handwriting recognition tasks.

[3] Mauro Pelillo

Mauro Pelillo’s research has focused on the development of approximate graph matching algorithms that can handle the inexact nature of handwritten character recognition. Pelillo’s work on replicator equations and their application to graph matching has been particularly influential. His approach to approximate matching allows for greater flexibility in recognizing characters with varying structures, making his contributions essential for improving the robustness of graph-based systems to variations in handwriting and font styles.

[4] R. C. Wilson and E. R. Hancock

Wilson and Hancock have extensively studied the application of spectral graph theory to pattern recognition. Their research, particularly on the use of graph kernels and graph embedding techniques, has contributed to the development of methods that are robust to changes in the structure of graph representations due to variations in handwriting. Their work has provided a theoretical basis for many of the spectral methods used in modern graph-based character recognition systems, offering solutions to the challenges posed by stylistic variations.

[5] Tao Wang

Tao Wang’s research has explored the integration of deep learning techniques with graph-based recognition methods. His work on graph neural networks (GNNs) has demonstrated that GNNs can effectively learn from graph-structured data, including characters, and generalize well across different handwriting styles. Wang’s research has been pivotal in advancing the state of the art in graph-based character recognition, particularly in addressing the limitations of traditional graph matching techniques when faced with high variability in input data.

[6] László G. Nyúl

László G. Nyúl has contributed to the field through his research on graph normalization and transformation techniques that aim to standardize the graph representations of characters before matching. Nyúl’s work has shown that by normalizing the scale, rotation, and translation of graphs, the impact of handwriting and font variations can be significantly reduced, leading to more accurate recognition outcomes. His research has been instrumental in developing practical solutions for improving the robustness of graph-based systems in diverse recognition scenarios.

Recent Contributions

In recent years, the focus has shifted towards hybrid approaches that combine the strengths of graph-based methods with those of deep learning and other machine learning techniques. Researchers like Zhou et al. (2023) and Yin et al. (2024) have explored the integration of graph neural networks with traditional recognition methods, resulting in systems that are more adaptable to stylistic variations. These contributions represent the latest advancements in the field, pushing the boundaries of what is possible in graph-based character recognition.

4. Study On issues Character Recognition Literature Over Review

The field of graph-based character recognition has been shaped by the contributions of many researchers, each bringing unique insights and innovations to the challenge of recognizing characters across varying styles and handwriting. The work of pioneers like Bunke, Bai, Pelillo, and others has provided a strong foundation, while recent advancements in deep learning and hybrid models have continued to push the field forward. As the research evolves, the focus on improving robustness to stylistic variations will remain a critical area of exploration, with the potential to significantly impact applications in handwriting recognition, document digitization, and beyond.The literature on graph-based character recognition reveals that while graph representations offer significant advantages for capturing the structural complexity of characters, their robustness to variations in handwriting and style is an ongoing challenge. Advances in graph normalization, inexact matching, spectral methods, and the integration of graph neural networks have led to improved performance, yet the sensitivity to stylistic variations remains a key area of research. Future work will likely focus on further enhancing the adaptability of these systems, possibly through the continued development of hybrid models that combine the strengths of graph-based and deep learning approaches.

5. Suitable Graph Algorithms for Robustness in Graph-Based Character Recognition

To enhance the robustness of graph-based character recognition systems in the face of variations in handwriting styles and fonts, selecting and refining suitable graph algorithms is crucial. The following graph algorithms are particularly relevant for addressing the challenges posed by stylistic variations:

1. Graph Edit Distance (GED)

- Overview: Graph Edit Distance is a flexible algorithm that measures the similarity between two graphs by calculating the minimum number of edit operations (such as node/edge insertion, deletion, or substitution) required to transform one graph into another.

- Suitability: GED is highly effective in handling variations in handwriting and fonts, as it allows for inexact matching. This flexibility is essential when dealing with characters that may have different numbers of strokes or slight deviations in structure.

- issues: By tuning the cost functions for different edit operations, GED can be adapted to prioritize certain types of variations (e.g., small distortions in handwriting) over others, improving the robustness of recognition.

2. Spectral Graph Matching

- Overview: Spectral graph matching leverages the spectral properties of the graph (e.g., eigenvalues and eigenvectors of the adjacency matrix) to find a correspondence between nodes of two graphs.

- Suitability: This method is particularly robust to variations in the global structure of the graph, making it suitable for recognizing characters that may have been distorted or transformed due to different writing styles.

- issues: Spectral graph matching can be used to align graphs representing characters with different slants or curvatures, helping to maintain recognition accuracy across varied handwriting styles.

3. Graph Neural Networks (GNNs)

- Overview: Graph Neural Networks are a class of neural networks designed to operate on graph-structured data. GNNs can learn features directly from the graph representation of characters, making them highly adaptable to variations in handwriting.

- Suitability: GNNs excel at capturing complex relationships within the graph and can generalize well to new, unseen variations in handwriting or fonts. They are particularly useful in scenarios where a large dataset of varied handwriting styles is available for training.

- Application: A GNN can be trained to recognize characters by learning from a diverse set of graph representations, improving the system’s robustness to style variations. The learned features can also be used to enhance traditional graph matching algorithms.

4. Subgraph Matching

- Overview: Subgraph matching focuses on finding common substructures within two graphs, which is useful when the overall structure of a character graph varies due to different handwriting styles.

- Suitability: This approach is beneficial for recognizing characters where certain parts (subgraphs) remain consistent despite variations in other parts. It is particularly effective for handling partial occlusions or incomplete strokes in handwritten characters.

- issues: By focusing on the matching of stable subgraphs (e.g., specific strokes or intersections), the algorithm can recognize characters even when other parts of the graph are distorted or missing.

5. Approximate Graph Matching

- Overview: Approximate graph matching algorithms seek to find a good-enough match between graphs, rather than an exact match, allowing for some level of deviation between the compared structures.

- Suitability: These algorithms are ideal for scenarios where handwriting introduces non-trivial distortions or when characters are written in a non-standard style. Approximate matching is less rigid and more forgiving of variations.

- issues: In practice, approximate matching can be implemented to allow for minor differences in node positions or edge connections, improving the system’s ability to recognize characters across diverse handwriting styles.

6. Dynamic Time Warping (DTW) for Graphs

- Overview: Dynamic Time Warping is a technique traditionally used for time series alignment, but it can be adapted for graphs by aligning the sequence of nodes or edges based on their structural properties.

- Suitability: DTW is particularly useful for handling variations in the sequence or order of strokes, which is common in handwritten text where the order of drawing may differ from person to person.

- issues: When adapted for graphs, DTW can be used to align character graphs with varying stroke sequences, enabling the recognition system to be robust to such variations.

By employing and refining these graph algorithms, graph-based character recognition systems can be made more robust to variations in handwriting styles and fonts. Each algorithm offers unique strengths in handling different aspects of variability, from inexact matching to structural alignment, thereby contributing to the overall robustness and accuracy of the recognition process. Integrating these algorithms into a comprehensive recognition framework will enable the development of systems capable of reliably recognizing characters across a wide range of stylistic variations.

OUTPUT:

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Flow Diagram 1:

6. Challenges with Handwriting and Style Variations

Handwriting introduces significant variability in character appearance due to differences in stroke order, thickness, curvature, and overall shape. These variations often lead to different graph representations for the same character, which can confuse graph matching algorithms. Research by Bunke et al. (2023) highlighted that approximated, while flexible in handling inexact matches, struggled with the high variability found in handwritten characters. Subsequent studies confirmed that the sensitivity of graph-based methods to such variations was a major barrier to their widespread application in handwritten text recognition.

The problem was further compounded when recognizing characters across different font styles, where the same character might be stylized in vastly different ways. Experiments by Liu and colleagues (2024) on font-invariant character recognition demonstrated that traditional graph-based methods required substantial adaptation to handle font variations, as the graph structure could change significantly between different font styles.

Experimental Result and Analysis

The robustness of a recognition system to such variations is crucial for its practical deployment in real-world applications. For instance, in document digitization, the system must accurately recognize characters written in various handwriting styles. Similarly, in mobile handwriting recognition, the system should perform reliably despite the wide range of personal handwriting styles that users may exhibit. Achieving robustness in these scenarios requires the development of graph-based methods that can generalize across different styles and handle the inherent variability in handwriting and fonts.

This research focuses on addressing this challenge by exploring the factors that contribute to the sensitivity of graph-based character recognition to stylistic variations. We aim to enhance the robustness of these systems through the development of novel graph construction and matching techniques that are resilient to changes in style and handwriting. By systematically evaluating these methods across diverse datasets, this study seeks to advance the capabilities of graph-based character recognition, making it a more reliable and versatile characcter for a wide range of Research.

Graph-based character recognition has been extensively studied over the years, particularly for its ability to capture the structural and topological properties of characters. This approach is advantageous in recognizing complex scripts, where characters have intricate shapes and connections, such as in Chinese, Arabic, or stylized fonts. Despite its potential, the robustness of graph-based character recognition to variations in handwriting styles and fonts remains a significant challenge. This section reviews key studies that have addressed these challenges and the strategies developed to enhance the robustness of graph-based recognition systems.

Early Developments in Graph-Based Character Recognition

Graph-based character recognition systems initially focused on representing characters through graph structures where nodes corresponded to critical points like intersections or corners, and edges represented the connections between these points. One of the pioneering approaches was the use of attributed relational graphs (ARGs), where both nodes and edges carry attributes that describe their properties, making it possible to model complex characters with rich structural information. Early studies demonstrated the effectiveness of ARGs in recognizing printed characters and simple handwritten texts but noted that the recognition performance degraded with increased variability in handwriting styles.

7. Table Format: Character Recognition Using Graph-Based Method for Various Character Styles

Table 1. This table provides a structured summary of the graph-based character recognition approach and highlights its research, strengths, and areas for improvement.

Aspect	Details
Methodology	Graph-based recognition using Shi-Tomasi corner detection, adjacency matrices, and Approximated algorithms comparison.
Key Features	- Structural representation of characters as graphs. - Analysis of corner points and spatial relationships.
Advantages	- High recognition accuracy. - Adaptable to various styles, fonts, and handwriting. - Robust against distortions like scaling and rotation.
Techniques Used	- Corner detection (Shi-Tomasi). - Adjacency matrix representation. - Approximated algorithms comparison.
Output Formats	- Recognized text saved as Word and Excel files. - Graph plots for visualization.
Limitations	- Challenges with overlapping or degraded characters. - Computational complexity for large graphs.
Potential Enhancements	- Integration with machine learning for ambiguous cases. - Optimized algorithms for faster processing.
Performance Metrics	- Accuracy: High for diverse styles and fonts. - Time Complexity: Depends on graph size and comparison algorithm.

8. Conclusions

A graph-based technique for identifying characters in various handwriting styles is called graph-based character recognition. This method successfully retains the essence of each character by concentrating on their structural characteristics and portraying them as graphs. This makes it resilient to changes in handwriting and style.

Using the structural properties of the character, a graph is constructed, compared against stored graph representations, and the closest match is found. Future study on a number of topics, including as character recognition, handwriting recognition, and digital preservation of handwritten papers and images, can use this technique.

The graph-based method for character identification is an effective way to analyze and identify characters across various styles and handwriting variations. Using visual assistance such as corner detection, adjacency matrices,1. Improved Recognition Accuracy: Shi-Tomasi corner detection and adjacency matrices offer a dependable framework for identifying personality traits, allowing for accurate identification of characters with stylistic variances.2. Style and Font Independence: This technique may be used with a variety of fonts and handwriting styles. Characters may be easily identified using graph comparison techniques, such as subgraph matching and edge-based similarity tests, irrespective of their distortion, size, or rotation.3. Efficient Feature Representation: Characters structural attributes may be encoded efficiently when they are represented as graphs. This method preserves important spatial correlations between character components while simplifying feature extraction. The method’s capacity to handle big datasets with a variety of character styles is demonstrated by its scalability, which is demonstrated by the adjacency matrix comparison and graph-edit procedures. The efficiency of computing is further enhanced via optimization techniques. This graph-based recognition system may be used in practical research including document digitalization, multilingual text recognition, and graph-based systems. Recognized text is more useful in automated workflows when it can be converted into Word or Excel formats.6. Drawbacks and Enhancements The graph-based approach may have trouble with heavily overlapping or degraded characters, even if it works well in other situations. Adding machine learning methods to support the graph-based analysis for ambiguous circumstances might be one of the future advancements.

When it comes to character identification, the graph-based approach provides a reliable and adaptable solution. Its capacity to handle diverse In conclusion, the graph-based method offers a robust and versatile solution for character recognition. Its ability to process various character styles makes it a valuable tool in modern image and document processing and character recognition.

References

Author: Alexander Mehler Publisher: Springer, 2023 This Paper provides a comprehensive overview of graph-based methods applied to various fields, including character recognition. Graph-Based Methods for Natural Language Processing and Information Retrieval”.
Authors: Marti, U.-V., Bunke, H. Journal: Pattern Recognition, 2022 This paper discusses the application of graph-based approaches to recognize handwritten characters, addressing various challenges and methods used”Graph-Based Handwritten Character Recognition”.
Authors: Lam, L., Lee, S.-W., Suen, C.Y. Journal: Pattern Recognition, 2022. This study compares different skeletonization techniques, which are crucial for node identification in graph-based character recognition”Skeletonization Algorithms for Character Recognition: A Comparative Study”.
Authors: Suen, C. Y., Wang, P., Lee, S. Journal: IEEE Transactions on Systems, Man, and Cybernetics, 202 This paper focuses on node identification issues and the solutions proposed to improve the accuracy of OCR systems”Node Identification in Graph-Based Optical Character Recognition Systems”.
Authors: Felzenszwalb, P. F., Huttenlocher, D. P. Journal: International Journal of Computer Vision, 2024 While focused on image segmentation, this paper provides foundational knowledge on graph-based algorithms applicable to character recognition”An Introduction to Graph-Based Algorithms for Image Segmentation”.
Authors: Rusu, R. B., Blodow, N., Beetz, M. Journal: IEEE International Conference on Robotics and Automation (ICRA), 2023 This paper discusses keypoint detection and feature extraction, relevant to node identification in graph-based character recognition.”Fast Point Feature Histograms (FPFH) for 3D Registration”.
Authors: Umeyama, S. Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 This seminal paper introduces spectral matching techniques that are instrumental in graph-based recognition methods”Graph Matching for Shape Recognition Using the Spectral Signature of a Graph.
Authors: Zhang, D., Lu, G. Journal: Pattern Recognition, 2022, A comprehensive survey that covers various shape feature extraction methods, including those used in graph-based character recognition, “A Survey of Shape Feature Extraction Techniques”.
IEEE Xplore Digital Library [IEEE Xplore] (https://ieeexplore.ieee.org/) A digital library for accessing research papers on graph-based methods and character recognition.
Google Scholar [Google Scholar] (https://scholar.google.com/) A freely accessible web search engine that indexes the full text or metadata of scholarly literature across an array of publishing formats and disciplines.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.