2. Methods
The SRCAF operates via a step-by-step procedure to detect and assess semantic conflicts across the roles of AI stakeholders. The framework identifies instances wherein users, developers, and lawmakers define important terminologies differently, thereby causing misunderstandings, mistrust, and noncompliance. Unlike traditional approaches that rely on legal definitions, SRCAF focuses on practical language interpretation. It integrates narrative modeling with quantitative measures to capture and compare role-specific meanings. The SRCAF process comprises five steps.
2.1. Role Definition
The analysis begins by identifying key stakeholder roles associated with the AI system, typically including developers, regulators, and end users. Each role interacts with the system differently and interprets terms based on its specific goals, responsibilities, and institutional expectations.
In most AI governance contexts, users, developers, and regulators bring differing priorities and communication norms. For example, a developer may focus on the system efficiency or model accuracy, whereas a regulator may emphasize legal compliance and public trust. By contrast, users may seek clarity and reassurance, particularly when providing sensitive data. These differences shape role-specific interpretations of system-related languages. In some contexts, data analysts or third-party partners may also influence interpretation depending on their access levels and responsibilities.
Accurate role identification is essential for modeling semantic divergence, as the interpretation of key terms often varies based on technical expertise, institutional perspectives, and legal accountability.
2.2. Narrative Construction
For each role, a brief narrative is constructed to describe how that role interprets a key term such as “consent” or “explanation.” These narratives reflect subjective expectations, and can be authored by analysts or derived from interviews, surveys, or user feedback. The objective is to represent the practical understanding of a stakeholder rather than recording legal definitions. For instance, a user narrative may describe “consent” as checking a box, whereas a regulator’s narrative may involve audit trails and documentation. To ensure consistency, each narrative is written in the first-person or neutral observational tone, typically spanning two to four sentences. Narratives may also include imagined objections or expected guarantees, which help extract nuanced keywords.
2.3. Keyword Extraction
From each narrative, three to five keywords representing the role’s understanding were extracted. These can include terms such as “revocable,” “informed,” or “broad use” that reflect expectations or legal meanings. For example, the user statement “I just want this data for myself,” may yield keywords such as “private,” “personal,” or “temporary.” Keywords can be selected manually or generated using basic neural linguistic programming tools such as keyword extractors. If narratives are collected in multiple languages, translation and contextual mapping are performed prior to extraction.
2.4. Overlap Score Calculation
The keyword sets were compared using an Overlap Score based on the Jaccard similarity formula. For the two roles, A and B, the score is calculated as:
where
and
are the keyword sets for each role. A lower score indicates weaker shared understanding. For example, an Overlap Score of zero indicates understanding. of shared keywords, suggesting complete semantic divergence.
This method enables nontechnical stakeholders to interpret divergent results. For example, if a user’s keywords are {private, temporary, anonymous} and the platform’s keywords are {informed, analytics, model training, the score is zero. If a regulator and the platform both mention “informed,” the score increases to 0.167, reflecting minimal overlap.
Although cosine similarity or word embeddings offer more detailed comparisons, they introduce challenges for nontechnical stakeholders and may require legal vetting. To preserve clarity and fidelity to each speaker’s expression, we avoided merging similar terms in this version.
2.5. Conflict Prioritization
Finally, the conflicts are ranked based on their overlapping scores and the risks associated with the terms involved. Terms such as “consent” or “data sharing,” which are legally sensitive, are prioritized when overlap is low. Guidance from frameworks, such as the AI RMF (NIST, 2023), focuses on terms related to privacy, transparency, and fairness.
The SRCAF methodology follows a structured five-step process, as illustrated in Figure 1. This linear workflow captures both qualitative and quantitative elements, enabling the identification of high-risk semantic divergences between roles.
By following these steps, the SRCAF framework offers a clear and practical way to identify when different roles may interpret key system terms in conflicting ways . This early detection enables teams to adjust language, clarify policies, or redesign elements of the system before misunderstandings escalate into user frustration, compliance issues, or regulatory action.
2.6. Role-Based Conflict in a Digital Health Compliance Context
To demonstrate the application of SRCAF, this case study analyzes a simplified scenario involving an AI online health check platform. This platform provides users with automated health risk assessments using a brief self-assessment form. Key stakeholders include the user who completes the form, the platform that collects and processes the data, and the regulator responsible for ensuring that consent mechanisms and communications meet health data privacy standards. Each role introduces a distinct perspective that influences system operation and understanding.
We focused on one critical term in this context: “data sharing.” All three roles have different interpretations of this term.
Users expect the data to be used temporarily and exclusively for personal feedback. Their typical understanding of “data sharing” includes terms such as:
The platform may interpret data sharing as the ability to store and use responses to improve algorithms or external analytic partnerships. The associated keywords were as follows:
A regulator expects sharing to be clearly disclosed and purpose-limited and provides informed consent. The sample keywords were as follows:
Using the Overlap Score, we calculate the Jaccard similarity between each pair of roles:
These results indicate that the user is completely misaligned with both the platform and regulator regarding the term “data sharing.” Although the platform and regulator share a small degree of overlap (0.167), this is not sufficient to ensure a system-wide semantic alignment. The complete disconnection of a user signals a high-risk conflict, particularly in terms of user trust and perceived fairness. To address this issue, the platform language must be redesigned to bridge both regulatory requirements and user expectations. Aligning with solely legal terms is insufficient if the user’s interpretation is fundamentally different.
To make the semantic distances easier to understand, Figure 2 shows a heatmap of Jaccard Overlap Scores between the keyword sets of each role. The matrix shows that the User and Platform roles share moderate similarity (0.33), while the User–Regulator and Platform–Regulator pairs have much lower overlap (0.14 and 0.11, respectively). These lighter cells indicate a higher risk of misunderstanding. In particular, the regulator’s expectations are not well aligned with either the user’s or the platform’s interpretation, which may lead to compliance problems if not addressed. This heatmap serves as a practical tool to identify which roles are most semantically disconnected and helps prioritize terms that require clarification or redesign before deployment. The values in Figure 2 were generated using a broader keyword extraction pipeline in Python, which includes additional terms. The manual scores shown here use a controlled keyword set of size 6 for transparency. This may result in slight numerical differences (e.g., 0.167 vs. 0.11 for Platform–Regulator).