Research on a General State Formalization Method from the Perspective of Logic

Siyuan Qiu; Jianfeng Xu

doi:10.20944/preprints202508.0939.v1

Submitted:

12 August 2025

Posted:

13 August 2025

You are already at the latest version

Abstract

As the world rapidly develops, information, as a vital resource, remains a subject of debate, with its definition and nature still being debated. To address this issue, Objective Information Theory proposes a set of axioms that rigorously define information. This paper aims to construct a formal system of mathematical logic using first-order and higher-order logic. Using well-formed formulas, it formalizes states and demonstrates that nearly all structures and states in various fields can be expressed. Finally, this paper proposes a universal state representation method, which improves the definition of state in Objective Information Theory and builds a bridge for the exchange and research of states across various fields.

Keywords:

objective information theory

;

logical systems

;

states

;

formal methods

Subject:

Computer Science and Mathematics - Logic

1. Introduction

Matter, energy and information are the three basic elements of nature.[1] In the current era of rapid development of information technology and artificial intelligence, information, as a bridge connecting the material world and the cognitive world, is becoming more and more important than ever. However, although people are talking about the “information revolution” with great interest and a large number of scholars are constantly devoting themselves to the research of informatics, the progress of information theory is not optimistic. The question of the nature of information has always troubled us[2,3,4].The concept of informatics, which takes information as its object, was proposed as early as the early 1990s, but has made little progress in the past 30 years and has failed to have a significant impact on the development of information technology and systems[5].The reasons for this are: first, there is a lack of a rigorous, universal definition of information; second, the phenomena encompassed by “information” are incredibly diverse, making it difficult to arrive at a comprehensive definition and establish a unified information theory; third, using only information quantity and information entropy as information metrics is too simplistic and ill-suited to the rich functionality of existing information systems; and fourth, current “information theory” is essentially still “communication coding theory.” While it covers several important aspects of information encoding, transmission, and storage, it largely ignores aspects of information understanding and processing. Therefore, the development of a comprehensive theoretical framework for informatics is imperative.

Starting from Wiener’s famous statement that “information is information, not matter, not energy”, Chinese scholar Xu Jianfeng regards information as the third basic category in the objective world, alongside matter and energy. He established a set of basic information postulates, mathematical definitions, a six-tuple model, five properties and 11 types of measurement systems based on the “reflective information view”, forming the theoretical system of Objective Information Theory (OIT). He proved that several previously unrelated classical information principles and formulas can be uniformly expressed through OIT[6,7,8].On this basis, the dynamic configuration and metric efficiency distribution of information systems were established, forming a theoretical framework for information system dynamics. In short, objective information theory, based on people’s common understanding of information, defines the information ontology set o, the ontology occurrence time set

T_{h}

, the ontology state set

S_{o} (o, T_{h})

, the objective carrier set c, the carrier reflection time set

T_{m}

, and the carrier reflection set

S_{c} (c, T_{m})

. Information I is the enabling mapping from the ontology state set

S_{o} (o, T_{h})

to the carrier reflection set

S_{c} (c, T_{m})

.

However, objective information theory lacks a rigorous mathematical definition of state, and its theoretical foundation remains incomplete. To address this issue, this paper aims to integrate the research findings of classical information theory, supplement the definition and model of information, and construct a formal representation of the object state, further clarifying its composition. Starting from the perspective of mathematical logic, this paper rigorously defines first-order and higher-order predicate logic systems, connects state with the interpretation of sets of well-formed formulas, and develops a comprehensive formal representation of state. This approach fills the existing theoretical gaps in OIT theory and further enhances research on the nature of information. Furthermore, from the perspectives of mathematics, economics, sociology, computer science, and natural linguistics, this paper abstracts typical phenomena from various disciplines, providing a universal symbolic and formal representation of the state within them.

Leveraging this universal formalization, first-order and higher-order logic become a bridge connecting the various fields. This reveals a profound unity in the development of modern science and technology: while different disciplines superficially deal with vastly different objects and phenomena, at a deeper level, they can all be uniformly expressed and mutually transformed through the language of logic.

Figure 1. The logical system has become a bridge for communication between various fields and is the most universal language.

2. Formal Expression of State

The information postulate emphasizes that the state of the ontology can be mapped to the state of the carrier. Therefore, establishing a formal representation of the ontology and carrier states is particularly important. Mathematical logic, based on axiomatic systems and symbolic language, enables rigorous and unambiguous characterization of concepts, propositions, and their reasoning. It avoids the ambiguity and uncertainty of natural language expressions and provides a precise descriptive tool for scientific research. Professor Xu Jianfeng of Shanghai Jiao Tong University provided me with a reference for this approach, and based on his work, I conducted further research and expansion. Let’s first limit the object language to first-order languages.

2.1. First-Order Formal System Definition

First-order predicate logic language has become a core tool for formal modeling and automatic reasoning in fields such as mathematics, information science, and artificial intelligence due to its strong expressiveness, clear structure, rigorous reasoning, good computability, and strong versatility[9].First, we define the most basic symbols in the first-order formal system:

Definition 1

(Symbols in

L^{(1)}

).

L^{(1)}

contains the following symbols:

First-order variables: $x_{1}^{(1)}, x_{2}^{(1)}, \dots$
First-order constants: $a_{1}^{(1)}, a_{2}^{(1)}, \dots$
First-order function symbols: $f_{1}^{(1) 1}, f_{2}^{(1) 1}, \dots, f_{1}^{(1) 2}, f_{2}^{(1) 2}, \dots$
brackets:(, )
First-order predicate symbols: $A_{1}^{(1) 1}, A_{2}^{(1) 1}, \dots, A_{1}^{(1) 2}, A_{2}^{(1) 2}, \dots$
Logical connectives:∼ (Negation),→ (Implication)
Quantifiers:∀ (Universal quantifier)

Based on logical theory, symbols such as conjunction ∧, disjunction ∨, if and only if ↔, and existence ∃ can be naturally introduced. In particular, some common mathematical symbols, such as

= a n d \in

, are also considered universal predicate symbols and will not be discussed further.

Terms in a language are similar to nouns or noun phrases in a natural language, but terms and nouns (phrases) are not exactly the same. The main difference is that terms contain variables and are “compound” items constructed using variables.

Definition 2

(Terms in

L^{(1)}

). The terms in

L^{(1)}

are generated as follows:

(1): Variables and constants are terms:
(2): If $f_{i}^{(1) n}$ ( $n > 0, i > 0$ ) is a function symbol in $L^{(1)}$ and $u_{1}, \dots, u_{n}$ is a term in $L^{(1)}$ , then $f_{i}^{(1) n} (u_{1}, \dots, u_{n})$ is also a term in $L^{(1)}$ .

Next, we define atomic formulas. Atomic formulas are the most basic formulas in the language.

Definition 3

(Atomic formulas in

L^{(1)}

). If

A_{i}^{(1) n}

(

n > 0, i > 0

) is a predicate symbol in

L^{(1)}

and

u_{1}, \dots, u_{n}

is a term in

L^{(1)}

, then

A_{i}^{(1) n} (u_{1}, \dots, u_{n})

is an atomic formula in

L^{(1)}

.

Definition 4

(Well-formed formulas in

L^{(1)}

). The well-formed formula in

L^{(1)}

is defined as follows:

(1): Each atomic formula is a well-formed formula in $L^{(1)}$ ;
(2): If $A$ and $B$ are well-formed formulas in $L^{(1)}$ , then $\sim A$ and $A \to B$ are both well-formed formulas in $L^{(1)}$ ;
(3): If $A$ is a well-formed formula in $L^{(1)}$ and u is a variable or function symbol in $L^{(1)}$ , then $(\forall u) A$ is a well-formed formula in $L^{(1)}$ .

2.2. Recursive Definition of Higher-Order Formal Systems

Considering that first-order logic still has many shortcomings in terms of quantified objects, recursive induction, and other issues, such as its inability to fully express the concept of a set and its lack of direct characterization of higher-order properties, we need to expand the characterization capabilities of logical systems.Higher-order predicate logic (HOL) is an extension of first-order predicate logic. It allows quantification over predicates, functions, and even predicates about predicates. It offers greater expressive power, can formalize complex semantics in natural language and mathematics, and supports a richer set of logical tools and theoretical frameworks. Next, we will define higher-order formal systems.

Assume that for

1, 2, \dots, k - 1

, the corresponding formal systems

L^{(1)}, L^{(2)}, \dots, L^{(k - 1)}

have been defined. Then the symbols in the k-order formal system

L^{(k)}

can be recursively defined[10].

Definition 5

(Symbols in

L^{(k)}

). The symbols in

L^{(k)}

include:

All symbols of $L^{(k - 1)}$ ;
k-order variables: $x_{1}^{(k)}, x_{2}^{(k)}, \dots$
k-order constants: $a_{1}^{(k)}, a_{2}^{(k)}, \dots$
k-order predicate variables: $P_{1}^{(k) 1}, P_{2}^{(k) 1}, \dots, P_{1}^{(k) 2}, P_{2}^{(k) 2}, \dots$
k order function symbols: $f_{1}^{(k) 1}, f_{2}^{(k) 1}, \dots, f_{1}^{(k) 2}, f_{2}^{(k) 2}, \dots$
k-order predicate symbols: $A_{1}^{(k) 1}, A_{2}^{(k) 1}, \dots, A_{1}^{(k) 2}, A_{2}^{(k) 2}, \dots$

Similarly, we can recursively define the terms in

L^{(k)}

, atomic formulas and well-formed formulas.

Definition 6

(Terms in

L^{(k)}

). The terms in

L^{(k)}

are defined as follows:

(1): All terms of $L^{(k - 1)}$ ;
(2): If $f_{i}^{(k) n}$ ( $n > 0, i > 0$ ) is a k-order function symbol in $L^{(k)}$ and $u_{1}, \dots, u_{n}$ are variables, constants, or functions in $L^{(k)}$ , then $f_{i}^{(k) n} (u_{1}, \dots, u_{n})$ is a k-order term in $L^{(k)}$ .

Definition 7

(Atomic formulas in

L^{(k)}

). The atomic formula in

L^{(k)}

is defined as follows:

(1): All atomic formulas of $L^{(k - 1)}$ ;
(2): If $A_{i}^{(k) n}$ ( $n > 0, i > 0$ ) is a predicate symbol of order k in $L^{(k)}$ and $u_{1}, \dots, u_{n}$ are terms in $L^{(k)}$ , then $A_{i}^{(k) n} (u_{1}, \dots, u_{n})$ is an atomic formula of order k in $L^{(k)}$ .

Definition 8

(Well-formed formula in

L^{(k)}

). The well-formed formula in

L^{(k)}

is defined as follows:

(1): All well-formed formulas for $L^{(k - 1)}$ ;
(2): If $A$ and $B$ are well-formed formulas in $L^{(k)}$ , then $\sim A$ and $A \to B$ are both well-formed formulas in $L^{(k)}$ ;
(3): If $A$ is a well-formed formula in $L^{(k)}$ and u is an argument or function symbol in $L^{(k)}$ , then $(\forall u) A$ is a well-formed formula in $L^{(k)}$ .

2.3. Interpretation of Formal Systems

Next, we can define the interpretation of the formal system.

Definition 9

(Interpretation of formal systems). An interpretation E of the formal system

L

is a two-tuple

E = 〈 D_{E}, J 〉

. Where:

The domain $D_{E}$ is a non-empty set that contains the value range of all elements in $L$ , including individuals, properties, relations, and functions.
The interpretation function J is a mapping that maps symbols in $L$ to concrete semantics in the domain $D_{E}$ and is defined as follows:

–

Interpretation of constants and variables: Each constant $a_{i}^{(k)}$ is interpreted as an element in $D_{E}$ , i.e., $J (a_{i}^{(k)}) \in D_{E}$ ; each variable $x_{i}^{(k)}$ is interpreted as an element in $D_{E}$ , i.e., $J (x_{i}^{(k)}) \in D_{E}$ .

–

Interpretation of function symbols: Each function symbol $f_{i}^{(k) n}$ is interpreted as a mapping from $D_{E}^{n}$ to $D_{E}$ , that is, $J (f_{i}^{(k) n}) : D_{E}^{n} \to D_{E}$ .

–

Interpretation of predicate symbols: Each predicate symbol $A_{i}^{(k) n}$ is interpreted as a mapping from $D_{E}^{n}$ to ${True, False}$ , i.e., $J (A_{i}^{(k) n}) : D_{E}^{n} \to {True, False}$ .

–

Interpretation of the terms:

$J (u) = \{\begin{matrix} J (a_{i}^{(k)}), & if u is constant a_{i}^{(k)} \\ J (x_{i}^{(k)}), & if u is variable x_{i}^{(k)} \\ J (f_{i}^{(k) n}) (J (u_{1}), \dots, J (u_{n})), & if u = f_{i}^{(k) n} (u_{1}, \dots, u_{n}) \end{matrix}$

–

Interpretation of atomic formula:

$if A = A_{i}^{(k) n} (u_{1}, \dots, u_{n}), then J (A) = J (A_{i}^{(k) n}) (J (u_{1}), \dots, J (u_{n})) .$

–

Interpretation of logical connectives:

$J (\sim A) = True \Leftrightarrow J (A) = False$

$J (A \to B) = True \Leftrightarrow (J (A) \to J (B)) = True$

–

Interpretation of quantifiers: If $A = (\forall u) B$ , where u is a variable or function symbol, then

$J (A) = True \Leftrightarrow \forall d \in D_{E}, when u is interpreted as d, J (B) = True$

The interpretation of formal systems transforms the abstract syntax of formal logic into concrete semantics, serving as a crucial link between the “symbolic world” and the “real world.” It not only renders logical language meaningful but also provides a theoretical foundation for correct reasoning, modeling, verification, and automated applications. It is an essential concept in mathematical logic and information science.

2.4. Axiom System for Logical Expression of Ontology Components Under State Decomposition

Let X denote a set of objects, T denote a set of time points or intervals, and L denote a higher-order formal system. We propose the following four axioms[11].

Parameter Reference Axiom : Every object $x \in X$ , every moment or period $t \in T$ , and every function $f \in F$ is represented by a unique constant or term $c_{x}, c_{t}, c_{f}$ in L.
Property Expressibility Axiom : The properties, form, value, relationship and other attributes of a set of objects in the entire domain can be expressed through functions and predicates in the formal system.
Logical Combination and the Closure Axiom

The generation rules of the state space $S$ are limited to the following logical operations:
- Implication:If $S_{1}, S_{2} \in S$ ,then it implies that $S_{1} \to S_{2}$ is also a state of $S$ ;
- Negation:If $S \in S$ , then $\neg S \in S$ ;
- Quantification: For any state predicate $S (x)$ , $\forall x S (x)$ and $\exists x S (x)$ also belong to $S$ .
In other words, $S$ is closed with respect to the above logical operations and only allows new states to be generated through a finite number of implications, negations, and quantifications.
Axiom of temporal causality:When any attribute, relationship, or state is established at a certain moment, its change or evolution at subsequent moments can be described by the formula in L.

Theorem 1.

If for any

x \in X, t \in T

, there exists at least one attribute, relation, or property that can be expressed by L, and satisfies the axioms of parameter reference, attribute expressibility, logical combination closure, and temporal causality, then all states of any object x at any time t can be uniquely characterized by a set of well-formed formulas

φ_{S (x, t)}

in L.

Proof of Theorem1.

According to the parameter reference axiom, the object x, time t, and the function f involved in the set can all be represented by unique terms

c_{x}, c_{t}, c_{f}

in L.

Next, according to the property expressibility axiom, the various properties of the state set

S (x, t)

can be expressed using functions and predicates in the formal system.

By definition, all expressions in the above state sets are atomic formulas in L. By logical combination and the closure axiom, any complex ontological state G can be recursively constructed from the base state by applying a finite number of generation rules expressible in L. Each generation rule is uniquely described by a well-formed formula and inference rule in L. Therefore, for any object x and time t, all its ontological states

S (x, t)

can be uniquely mapped and characterized in L by the corresponding set of formulas

φ_{S (x, t)}

, where

φ_{S (x, t)}

is recursively generated from the atomic formulas using logical rules.

In terms of uniqueness, the construction of

S (x, t)

depends solely on x, t, and the set of ontological components. The representation of all predicates, functions, and parameters in L is uniquely determined by the axiomatic system. Therefore,

φ_{S (x, t)}

uniquely corresponds to

S (x, t)

within L. If

φ^{'}, φ^{''} \in L

both characterize

S (x, t)

, then by the logical equivalence relation in L,

φ^{'} \equiv φ_{S (x, t)} \equiv φ^{''}

, guaranteeing uniqueness.

Furthermore, the temporal causality axiom states that any time t can be expressed by a recursive or evolutionary formula in L. Specifically, for any

x \in X

, there exists a formula

ψ (x, t^{'}, x, t)

in L such that

S (x, t^{'})

is uniquely determined by

S (x, t)

and related laws. Thus, any dynamic evolution of a system can be recursively expressed by a chain of well-formed formulas in L, and the history and future of its state can be reduced to the logical deduction of a set of formulas.

In summary,

S (x, t)

can always be rigorously characterized by a unique set of well-formulated formulas in L under interpretation, and this expression holds true for any dynamic evolution. The theorem is proved. □

2.5. The State of an Object at a Specific Time

Then, according to the theorem, we give the definition of state:

Definition 10

(state). The state

S (x, t)

of a set of objects x at a particular time set t is an interpretation of a set of well-formed formulas in the formal system

L

on the universe

x \times t

. The specific properties of x and t, as well as the choice of formula set and the definition of the interpretation, are determined by the specific application scenario.

Theorem 1 and the definition of state demonstrate that state can be well described using first-order and higher-order logic. The formal representation of state is not only a technical tool but also a fundamental way for humans to understand and transform the world. It transforms intuitive concepts into precise mathematical objects, enabling rigorous reasoning and systematic analysis. As science and technology become increasingly complex, this formalization capability will continue to be a vital force driving the development of informatics and the progress of human civilization.

3. Mathematical Field State Expression

Mathematics, as a fundamental discipline, encompasses a wide range of branches and a vast system. Fundamental fields such as number theory focus on the properties of integers; algebra encompasses linear algebra (vector spaces and matrices), abstract algebra (groups, rings, and fields), and polynomial theory; and geometry includes Euclidean geometry and differential geometry (manifolds and curvature). Applied mathematics encompasses topology, probability theory and statistics (such as stochastic processes, Bayesian statistics, and the foundations of machine learning), and computational mathematics (numerical analysis, algorithm design, and scientific computing). Furthermore, new problems continue to emerge in discrete mathematics, mathematical physics, logic, and set theory.

From elementary arithmetic to cutting-edge research, mathematics demonstrates a progression in depth and abstraction, with numerous fields intersecting and integrating. Theorems, propositions, and formulas within each branch can be viewed as characterizing the “state” of certain mathematical objects. Broadly speaking, the state of mathematical objects is a core concept for understanding the dynamics, contextual dependence, and inherent connections of mathematical structures. Studying mathematical states not only helps focus on key properties and ignore minor details, but also helps grasp the essence of a problem, forming a crucial foundation for the development of mathematical theory.

3.1. Formalization of Finite Mathematical Structures

Finite structures are the foundation of discrete mathematics. Problems such as finite sets and their subsets, and the connectivity, coloring, and matching of graphs in finite graph theory are all inseparable from the study of finite structures. In addition, many “infinite” mathematical concepts originate from the generalization of finite structures[12,13,14].

Finite mathematical structures are not only an essential component of mathematics but also fundamental tools for understanding the complex world, solving practical problems, and advancing science and technology. In a sense, finite structures are “real mathematics”—both theoretically elegant and practically useful. Here, we provide a rigorous proof that finite mathematical structures can be formalized in a first-order manner.

Definition 11

(Finite structure). Let

A = 〈 A, R_{1}^{A}, \dots, R_{m}^{A}, f_{1}^{A}, \dots, f_{n}^{A}, c_{1}^{A}, \dots, c_{k}^{A} 〉

be a finite structure, where: A is a finite set,

| A | = N

,

R_{i}^{A} \subseteq A^{a_{i}}

is an

a_{i}

meta-relation,

f_{j}^{A} : A^{b_{j}} \to A

is a

b_{j}

meta-function, and

c_{l}^{A} \in A

It’s constant.

Theorem 2

(First-order complete characterization of finite structures). If

A

is a finite structure, then there exists a first-order language L and a set of L-sentences Γ such that for any L-structure

B

:

B ⊧ Γ if and only if B ≅ A

(1)

Proof of Theorem2.

The definition L includes:

Relation symbols: $R_{1}, \dots, R_{m}$ (with arity $a_{1}, \dots, a_{m}$ respectively)
Function symbols: $f_{1}, \dots, f_{n}$ (with arity $b_{1}, \dots, b_{n}$ respectively)
Constant symbols: $c_{1}, \dots, c_{k}$
Individual constants: $d_{1}, \dots, d_{N}$ (corresponding to each element in A)

Let

A = {α_{1}, α_{2}, \dots, α_{N}}

, construct the following statement:

( $Γ_{1}$ ) domain restriction statement:

\forall x (x = d_{1} \lor x = d_{2} \lor \dots \lor x = d_{N})

(2)

( $Γ_{2}$ ) element-wise distinction statement:

d_{i} \neq d_{j} (for all 1 \leq i < j \leq N)

(3)

( $Γ_{3}$ ) relation characterization statement:

For each relation symbol

R_{i}

and each tuple

(α_{j_{1}}, \dots, α_{j_{a_{i}}}) \in A^{a_{i}}

:

\{\begin{matrix} R_{i} (d_{j_{1}}, \dots, d_{j_{a_{i}}}) & If (α_{j_{1}}, \dots, α_{j_{a_{i}}}) \in R_{i}^{A} \\ \neg R_{i} (d_{j_{1}}, \dots, d_{j_{a_{i}}}) & If (α_{j_{1}}, \dots, α_{j_{a_{i}}}) \notin R_{i}^{A} \end{matrix}

(4)

( $Γ_{4}$ ) function characterization statement :

For each function symbol

f_{j}

and each tuple

(α_{k_{1}}, \dots, α_{k_{b_{j}}}) \in A^{b_{j}}

:

f_{j} (d_{k_{1}}, \dots, d_{k_{b_{j}}}) = d_{l}

(5)

Where l satisfies

f_{j}^{A} (α_{k_{1}}, \dots, α_{k_{b_{j}}}) = α_{l}

( $Γ_{5}$ ) Constant Characterization Statement:

c_{i} = d_{j} where j satisfies c_{i}^{A} = α_{j}

(6)

We define

Γ = {Γ_{1}, Γ_{2}, Γ_{3}, Γ_{4}, Γ_{5}}

. Next, we prove two lemmas.

Lemma 1.

If

B ⊧ Γ

, then

| B | = N

.

Proof of Lemma1.

By (

Γ_{1}

),

\forall x \in B, \exists i \in {1, \dots, N}, x = d_{i}^{B}

, so

| B | \leq N

. By (

Γ_{2}

),

d_{i}^{B} \neq d_{j}^{B}

for all

i \neq j

, so

| B | \geq N

. Therefore,

| B | = N

. □

Lemma 2.

If

B ⊧ Γ

, then the map

h : A \to B

defined as

h (α_{i}) = d_{i}^{B}

is a bijection.

Proof of Lemma2.

This follows directly from Lemma 1 and (

Γ_{2}

). □

Next we can prove the consequence of the Theorem2:

(⇒) If $B ≅ A$ , then $B ⊧ Γ$ :

Let

g : A \to B

be an isomorphic mapping. Definition of

B

:

$d_{i}^{B} = g (α_{i})$
$R_{i}^{B}, f_{j}^{B}, c_{l}^{B}$ are defined by isomorphic correspondences.

By the definition of isomorphism,

B

satisfies all statements in

Γ

.

(⇐) If $B ⊧ Γ$ , then $B ≅ A$ :

By Lemma 2,

h : A \to B

is defined as

h (α_{i}) = d_{i}^{B}

, which is a bijection.

Verify that h maintains the relationship:

For any

(α_{j_{1}}, \dots, α_{j_{a_{i}}}) \in A^{a_{i}}

:

\begin{matrix} (α_{j_{1}}, \dots, α_{j_{a_{i}}}) \in R_{i}^{A} \\ \Leftrightarrow B ⊧ R_{i} (d_{j_{1}}, \dots, d_{j_{a_{i}}}) (by (Γ_{3})) \\ \Leftrightarrow (d_{j_{1}}^{B}, \dots, d_{j_{a_{i}}}^{B}) \in R_{i}^{B} \\ \Leftrightarrow (h (α_{j_{1}}), \dots, h (α_{j_{a_{i}}})) \in R_{i}^{B} \end{matrix}

(7)

Verify that h holds:

For any

(α_{k_{1}}, \dots, α_{k_{b_{j}}}) \in A^{b_{j}}

, let

f_{j}^{A} (α_{k_{1}}, \dots, α_{k_{b_{j}}}) = α_{l}

From (

Γ_{4}

):

\begin{matrix} B ⊧ f_{j} (d_{k_{1}}, \dots, d_{k_{b_{j}}}) = d_{l} \\ \Leftrightarrow f_{j}^{B} (d_{k_{1}}^{B}, \dots, d_{k_{b_{j}}}^{B}) = d_{l}^{B} \\ \Leftrightarrow f_{j}^{B} (h (α_{k_{1}}), \dots, h (α_{k_{b_{j}}})) = h (α_{l}) \\ \Leftrightarrow f_{j}^{B} (h (α_{k_{1}}), \dots, h (α_{k_{b_{j}}})) = h (f_{j}^{A} (α_{k_{1}}, \dots, α_{k_{b_{j}}})) \end{matrix}

(8)

Verify that h remains constant:This is directly derived from (

Γ_{5}

).

Thus, h is an isomorphism,

B ≅ A

.

We can then draw the following inferences

Corollary 1

(Uniqueness). The set of axioms Γ uniquely determines

A

in the sense of logical equivalence.

Corollary 2

(Completeness). For any first-order property φ on

A

, either

Γ ⊧ φ

or

Γ ⊧ \neg φ

.

Proof of Corollary2.

Let

φ

be any first-order sentence. Since

φ

is a sentence, either

A ⊧ φ

or

A ⊧ \neg φ

must hold.

If

A ⊧ φ

, then by Theorem 1, any structure

B

satisfying

Γ

is isomorphic to

A

, so

B ⊧ φ

. Therefore,

Γ ⊧ φ

.

If

A ⊧ \neg φ

, then similarly,

Γ ⊧ \neg φ

. □

Corollary 3

(Decidability). The set

{φ : Γ ⊧ φ}

is decidable.

Proof of Corollary2.

By Corollary 2, for any sentence

φ

, we can directly verify that

A ⊧ φ

on the finite structure

A

. If true, then

Γ ⊧ φ

; otherwise,

Γ ⊧ \neg φ

. □

3.2. Previous Research on the Formalization of Infinite Structures

Naturally, we will wonder whether all infinite structures, except finite ones, can be completely characterized by a set of first-order logic formulas.

Generally speaking, the answer is no. In fact, according to the research results of Skolem et al., even countable structures cannot be guaranteed to be fully described by first-order logic[15].

Theorem 3

(Skolem). The standard natural numbers are countable structures that cannot be characterized by first-order theoretical categories.

We will not give a detailed proof here. The main idea of the proof is to introduce infinite elements through extension theory. Then, we use the compactness theorem to derive a non-standard model and conclude.

Of course, if we expand the tools from first-order logic to higher-order logic, we can expand the characterization capabilities of the logical language[16,17]:

Theorem 4

(Peano). Under standard second-order semantics, the second-order Peano axioms categorically characterize the structure of natural numbers. That is, if quantification over set variables is allowed, then the sequence of natural numbers can be uniquely characterized by second-order logic.

This is truly a profound mathematical result, rigorously proven. It not only solves the problem of characterizing natural numbers but also reveals a fundamental property of the expressive power of logical systems—higher-order logic possesses greater expressiveness than first-order logic.

However, despite its greater expressiveness, higher-order logic still cannot represent all countable structures. The boundaries of the logical structures that higher-order logic can represent remain unresolved. This result reflects the fundamental tension between computability and logical expressiveness. Even the most powerful logical systems cannot fully “tame” the complexity of infinite structures. This perhaps reveals a certain irreducible complexity of mathematical reality.

At present, the problem of expressing mathematical structures still depends on the work done by Scott in 1965[18].

Scott first introduced the concept of infinite logic:

Definition 12

(infinite logic

L_{ω_{1} ω}

). The language

L_{ω_{1} ω}

is defined by the following rules:

1.: Contains all atomic formulas of first-order logic
2.: If ${ϕ_{i} : i \in I}$ is a set of formulas and $| I | \leq ℵ_{0}$ , then $⋀_{i \in I} ϕ_{i}$ and $⋁_{i \in I} ϕ_{i}$ are also formulas.
3.: If ϕ is a formula and x is a variable, then $\exists x ϕ$ and $\forall x ϕ$ are formulas.
4.: Every formula contains only a finite number of free variables.

Furthermore, he proposed the crucial isomorphism theorem in the article.

Theorem 5

(Scott’s isomorphism theorem, 1965). Let

A

be a countable structure and

L

be a countable language. Then there exists a

L_{ω_{1} ω}

statement

ϕ_{A}

(called a Scott statement of

A

) such that:

For any structure

B

,

B ⊧ ϕ_{A} \Leftrightarrow B ≅ A

(9)

That is,

ϕ_{A}

completely characterizes the structure

A

in an isomorphic sense.

Scott’s isomorphism theorem is more than just a technical result; it reveals that infinitely long formulas are a natural tool for dealing with infinite structures, and that abstract existence can be transformed into concrete constructions.

Scott’s isomorphism theorem not only solves a specific mathematical problem but, more importantly, opens up a whole new research paradigm, influencing multiple branches of mathematics and still guiding development in related fields today. This makes it one of the most important achievements in mathematical logic of the 20th century.

3.3. Formalization of Conditional Infinite Structures

To address the problem that infinite structures are difficult to characterize using logic, we present and prove a slightly weaker but still highly universal theorem. First, we provide several definitions.

Definition 13

(Relationship maintenance). Let

M = (M, R_{1}, R_{2}, \dots, R_{k})

be a structure where

R_{i}

is a

n_{i}

-ary relation. Let

{M_{j} = (M_{j}, R_{1}^{j}, R_{2}^{j}, \dots, R_{k}^{j})}_{j \in N}

be an approximate sequence.

Relationship maintenance means:

\forall i \in {1, 2, \dots, k}, \forall j \in N : R_{i}^{j} = R_{i} ↾ M_{j}^{n_{i}}

(10)

then

R_{i}^{j} = {(a_{1}, \dots, a_{n_{i}}) \in M_{j}^{n_{i}} : (a_{1}, \dots, a_{n_{i}}) \in R_{i}}

(11)

Definition 14

(A precise definition of recursive approximation). The structure

M = (M, R_{1}, \dots, R_{k})

satisfies recursive approximation if and only if:

There exists a sequence

{M_{n}}_{n \in N}

where every

M_{n} = (M_{n}, R_{1}^{n}, \dots, R_{k}^{n})

satisfies:

Monotonicity: $M_{n} \subseteq M_{n + 1} \subseteq M$
Countability: $| M_{n} | = ℵ_{0}$ for all n
Recursion: There exists a recursive function that computes the Scott statement for each $M_{n}$
Density: $\bar{⋃_{n} M_{n}} = M$ (in appropriate topology)
Relationship maintenance: $R_{i}^{n} = R_{i} ↾ M_{n}^{a r (R_{i})}$ For all $i, n$ , where $a r (R_{i})$ denotes the number of elements of the relation $R_{i}$
Asymptotic uniqueness: Any two sequences are isomorphic to themselves or to each other after adding a finite number of elements from M.

Definition 15

(Topology of Scott’s statements space). We define a topology on the space of Scott statements, which makes the convergence exact. We assume that the space is complete under this topology.

Define

S

as the space of all Scott statements. For

ϕ, ψ \in S

, define the distance:

d (ϕ, ψ) = \sum_{k = 1}^{\infty} 2^{- k} \cdot d_{k} (ϕ, ψ)

(12)

Where:

d_{k} (ϕ, ψ) = \frac{| {Tp}_{k} (ϕ) ▵ {Tp}_{k} (ψ) |}{| {Tp}_{k} (ϕ) \cup {Tp}_{k} (ψ) |}

(13)

Here

{Tp}_{k} (ϕ)

denotes the set of k-types that occur in a structure satisfying ϕ, and

{Tp}_{k} (M)

contains the “complete description” of all possible k-tuples in the structure

M

.

Definition 16

(Local finiteness).

\forall k, \forall l \in N, | {Tp}_{k} (ϕ_{l}) | = ℵ_{0}

(14)

and

\forall k, \forall l \in N, | {Tp}_{k} (ϕ_{l + 1}) - {Tp}_{k} (ϕ_{l}) | < \infty

(15)

Theorem 6

(Higher-order characterization theorems for recursive approximate structures). Suppose

M, N

is an uncountable structure that satisfies recursive approximation and local finiteness. Then there exists a higher-order logic theory

T_{M}

such that:

\forall N (N ⊧ T_{M} \Leftrightarrow N ≅ M)

(16)

Here, we assume that higher-order logic can be infinitely quantified, that is, it satisfies the properties of infinite logic. We prove this conclusion step by step. First, we prove that the recursive approximation sequence is inherently unique.

Lemma 3

(Normality of approximate sequences). Assume

M

satisfies recursive approximation, and

{M_{n}}

and

{M_{n}^{'}}

are two approximate sequences that satisfy the condition. Then there exists an increasing function

h : N \to N

such that:

M_{n} ≅ M_{h (n)}^{'} for all n

(17)

Proof of Lemma3.

By density, monotonicity, and asymptotic uniqueness, for any

M_{n}

, there exists a sufficiently large m such that

M_{n}

can be embedded in

M_{m}^{'}

.

Vice versa. Combined with countability, we obtain an isomorphism. □

Next we define limit operations and related concepts.

Definition 17

(Limits of structural sequences). Let

{M_{n}}

be an increasing countable sequence of structures. Definition:

lim_{n \to \infty} M_{n} = (⋃_{n} M_{n}, ⋃_{n} R_{1}^{n}, \dots, ⋃_{n} R_{k}^{n})

(18)

If the union of every relation is well-defined in the limit.

Lemma 4

(Existence and uniqueness of limits). If

{M_{n}}

satisfies the conditions for recursive approximation, then the limit exists and is isomorphic to the original structure

M

.

Proof of Lemma4.

Existence: By monotonicity,

⋃_{n} M_{n}

is well-defined. The union of relations is well-defined by the relation-preserving property.

Uniqueness: By density,

⋃_{n} M_{n}

is dense in M. If

M

has appropriate continuity (implied by the recursive approximation), then it is uniquely determined by the dense substructure. □

Next, we need to verify the convergence of Scott’s sequence of statements.

Theorem 7

(Convergence Theorem). Under the recursive approximation, if the local finiteness condition is additionally satisfied, then the Scott statement sequence

{ϕ_{n}}

converges to

ϕ_{\infty}

in the defined topology.

Proof of Theorem7.

For fixed k, consider the sequence:

{Tp}_{k} (ϕ_{1}) \subseteq {Tp}_{k} (ϕ_{2}) \subseteq \dots \subseteq {Tp}_{k} (ϕ_{k_{m}})

(19)

For any

ϵ

, we can choose K so that

\sum_{k = K + 1}^{\infty} 2^{- k} < ε / 2

.

Since each inclusion is a subset relation, we can exploit local finiteness. Let

N_{k}

exist such that when

m, n \geq N_{k}

:

d_{k} (ϕ_{m}, ϕ_{n}) < 2^{(k - 1)} ϵ / K

(20)

So there exists

N = max {N_{1}, N_{2}, \dots, N_{K}}

such that when

m, n \geq N

:

d_{k} (φ_{n}, φ_{)} < 2^{(k - 1)} ϵ / N for all k \leq K

(21)

Thus:

\begin{matrix} d (φ_{n}, φ_{m}) & = \sum_{k = 1}^{\infty} 2^{- k} \cdot d_{k} (φ_{n}, φ_{m}) \\ = \sum_{k = 1}^{K} 2^{- k} \cdot 2^{(k - 1)} ε / K + \sum_{k = K + 1}^{\infty} 2^{- k} \cdot d_{k} (φ_{n}, φ_{\infty}) \\ \leq ε / 2 + \sum_{k = K + 1}^{\infty} 2^{- k} \cdot 1 \\ < ε / 2 + ε / 2 = ε \end{matrix}

(22)

Thus, we obtain a Cauchy sequence. Based on completeness, we prove that this sequence has a limit. Let’s assume that its limit is

ϕ_{\infty}

. □

Next, we can construct a complete characterization theory:

T_{M} = T_{approximation} \cup T_{limit} \cup T_{unique} \cup ϕ_{\infty}

(23)

T_{approximation} = \{\exists {M_{n}}_{n} . Satisfies the recursive approximation condition\}

(24)

T_{limit} = \{M = lim_{n \to \infty} M_{n}\}

(25)

T_{unique} = \{The approximation sequence is unique under isomorphism\}

(26)

Finally, we prove the Theorem6;

Proof of Theorem6.

Assume

N ⊧ T_{M}

, then

N

has an approximate sequence

{N_{n}}

that satisfies the same conditions.

By

ϕ_{1}, \dots, ϕ_{\infty}

, the Scott statement for each

N_{n}

is identical to the corresponding

M_{n}

. By Scott’s theorem,

N_{n} ≅ M_{n}

for all n.

Construct an isomorphic sequence

f_{n} : M_{n} \to N_{n}

. By monotonicity, consistency, and Lemma4,

{f_{n}}

can be combined into a global isomorphism:

f = ⋃_{n} f_{n} : M \to N

(27)

Verifying that f is indeed an isomorphism: - Injectivity: by the injectivity and density of each

f_{n}

- Surjectivity: by the surjectivity and limit properties of each

f_{n}

- Homomorphism: by the preservation of relations. □

At this point, we have rigorously proved the theorem and obtained the most abstract formula for

M

,

ϕ_{\infty}

. Clarifying

ϕ_{\infty}

will help researchers understand the logical nature of infinite structures and facilitate deeper research.

3.4. Formalization of Phenomena in Mathematics

Based on the formal characterization of finite and infinite mathematical structures, we finally conclude:

Theorem 8.

Within the first-order and higher-order theoretical framework, almost all mathematical phenomena and mathematical structures can be effectively logically characterized.

Our theory demonstrates that, within the appropriate framework, nearly all mathematical phenomena—particularly discrete, algebraic, and finitely generated phenomena—can be meaningfully logically characterized. This is an important theoretical achievement that expands our understanding of the extent to which mathematics can be formalized.

However, the richness and complexity of mathematics mean that finding a precise logical expression is difficult. As mathematics develops, modern mathematics presents new challenges. Problems such as the explosion of parameter space, the breakdown of intuition, and insufficient tools make characterizing high-dimensional and abstract problems particularly difficult. This reminds us that mathematics has both a formal side and a side beyond formalism. A perfect logical characterization may be a guiding principle, guiding us to continuously deepen our understanding, but it should not be mistaken for a fully achievable ultimate goal.

4. State Expression in Economics and Sociology

The logical formalization of economic and sociological phenomena is of great significance. First, it provides a scientific theoretical foundation for these disciplines. By using precise mathematical language to eliminate the ambiguity and ambiguity inherent in traditional textual descriptions, it enables rigorous definitions and operational measurement standards for abstract concepts such as market efficiency, social capital, and institutional change. Second, logical formalization builds a bridge for interdisciplinary dialogue, enabling the comparison, integration, and mutual learning of rational choice theory in economics, social network analysis in sociology, and institutional theory in political science within a unified mathematical framework, thus promoting the comprehensive development of the social sciences. Third, it significantly enhances the scientific nature of empirical research. By formalizing theoretical assumptions, it makes research replicable and verifiable, and provides essential mathematical tools for computational social science in the big data era. Finally, at the level of policymaking and social governance, logical formalization enables complex socioeconomic policies to be based on rigorous logical reasoning and mathematical modeling, improving the scientific nature of decision-making and the accuracy of predictions. Overall, logical formalization is driving the transformation of economics and sociology from descriptive disciplines to predictive and explanatory sciences, providing more powerful theoretical tools for understanding and improving human society.

4.1. Logical Characterization in the Field of Economics

The logical characterization of economics is of fundamental significance to the development of the discipline and is also one of the hot issues in research[19,20,21,22]. First, logical representation can eliminate ambiguity and vagueness in economic theory. Traditional textual descriptions often allow for multiple interpretations, while logical formalization requires precise definitions of each concept and relationship, forcing theorists to clearly express their assumptions and reasoning. For example, when we say “demand is negatively correlated with price,” a logical representation requires us to specify under what conditions, for which goods, and over what timeframe this relationship holds true.

Furthermore, economics deals with complex systems involving multiple agents, multiple levels, and multiple variables, involving interactions between diverse actors such as consumers, businesses, and governments[23]. When a theory becomes complex, natural language descriptions often fail to accurately capture all logical relationships and constraints. Logical representations provide a structured approach to organizing these complex relationships, ensuring the internal consistency and integrity of the theory[24]. For example, when analyzing market equilibrium, we need to simultaneously consider multiple constraints, such as the supply equation, the demand equation, and market-clearing conditions. Logical representations can clearly demonstrate the logical dependencies between these conditions.

At the same time, considering that there are multiple schools and theoretical frameworks within economics, such as neoclassical economics, Keynesianism, institutional economics, etc[25].Different theoretical frameworks utilize different conceptual systems and analytical methods, making academic dialogue difficult. Logical representation provides a unified language for different theories, enabling theoretical comparison, integration, and synthesis. Researchers can more easily identify commonalities and divergences between different theories, promoting the integration and development of theories.

Therefore, the importance of logical characterization in the field of economics is self-evident. Here we first consider the logical characterization of finite economic structures.

Definition 18

(Economic structure). An economic structure

S

can be represented as follows:

S = (A, R_{1}, R_{2}, \dots, F_{1}, F_{2}, \dots, P_{1}, P_{2}, \dots)

(28)

Where:

A is a set of agents (individuals, enterprises, institutions, etc.)
$R_{i}$ is a relationship (social network, hierarchy, transaction relationship, etc.)
$F_{j}$ is a function (utility function, production function, decision rule, etc.)
$P_{k}$ is a process (market mechanism, institutional evolution, information dissemination, etc.)

Definition 19

(The logical language of economic structure). Basic language

L_{S E}

Contains:

Individual constants:

$a_{1}, a_{2}, \dots$ represent specific agent individuals, enterprises, organizations, etc.

Variables:

$x, y, z$ represent agent variables, t represents time variables, and s represents state variables

Predicate symbols:

$A g e n t (x)$ : x is an agent
$T r a n s i t i o n_{P_{k}} (s, s^{'})$ : represents the transition from state s to state $s^{'}$ under process $P_{k}$
$T r a n s i t i o n C o n d i t i o n_{k} (s, s^{'})$ : indicates that the transition from state s to state $s^{'}$ under process $P_{k}$ satisfies the transition condition.

Theorem 9

(First-order representability of finite economic structures). Let

S = (A, R_{1},

R_{2}, \dots, F_{1}, F_{2}, \dots, P_{1}, P_{2}, \dots)

be a finite economic structure, that is:

1.: $| A | < \infty$ (Finite Agents)
2.: Every relation $R_{i}$ and function $F_{j}$ is defined over a finite field and has corresponding predicate and function representations in the base language.
3.: The process $P_{k}$ involves finite states and finite time.

Then there exists a set of first-order formulas Φ such that:

\forall T, T ⊧ Φ \Leftrightarrow T ≅ S

(29)

Proof of Theorem9.

Domain characterization:

ϕ_{domain} = \forall x (A g e n t (x) \to ⋁_{i = 1}^{| A |} x = a_{i}) \land \underset{i \neq j}{⋀} a_{i} \neq a_{j}

(30)

Characterization of relationships: For each relation

R_{i} \subseteq A^{n_{i}}

:

ϕ_{R_{i}} = \forall x_{1} \dots x_{n_{i}} (R_{i} (x_{1}, \dots, x_{n_{i}}) \leftrightarrow \underset{(a_{j_{1}}, \dots, a_{j_{n_{i}}}) \in R_{i}}{⋁} (x_{1} = a_{j_{1}} \land \dots \land x_{n_{i}} = a_{j_{n_{i}}}))

(31)

Function description: For each function

F_{j} : A^{m_{j}} \to D_{j}

(where the range

D_{j}

is finite):

ϕ_{F_{j}} = \forall x_{1} \dots x_{m_{j}} \exists! y (F_{j} (x_{1}, \dots, x_{m_{j}}) = y \land y \in D_{j})

(32)

Description of the process: For each process

P_{k}

, if it involves a finite state transition:

ϕ_{P_{k}} = \forall s \forall s^{'} (T r a n s i t i o n_{P_{k}} (s, s^{'}) \to {TransitionCondition}_{k} (s, s^{'}))

(33)

Complete formula:

Φ = {ϕ_{domain}, ϕ_{R_{1}}, ϕ_{R_{2}}, \dots, ϕ_{F_{1}}, ϕ_{F_{2}}, \dots, ϕ_{P_{1}}, ϕ_{P_{2}}, \dots}

(34)

Since all components are finite, every formula is first-order, and

Φ

completely characterizes the structure

S

, we have proved the result. □

From the proof, we can see that structural finiteness has a profound impact on the construction of economic theory. It means that economic models need to pay more attention to boundary conditions, constrained optimization, and finite games. At the same time, finiteness also provides a more realistic foundation for economic analysis, making theoretical predictions more closely aligned with actual economic phenomena.

Given that in the real world, the number of market participants, the variety of resources and commodities, and time and space are all finite, we can further conclude.

Theorem 10.

Within the theoretical framework of first-order and higher-order logic, almost all economic structures can be fully characterized.

Here, we give an example to verify how logic represents economic phenomena and structures. Lowercase letters represent variables.

Table 1. Predicate Definition.

symbol	definition
$D e m a n d (i, g, p, q)$	The quantity q demanded by consumer i for good g at price p is q
$C o n s u m e r (i)$	i is a consumer
$G o o d (g)$	g is a commodity
$P r i c e (p)$	p is a valid price (non-negative)
$Q u a n t i t y (q)$	q is a valid quantity (non-negative)
$M a x i m i z e s U t i l i t y (i, b, c o n s t r a i n t)$	Consumer i chooses a bundle of goods b to maximize utility under constraints
$B u d g e t C o n s t r a i n t (i, p, i n c o m e)$	Consumer i’s budget constraint under price p and income $i n c o m e$
$I n c o m e (i)$	Income of consumer i
$C o n t a i n s (b, g, q)$	The bundle b contains a quantity q of the good g
$S u p p l y (f, g, p, q)$	The supply of good g by firm f at price p is q
$F i r m (f)$	f is an enterprise (production unit)
$P r o f i t M a x i m i z i n g (f, v, q_{b u n d l e}, p, w)$	Firm f chooses input v and output $q_{b u n d l e}$ to maximize profit under price p and factor price w¹

Definition 20.

Definition of Supply and Demand:

\begin{matrix} D e m a n d (i, g, p, q) & \leftrightarrow C o n s u m e r (i) \land G o o d (g) \land P r i c e (p) \land Q u a n t i t y (q) \land \\ \exists b (M a x i m i z e s U t i l i t y (i, b, B u d g e t C o n s t r a i n t (i, p, Income (i))) \land \\ C o n t a i n s (b, g, q)) \end{matrix}

(35)

\begin{matrix} S u p p l y (f, g, p, q) & \leftrightarrow F i r m (f) \land G o o d (g) \land P r i c e (p) \land Q u a n t i t y (q) \land \\ \exists v, q_{b u n d l e} (P r o f i t M a x i m i z i n g (f, v, q_{b u n d l e}, p, w) \land \\ C o n t a i n s (q_{b u n d l e}, g, q)) \end{matrix}

(36)

4.2. Logical Characterization in the Field of Sociology

The reason why social structures require logical representations is closely related to their inherent complexity and abstractness, and these needs are even more pressing than in economics[26,27,28].

The need to handle relational complexity is a core motivation for logical representations of social structure. Social structure is essentially composed of a network of multiple relationships between individuals, including kinship, power, economic, and cultural relationships. These relationships often intersect and overlap, forming a multidimensional, complex network. Natural language struggles to accurately describe structural features such as transitivity, symmetry, and hierarchy within such networks. Logical representations can formalize the precise properties of these relationships, such as the transitive relationship “If A is B’s superior, and B is C’s superior, then A is C’s superior.”

The need to operationalize abstract concepts is another key factor. Sociology is replete with abstract concepts, such as social status, cultural capital, social cohesion, and institutional legitimacy. These concepts often lack intuitive physical counterparts, and their meanings can shift in different contexts. Logical representations force researchers to clearly define the connotations and extensions of these abstract concepts and establish logical relationships between them, making theoretical discussions more precise and operational[29].

The first-order logic description of social structure is similar to that of economic structure. Here, we only give the theorem.

Theorem 11

(First-order representability of finite social structures). Let

S = (A, R_{1}, R_{2},

\dots, F_{1}, F_{2}, \dots, P_{1}, P_{2}, \dots)

be a finite social structure, that is:

1.: $| A | < \infty$ (finite agent)
2.: Each relation $R_{i}$ and function $F_{j}$ is defined over a finite domain and has corresponding predicate and function representations in the base language.
3.: The process $P_{k}$ involves finite states and finite time.

Then there exists a set of first-order formulas Φ such that:

\forall T, T ⊧ Φ \Leftrightarrow T ≅ S

(37)

Similarly, considering the limitations of social structure, we can draw a conclusion.

Theorem 12.

Within the theoretical framework of first-order and higher-order logic, almost all social structures can be fully characterized.

Through logical representation, sociology can not only describe social phenomena more accurately, but also discover hidden social laws, providing more powerful theoretical tools for understanding and improving social structure.

Here, we give an example of logical representation.

Table 2. Semantic interpretation table of sociological predicates.

predicate	Semantic meaning
$F r i e n d (x, y)$	x and y are friends
$S m o k e s (x)$	x smoking
$I n f l u e n c e d (x, y)$	x is affected by y
$H i g h e r S m o k i n g P r o b a b i l i t y (x)$	x has a higher probability of smoking
$S o c i a l N e t w o r k (x, y)$	x and y are in the same social network

Through the given predicate, we can express the state of the smoking phenomenon in sociology.Here, HigherSmokingProbability(x) is abbreviated as HSP(x).

Definition 21

(Expressions related to smoking).

\begin{matrix} ϕ_{1} & = \forall x, y (F r i e n d (x, y) \to F r i e n d (y, x)) (Friendship is symmetrical) \end{matrix}

(38)

\begin{matrix} ϕ_{2} & = \forall x, y (F r i e n d (x, y) \land S m o k e s (y) \to I n f l u e n c e d (x, y)) (Friends ’ smoking has an impact) \end{matrix}

(39)

\begin{matrix} ϕ_{3} & = \forall x (I n f l u e n c e d (x, y) \to H S P (x)) (Increases the probability of smoking) \end{matrix}

(40)

\begin{matrix} ϕ_{4} & = \forall x, y, z (F r i e n d (x, y) \land F r i e n d (y, z) \to S o c i a l N e t w o r k (x, z)) (Forming a social network) \end{matrix}

(41)

5. Computer Field State Expression

In computer science, logic plays an irreplaceable role as a fundamental tool for expressing states. In program verification, Hoare logic precisely describes the state of each execution point of a program through preconditions, postconditions, and invariants, enabling rigorous proof of program correctness. In database systems, first-order logic is not only used to define the semantics of query languages, but also characterizes the legal state space of data through integrity constraints. In the field of formal methods, temporal logic (such as LTL and CTL) can express the dynamic behavior and safety properties of systems in the time dimension, providing a mathematical foundation for modeling concurrent and real-time systems. Planning problems in artificial intelligence are essentially about finding a path from an initial state to a target state in the state space described by logic. In hardware design, Boolean logic directly corresponds to the physical state of circuits, enabling the design of complex digital systems. This abstract expressive power of logic not only provides a unified mathematical framework for modeling complex systems but, more importantly, enables automated reasoning and verification. From compiler optimization analysis to operating system resource management, from network protocol correctness verification to interpretability analysis of machine learning models, logic plays a critical role in translating intuitive concepts into computable forms[30,31,32].

5.1. Boolean Algebra and the Formalization of Computer Systems

Boolean algebra and logical representation are core areas of computer science and mathematical logic[33,34]. First, we verify that Boolean algebra can be formalized using first-order logic.

Theorem 13

(The expressive power of Boolean algebra and first-order logic). All axioms and operations of Boolean algebra can be expressed using first-order logic (FOL).

Proof of Theorem13.

Boolean algebra is defined as: a set B, binary operations

\land, \lor

, unary operations ¬, constants

0, 1

, and the following axioms:

\begin{matrix} \forall a, b \in B : a \land b = b \land a \end{matrix}

(42)

\begin{matrix} \forall a \in B : a \lor \neg a = 1 \end{matrix}

(43)

\begin{matrix} \forall a, b, c \in B : a \land (b \lor c) = (a \land b) \lor (a \land c) etc . \end{matrix}

(44)

(45)

Given that Boolean algebra is a finite mathematical structure, all its structures and properties can be formalized using first-order logic. Therefore, the theorem is proved. □

Boolean algebra is the foundation of computer systems. The logic gates in digital circuits, conditional branching in programs, propositional calculus, and automata are all based on Boolean algebra. Operations, states, and transitions can all be reduced to Boolean algebraic expressions.

Because the entire content of Boolean algebra can be expressed using first-order logic, and computer systems can be reduced to Boolean algebraic structures, its core content can also be expressed using first-order logic. Practical applications such as model checking, hardware verification, and theorem proving have extensively employed first-order logic for modeling and reasoning.

Theorem 14.

Computer systems can be formally expressed in first-order logic.

5.2. Predicate Logic Description of a Turing Machine (TM)

Definition 22.

Similar to finite state machines, Turing machines can be formally expressed. A deterministic Turing machine (DTM) can be represented as a seven-tuple:

M = (Q, Γ, Σ, δ, q_{0}, q_{accept}, q_{reject})

(46)

Where:

Q is a finite state set.
Γ is the set of tape symbols (including the blank symbol ⊔).
$Σ \subseteq Γ$ is the set of input symbols (excluding ⊔).
$δ : Q \times Γ \to Q \times Γ \times {L, R}$ is the state transition function, where L and R indicate whether the read/write head moves left or right.
$q_{0} \in Q$ is the initial state.
$q_{accept}, q_{reject} \in Q$ are the accept and reject states, respectively.

To represent a Turing machine, we can define the following predicates and give the corresponding state expressions:

Definition 23

(Predicate Definition and State Expression). The predicates and states in a Turing machine can be expressed as:

State: $State (q)$ indicates that q is a state.

$ϕ_{1} = \forall q (State (q) \to q \in Q)$

(47)
Tape Symbol: $Typesymbol (a)$ indicates that a is a tape symbol.

$\forall a (TypeSymbol (a) \to a \in Γ)$

(48)
Tape content: $Cell (t, p, a)$ indicates that a is the tape symbol at time t and position p.

$\forall t \forall p \forall a (Cell (t, p, a) \to a \in Γ)$

(49)
Read/Write Head Position: $Head (t, p)$ indicates that the head is at position p at time t.

$\forall t \exists p Head (t, p)$

(50)
Transition function: $T r a n s t i o n (q, a, q^{'}, a^{'}, d)$ represents state transition, where d is the direction.

$\forall q \forall a \forall q^{'} \forall a^{'} \forall d (Transition (q, a, q^{'}, a^{'}, d) \leftrightarrow δ (q, a) = (q^{'}, a^{'}, d))$

(51)
Initial state: $I n i t i a l (q)$ represents the initial state q.

$\exists q_{0} (Initial (q_{0}) \land State (q_{0}))$

(52)
Accept state (similar to the rejection state): $A c c e p t (q)$ indicates that q is an accepting state.

$\forall q (Accept (q) \leftrightarrow q = q_{accept})$

(53)

Based on the above definition of predicates and the corresponding Turing machine state representation and state transition representation, we derive the theorem:

Theorem 15

(Turing Machine State Representation). For any Turing machine, its input, output, and state transition behavior over a relevant time set can be described using a state set.

Before Turing, concepts such as “computation,” “algorithm,” and “efficient process” were intuitive. Mathematicians knew what computation was, but could not give a rigorous mathematical definition. The logical representation of the Turing machine was the first to transform these intuitive concepts into precise mathematical objects: state sets, symbol sets, transition functions, initial states, and accepting states. This formalization enables us to use mathematical methods to study the properties of computation itself[35].

5.3. Mathematical Formalization of Neural Networks

Neural networks (NNs) are machine learning models that mimic the structure and function of biological neural systems, capable of learning complex patterns from data and making predictions or decisions[36]. They are a core technology in deep learning and are widely used in fields such as computer vision, natural language processing, and speech recognition.

First, let’s briefly introduce neural networks. A neural network can be represented as a function

N : R^{n} \to R^{m}

, whose hierarchical structure is decomposed into:

N (x) = σ_{L} (W_{L} \cdot σ_{L - 1} (W_{L - 1} \dots σ_{1} (W_{1} x + b_{1}) \dots) + b_{L})

(54)

Where: input layer:

x \in R^{n}

, hidden layer:

h_{l} = σ_{l} (W_{l} h_{l - 1})

, output layer:

N (x) \in R^{m}

, weight matrix:

W_{l} \in R^{d_{l} \times d_{l - 1}}

, bias vector:

b_{l} \in R^{d_{l}}

, activation function:

σ_{l} : R^{d_{l}} \to R^{d_{l}}

Single neuron calculation:

σ (w^{T} x + b) = σ (\sum_{i = 1}^{n} w_{i} x_{i} + b)

(55)

Neural networks, by simulating the connections of the human brain, enable efficient modeling of complex data. With advances in computing power and algorithms (e.g., GPUs and attention mechanisms), their capabilities are continuously expanding, becoming the driving force behind the AI revolution.

Next, we demonstrate that the relevant aspects of neural networks can be formally expressed.

Proposition 1.

Neural networks can be formally expressed in first-order and higher-order logic.

Proof of Proposition1.

The first step is to logically represent a single neuron in a neural network.

Definition 24

(Neuron Triplet). A neuron can be represented as

N = (w, b, σ)

, where

w \in R^{n}

represents the weight vector,

b \in R

represents the bias term, and

σ : R \to R

represents the activation function.

Using the neuron triple model, we can see that the neuron input-output relationship can be logically represented as follows:

\forall x \in R^{n}, \exists z, a \in R, (z = \sum_{i = 1}^{n} w_{i} x_{i} + b) \land (a = σ (z))

(56)

Next, we express the activation function in the neural network. Because activation functions possess strong mathematical properties and are finite mathematical structures, they are naturally amenable to logical expression.

Finally, we demonstrate the logical representation of the network topology. Taking a simple fully connected layer as an example:

Feedforward Network: $C o n n e c t e d (u, v)$ indicates the connection between $u, v$ , and $L a y e r_{l}$ indicates the lth layer.

$\begin{matrix} \forall l \in {1, . . ., L}, \forall u \in {Layer}_{l}, \forall v \in {Layer}_{l + 1}, \\ Connected (u, v) \land \neg \exists k < l, Connected (v, {Layer}_{k}) . \end{matrix}$

(57)
Hierarchical Combination: $N e t w o r k (x)$ indicates the hierarchical combination structure of x.

$\exists Network : (R^{d_{0}} \to R^{d_{L}}), Network (x) = σ_{L} \circ W_{L} \circ \dots \circ σ_{1} \circ W_{1} (x)$

(58)
Combinatorial Completeness: If the lth layer can be represented as $Φ_{l}$ , then the $l + 1$ th layer can be represented as:

$Φ_{l + 1} = \exists y_{l}, Φ_{l} (x, y_{l}) \land (y_{l + 1} = σ_{l + 1} (W_{l + 1} y_{l} + b_{l + 1}))$

(59)

Thus, we have proved the conclusion. □

According to the theorem, from a theoretical perspective, there is a profound equivalence between neural networks and logical systems. Neural networks can discover strategic patterns that humans have never discovered. If these patterns can be expressed in logical form, they may be transformed into verifiable scientific theories. This fusion is not just a technological advancement; it also represents a deepening of our understanding of the nature of intelligence: intelligence requires both the ability to learn from experience and the ability to reason based on rules, and the logical representation of neural networks is the key bridge connecting the two.

5.4. Formal Expression of States in Computer Science

According to previous proofs, computer systems, Turing machines, and neural networks can all be formalized using first-order and higher-order logic. Scholars have also formalized phenomena such as computational concepts, programming languages, and algorithmic processes.[11,37,38]. Furthermore, emerging problems in computer science can be gradually formalized using the proof process of neural networks. Based on this, we draw a comprehensive conclusion.

Theorem 16.

All phenomena in computer science can be formalized using first-order and higher-order logic.

Theorem 16 demonstrates that logic provides a unified framework for expressing different computational models. Logical formalization has transformed computer science from an engineering craft into a rigorous scientific discipline, providing powerful tools for understanding and controlling complex systems.

6. State Expression in Natural Language Domain

Natural languages, such as Chinese, English, and Arabic, are the languages humans use in everyday life. They are essential tools for communicating ideas and conveying information. In contrast to formal languages, they possess unique properties. Natural languages are richly expressive, capable of expressing the myriad worlds, complex emotions, and abstract concepts. This chapter introduces Montague semantics, explores the rules governing the grammar and semantic translation of natural languages, and achieves a formal understanding of natural languages through the principles of formalized mathematical logic.

6.1. Montague Semantics

Montagu semantics, also known as Montagu grammar, is a formalized approach to the study of natural semiotics, particularly intensional semantics. It represents a new stage in the development of linguistics and logic.

Montagu’s research began with the concept of categories, dividing English syntax into distinct categories. He then established 17 general rules for the formation of all English syntax. He then defined meaningful expressions, used recursion to represent meaningful sets, and explained them using model theory. Finally, Montagu defined 17 corresponding rules for semantic translation, corresponding to the 17 rules for the formation of syntax[39,40].

Based on Montagu’s work, Bennett conducted further research. He refined the English categories, dividing adjectives into separate categories; introduced more complex grammatical and semantic translation rules, expanding the rules from 17 to 35, and provided a solid theoretical foundation for language research using Montagu semantics[41].

6.2. Study of English Ambiguity

Correctly handling linguistic ambiguity is an important indicator of semantic comprehension. Here, we use Montague Grammar to provide an interpretation of ambiguity.

“At least one person likes the book” is a common ambiguous phrase in English. The ambiguity of “at least one person likes the book” stems from the ambiguity of the quantifier scope. Without context, “the book” could refer to a specific book or to a general term.

The semantics of the sentence “At least one person likes the book” can be formally modeled using the

λ

calculus, with two possible interpretations: a broad interpretation and a narrow interpretation.

In the broad interpretation, “the book” refers to a specific book b. The logical form of the entire sentence is:

\exists x [m a n^{'} (x) \land l i k e^{'} (\hat{} b^{'}) (x)]

(60)

Where:

$m a n^{'} (x)$ means x is a person;
$l i k e^{'} (\hat{} b^{'}) (x)$ means x likes a specific book b.

The specific

λ

calculus combination process is as follows:

\begin{matrix} Like & : l i k e^{'} \\ The book & : b^{'} \\ Like the book & : l i k e^{'} (\hat{} b^{'}) \\ At least one person & : λ P . \exists x [m a n^{'} (x) \land P {x}] \\ Combination result & : λ P . \exists x [m a n^{'} (x) \land P {x}] (\hat{} l i k e^{'} (\hat{} b^{'})) \\ λ transposition & : \exists x [m a n^{'} (x) \land \hat{} l i k e^{'} (\hat{} b^{'}) {x}] \\ Bracket convention & : \exists x [m a n^{'} (x) \land \overset{ˇ}{} \hat{} l i k e^{'} (\hat{} b^{'}) (x)] \\ Top and bottom elimination & : \exists x [m a n^{'} (x) \land l i k e^{'} (\hat{} b^{'}) (x)] \end{matrix}

In the narrow-scope interpretation, “the book” is not a specific object but a quantifiable range. The logical form of the sentence is:

\exists x [m a n^{'} (x) \land (l i k e^{'} (x, \hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}]))]

(61)

Where:

$b o o k^{'} (b)$ indicates that b is a book;
$(l i k e^{'} (x, \hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}]))$ indicates that there exists at least one person x, there exists a book b, and this person likes book b.

The specific

λ

calculus combination process is as follows:

\begin{matrix} Like & : l i k e^{'} \\ At least one person & : λ P . \exists x [m a n^{'} (x) \land P {x}] \\ The book exists & : λ Q . \exists b [b o o k^{'} (b) \land Q {b}] \\ Like the book & : l i k e^{'} (\hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}]) \\ Combination result & : λ P . \exists x [m a n^{'} (x) \land P {x}] (\hat{} l i k e^{'} (\hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}])) \\ λ Transposition & : \exists x [m a n^{'} (x) \land (\hat{} l i k e^{'} (\hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}])) {x}] \\ Bracket convention & : \exists x [m a n^{'} (x) \land (\overset{ˇ}{} \hat{} l i k e^{'} (\hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}])) (x)] \\ Up and Down Elimination & : \exists x [m a n^{'} (x) \land (l i k e^{'} (\hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}])) (x)] \\ Relational notation & : \exists x [m a n^{'} (x) \land (l i k e^{'} (x, \hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}]))] \end{matrix}

By modeling the

λ

calculus, the sentence “At least one person likes this book” can capture the following two semantic interpretations:

Broad interpretation: There is at least one person who likes this particular book:

$\exists x [m a n^{'} (x) \land l i k e^{'} (\hat{} b^{'}) (x)]$

(62)
Narrow interpretation: There is a book and at least one person likes it:

$\exists x [m a n^{'} (x) \land (l i k e^{'} (x, \hat{} λ Q . \exists b [b o o k^{'} (b) \land Q {b}]))]$

(63)

Similarly, the ambiguous phenomenon of “Every student read a book” has been extensively studied and interpreted. The quantifier scope ambiguity involved in this issue is an active frontier in semantic research, continuing to drive theoretical and technological developments[42]. Studying such issues can further advance the research and development of natural semantics.

6.3. Optimizing Syntax and Semantic Translation Rules

Although Montagu, Bennett, and others have established detailed rules for English semantics and grammar, which later generations can simply apply directly, many issues still need to be resolved in the actual translation process.

6.3.1. Conjunction Rules

Among Montagu’s 17 rules, S11 and S12 deal with conjunction, defining parallel sentences and parallel verbs, respectively. Among Bennett’s 35 rules, S28 deals with conjunction, defining only parallel verbs. However, as we know, conjunctions of nouns and noun phrases occur very frequently in natural language. Surprisingly, neither Montagu nor Bennett provide corresponding grammatical rules for this phenomenon. This is because conjunctions of nouns and noun phrases require the verb to become plural, and to simplify expression, no corresponding grammatical rules were defined. However, given the high frequency and widespread use of nouns and noun phrases, and to help beginners better grasp the relevant content, we provide the following additional rules:

Definition 25

(Grammatical Rules for Conjunction of Nouns and Noun Phrases). If

α, β \in P_{C N}

, then

F (α, β) \in P_{C N}

. If

α, β \in P_{T}

, then

F (α, β) \in P_{T}

. Here,

F (α, β) = α a n d β

. And when the object of the conjunction becomes the subject, the corresponding verb becomes plural.

Definition 26

(Translation Rules for Conjunctions of Nouns and Noun Phrases). If

α, β \in P_{C N}

or

α, β \in P_{T}

, and

α, β

is translated as

α^{'}, β^{'}

, then

α a n d β

is translated as

λ P [α^{'} (P) \land β^{'} (P)]

. When the conjunction object serves as the subject, the corresponding verb is translated into its plural form.

6.3.2. Adjective Rules

Among Bennett’s 35 rules, the one concerning adjectives is S10. We give its original definition:

Definition 27

(S10). If

γ \in P_{A J}

and

ζ \in P_{C N}

, then

F_{9} (γ, ζ) \in P_{C N}

, where

(a): if γ contains an occurrence of a member of $B_{A J / T}$ , then $F_{φ} (γ, ζ) = ζ γ$ ;
(b): otherwise $F_{9} (γ, ζ) = γ ζ$ .

Bennett argues that using S10 can resolve the English grammatical phenomenon of adjective + noun. However, let’s consider the following example: John’s mother. According to Bennett’s definition, John’s does not fall into the basic category of adjectives, so S10 cannot be used for translation. We must instead use S5.

Definition 28

(S5). S5.If

ζ \in P_{C N / T}

and

α \in P_{T}

, then

F_{5} (ζ, α) \in P_{C N}

, where

(a): if $α = h e_{n}$ , then $F_{5} (ζ, α) = ζ * h i m$ ;
(b): otherwise $F_{5} (ζ, α) = ζ α$ .

Consider John’s mother to be equivalent to mother of John. This can be translated:

\begin{matrix} mother & : m o t h e r^{'} \\ John & : λ P [P {j}] \\ mother of John & : m o t h e r^{'} (\hat{} (λ P [P {j}])) \end{matrix}

While there’s certainly nothing wrong with translating “John’s mother” this way, treating “John’s mother” and “mother of John” as equivalent loses the distinction between the two grammatical structures. Furthermore, the resulting translation is often less concise and clear, hindering the reader’s intuitive understanding. Therefore, we provide supplementary rules for these situations.

Definition 29

(Grammar rule). If

α \in P_{C N}

or

α \in P_{T}

, then

G_{2} (α) \in P_{A J}

, where

(a): If $α = h e_{n}$ , then $G_{2} (α) = h i s_{n}$ ;
(b): Otherwise $G_{2} (α) = α^{'} s$ .

Definition 30

(Semantic Translation Rules). If

α \in P_{C N}

or

α \in P_{T}

, and α is translated as

α^{''}

, then

(a): $h i s_{n}$ is translated as ${h i s_{n}}^{'}$ ;
(b): John’s is translated as $λ P [P {j^{'} s}]$ .
(b): In other cases, $α^{'} s$ is translated as $α^{''} s^{'}$ .

Using the new rules, we can retranslate John’s mother:

\begin{matrix} mother & : m o t h e r^{'} \\ John ’ s & : λ P [P {j^{'} s}] \\ John ’ s mother & : λ P [P {j}] (\hat{} m o t h e r^{'}) \\ λ transposition & : \hat{} m o t h e r^{'} {j} \\ Brackets, upper and lower rules & : m o t h e r^{'} (j^{'} s) \end{matrix}

In comparison, the translation using the new rules is more concise and clear, making it easier for readers to grasp the grammatical structure.

6.3.3. Clause Rules

For the sake of brevity, neither Montagu nor Bennett discussed clause rules in detail, instead focusing on the typical relation “such that.” Some might question whether each clause has a specific function and introductory phrase, expressing different logical relationships depending on the context. Since their functions are not identical to “such that,” does this mean that the same rules cannot be applied universally?

Indeed, “such that” primarily expresses a result or condition. Commonly used attributive clauses, such as modifiers and adverbial clauses, often express time or cause, differ significantly from “such that.” However, these commonly used clauses are structurally simpler than standard relative clauses. Generally speaking, one can emulate Montagu’s approach to translation by modifying or simplifying Montagu’s rules or by transforming the clause form.

Here, we provide a case study: the man whom Mary loves.

\begin{matrix} man : m a n^{'} \\ the man : λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}] \\ Mary : λ Q [Q {m}] \\ loves : l o v e^{'} \\ loves the man : l o v e^{'} (\hat{} (λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}])) \\ Mary loves the man : λ Q [Q {m}] (\hat{} l o v e^{'} (\hat{} (λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}]))) \\ λ Transposition : \hat{} λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}])) {m} \\ Brackets, upper and lower rules : λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}])) (m) \\ λ Transposition : \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land m {y}])) \end{matrix}

Combined results:

\begin{matrix} λ x_{n} [(λ P \exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land P {y}]) (x_{n}) \land \\ (\exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land m {y}]))] \end{matrix}

(64)

After

λ

transposition, brackets, and upper and lower rules, the final result is:

\begin{matrix} λ x_{n} [\exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land \overset{ˇ}{} x_{n} (y)]) \land \\ (\exists y [\forall x [m a n^{'} (x) \leftrightarrow x = y] \land \overset{ˇ}{} m (y)]))] \end{matrix}

(65)

The core insight of Montague semantics lies in placing natural language within a formal framework as rigorous as that of mathematics and logic. By supplementing these rules, scholars can better understand natural language through logic. Formal logic is not just an abstract mathematical tool; it is a key approach to understanding and modeling the most complex human cognitive abilities. As artificial intelligence progresses toward general intelligence, the formal methodology represented by Montague semantics will continue to play an irreplaceable role.

6.4. Formalization of Natural Languages

Scholars such as Montague have provided comprehensive tools and detailed introductions to the formalization of English, a natural language. Furthermore, combined with the rules subsequently added by scholars, it can be argued that English can be fully formalized. Similarly, other natural languages can be formalized using similar methods. Alternatively, given the intertranslatability between English and other languages, they can be directly converted into English for formalization.

In short, we can conclude.

Theorem 17.

All natural languages can be fully formalized using first-order and higher-order logic with appropriate expansion of symbols and rules.

The importance of formalization of natural languages ultimately lies in the scientific method it provides for understanding the essence of human intelligence. Language is not only a tool for communication but also a vehicle for thought, a container for knowledge, and a medium for the transmission of culture. Formalizing language is a mathematical modeling of human cognitive abilities and a scientific exploration of the essence of intelligence.

7. Conclusion and Outlook

This paper systematically investigates the logical formalization of object states, proposing and demonstrating a rigorous and universally applicable formalization framework for revealing the nature of information and its interdisciplinary applications. The paper first reviews classical information theory and its shortcomings, noting the current lack of a unified and rigorous mathematical definition of the core concept of “state.”

To this end, this paper establishes a universal representation system for information states based on first-order and higher-order predicate logic, combined with modal logic and calculus, addressing the current lack of formal representations for states. This paper enumerates typical states from various fields, including mathematics, economics, sociology, computer science, and natural language, and rigorously proves that these states can be formalized using first-order and higher-order logic. Because the proof methods used in these fields can be extended to other fields, this paper demonstrates a universally applicable method for representing states in any domain: formalization using first-order and higher-order predicate logic.

Thus, first-order and higher-order logic are not merely technical tools but also cognitive bridges for understanding the unity of the world. They integrate scattered fragments of knowledge into coherent theoretical systems, elevate local understandings of phenomena into global conceptual frameworks, and transform static descriptions of states into dynamic reasoning processes. In this sense, logic has truly become a universal bridge connecting states across various fields, a universal language that enables communication across fields and provides humanity with the most fundamental and powerful mathematical tools for understanding and transforming the world.

Through the formalization of states, objective information theory has been further refined and developed, deepening research on the nature of information and expanding its scope from the classic Shannon framework. Many pressing problems in information science can be transformed into logical problems. By studying the properties of logical language and drawing on proven conclusions and axioms from logic, we can clarify the meaning of information and guide the development of information research.

Funding

This research received no external funding

Acknowledgments

I would like to thank Professor Xu for his guidance all the time. He has provided me with great help in writing, content organization, topic selection, etc. Without him, I would not have been able to complete this paper.

Conflicts of Interest

The authors declare no conflicts of interest. The authors have identified and declared that there are no personal circumstances or interests that may be perceived asinappropriately influencing the representation or interpretation of the reported research results. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

OIT	Objective Information Theory
FOL	First-order predicate logic
HOL	Higher-order predicate logic

References

N. WIENER. Cybernetics or Control and Communication in the Animal and the Machine. The MIT Press, Cambridge, 2019.
C. E. Shannon. The mathematical theory of communication. Bell Syst Tech, 27:379–423, 1948. [CrossRef]
J. von NEUMANN. Mathematische Grundlagen der Quantenmechanik, volume 38. Springer Berlin Heidelberg, Berlin, Heidelberg, 1971.
A. N. KOLMOGOROV. Three approaches to the quantitative definition of information. International journal of computer mathematics, 2(1-4):157–168, 1968. [CrossRef]
MCGOWAN A T. User and information dynamics: Managing change. Bulletin of the Medical Library Association, 78:327–329, 1990.
XU. J, MA. X, SHEN. Y, et al. Objective information theory: A sextuple model and 9 kinds of metrics. 2014 Science and information conference. IEEE, pages 793–802, 2014. [CrossRef]
XU. J, MA. X, TANG. J, et al. Research on model and measurement of objective information. Science China Information Sciences, 45(3):336–353 (In Chinese), 2015.
XU. J, LIU. Z, WANG. S, et al. Foundations and applications of information systems dynamics. Engineering(Beijing, China), 27:254–265, 2023. [CrossRef]
Alan G Hamilton. Logic for mathematicians. Cambridge University Press, 1988.
P. B. ANDREWS. An Introduction to Mathematical Logic and Type Theory: To Truth Through Proof: vol 27. Dordrecht: Springer Netherlands, 2002. [CrossRef]
Jianfeng Xu. Information science principles of machine learning: A causal chain meta-framework based on formalized information mapping. arXiv preprint arXiv:2505.13182, 2025. [CrossRef]
Richard G Swan. K-theory of finite groups and orders, volume 149. Springer, 2006.
Rudolf Lidl and Harald Niederreiter. Finite fields. Number 20. Cambridge university press, 1997.
Leonid Libkin. Elements of finite model theory, volume 41. Springer, 2004. [CrossRef]
Chen Chung Chang and H Jerome Keisler. Model theory, volume 73. Elsevier, 1990.
Richard Dedekind. Was sind und was sollen die zahlen? In Was sind und was sollen die Zahlen?. Stetigkeit und Irrationale Zahlen, pages 1–47. Springer, 1965. [CrossRef]
Giuseppe Peano. Arithmetices principia: Nova methodo exposita. Fratres Bocca, 1889.
Dana Scott. Logic with denumerably long formulas and finite strings of quantifiers. In The theory of models, pages 329–341. Elsevier, 2014. [CrossRef]
Gerard Debreu. Theory of value: An axiomatic analysis of economic equilibrium, volume 17. Yale University Press, 1959.
Yoav Shoham and Kevin Leyton-Brown. Multiagent systems: Algorithmic, game-theoretic, and logical foundations. Cambridge University Press, 2008.
John Geanakoplos. Three brief proofs of arrow’s impossibility theorem. Economic Theory, 26(1):211–215, 2005. [CrossRef]
Ulle Endriss. Logic and social choice theory. 2012.
W Brian Arthur, Steven N Durlauf, and David A Lane. The economy as an evolving complex system ii. adison wesley. Reading, MA, 1997.
Jaakko Hintikka and Jack Kulas. Anaphora and Definite Descriptions: Two Applications of Game-Theoretical Semantics, volume 26. Springer Science & Business Media, 1985. [CrossRef]
Daniel M Hausman. The inexact and separate science of economics. Cambridge University Press, 2023.
James Samuel Coleman. Introduction to mathematical sociology. 1964.
Stanley Wasserman and Katherine Faust. Social network analysis: Methods and applications. 1994.
Patricia H Thornton, William Ocasio, and Michael Lounsbury. The institutional logics perspective: A new approach to culture, structure, and process. Oxford University Press, 2012.
Stephen P Borgatti, Martin G Everett, Jeffrey C Johnson, and Filip Agneessens. Analyzing social networks using R. Sage, 2022.
Michael Huth and Mark Ryan. Logic in Computer Science: Modelling and reasoning about systems. Cambridge university press, 2004.
Charles Antony Richard Hoare. An axiomatic basis for computer programming. Communications of the ACM, 12(10):576–580, 1969. [CrossRef]
Benjamin C Pierce. Types and programming languages. MIT press, 2002.
George Boole. The mathematical analysis of logic. CreateSpace Independent Publishing Platform, 1847.
George Boole. An investigation of the laws of thought: on which are founded the mathematical theories of logic and probabilities, volume 2. Walton and Maberly, 1854.
Alan Mathison Turing et al. On computable numbers, with an application to the entscheidungsproblem. J. of Math, 58(345-363):5, 1936. [CrossRef]
Warren S McCulloch and Walter Pitts. A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics, 5(4):115–133, 1943. [CrossRef]
Glynn Winskel. The formal semantics of programming languages: an introduction. MIT press, 1993.
Hartley Rogers Jr. Theory of recursive functions and effective computability. MIT press, 1987.
Richard Montague. English as a formal language. 1970.
Richard Montague. The proper treatment of quantification in ordinary english. In Approaches to natural language: Proceedings of the 1970 Stanford workshop on grammar and semantics, pages 221–242. Springer, 1973. [CrossRef]
Michael Bennett. A variation and extension of a montague fragment of english. In Montague Grammar, pages 119–163. Elsevier, 1976. [CrossRef]
Robin Cooper. Quantification and syntactic theory, volume 21. Springer Science & Business Media, 2013. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Research on a General State Formalization Method from the Perspective of Logic

Abstract

Keywords:

Subject:

1. Introduction

2. Formal Expression of State

2.1. First-Order Formal System Definition

2.2. Recursive Definition of Higher-Order Formal Systems

2.3. Interpretation of Formal Systems

2.4. Axiom System for Logical Expression of Ontology Components Under State Decomposition

2.5. The State of an Object at a Specific Time

3. Mathematical Field State Expression

3.1. Formalization of Finite Mathematical Structures

3.2. Previous Research on the Formalization of Infinite Structures

3.3. Formalization of Conditional Infinite Structures

3.4. Formalization of Phenomena in Mathematics

4. State Expression in Economics and Sociology

4.1. Logical Characterization in the Field of Economics

4.2. Logical Characterization in the Field of Sociology

5. Computer Field State Expression

5.1. Boolean Algebra and the Formalization of Computer Systems

5.2. Predicate Logic Description of a Turing Machine (TM)

5.3. Mathematical Formalization of Neural Networks

5.4. Formal Expression of States in Computer Science

6. State Expression in Natural Language Domain

6.1. Montague Semantics

6.2. Study of English Ambiguity

6.3. Optimizing Syntax and Semantic Translation Rules

6.3.1. Conjunction Rules

6.3.2. Adjective Rules

6.3.3. Clause Rules

6.4. Formalization of Natural Languages

7. Conclusion and Outlook

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe