A Mathematical Approach to the Theory of Finite Automata

Chac Kwan

doi:10.20944/preprints202509.0560.v1

Submitted:

05 September 2025

Posted:

09 September 2025

You are already at the latest version

Abstract

There is a lack of rigorous mathematical treatment in the theory of finite automata. This paper provides a rigorous mathematical approach to automata theory which doesn’t currently exist in the literature of theoretical computer science. Basic definitions are developed in mathematical terms and used as the foundation for constructing mathematical proofs for theorems. It provides a model for instructors to write better lecture notes and authors to write better textbooks for educational purpose. It also corrects some critical errors and erroneous arguments that can be found in many textbooks which are widely used in the education of theoretical computer science.

Keywords:

theoretical computer science

;

computability theory

;

finite automata

;

Pumping Lemma

;

discrete mathematics

;

Myhill-Nero theorem

Subject:

Computer Science and Mathematics - Computer Science

1. Deterministic Finite Automaton (DFA)

Definition 1.

A deterministic finite automaton denoted by

D F A

is a 5-tuple,

M = (Q, Σ, δ, q_{0}, F),

where

(i) Q is a finite set of states;

(ii) Σ is a finite alphabet;

(iii)

δ : Q \times Σ ⟶ Q

is the transition function;

(iv)

q_{0} \in Q

is the start state; and

(v)

F \subset Q

is the set of accept states.

Let

w = w_{1} w_{2} w_{3}, \dots, w_{n}

be a string over Σ where each

w_{i} \in Σ

and

n \geq 1

.

M accepts w if and only if

\exists r_{0}, r_{1}, r_{2}, \dots, r_{n} \in Q

s.t. the following conditions are satisfied:

(a)

r_{0} = q_{0}

;

(b)

δ (r_{i}, w_{i + 1}) = r_{i + 1}

for

i = 0, 1, 2, \dots, n - 1

; and

(c)

r_{n} \in F

For

n = 0, w = ϵ .

Only conditions (a) and (c) are applicable and they become

r_{0} = q_{0}

and

r_{0} \in F

. We therefore define M to accept ϵ if the start state is also an accept state.

On the other hand, since there is no ϵ-movement in a

D F A

, the only way the

D F A

can accept an empty string is to accept it at the start state.

Accordingly, M accepts ϵ if and only if the start state is also an accept state.

If we write

r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1}

instead of

δ (r_{i}, w_{i + 1}) = r_{i + 1}

for

i = 0, 1, 2, \dots, n - 1,

then conditions (a), (b) and (c) can be written as follows:

q_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n}, r_{n} \in F

We say M recognizes language A if

A = {w \in Σ^{*} ∣ M

accepts

w}

and it is written as

L (M) = A

Definition 2.

A language is called regular if it is recognized by a

D F A

.

Definition 3.

For any language L,

L^{0} = {ϵ}, L^{1} = L, L^{2} = L L, \dots, L^{m + 1} = L^{m} L

for

m > 0

.

L^{*} = L^{0} \cup L^{1} \cup L^{2} \cup \dots = ⋃_{k = 0}^{\infty} L^{k}

= {w

|

w = w_{1} w_{2} w_{3} \dots w_{n}; w_{i} \in L

for

1 \leq i \leq n; n \geq 1} \cup {ϵ}

Definition 4.

Inductive Transition Function

Let

M = (Q, Σ, δ, q_{0}, F)

be a

D F A

.

\hat{δ} : Q \times Σ^{*} ⟶ Q

s.t.

(i)

\hat{δ} (q, ϵ) = q \forall q \in Q

(ii)

\hat{δ} (q, w a) = δ (\hat{δ} (q, w), a) \forall a \in Σ, w \in Σ^{*}, q \in Q

Definition 5.

\forall p, q \in Q, w \in Σ^{*}, p \overset{w, \hat{δ}}{⟶} q \overset{d e f}{⟺} q = \hat{δ} (p, w)

Proposition 1.

\hat{δ} (q, a) = δ (q, a)

\forall q \in Q, a \in Σ

< P r o o f >

\begin{array}{l} \hat{δ} (q, a) = \hat{δ} (q, ϵ a) \\ = δ (\hat{δ} (q, ϵ), a) & (D e f i n i t i o n 1.4 (i i)) \\ = δ (q, a) & (D e f i n i t i o n 1.4 (i)) \end{array}

Theorem 1.

(DFA Acceptance)

For any

D F A

,

M = (Q, Σ, δ, q_{0}, F)

\hat{δ} (q_{0}, w) \in F ⟺ M

accepts

w \forall w \in Σ^{*}

< P r o o f >

Claim: If

w = w_{1} w_{2} \dots w_{n}

where

n \geq 0

and

q_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n},

then

\hat{δ} (r_{0}, w) = r_{n}

This Claim can be proved by induction on n.

For

n = 0, w = ϵ

and the computation becomes

q_{0} = r_{0}

.

\hat{δ} (q_{0}, w) = \hat{δ} (q_{0}, ϵ)

= q_{0}

(By Definition 1.4(i))

= r_{0}

Therefore, the statement is true for

n = 0

.

Assume the statement is true for

n = k

, where

k \geq 0

.

That is,

\hat{δ} (r_{0}, w_{1} w_{2} \dots w_{k}) = r_{k}

\begin{matrix} \hat{δ} (r_{0}, w_{1} w_{2} \dots w_{k} w_{k + 1}) = δ (\hat{δ} (r_{0}, w_{1} w_{2} \dots w_{k}), w_{k + 1}) & (Definition 1.4 (i i)) \\ = δ (r_{k}, w_{k + 1}) & (Induction Hypothesis) \\ = r_{k + 1} & (Definition of r_{i + 1}) \end{matrix}

Therefore, the statement is true for

n = k + 1

.

If M accepts

w = w_{1} w_{2} \dots w_{n},

where

w_{i} \in Σ

for

1 \leq i \leq n

and

n \geq 1

or

(w = ϵ

and

n = 0)

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

st

q_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n},

r_{n} \in F

By Claim,

\hat{δ} (r_{0}, w_{1} w_{2} \dots w_{n}) = r_{n}

Therefore,

\hat{δ} (q_{0}, w) = r_{n}

(

r_{0} = q_{0}; w = w_{1} w_{2} \dots w_{n}

)

Since

r_{n} \in F,

\hat{δ} (q_{0}, w) \in F

Therefore, M accepts

w ⟹ \hat{δ} (q_{0}, w) \in F

Conversely, if

\hat{δ} (q_{0}, w) \in F

\hat{δ} (q_{0}, w_{1} w_{2} \dots w_{n}) \in F

Take

r_{0} = q_{0}

r_{i + 1} = δ (r_{i}, w_{i + 1}) \forall i = 0, 1, 2, \dots n - 1

q_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n}

By Claim,

\hat{δ} (r_{0}, w_{1} w_{2} \dots w_{n}) = r_{n}

Since

\hat{δ} (q_{0}, w_{1} w_{2} \dots w_{n}) \in F, r_{n} \in F

.

Therefore, M accepts w.

Therefore,

\hat{δ} (q_{0}, w) \in F ⟹ M

accepts w.

Therefore,

\hat{δ} (q_{0}, w) \in F ⟺ M

accepts w.

This completes the proof.

Theorem 2.

For any

D F A s, M

and

M^{'}

where

M = (Q, Σ, δ, q_{0}, F)

M^{'} = (Q, Σ, δ^{'}, q_{0}, F^{'})

\forall q \in Q, a \in Σ, w \in Σ^{*}

δ^{'} (q, a) = δ (q, a) ⟹ \hat{δ^{'}} (q, w) = \hat{δ} (q, w)

< P r o o f >

The proof is by induction on

| w | \geq 0

.

For

| w | = 0, w = ϵ

.

By Definition 1.4(i),

\hat{δ} (q, ϵ) = q

and

\hat{δ^{'}} (q, ϵ) = q

Therefore,

\hat{δ} (q, ϵ) = \hat{δ^{'}} (q, ϵ)

Assume the statement is true for

| w | = k \geq 0

.

\begin{matrix} \hat{δ} (q, w a) = δ (\hat{δ} (q, w), a) \\ = δ^{'} (\hat{δ} (q, w), a) & (δ^{'} (q, a) = δ (q, a)) \\ = δ^{'} (\hat{δ^{'}} (q, w), a) & (Induction Hypothesis) \\ = \hat{δ^{'}} (q, w a) & (Definition 1.4 (i i)) \end{matrix}

The statement is also true for

| w | = k + 1

2. Nondeterministic Finite Automaton (NFA)

Definition 6.

A nondeterministic finite automaton (

N F A

) is a 5-tuple,

N = (Q, Σ, δ, q_{0}, F)

, where

(i) Q is a finite set of states;

(ii) Σ is a finite alphabet;

(iii)

δ : Q \times Σ_{ϵ} ⟶ P (Q)

is the transition function, where

Σ_{ϵ} = Σ \cup {ϵ}, P (Q) =

the power set of

Q = {S ∣ S \subset Q}

.

(iv)

q_{0} \in Q

is the start state; and

(v)

F \subset Q

is the set of accept states.

Let

w = w_{1} w_{2} w_{3} \dots w_{m}

where

w_{i} \in Σ_{ϵ}

for

1 \leq i \leq m

and

m \geq 1

.

N accepts w if and only if

\exists r_{0}, r_{1}, r_{2}, \dots, r_{m} \in Q

s.t. the following conditions are satisfied:

(a)

r_{0} \in {q_{0}}

(b)

r_{i + 1} \in δ (r_{i}, w_{i + 1})

for

i = 0, 1, 2, \dots, m - 1

(c)

r_{m} \in F

For

m = 0, w = ϵ .

Only conditions (a) and (c) are applicable and they become

r_{0} = q_{0}

and

r_{0} \in F

.

We therefore define N to accept ϵ if the start state is also an accept state.

If we write

r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1}

instead of

r_{i + 1} \in δ (r_{i}, w_{i + 1})

for

i = 0, 1, 2, \dots, m - 1

, then conditions (a), (b) and (c) can be written as follows:

q_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{i} \overset{w_{i + 1}, δ}{⟶} r_{i + 1} \dots r_{m - 1} \overset{w_{m}, δ}{⟶} r_{m}, r_{m} \in F

.

Note that when

m = 0,

this computation becomes

q_{0} = r_{0}

and

r_{0} \in F .

Definition 7.

(Inductive Transition Function)

Let

N = (Q, Σ, δ, q_{0}, F)

be an

N F A

.

\hat{δ} : P (Q) \times Σ_{ϵ}^{*} ⟶ P (Q)

such that

(i)

\hat{δ} (A, ϵ) = A \forall A \in P (Q)

(ii)

\hat{δ} (A, w a) = ⋃_{q \in \hat{δ} (A, w)} δ (q, a) \forall a \in Σ_{ϵ}, w \in Σ_{ϵ}^{*}, A \in P (Q)

.

Definition 8.

\forall p, q \in Q, w \in Σ_{ϵ}^{*}, p \overset{w, \hat{δ}}{⟶} q \overset{d e f}{⟺} q \in \hat{δ} ({p}, w)

.

Proposition 2.

If

N = (Q, Σ, δ, q_{0}, F)

is an

N F A

, then

\forall a \in Σ_{ϵ}, p \in Q, \hat{δ} ({p}, a) = δ (p, a) .

< P r o o f >

\hat{δ} ({p}, a) = \hat{δ} ({p}, ϵ a)

= ⋃_{q \in \hat{δ} ({p}, ϵ)} δ (q, a)

(Definition 1.10 (ii))

= ⋃_{q \in {p}} δ (q, a)

(Definition 1.10 (i))

= δ (p, a)

Proposition 3.

If

N = (Q, Σ, δ, s_{0}, F)

is an

N F A

,

\forall w \in Σ_{ϵ}^{*}

where

w = w_{1} w_{2} w_{3} \dots w_{n}

;

w_{i} \in Σ_{ϵ}

for

1 \leq i \leq n

and

n \geq 1

or

w = ϵ

for

n = 0 .

(\exists r_{0}, r_{1}, r_{2}, \dots, r_{n} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n}) ⟺ r_{n} \in \hat{δ} ({s_{0}}, w)

< P r o o f >

This proposition can be proved by induction on n.

Let

P (n)

denote the statement:

(\exists r_{0}, r_{1}, r_{2}, \dots, r_{n} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n})

; and

Q (n)

denote the statement:

r_{n} \in \hat{δ} ({s_{0}}, w)

.

For

n = 0, w = ϵ

.

P (0) ⟺ (\exists r_{0} \in Q

s.t.

s_{0} = r_{0})

⟺ r_{0} \in {s_{0}}

⟺ r_{0} \in \hat{δ} ({s_{0}}, ϵ)

(Definition 1.10(i))

⟺ r_{0} \in \hat{δ} ({s_{0}}, w)

(

w = ϵ

)

⟺ Q (0)

Assume

P (k) ⟺ Q (k)

for any

k \geq 0

.

P (k + 1) ⟺ (\exists r_{0}, r_{1}, r_{2}, \dots r_{k}, r_{k + 1} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{k - 1} \overset{w_{k}, δ}{⟶} r_{k} \overset{w_{k + 1}, δ}{⟶} r_{k + 1})

P (k + 1) ⟹ P (k)

(From computation path of

P (k + 1)

)

⟹ Q (k)

(Induction Hypothesis)

⟹ r_{k} \in \hat{δ} ({s_{0}}, w)

where

w_{1} w_{2} w_{3} \dots w_{k} = w

(Definition of

Q (k)

)

Since

\hat{δ} ({s_{0}}, w w_{k + 1}) = ⋃_{q \in \hat{δ} ({s_{0}}, w)} δ (q, w_{k + 1})

(Definition 1.10(ii)

and

r_{k} \in \hat{δ} ({s_{0}}, w)

,

δ (r_{k}, w_{k + 1}) \subset \hat{δ} ({s_{0}}, w w_{k + 1})

r_{k} \overset{w_{k + 1}, δ}{⟶} r_{k + 1}

(From computation path of

P (k + 1)

)

Therefore,

r_{k + 1} \in δ (r_{k}, w_{k + 1}) \subset \hat{δ} ({s_{0}}, w w_{k + 1})

Therefore,

r_{k + 1} \in \hat{δ} ({s_{0}}, w w_{k + 1})

Therefore,

P (k + 1) ⟹ Q (k + 1)

.

Conversely,

Q (k + 1) ⟹ r_{k + 1} \in \hat{δ} ({s_{0}}, w w_{k + 1})

⟹ r_{k + 1} \in ⋃_{q \in \hat{δ} ({s_{0}}, w)} δ (q, w_{k + 1})

r_{k + 1} \in δ (r_{k}, w_{k + 1})

for some

r_{k} \in \hat{δ} ({s_{0}}, w)

r_{k} \in \hat{δ} ({s_{0}}, w) ⟹ Q (k)

⟹ P (k)

(Induction Hypothesis)

⟹ (\exists r_{0}, r_{1}, r_{2}, \dots r_{k} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{k - 1} \overset{w_{k}, δ}{⟶} r_{k})

r_{k + 1} \in δ (r_{k}, w_{k + 1}) ⟹ r_{k} \overset{w_{k + 1}, δ}{⟶} r_{k + 1}

Combining the two computation paths,

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{k - 1} \overset{w_{k}, δ}{⟶} r_{k} \overset{w_{k + 1}, δ}{⟶} r_{k + 1}

Therefore,

Q (k + 1) ⟹ P (k + 1)

and the proof is complete.

Proposition 4.

\forall x, y \in Σ_{ϵ}^{*}

&

A \in P (Q)

,

\hat{δ} (A, x y) = \hat{δ} (\hat{δ} (A, x), y)

< P r o o f >

The proof is by induction on

n = | y |

.

Let

T (n)

denote the statement corresponding to

n = 0, 1, 2, \dots

For

| y | = 0

,

y = ϵ

.

\hat{δ} (A, x ϵ) = \hat{δ} (A, x)

= \hat{δ} (\hat{δ} (A, x), ϵ)

(Definition 1.10(i))

T (0)

is true.

Assume

T (k)

is true for

| y | = k \geq 0

.

That is

\hat{δ} (A, x y) = \hat{δ} (\hat{δ} (A, x), y)

for

| y | = k \geq 0

For any

a \in Σ_{ϵ}

,

y \in Σ_{ϵ}^{*}

,

| y | = k

LHS of

T (k + 1) = \hat{δ} (A, x y a)

= ⋃_{q \in \hat{δ} (A, x y)} δ (q, a)

(By Definition 1.10(ii))

= ⋃_{q \in \hat{δ} (\hat{δ} (A, x), y)} δ (q, a)

(By Induction Hypothesis)

= \hat{δ} (\hat{δ} (A, x), y a)

(By Definition 1.10(ii))

=

RHS of

T (k + 1)

Therefore,

T (k) ⟹ T (k + 1)

.

Proposition 5.

\forall A_{i} \subset Q, x \in Σ_{ϵ}^{*}, i = 1, 2, \dots n, n \in N, \hat{δ} (⋃_{i = 1}^{n} A_{i}, x) = ⋃_{i = 1}^{n} \hat{δ} (A_{i}, x)

< P r o o f >

The proof is by induction on

| x |

.

For

| x | = 0

,

x = ϵ

.

\hat{δ} (⋃_{i = 1}^{n} A_{i}, ϵ) = ⋃_{i = 1}^{n} A_{i}

(Definition 1.10(i))

= ⋃_{i = 1}^{n} \hat{δ} (A_{i}, ϵ)

(Definition 1.10(i))

Claim:

\forall n \in N

, sets

A_{i}

and

S_{x}

⋃_{x \in \cup_{i = 1}^{n} A_{i}} S_{x} = ⋃_{i = 1}^{n} (⋃_{x \in A_{i}} S_{x})

LHS

= ⋃_{x \in A_{1} \cup A_{2} \cup \dots A_{n}} S_{x}

= (⋃_{x \in A_{1}} S_{x}) \cup (⋃_{x \in A_{2}} S_{x}) \cup \dots \cup (⋃_{x \in A_{n}} S_{x})

= ⋃_{i = 1}^{n} (⋃_{x \in A_{i}} S_{x})

=

RHS

Assume the statement is true for

| x | = k

for

k \geq 0

.

\forall a \in Σ_{ϵ}

,

| x a | = k + 1

.

\hat{δ} (⋃_{i = 1}^{n} A_{i}, x a) = ⋃_{p \in \hat{δ} ((\cup_{i = 1}^{n} A_{i}), x)} δ (p, a)

(Definition 1.10(ii))

= ⋃_{p \in \cup_{i = 1}^{n} \hat{δ} (A_{i}, x)} δ (p, a)

(Induction Hypothesis)

= ⋃_{i = 1}^{n} (⋃_{p \in \hat{δ} (A_{i}, x)} δ (p, a))

(Claim)

= ⋃_{i = 1}^{n} \hat{δ} (A_{i}, x a)

(Definition 1.10(ii))

Therefore, the statement is also true for

| x | = k + 1

.

Proposition 6.

\hat{δ} (A, x) = ⋃_{q \in A} \hat{δ} ({q}, x)

for all

A \subset Q

.

< P r o o f >

LHS

= \hat{δ} (⋃_{q \in A} {q}, x)

= ⋃_{q \in A} \hat{δ} ({q}, x)

(Proposition 1.15)

= RHS

Proposition 7.

\forall A, B,

where

A \subset B \subset Q, \hat{δ} (A, x) \subset \hat{δ} (B, x)

< P r o o f >

B = A \cup (B ∖ A)

(Set Theory)

\hat{δ} (B, x) = \hat{δ} (A \cup (B ∖ A), x)

= \hat{δ} (A, x) \cup \hat{δ} (B ∖ A, x)

(Proposition 1.15)

Therefore,

\hat{δ} (A, x) \subset \hat{δ} (B, x)

Proposition 8.

For any two

N F A s

N_{1}

and

N_{2}

, where

N_{1} = (Q_{1}, Σ, δ_{1}, q_{1}, F_{1})

N_{2} = (Q_{2}, Σ, δ_{2}, q_{2}, F_{2})

and

Q_{1} \subset Q_{2}

\forall q \in Q_{1}, a \in Σ_{ϵ}, δ_{1} (q, a) \subset δ_{2} (q, a) \Rightarrow \hat{δ_{1}} ({q}, w) \subset \hat{δ_{2}} ({q}, w) \forall w \in Σ_{ϵ}^{*}

< P r o o f >

The proof is by induction on

| w |

.

For

| w | = 0

,

w = ϵ

.

\hat{δ_{1}} ({q}, ϵ) = {q}

and

\hat{δ_{2}} ({q}, ϵ) = {q}

(By Definition 1.10(i))

Therefore,

\hat{δ_{1}} ({q}, ϵ) \subset \hat{δ_{2}} ({q}, ϵ) .

The statement is true for

| w | = 0

.

Assume the statement is true for

| w | = k \geq 0

.

That is,

\hat{δ_{1}} ({q}, w) \subset \hat{δ_{2}} ({q}, w)

for

| w | = k \geq 0

.

For

k + 1

,

\hat{δ_{1}} ({q}, w a) = ⋃_{p \in \hat{δ_{1}} ({q}, w)} δ_{1} (p, a)

\subset ⋃_{p \in \hat{δ_{2}} ({q}, w)} δ_{2} (p, a)

(By Induction Hypothesis and

δ_{1} (q, a) \subset δ_{2} (q, a)

)

= \hat{δ_{2}} ({q}, w a)

(By Definition 1.10(ii))

Theorem 3.

(NFA acceptance)

N = (Q, Σ, δ, s_{0}, F)

is an

N F A

.

\forall w \in Σ_{ϵ}^{*}

where

w = w_{1} w_{2} w_{3} \dots w_{n};

and

(w_{i} \in Σ_{ϵ}

for

1 \leq i \leq n

and

n \geq 1)

or

(w = ϵ

and

n = 0) .

N accepts w if and only if

\hat{δ} ({s_{0}}, w) \cap F \neq \emptyset

In other words, N accepts w if and only if

(\exists r \in F

s.t.

s_{0} \overset{w, \hat{δ}}{⟶} r)

< P r o o f >

If N accepts w

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n}

and

r_{n} \in F

.

r_{n} \in \hat{δ} ({s_{0}}, w)

(By Proposition 1.13)

Since

r_{n}

is also in F,

\hat{δ} ({s_{0}}, w) \cap F \neq \emptyset

Conversely, if

\hat{δ} ({s_{0}}, w) \cap F \neq \emptyset

,

\exists r_{n} \in \hat{δ} ({s_{0}}, w)

and

r_{n} \in F

.

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

s.t.

s_{0} = r_{0} \overset{w_{1}, δ}{⟶} r_{1} \overset{w_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ}{⟶} r_{n}

;

r_{n} \in F

(Proposition 1.13)

Therefore, N accepts w.

3. Epsilon-Closure

The

ϵ

-Closure of a set of states is a collection of states that can be reached from a member of the given set of states via zero or a finite number of

ϵ

transitions.

Formally, we define

ϵ

-Closure as follows.

Definition 9.

Let

N = (Q, Σ, δ, s_{0}, F)

be an

N F A

.

For any

R \subset Q,

the ϵ-Closure of R is

E (R) = {q \in Q ∣ p \overset{ϵ^{i}, \hat{δ}}{⟶} q

for some

p \in R}

where

i is an integer

\geq 0

and

p \overset{ϵ^{0}, \hat{δ}}{⟶} q

means

p = q

.

Proposition 9.

\forall A_{i} \subset Q, i = 1, 2, \dots n, n \in N, E (⋃_{i = 1}^{n} A_{i}) = ⋃_{i = 1}^{n} E (A_{i})

< P r o o f >

Claim.

E (A_{1} \cup A_{2}) = E (A_{1}) \cup E (A_{2})

q \in E (A_{1} \cup A_{2}) \Leftrightarrow \exists p \in A_{1} \cup A_{2}

s.t.

p \overset{ϵ^{i}, \hat{δ}}{⟶} q

where

i \geq 0

\Leftrightarrow ((\exists p \in A_{1}) \lor (\exists p \in A_{2})) \land (p \overset{ϵ^{i}, \hat{δ}}{⟶} q

where

i \geq 0)

\Leftrightarrow (q \in E (A_{1})) \lor (q \in E (A_{2}))

\Leftrightarrow q \in E (A_{1}) \cup E (A_{2})

Therefore,

E (A_{1} \cup A_{2}) = E (A_{1}) \cup E (A_{2})

With this Claim and an induction argument, we can conclude Proposition 1.21.

4. The Equivalence of DFA and NFA

Lemma 1.

Let

N = (Q, Σ, δ, q_{0}, F)

be an

N F A

,

M = (Q^{'}, Σ, δ^{'}, q_{0}^{'}, F^{'})

be a

D F A

.

Q^{'} = P (Q), q_{0}^{'} = E ({q_{0}}), F^{'} = {R \in Q^{'} ∣ R \cap F \neq \emptyset}

δ^{'} : Q^{'} \times Σ ⟶ Q^{'}

such that

δ^{'} (R, a) = ⋃_{r \in R} E (δ (r, a)) \forall a \in Σ, R \in Q^{'}

Let

w = w_{1} w_{2} w_{3} \dots w_{n}

such that if

n = 0, w = ϵ

and if

n \geq 1,

then

w_{i} \neq ϵ \forall 1 \leq i \leq n

.

Let

i_{0}, i_{1}, i_{2}, \dots i_{n}

be integers

\geq 0, q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q, p_{1}, p_{2}, p_{3}, \dots p_{n} \in Q

and

q \in Q .

The following holds:

q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q_{1} \overset{w_{1}, δ}{⟶} p_{1} \overset{ϵ^{i_{1}}, \hat{δ}}{⟶} q_{2} \overset{w_{2}, δ}{⟶} p_{2} \overset{ϵ^{i_{2}}, \hat{δ}}{⟶} q_{3} \dots q_{n - 1} \overset{w_{n - 1}, δ}{⟶} p_{n - 1} \overset{ϵ^{i_{n - 1}}, \hat{δ}}{⟶} q_{n} \overset{w_{n}, δ}{⟶} p_{n} \overset{ϵ^{i_{n}}, \hat{δ}}{⟶} q .

⟺ q \in \hat{δ^{'}} (q_{0}^{'}, w)

< P r o o f >

Proof is by induction on

| w | = n .

Let

P (n)

denote the statement of

q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q_{1} \overset{w_{1}, δ}{⟶} p_{1} \overset{ϵ^{i_{1}}, \hat{δ}}{⟶} q_{2} \overset{w_{2}, δ}{⟶} p_{2} \overset{ϵ^{i_{2}}, \hat{δ}}{⟶} q_{3} \dots q_{n - 1} \overset{w_{n - 1}, δ}{⟶} p_{n - 1} \overset{ϵ^{i_{n - 1}}, \hat{δ}}{⟶} q_{n} \overset{w_{n}, δ}{⟶} p_{n} \overset{ϵ^{i_{n}}, \hat{δ}}{⟶} q .

and

Q (n)

denote the statement of

q \in \hat{δ^{'}} (q_{0}^{'}, w)

corresponding to

n \geq 0

.

For

| w | = n = 0, w = ϵ

.

P (0) ⟺ q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q ⟺ q \in E ({q_{0}}) ⟺ q \in q_{0}^{'} (q_{0}^{'} = E ({q_{0}})) ⟺ q \in \hat{δ^{'}} (q_{0}^{'}, ϵ)

(Definition 1.4(i))

⟺ q \in \hat{δ^{'}} (q_{0}^{'}, w) (w = ϵ)

⟺ Q (0)

Assume

P (k) \Leftrightarrow Q (k)

for

k \geq 0

.

P (k + 1)

\Leftrightarrow q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q_{1} \overset{w_{1}, δ}{⟶} p_{1} \overset{ϵ^{i_{1}}, \hat{δ}}{⟶} q_{2} \overset{w_{2}, δ}{⟶} p_{2} \overset{ϵ^{i_{2}}, \hat{δ}}{⟶} q_{3} \dots q_{k} \overset{w_{k}, δ}{⟶} p_{k} \overset{ϵ^{i_{k}}, \hat{δ}}{⟶} q_{k + 1} \overset{w_{k + 1}, δ}{⟶} p_{k + 1} \overset{ϵ^{i_{k + 1}}, \hat{δ}}{⟶} q .

\Leftrightarrow P (k)

&

q_{k + 1} \overset{w_{k + 1}, δ}{⟶} p_{k + 1} \overset{ϵ^{i_{k + 1}}, \hat{δ}}{⟶} q

\Leftrightarrow Q (k)

&

q_{k + 1} \overset{w_{k + 1}, δ}{⟶} p_{k + 1} \overset{ϵ^{i_{k + 1}}, \hat{δ}}{⟶} q

(Induction Hypothesis)

\Leftrightarrow q_{k + 1} \in \hat{δ^{'}} (q_{0}^{'}, w)

where

| w | = k

&

q_{k + 1} \overset{w_{k + 1}, δ}{⟶} p_{k + 1} \overset{ϵ^{i_{k + 1}}, \hat{δ}}{⟶} q

\Leftrightarrow q_{k + 1} \in \hat{δ^{'}} (q_{0}^{'}, w)

where

| w | = k

&

q \in E (δ (q_{k + 1}, w_{k + 1}))

\Leftrightarrow q \in ⋃_{r \in \hat{δ^{'}} (q_{0}^{'}, w)} E (δ (r, w_{k + 1}))

where

| w | = k

\Leftrightarrow q \in δ^{'} (\hat{δ^{'}} (q_{0}^{'}, w), w_{k + 1})

where

| w | = k

(Consider

R = \hat{δ^{'}} (q_{0}^{'}, w), w_{k + 1} = a

&

δ^{'} (R, a) \overset{d e f}{=} ⋃_{r \in R} E (δ (r, a))

)

\Leftrightarrow q \in \hat{δ^{'}} (q_{0}^{'}, w w_{k + 1})

where

| w | = k

(Definition 1.4(ii))

\Leftrightarrow Q (k + 1)

This completes the proof of Lemma 1.22.

Theorem 4.

Every

N F A

can be converted to an equivalent

D F A

.

< P r o o f >

Let

N = (Q, Σ, δ, q_{0}, F)

be an

N F A

.

Construct a DFA as follows.

M = (Q^{'}, Σ, δ^{'}, q_{0}^{'}, F^{'})

where

Q^{'} = P (Q), q_{0}^{'} = E ({q_{0}}), F^{'} = {R \in Q^{'} ∣ R \cap F \neq \emptyset}

δ^{'} : Q^{'} \times Σ ⟶ Q^{'}

such that

δ^{'} (R, a) = ⋃_{r \in R} E (δ (r, a)) \forall a \in Σ, R \in Q^{'}

We claim that N and M are equivalent by showing that

\forall w \in Σ_{ϵ}^{*}, N

accepts

w \Leftrightarrow M

accepts w

The proof is divided into two cases, one with

w = ϵ

and one with

w \neq ϵ

.

(i)

w = ϵ

If N accepts w,

\exists j \geq 0

s.t.

q_{0} \overset{ϵ^{j}, \hat{δ}}{⟶} p

and

p \in F

.

Therefore,

p \in E ({q_{0}})

&

p \in F

.

Therefore,

p \in q_{0}^{'}

&

p \in F

.

Therefore,

q_{0}^{'} \cap F \neq \emptyset

.

Therefore,

q_{0}^{'} \in F^{'}

.

Therefore, the start state of M is also an accept state of M.

By definition, M accepts

ϵ (= w)

.

Conversely, if M accepts

w = ϵ

,

q_{0}^{'} \in F^{'}

(A

D F A

accepts ϵ iff its start state is also an accept state.)

q_{0}^{'} \cap F \neq \emptyset

(By definition of

F^{'}

)

\exists p \in q_{0}^{'}

and

p \in F

.

Since

q_{0}^{'} = E ({q_{0}}), q_{0} \overset{ϵ^{j}, \hat{δ}}{⟶} p

for some

j \geq 0

.

Since

p \in F

, N accepts

ϵ^{j}

, which is same as ϵ.

(ii)

w \neq ϵ

\exists w_{i} \neq ϵ, \forall 1 \leq i \leq n, n \geq 1

and

w = ϵ^{i_{0}} w_{1} ϵ^{i_{1}} w_{2} ϵ^{i_{2}} w_{3} ϵ^{i_{3}} \dots w_{n} ϵ^{i_{n}}

for some integers

i_{0}, i_{1}, i_{2}, \dots i_{n} \geq 0

If N accepts w,

\exists q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q, p_{1}, p_{2}, p_{3}, \dots p_{n} \in Q

and

q \in Q .

s.t.

q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q_{1} \overset{w_{1}, δ}{⟶} p_{1} \overset{ϵ^{i_{1}}, \hat{δ}}{⟶} q_{2} \overset{w_{2}, δ}{⟶} p_{2} \overset{ϵ^{i_{2}}, \hat{δ}}{⟶} q_{3} \dots q_{n - 1} \overset{w_{n - 1}, δ}{⟶} p_{n - 1} \overset{ϵ^{i_{n - 1}}, \hat{δ}}{⟶} q_{n} \overset{w_{n}, δ}{⟶} p_{n} \overset{ϵ^{i_{n}}, \hat{δ}}{⟶} q

&

q \in F

.

By Lemma 1.22,

q \in \hat{δ^{'}} (q_{0}^{'}, w)

where

w = w_{1} w_{2} w_{3} \dots w_{n}

.

Therefore,

\hat{δ^{'}} (q_{0}^{'}, w) \cap F \neq \emptyset

.

Therefore,

\hat{δ^{'}} (q_{0}^{'}, w) \in F'

.

Therefore, M accepts

w

(

D F A

acceptance)

Conversely, if M accepts

w = w_{1} w_{2} w_{3} \dots w_{n}

,

\hat{δ^{'}} (q_{0}^{'}, w) \in F'

(

D F A

acceptance)

\hat{δ^{'}} (q_{0}^{'}, w) \cap F \neq \emptyset

(Definition of

F'

)

\exists q \in \hat{δ^{'}} (q_{0}^{'}, w)

and

q \in F

.

By Lemma 1.22,

q_{0} \overset{ϵ^{i_{0}}, \hat{δ}}{⟶} q_{1} \overset{w_{1}, δ}{⟶} p_{1} \overset{ϵ^{i_{1}}, \hat{δ}}{⟶} q_{2} \overset{w_{2}, δ}{⟶} p_{2} \overset{ϵ^{i_{2}}, \hat{δ}}{⟶} q_{3} \dots q_{n - 1} \overset{w_{n - 1}, δ}{⟶} p_{n - 1} \overset{ϵ^{i_{n - 1}}, \hat{δ}}{⟶} q_{n} \overset{w_{n}, δ}{⟶} p_{n} \overset{ϵ^{i_{n}}, \hat{δ}}{⟶} q

&

q \in F

.

Therefore, N accepts

w = ϵ^{i_{0}} w_{1} ϵ^{i_{1}} w_{2} ϵ^{i_{2}} w_{3} ϵ^{i_{3}} \dots w_{n} ϵ^{i_{n}}

This completes the proof of Theorem 1.23.

Corollary 1.

A language is regular iff some

N F A

recognizes it.

5. Regular Operators

Regular Languages are closed under the operation of Regular Operators.

Theorem 5.

L is regular

\Rightarrow Σ^{*} ∖ L

is regular.

< P r o o f >

Let

M = (Q, Σ, δ, q_{0}, F)

be the

D F A

that recognizes L.

That is,

L (M) = L

.

Define

M^{'} = (Q, Σ, δ^{'}, q_{0}, Q ∖ F)

where

δ^{'} : Q \times Σ ⟶ Q

s.t.

\forall q \in Q, a \in Σ, δ^{'} (q, a) = δ (q, a)

\forall w \in Σ^{*} ∖ L

,

w \notin L ⟹ \hat{δ} (q_{0}, w) \notin F

⟹ \hat{δ} (q_{0}, w) \in Q ∖ F

⟹ \hat{δ^{'}} (q_{0}, w) \in Q ∖ F

(Theorem 1.8)

⟹ M^{'}

accepts w

Conversely, if

M^{'}

accepts w,

\hat{δ^{'}} (q_{0}, w) \in Q ∖ F

\hat{δ} (q_{0}, w) \in Q ∖ F

(Theorem 1.8)

Therefore,

\hat{δ} (q_{0}, w) \notin F

Therefore,

w \notin L

(because

w \in L ⟹ M

accepts

w ⟹ \hat{δ} (q_{0}, w) \in F

)

w \in Σ^{*} ∖ L

Therefore,

w \in Σ^{*} ∖ L ⟺ M^{'}

accepts w.

L (M^{'}) = Σ^{*} ∖ L

Σ^{*} ∖ L

is regular.

Theorem 6.

L_{1}

and

L_{2}

are regular

⟹ L_{1} \cap L_{2}

is regular.

< P r o o f >

\exists D F A s

M_{1}

and

M_{2}

s.t.

L (M_{1}) = L_{1}

and

L (M_{2}) = L_{2}

Let

M_{1} = (Q_{1}, Σ, δ_{1}, s_{0}, F_{1})

M_{2} = (Q_{2}, Σ, δ_{2}, s_{0}^{'}, F_{2})

Define

M_{3}

as follows.

M_{3} = (Q_{3}, Σ, δ_{3}, s_{0}^{″}, F_{3})

where

s_{0}^{″} = (s_{0}, s_{0}^{'}), Q_{3} = Q_{1} \times Q_{2}, F_{3} = F_{1} \times F_{2}

δ_{3} : Q_{3} \times Σ ⟶ Q_{3}

s.t.

δ_{3} ((q_{1}, q_{2}), a) = (δ_{1} (q_{1}, a), δ_{2} (q_{2}, a)) \forall q_{1} \in Q_{1}, q_{2} \in Q_{2}, a \in A

.

Claim.

\forall n \in N \cup {0}, w \in Σ^{*}

, where

| w | = n

, if

(i)

s_{0} = r_{0} \overset{w_{1}, δ_{1}}{⟶} r_{1} \overset{w_{2}, δ_{1}}{⟶} r_{2} \dots r_{n - 1} \overset{w_{n}, δ_{1}}{⟶} r_{n}

(ii)

s_{0}^{'} = r_{0}^{'} \overset{w_{1}, δ_{2}}{⟶} r_{1}^{'} \overset{w_{2}, δ_{2}}{⟶} r_{2}^{'} \dots r_{n - 1}^{'} \overset{w_{n}, δ_{2}}{⟶} r_{n}^{'}

(iii)

s_{0}^{″} = r_{0}^{″} \overset{w_{1}, δ_{3}}{⟶} r_{1}^{″} \overset{w_{2}, δ_{3}}{⟶} r_{2}^{″} \dots r_{n - 1}^{″} \overset{w_{n}, δ_{3}}{⟶} r_{n}^{″}

then

r_{n}^{″} = (r_{n}, r_{n}^{'})

.

Proof of Claim is by induction on n.

For

n = 0

, (i), (ii) and (iii) become

s_{0} = r_{0}, s_{0}^{'} = r_{0}^{'}

, and

s_{0}^{″} = r_{0}^{″}

.

s_{0}^{″} = (s_{0}, s_{0}^{'})

(By definition of

M_{3}

.)

Therefore,

r_{0}^{″} = (r_{0}, r_{0}^{'})

Assume the statement is true for

n = k \geq 0

.

(i), (ii) & (iii) for

n = k + 1 \Rightarrow

s_{0} = r_{0} \overset{w_{1}, δ_{1}}{⟶} r_{1} \overset{w_{2}, δ_{1}}{⟶} r_{2} \dots r_{k - 1} \overset{w_{k}, δ_{1}}{⟶} r_{k} \overset{w_{k + 1}, δ_{1}}{⟶} r_{k + 1}

s_{0}^{'} = r_{0}^{'} \overset{w_{1}, δ_{2}}{⟶} r_{1}^{'} \overset{w_{2}, δ_{2}}{⟶} r_{2}^{'} \dots r_{k - 1}^{'} \overset{w_{k}, δ_{2}}{⟶} r_{k}^{'} \overset{w_{k + 1}, δ_{2}}{⟶} r_{k + 1}^{'}

s_{0}^{″} = r_{0}^{″} \overset{w_{1}, δ_{3}}{⟶} r_{1}^{″} \overset{w_{2}, δ_{3}}{⟶} r_{2}^{″} \dots r_{k - 1}^{″} \overset{w_{k}, δ_{3}}{⟶} r_{k}^{″} \overset{w_{k + 1}, δ_{3}}{⟶} r_{k + 1}^{″}

⇒ (i), (ii) & (iii) for

n = k

&

r_{k} \overset{w_{k + 1}, δ_{1}}{⟶} r_{k + 1}

&

r_{k}^{'} \overset{w_{k + 1}, δ_{2}}{⟶} r_{k + 1}^{'}

&

r_{k}^{″} \overset{w_{k + 1}, δ_{3}}{⟶} r_{k + 1}^{″}

\Rightarrow r_{k}^{″} = (r_{k}, r_{k}^{'})

&

r_{k + 1} = δ_{1} (r_{k}, w_{k + 1})

&

r_{k + 1}^{'} = δ_{2} (r_{k}^{'}, w_{k + 1})

&

r_{k + 1}^{″} = δ_{3} (r_{k}^{″}, w_{k + 1})

(Induction Hypothesis)

\Rightarrow r_{k + 1} = δ_{1} (r_{k}, w_{k + 1})

&

r_{k + 1}^{'} = δ_{2} (r_{k}^{'}, w_{k + 1})

&

r_{k + 1}^{″} = δ_{3} ((r_{k}, r_{k}^{'}), w_{k + 1})

\Rightarrow r_{k + 1} = δ_{1} (r_{k}, w_{k + 1})

&

r_{k + 1}^{'} = δ_{2} (r_{k}^{'}, w_{k + 1})

&

r_{k + 1}^{″} = (δ_{1} (r_{k}, w_{k + 1}), δ_{2} (r_{k}^{'}, w_{k + 1}))

(Definition of

δ_{3})

\Rightarrow r_{k + 1}^{″} = (r_{k + 1}, r_{k + 1}^{'})

We now need to show

L_{1} \cap L_{2} = L (M_{3})

.

\forall w \in L_{1} \cap L_{2}, w \in L_{1}

and

w \in L_{2}

.

w \in L_{1} \Rightarrow \exists s_{0} = r_{0}, r_{1}, r_{2}, \dots r_{n}

s.t.

s_{0} = r_{0} \overset{w_{1}, δ_{1}}{⟶} r_{1} \overset{w_{2}, δ_{1}}{⟶} r_{2} \dots \dots r_{n - 1} \overset{w_{n}, δ_{1}}{⟶} r_{n}

&

r_{n} \in F_{1}

w \in L_{2} \Rightarrow \exists s_{0}^{'} = r_{0}^{'}, r_{1}^{'}, r_{2}^{'}, \dots r_{n}^{'}

s.t.

s_{0}^{'} = r_{0}^{'} \overset{w_{1}, δ_{2}}{⟶} r_{1}^{'} \overset{w_{2}, δ_{2}}{⟶} r_{2}^{'} \dots \dots r_{n - 1}^{'} \overset{w_{n}, δ_{2}}{⟶} r_{n}^{'}

&

r_{n}^{'} \in F_{2}

Let

r_{0}^{″} = s_{0}^{″} = (s_{0}, s_{0}^{'})

r_{1}^{″} = δ_{3} (r_{0}^{″}, w_{1}), \dots r_{i + 1}^{″} = δ_{3} (r_{i}^{″}, w_{i + 1}), \dots r_{n}^{″} = δ_{3} (r_{n - 1}^{″}, w_{n})

.

Therefore,

s_{0}^{″} = r_{0}^{″} \overset{w_{1}, δ_{3}}{⟶} r_{1}^{″} \overset{w_{2}, δ_{3}}{⟶} r_{2}^{″} \dots r_{n - 1}^{″} \overset{w_{n}, δ_{3}}{⟶} r_{n}^{″}

By Claim,

r_{n}^{″} = (r_{n}, r_{n}^{'})

Since

r_{n} \in F_{1}

and

r_{n}^{'} \in F_{2}, r_{n}^{″} \in F_{1} \times F_{2} = F_{3}

.

Therefore,

M_{3}

accepts w.

w \in L (M_{3})

L_{1} \cap L_{2} \subset L (M_{3})

Conversely, if

w \in L (M_{3})

,

\exists r_{0}^{″}, r_{1}^{″}, r_{2}^{″}, \dots r_{n}^{″} \in Q_{3}

s.t.

s_{0}^{″} = r_{0}^{″} \overset{w_{1}, δ_{3}}{⟶} r_{1}^{″} \overset{w_{2}, δ_{3}}{⟶} r_{2}^{″} \dots \dots r_{n - 1}^{″} \overset{w_{n}, δ_{3}}{⟶} r_{n}^{″}

&

r_{n}^{″} \in F_{3}

Take

r_{0} = s_{0}

;

r_{i + 1} = δ_{1} (r_{i}, w_{i + 1}) \forall i = 0, 1, 2, \dots n - 1

;

r_{0}^{'} = s_{0}^{'}

;

r_{i + 1}^{'} = δ_{2} (r_{i}^{'}, w_{i + 1}) \forall i = 0, 1, 2, \dots n - 1

.

Therefore,

s_{0} = r_{0} \overset{w_{1}, δ_{1}}{⟶} r_{1} \overset{w_{2}, δ_{1}}{⟶} r_{2} \dots \dots r_{n - 1} \overset{w_{n}, δ_{1}}{⟶} r_{n}

s_{0}^{'} = r_{0}^{'} \overset{w_{1}, δ_{2}}{⟶} r_{1}^{'} \overset{w_{2}, δ_{2}}{⟶} r_{2}^{'} \dots \dots r_{n - 1}^{'} \overset{w_{n}, δ_{2}}{⟶} r_{n}^{'}

By Claim,

r_{n}^{″} = (r_{n}, r_{n}^{'})

Since

r_{n}^{″} \in F_{3} = F_{1} \times F_{2}, r_{n} \in F_{1}

and

r_{n}^{'} \in F_{2}

.

M_{1}

accepts w and

M_{2}

accepts w.

w \in L (M_{1})

and

w \in L (M_{2})

w \in L_{1}

and

w \in L_{2}

w \in L_{1} \cap L_{2}

L (M_{3}) \subset L_{1} \cap L_{2}

Combining both directions,

L (M_{3}) = L_{1} \cap L_{2}

L_{1} \cap L_{2}

is regular.

Theorem 7.

L_{1}

and

L_{2}

are regular

⟹ L_{1} \cup L_{2}

is regular.

< P r o o f >

From set theory,

Σ^{*} ∖ (L_{1} \cup L_{2}) = (Σ^{*} ∖ L_{1}) \cap (Σ^{*} ∖ L_{2})

L_{1}

is regular

⟹ Σ^{*} ∖ L_{1}

is regular. (Theorem 1.25)

L_{2}

is regular

⟹ Σ^{*} ∖ L_{2}

is regular. (Theorem 1.25)

Σ^{*} ∖ L_{1}

and

Σ^{*} ∖ L_{2}

are regular

⟹ (Σ^{*} ∖ L_{1}) \cap (Σ^{*} ∖ L_{2})

is regular. (Theorem 1.26)

Therefore,

Σ^{*} ∖ (L_{1} \cup L_{2})

is regular.

Therefore,

L_{1} \cup L_{2}

is regular. (Theorem 1.25)

Theorem 8.

Every

N F A

can be converted to another

N F A

with the following properties.

(i) There is only one accept state which has transition arrows coming in and no

transition arrows going out.

(ii) The accept state is different from the start state.

(iii) The start state has no arrows coming in from other states but only transition

arrows going out.

< P r o o f >

Let

N_{1} = (Q_{1}, Σ, δ_{1}, q_{1}, F_{1})

be the

N F A

to be converted.

Define

N F A

,

N = (Q, Σ, δ, q_{0}, {q_{a}})

where

Q = Q_{1} \cup {q_{0}, q_{a}}, q_{0} \neq q_{a}

and

δ (q, x) = \{\begin{matrix} {q_{1}} & i f & (q, x,) = (q_{0}, ϵ) \\ \emptyset & i f & q = q_{0} a n d x \neq ϵ \\ \emptyset & i f & q = q_{a} \\ δ_{1} (q, x) & i f & q \in Q_{1} ∖ F_{1} \\ δ_{1} (q, x) & i f & q \in F_{1} a n d x \neq ϵ \\ δ_{1} (q, x) \cup {q_{a}} & i f & q \in F_{1} a n d x = ϵ \end{matrix}

It is clear that N satisfies conditions (i), (ii) and (iii).

Furthermore,

δ_{1} (q, x) \subset δ (q, x) \forall x \in Σ_{ϵ}, q \in Q_{1}

and hence

{\hat{δ}}_{1} ({q}, w) \subset \hat{δ} ({q}, w) \forall w \in Σ_{ϵ}^{*}

by Proposition 1.18.

It remains to show that

\forall w \in Σ_{ϵ}^{*}, N_{1}

accepts

w \Leftrightarrow N

accepts w.

For forward direction

" \Rightarrow "

,

Let

N_{1}

accepts w.

q_{1} \overset{w, {\hat{δ}}_{1}}{⟶} r

,

r \in F_{1}

.

Since

{\hat{δ}}_{1} (q_{1}, w) \subset \hat{δ} (q_{1}, w), q_{1} \overset{w, \hat{δ}}{⟶} r, r \in F_{1}

.

Since

δ (q_{0}, ϵ) = {q_{1}}, q_{0} \overset{ϵ, δ}{⟶} q_{1}

.

Furthermore, since

δ (q, ϵ) = δ_{1} (q, ϵ) \cup {q_{a}} \forall q \in F_{1}

,

δ (r, ϵ) = δ_{1} (r, ϵ) \cup {q_{a}}

.

Therefore,

q_{a} \in δ (r, ϵ)

.

That is,

r \overset{ϵ, δ}{⟶} q_{a}

Therefore,

q_{0} \overset{ϵ, δ}{⟶} q_{1} \overset{w, \hat{δ}}{⟶} r \overset{ϵ, δ}{⟶} q_{a}

.

Therefore N accepts

ϵ w ϵ

which is the same as w.

Therefore,

N_{1}

accepts

w \Rightarrow N

accepts w.

Conversely, if N accepts

w = x_{1} x_{2} \dots x_{n}

where

x_{i} \in Σ_{ϵ}

for

n \geq 1

&

1 \leq i \leq n

.

(Note that

w = ϵ

if

x_{i} = ϵ \forall i

.)

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

s.t.

q_{0} = r_{0} \overset{x_{1}, δ}{⟶} r_{1} \overset{x_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{x_{n}, δ}{⟶} r_{n}

&

r_{n} \in {q_{a}}

Since the only way to transition to

q_{a}

using δ is from a state in

F_{1}

via the ϵ arrow, we must have

r_{n - 1} \in F_{1}

&

x_{n} = ϵ

.

Since the only way to transition out of

q_{0} (= r_{0})

using δ is via an ϵ arrow, we must have

x_{1} = ϵ

.

Since

δ (q_{0}, ϵ) = {q_{1}}

, we must have

r_{1} = q_{1}

.

We now can rewrite the above computation as

q_{0} = r_{0} \overset{ϵ, δ}{⟶} q_{1} \overset{x_{2}, δ}{⟶} r_{2} \dots r_{n - 2} \overset{x_{n - 1}, δ}{⟶} r_{n - 1} \overset{ϵ, δ}{⟶} q_{a}

&

r_{n - 1} \in F_{1}

.

For all

1 \leq j \leq n - 2, r_{j} \notin {q_{0}, q_{a}}

because

r_{j}

has both incoming and outgoing arrows.

Therefore,

r_{j} \in Q_{1}

.

Claim.

r_{j} \overset{x_{j + 1}, δ_{1}}{⟶} r_{j + 1} \forall 1 \leq j \leq n - 2

.

Since

r_{j} \in Q_{1}, δ (r_{j}, x_{j + 1}) = δ_{1} (r_{j}, x_{j + 1})

or

δ_{1} (r_{j}, x_{j + 1}) \cup {q_{a}}

by definition of δ.

r_{j} \overset{x_{j + 1}, δ}{⟶} r_{j + 1}

\Rightarrow r_{j + 1} \in δ (r_{j}, x_{j + 1})

\Rightarrow r_{j + 1} \in δ_{1} (r_{j}, x_{j + 1})

or

r_{j + 1} \in δ_{1} (r_{j}, x_{j + 1}) \cup {q_{a}}

\Rightarrow r_{j + 1} \in δ_{1} (r_{j}, x_{j + 1})

or

r_{j + 1} \in δ_{1} (r_{j}, x_{j + 1})

(because

r_{j + 1} \neq q_{a}

)

\Rightarrow r_{j + 1} \in δ_{1} (r_{j}, x_{j + 1})

\Rightarrow r_{j} \overset{x_{j + 1}, δ_{1}}{⟶} r_{j + 1}

The computation now becomes

q_{0} = r_{0} \overset{ϵ, δ}{⟶} q_{1} \overset{x_{2}, δ_{1}}{⟶} r_{2} \dots r_{n - 2} \overset{x_{n - 1}, δ_{1}}{⟶} r_{n - 1} \overset{ϵ, δ}{⟶} q_{a}

&

r_{n - 1} \in F_{1}

.

Therefore,

q_{1} \overset{x_{2}, δ_{1}}{⟶} r_{2} \dots r_{n - 2} \overset{x_{n - 1}, δ_{1}}{⟶} r_{n - 1}

&

r_{n - 1} \in F_{1}

.

Therefore,

N_{1}

accepts

x_{2} x_{3} \dots \dots x_{n - 1}

.

Therefore,

N_{1}

accepts

w = x_{1} x_{2} x_{3} \dots \dots x_{n - 1} x_{n}

because

x_{1} = x_{n} = ϵ

.

Therefore, N accepts

w \Rightarrow N_{1}

accepts w.

This completes the proof of Theorem 1.28.

Theorem 9.

For any regular languages

L_{1}

and

L_{2}

, the language

L_{1} L_{2}

is regular.

< P r o o f >

Since

L_{1}

and

L_{2}

are regular, there exist

N F A s

N_{1}, N_{2}

that recognize

L_{1}

and

L_{2}

.

By Theorem 1.28, we can start with

N_{1}

and

N_{2}

defined as follows.

N_{1} = (Q_{1}, Σ, δ_{1}, q_{1 s}, {q_{1 a}})

where

q_{1 s} \neq q_{1 a}, q_{1 s} \notin δ_{1} (q, x) \forall q \in Q_{1}, x \in Σ_{ϵ}

and

δ_{1} (q_{1 a}, x) = \emptyset \forall x \in Σ_{ϵ}

.

N_{2} = (Q_{2}, Σ, δ_{2}, q_{2 s}, {q_{2 a}})

where

q_{2 s} \neq q_{2 a}, q_{2 s} \notin δ_{2} (q, x) \forall q \in Q_{2}, x \in Σ_{ϵ}

and

δ_{2} (q_{2 a}, x) = \emptyset \forall x \in Σ_{ϵ}

.

We can further assume that

Q_{1} \cap Q_{2} = \emptyset

because we can always replace

Q_{1}

with a set of objects which are completely different from those in

Q_{2}

without affecting the function of

N_{1}

.

Now construct

N = (Q, Σ, δ, q_{1 s}, {q_{2 a}})

where

Q = Q_{1} \cup Q_{2}

.

δ (q, x) = \{\begin{matrix} δ_{1} (q, x) & i f & q \in Q_{1} ∖ {q_{1 a}} \\ δ_{1} (q_{1 a}, x) & i f & q = q_{1 a} & x \neq ϵ \\ δ_{1} (q_{1 a}, x) \cup {q_{2 s}} & i f & q = q_{1 a} & x = ϵ \\ δ_{2} (q, x) & i f & q \in Q_{2} \end{matrix}

We now need to show

L (N) = L_{1} L_{2}

.

If

w \in L_{1} L_{2}, w = w_{1} w_{2}

where

w_{1}, w_{2} \in Σ_{ϵ}^{*}

and

w_{1} \in L_{1}, w_{2} \in L_{2}

.

Since

N_{1}

recognizes

L_{1}

and

N_{2}

recognizes

L_{2}, N_{1}

accepts

w_{1}

and

N_{2}

accepts

w_{2}

.

\exists r_{1} \in {q_{1 a}}

and

r_{2} \in {q_{2 a}}

such that

q_{1 s} \overset{w_{1}, {\hat{δ}}_{1}}{⟶} r_{1}

and

q_{2 s} \overset{w_{2}, {\hat{δ}}_{2}}{⟶} r_{2}

(By Theorem 1.19 of

N F A

Acceptance)

q_{1 s} \overset{w_{1}, \hat{δ}}{⟶} q_{1 a}

and

q_{2 s} \overset{w_{2}, \hat{δ}}{⟶} q_{2 a}

(Proposition 1.18 and

r_{1} = q_{1 a}; r_{2} = q_{2 a}

).

By definition of

δ, q_{2 s} \in δ (q_{1 a}, ϵ)

.

Therefore,

q_{1 s} \overset{w_{1}, \hat{δ}}{⟶} q_{1 a} \overset{ϵ, δ}{⟶} q_{2 s} \overset{w_{2}, \hat{δ}}{⟶} q_{2 a}

.

Therefore, N accepts

w_{1} ϵ w_{2},

which is the same as

w_{1} w_{2}

.

L_{1} L_{2} \subset L (N)

Conversely, if N accepts

w = x_{1} x_{2} \dots x_{n},

where

x_{1}, x_{2}, \dots x_{n} \in Σ_{ϵ}

for

n \geq 1

,

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

such that

q_{1 s} = r_{0} \overset{x_{1}, δ}{⟶} r_{1} \overset{x_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{x_{n}, δ}{⟶} r_{n}

&

r_{n} = q_{2 a}

.

(Note that

w = ϵ

if

x_{i} = ϵ \forall i

).

Since the only way to transition from a state of

N_{1}

to a state of

N_{2}

is via

q_{1 a}

to

q_{2 s}

using the ϵ arrow, ∃ an

r_{i} = q_{1 a}

and

r_{i + 1} = q_{2 s}

such that

x_{i + 1} = ϵ

and the computation becomes

q_{1 s} = r_{0} \overset{x_{1}, δ}{⟶} r_{1} \overset{x_{2}, δ}{⟶} r_{2} \dots r_{i - 1} \overset{x_{i}, δ}{⟶} q_{1 a} \overset{ϵ, δ}{⟶} q_{2 s} \overset{x_{i + 2}, δ}{⟶} r_{i + 2} \dots r_{n - 1} \overset{x_{n}, δ}{⟶} r_{n}; r_{n} = q_{2 a}

.

Claim 1.

r_{0}, r_{1}, r_{2}, \dots r_{i - 1} \in Q_{1}

.

q_{1 s} = r_{0} \Rightarrow r_{0} \in Q_{1}

.

Assume for contradiction that

r_{i - 1} \notin Q_{1}

.

Then

r_{i - 1} \in Q_{2}

.

r_{i - 1} \overset{x_{i}, δ}{⟶} q_{1 a}

\Rightarrow r_{i - 1} \overset{x_{i}, δ_{2}}{⟶} q_{1 a}

(

δ (q, x) = δ_{2} (q, x)

if

q \in Q_{2}

)

\Rightarrow q_{1 a} \in Q_{2}

⇒ Contradiction

Therefore,

r_{i - 1} \in Q_{1}

.

With similar and inductive argument, we can conclude

r_{i - 2}, \dots r_{2}, r_{1}

are all in

Q_{1}

.

Claim 2.

r_{j} \neq q_{1 a} \forall 0 \leq j \leq i - 1

.

Assume for contradiction

r_{j} = q_{1 a}

for some

0 \leq j \leq i - 1

.

Therefore,

r_{j} \overset{x_{j + 1}, δ}{⟶} r_{j + 1} \Leftrightarrow q_{1 a} \overset{x_{j + 1}, δ}{⟶} r_{j + 1} \Leftrightarrow r_{j + 1} \in δ (q_{1 a}, x_{j + 1})

.

By definition of

δ, δ (q_{1 a}, x_{j + 1})

= δ_{1} (q_{1 a}, x_{j + 1})

or

δ_{1} (q_{1 a}, x_{j + 1}) \cup {q_{2 s}}

= \emptyset

or

\emptyset \cup {q_{2 s}}

= \emptyset

or

{q_{2 s}}

Therefore,

r_{j + 1} \in \emptyset

or

r_{j + 1} \in {q_{2 s}}

.

Either of these leads to a contradiction.

Therefore,

r_{j} \neq q_{1 a} \forall 0 \leq j \leq i - 1

.

Combining Claim 1 and Claim 2,

r_{j} \in Q_{1} ∖ {q_{1 a}} \forall 0 \leq j \leq i - 1

.

By definition of

δ, δ (r_{j}, x) = δ_{1} (r_{j}, x) \forall 0 \leq j \leq i - 1

.

Therefore, computation

q_{1 s} = r_{0} \overset{x_{1}, δ}{⟶} r_{1} \overset{x_{2}, δ}{⟶} r_{2} \dots r_{i - 1} \overset{x_{i}, δ}{⟶} q_{1 a}

can be replaced by computation

q_{1 s} = r_{0} \overset{x_{1}, δ_{1}}{⟶} r_{1} \overset{x_{2}, δ_{1}}{⟶} r_{2} \dots r_{i - 1} \overset{x_{i}, δ_{1}}{⟶} q_{1 a}

.

Therefore,

N_{1}

accepts

w_{1} = x_{1} x_{2} \dots x_{i}

.

w_{1} \in L (N_{1}) = L_{1}

.

Claim 3.

r_{j} \in Q_{2} \forall i + 2 \leq j \leq n - 1

.

q_{2 s} \overset{x_{i + 2}, δ}{⟶} r_{i + 2}

\Rightarrow r_{i + 2} \in δ (q_{2 s}, x_{i + 2})

\Rightarrow r_{i + 2} \in δ_{2} (q_{2 s}, x_{i + 2}) (δ (q, x) = δ_{2} (q, x)

if

q \in Q_{2})

\Rightarrow q_{2 s} \overset{x_{i + 2}, δ_{2}}{⟶} r_{i + 2}

\Rightarrow r_{i + 2} \in Q_{2}

.

With similar and inductive argument, we can show that

r_{i + 3}, \dots r_{n - 1}

are all in

Q_{2}

.

Therefore, computation

q_{2 s} \overset{x_{i + 2}, δ}{⟶} r_{i + 2} \dots r_{n - 1} \overset{x_{n}, δ}{⟶} r_{n}; r_{n} = q_{2 a}

can be replaced by computation

q_{2 s} \overset{x_{i + 2}, δ_{2}}{⟶} r_{i + 2} \dots r_{n - 1} \overset{x_{n}, δ_{2}}{⟶} r_{n}; r_{n} = q_{2 a}

Therefore,

N_{2}

accepts

w_{2} = x_{i + 2} x_{i + 3} \dots x_{n}

.

w_{2} \in L (N_{2}) = L_{2}

.

w_{1} w_{2} \in L_{1} L_{2}

.

w = x_{1} x_{2} \dots x_{i} x_{i + 1} x_{i + 2} \dots x_{n}

= w_{1} x_{i + 1} w_{2}

= w_{1} w_{2} (x_{i + 1} = ϵ)

Therefore,

w \in L_{1} L_{2}

.

Therefore,

L (N) \subset L_{1} L_{2}

.

Combining both directions,

L (N) = L_{1} L_{2}

.

Theorem 10.

For any regular language

L, L^{*}

is regular.

< P r o o f >

Let

N_{1}

be the

N F A

that recognizes L.

By Theorem 1.28, we can start with an

N_{1}

defined as follows.

N_{1} = (Q_{1}, Σ, T_{1}, q_{1}, {q_{a}})

where

q_{1} \neq q_{a}, q_{1} \notin T_{1} (q, x) \forall q \in Q_{1}, x \in Σ_{ϵ}

and

T_{1} (q_{a}, x) = \emptyset \forall x \in Σ_{ϵ}

.

Let

N = (Q, Σ, T, q_{0}, {q_{a}, q_{0}})

such that

Q = Q_{1} \cup {q_{0}}

.

T (q, x) = \{\begin{matrix} T_{1} (q, x) & i f & q \in Q_{1} ∖ {q_{a}} \\ {q_{1}} \cup T_{1} (q_{a}, ϵ) & i f & q = q_{a} & x = ϵ \\ T_{1} (q_{a}, x) & i f & q = q_{a} & x \neq ϵ \\ {q_{1}} & i f & q = q_{0} & x = ϵ \\ \emptyset & i f & q = q_{0} & x \neq ϵ \end{matrix}

We need to show

w \in L^{*} \Leftrightarrow N

accepts w.

If

w \in L^{*}

,

w \in L^{M}

for some

M \geq 0

.

If

M = 0, w \in L^{0} = {ϵ}

.

Therefore,

w = ϵ

.

ϵ is accepted by N because N has a start state that is also an accept state.

For

M \geq 1

,

let

w = w_{1} w_{2} \dots w_{M}

with each

w_{i} \in L

for

1 \leq i \leq M

.

Therefore,

N_{1}

accepts

w_{i}

for each i.

For each

i, q_{1} \overset{w_{i}, \hat{T_{1}}}{⟶} q_{a}

(By Theorem 1.19 of

N F A

Acceptance)

For each

i, q_{1} \overset{w_{i}, \hat{T}}{⟶} q_{a}

(Proposition 1.18)

Since

T (q_{a}, ϵ) = {q_{1}} \cup T_{1} (q_{a}, ϵ)

,

q_{1} \in {q_{1}} \cup T_{1} (q_{a}, ϵ) \Rightarrow q_{1} \in T (q_{a}, ϵ) \Rightarrow q_{a} \overset{ϵ, T}{⟶} q_{1}

.

Therefore,

q_{0} \overset{ϵ, T}{⟶} q_{1} \overset{w_{1}, \hat{T}}{⟶} q_{a} \overset{ϵ, T}{⟶} q_{1} \overset{w_{2}, \hat{T}}{⟶} q_{a} \overset{ϵ, T}{⟶} q_{1} \dots q_{a} \overset{ϵ, T}{⟶} q_{1} \overset{w_{M}, \hat{T}}{⟶} q_{a}

Therefore, N accepts

ϵ w_{1} ϵ w_{2} \dots ϵ w_{M} = w_{1} w_{2} \dots w_{M} = w

.

Therefore,

w \in L^{*} \Rightarrow N

accepts w.

Conversely, if N accepts

w = x_{1} x_{2} x_{3} \dots x_{n}

where

x_{i} \in Σ_{ϵ}

for

1 \leq i \leq n & n \geq 1

.

(Note that

w = ϵ

if

x_{i} = ϵ \forall i

.)

\exists r_{0}, r_{1}, r_{2}, \dots r_{n} \in Q

such that

q_{0} = r_{0} \overset{x_{1}, T}{⟶} r_{1} \overset{x_{2}, T}{⟶} r_{2} \dots r_{n - 1} \overset{x_{n}, T}{⟶} r_{n}

&

r_{n} \in {q_{0}, q_{a}}

.

Since

T (q_{0}, x) = \emptyset

if

x \neq ϵ, x_{1} = ϵ

.

Furthermore,

T (q_{0}, ϵ) = {q_{1}}

.

Therefore,

r_{1} = q_{1}

.

r_{n} \in {q_{0}, q_{a}} \Rightarrow r_{n} = q_{a}

because

q_{0}

has no incoming arrows.

The computation now becomes

q_{0} \overset{ϵ, T}{⟶} q_{1} \overset{x_{2}, T}{⟶} r_{2} \dots r_{n - 1} \overset{x_{n}, T}{⟶} q_{a}

.

Therefore,

q_{1} \overset{x_{2}, T}{⟶} r_{2} \dots r_{n - 1} \overset{x_{n}, T}{⟶} q_{a}

.

Claim 1:

For the computation,

q_{0} = r_{0} \overset{x_{1}, T}{⟶} r_{1} \overset{x_{2}, T}{⟶} r_{2} \dots r_{i} \overset{x_{i + 1}, T}{⟶} r_{i + 1} \dots r_{n - 1} \overset{x_{n}, T}{⟶} r_{n}

&

r_{n} = q_{a}

,

if

\exists r_{i} = q_{a}

for

1 < i < n - 1

, then

r_{i + 1} = q_{1}

&

x_{i + 1} = ϵ

.

r_{i} \overset{x_{i + 1}, T}{⟶} r_{i + 1} \Rightarrow q_{a} \overset{x_{i + 1}, T}{⟶} r_{i + 1} \Rightarrow r_{i + 1} \in T (q_{a}, x_{i + 1})

.

T (q_{a}, x_{i + 1})

= T_{1} (q_{a}, x_{i + 1})

or

T_{1} (q_{a}, x_{i + 1}) \cup {q_{1}}

(by definition of T)

= \emptyset

or

\emptyset \cup {q_{1}}

(by definition of

N_{1}

)

= \emptyset

or

{q_{1}}

Therefore,

r_{i + 1} \in \emptyset

or

r_{i + 1} \in {q_{1}}

.

Therefore,

r_{i + 1} \in {q_{1}}

and hence

r_{i + 1} = q_{1}

.

Therefore,

r_{i} \overset{x_{i + 1}, T}{⟶} r_{i + 1}

\Rightarrow q_{a} \overset{x_{i + 1}, T}{⟶} q_{1}

\Rightarrow q_{1} \in T_{1} (q_{a}, x_{i + 1})

if

x_{i + 1} \neq ϵ

\Rightarrow q_{1} \in \emptyset

if

x_{i + 1} \neq ϵ

(by definition of

N_{1}

)

⇒ Contradiction if

x_{i + 1} \neq ϵ

.

Therefore,

x_{i + 1} = ϵ

.

Claim 2:

For any computation

q_{1} \overset{x_{1}, T}{⟶} s_{1} \overset{x_{2}, T}{⟶} s_{2} \dots s_{i} \overset{x_{i + 1}, T}{⟶} s_{i + 1} \dots s_{n - 1} \overset{x_{n}, T}{⟶} s_{n} \overset{x_{n + 1}, T}{⟶} q_{a}

,

if ∃ no

q_{a}

in between

q_{1}

and

q_{a}

, that is

s_{i} \neq q_{a}

for

1 \leq i \leq n

, then

q_{1} \overset{w, \hat{T_{1}}}{⟶} q_{a}

for some

w \in Σ_{ϵ}^{*}

.

q_{0} \notin {s_{1}, s_{2}, \dots s_{n}}

because

q_{0}

has no incoming arrows.

Therefore,

s_{1}, s_{2}, \dots s_{n} \in Q_{1}

.

Therefore,

s_{1}, s_{2}, \dots s_{n} \in Q_{1} ∖ {q_{a}}

.

By definition of

T, T (q, x) = T_{1} (q, x)

if

q \in Q_{1} ∖ {q_{a}}

.

Therefore,

T (s_{i}, x) = T_{1} (s_{i}, x)

for

1 \leq i \leq n

&

x \in Σ_{ϵ}

.

The given computation can be replaced by

q_{1} \overset{x_{1}, T_{1}}{⟶} s_{1} \overset{x_{2}, T_{1}}{⟶} s_{2} \dots s_{i} \overset{x_{i + 1}, T_{1}}{⟶} s_{i + 1} \dots s_{n - 1} \overset{x_{n}, T_{1}}{⟶} s_{n} \overset{x_{n + 1}, T_{1}}{⟶} q_{a}

,

q_{1} \overset{w, \hat{T_{1}}}{⟶} q_{a}

where

w = x_{1} x_{2} x_{3} \dots x_{n + 1}

.

Back to computation

q_{1} \overset{x_{2}, T}{⟶} r_{2} \overset{x_{3}, T}{⟶} r_{3} \dots r_{n - 1} \overset{x_{n}, T}{⟶} q_{a}

.

Let m be the number of

q_{a}

’s in between

q_{1}

&

q_{a}

.

If

m = 0

, by Claim 2,

q_{1} \overset{w^{'}, \hat{T_{1}}}{⟶} q_{a}

where

w^{'} = x_{2} x_{3} \dots x_{n}

.

N_{1}

accepts

w^{'}

.

w = x_{1} w^{'} = ϵ w^{'} = w^{'}

.

N_{1}

accepts w.

w \in L \subset L^{*}

.

For

m \geq 1, \exists r_{j_{1}} = r_{j_{2}} = \dots r_{j_{m}} = q_{a}

.

By Claim 1,

r_{j_{1} + 1} = r_{j_{2} + 1} = \dots r_{j_{m} + 1} = q_{1}

.

q_{1} \overset{w_{1}, \hat{T_{1}}}{⟶} r_{j_{1}} = q_{a}

(Claim 2)

q_{a} = r_{j_{1}} \overset{ϵ, T}{⟶} r_{j_{1} + 1} = q_{1}

(Claim 1)

q_{1} = r_{j_{1} + 1} \overset{w_{2}, \hat{T_{1}}}{⟶} r_{j_{2}} = q_{a}

(Claim 2)

q_{a} = r_{j_{2}} \overset{ϵ, T}{⟶} r_{j_{2} + 1} = q_{1}

(Claim 1)

⋮

q_{1} = r_{j_{m - 1} + 1} \overset{w_{m}, \hat{T_{1}}}{⟶} r_{j_{m}} = q_{a}

(Claim 2)

q_{a} = r_{j_{m}} \overset{ϵ, T}{⟶} q_{1} \overset{w_{m + 1}, \hat{T_{1}}}{⟶} q_{a}

(Claim 1 & Claim 2)

Therefore,

N_{1}

accepts

w_{1}, w_{2}, \dots w_{m}, w_{m + 1}

.

w_{1}, w_{2}, \dots w_{m}, w_{m + 1} \in L

.

w_{1} w_{2} \dots w_{m} w_{m + 1} \in L^{m + 1}

However,

x_{2} x_{3} \dots x_{n} = w_{1} ϵ w_{2} ϵ \dots ϵ w_{m} ϵ w_{m + 1} = w_{1} w_{2} \dots w_{m} w_{m + 1}

.

w = x_{1} x_{2} x_{3} \dots x_{n} = ϵ x_{2} x_{3} \dots x_{n} = x_{2} x_{3} \dots x_{n} = w_{1} w_{2} \dots w_{m} w_{m + 1}

.

Therefore,

w \in L^{m + 1} \subset L^{*}

.

Therefore, N accepts

w \Rightarrow w \in L^{*}

.

Combining both directions,

w \in L^{*} \Leftrightarrow N

accepts w.

This completes the proof of Theorem 1.30.

Definition 10.

For any string

w = x_{1} x_{2} \dots x_{n}

, where

x_{i} \in Σ_{ϵ}

for each i, the reverse of w, written

w^{R}

is the string

x_{n} x_{n - 1} \dots x_{1}

.

For any language A,

A^{R} \overset{d e f}{=} {w^{R} ∣ w \in A}

.

Theorem 11.

For any language A, A is regular iff

A^{R}

is regular.

< P r o o f >

Since A is regular, there is an

N F A

,

N_{A}

that recognizes it.

Let

N_{A} = (Q_{A}, Σ, δ_{A}, q_{A}, F_{A})

.

Construct

N_{A^{R}} = (Q_{A} \cup {q_{s}}, Σ, δ_{A^{R}}, q_{s}, {q_{A}})

where

q_{s} \notin Q_{A}

such that

δ_{A^{R}} (q, x) = \{\begin{matrix} F_{A} & i f & (q, x) = (q_{s}, ϵ) \\ \emptyset & i f & q = q_{s} & x \neq ϵ \\ {p \in Q_{A} ∣ q \in δ_{A} (p, x)} & i f & q \in Q_{A} \end{matrix}

From the third row of this definition, it immediately follows that

p \in δ_{A^{R}} (q, x) \Leftrightarrow q \in δ_{A} (p, x)

or

q \overset{x, δ_{A^{R}}}{⟶} p \Leftrightarrow p \overset{x, δ_{A}}{⟶} q \dots \dots (*)

.

Claim: ∃ a computation path for w from p to q via transition function

δ_{A}

iff ∃ a computation path for

w^{R}

from q to p via transition function

δ_{A^{R}}

.

That is,

p \overset{w, \hat{δ_{A}}}{⟶} q \Leftrightarrow q \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} p

.

This Claim can be proved by induction on

| w |

.

For

| w | = 1

,

w = w^{R} = x

where

x \in Σ_{ϵ}

.

From

(*)

,

q \overset{x, δ_{A^{R}}}{⟶} p \Leftrightarrow p \overset{x, δ_{A}}{⟶} q

Therefore, the statement is true for

| w | = 1

.

Assume the statement is true for

| w | = k

where

k \geq 1

.

That is ,

p \overset{w, \hat{δ_{A}}}{⟶} q \Leftrightarrow q \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} p

for

| w | = k

.

p \overset{w x, \hat{δ_{A}}}{⟶} q

\Leftrightarrow p \overset{w, \hat{δ_{A}}}{⟶} q^{'} \overset{x, δ_{A}}{⟶} q

(Proposition 1.12)

\Leftrightarrow p \overset{w, \hat{δ_{A}}}{⟶} q^{'}

and

q^{'} \overset{x, δ_{A}}{⟶} q

\Leftrightarrow q^{'} \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} p

and

q \overset{x, δ_{A^{R}}}{⟶} q^{'}

(Induction Hypothesis and (*))

\Leftrightarrow q \overset{x, δ_{A^{R}}}{⟶} q^{'} \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} p

\Leftrightarrow q \overset{x w^{R}, \hat{δ_{A^{R}}}}{⟶} p

(Proposition 1.12)

\Leftrightarrow q \overset{{(w x)}^{R}, \hat{δ_{A^{R}}}}{⟶} p

(

x w^{R} = {(w x)}^{R}

)

The statement is true for

| w | = k + 1

and the proof of Claim is complete.

To prove that

A^{R}

is regular, we need to prove that

w^{R} \in A^{R}

iff

N_{A^{R}}

accepts

w^{R}

.

If

w^{R} \in A^{R}

,

w \in A

.

Since

N_{A}

accepts w,

q_{A} \overset{w, \hat{δ_{A}}}{⟶} q

,

q \in F_{A}

(Theorem 1.19 –

N F A

acceptance)

By Claim,

q \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} q_{A}

Since

δ_{A^{R}} (q_{s}, ϵ) = F_{A}

, and

q \in F_{A}

,

q_{s} \overset{ϵ, δ_{A^{R}}}{⟶} q

.

Therefore,

q_{s} \overset{ϵ, δ_{A^{R}}}{⟶} q \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} q_{A}

.

N_{A^{R}}

accepts

ϵ w^{R}

(Theorem 1.19 –

N F A

acceptance)

N_{A^{R}}

accepts

w^{R}

.

Conversely, if

N_{A^{R}}

accepts

w^{R}

,

N_{A^{R}}

accepts

ϵ w^{R}

.

q_{s} \overset{ϵ w^{R}, \hat{δ_{A^{R}}}}{⟶} q_{A}

(Theorem 1.19 –

N F A

acceptance)

q_{s} \overset{ϵ, δ_{A^{R}}}{⟶} q \overset{w^{R}, \hat{δ_{A^{R}}}}{⟶} q_{A}

(Proposition 1.12)

Since

δ_{A^{R}} (q_{s}, ϵ) = F_{A}

,

q \in F_{A}

.

q_{A} \overset{w, \hat{δ_{A}}}{⟶} q

, and

q \in F_{A}

(Claim)

Therefore,

N_{A}

accepts

w

(Theorem 1.19 –

N F A

acceptance)

w \in A

(

N_{A}

recognizes A)

w^{R} \in A^{R}

.

w^{R} \in A^{R}

iff

N_{A^{R}}

accepts

w^{R}

.

We have proved that A is regular

\Rightarrow A^{R}

is regular.

On the other hand, sine

{(A^{R})}^{R} = A

,

A^{R}

is regular

\Rightarrow {(A^{R})}^{R}

is regular

\Rightarrow A

is regular.

Therefore, A is regular iff

A^{R}

is regular.

6. Regular Expression

Definition 11.

(Regular Expression)

Let Σ be a finite alphabet.

ℜ_{Σ}

is a set with the following properties:

(a)

R \in ℜ_{Σ}

iff R is one of the following:

(i) a for some

a \in Σ

(ii)

\hat{ϵ}

(iii)

\hat{\emptyset}

(iv)

R_{1} \hat{\cup} R_{2}

for some

R_{1}, R_{2} \in ℜ_{Σ}

(v)

R_{1} \hat{•} R_{2}

for some

R_{1}, R_{2} \in ℜ_{Σ}

(vi)

R_{1}^{\hat{*}}

for some

R_{1} \in ℜ_{Σ}

where

\hat{\cup}, \hat{•}

and

\hat{*}

are operations in

ℜ_{Σ}

with

\hat{\cup} : ℜ_{Σ} \times ℜ_{Σ} ⟶ ℜ_{Σ}

\hat{•} : ℜ_{Σ} \times ℜ_{Σ} ⟶ ℜ_{Σ}

\hat{*} : ℜ_{Σ} ⟶ ℜ_{Σ}

(b) ∃ an injective (one-to-one) mapping

L : ℜ_{Σ} ⟶ P (Σ^{*})

s.t.

(i)

L (a) = {a} \forall a \in Σ

(ii)

L (\hat{ϵ}) = {ϵ}

(iii)

L (\hat{\emptyset}) = \emptyset

(iv)

L (R_{1} \hat{\cup} R_{2}) = L (R_{1}) \cup L (R_{2}) \forall R_{1}, R_{2} \in ℜ_{Σ}

(v)

L (R_{1} \hat{•} R_{2}) = L (R_{1}) • L (R_{2}) \forall R_{1}, R_{2} \in ℜ_{Σ}

(vi)

L (R_{1}^{\hat{*}}) = {(L (R_{1}))}^{*} \forall R_{1} \in ℜ_{Σ}

ℜ_{Σ}

is called the set of all regular expressions over the alphabet Σ.

Any member of

ℜ_{Σ}

is called a regular expression over Σ.

For any regular expression R,

L (R)

is called the language described by R.

While

\hat{\cup}, \hat{•}

and

\hat{*}

are operations in

ℜ_{Σ}

,

\cup, •

and * are set operations in

P (Σ^{*})

.

When there is no danger of confusion,

\hat{\cup}, \hat{•}

and

\hat{*}

are usually written same as

\cup, •

and *.

While

\hat{ϵ}

and

\hat{\emptyset}

are regular expressions, ϵ is the empty string and ∅ is the empty language. When there is no danger of confusion, they are all written as ϵ and ∅.

Proposition 10.

Let Σ be a finite alphabet and

ℜ_{Σ}

be the set of all regular expressions over Σ.

The following statements are true.

(a)

\forall R_{1}, R_{2} \in ℜ_{Σ}, R_{1} \cup R_{2} = R_{2} \cup R_{1}

(b) ∃ regular expressions

\hat{Σ}

and

\hat{Σ^{*}}

such that

L (\hat{Σ}) = Σ

and

L (\hat{Σ^{*}}) = Σ^{*}

. When there is no danger of confusion,

\hat{Σ}

and

\hat{Σ^{*}}

are usually written same as Σ and

Σ^{*}

.

< P r o o f >

(a)

L (R_{1} \cup R_{2}) = L (R_{1}) \cup L (R_{2}

L (R_{2} \cup R_{1}) = L (R_{2}) \cup L (R_{1})

L (R_{1}) \cup L (R_{2}) = L (R_{2}) \cup L (R_{1})

from set theory.

Therefore,

L (R_{1} \cup R_{2}) = L (R_{2} \cup R_{1})

Therefore,

R_{1} \cup R_{2} = R_{2} \cup R_{1}

(L is one-one)

(b) Define

\hat{Σ} = ⋃_{a \in Σ} a

\hat{Σ}

is a regular expression by Definition 1.33(a)(i) and 1.33(a)(iv).

By Definition 1.33(b)(iv)

L (\hat{Σ}) = L (⋃_{a \in Σ} a) = (⋃_{a \in Σ} L (a)) = (⋃_{a \in Σ} {a}) = Σ

Define

\hat{Σ^{*}} = {(\hat{Σ})}^{\hat{*}}

.

\hat{Σ^{*}}

is a regular expression by Definition 1.33(a)(vi).

By Definition 1.33(b)(vi),

L (\hat{Σ^{*}}) = L ({(\hat{Σ})}^{\hat{*}}) = {(L (\hat{Σ}))}^{*} = Σ^{*} (L (\hat{Σ}) = Σ)

Example 1.

Find the language described by

Σ^{*} 1 Σ^{*}

where

Σ = {0, 1}

.

L (Σ^{*} 1 Σ^{*}) = L (Σ^{*}) L (1) L (Σ^{*}) = Σ^{*} {1} Σ^{*} = {w ∣ w h a s a t l e a s t o n e 1}

.

Example 2.

Find the language described by

{(Σ Σ Σ)}^{*}

where

Σ = {0, 1}

.

L ({(Σ Σ Σ)}^{*}) = {(L (Σ Σ Σ))}^{*} = {(L (Σ) L (Σ) L (Σ))}^{*} = {(Σ Σ Σ)}^{*}

= {x y z ∣ x, y, z \in Σ}^{*} = {w ∣ | w | i s a m u l t i p l e o f t h r e e}

.

Lemma 2.

If a language is described by a regular expression, then it is regular. That is, if

A = L (R)

for some

R \in ℜ_{Σ}

, then

A = L (N)

for some finite automaton N.

< P r o o f >

From the formal definition of regular expressions, R is one of the following:

(i) a for some

a \in Σ

(ii)

\hat{ϵ}

(iii)

\hat{\emptyset}

(iv)

R_{1} \hat{\cup} R_{2}

for some

R_{1}, R_{2} \in ℜ_{Σ}

(v)

R_{1} \hat{•} R_{2}

for some

R_{1}, R_{2} \in ℜ_{Σ}

(vi)

R_{1}^{\hat{*}}

for some

R_{1} \in ℜ_{Σ}

In case (i),

L (a) = {a}

and

{a}

can be recognized by the

N F A

defined as follows:

N = ({q_{1}, q_{2}}, Σ_{ϵ}, δ, q_{1}, {q_{2}})

such that

δ (q_{1}, a) = {q_{2}}

,

δ (q, b) = \emptyset \forall q \neq q_{1}, b \neq a

.

In case (ii),

L (ϵ) = {ϵ}

and

{ϵ}

can be recognized by the following

N F A

:

N = ({q_{1}}, Σ_{ϵ}, δ, q_{1}, {q_{1}})

, where

δ (q_{1}, b) = \emptyset \forall b \neq ϵ

and

δ (q_{1}, ϵ) = {q_{1}}

.

In case (iii),

L (\emptyset) = \emptyset

, which is recognized by the following

N F A

:

N = ({q}, Σ_{ϵ}, δ, q, \emptyset)

where

δ (q, b) = \emptyset \forall b \in Σ_{ϵ}

.

In cases (iv), (v) and (vi), R is repeated operations of

\hat{\cup}, \hat{•}

and

\hat{*}

on a, ϵ and ∅. Since we have shown above

L (a)

,

L (ϵ)

and

L (\emptyset)

are regular and we have proved before that regular languages are closed under

\cup, •

and * ,

L (R)

is regular.

Definition 12.

A generalized nondeterministic finite automaton (denoted by

G N F A

) has all the properties as described in Theorem 1.28 and is a 5-tuple,

(Q, Σ, δ, q_{s t a r t}, {q_{a c c e p t}})

where

(i) Q is a finite set of states;

(ii) Σ is a finite alphabet;

(iii)

δ : (Q ∖ {q_{a c c e p t}}) \times (Q ∖ {q_{s t a r t}}) ⟶ ℜ_{Σ}

is the transition function;

(iv)

q_{s t a r t}

is the start state; and

(v)

q_{a c c e p t}

is the accept state.

A

G N F A

accepts a string

w \in Σ^{*}

, if

w = w_{1} w_{2} \dots w_{n}

, where each

w_{i}

is in

Σ^{*}

and a sequence of states

q_{0}, q_{1}, q_{2}, \dots q_{n}

exist such that

(1)

q_{0} = q_{s t a r t}

;

(2)

q_{n} = q_{a c c e p t}

; and

(3) For each i,

w_{i} \in L (R_{i})

where

R_{i} = δ (q_{i - 1}, q_{i})

and

L (R_{i})

is the language described by expression

R_{i}

.

If we write

q_{i} \overset{R, δ}{⟶} q_{j}

instead of

δ (q_{i}, q_{j}) = R

, the definition of acceptance can be written as

q_{s t a r t} = q_{0} \overset{R_{1}, δ}{⟶} q_{1} \overset{R_{2}, δ}{⟶} q_{2} \dots \overset{R_{n}, δ}{⟶} q_{n} = q_{a c c e p t}

with

w_{i} \in L (R_{i})

for

i = 1, 2, \dots, n

.

Lemma 3.

Every

N F A

can be converted into an equivalent

G N F A

.

< P r o o f >

Because of Theorem 1.28, we can start with an

N F A

defined as follows.

N = (Q, Σ, δ, q_{s t a r t}, {q_{a c c e p t}})

where

q_{s t a r t} \neq q_{a c c e p t}

;

δ (q_{a c c e p t}, a) = \emptyset \forall a \in Σ_{ϵ}

; and

q_{s t a r t} \notin δ (q, a) \forall a \in Σ_{ϵ}, q \in Q

.

Define

G N F A

,

N_{G}

as follows:

N_{G} = (Q, Σ, δ_{G}, q_{s t a r t}, {q_{a c c e p t}})

where

δ_{G} : (Q ∖ {q_{a c c e p t}}) \times (Q ∖ {q_{s t a r t}}) ⟶ ℜ_{Σ}

such that:

\forall (q_{i}, q_{j}) \in (Q ∖ {q_{a c c e p t}}) \times (Q ∖ {q_{s t a r t}})

δ_{G} (q_{i}, q_{j}) = R_{i, j}

where

R_{i, j} = ⋃_{w \in S_{i, j}} w

; and

S_{i, j} = {w \in Σ^{*} ∣ q_{i} \overset{w, \hat{δ}}{⟶} q_{j}}

.

Note that if

i = j

,

δ_{G} (q_{i}, q_{i}) = R_{i, i}

,

S_{i, i} = {w \in Σ^{*} ∣ q_{i} \overset{w, \hat{δ}}{⟶} q_{i}}

; and

R_{i, i} = ⋃_{w \in S_{i, i}} w^{*}

\forall (q_{i}, q_{j}), S_{i, j}

is unique and therefore

R_{i, j}

is unique.

Since w is the concatenation of symbols from Σ, and every symbol in Σ is a regular expression, w is a regular expression.

Therefore,

R_{i, j} = ⋃_{w \in S_{i, j}} w

is a regular expression.

Therefore,

δ_{G} (q_{i}, q_{j}) = R_{i, j}

is well defined.

Claim 1. For any string w in

Σ^{*}

,

L (w) = {w}

.

< Proof of Claim 1 >

L (w) = L (a_{1} a_{2} \dots a_{n})

where

a_{i} \in Σ

= L (a_{1}) L (a_{2}) \dots L (a_{n})

= {a_{1}} {a_{2}} \dots {a_{n}}

= {a_{1} a_{2} \dots a_{n}}

= {w}

Claim 2.

\forall w \in Σ^{*}

, N accepts

w \Leftrightarrow N_{G}

accepts w.

< Proof of Claim 2 >

For forward direction

" \Rightarrow "

Let N accepts w where

w = w_{1} w_{2} \dots w_{n}

,

n \geq 1

, and each

w_{i}

is in

Σ^{*}

for

1 \leq i \leq n

.

By theorem of acceptance,

\exists q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q

such that

q_{s t a r t} = q_{0} \overset{w_{1}, \hat{δ}}{⟶} q_{1} \overset{w_{2}, \hat{δ}}{⟶} q_{2} \dots q_{n - 1} \overset{w_{n}, \hat{δ}}{⟶} q_{n} = q_{a c c e p t}

.

Since

q_{i - 1} \overset{w_{i}, \hat{δ}}{⟶} q_{i}, w_{i} \in S_{i - 1, i}

.

By definition of

δ_{G}

,

δ_{G} (q_{i - 1}, q_{i}) = R_{i - 1, i} = ⋃_{w \in S_{i - 1, i}} w

L (δ_{G} (q_{i - 1}, q_{i}))

= L (R_{i - 1, i})

= L (⋃_{w \in S_{i - 1, i}} w)

= ⋃_{w \in S_{i - 1, i}} L (w)

= ⋃_{w \in S_{i - 1, i}} {w}

(By Claim 1)

= S_{i - 1, i}

.

Since

w_{i} \in S_{i - 1, i}

,

w_{i} \in L (R_{i - 1, i})

.

Since

q_{i - 1} \overset{R_{i - 1, i}, δ_{G}}{⟶} q_{i} \forall i = 1, 2, \dots n

,

q_{s t a r t} = q_{0} \overset{R_{0, 1}, δ_{G}}{⟶} q_{1} \overset{R_{1, 2}, δ_{G}}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i - 1, i}, δ_{G}}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n - 1, n}, δ_{G}}{⟶} q_{n} = q_{a c c e p t}

.

N_{G}

accepts w.

Conversely, if

N_{G}

accepts w for

w = w_{1} w_{2} \dots w_{n}

,

n \geq 1

, and each

w_{i}

is in

Σ^{*}

,

\exists q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q

such that

q_{s t a r t} = q_{0} \overset{R_{0, 1}, δ_{G}}{⟶} q_{1} \overset{R_{1, 2}, δ_{G}}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i - 1, i}, δ_{G}}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n - 1, n}, δ_{G}}{⟶} q_{n} = q_{a c c e p t}

with

w_{i} \in L (R_{i - 1, i}) \forall i \in {1, 2, 3, \dots n}

,

R_{i - 1, i} = ⋃_{w \in S_{i - 1, i}} w

and

S_{i - 1, i} = {w \in Σ^{*} ∣ q_{i - 1} \overset{w, \hat{δ}}{⟶} q_{i}}

L (R_{i - 1, i})

= L (⋃_{w \in S_{i - 1, i}} w)

= ⋃_{w \in S_{i - 1, i}} L (w)

= ⋃_{w \in S_{i - 1, i}} {w}

(By Claim 1)

= S_{i - 1, i}

.

\forall i \in {1, 2, 3, \dots n}

,

w_{i} \in L (R_{i - 1, i})

\Rightarrow w_{i} \in S_{i - 1, i}

\Rightarrow q_{i - 1} \overset{w_{i}, \hat{δ}}{⟶} q_{i}

(Definition of

S_{i, j}

)

Therefore,

q_{s t a r t} = q_{0} \overset{w_{1}, \hat{δ}}{⟶} q_{1} \overset{w_{2}, \hat{δ}}{⟶} q_{2} \dots q_{n - 1} \overset{w_{n}, \hat{δ}}{⟶} q_{n} = q_{a c c e p t}

.

Therefore, N accepts

w = w_{1} w_{2} \dots w_{n}

.

N and

N_{G}

are equivalent and the Lemma is proved.

Lemma 4.

Every

G N F A

of n states (

n \geq 2

) can be reduced to an equivalent

G N F A

of 2 states.

< P r o o f >

This lemma can be proved by induction on n.

It is trivial that the statement is true for

n = 2

.

Assume that the statement is true for

n = k \geq 2

.

Let

G = (Q, Σ, δ, q_{s t a r t}, {q_{a c c e p t}})

be a

G N F A

with

k + 1

states.

\exists q_{r i p} \in Q ∖ {q_{s t a r t}, q_{a c c e p t}}

because

k + 1 \geq 3

.

Construct

G^{'} = (Q^{'}, Σ, δ^{'}, q_{s t a r t}, {q_{a c c e p t}})

such that

Q^{'} = Q ∖ {q_{r i p}}

\forall (q_{i}, q_{j}) \in (Q ∖ {q_{a c c e p t}}) \times (Q ∖ {q_{s t a r t}})

,

δ^{'} (q_{i}, q_{j}) = δ (q_{i}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{j}) \cup δ (q_{i}, q_{j})

.

Therefore,

Q^{'}

is a

G N F A

with k states.

Let G accept

w = w_{1} w_{2} \dots w_{n}

where each

w_{i} \in Σ^{*}

.

\exists q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q

such that

q_{s t a r t} = q_{0} \overset{R_{1}, δ}{⟶} q_{1} \overset{R_{2}, δ}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i}, δ}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n}, δ}{⟶} q_{n} = q_{a c c e p t}

; and

w_{i} \in L (R_{i}) = L (δ (q_{i - 1}, q_{i}))

.

If none of

q_{0}, q_{1}, q_{2}, \dots q_{n}

is

q_{r i p}

, then they are all in

Q^{'}

.

Also,

w_{i} \in L (δ (q_{i - 1}, q_{i}))

\Rightarrow w_{i} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{i})) \cup L (δ (q_{i - 1}, q_{i}))

\Rightarrow w_{i} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{i}) \cup δ (q_{i - 1}, q_{i}))

\Rightarrow w_{i} \in L (δ^{'} (q_{i - 1}, q_{i}))

\Rightarrow w_{i} \in L (R_{i}^{'})

where

R_{i}^{'} = δ^{'} (q_{i - 1}, q_{i})

q_{s t a r t} = q_{0} \overset{R_{1}^{'}, δ^{'}}{⟶} q_{1} \overset{R_{2}^{'}, δ^{'}}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i}^{'}, δ^{'}}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n}', δ^{'}}{⟶} q_{n} = q_{a c c e p t}

with

w_{i} \in L (R_{i}^{'})

.

Therefore,

G^{'}

accepts

w = w_{1} w_{2} \dots w_{n}

.

If ∃ some q’s in the sequence

q_{0}, q_{1}, q_{2}, \dots q_{n}

which are

q_{r i p}

,

let

q_{i}

be the first such

q_{r i p}

and

q_{j}

be the first state in the sequence after

q_{i}

such that

q_{j} \neq q_{r i p}

.

q_{i - 1} \overset{R_{i}}{⟶} q_{i} = q_{r i p} \overset{R_{i + 1}}{⟶} q_{r i p} \dots q_{r i p} \overset{R_{j - 1}}{⟶} q_{r i p} \overset{R_{j}}{⟶} q_{j}

.

R_{i + 1} = δ (q_{i}, q_{i + 1}) = δ (q_{r i p}, q_{r i p}) \Rightarrow w_{i + 1} \in L (δ (q_{r i p}, q_{r i p}))

⋮

R_{j - 1} = δ (q_{j - 2}, q_{j - 1}) = δ (q_{r i p}, q_{r i p}) \Rightarrow w_{j - 1} \in L (δ (q_{r i p}, q_{r i p}))

w_{i + 1} \dots w_{j - 1} \in L^{j - i - 1} (δ (q_{r i p}, q_{r i p}))

w_{i + 1} \dots w_{j - 1} \in L^{*} (δ (q_{r i p}, q_{r i p}))

Let

w_{j}^{'} = w_{i} w_{i + 1} \dots w_{j - 1} w_{j}

w_{i} \in L (δ (q_{i - 1}, q_{i}))

and

q_{i} = q_{r i p} \Rightarrow w_{i} \in L (δ (q_{i - 1}, q_{r i p}))

w_{j} \in L (δ (q_{j - 1}, q_{j}))

and

q_{j - 1} = q_{r i p} \Rightarrow w_{j} \in L (δ (q_{r i p}, q_{j}))

{w_{j}}^{'} \in L (δ (q_{i - 1}, q_{r i p})) L^{*} (δ (q_{r i p}, q_{r i p})) L (δ (q_{r i p}, q_{j}))

w_{j}^{'} \in L (δ (q_{i - 1}, q_{r i p})) L^{*} (δ (q_{r i p}, q_{r i p})) L (δ (q_{r i p}, q_{j})) \cup L (δ (q_{i - 1}, q_{j}))

Therefore,

w_{j}^{'} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{j}) \cup δ (q_{i - 1}, q_{j}))

w_{j}^{'} \in L (δ^{'} (q_{i - 1}, q_{j}))

w_{j}^{'} \in L (R_{j}^{'})

where

R_{j}^{'} = δ^{'} (q_{i - 1}, q_{j})

If there are no more

q_{r i p}

’s in the sequence,

q_{s t a r t} = q_{0} \overset{R_{1}^{'}, δ^{'}}{⟶} q_{1} \overset{R_{2}^{'}, δ^{'}}{⟶} q_{2} \dots q_{i - 1} \overset{R_{j}^{'}, δ^{'}}{⟶} q_{j} \overset{R_{j + 1}^{'}, δ^{'}}{⟶} q_{j + 1} \dots q_{n - 1} \overset{R_{n}^{'}, δ^{'}}{⟶} q_{n} = q_{a c c e p t}

is the path of acceptance in

G^{'}

for

(w_{1} w_{2} \dots w_{i - 1}) (w_{j}^{'}) (w_{j + 1} \dots w_{n})

,

which is the same as

(w_{1} w_{2} \dots w_{i - 1}) (w_{i} w_{i + 1} \dots w_{j - 1} w_{j}) (w_{j + 1} \dots w_{n})

because

w_{j}^{'} = w_{i} w_{i + 1} \dots w_{j - 1} w_{j}

.

Therefore,

G^{'}

accepts

w = w_{1} w_{2} \dots w_{n}

.

If there are some more

q_{r i p}

’s in the sequence, repeat the above process until all

q_{r i p}

’s are removed and the resulting computation path is the path of acceptance of w in

G^{'}

.

Conversely, if

G^{'}

accepts

w = w_{1} w_{2} \dots w_{n}

where

w_{i} \in Σ^{*}

,

\exists q_{0}, q_{1}, q_{2}, \dots q_{n} \in Q^{'}

such that

q_{s t a r t} = q_{0} \overset{R_{1}^{'}, δ^{'}}{⟶} q_{1} \overset{R_{2}^{'}, δ^{'}}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i}^{'}, δ^{'}}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n}^{'}, δ^{'}}{⟶} q_{n} = q_{a c c e p t}

with

w_{i} \in L (R_{i}^{'})

where

R_{i}^{'} = δ^{'} (q_{i - 1}, q_{i})

.

Therefore,

w_{i} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{i}) \cup δ (q_{i - 1}, q_{i}))

Therefore,

w_{i} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{i}))

or

w_{i} \in L (δ (q_{i - 1}, q_{i}))

.

If

w_{i} \in L (δ (q_{i - 1}, q_{i}))

,

q_{s t a r t} = q_{0} \overset{R_{1}, δ}{⟶} q_{1} \overset{R_{2}, δ}{⟶} q_{2} \dots q_{i - 1} \overset{R_{i}, δ}{⟶} q_{i} \dots q_{n - 1} \overset{R_{n}, δ}{⟶} q_{n} = q_{a c c e p t}

where

w_{i} \in L (R_{i})

is the acceptance path for

w = w_{1} w_{2} \dots w_{n}

in G.

If

w_{i} \in L (δ (q_{i - 1}, q_{r i p}) {(δ (q_{r i p}, q_{r i p}))}^{*} δ (q_{r i p}, q_{i}))

,

let

w_{i} = w_{i 1} w_{i 2} w_{i 3}

where

w_{i 1} \in L (δ (q_{i - 1}, q_{r i p})) = L (R_{i - 1, r i p})

,

w_{i 2} \in L^{*} (δ (q_{r i p}, q_{r i p})) = L^{*} (R_{r i p, r i p})

, and

w_{i 3} \in L (δ (q_{r i p}, q_{i})) = L (R_{r i p, i})

.

\exists m \geq 0

such that

w_{i 2} \in L^{m} (δ (q_{r i p}, q_{r i p}))

.

w_{i 2} = w_{i 2} (1) w_{i 2} (2) \dots w_{i 2} (m)

where each

w_{i 2} (j) \in L (δ (q_{r i p}, q_{r i p})) = L (R_{r i p, r i p})

.

q_{i - 1} \overset{R_{i - 1, r i p}, δ}{⟶} q_{r i p} \overset{R_{r i p, r i p}, δ}{⟶} q_{r i p} \dots q_{r i p} \overset{R_{r i p, i}, δ}{⟶} q_{i}

is a computation path in G for

w_{i 1} w_{i 2} w_{i 3} = w_{i}

.

This is true for all

1 \leq i \leq n

.

Therefore, there is a computation path in G from

q_{0}

to

q_{n}

for

w_{1} w_{2} \dots w_{n} = w

.

Therefore, G accepts

w = w_{1} w_{2} \dots w_{n}

.

So G and

G^{'}

are equivalent.

Since

G^{'}

has k states, by induction hypothesis,

G^{'}

can be reduced to an equivalent

G N F A

of 2 states.

Hence, G can be reduced to an equivalent

G N F A

of 2 states.

This completes the proof.

Lemma 5.

If an

N F A

,

N = (Q, Σ, δ, q_{0}, F)

is equivalent to a 2-state

G N F A

,

N_{G} = (Q, Σ, δ_{G}, q_{s t a r t}, {q_{a c c e p t}})

, then

L (N) = L (R)

where

R = δ_{G} (q_{s t a r t}, q_{a c c e p t})

.

< P r o o f >

w \in L (N)

\Leftrightarrow N

accepts w

\Leftrightarrow N_{G}

accepts

w

(N and

N_{G}

are equivalent.)

\Leftrightarrow w \in L (R)

(

R = δ_{G} (q_{s t a r t}, q_{a c c e p t})

)

By Lemmas 1.39, 1.40, 1.41, we have the following conclusion:

Lemma 6.

If a language is regular, it is described by a regular expression.

By Lemma 1.37 and Lemma 1.42, we have the following theorem.

Theorem 12.

A language is regular iff some regular expression describes it.

7. Pumping Lemma

Theorem 13.

- Pumping Lemma

Let A be a language.

Let

(S)

denote the following statement:

∃ a number p (the pumping length) where, if s is any string in A of length at least p, then s may be divided into three pieces,

s = x y z

, satisfying the following conditions:

(1) For each

i \geq 0

,

x y^{i} z \in A

,

(2)

| y | > 0

, and

(3)

| x y | \leq p

.

The Pumping Lemma states that A is regular

\Rightarrow (S)

.

< P r o o f >

Since A is regular, there exists a finite automaton

M = (Q, Σ, δ, q_{0}, F)

that recognizes A.

That is,

A = L (M)

.

Let p be the number of states in M.

Let

s = s_{1} s_{2} \dots s_{n}

where each

s_{i} \in Σ

and

0 \leq p \leq n

.

\exists r_{0}, r_{1}, \dots r_{n} \in Q

, such that

q_{0} = r_{0} \overset{s_{1}, δ}{⟶} r_{1} \overset{s_{2}, δ}{⟶} r_{2} \dots r_{n - 1} \overset{s_{n}, δ}{⟶} r_{n}, r_{n} \in F

.

Since

p \leq n, q_{0} = r_{0} \overset{s_{1}, δ}{⟶} r_{1} \overset{s_{2}, δ}{⟶} r_{2} \dots r_{p - 1} \overset{s_{p}, δ}{⟶} r_{p}

is a sub path with

p + 1

states.

Since M has only p states, by the pigeonhole principle,

\exists k, l

such that

0 \leq k < l \leq p

and

r_{k} = r_{l}

.

Let

x = s_{1} s_{2} \dots s_{k}, y = s_{k + 1} s_{k + 2} \dots s_{l}

and

z = s_{l + 1} s_{l + 2} \dots s_{n}

.

Therefore,

r_{0} \overset{x, \hat{δ}}{⟶} r_{k} \overset{y, \hat{δ}}{⟶} r_{l} \overset{z, \hat{δ}}{⟶} r_{n}

.

Since

r_{k} = r_{l}

,

r_{k} \overset{y^{i}, \hat{δ}}{⟶} r_{l} \forall i \geq 0

.

Therefore,

r_{0} \overset{x, \hat{δ}}{⟶} r_{k} \overset{y^{i}, \hat{δ}}{⟶} r_{l} \overset{z, \hat{δ}}{⟶} r_{n}

with

r_{n} \in F

.

Therefore, M accepts

x y^{i} z

.

Therefore,

x y^{i} z \in A

.

Since

k < l

,

| y | > 0

.

| x y | = | x | + | y | = k + l - k = l \leq p

.

This completes the proof of the Pumping Lemma.

Theorem 14.

- Pumping Lemma (contra positive form)

\neg (S) \Rightarrow A

is not regular where

\neg (S)

is equivalent to:

\forall p \geq 1, \exists s \in A

with

| s | \geq p

such that whenever

s = x y z

, at least one of the conditions

(1), (2),

or

(3)

cannot be satisfied.

The contra positive form of the Pumping Lemma is used to prove a language is not regular. The general strategy is to find an

s \in A

with

| s | \geq p

for any given

p \geq 1

so that whenever s is broken into

s = x y z

, at least one of the conditions of

(1), (2),

or

(3)

must be false. This can be usually accomplished by showing one of the following:

(i) Condition 1 alone is false.

(ii) Condition 3

\Rightarrow \neg

(Condition 1)

(iii) (Condition 2 and Condition 3)

\Rightarrow \neg

(Condition 1).

Example 3.

Show that

A = {0^{n} 1^{n} ∣ n \geq 0}

is not regular.

The strategy is to create an s that will force y to contain all 0’s or all 1’s so that when y is pumped indefinitely,

x y^{i} z

will contain too many 0’s or 1’s to make it impossible for

x y^{i} z

to remain in A.

Since Condition 3 requires

| x y | \leq p

, a prefix of

0^{p}

in s will achieve that purpose.

Formally, we make the argument as follows.

\forall p \geq 1

, let

s = 0^{p} 1^{p}

.

s \in A

and

| s | \geq p

.

If

s = x y z

, then

x y z = 0^{p} 1^{p}

.

Condition 3

\Rightarrow | x y | \leq p

\Rightarrow x y

consists of only 0’s

\Rightarrow y

consists of only 0’s.

| x y y z | = | x y z | + | y |

.

Since Condition 2 requires

| y | > 0

,

x y y z

adds a positive number of 0’s to

x y z

.

Since

x y z

has equal numbers of 0’s and 1’s,

x y y z

must have more 0’s than 1’s and hence is not in A.

Therefore, (Condition 2 + Condition 3)

\Rightarrow \neg

(Condition 1) and hence A is not regular.

Example 4.

Show that

A = {w w ∣ w \in {0, 1}^{*}}

is not regular.

The strategy is to create an s with some leading 0’s on the left, say

0^{m}

but we also want to make sure that

0^{m}

is long enough to force

x y

to contain all 0’s in it so that when y is pumped up indefinitely, it will create too many 0’s to make it impossible for

s = w w

.

Since Condition 3 requires

| x y | \leq p

, we want to make

m \geq p

.

A natural candidate for s is therefore

0^{p} 10^{p} 1

.

To prove that this construction works, however, requires some algebraic manipulation.

Formally, we make the argument as follows.

\forall p \geq 1

, take

s = 0^{p} 10^{p} 1

.

If

s = x y z

, then

x y z = 0^{p} 10^{p} 1

.

Condition 3

\Rightarrow | x y | \leq p

\Rightarrow x y

consists of only 0’s

\Rightarrow y

consists of only 0’s.

Let

x y^{i} z = 0^{p^{'}} 10^{p} 1

where

p^{'} - p = (i - 1) | y |

or

p^{'} = p + (i - 1) | y |

.

For

i > 3

,

p^{'} > p + (3 - 1)

(

| y | \geq 1

by Condition 2)

Therefore,

p^{'} > p + 2

for

i > 3

.

Assume for contradiction that

\forall i \geq 0

,

x y^{i} z \in A

.

That is,

x y^{i} z = 0^{p^{'}} 10^{p} 1 = w w

.

For all

i > 3

,

| w |

= \frac{| 0^{p^{'}} 10^{p} 1 |}{2}

= \frac{p^{'} + p + 2}{2}

> \frac{p + 2 + p + 2}{2}

(

p^{'} > p + 2

for

i > 3

)

= p + 2

Therefore,

| w | > | 10^{p} 1 |

.

This implies w consists of at least two 1’s.

On the other hand,

p^{'} + p + 2 = 2 | w |

.

p^{'} - | w | = | w | - (p + 2) > 0

p^{'} > | w |

This implies w must consist of all 0’s.

This leads to a contradiction.

Therefore, (Condition 2 + Condition 3)

\Rightarrow \neg

(Condition 1) and hence A is not regular.

Example 5.

Show that

A = {1^{n^{2}} ∣ n \geq 0}

is not regular.

The idea behind this problem is every time we pump up y, we increase the length of s by an amount of

| y |

which is bounded by p and p is fixed. On the other hand, s has to be the square of a natural number and the difference between two consecutive squares, say

n^{2}

and

{(n + 1)}^{2}

will grow to infinity as n goes to infinity. In this case, we don’t have to worry about how to create more 0’s in s so as to outnumber the 1’s or vice versa. This particular nature of s will automatically lead to a contradiction to Condition 1 as

| s |

grows to infinity.

Proving this to work requires some algebraic manipulation.

The formal argument is made as follows.

\forall p \geq 1

, take

s = 1^{p^{2}}

p \geq 1

\Rightarrow p (p - 1) \geq 0

\Rightarrow p^{2} \geq p

\Rightarrow | 1^{p^{2}} | \geq | 1^{p} | = p

Therefore,

| s | \geq p

.

Assume for contradiction that Condition 1 is true.

That is,

\forall i \geq 0

,

x y^{i} z \in A

.

Both

x y^{i} z

and

x y^{i + 1} z

are in A.

Let

x y^{i} z = 1^{n^{2}}

and

x y^{i + 1} z = 1^{m^{2}}

where m and n are positive integers.

| x y^{i} z | = n^{2}

and

| x y^{i + 1} z | = m^{2}

.

By Condition 2,

| y | \geq 1

\Rightarrow | y^{i + 1} | > | y^{i} |

\Rightarrow | x y^{i + 1} z | > | x y^{i} z |

\Rightarrow m^{2} > n^{2}

\Rightarrow m > n

\Rightarrow m \geq n + 1

By Condition 3,

| x y | \leq p \Rightarrow | y | \leq p

.

Therefore,

| x y^{i + 1} z | - | x y^{i} z | = | y | \leq p

.

Therefore,

m^{2} - n^{2} \leq p

.

{(n + 1)}^{2} - n^{2} \leq m^{2} - n^{2} \leq p

.

2 n + 1 \leq p

.

n \leq \frac{p - 1}{2} \dots \dots (1)

where

(1)

is true for all i.

On the other hand,

Condition 2

\Rightarrow | y | \geq 1 \Rightarrow | y^{i} | \geq i

.

n^{2} = | x | + | y^{i} | + | z | \geq | y^{i} | \geq i

.

n \geq \sqrt{i}

for all i.

For

i > \frac{{(p - 1)}^{2}}{4}

,

\sqrt{i} > \frac{p - 1}{2}

and

n > \frac{p - 1}{2}

This contradicts

(1)

which is true for all i.

Therefore, (Condition 2 + Condition 3)

\Rightarrow \neg

(Condition 1) and hence A is not regular.

8. Myhill-Nerode Theorem

Definition 13.

\forall x, y \in Σ^{*}, L \subset Σ^{*}

,

we say that x and y are indistinguishable by L iff

\forall z \in Σ^{*}, x z \in L \Leftrightarrow y z \in L

.

We say that x and y are distinguishable by L iff there exists

z \in Σ^{*}

such that exactly one of

x z

and

y z

is in L.

If x and y are indistinguishable by L, we write

x \equiv_{L} y

.

Proposition 11.

\equiv_{L}

is an equivalence relation.

< P r o o f >

\forall x \in L, x z \in L \Leftrightarrow x z \in L \forall z \in Σ^{*}

x \equiv_{L} x

\equiv_{L}

is reflexive.

\forall x, y \in L

,

x \equiv_{L} y

\Rightarrow (\forall z \in Σ^{*}, x z \in L \Leftrightarrow y z \in L)

\Rightarrow (\forall z \in Σ^{*}, y z \in L \Leftrightarrow x z \in L)

\Rightarrow y \equiv_{L} x

\equiv_{L}

is symmetric.

\forall x, y, w \in Σ^{*}

,

(x \equiv_{L} y) \land (y \equiv_{L} w)

\Rightarrow (\forall z \in Σ^{*}, x z \in L \Leftrightarrow y z \in L) \land (\forall z \in Σ^{*}, y z \in L \Leftrightarrow w z \in L)

\Rightarrow (\forall z \in Σ^{*}, x z \in L \Leftrightarrow w z \in L)

\Rightarrow x \equiv_{L} w

\equiv_{L}

is transitive.

Proposition 12.

\equiv_{L}

is right congruence. That is

x \equiv_{L} y \Rightarrow x a \equiv_{L} y a \forall a \in Σ

.

< P r o o f >

\forall z \in Σ^{*}, a \in Σ

,

x a z \in L \Leftrightarrow y a z \in L

(

x \equiv_{L} y

)

x a \equiv_{L} y a

(Definition of

\equiv_{L}

)

Proposition 13.

\forall x, y \in Σ^{*}, (x \equiv_{L} y) \Rightarrow (x \in L \Leftrightarrow y \in L)

< P r o o f >

Take

z = ϵ

.

x ϵ \in L \Leftrightarrow y ϵ \in L

Therefore,

x \in L \Leftrightarrow y \in L

.

Theorem 15.

- Myhill-Nerode Theorem

Let

L \subset Σ^{*}, X \subset Σ^{*}

.

X is said to be pairwise distinguishable by L iff every two distinct strings in X are

distinguishable by L.

The index of L is defined as

I n d e x L = m a x {| X | ∣ X i s p a i r w i s e d i s t i n g u i s h a b l e b y L}

.

The following statements are true:

(a)

If L is recognized by a

D F A

with k states, L has an index at most k.

(b)

If the index of L is a finite number k, it is recognized by a

D F A

with k states.

(c)

L is regular iff it has finite index. Moreover, its index is the size of the smallest

D F A

recognizing it.

< P r o o f >

(a)

Let

M = (Q, Σ, δ, q_{0}, F)

be a

D F A

with k states that recognizes L.

Assume for contradiction that L has an index greater than k.

\exists X

(pairwise distinguishable by L) that has more than k members.

Let

s_{1}, s_{2}, s_{3} \dots s_{k + 1}

be

k + 1

distinct and pairwise distinguishable members in X.

\hat{δ} (q_{0}, s_{1}), \hat{δ} (q_{0}, s_{2}), \hat{δ} (q_{0}, s_{3}), \dots \hat{δ} (q_{0}, s_{k + 1})

are

k + 1

states in Q.

Since

| Q | = k

, by the pigeonhole principle, there are

i, j

where

1 \leq i < j \leq k + 1

s.t.

\hat{δ} (q_{0}, s_{i}) = \hat{δ} (q_{0}, s_{j})

.

\forall z \in Σ^{*}

,

s_{i} z \in L

\Leftrightarrow \hat{δ} (q_{0}, s_{i} z) \in F

(M recognizes L)

\Leftrightarrow \hat{δ} (\hat{δ} (q_{0}, s_{i}), z) \in F

(Proposition 1.14)

\Leftrightarrow \hat{δ} (\hat{δ} (q_{0}, s_{j}), z) \in F

\Leftrightarrow \hat{δ} (q_{0}, s_{j} z) \in F

(Proposition 1.14)

\Leftrightarrow s_{j} z \in L

(M recognizes L)

Therefore,

s_{i} \equiv_{L} s_{j}

(Definition of

\equiv_{L}

)

This contradicts the assumption that X is pairwise distinguishable by L.

(b)

Let

X = {s_{1}, s_{2} \dots, s_{k}}

be pairwise distinguishable by L.

Claim 1.

I n d e x

L \geq 2 \Rightarrow L \neq \emptyset

and hence

L = \emptyset \Rightarrow I n d e x

L = 1

.

I n d e x

L \geq 2

\Rightarrow \exists X

(pairwise distinguishable by L) that has at least 2 members.

\Rightarrow \exists s_{i}, s_{j} \in X

where

s_{i} \neq s_{j}

and

s_{i}, s_{j}

are distinguishable by L.

\Rightarrow \exists z \in Σ^{*}

s.t.

s_{i} z \in L

and

s_{j} z \notin L

or vice versa.

\Rightarrow L \neq \emptyset

.

Since

L = \emptyset \Rightarrow I n d e x

L < 2

or

I n d e x

L = 1

,

I n d e x

L is defined to be 1 whenever

L = \emptyset

.

Claim 2.

\forall w \in Σ^{*}

, there is one and only one

s_{w} \in X

s.t.

w \equiv_{L} s_{w}

. Hence by taking

w = ϵ

,

there is one and only one

s_{ϵ} \in X

s.t.

ϵ \equiv_{L} s_{ϵ}

.

Either

w \in X

or

w \notin X

.

If

w \in X, \exists s_{i} \in X

s.t.

w = s_{i}

.

Call this

s_{w}

so that

w = s_{i} = s_{w}

.

Since

\equiv_{L}

is reflexive, it follows that

w \equiv_{L} s_{w}

.

If

w \notin X

, w must be indistinguishable with a member of X otherwise it will contradict

the assumption that

I n d e x

L = k

.

Therefore,

w \equiv_{L} s_{w}

for some

s_{w} \in X

.

Either case,

w \equiv_{L} s_{w}

for some

s_{w} \in X

.

If there is another

s_{w}^{'} \in X

s.t.

w \equiv_{L} s_{w}^{'}

, then

s_{w} \equiv_{L} s_{w}^{'}

because

\equiv_{L}

is transitive.

This contradicts the assumption that X is pairwise distinguishable by L.

Therefore,

s_{w}

is unique.

Claim 3. If

L \neq \emptyset

then

L \cap X \neq \emptyset

L \neq \emptyset \Rightarrow \exists w \in L

By Claim 2, there is one and only one

s_{w} \in X

s.t.

w \equiv_{L} s_{w}

.

By Proposition 1.52,

w \in L \Leftrightarrow s_{w} \in L

.

Therefore,

s_{w} \in L \cap X

.

Therefore,

L \cap X \neq \emptyset

.

This completes proof of Claim 3.

If Index

L = k = 1

,

L = \emptyset

which is recognized by the one-state

D F A

,

M = ({q}, Σ, δ, q, \emptyset)

where

δ (q, b) = \emptyset \forall b \in Σ

.

If

I n d e x

L = k \geq 2

,

\exists X = {s_{1}, s_{2}, s_{3} \dots s_{k}}

where X is pairwise distinguishable by L.

Let

Q = {q_{1}, q_{2}, q_{3} \dots q_{k}}

Let

f : X ⟶ Q

such that

f (s_{i}) = q_{i} \forall i

with

1 \leq i \leq k

f is bijective (one-one and onto).

\forall q_{i} \in Q, \exists

a unique

s_{i} \in X

s.t.

f (s_{i}) = q_{i}

since f is bijective.

\forall a \in Σ, \exists

a unique

s_{j} \in X

s.t.

s_{i} a \equiv_{L} s_{j}

by Claim 2.

Since f is a bijective mapping, there is a unique

q_{j}

such that

f (s_{j}) = q_{j}

.

Let

M = (Q, Σ, δ, q_{0}, F)

where

δ : Q \times Σ ⟶ Q

s.t.

δ (q_{i}, a) = q_{j}

where

a \in Σ, q_{i}, q_{j} \in Q

s.t.

f (s_{i}) = q_{i}, f (s_{j}) = q_{j}

where

s_{i} \in X, s_{j} \in X

and

s_{i} a \equiv_{L} s_{j}

.

If there is another

q_{k} \in Q

such that

δ (q_{i}, a) = q_{k}, \exists s_{k} \in X

such that

f (s_{k}) = q_{k}

and

by definition of

δ, s_{i} a \equiv_{L} s_{k}

.

Since

\equiv_{L}

is transitive,

s_{j} \equiv_{L} s_{k}

.

This contradicts that both

s_{j}

and

s_{k}

are in X and hence must be distinguishable by L.

Therefore,

δ (q_{i}, a) = q_{j}

is uniquely defined.

q_{0} = q_{ϵ}

where

q_{ϵ} = f (s_{ϵ})

and

s_{ϵ}

is defined in Claim 2.

F = {f (s) ∣ s \in L \cap X}

F \neq \emptyset

because of Claim 1 and Claim 3.

Claim 4.

\forall w \in Σ^{*}

,

\hat{δ} (q_{ϵ}, w) = q_{i} \Leftrightarrow w \equiv_{L} s_{i}

where

f (s_{i}) = q_{i}

.

Claim 4 can be proved by induction on

| w |

.

For

w = ϵ

, there exists one and only one

s_{ϵ} \in X

s.t.

ϵ \equiv_{L} s_{ϵ}

by Claim 2.

\hat{δ} (q_{ϵ}, w) = q_{i}

\Leftrightarrow \hat{δ} (q_{ϵ}, ϵ) = q_{i}

(

w = ϵ

)

\Leftrightarrow q_{ϵ} = q_{i}

(Definition of 1.4(i))

\Leftrightarrow f (s_{ϵ}) = q_{i}

(Definition of

q_{ϵ}

)

\Leftrightarrow f (s_{ϵ}) = f (s_{i})

(Definition of

q_{i}

)

\Leftrightarrow s_{ϵ} = s_{i}

(f is bijective)

\Leftrightarrow ϵ \equiv_{L} s_{i}

(

ϵ \equiv_{L} s_{ϵ}

by Claim 2)

\Leftrightarrow w \equiv_{L} s_{i}

(

w = ϵ

)

The statement is true for

w = ϵ

.

Let

\hat{δ} (q_{ϵ}, w a) = f (s_{i}) = q_{i}

.

δ (\hat{δ} (q_{ϵ}, w), a) = f (s_{i}) = q_{i}

.

\exists q_{j}

s.t.

\hat{δ} (q_{ϵ}, w) = q_{j}

\exists s_{j}

s.t.

f (s_{j}) = q_{j}

and

w \equiv_{L} s_{j}

(By induction hypothesis)

w a \equiv_{L} s_{j} a

(

\equiv_{L}

is right congruence by Proposition 1.51)

δ (q_{j}, a) = q_{i}

\Rightarrow s_{j} a \Rightarrow_{L} s_{i}

(Definition of δ)

\Rightarrow w a \equiv_{L} s_{i}

(

\equiv_{L}

is transitive)

Conversely, if

w a \equiv_{L} s_{i}

for some

s_{i} \in X

,

\hat{δ} (q_{ϵ}, w a)

= δ (\hat{δ} (q_{ϵ}, w), a)

= δ (q_{j}, a)

where

q_{j} = \hat{δ} (q_{ϵ}, w)

By induction hypothesis,

w \equiv_{L} s_{j}

because

\hat{δ} (q_{ϵ}, w) = q_{j}

.

w a \equiv_{L} s_{j} a

(Right congruence by Proposition 1.51)

Let

δ (q_{j}, a) = q_{k}

s_{j} a \equiv_{L} s_{k}

(By definition of δ)

w a \equiv_{L} s_{k}

(

\equiv_{L}

is transitive)

w a \equiv_{L} s_{i}

(Assumption)

s_{k} = s_{i}

(Claim 2)

f (s_{k}) = f (s_{i})

q_{k} = q_{i}

\hat{δ} (q_{ϵ}, w a)

= δ (q_{j}, a)

= q_{k}

= q_{i}

This completes the proof of Claim 4.

It remains to prove

L = L (M)

.

\forall w \in L, \exists

one and only one

s_{i} \in X

s.t.

w \equiv_{L} s_{i}

(By Claim 2)

w \in L \Leftrightarrow s_{i} \in L

(Proposition 1.52)

Therefore,

s_{i} \in L

(

w \in L

)

Since

s_{i} \in L \cap X

and

q_{i} = f (s_{i})

,

q_{i} \in F

(Definition of F)

\hat{δ} (q_{ϵ}, w) = q_{i}

(Claim 4)

\hat{δ} (q_{0}, w) = q_{i}

(

q_{0} = q_{ϵ}

)

M accepts

w

(

q_{i} \in F

)

Conversely, if M accepts w,

\hat{δ} (q_{ϵ}, w) = q_{i}

and

q_{i} \in F

(

q_{0} = q_{ϵ}

)

w \equiv_{L} s_{i}

where

q_{i} = f (s_{i})

(Claim 4)

w \in L \Leftrightarrow s_{i} \in L

(Proposition 1.52)

Since

q_{i} \in F

and

q_{i} = f (s_{i})

,

s_{i} \in L \cap X

by definition of F.

Therefore,

s_{i} \in L

.

Therefore,

w \in L

.

L = L (M)

and M has k states.

(c)

L is regular

\Rightarrow \exists M

s.t.

L = L (M)

\Rightarrow I n d e x

L \leq k

where

k =

the number of states in

M

(by (a))

\Rightarrow L

has a finite index

L has a finite index

\Rightarrow I n d e x

L = k

\Rightarrow L = L (M)

for some k-state

D F A

M

(by (b))

\Rightarrow L

is regular

Assume for contradiction that there is a

k^{'}

-state

D F A

accepting L where

k^{'} < k

.

By (a),

I n d e x

L \leq k^{'}

.

This would contradict

k^{'} < k = I n d e x

L.

9. An Application of the Myhill-Nerode Theorem

The Myhill-Nerode Theorem can be used to determine whether a language L is regular

or non-regular by determining the number of members in X, the set that is pairwise

distinguishable by L.

Example 6.

Determine if

L = {a^{n} b^{n} ∣ n \geq 0}

is regular.

Consider

X = {a, a^{2}, a^{3} \dots}

∀ distinct

x, y \in X, x = a^{i}, y = a^{j}

where

1 \leq i < j < \infty

\exists z = b^{i}

such that

x z = a^{i} b^{i} \in L

and

y z = a^{j} b^{i} \notin L

.

x and y are distinguishable by L. (

x {\neg \equiv}_{L} y

)

X is pairwise distinguishable by L.

I n d e x

L \geq | X |

I n d e x

L is infinite.

L is not regular.

References

Sipser, Michael. Introduction to the Theory of Computation, Third Edition.
Dexter C. Kozen. Automata & Computability.
John E. Hopcroft, Rajeev Motwani, Jeffrey D Ullman. Introduction to Automata Theory, Languages, & Computation, Third Edition.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Mathematical Approach to the Theory of Finite Automata

Abstract

Keywords:

Subject:

1. Deterministic Finite Automaton (DFA)

2. Nondeterministic Finite Automaton (NFA)

3. Epsilon-Closure

4. The Equivalence of DFA and NFA

5. Regular Operators

6. Regular Expression

7. Pumping Lemma

8. Myhill-Nerode Theorem

9. An Application of the Myhill-Nerode Theorem

References

MDPI Initiatives

Important Links

Subscribe