Functional Language Logic

Vincenzo Manca

doi:10.20944/preprints202412.2423.v1

Submitted:

27 December 2024

Posted:

30 December 2024

You are already at the latest version

Abstract

The formalism of Functional Language Logic (FLL) is presented, which is an extension of the logical formalism introduced in "Information MDPI" (15, 1, 64, 2024) for representing sentences in natural languages. In the FLL framework, a sentence is represented by aggregating primitive predicates corresponding to words of a fixed language (English in the given examples). The FLL formalism constitutes a bridge between mathematical logic (high-order predicate logic) and classical logical analysis of discourse, rooted in the Western linguistic tradition. Namely, FLL representations reformulate on a rigorous logical basis many fundamental classical concepts (complementation, modification, determination, distribution, \ldots), becoming, at the same time, a natural way of introducing mathematical logic through natural language representations, where the logic of linguistic phenomena is analyzed independently from the single syntactical and semantical choices of particular languages. In FLL, twenty logical operators express the mechanisms of logical aggregation underlying meaning constructions. The relevance of FLL in Chatbot interaction is considered, and the relationship between embedding vectors of LLM (Large Language Models) transformers and FLL representations is outlined.

Keywords:

Natural Language Processing

;

Logical Semantics

;

High-Order Logic

;

Machine Learning

;

Large Language Models

;

Transformers

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

The logic of natural language is an old investigation field going back to Aristotile’s logic, the middle-age Scholastic philosophy, and Leibniz’s investigation at the beginning of mathematical logic [34]. In his book about the mathematical analysis of logic [4], George Boole emphasizes the logical basis of natural language. In 1979 Gottlob Frege [13] defined First-order Predicate Logic as a complete conceptual framework. Frege’s language includes predicates of any number of arguments, individual constants and variables, propositional connectives, and quantifiers (universal and existential).

In Principia Mathematica [37], Bertrand Russell and Alfred Withehead introduced the theory of logical types as a remedy to the logical paradoxes discovered within the foundation of mathematics.

In 1928, Alonzo Church introduced the lambda notation, and in 1940 the lambda typed calculus [6]. However, the notion of function, defined by a mathematical formula, goes back to Leonard Euler [10] and Gottlob Frege [13], who realized the functional nature of predicates. Mathematical function resulted in a powerful foundational concept, in mathematical logic, in computability, up to the new frontiers of artificial intelligence [3,6,14,19,23,25,26,31,33,40]. Hans Reichenbach developed a logical analysis of the conversation language in a chapter of his book on mathematical logic [36].

In 1970, Richard Montague wrote the paper “English as a formal language" where typed lambda calculus and high-order logic are combined to represent ordinary discourse, and several papers on the same line followed [9,27,28,29,39].

The elimination of variables is a problem intensively investigated in mathematical logic by many authors, such as Moses Schönfinkel, Harshel Curry, Robert Feys, Alfred Tarski, and Leon Henkin [7,16]. In natural language, neither apparent nor free variables are used, therefore a logical analysis of language has to cope with this phenomenon for a full comprehension of its internal mechanisms. We will show, that this aspect is strictly related to the monadic nature of predicates and the possibility of having high-order predicates.

In [22] a formalism of logical semantics for natural languages was introduced within the High-order Monadic Logic HML, which is essentially a typed lambda calculus based on unary functions with a new logical operator of “Predicate Abstraction" making logical representations of sentences completely adherent to the usual linguistic constructions. In the same paper, an experiment is reported on teaching the given logical formalism to ChatCPT3.5. This shows interesting perspectives on the interaction with chatbots, revealing a surprising ability to use such logical formalism.

In this paper, the approach of [22] is developed, by defining the more complete and motivated formalism of Functional Language Logic (FLL), strictly related to the classical logical analysis of sentences. Any word in a given dictionary is a unary predicate, a function from individuals to truth values. Twenty logical operators express the logical aggregations underlying the main linguistic constructions (Table 23 ).

The functional types for the predicates, individuals, substantives, propositions, hyper-predicates, and ad-predicates are introduced. Hyper-predicates are predicates that apply to predicates and give new propositions, whereas ad-predicates apply to predicates and give new predicates.

The following sections are devoted to specific linguistic phenomena. Direct and indirect complementations are reduced to the application of a complementation operator that, in the case of a direct complementation, takes a substantive, producing a new predicate, while in the case of an indirect complementation, takes an atomic proposition, again giving a new predicate. Modification is the operator transforming a predicate into an ad-predicate.

Predication, complementation, and modification are the main linguistic constructs. Specification is a case of complementation, which is considered apart due to its generality and importance.

Descriptive operators apply to predicates and provide substantives. arguments are then considered.

Finally, operators for managing with context, references, and performatives are considered.

Chatbots, preconceived in [40], are a frontier of artificial intelligence, their acquisition of complex and articulated competencies in dialogic activity with humans confirms the essential role of natural language in constructing the conceptual organization of cognitive systems. Namely, Greeks used the same word, “Logos" meaning either language or reason. This consideration suggests that FLL could be a strategic tool in teaching chatbots to acquire sophisticated competencies in logical analysis [22,24].

In conclusion, a topic for further research is addressed, which is related to the relationship between FFL logical representation of meanings and the embedding vectors of LLM transformers in modern conversational systems.

2. Material and Methods

2.1. Logical Symbols and Operators

Given an expression

E (x)

built with operations applied to constant and variables, where the variable x ranges on the class A, and taking values in the class B, we denote by

x . E (x)

the function from A to B associating to any element

a \in A

the value

E (b)

assumed by

E (x)

when x takes the value a. Alonzo Church introduced the lambda notation

λ x . E (x)

to stress that the function does not depend on the chosen variable, we omit the symbol

λ

for a shorter notation. If in

x . E (x)

variable x is replaced by y (which takes the same values as x does), for every a:

(x . E (x)) (a) = (y . E (y)) (a) = E (a)

therefore:

x . E (x) = y . E (y)

that is, the two

λ

-expressions are different names for the same function.

A predicate is a symbol denoting a function from a set of individuals to a set of two truth values, we denote by

T, ⊥

. In the following, we will use predicates with only one argument (monadic) or zero arguments. Monadic predicates are called Properties, while predicates with no arguments are Propositions, which can also be considered symbols for truth values. Letters

P, Q, R, \dots

, possibly with indices, will denote predicates. Symbols

a, b, c, \dots

, possibly with indices, are individual constants (names of particular individuals), and symbols

x, y, z, \dots

, possibly with indices, are individual variables. Letters

X, Y, Z, \dots

, possibly with indices, are predicate variables.

Symbols

\neg, \to, \land, \lor, \leftrightarrow

, called connectives, are operations on truth values with the following meanings:

\neg T = ⊥, \neg ⊥ = T

;

P \to Q = ⊥

iff

P = T, Q = ⊥

;

P \land Q = T

iff

P = T, Q = T

;

P \lor Q = ⊥

iff

P = ⊥, Q = ⊥

(whence

T = \neg P \lor P

);

P \leftrightarrow Q

iff

P = Q

.

Symbols

\forall, \exists

, called quantifiers, are operations such that

\forall P (x) = T

iff

x . P (x) = x . T

;

\exists P (x) = ⊥

iff

x . P (x) = x . ⊥

.

Connectives and quantifiers can also be easily seen as operators over predicates, for example,

P \to Q = x . (P (x) \to Q (x))

,

\forall P = \forall x . P (x)

.

2.2. Abstraction Operators

The operator of class abstraction transforms a monadic predicate P in the class A of the values on which the predicate holds (gives truth value T). This operator was initially defined by Georg Cantor and formalized by Bertrand Russed with the notation

\hat{x} . P (x)

. Nowadays, the commonly used notation for classes is

A = {x | P (x)}

.

The notion of type is analogous to that of a class; it is used in many contexts and with many specific senses. We write

a : t

to denote that a has a type of t. Of course, the elements of a given type provide a class, and analogously, having a type is a property. Therefore, a type can be assimilated into the concepts of class and property, even if its meaning is more related to that one of a symbolic mark to attach to objects for categorizing them.

However, it is useful to distinguish similar concepts because, in many complex analyses, these notions refer to different levels of a discourse that are useful to consider separately. For example, the names of things are different from things in themselves, and operating with names can provide useful possibilities better dealt with in specific contexts by avoiding any confusion with things and operations on them. If

x : s

and

E (x) : t

, we denote by

(s \mapsto t)

the type of

x . E (x)

; if A is a class of elements of type t, then we denote by

[t]

the type of A. Therefore, types can be arranged in expressions of increasing complexity:

t, (s \mapsto t), [s], ([s] \mapsto t), \dots

. The maximum number of nested pairs of parentheses or brackets in the expression of a type provides the logical order of that type.

In 1901, Bertrand Russel discovered a logical paradox related to the intuitive notion of class. Namely, some autoreferential conditions (the class of classes that do not belong to themselves) are contradictory. Axiomatic set theories were developed, which define sets as special classes regulated by axioms, avoiding paradoxes. Type theory, elaborated by Bertrand Russel and Alfred Whitehead, overcomes paradoxes by assigning types for dealing with high-order predicates that apply to predicates as arguments [18]. In natural language, expressions such as

P a s t (P)

or

Y e s t e r d a y (P)

are typical examples of predicates taking predicates as arguments.

Two expressions denoting individuals are equal when they denote the same individual. We can express equality by using monadic properties according to the following trick. We add for every individual a a predicate

E_{a}

that holds on an individual b if it denote the individual denoted by a:

E_{a} (b) \leftrightarrow a = b .

Analogously, a binary operation, such as +, can be expressed by using the monadic operation

S u m_{3} = x . 3 + x

, namely:

S u m_{3} (5) = 8

In this way, an operation’s argument becomes a parameter embedded in a unary operation, giving the same result as a binary operation on two arguments. This phenomenon is crucial in natural language, allowing for a monadic representation of all predicates occurring in linguistic constructions.

The operator of predicative abstraction allows for raising the logical order of a predicate. If

P r e d

is a predicate, we denote by

{}^{^}P r e d

its predicative abstraction, for which:

{}^{^}P r e d (P) \leftrightarrow (P \to P r e d)

that is,

{}^{^}P r e d

is a hyperpredicate (with respect to

P r e d

), which holds over all predicates that imply

P r e d

. This means that the implication

P \to P r e d

becomes the application of a hyper-predicate

{}^{^}P r e d (P)

telling that P is an implicant of

L o v e

, which is different from

L o v e (a)

, where a is a loving individual. Namely, P is a predicate, then it does not love, being

L o v e

a property of individuals.

Figure 1. A graphical representation of the predication Good(a) in a direct way (bottom) and trough predicative abstraction (Top).

In the following, we will write $L o v e (P)$ for abbreviating ${}^{^}L o v e (P)$ because the type raising is implicitly deduced by the argument, which is a predicate rather than an individual.

2.3. Complementation and Descriptive operators

Some other operators will be introduced in the next sections. Two of them change the logical type of expression and correspond to linguistic constructs that change the linguistic category of expressions. They are the complementation operator, denoted by underscore _, and the specification operator (a special case of complementation) denoted by a suffix-dot. The other two operators are descriptive because tale predicates and provide substantives. They are determination operator

ι

, due to Giuseppe Peano [35], and choice operator

ε

, due to David Hilbert [18] complete our list.

2.4. Comparison with Related Logical Formalisms

Many formalisms are aiming at representing sentences logically. After the mentioned approach inaugurated by Richard Montague, in many fields, such as logic, linguistics, philosophy, computer science, knowledge representation, and other related fields, the search for logical representations coupling rigor with simplicity and adequacy was always very active and oriented to many specific requirements of some applicative contexts. In the setting of logical approaches, let us mention the works in the context of the CSLI (Center for the Study of Language and Information), especially the Situation Logic, the Natural Language Semantics, and the intensional logic [2,8,11,12]. For many aspects, FLL has common features with these approaches, but two important characteristics distinguish it properly. It is directly related to the traditional logical analysis of the discourse, which is very popular in the educational curricula, especially in the context of classical dead languages (Greek, Latin); moreover, it is based on a limited number of logical symbols applied to the words of fixed dictionaries, which makes it very simple to learn, even without entering in its complex logical basis.

The formalism of FLL is concerned with the logical representations of sentences. However, for completeness, we want to mention some topics that are important in logical formalisms but outside the scope of the paper. One of these topics is concerned with formal deductions (Proof Theory), realized by suitable deductive algorithms; the other refers to the interpretations of formulas within mathematical structures (Model Theory) [6,18,21,38].

The first logical calculus was elaborated for Predicative Logic by Gottlob Frege for Predicate Logic [13]. This logic has constant and individual variables, predicative constants denoting relations of any number of arguments on individuals, together with connectives and quantifiers. Frege’s predicative calculus, and many other equivalents to it, resulted to be complete, that is, it can deduce all the logical consequences deriving from a list of axioms. A logical consequence of a set of propositions

T

is a formula true in all the models where the propositions of

T

are true.

In Model Theory, a model M is associated with a class

T

of formulae (a theory) when all the formulae of

T

, according to suitable interpretation rules, are true in M. For Predicative Logic, a fundamental result, known as Löwenheim-Skolem Theorem, holds according to which any coherent theory (where ⊥ cannot be deduced) can be interpreted in the domain of natural numbers [6,18].

3. Results

Let us assume that all words of a given language, in our case, English (written with capital initial letters), are monadic predicates of some logical order. Sometimes, for a better reading of complex formulas, we will use the inverse parentheses notation by writing

) a (P

instead of

P (a)

.

3.1. Categories and Functional Types

The formalism FLL has the following categories of expressions, and some of them will receive some types:

1) Arguments, of type

a r g

, is the category of any expression that occurs as an argument of a function;

2) Individuals, of type

i n d

, is the category of denotations of costants

a, b, c, \dots

, which can be considered as indexes o indicals of objects assumed in a discourse;

3) Propositions, of type

b o o l

, is the category of denotations of truth values, also seen as predicates of zero arguments. An expression

P (a)

is an atomic proposition, or a simple predication while

(P \land Q) (a)

, for example (Eat and Drink)(a), where the predicate is the conjunction of two predicates, or

(P (a) \land Q (a)

, which is the conjunction of two propositions, are not atomic predications. We will indicate by

a t o m

the type of atomic propositions;

4) Predicates, of type

p r e d

, is the category of monadic predicates, that is, functions from arguments to truth values:

p r e d = (i n d \mapsto b o o l)

in this category predicative constants

P, Q, R, \dots

are included;

5) Hyper-predicates, of type

h y p e r p r e d

, is the category of predicates that apply to predicates and provide propositions:

h y p e r p r e d = (p r e d \mapsto b o o l)

hyper-predicates can be considered second-order predicates, and analogously, third-order predicates can be considered and, in general, higher-order predicates for further levels;

6) Ad-predicates, of type

a d p r e d

, is the category of functions that apply to predicates and provide predicates:

a d p r e d = (p r e d \mapsto p r e d)

7) Substantives, of type

s u b s t

, is the category of individuals, predicative constants, and any expression that can be equated to them. Also, capital letters

A, B, C . \dots

denoting classes (possibly with indexes) are substantives. Equating an expression to a constant provides a substativation;

8) Logical Operators are the symbols expressing operations on predicates;

9) Performatives are the symbols expressing discourse functionalities.

In the sequel, we provide examples of FLL representations for small texts in the natural language (English). From these representations, twenty logical operators emerge that can describe the meaning of these texts by composing the meanings of the single words. In this reduction, we get the comprehension of texts from the predicates associated with the lemmas of a dictionary.

We want to recall that the usual grammatical categories of Verbs, Nouns, and Adjectives are based on spatiotemporal features. Adverbs realize hyper predicates on predicates (negation, intensification, modality, …). At the same time, the other linguistic units are “empty words", having meanings driven by the contexts in playing roles that correspond to logical operators of FLL.

3.2. FLL Representation of Simple Sentences

Let us start with a simple sentence: “John is good." Its FLL representation is given in Table 1.

John is a person name, then John(a) means that there is an individual a who saisfies the property of having the name “John", and a satisfies the property “Good".

The sentence “John loves Mary" has the FLL representation given in Table 2.

Here a new operator appears, with postfix notation, indicated by _ and called of complementaton, which transforms a predicate, such as Love, into the predicate Love_(b), completing the meaning of Love with the object b. When Love_(b) applies to the individual a we get (Love_(b))(a), or simply Love_(b)(a), expressing “a love b". Therefore, complementing a monadic predicate, we express a binary predicate. In general, given a predicate

P r e d

, the expression

P r e d_

is a function of three possible types, according to the following list:

P r e d_: (s u b s t \mapsto p r e d)

P r e d_: (a t o m \mapsto p r e d)

P r e d_: (p r e d \mapsto p r e d)

the example above falls in the first case and is called direct complementation; the second case is called indirect complementation; the third case is called modification, or with traditional terminology, predicative complementation.

For a better reading of formulas, we avoid application parentheses after _ in complementations, and we assume that the complementation operator applies with left priority. Firstly, the leftmost operator applies, then the operator _ following it on the right, and so on, up to the rightmost complementation operator. In this way, in the usual notation

P (a)

, the subject of the predicate is at the end, on the right, while in the inverse parentheses notation

) a (P

, the subject is at the beginning on the left.

The sentence “John goes home with a bike" has the representation given in Table 3.

in this representation, the operator _ transforms Go into Go_, a function taking an atomic proposition and producing a predicate. Analogously Go

_P l a c e (b)

is a predicate to which the operator _ applies, and Go

_P l a c e (b)_

takes as an argument

I n s t r u m e n t (c)

and becomes Go

_P l a c e (b)_I n s t r u m e n t (c)

. In conclusion, the resulting predicate applies to the individual a providing a proposition. Atomic propositions Place(b) and Instrument(c) define the roles of complements

b, c

which complete the predicate

G o

.

Figure 2 visualizes the representation of Table 3 by a labeled graph: constants or words label nodes. Simple arrows denote predication, while labeled arrows express the operator indicated in the label. We remark that the graph is a second-order graph (in more complex cases, third or fourth orders are necessary) because there are nodes including subgraphs (represented by surrounding curves)

A different way of expressing complementation is through arguments that are sequences, as in Table 4. However, the method based on the complementation operator is more adherent to the linguistic mechanism of complementation therefore, in the sequel, we follow it.

3.3. Complementation

Traditional linguistic analysis is focused on a long list of possible complements: object, specification, place, time, instrument, …. In a list used in the schools, it is possible to find fifty different types of complements. However, such lists result, to a large extent, arbitrary and incomplete. The linguistic form of complementation depends on specific syntactic features. In a logical representation, it is important only to identify the elements completing a predicate by distinguishing each one from the others. Let us consider the sentence “John gives a pen to Mary." The following FLL representation of this sentence is given in Table 5.

However, different predicates (Take, Accept, Destination, Target) could be used instead of "Receive" to adequately express the role of constant c, apart from specific syntactical realizations of the sentence.

The sentence “People elected John as major" is represented in Table 6.

3.4. Specification

The specification is a frequent kind of indirect complementation, putting in some relationship a predicate with a substantive (membership, inclusion, pertinence, possess, …). It is useful to introduce for it a special symbol:

P r e d . a

completing

P r e d

with the argument a as a specification complement. The sentence “John goes home with his bike" is given in Table 7 where a predicate constant P and predicative abstraction is used. Figure 3 visualizes this FLL representation (in the following examples the symbol of predicative abstraction will be tacitly intended).

The sentence “John asked Mary for information on the train timetable" is in Table 8.

The dot notation for specification suggests reducing all cases of indirect complementation to specification (in some languages, such as Arabic, there are only the object complement and the specification complement). For example, “John goes home with his bike" is represented in Table 9 using the specification operator for expressing complementation.

A more complex example is the sentence “Yesterday I was walking without shoes", in Table 10, which has a complex ad-predicate realized by modifications.

Equivalent representations are given in Table 11,Table 12.

These examples show clearly that the same sentence can be represented in many ways. Each representation has advantages or inadequacies concerning the others. The right choice depends on the kind of the intended application of the representation.

3.5. Modification

In the previous examples, we used the modification operator to transform a predicate into an ad-predicate. The typical case ad-predicates are the adverbs modifying verbs, or special verbs, such as begin, finish, interrupt, can, will, must, appear, seem, …, are modifiers of other verbs, as it happens in the representation of Table 13.

An analogous modification occurs when a noun or adjective modifies an adjective, as it is shown in Table 14, and Figure 4.

of course “John is good" and “John is a good policeman" use “good" in two completely different ways, making it evident that the same word, in different contexts, can exhibit different logical types. Namely, in the first case, Good is a predicate, while in the second one, it is an ad-predicate.

3.6. Determiners, Indefinites, Plurals

The

ι

operator of determination was introduced by Giuseppe Peano [35]. Let P be a predicate that is satisfied only by one individual; then, this individual is denoted by

ι P

. Hence:

ι P = a \leftrightarrow P (a) \land (\neg (a = b) \to \neg P (b))

The expression

ι P

corresponds to the definite article of natural languages. If we write

a = ι B o y

, we mean that in the given context, a boy is univocally determined and is identified by the individual constant a. If more than one value satisfies P, all the propositions where

ι P

occurs are false.

The

ε

operator choice has been introduced by David Hilbert [18], it provides a chosen indefinite value that satisfies P. If no argument satisfies P, all the propositions where

ε P

occurs are false. Hence:

a = ε P \leftrightarrow P (a)

and:

P (ε P) \leftrightarrow \forall x P (x) .

Using

ε P

we can put:

{ε P} = {x | P (x)} .

Operators

ι

and

ε

have both type

(p r e d \mapsto s u b s t)

.

Different occurrences of

ε P

may denote different individuals. If we say Any man who loves a woman is happy, we refer to an indefinite man. If we say that Any man who loves a woman is happy, but any man who does not love any woman is searching for a woman whom he can love, clearly, the two occurrences of “any man" have to denote different persons. Otherwise, the sentence is meaningless.

Proposition

Q (ε P)

implies the following propositions, where constants cover all the values satisfied by P:

a_{1} = ε P \to Q (a_{1})

a_{2} = ε P \to Q (a_{2})

…

Therefore, the choice operator

ε

provides universal quantification and the constructions distributing the values of a predicate over other predicates (every man is mortal).

We can extend

ε

notation with numeric indexes so that

ε

expressions with the same index denote the same individual. In this way, expressions such as

ε_{i} P

can be used as usual variables. For example, lambda expressions can be expressed by:

x . E (x) = ε_{1} . E (ε_{1}) .

Indefinite values expressed by

ε

expressions are different from generic indeterminate values that, in many languages, correspond to undeterminate articles. In FLL, particular values are denoted by individual constants. Namely, when we write

P (a)

, we mean that there exists a value that satisfies P, and we call it a.

Relative clauses are of two kinds: descriptive e restrictive. If we say “John, who lives in Rome, will not come to the meeting", the relative clause (introduced by who) adds information that can be equivalently given by saying: “John will not come to the meeting, he lives in Rome".

Conversely, “John is searching for a pen that writes green" is a restrictive relative clause because characterizes what John is searching for. The FLL representation of Table 15 is obtained using the choice Hilbert operator.

Now we give an example using the

ε

operator to express a consecutive construction.

“The bag is so heavy that I cannot bring it" in FLL provides the representation of Table 16.

In the last equation

ε

applies to the predicate within parentheses, and Greater_c’(c) means that: “The weight c (of the bag) overcomes

c^{'}

, which is any weight that a can bring.

In FLL numerals: 0, 1, 2, …(in decimal notation) and ordinals:

1^{o}, 2^{o}, 3^{o}, \dots

, with the usual symbols of arithmetic operations and relations, are available.

Modification with numerals (0, 1, 2, …) allows for a simple representation of plurals. Given a predicate

P r e d

, the expression

2_P r e d

means a couple of individuals that satisfy

P r e d

. Analogously,

(> 1) P r e d

denotes a plurality of individuals satisfying

P r e d

.

Modifications such as

2^{o}_P r e d

denote ordinals (“the second which satisfies Pred") assuming an order, specified by the context or previously given. For example, the following is a representation that refers to two boys; the first speaks, and the second listens:

––––––––––––––––––

2_B o y (a)

1^{o} . a (b)

2^{o} . a (c)

Speak(b)

Listen(c)

––––––––––––––––––

3.7. Contexts, References, and Performatives

Deixis (Greek etymology) refers to all the aspects of a sentence’s spatiotemporal context. Words such as this, that, now, I, and you are deictic words assuming meanings that refer to their specific context. A situation consists of all elements necessary to the correct meaning of a sentence, including deixis and other aspects, such as presuppositions that a speaker assumes about the persons, things, facts, and habits on which a specific communication is based. Moreover, other aspects regarding persons involved in communication can be relevant, and in many languages, these aspects can remarkably influence the expressions used. The register (familiar, formal, institutional, …), especially in some languages, can direct even the choice of the words of sentences.

Anaphora (Greek etymology) refers to the linguistic elements pointing to words and expressions already occurring in sentences in the linear order of their generation. Pronouns are the typical elements playing this role. The concordance is the mechanism on which anaphora is based. Moreover, the same mechanism is also responsible for the aggregation of linguistic expressions in bigger units, including them as components.

Concordance is realized using grammatical marks expressing features (gender, number, person, time, …). The system of grammatical features can change in different languages (form, color, localization, distribution, consistency, …). A pronoun can be seen as an aggregation of marks. In this way, it refers to the closest linguistic expression preceding it and having the same marks. Grammatical features alter linguistic forms using inflection and conjugation so that elements with the same marks are aggregated in bigger units.

In the FLL representations, the individual constants realize pronouns, while parentheses realize aggregation. If we consider the complexity of phenomena realizing anaphora and concordance, we can appreciate FLL’s great advantage over natural languages.

A class of sentences widely analyzed by logicians since the Middle Ages are donkey sentences, so-called for an example reported in an ancient treatise of logical analysis of language (Every man who owns a donkey will beat). The problem with these sentences is the pronoun reference in the context of a universal quantification.

The sentence “Every man loves the woman who loves him". In predicative logic becomes:

\forall x, y ((M a n (x) \land W o m a n (y) \land L o v e (y)_(x)) \to L o v e (x)_(y))

where a reference hereditates the distributive nature of the referred term (

E v e r y_m a n / w h o

).

If we express universal quantification with the

ε

operator, we get:

L o v e_ε M a n (ε W o m a n) \to L o v e_ι W o m a n (ι M a n)

where iota operator refers to the individuals chosen on the left of implication. Using

ε

with indexes:

L o v e_ε_{1} M a n (ε_{1} W o m a n) \to L o v e_ε_{1} W o m a n (ε_{1} M a n)

however, a form more adherent to the linguistic form and using once

ε

is the following:

((a = ε M a n) \land W o m a n (b) \land L o v e_a (b) \to L o v e_b (a) .

“Any man loved by a woman loves her", which we can also represent by Table 17 and Table 18 (the choice is intended in the class of substantives).

Let us consider the sentence “The boys were entering two at a time." Traditional logical analysis tells us that “two at a time" is a complement of "distribution." However, this does not completely clarify its underlying logical mechanism, which is completely represented in Table 19.

The values of

ε A

change with the choices within the class A, and for each pair, there is an entrance time.

We can further explicit the distribution process. Let

a_{1}, a_{2}, \dots

be the choices

ε P

and

b_{1}, b_{2}, \dots

the choices

ε Q

(covering the boys and the times). Then, the FLL representation is equivalent to the sequence of propositions:

E n t e r_T i m e (b_{1}) (a_{1})

E n t e r_T i m e (b_{2}) (a_{2})

…

It is important to remark on the continuative character of the verb “were entering" because it tells us that the process is developed in a time interval along a sequence of steps. Therefore, the distribution expresses a modality of realization of the process associated with the verb enter. Table 20 shows the associated FLL representation.

which we can read: “The boys were entering distributing in two". In this way, we are very close to the linguistic form of the sentence through an analysis of the deep structure of the sentence.

Coordination and subordination between propositions consist of predications over propositions. In the sentence: “While the boys were entering the classroom, the teacher was writing on the blackboard", a relationship expressed by while" occurs between propositions

P_{1}, P_{2}

representable by the predication:

W h i l e_P_{1} (P_{2})

where:

P_{1} =

The boys were entering the classroom;

P_{2} =

The teacher was writing on the blackboard.

Conjunctions of temporal and situational nature (concessive, adversative, consecutive, final, causal, …) express relationships in typical subordinative clauses.

An FLL representation can be always expressed by a predication such as

P (a)

, where all the specific information about P and a are given in the remaining part of the representation. In a sense, all the components of the sentence representation converge into

P (a)

. We may use the assertion symbol ⊧ to stress this special role. In the linguistic terminology,

⊧ P (a)

means that

P (a)

is the principal proposition of the sentence, to which the other propositions refer in determining their subordinative relationships.

Languages allow for describing facts but also giving commands, and asking questions.

Performatives are linguistic elements responsible for indicating the specific functionality of statements. We give only two examples in FLL here: "Go home!" and "Where do you go?" of Table 21 and Table 22, respectively.

The interrogative symbol before the assertion symbol tells us that the expression is a question and the same symbol before the constant confers to the constant the role of the interrogative pronoun. Analogously the exclamation mark expresses orders and before the constant it indicates the individual to which the order is directed.

In conclusion, the FLL logical representation of language is based on predication with predicate and arguments at different logical orders. The presence of different logical orders provides the main complexity of linguistic expressions. When people learn to speak, they implicitly acquire the capability of analysis and synthesis that allows for correct and efficient use of the integrated system of predications underlying FLL representations. Three of four logical orders are very often present in the ordinary discourse (“your beauty fascinates me").

Logical symbols of FLL can be reduced to 20. No variable symbols are present, but individual constants

a, b, c \dots

. possibly indexed, and predicative constants

P, Q, R, \dots

or class constants

A, B, C, \dots

(possibly indexed). Table 23 summarizes all FLL operators. Table 24 will show the interlingual character of FLL representations.

4. Conclusions

The adequacy of FLL in representing the logic of natural languages highlights, in terms of mathematical logic, the role of traditional logical analysis developed within the classical linguistic tradition, linked to the study of ancient languages and based on Aristotile’s schema of predication. Namely, FLL logically puts on a rigorous basis the main concepts on which logical analysis is built by using twenty logical operators (Table 23). In this sense, the logic of the natural language results in a link between mathematical logic and linguistics, and also a natural way to approach the first using the second one. The monadic nature of FLL is a crucial aspect concerning the elimination of variables, coupled with the use of high-order predicates.

In previous work, conversations with ChatGPT were reported, which show the ability of these systems to learn a logical formalism similar to FLL and acquire the capability of providing correct logical representations of given texts. However, we know these chatbots are based on transformers, and then linguistic meanings are reduced to embedding vectors, as numerical vectors of many thousands of components.

The idea of an embedding vector is rooted in a long linguistic tradition [15], which emerged in the 20th century, with the origin in structural linguistics concerning phonology, where a phoneme is a set of pertinent features that exclusively identify it in opposition to all the other phonemes of a language. The same intuition can be exported to word semantics because, given a document corpus, a word can be identified by all the documents where it occurs and by all the positions where in these documents it occurs.

A further investigation could be focused on the relationship between embedding vectors for sentences and discourses and their corresponding FLL representations. The two methods correspond to (geometric) synthetic versus (logic) analytic comprehension. Specific aspects of a detailed analysis of their comparison could provide crucial elements for a deep understanding of the related cognitive process on which knowledge is based [24].

Mathematical logic, with the notions of class, symbol, number, variable, operation, equation, relation, function, predicate, set, type, proposition, truth value, connective, variable abstraction, and predicate abstraction provides a powerful and universal system of conceptualization, which surely is one of the most relevant successes of mathematics. Teaching chatbots mathematical logic could improve their semantic mechanisms by acquiring theoretical competencies in their internal knowledge organization.

A line of development of the paper could be the analysis of chatbot interactions in learning and exhibiting FLL representations, along with the experience presented in [22]. The levels, times, and strategies of FLL training could provide tools for evaluating the logical competencies of future conversational systems.

References

Bahdanau, D., Cho, K., Bengio, Y., Neural Machine Translation by Jointly Learning to Align and Translate, arXiv:1409.0473 (2014).
Barwise, J., The Situation in Logic, CSLI Lecture Notes,17 (1989).
Goodfellow, I., Bengio, Y. Courville A., Deep Learning. MIT Press (2016) Situations, Language Logic. Studies in Linguistics and Philosophy (SLAP, vol. 34), D. Reidel Publishing Company, Dordrecht, Holland (1987).
Boole, G.: The mathematical analysis of logic. Cambridge: MacMillan, Barclay; & Macmillan, London: George Bell (1847). [CrossRef]
Brown T. B. et al.: Language Models are Few-Shot Learners, NEURIPS, 33, 1877–1901 (2020).
Church, A.: Introduction to Mathematical Logic, Princeton University Press (1956). [CrossRef]
Curry H. B., Feys, R. Combinatory Logic, North-Holland Publishing Company (1958).
Devlin, K., Situation theory and situation semantics, in: Handbook of the History of Logic Vol. 7, 2006, 601-664, Elsevier, Amsterdam, Netherlands (2006).
Dowty, D. R., Wall, R. E. (ed.): Introduction to Montague semantics, D. Reidel (1989).
Euler, L., Introductio in Analysin Infinitorum. M. M. Bousquet, Lausanne (1948).
Fenstad, J. E., Halvorsen, PK., Langholm, T., Benthem, J., Situations, Language and Logic. Studies in Linguistics and Philosophy, vol 34. Springer, Dordrecht (1987) Available online:. [CrossRef]
Fitting, M. , Intensional Logic in: The Stanford Encyclopedia of Philosophy, Available online:. Available online: https://plato.stanford.edu/archives/win2022/entries/logic-intensional/, Metaphysics Research Lab, Stanford University (accessed on day month year).
Frege, G., Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens, Halle an der Saale, Verlag von Louis Nebert (1879).
Gelb, W., Kirsch, B., The Evolution of Artificial Intelligence: From Turing to Modern Chatbots, Tulane University, Archives, https://aiinnovatorsarchive.tulane.edu/2024/ (2024).
Harris, Z., S., Distributional Structure, WORD, 10:2-3, 146-162 (1954).
Henkin L., Monk J. D., Tarski A., Cylindric Algebras, Vol. 1, North-Holland (985).
Hilbert, D.: Über das Unendliche, Mathematische Annalen 95, 161-190 (1926).
Hilbert, D., Ackermann, W.: Principles of to Mathematical Logic (tr. from German, 1928), AMS Chelsea Publishing (1991).
Hornick, K., Stinchcombe, M., White, M.: Multilayer feedforward networks are universal approximators, Neural Networks, 2, 359-366 (1989).
Kaplan, J. et al.: Scaling Laws for Neural Language Models arXiv:2001.08361 (2020).
Manca, V: A Metagrammatical Logical Formalism, in C. Martín-Vide (ed.), Mathematical and Computational Analysis of Natural Language, John Benjamins (1998).
Manca, V. Agile Logical Semantics for Natural Languages. Information, 15, 1, 64 (2024). [CrossRef]
Manca, V., Artificial Neural Network Learning, Attention, and Memory, Information, 15, 387 (2024).
Manca, V., On the functional nature of cognitive-systems , Information, 15, 807 (2024). [CrossRef]
Mitchell, T., Machine Learning, McGraw Hill (1997).
Minsky, M., Computation. Finite and Infinite Machines, Prentice-Hall Inc. (1967). [CrossRef]
Montague, R.: Universal Grammar, Theoria, 36, 373-398 (1970).
Montague, R. English as a formal language, In B. Visentini et al. (eds) I Linguaggi nella Società e nella Tecnica, Milan (1970).
Montague, R. The Proper Treatment of Quantification in Ordinary English, https://www.cs.rhul.ac.uk/ zhaohui/montague73.pdf.
Neumann, von, J.: The Computer and the Brain, Yale University Press (2012).
Nielsen, M. Neural Networks and Deep Learning. Online (2013).
OpenAI, GPT4-Technical Teport, ArXiv: submit/4812508 [cs.CL] 27 Mar (2023).
Parker, D.B., Learning logic. Technical Report TR-47. Center for Computational Research in Economics and Management Science, MIT, Cambridge, MA. (1985).
Parkinson, G. H. R.: Leibniz Logical Papers, Clarendon Press (1966).
Peano, G.: Opere Scelte, vol. II: Logica Matematica. Interlingua ed Algebra della Grammatica, Edizioni Cremonese (1958).
Reichenbach, H.: Elements of Symbolic Logic, MacMillan Limited (1947).
Russell, B., Whitehead, A. N., Principia Mathematica, Cambridge University Press (1910-13).
Tarski, A.: The semantic concept of truth and the foundation of semantics. Philosophy and Phenomenological Research 4, University of California, Berkley (1944). [CrossRef]
Thomason, R. H. (ed.): Formal Philosophy, Yale University Press (1974).
Turing, A. M.: Computing Machinery and Intelligence, Mind, London, N. S. 59,433-460 (1950).

Figure 2. A graphical representation of sentence given in Table 3

Figure 3. A graphical representation of sentence given in Table 7.

Figure 4. A graphical representation of "Good(a)" (Top) and "Good_Policeman(a)" (Bottom).

Table 1. “John is good."

John(a)

Good(a)

Table 2. “John loves Mary."

John(a)

Mary(b)

Love_(b)(a)

Table 3. “John goes home with a bike."

John(a)

Home(b)

Bike(c)

)a( Go_Place(b)_Instrument(c)

Table 4. “John goes home with the bike."

John(a)

Home(b)

Bike(c)

u =(a,b,c)

Go(u) ∧ Subject(a) ∧ Place(b) ∧ Instrument(c)

Table 5. “John gives a pen to Mary."

John(a)

Pen(b)

Mary(c)

)a( Give_(b)_Receive(c)

Table 6. “People elected John as major."

A =PeopleJohn(b)

Elect(P)

Past(P)

P_Major(b)(A)

Table 7. “John goes home with his bike."

John(a)

Home.a(b)

Bike.a(c)

^bGo(P)

)a( (P_Place(b)_Instrument(c)

Table 8. `John asked Mary for information on the train timetable."

John(a)

Timetable_Train(b)

Information.b(c)

Mary(d)

Ask(P)

Past(P)

)a( P_d_c

Table 9. “John goes home with his bike."

John(a)

Home.a(b)

Bike.a(c)

Go(P)

Direction.P(b)

Instrument.P(c)

P(a)

Table 10. “Yesterday I was walking without shoes."

Me(a)

Walk(P)

Without_(2_Shoe)(P)

Past(P)

Progressive(P)

Yesterday(P)

P(a)

Table 11. “Yesterday I was walking without shoes."

Me(a)

Walk(P)

Past(P)

Progressive(P)

Yesterday(P)

c = (a₁, a₂)

Shoe.a(a₁)

Shoe.a(a₂)

Pair_a₂(a₁)

)a( P_Without(c)

Table 12. “Yesterday I was walking without shoes."

Me(a)

Walk(P)

Past(P)

Progressive(P)

Yesterday(P)

)a( P_(¬(Wear_(2_Shoe))

Table 13. “John wanted to speak."

John(a)

(Want_Speak)(P)

Past(P)

Progressive(P)

P(a)

Table 14. “John is a good policeman."

John(a)

(Good_Policeman)(a)

Table 15. “John is searching for a pen that writes green."

John(a)

Pen(P)

Green_Write(P)

P(a)

Table 16. “The bag is so heavy that I cannot bring it."

Me(a)

b = ιBag

(Weight_Quantity).b(c)

c′ = ε(x.Quantity_(Weight)(x) Λ Can_(Bring)_x(a))

Greater_c′(c)

Table 17. “Any man loved by a woman loves her."

a = εMan

(Woman(b) Λ Love_a(b)) → Love_b(a)

Table 18. “Any man loved by a woman loves her."

a = εMan

Love(P)

Love(Q)

(Woman(b) Λ Love_a(b)) → Love_b(a)

Table 19. “The boys were entering two at a time."

A = ι(Class_(2Boy))

a = εA

Time.a(b)

Enter_Time(b)(a)

Table 20. “The boys were entering two at a time."

A = ι((>1)_ Boy))

Enter(P)

Continuative(P)

P_Distribution(2)(A)

Table 21. “Go home."

You(a)

b = ιHome

Go(P)

!a |= P_b(a)

Table 22. “Where do you go?"

You(a)

Place(b)

Go(P)

? |= P_Place(?b)(a)

Table 23. FLL Operators

Table 24. An FLL representation of the Cinese sentence: “Iesterday I was walking along the sea."

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Functional Language Logic

Abstract

Keywords:

Subject:

1. Introduction

2. Material and Methods

2.1. Logical Symbols and Operators

2.2. Abstraction Operators

2.3. Complementation and Descriptive operators

2.4. Comparison with Related Logical Formalisms

3. Results

3.1. Categories and Functional Types

3.2. FLL Representation of Simple Sentences

3.3. Complementation

3.4. Specification

3.5. Modification

3.6. Determiners, Indefinites, Plurals

3.7. Contexts, References, and Performatives

4. Conclusions

References

MDPI Initiatives

Important Links

Subscribe