Human Sentence Processing

Some Assumptions

No one knows how the human brain processes language but certain reasonable assumptions can be made about human sentence processing.

These assumptions, whether valid or not, presented a useful basis for modeling the Logos System.

Among these assumptions are:

First Assumption

Human sentence processing is opportunistic rather than algorithmic.

There is not some master control circuitry–some homunculus sitting somewhere in the brain–that “manages” human sentence processing in the manner of an algorithm (i.e. a logic-driven sequence of steps).

  • In the Logos System, sentence processing is also non-algorithmic, i.e. it is based on declarative information about language rather than upon sequences of procedural steps.

  • In the Logos System, the question of how such declarative information is to be applied to the input stream is solved in a novel, symbolic quasi-neural net paradigm. (See Logos Model/neural net architecture.)

  • In both the brain and the Logos System, the only thing that might be said to control sentence processing is the sentence itself.

  • Effectively, the sentence is the algorithm.

Second Assumption

Human sentence processing is effected by memory associations stored in human memory–i.e. by the content and organization of human memory and all its associations–as memory reacts to the input signal.

The process has nothing to do with fixed sequences of logical operations.

Thus, human sentence processing will vary from sentence to sentence as the memory associations vary. For one linguistic situation, the brain may rely more upon semantic associations, for another, upon syntactic associations, for still another, upon stored chunk of hybrid linguistic data.

In this latter case, associations involving the chunks have become so familiar that short cuts to them have sprung up (to satisfy the law of least effort).

  • In the Logos System, sentence processing is also effected solely by the content and organization of associative memory.

  • In the Logos system, by associative memory is meant the words and structures of a given language (and their associations) that have been learned through exposure to that language and assimilated into memory.

  • Rules in the Logos System used to process sentences are comprised entirely of semantico-syntactic patterns. These patterns capture the semantico-syntactic associations learned regarding that language.

  • Here too, the parts and types of memory involved in processing a sentence vary from from one linguistic situation to another, in true opportunistic fashion, now veering more towards syntax, now towards semantics.

  • In sum, in both the brain and the Logos System, associative memory itself is the processor, as this memory reacts opportunistically to the input signals.

Third Assumption

The brain abhors complexity and seeks wherever possible to simplify its processes, driven in this by the law of least effort.

The brain achieves simplification largely through the use of abstraction.

The brain cell (neuron) by its very architecture would appear to be an abstraction device, with many dendrites for input and a single axon for output.

The brain clearly employs abstraction in dealing with the complexity of natural language. No one knows how the brain represents language internally, but there are hints from language acquisition (click here for discussion).

  • The Logos System relies upon abstraction as its chief device for coping with complexity.

  • Abstraction is the principle upon which the SAL Representation Language of the Logos System is based.

  • The semantico-syntactic patterns of the Logos System knowledge base are patterns at several levels of abstraction and thus enjoy the power of generality.

  • In both the brain and the Logos System, the problem of complexity of natural language is handled by dealing with natural language at the level of semantico-syntactic abstractions.

Fourth Assumption

Humans acquire language by means of the general mechanism for learning, viz. the assimilation of what is unknown to what is already known.

Language learning begins when an unfamiliar linguistic pattern is seen to be analogous to some already familiar linguistic pattern.

Language learning dramatically advances when the brain discerns commonality in differing, already familiar linguistic patterns and thus, through the power of abstraction, begins to generalize these patterns.

The language learner assimilates new, unfamiliar patterns by associating them with already familiar patterns. This assimilation occurs when some resemblance between the unfamiliar and the already familiar is recognized. When this resemblance-recognition is effected at a sufficiently abstract abstract level, the learning process becomes very efficient.

This mechanism, in part at least, might well explain how virtually infinite language capacity may result from merely finite language exposure.

  • The assimilation of the unfamiliar to the already familiar perfectly describes the Logos approach to sentence analysis.

  • In effect, an unfamiliar input sentence is “learned” (decoded) by associating the various parts of the unfamiliar sentence with already familiar (stored) semantico-syntactic patterns which, at some level of abstraction, they resemble.

  • These already familiar, stored patterns are the product of prior, finite exposure to language by Logos System developers ) who have generalized what they have seen in language (simulating presumed brain processes).

  • The capacity to discern analogy among diverse phenomena is fundamental to this generalization process.

  • Seeing that x (the unfamiliar) is like y (the already familiar) entails seeing them under a common (more general or abstract) classification.

  • The SAL Representation Language, being an abstraction language, supplies the terms by which unfamiliar input patterns are likened to those already familiar in an efficient manner.

  • The stored, semantico-syntactic (SAL) patterns of the system’s memory, being abstract, encompass virtually all actual and potential sentences of a given source language.

  • Thus, after generalizing its language exposure as a finite set of abstract, semantico-syntactic patterns, the system’s memory is prepared to deal with (decode) an infinite variety of new, previously unseen sentences.

Fifth Assumption

In the brain, syntactic processing and semantic processing are integrated rather than separate, sequential processes.

Evidence for this is seen in the fact that we often understand half-uttered sentences, and can often complete the sentences begun by others.

This further suggests that human sentence processing must be deterministic, producing a single parse rather than a syntactic parse forest that must then be semantically pruned.

  • In the Logos System, the SAL Representation Language integrates semantics and syntax, seeing them as the two extremes of a continuum.

  • The integration of syntax and semantics in SAL allows syntactic processing and semantic processing to be conducted simultaneously.

  • By applying semantics to the syntactic parse, the Logos System is able to produce a single, deterministic parse rather than a parse forest which must then be pruned by means of semantics.