The Listening Comprehension Construct
The Nature of Spoken Language
When making language tests, the first place to start is the language. Processing spoken texts requires skills not required when processing written texts:
- The information is conveyed through sound, which requires a knowledge of the phonetics of the language.
- There is considerable phonological modification (assimilation, elision, vowel reduction, etc.); which requires a knowledge of the phonology of the language
- It must be processed in fast real time, not controlled by the listener, which means processing must be fast and efficient
Linguistically, spoken language differs considerably from written language:
- Not sentences, but idea units
- Idea units are shorter
- Connected more by conjunction than embedding, and connections are often not explicit
- There are many hesitations: pauses, fillers, repairs, repetitions
- Is far more colloquial, less conservative, more slang
- More personal involvement
- Far less concern for truth, and more poetic license.
Processing Spoken Language
Meaning is not found in the text, but is constructed by the listener, using:
- Linguistic knowledge
- The co-text
- The context of situation
- General background or topic specific knowledge
There is no fixed bottom-up order of processing: expectations and inferences lead to top-down interpretations
- Speech is generally very fast, and hence processing must be automatic.
- Faster speech rates are more difficult, and all listeners have a cut-off point, beyond which processing breaks down completely.
- Listeners don't get all the meanings in a text, they get a reasonable interpretation.
- Listeners extract the meaning, or gist, which they store in memory, and then forget the form.
Remember, listening comprehension is not understanding what words mean, but what speakers mean.
The Functions of Speech
There are two main functions of spoken language:
- Interactional — to maintain social relationships: phatic, small talk, the personal contact is the motivation for the message
- Transactional — to transmit propositional information: the content is the motivation for the message
People virtually never intentionally listen to interactional language: to do so is boring and pointless.
Listening Situations
There are two main types of listening:
- Collaborative listening — the listener has the responsibility to cooperate to construct the discourse, provide back channeling, monitor their comprehension, and take turns where appropriate. Note: The listener can ask for repetition, but always gets a re-statement.
- Non-collaborative listening — the listener has no chance to influence the discourse, or seek clarification. Note: There is no chance to seek repetition.
Recently possible:
- Non-collaborative listening to recordings — no chance to influence discourse, but can get a (perpetual) repetition of exactly the same words.
Listener can be:
- Addressee
- Participant, but non-addressee
- Overhearer
Listening is usually much easier for the addressee, who shares knowledge with the speaker, which is taken into account, whereas the overhearer often lacks the shared knowledge that makes the discourse comprehensible.
Speaker can be in a:
- Monologic situation
- Dialogic situation
- Group situation
Context and Cognitive Environment
Listeners use the context to help them interpret meanings. i.e.
- Context of situation
- Co-text
But the most important context is what is in the mind of the listener. This consists of:
- Their purpose for listening
- Their understanding up to this point—accurate or not
- What they are thinking about—which can differ considerably from one listener to another
- The mental images they create—often very idiosyncratic, and often what is actually stored in memory
It is the cognitive environment that provides the real context for interpretation: i.e. what's in the listener's mind.
Causes of Comprehension Breakdown
- Lack of knowledge of the linguistic code
- Mislexicalizations, i.e. slips of the ear
- Lack of cohesion, often due to too many ‘gaps' in comprehension so far
- Processing breakdowns; listeners have ‘knowledge' of the language, but fail to apply it efficiently, due to: Lack of processing speed; Lack of sufficient effort or attention; Inability to engage in other necessary cognitive activities while listening
Note: Almost all processing breakdowns occur because processing is not sufficiently automatic.
Monitoring of Comprehension
If listeners aim for a reasonable interpretation, then they must have some mechanism to check, or monitor, whether their interpretation is reasonable.
- Listeners often have a reasonably accurate idea of how well they have understood, but not always
- Monitoring seems to work on the principle that the interpretation is accepted as alright as long as there are no obvious contradictions or discrepancies
- Alertness and mindfulness of possible discrepancies can vary due to interest, motivation, and, no doubt, ‘general intelligence'
Oral-literate Continuum
Differences between listening and reading are really a question of degree. So texts can be ranged along an oral-literate continuum:
- Oral texts: more characteristics of spoken language; associated with casual conversation
- Literate texts: more the characteristics of written language, especially, expository prose
Features that determine the position on the continuum include:
- Linguistic features (the sound system, phonology, stress, intonation)
- Paralinguistic features (tone of voice, gestures, and facial expressions)
- Vocabulary choice
- Grammar Degree of planning
- The type and degree of shared context between the participants
- The function of the text
Spoken texts differ in their degree of orality depending on the listening situation.
- Informal situations tend to have more oral features
- Formal situations tend to have more literate features.
Planned vs. Unplanned Discourse
One variable that relates to the degree of orality is whether the discourse is planned or not.
Unplanned Discourse
- Initial ideas and first reactions
- Loosely or poorly organized
- Fragments of language, with hesitations, false starts, restatements, vocabulary repair, and grammatically ‘incorrect sentences'
- Context is used to connect propositions
- Referents are often missing
Planned Discourse
- Polished worked text
- Carefully structured
- Appropriate vocabulary and grammar
- Coherent and cohesive
Most informal texts are unplanned; formal texts tend to be more planned. We will talk later about the need to plan, unplanned discourse, and the dilemma this creates for test development.
Conclusions on Construct Definition
If we examine listening tests, we can see a number of different listening constructs, from the most narrowly defined, usually in older tests, up to the broadest, usually in more modern tests:
- Construct 1: Competence in knowing and using the sound system.
- Construct 2: Competence in understanding short utterances on a literal linguistic level.
- Construct 3: Competence in using the grammatical system as well as discourse competence.
- Construct 4: Competence with all the linguistic, pragmatic or sociolinguistic aspects of language.
It is probably fair to say that these constructs derive from the work on communicative competence, or at least they can be understood within that framework. But in many cases, even the most ‘communicative' of these will often lack the oral characteristics of realistic spoken language, and often will not require automatic processing.
I prefer a different approach to construct definition. Listening Comprehension is a very complex process. But normally three things need to be stressed for construct validity:
- The need for realistic texts: you have to test whether they can understand spoken language
- The need for realistic tasks: you have to test whether they can do what listeners normally do
- The need for language processing to be completely automatic: you have to test whether they can do this automatically, while paying no attention to their listening processes.
A good default listening construct might be:
The ability to process extended samples of realistic spoken language, automatically and in real time, to understand the linguistic information that is unequivocally included in the text, and to make whatever inferences are unambiguously implicated by the content of the passage.