Modelling Bilingual Representation and Processing

Authors Avatar

Modelling Bilingual Representation and Processing

The bilingual’s ability to process several languages almost simultaneously and overcome the conflicts across languages is striking. The main topic of interest here is how a non-target language affects target word identification under various experimental circumstances. In this research, empirical investigation and computer simulation go hand in hand. To account for collected empirical data, several models of bilingual representation and processing have been developed. This essay will examine the Bilingual Interactive Activation (BIA) model which simulates orthographic level of representation, the Distributed Feature Model (DFM) which specifies the semantics (i.e., meaning) of isolated words, and the Revised Hierarchical Model (RHM) which accounts for the interlanguage connections between lexicon and concepts/semantics as a function of L2 learners’ proficiency. The strengths and weaknesses of these three models will be evaluated on an empirical stand and the author argues that a comprehensive model (e.g., BIA+ model) is needed to simulate and account for all the above perspectives (i.e., orthographic, semantic as well as phonological representations and individual differences in terms of bilinguals’ L2 proficiency).

        

The BIA model (Dijkstra & van Heuven, 1998; van Heuven, 2000) is a bilingual extension of the monolingual Interactive Activation (IA) model for visual word recognition (McClelland & Rumelhart, 1981). In the IA model, there are three levels of nodes, with ascending complexity: (1) features of a letter such as curves, straight lines, or crossbars, (2) individual letters, and (3) words. Information at all levels can interact with each other during the word recognition process, which may flow both ‘bottom-up’ (features to letters to words) and ‘top-down’ (words to letters to features). Within levels, nodes compete for activation (thus inhibiting each other); across levels, nodes either inhibit or excite each other. According to IA, it is these inhibitory and excitatory connections that give rise to the appropriate activation of patterns that correspond to the perception of words. As a straightforward extension of the IA model, the BIA model consists of four levels of nodes: features, letters, words, and languages. All nodes at the word level are interconnected with mutual inhibition. Feature units activate appropriate letters, and letter units activate appropriate words in the appropriate language. The model incorporates an integrated lexicon and specially adds two language nodes (one for English and one for Dutch).  Each language node collects activation of all words from one lexicon and suppresses all words in the other lexicon. The relative prominence of one language or the other under particular circumstances is reflected in the activity of two language nodes.

The BIA model proposes a precise mechanism for the way in which orthographic forms are activated in two languages in bilingual visual word recognition. The model is able to simulate a series of experiments that exploit the words that share lexical features across languages. This includes cross-language neighbours that are orthographically similar in two languages but unrelated (e.g., Jared & Kroll, 2001; van Heuven et al., 1998), interlingual homographs that share lexical form but not meaning (Dijkstra et al., 1998; Jared & Szucs, 2002; von Studnitz & Green, 2002) and cognates that are translation equivalents with similar spelling and sound (e.g., Dijkstra et al., 1998; van Hell & Dijkstra, 2002). All the listed studies support the main claim of the BIA model that proficient bilinguals activate information about words in both languages in parallel. This is in line with the widely accepted account that the bilingual lexicon is integrated across languages and is accessed in a language non-selective way.  

For different experiment paradigms, the model was also able to capture the effects on target recognition of a number of well-known bilingual factors. Among these are person-related factors such as language proficiency and subjective word frequency (Bijeljac-Babic et al., 1997), and contextually determined factors such as language presentation order and language of a previously presented prime (von Studnitz & Green, 1997). However, BIA only incorporates orthographic representation and fails to fully characterize word recognition in and out of meaningful context. A complete model must specify semantics (Francis, 1999). De Groot and colleagues (de Groot, 1992, de Groot et al., 1994; van Hell &  de Groot, 1998) proposed a model of bilingual semantics, namely the Distributed Feature Model (DFM). The model assumes that a word’s lexical category (e.g., concrete or abstract, cognates or noncognates) is determined by the degree to which semantic representations are shared (i.e., semantic overlap) across languages. More specifically, representation for concrete words and cognates are expected to be similar across languages, but not the case for abstract words and noncognates. The supportive evidence for the DFM mainly comes from translation performance as translation requires semantic processing. It’s found  by the DFM modellers that the time to recognize and produce translation equivalents is faster when the word pairs are concrete nouns and cognates because concrete words and cognates have more degrees of semantic overlap across languages than abstract words and noncognates. De Groot and her colleagues’ studies on word concreteness however, have been criticized by several researchers. They generally argue that concrete words are likely to have fewer translations across languages than abstract words. So it is not surprising that abstract words will have considerably longer translation latency due to their alternative translations (Tokowicz et al., 2002).

Join now!

Neither BIA model nor DFM accounts for the developmental perspective of bilinguals in terms of their developing proficiency in their second language acquisition. To explain how the connection between words (lexical representation) and their meanings (conceptual/semantic representation) develop with increasing proficiency in the L2, Kroll & Stewart (1994) proposed the Revised Hierarchical Mode (RHM). Different from the BIA model, the RHM does not specify the precise dynamics of lexical recognition, instead, it focuses on how mappings from word to concept are developed and accessed during language processing. The model assumes independent lexical representations for words in each language but ...

This is a preview of the whole essay