Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Sentential inference bridging between lexical/grammatical knowledge and text comprehension among native Chinese speakers learning Japanese

  • Katsuo Tamaoka,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Hunan University, Changsha, China, Nagoya University, Nagoya, Japan

  • Hiromu Sakai,

    Roles Data curation, Writing – review & editing

    Affiliation Waseda University, Shinjuku City, Japan

  • Yayoi Miyaoka,

    Roles Data curation, Writing – review & editing

    Affiliation Hiroshima University of Economics, Hiroshima, Japan

  • Hajime Ono,

    Roles Data curation, Writing – review & editing

    Affiliation Tsuda University, Kodaira, Japan

  • Michiko Fukuda,

    Roles Data curation, Writing – review & editing

    Affiliation Bunkyo University, Koshigaya, Japan

  • Yuxin Wu,

    Roles Data curation, Investigation, Project administration, Writing – review & editing

    Affiliation Xi’an International Studies University, Xi’an, China

  • Rinus G. Verdonschot

    Roles Conceptualization, Formal analysis, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    rinus.verdonschot@mpi.nl

    Affiliation Max Planck Institute for Psycholinguistics in Nijmegen, Nijmegen, The Netherlands

Abstract

The current study explored the role of sentential inference in connecting lexical/grammatical knowledge and overall text comprehension in foreign language learning. Using structural equation modeling (SEM), causal relationships were examined between four latent variables: lexical knowledge, grammatical knowledge, sentential inference, and text comprehension. The study analyzed 281 Chinese university students learning Japanese as a second language and compared two causal models: (1) the partially-mediated model, which suggests that lexical knowledge, grammatical knowledge, and sentential inference concurrently influence text comprehension, and (2) the wholly-mediated model, which posits that both lexical and grammatical knowledge impact sentential inference, which then further affects text comprehension. The SEM comparison analysis supported the wholly-mediated model, showing sequential causal relationships from lexical knowledge to sentential inference and then to text comprehension, without significant contribution from grammatical knowledge. The results indicate that sentential inference serves as a crucial bridge between lexical knowledge and text comprehension.

Introduction

In most written or spoken texts, a specific message is conveyed through a series of semantically related sentences that form meaningful connections. To understand these texts, one must grasp the meaning of the individual words [14] and have proficiency in the grammatical rules that prescribe word arrangement within sentences [59]. As a result, a combination of lexical and grammatical knowledge is essential for effective sentence comprehension.

However, there is a significant difference between understanding a single sentence and comprehending the overall message conveyed by a text. A text typically consists of multiple sentences, and the comprehension of its message relies on the rules of inference that connect one sentence to another. Sentential inference is thought to facilitate the semantic connections between previously introduced and new sentence information [1014], ultimately leading to the comprehension of a text’s complete message [1516].

The current study aims to investigate the role of sentential inference as a mediator between lexical and grammatical knowledge and overall text comprehension.

Background

The role of lexical knowledge during text comprehension

The role of lexical knowledge in text comprehension has been extensively studied in first and second language acquisition. Research on language acquisition in native English-speaking children [1, 1719] has demonstrated that lexical knowledge is a significant contributor to text understanding and reading comprehension. LaBerge and Samuels [2] introduced the concept of ‘automaticity,’ which suggests that proficient readers can process words in a text effortlessly. They suggested this ultimately allows children to focus their cognitive efforts on higher-order comprehension skills once they have mastered automatic word processing.

Beck, Perfetti, and McKeown [1] examined the connection between word processing and reading comprehension as well. Over a five-month period, they trained fourth-grade students to establish semantic networks, leading to improved word processing efficiency and enhanced reading comprehension compared to a control group. Similarly, using Cantonese, Leong and Ho [20] found that lexical knowledge significantly influenced Chinese reading comprehension in a study involving 361 Chinese secondary school students.

In a study of Japanese language acquisition, Tamaoka, Leong & Hatta [4] found that word processing speed, an indicator of automaticity, greatly influenced reading comprehension. They observed significant differences in processing alphabetic loanwords (katakana) and Chinese-origin words (kanji) between skilled and less-skilled readers from grades four to six. Skilled readers were better at deciphering unknown pronunciations of difficult kanji characters using phonetic cues. This highlights the importance of phonological and orthographic morphemes in reading comprehension (see also [21]).

Similarly, Takahashi [3] determined that lexical knowledge was the most crucial factor in reading comprehension for Japanese-speaking children in grade five. In a subsequent longitudinal study [22], Japanese-speaking children from grades one to five were studied and it was discovered that three primary factors—word pronunciation speed, reading memory span, and lexical knowledge—significantly impacted reading comprehension in younger children (grades one to three). However, upon reaching grade five, lexical knowledge emerged as the sole influential factor, with word pronunciation speed and reading memory span no longer having a significant effect on reading comprehension. Takahashi [22] attributed this change to the increasing complexity of reading contexts and the expanding range of vocabulary used in texts as children progress through the grades.

So far, our primary focus has been on the text comprehension of monolinguals. Nevertheless, it seems necessary to point out that also for bilinguals, strategies to text comprehension might overlap and warrant an investigation. In research exploring the acquisition of English as a second language by native Chinese speakers, Zhang [23] employed Structural Equation Modeling (SEM) to investigate the contributions of lexical and grammatical knowledge to reading comprehension. Zhang’s findings revealed that lexical knowledge had a significant impact on reading comprehension, whereas grammatical knowledge offered a more modest contribution.

Considering the significant role lexical knowledge seems to play in reading comprehension, some have introduced the concept of a ‘lexical threshold’ to determine the minimum lexical knowledge necessary for accurate text comprehension. Hu and Nation [24], Nation [25], and Stahl and Nagy [26] posited that understanding 98% of the words in a written English text is required to achieve accurate comprehension. In the case of native Chinese speakers learning Japanese, Komori, Mikuni, and Kondo [27] estimated that recognizing 96% of the words in a general written text is essential for comprehension. Note that this number still indicates that about 4% of the vocabulary in a text would be unknown. Additionally, Mikuni et al. [28] calculated the threshold for listening comprehension to be around 93%. As spoken language is typically less lexically dense than written language, it is possible that listening comprehension can be achieved with a lower lexical threshold than reading comprehension. However, it is evident that a high level of lexical knowledge is crucial for attaining text comprehension in both spoken and written Japanese.

In a second language study by Zhang, Zhang, Li, and Zhang [29], factors impacting Chinese reading comprehension were examined by testing 447 English-speaking Chinese learners from college-level study-abroad programs. The study found that morphological awareness and lexical knowledge both influenced reading comprehension through lexical inference. As Chinese texts are typically composed of morphological (logographic) characters called hànzì, which do not contain inflections for verbs or adjectives, knowledge of hànzì morphology and compound words is essential to understand the text. Approximately 75–80% of Chinese words consist of two or more hànzì characters [30]. Zhang et al. [29] identified lexical inference as a mediating factor between morphological/lexical knowledge and reading comprehension, indicating a sequential relationship between the understanding of hànzì characters, their compound words, and the lexical relations of multiple words. Consequently, morphological/lexical knowledge and lexical inference seem to play crucial roles in Chinese reading comprehension.

Other research has shown that adult native Chinese speakers learning Japanese also make use of morphological/lexical knowledge when learning English. For example, Koda (2000) examined native Chinese and Korean speakers learning English and found that while both groups showed similar sensitivity to English morphological structure, Chinese learners were more efficient in integrating morphological and contextual information during English sentence processing. This strong contribution of morphological awareness to reading comprehension was also evident among native Chinese children in elementary school who learnt English as a second language. Wang, Cheng, and Chen [31] reported that both Chinese and English morphological awareness contributed to Chinese reading comprehension in these children, referring to this phenomenon as "cross-language morphological transfer." Specifically, Native Chinese speakers initially learn to use hànzì morphological awareness for text comprehension in their first language (Chinese). This morphological/lexical approach, which they acquired in Chinese, can then potentially be applied through lexical inference to comprehend English text as well.

Koda’s [32] findings can also be relevant to native English speakers learning Chinese as a second language. For example, Zhang et al. [29] suggested that morphological awareness and lexical knowledge, through lexical inference, significantly contribute to the comprehension of Chinese text. Due to the linguistic characteristics of the Chinese language, native English speakers likely rely on hànzì for morphological awareness and utilize lexical inferences to understand Chinese text. Consequently, the morphological awareness approach is employed in both cases: from Chinese as a first language to English as a second language [32], and from English as a first language to Chinese as a second language [29, 31]. When learning Chinese as a second or foreign language, morphological awareness of hànzì is essential for text comprehension. Note that although there are many inconsistencies in the usage of this terminology, typically, a “foreign” language is not spoken in the region where it is taught whereas a second language is. Therefore, in both directions of language acquisition, morphological awareness appears to be a critical factor.

Furthermore, lexical knowledge plays a crucial role in Japanese text comprehension for native Chinese speakers learning Japanese as a foreign language. Chen [33] found that among 4,600 Japanese kanji-compound words, 54.5% had the same characters and meanings as in Mandarin Chinese (e.g., 数学 “mathematics”), while 14.9% shared the same characters but had similar meanings (e.g., 高校 “college” in Chinese; “high school” in Japanese). Only 4.1% of the words had the same characters but different meanings (e.g., 暗算 “mental arithmetic” in Japanese, “to plot” in Chinese). If minor orthographic differences between Chinese hànzì and Japanese kanji characters are disregarded, native Chinese speakers can understand approximately 98.1% of commonly used Japanese kanji before even studying the language [34, 35]. This high degree of similarity explains why native Chinese speakers often rely heavily on kanji-based lexical items to comprehend written Japanese texts. As a result, native Chinese speakers learning Japanese are likely to leverage their knowledge of hànzì to understand words in Japanese text.

Japanese differs from Chinese in that it utilizes three scripts instead of only the orthographic (hànzì) script [36]. Firstly, Japanese text incorporates Chinese-originated words (Kango) written in Chinese morphemic hànzì characters (known as kanji in Japanese) which predominantly convey conceptual meanings. Additionally, there are two phonetic scripts: katakana and hiragana. Katakana is primarily used for transcribing alphabetic loanwords, with word etymology differentiated by the use of kanji and katakana. Hiragana, the third script, is commonly employed for grammatical functions, although it has other uses as well (e.g., to write specific native Japanese words, such as: きれい,/kirei/ “beautiful”). Given that lexical and grammatical functions can be visually distinguished to some extent by the three scripts in Japanese, the causal relationship between morphological awareness, lexical knowledge, and lexical inference leading to text comprehension in Chinese [29] may not operate in the same manner for understanding Japanese texts.

The role of grammatical knowledge for text comprehension

Despite some researchers (e.g., [56, 89]) highlighting the significance of grammatical knowledge in text comprehension, it has often been overshadowed by lexical knowledge. Weir [9] reported a strong correlation (r = 0.67) between grammatical knowledge and reading comprehension among non-native English speakers studying English at higher education institutions in the UK. In a study using SEM with 588 native-Japanese speaking university students learning English in Japan, Shiotsu and Weir [7] found a strong causal relationship between grammatical knowledge and reading comprehension, whereas lexical knowledge did not show such a relationship. However, the same study also revealed a comparable contribution of lexical and grammatical knowledge to reading comprehension for 107 native Japanese university students studying in the UK.

These observed differences in the contribution of lexical and grammatical knowledge to reading comprehension among Japanese students in Japan and the UK could be attributed to their distinct learning environments. Students in the UK would have more opportunities to use English in daily life and receive instruction from native English speakers, unlike their counterparts in Japan. Consequently, Japanese students in English-speaking countries would encounter various words and expressions regularly, leading to a simultaneous reliance on both lexical and grammatical knowledge for text comprehension.

On the other hand, Japanese students in Japan have limited exposure to English outside the classroom and might adopt a strategy of understanding English by translating it into Japanese. Given the significant differences in grammatical features between the two languages, grammatical knowledge becomes vital for translation. As a result, these students might depend more on grammatical knowledge for text comprehension. The contrasting English learning environments in Japan and the UK could therefore explain the diverging outcomes in the studies.

For native Japanese speakers learning English, Yamashita [37] found that while both lexical and grammatical knowledge had a significant impact on reading comprehension, lexical knowledge seemed to contribute more. Similarly, Bosser [38] found that both lexical and grammatical knowledge were important predictors for Dutch reading comprehension among native Turkish speakers learning Dutch as a second language. However, Bosser [38] also observed that lexical knowledge was a stronger predictor than grammatical knowledge. Therefore, although grammatical knowledge does contribute to reading comprehension, its influence appears to be less significant than that of lexical knowledge.

Yamato, Tamaoka, and Chu [39] studied the impact of lexical and grammatical knowledge on the phrase-by-phrase processing of Japanese texts (i.e., self-paced reading) by native Chinese speakers learning Japanese. To compare reading speeds, participants were divided into three groups based on their Japanese vocabulary and grammar test scores. They found that lexical knowledge had a more significant influence on reading speed than grammatical knowledge, and it contributed to reading speed for both single phrases and continuous sequences of phrases. However, grammatical knowledge’s impact was limited to certain phrases with complex structures, suggesting that grammar still may play a role in resolving structural difficulties.

Yamato and Tamaoka [40] compared 20 matched pairs of native Chinese and Korean speakers, who were equally proficient in lexical and grammar knowledge. The study found that native Chinese speakers processed words in Japanese kanji characters more quickly than Korean speakers. Due to the similarity between Chinese and Japanese kanji script [41], Chinese speakers were able to process kanji-presented words faster. This suggests that Japanese text comprehension may be partially achieved through morphological/lexical knowledge. As Chinese does not have verb and adjective inflections [42, 43], native Chinese speakers might primarily focus on Japanese kanji characters for text comprehension, similar to their approach in Chinese.

Bridging between lexical/grammatical knowledge and text comprehension

Lexical and grammatical knowledge are essential for understanding text, but there is a distinction between this knowledge and overall text comprehension [44]. We typically use our life experiences to deduce the meaning of partially understood texts. In reading and listening comprehension, we make inferences based on cues provided within sentences. This information is not explicitly stated, so we depend on our experiential knowledge to decipher the text’s meaning. Zhang et al. [29] labeled this process as lexical inference, which involves logical reasoning within individual sentences.

To fully comprehend a text’s message, a broader form of inference, called sentential inference, is needed to connect different sentences. Kintsch [15] and van Dijk & Kintsch [16] introduced two textual structures: microstructure and macrostructure. Microstructure concerns lexical and morphosyntax relations within a sentence, while macrostructure deals with inter-sentential relations. Lo et al. [45] found that cognitive-linguistic skills contribute differently to microstructure and macrostructure understanding in Chinese elementary school children. Intra-sentence microstructural comprehension helps in understanding single sentences (lexical inference), while inter-sentence macrostructure comprehension aids in understanding larger texts (sentential inference).

Long et al. [13] discovered that skilled readers outperformed less-skilled readers in identifying theme-related inferences in specific passages. Murray and Burke [14] noted that high-skilled readers automatically recalled related lexical items, while low-skilled readers did not. These first-language understanding studies suggest that the ability to draw lexical and sentential inferences is crucial for reading comprehension.

The current study focused on three aspects of sentential inference, namely: intentional expressions, co-referential resolution, and causal relations. First, understanding indirect messages in a series of sentences, such as intentions not explicitly stated, is necessary to comprehend the actual message. For example, a listener can infer the unspoken intention of a speaker’s comment about a cold room and act accordingly, like closing a nearby window. This ability to grasp indirect speech is important for overall text comprehension.

The second aspect of sentential inference is co-referential resolution, which occurs when multiple expressions refer to the same person or thing. For example, in “John went to the store, and he bought some cheese," the words "John" and "he" are co-referential because they both refer to the same person. Co-references help link sentences and can refer to a whole sentence’s context, such as a Japanese deictic word like kore ‘this’, sore ‘that’, or are ‘that’ (distal). Understanding these references is what is denoted as co-referential resolution [46].

The last aspect is causal relation, which deals with cause and effect in sentences. Causal relations develop a text’s message and are often indicated by conjunctions like ‘because’, ‘so’, ‘hence’, ‘thus’, or adverbs like ‘accordingly’ and ‘consequently’. In Japanese, causal relations are indicated by conjunctions and subordinate conjunctive particles. Learners of Japanese as a foreign language may face difficulties with conjunctive particles like -kara and -node due to their similarities, possibly leading to pragmalinguistic failure [47]. Understanding a series of causal relations is vital for grasping a text’s overall message, as it involves processing multiple sentences and their logical reasoning steps.

Two existing models of text understanding

The current study utilizes structural equation modeling (SEM) to examine the relationship between lexical/grammatical knowledge, sentential inference, and text comprehension, specifically focusing on native Chinese speakers learning Japanese. The SEM analysis encompasses four primary variables: lexical knowledge, grammatical knowledge, sentential inference, and text comprehension.

Two proposed models outline the key factors in text comprehension. The first, the "partially-mediated" model, posits that lexical/grammatical knowledge and sentential inference work in tandem to facilitate text understanding (see Fig 1). In this model, second language learners employ lexical and grammatical knowledge to comprehend sentence meaning, while both lexical/grammatical knowledge and sentential inference operate concurrently to aid in understanding the entire text. This model anticipates that word meaning, grammatical features, and their integration with sentential relations all play a role in text comprehension.

thumbnail
Fig 1. The partially-mediated model of text comprehension.

https://doi.org/10.1371/journal.pone.0284331.g001

The second model, the "whole-mediated" model, suggests that lexical and grammatical knowledge offer information to understand individual sentences (see Fig 2). However, comprehending a single sentence does not necessarily convey a text’s overall message. To understand the sequence of messages, sentences must be semantically connected to preceding and following sentences. Consequently, sentential inference may function independently to merge the meanings of multiple sentences, each initially based on lexical/grammatical knowledge. In this model, the final message of a text is understood through the integration of a series of sentential semantic inputs.

Our study tested these two models using a large dataset of native Chinese speakers learning Japanese at Xi’an International Studies University, China. By comparing the models, we investigated the causal relationships between lexical/grammatical knowledge and sentential inference in achieving text comprehension. Notably, since Dixon and Marchman [48] encountered difficulty in determining whether lexical or grammatical knowledge precedes the other, the present SEM analysis incorporated a correlation between them.

Methodology

Participants

A total of 281 native Chinese-speaking university students learning Japanese at Xi’an International Studies University participated in this study. They had no prior experience studying Japanese before joining the university. At the time of testing, the students had either one year (N = 146; 124 female) or two years (N = 135; 116 female) of academic Japanese training, and none of them had lived in Japan. Their ages ranged from 16.1 to 26.7 years, with an average age of 19.7 ± 1.0 years for those with one-year of learning experience and 20.8 ± 1.1 years for those with two years of experience. All data collected was securely stored, and participants were given numerical pseudonyms for privacy. Before beginning the study, participants were informed about the study objectives, procedures, risks, and benefits, and verbal informed consent was obtained from each participant to ensure ethical research conduct. The study received approval from the ethics committee of the School of Japanese Culture and Economy at Xi’an International Studies University.

Measurements

The study investigated four main variables, namely: lexical knowledge, grammatical knowledge, sentential inference, and text comprehension through multiple-choice questions (one correct answer out of four choices), lasting 180 minutes in total (four 45-minute classes).

Lexical knowledge refers to understanding vocabulary in a language, including individual word meanings, spelling, connotations, and how words form phrases and sentences. It is a critical component of language proficiency and communication and is measured in nearly all influential Japanese language proficiency tests (e.g., the Japanese Language Proficiency Test or JLPT). In this study, lexical knowledge was measured through four tests based on word categories: Chinese-origin words (Kango), Japanese-origin words (Wago), alphabetic loanwords (Gairaigo), and function words, which serve a grammatical function rather than conveying lexical meaning. In Japanese, function words are crucial for constructing coherent and meaningful sentences, as they indicate the grammatical relationships between various sentence elements. All words in the test were taken from the JLPT [49]. The Chinese-origin words were selected from two-kanji compound words such as guti (‘complaint’), hukyô (‘recession’), syumi (‘hobby’), yûbô-da (‘promising’), kengaku-suru (‘to visit’) and tyûmon-suru (‘to order’). Examples of Japanese-origin words are arasuzi (‘story’), sakasama (‘upside-down’), yayakosii (‘complicated’), detarame-na (‘nonsense’), hakado-ru (‘to make progress’), and unazu-ku (‘to nod and agree’). All loanwords were taken from alphabetic languages, such as saizu (‘size’), kyaria (‘career’), dorai-da (‘unsentimental’), rûzu-da (‘loose’), massâzi-suru (‘to massage’) and suttopu-suru (‘to stop’). Function words were defined as grammatical words consisting of more than two morphemes with no inflections such as ga-hayai-ka (‘no sooner … than …’), ta-tokoro-de (‘even if…’), itaru-made (‘until…’ or ‘up to…’), kawa-kiri-ni (‘start by…’), and yogi-naku-sa-reru (‘be obliged to…’). Twelve words in each lexical category, that is: Chinese-origin, Japanese-origin, and alphabetic loanwords consisted of four nouns, four adjectives and four verbs. In addition, all twelve words were selected from levels 1 and 2 in the vocabulary list of Japanese Language Proficiency Test (JLPT), cross-matched by JLPT levels 1–2 over three word categories of Kango, Wago and Gairaigo. As shown in Table 1, Cronbach’s alpha, calculated across 281 participants with 48 test items, was relatively high at 0.739.

thumbnail
Table 1. Full scores, means, standard deviations, and correlations for 12 language tests (observed variables).

https://doi.org/10.1371/journal.pone.0284331.t001

Grammatical knowledge

Grammatical knowledge refers to knowledge of the rules and principles that regulate the structure and usage of language, encompassing an awareness of parts of speech, sentence structure, and the interconnectedness of words and phrases. As depicted in Fig 3, grammatical knowledge was measured through three subtests: morphological inflections, local-dependency, and complex structure. The subtest for morphological inflections consisted of questions about changes within a single lexical unit. For example, Japanese verbs and adjectives have complex inflections, and a predicate presented as a single unit in Japanese can be formed by adding various morphemic units indicating negation, past tense, voice, aspect, and mood. For instance, the verb stem seme-ru ‘to blame’ can be compounded with the negation–nai and the past tense–ta, resulting in the predicate of seme-nakat-ta ‘did not blame’. In the test, participants were asked to fill in an empty sentence bracket, such as the following example: Ayamatte kabin-o kowasita watasi-o, titi-wa () ‘My father (did not blame) me, who mistakenly broke the flower vase’. Out of four choices: seme-nakat-ta, seme-nai-dat-ta, semeru-nakat-ta, and seme-naku-te-dat-ta, seme-nakat-ta was the correct answer for ‘did not blame’.

thumbnail
Fig 3. Different types of (local) dependencies in Japanese sentence comprehension.

https://doi.org/10.1371/journal.pone.0284331.g003

As depicted in Fig 3, the subtest for local dependency required participants to determine whether two neighboring units formed a correct expression. For example, the verb tukur-u ‘to cook’ can be modified by the adverb zyôzu-ni ‘very well’ to mean ‘to cook very well.’ However, when used as an adjective to modify a noun, the particle -na must be used, as in zyôzu-na e ‘a well-drawn picture.’ The difference between these two words is that -ni should be used for the adverb and -na for the adjective. In this subtest, the correct answer for the sentence Kanozyo-wa itumo tamagoyaki-o () tukuru ‘She always cooks omelets (very well)’ could not be determined by lexical knowledge alone. The correct choice among zyôzu-ni, zyôzu-de, zyôzu-no, and zyôzu-na could only be determined by referencing the verb tukuru (to cook) at the end of the sentence.

The third subtest for complex structure required participants to refer to an entire sentence to determine the correct answer, as illustrated in Fig 3 as a long-distance relation. In the example sentence Donnani kanozyo-ga (), ano daigaku-niwa gôkaku sinai darô ‘(No matter) how hard she (tries), she cannot pass the entrance examination of that university,’ the four choices were ganbattemo, ganbatte, ganbarunoni, and ganbaruga. Each of these four expressions could be used to form a correct sentence; however, since the final phrase of the sentence ended with a negative following the prior phrase donnani … temo [no matter how …], the correct choice could only be ganbattemo. Cronbach’s coefficient alpha, calculated across 281 participants with 36 test items, was relatively high at 0.793.

Sentential inference.

Sentential inference was measured using three subtests: intentional expressions, co-referential resolution, and causal relations. A total of eighteen questions were administered, six in each of the three categories. All questions in the three subtests consisted of more than two sentences. An example of an intentional expression is ‘Shall I fix you a coffee?’ with the response ‘If I drink coffee now, I won’t be able to sleep.’ Participants were asked to choose the best follow-up response from the first speaker: (a) ‘Then, only I will drink coffee.’, (b) ‘How much sugar would you like to have?’, (c) ‘If you take this pill, you can sleep well.’, or (d) ‘I also like coffee very much.’ The appropriate response is (a). The only cue for choosing the correct response was recognizing the intention of the listener’s statement ‘I cannot sleep.’

Coreferential resolution involves deictic words such as kore (this), sore (that), and are (distal ‘that’), which refer to a thing or idea from a previous sentence. For instance, in the sentence "Actually, I am afraid of cats. Please do not tell anyone about this," the phrase "this thing" (kono koto in Japanese) broadly refers to the speaker’s fear of cats rather than a specific word from the previous sentence.

Causal relations questions were created using multiple sentences that followed a logical progression. For example, "Last night, I read a book until 2 a.m. () it was really interesting, so I couldn’t stop reading." The correct answer was "datte" (because) rather than the other three choices—"dakara" (therefore), "dakedo" (even though), and "sorede" (then). Based on the meaning of the previous sentence, the causal expression "datte" explains why the speaker read a book until 2 a.m., stating that the book was captivating. These two sentences have a causal relationship, and this question evaluates understanding of causal relations. Cronbach’s coefficient alpha, calculated for 281 participants with 18 test items, was 0.638 (acceptable).

Text comprehension.

Text comprehension is defined as the semantic understanding of a whole text’s theme. It was measured using reading and listening comprehension tests with relatively long, visually-presented (reading) or auditory-presented (listening) texts, each consisting of one or two paragraphs. There were eight texts for reading comprehension and eight for listening comprehension. Each correct answer was worth one point. To ensure that the total study workload was manageable, we included only one question per text. This approach was considered reasonable, as high-quality questions probing deeper thinking, connections, and inference generation were used, making them suitable for measuring text understanding. Cronbach’s coefficient alpha, calculated for 281 participants with 16 test items, was 0.613 (acceptable).

Procedure

A series of tests was conducted on university students in two ninety-minute classes (180 minutes total) in quiet classrooms. Each Japanese language class had 30 students. Tests were administered to second-year students in five classes and third-year students in five classes (10 classes total) at the beginning of the academic year (September). Second-year students were considered to have one year of Japanese learning experience, while third-year students had two years. One or more of the paper’s authors and the students’ Chinese-speaking class instructors supervised the two test sessions in the 10 classes.

Results

For the 281 participants (146 second-year students and 135 third-year students), the full scores, means, standard deviations, correlations for the twelve tests, and Cronbach’s coefficient alphas for the four latent variables are shown in Table 1.

Model comparisons

Using AMOS [50] added on to SPSS Statistics Version 28, a SEM analysis was conducted with a covariance matrix on 281 Chinese students learning Japanese in China. The goodness-of-fit index (see Table 2) showed that the Chi-squares for both the partially-mediated model [χ2(48) = 72.25, p < .05] and the wholly-mediated model [χ2(50) = 72.88, p < .05] were significant, indicating that neither model ideally fit the data [51]. However, Chi-square tests are inherently dependent on sample size (e.g., [5255]). Thus, Jöreskog and Sörbom [54] proposed the χ2/df index for larger sample sizes, which indicates a good fit for values less than 2.00. Since the current sample size (N = 281) was relatively large, χ2/df was calculated as shown in Table 2. This resulted in both models fitting well, but the wholly-mediated model (1.46) scored slightly lower, demonstrating a better fit than the partially-mediated model (1.51).

thumbnail
Table 2. Comparison indices of the partially-mediated model and the wholly-mediated model.

https://doi.org/10.1371/journal.pone.0284331.t002

Table 2 summarizes four model comparison measures [55], and the best-fitting model for the present data was chosen using these criteria, where a lower value indicates a better fit. As indicated by all four criteria in Table 2, the wholly-mediated model was a better fit than the partially-mediated model across all values: AIC [56], CAIC [57], ECVI [58], and BIC [59]. Consequently, the wholly-mediated model was adopted for the present study.

The wholly-mediated model—Fit indicators

We evaluated the fit of the wholly-mediated model to the data obtained in the present study (see Fig 4) using six criteria (for details, see [53, 55]). All six indicators, namely: χ2/df [54], RMSEA [58], GFI [60], AGFI [55], NFI [61], and CFI [55] demonstrated that the model fit the data very well. Therefore, the wholly-mediated model depicted in Fig 4 was further investigated using SEM analysis.

thumbnail
Fig 4. SEM results of the wholly-mediated model.

Note: RMSEA: Root Mean Square Error of Approximation. GFI = Goodness of Fit Index. AGFI = Adjusted GFI. NFI = Normal Fit Index. CFI = Comparative Fit Index.

https://doi.org/10.1371/journal.pone.0284331.g004

The wholly-mediated model–SEM results

The SEM revealed the component structure for the latent variables within the wholly-mediated model, which was constructed with four latent variables as shown by the ellipses in Fig 4. The SEM causal structure was established such that grammatical and lexical knowledge connected to text comprehension via sentential inference. These four latent variables were estimated by twelve observed variables depicted by rectangles in Fig 4. For the latent variable of lexical knowledge, all path coefficients indicated that the four observed variables contributed significantly; Japanese-origin words (beta = 0.51), Chinese-origin words (beta = 0.70), alphabetic loanwords (beta = 0.64), and function words (beta = 0.61) were all important. The latent variable of grammar knowledge was measured by three observed variables. Very high path coefficients were found for morphological inflections (beta = 0.86), and for structural complexity (beta = 0.79), but relatively lower for local dependency (beta = 0.40). Morphological inflections and, to a lesser degree, structural complexity seemed to greatly contribute to grammar knowledge.

Similarly, the latent variable of sentential inference was measured by three observed variables, which showed relatively high contributions: causal relations (beta = 0.68), intentional expressions (beta = 0.60), and co-referential resolution (beta = 0.57). The fourth latent variable of text comprehension was measured by reading comprehension (beta = 0.75) and listening comprehension (beta = 0.62), which showed a large contribution.

The wholly-mediated model was applied to investigate the influence of sentential inference on bridging the gap between lexical/grammatical knowledge and text comprehension. Since the correlation between lexical knowledge and grammatical knowledge was very high (r = 0.89, p < .001), both factors seemed to be acquired in a mutually-related manner. Native Chinese speakers displayed a strong causal relation between lexical knowledge and sentential inference (beta = 0.85, p < .001), but they showed no significant causal relation between grammatical knowledge and sentential inference (beta = 0.15, ns). Moreover, a strong significant causal relation was found between sentential inference and text comprehension (beta = 0.94, p < .001). These SEM results indicate that Chinese speakers learning Japanese rely heavily on their lexical knowledge to support their sentential inference, which in turn contributed greatly to text comprehension. In sum, a clear sequential relation from lexical knowledge via sentential inference to text comprehension was observed in the present study.

The indirect effect from lexical knowledge to text comprehension was significant (beta = 0.72, p < .01) while the indirect effect from grammatical knowledge to text comprehension was not significant (beta = -0.198, ns). The significant probabilities (p-value) of the indirect effects were calculated 500 times by bootstrapping. The indirect effects also provided evidence for the causal pathway from lexical knowledge, through sentential inference, to text comprehension in the mediated model.

Discussion

The primary goal of reading or listening to a text is to grasp the intended message. While lexical knowledge and basic grammatical understanding may be sufficient for comprehending individual sentences, it often falls short when trying to understand a series of sentences in Japanese or other languages. This study aimed to explore the role of sentential inference in bridging the gap between lexical/grammatical knowledge and overall text comprehension. Four tests (i.e., lexical knowledge, grammatical knowledge, sentential inference, and text comprehension) were conducted on 281 native Chinese-speaking students learning Japanese at a Chinese university to examine the causal relationships among these abilities. Two distinct models were considered: the partially-mediated model, which views sentential inference as just one of many factors contributing to text comprehension on par with lexical/grammatical knowledge; and the wholly-mediated model, which posits sentential inference as separate from lexical/grammatical knowledge and serving as a bridge to broader text comprehension (e.g., [1014]). In this model, sentential inference is crucial for integrating sentence-level information to understand the entire message of a text. Our SEM analysis revealed that the wholly-mediated model better fits the data than the partially-mediated model, as further demonstrated in Fig 4. This finding suggests a sequential causal relationship that starts with lexical knowledge (but not grammatical knowledge), moves through sentential inference, and ultimately leads to text comprehension. The current study highlights the importance of sentential inference as a central mediator of lexical knowledge in facilitating holistic text understanding in Japanese.

In this study, we analyzed participants who had completed one and two years of Japanese language learning together. However, considering that increasing proficiency could potentially affect the variables being investigated, we re-evaluated the data using the wholly-mediated model for Grade 1 (N = 146) and Grade 2 (N = 135) separately. This analysis produced results like those obtained when combining all participants (N = 281). Nevertheless, the goodness-of-fit indices were lower than those for the full sample. Consequently, we opted to retain the single-group structural equation modeling (SEM) analysis with a sample size of N = 281.

The core finding of this study is that sentential inference is essential for bridging the gap between lexical knowledge and comprehensive understanding of Japanese texts. In a study on learning Chinese as a foreign language, Zhang et al. [29] discovered that lexical inference served as a connecting tool between morphological awareness/lexical knowledge and Chinese reading comprehension for native English speakers. As Chinese texts are entirely written in hànzì, English speakers must first acquire knowledge of the morphological units of hànzì characters to read Chinese. Lexical inference is automatically employed as a mediating factor for reading comprehension in Chinese. Consequently, differences may exist in how Japanese and Chinese text comprehension occurs. Specifically, sentential inference might act as a bridge between lexical knowledge and Japanese text comprehension, while lexical inference appears more involved in understanding Chinese texts. It should be noted that it is also possible that sentential inference connects Chinese vocabulary knowledge and text comprehension, with the only difference being that lexical inference is required prior to sentential inference (note: we like to thank an anonymous reviewer for pointing this out).

Although not examined in our current paper, it is conceivable that individual differences in working memory capacity also contribute to complete text understanding. For instance, successful inferences must be stored in memory, particularly for long and dense texts, as subsequent interpretation relies on them. Future research is needed to determine if this factor is significant for our empirical setup.

It is important to note that our findings are currently interpreted in the context of successful and complete text comprehension. Our tests assessed whether whole text understanding was achieved, and our model was built on this data. This does not imply that every reading situation must follow this exact pattern (e.g., not all text comprehension results in complete understanding). Undoubtedly, scenarios may exist where, for example, a low-proficiency L2-learner could still benefit from lexical information aiding text comprehension despite not fully achieving sentential inference. Other cases might involve native speakers confronted with complex legal texts, which typically demand significant inferencing skills from readers. However, failure to achieve sentential inference often leads to incomplete overall comprehension, causing nuances or even essential points to be misunderstood.

Considering the findings from the current study, several implications for L2 acquisition and teaching could be drawn. First, the results underscore the importance of emphasizing lexical knowledge development in language learning, as it appears to play a more significant role in sentential inference and text comprehension than grammatical knowledge. Consequently, L2-educators might consider prioritizing the development of students’ vocabulary, focusing on both the breadth and depth of their lexical knowledge. Second, the study highlights the need to focus on sentential inference training in L2-instruction. Educators might therefore incorporate activities and strategies that enhance students’ abilities to make inferences at the sentence level, such as tasks that involve identifying context clues, drawing connections between words, and predicting the meaning of unfamiliar words or phrases. Lastly, our findings potentially suggest a reevaluation of the role of grammar instruction in L2-acquisition and teaching. Given that grammatical knowledge seems not to significantly contribute to text comprehension through sentential inference, L2-educators may need to place greater emphasis on communicative competence and contextual understanding, though further research, admittingly, needs to explore the role of grammar in different aspects of language learning and proficiency.

References

  1. 1. Beck I. L., Perfetti C. A., & McKeown M. G. (1982). Effects of long-term vocabulary instruction on lexical access and reading comprehension. Journal of Educational Psychology, 74, 506–521.
  2. 2. LaBerge D., & Samuels S. J. (1974). Toward a theory of automatic information processing in reading. Cognitive Psychology, 6, 293–323.
  3. 3. Takahashi N. (1996). Zidôki no kodomo no yomi nôryoku no kitei yôin nituite—Konponensyaru apurôti-ni yoru bunsekiteki enkyû [Reading ability of elementary school children: A componential approach]. Sinrigaku Kenkyû [Japanese Journal of Psychology], 67, 186–194.
  4. 4. Tamaoka K., Leong C. K., & Hatta T. (1992). Effects of vocal interference on identifying kanji, hiragana and katakana words by skilled and less skilled Japanese readers in Grades 4–6. Psychologia, 35, 33–41.
  5. 5. Anderson J. C. (2000). Assessing reading. Cambridge: Cambridge University Press.
  6. 6. Bernhardt E. B. (1991). Reading development in a second language: Theoretical, empirical, and classroom perspectives. Norwood, NJ: Ablex.
  7. 7. Shiotsu T. & Weir C. J. (2007). The relative significance of syntactic knowledge and vocabulary breadth in the prediction of reading comprehension test performance. Language Testing, 24, 99–128.
  8. 8. Urquhart A. H., & Weir C. J. (1998). Reading in a second language: Process, product, and practice. New York: Longman.
  9. 9. Weir C. J. (1983). Identifying the language needs of overseas students in tertiary education in the United Kingdom. Ph.D. dissertation, London University, UK.
  10. 10. Binder K. S., Chace K. H., & Manning M. C. (2007). Sentential and discourse context effects: adults who are learning to read compared with skilled readers. Journal of Research in Reading, 30, 360–378.
  11. 11. Graesser A. C., Singer M., & Trabasso T. (1994). Constructing inferences during narrative text comprehension. Psychological Review, 101, 371–395. pmid:7938337
  12. 12. Hannon B., & Daneman M. (1998). Facilitating knowledge-based inferences in less-skilled readers. Contemporary Educational Psychology, 23, 149–172. pmid:9576839
  13. 13. Long D. L., Oppy B. J., & Seely M. R. (1994). Individual differences in the time course of inferential processing. Journal of Experimental Psychology: Learning, Memory, and Cognition, 20, 1456–1470.
  14. 14. Murray J. D., & Burke K. A. (2003). Activation and encoding of forward inferences: The role of reading skill. Discourse Processes, 35, 81–102.
  15. 15. Kintsch W. (1998). Comprehension: A paradigm for cognition. Cambridge, UK: Cambridge University Press.
  16. 16. van Dijk T. A., & Kintsch W. (1983). Strategies of discourse comprehension. New York, NY: Academic Press.
  17. 17. Gough P. B., & Tunmer W. E. (1986). Decoding, reading, and reading disability. Remedial & Special Education, 7, 6–10.
  18. 18. Juel C., Griffith P. L., & Gough P. B. (1986). Acquisition of literacy: A longitudinal study of children in first and second grade. Journal of Educational Psychology, 78, 243–255.
  19. 19. Tunmer W. E., & Hoover A. (1992). Cognitive and linguistic factors in learning to read. In Treiman R., Ehri C., & Gough P. B. (Eds.), Reading acquisition (pp. 175–214). Hillsdale, NJ: Erlbaum.
  20. 20. Leong C. K., & Ho M. K. (2008). The role of lexical knowledge and related linguistic components in typical and poor language comprehenders of Chinese. Reading and Writing: An Interdisciplinary Journal, 21, 559–586.
  21. 21. Leong C. K., & Tamaoka K. (1995). Use of phonological information in processing kanji and katakana by skilled and less skilled Japanese readers. Reading and Writing, 7, 377–393.
  22. 22. Takahashi N. (2001). Gakudôki-ni okeru dokkai nôryoku no hattatu katei—1–5 nensei no zyûdantekina bunseki [Developmental changes in reading ability: A longitudinal analysis of Japanese children from first to fifth grade]. Kyôiku Sinrigaku Kenkyû [Japanese Journal of Educational Psychology], 49, 1–10.
  23. 23. Zhang D. (2012). Vocabulary and grammar knowledge in second language reading comprehension: A structural equation modelling study. The Modern Language Journal, 96, 558–575.
  24. 24. Hu M. H-C., & Nation I. S. P. (2000). Unknown vocabulary density and reading comprehension. Reading in a foreign language, 13, 403–430.
  25. 25. Nation I. S. P. (2001). Learning vocabulary in another language. Cambridge: Cambridge University Press.
  26. 26. Stahl S. A., & Nagy W. E. (2006). Teaching word meanings. Mahwah, NJ: Lawrence Erlbaum Associates.
  27. 27. Komori K., Mikuni J., & Kondo A. (2004). Bunsyô rikai-o sokusin suru goi tisiki no ryôteki sokumen—Kitigo ritu no ikiti tansaku no kokoromi [What percentage of known words in a text facilities reading comprehension?—A case study for exploration of the threshold of known words coverage]. Nihongo Kyôiku [Journal of Japanese Language Teaching], 120, 83–92.
  28. 28. Mikuni J., Komori K, & Kondo A. (2005). Tyôkai-ni okeru goi tisiki no ryôteki sokumen-ga naiyô rikai-ni oyobosu eikyô - Dokkai tono hikaku kara [The effects of text coverage of known words on listening comprehension: A comparison with reading comprehension]. Nihongo Kyôiku [Journal of Japanese Language Teaching] 125. 76–85.
  29. 29. Zhang H., Zhang X., Li M. & Zhang Y. (2021). Morphological awareness in L2 Chinese reading comprehension: Testing of mediating routes, Frontiers in Psychology, 12:736933. pmid:34721208
  30. 30. Packard J. L. (2000). The morphology of Chinese: A linguistic and cognitive approach. New York, NY: Cambridge University Press.
  31. 31. Wang M., Cheng C., and Chen S. W. (2006). Contribution of morphological awareness to Chinese-English biliteracy acquisition. Journal of Educational Psychology, 98, 542–553.
  32. 32. Koda K. (2000). Cross-linguistic variations in L2 morphological awareness. Applied Psycholinguistics, 21, 297–320.
  33. 33. Chen Y-M. (2002). Nihongo nizi kanzi goi to sore-ni taiôsuru tyûgokugo nizi kanzi goi-wa onazi ka–Taiwan oyobi Tyûgoku no tyûgokugo tono hikaku [Are two-kanji compounds in Japanese the same as those in Chinese–A comparison with the Chinese language in Taiwan and the mainland China]. Gengo Bunka to Nihongo Kyôiku [Language Culture and Japanese Education], 24. 40–53.
  34. 34. Hishinuma T. (1983). Nihongo to tyûgokugo no zyôyô zitai [The commonly-used kanji orthography in Japanese and Chinese]. Tyûgoku Kenkyû Geppô [Monthly-journal of Chinese Language Studies], 428, 1–20.
  35. 35. Hishinuma T. (1984). Tyûgokugo no hyôzyun zitai to nihon no zyôyô zitai [The standard orthography in Chinese and the common orthography in Japanese]. Nihongogaku [The Study of the Japanese Language] 3, 32–40.
  36. 36. Tamaoka K. (1991). Psycholinguistic nature of the Japanese orthography. Gengo Bunka Kenkyû [Studies of Language and Culture], 11(1), 49–82.
  37. 37. Yamashita J. (1999). Reading in a First and a Foreign Language: A Study of Reading Comprehension in Japanese (the L1) and English (the L2). Ph.D. Dissertation, Lancaster University, UK.
  38. 38. Bosser B. (1992). Reading in two languages: A study of reading comprehension in Dutch as a second language and in Turkish as a first language. Rotterdam, The Netherlands: Drukkerij Van Driel.
  39. 39. Yamato Y., Tamaoka K., & Chu X. (2013). Tyûgokuzin nihongo gakusyûsya no tekisuto no on-rain yomi-ni okeru goi to bunpô nôryoku no eikyô [Effects of lexical and grammatical ability for on-line text processing by native Chinese speakers learning Japanese]. Nihon Kyôka Kyôiku Gakkaisi [The Bulletin of Japanese Curriculum Research and Development], 36(1), 33–43.
  40. 40. Yamato Y., & Tamaoka K. (2013). Tyûgokugo bogo wasya to kankokugo bogo wasya no nihongo tekisuto no yomi syori-ni okeru bogo tono gengoteki ruizisei no eikyô [Effects of linguistic similarity on Japanese text processing by native Chinese and Korean speakers]. Koide Kinen Nihongo Kyôiku Kenkyûkai Ronbunsyû [Koide-memorial Journal of Japanese Language Education] 21, 229–268.
  41. 41. Tamaoka K. (2014). The Japanese writing system and lexical understanding. Japanese Language and Literature (The American Association of Teachers of Japanese, AATJ), 48, 431–471.
  42. 42. Maruo M. (2010). Kiso kara hatten made yoku wakaru tyûgokugo bunpô [Understanding Chinese grammar well from basic to advanced]. Tokyo: ASK Publisher.
  43. 43. Sun C. (2006). Chinese: A Linguistic Introduction. New York, NY: Cambridge University Press.
  44. 44. Lambrecht K. (1994). Information structure and sentence form: Topic, focus, and the mental representations of discourse referents. Cambridge, UK: Cambridge University Press.
  45. 45. Lo L-Y., Ho C. S-H., Wong Y-K., Chan D. W-O. & Chung K. K-H. (2016). Understanding the Microstructure and Macrostructure of Passages Among Chinese Elementary School Children. Journal of Psycholinguistic Research, 45, 1287–1300. pmid:26573861
  46. 46. Reinhart T. (1983). Anaphora and semantic interpretation. London UK: Croom Helm.
  47. 47. Thomas J. (1983). Cross-cultural pragmatic failure. Applied Linguistics, 4, 91–112.
  48. 48. Dixon J. A. & Marchman V. A. (2007). Grammar and the lexicon: Developmental ordering in language acquisition. Child Development, 78, 190–212. pmid:17328700
  49. 49. Japan Foundation [Kokusai Kôryû Kikin] and Japan Educational Exchange and Services [Nihon Kokusai Kyôiku Sien Kyôkai] (2002). Nihongo nôryoku siken syutudai kizyun—Kaiteiban [The criteria of Japanese proficiency tests–Revised edition]. Tokyo: Japan Foundation and Japan Educational Exchange and Services.
  50. 50. Arbuckle J. L. (2009). AMOS 18.0 User’s Guide. Chicago, IL: SPSS Inc.
  51. 51. Barrett P. (2007). Structural equation modeling: Adjudging model fit. Personality and Individual Differences, 42, 815–824.
  52. 52. Bentler P. M., & Bonnet D. C. (1980). Significance tests and goodness of fit in the analysis of covariance structures. Psychological Bulletin, 88, 588–606.
  53. 53. Hooper D., Coughlan J., & Mullen M. (2008). Structural equation modeling: Guidelines for determining model fit. Electronic Journal of Business Research Models, 6, 53–60.
  54. 54. Jöreskog K., & Sörbom D. (1993). LISREL 8: Structural equation modeling with the SIMPLIS command language. Chicago, IL: Scientific Software International.
  55. 55. Schermelleh-Engel K., Moosbrugger H., & Müller H., (2003). Evaluating the fit of structural equation models: Tests of significance and descriptive goodness-of-fit measures. Methods of Psychological Research Online, 8, 23–74.
  56. 56. Akaike H. (1987). Factor analysis and AIC. Psychometrika, 52, 317–332.
  57. 57. Bozdogan H. (1987). Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52, 345–370.
  58. 58. Browne M. W., & Cudeck R. (1993). Alternative ways of assessing model fit. In Bollen K. A. & Long J. S. (Eds.), Testing structural equation models (pp. 136–162). Newbury Park, CA: Sage.
  59. 59. Schwarz G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464.
  60. 60. Marsh H. W., & Grayson D. (1995). Latent variable models of multitrait-multimethod data. In Hoyle R. (Ed.), Structural equation modeling: Concepts, issues and applications (pp. 177–198). Thousand Oaks, CA: Sage.
  61. 61. Kaplan D. (2000). Structural equation modeling: Foundation and extensions. Thousand Oaks, CA: Sage.