Lexical sophistication measures and writing proficiency: The case of Indonesian learners of Japanese (Published)
The present study tests two measures of lexical sophistication in writing proficiency (moving-average morphological richness and moving-average mean size of paradigm for testing the lexical diversity) and mean word length for testing writing form (plain, humble, honorification). The findings suggest that the three metrics work reliably. Regarding lexical diversity, moving-average of morphological richness (MAMR) and moving-average mean size of paradigm (MAMSP) of Indonesian Japanese learners-written texts are close to native Japanese-written data. Lexical complexity measured by word length by Indonesian Japanese learners is characterised by slightly less richness than native Japanese data but remains very close. Word length-frequency relationship in the Indonesian-written data presents outstanding fitting results to nine models, including the Poisson Model families and Binomial Model families, with 0.9918 as the lowest and 0.9987 as the highest determination coefficient R2. It is hoped that this study’s outcome may help develop an automatic evaluation of the writing proficiency of agglutinative languages with diverse writing forms.
Word length distribution of Japanese dialects (Published)
This study applies a mathematical linguistic approach to explore word length distribution of Japanese dialects to cluster dialects at a lexical level. Data were extracted from spoken recordings of native speakers from 47 areas. The findings revealed that the further south the area was, the longer the mean word length (MWL) became. In majority of dialects, MWL ranges from one to nine. The Saga dialect has the longest MWL (3.26). Further analysis of the MWL-frequency relationship via the Altmann-fitter reveals that MWL-frequency of all dialects fit more than 30 distribution models, including the binomial and Poisson families.
Citation: Wenchao Li (2022) Word length distribution of Japanese dialects, International Journal of Quantitative and Qualitative Research Methods, Vol.10, No.3, pp.7-16
Syntactic complexity in Japanese dialects (Published)
This study attempted the clustering of Japanese dialects at a syntactic level in light of dependency grammar. it is Kagoshima, a southwestern area of Japan, that bears the longest mean dependency distance (1.7851) and Akita of the north-eastern prefecture that shows the shortest mean dependency distance (1.1805). A test of Euclidean distances and clustering based on the MDD brings to light that, the north-eastern and north Kinki areas tend to bear a shorter MDD, indicating a less complex syntax in spoken dialect; the Kantoo and Chuubu, south Kinki, and Shikoku areas tend to present a medium syntactic complexity; the south areas tend to present long MDDs. In a broader sense, the more northward the region, the deeper the syntactic complexity of spoken dialect. The longest DD (3.4) is detected in Gifu prefecture. This has to do with its location, between eastern and western Japan, connected to seven prefectures: Toyama, Ishikawa, Fukui, Shiga, Aichi, Mie and Nagano.
Citation: Wenchao Li (2022) Syntactic complexity in Japanese dialects, International Journal of Mathematics and Statistics Studies, Vol.10, No.4, pp.1-14