Research and Publications

Raxmatullayeva Malika Mo'min qizi

Ataboyev Nozimjon Bobojonovich

10.5281/zenodo.20284280

Annotatsiya

Cultural lacunae are lexical or conceptual gaps where a culture-bound term has no direct equivalent in another language. Traditional identification methods are subjective and non-scalable. This study proposes a corpus-driven pipeline for automatic detection of cultural lacunae using corpus tools and cross-lingual embeddings. Two comparable corpora (American English and Uzbek, 5 million words each) were constructed. The pipeline detected 147 candidate lacunae with strict precision of 72% and lenient precision of 88.5%. Food, social rituals, and legal-administrative domains showed the highest lacuna density. Building on Ataboev’s (2019a, 2019b, 2020, 2024a, 2024b) corpus linguistics research, this study extends automatic detection to cultural gap identification.

Kalit so'zlar:

cultural lacunae, corpus linguistics, automatic detection, comparable
corpora, Uzbek corpus.

Foydalanilgan adabiyotlar

1. Ataboev, N. B. (2019a). ICT in linguistic studies: Application of electronic language corpus and corpus-based analysis. Test Engineering and Management, *81*, 4170-4176. 2. Ataboev, N. B. (2019b). Problematic issues of corpus analysis and its shortcomings. ISJ Theoretical & Applied Science, *10*(78), 170-173. 3. Ataboev, N. B. (2020). Corpus-based research on the language features of corpus linguistics: In the example of ECOCL. Language, *3*(2139), 950. 4. Ataboev, N. (2024a). Analysis of the media texts corpus in the prism of existing English diachronic corpora [Media matnlar korpusining mavjud ingliz tili diaxron korpuslari prizmasida tahlili]. Acta NUUz, *1*(1.2.1), 288 291. https://doi.org/10.69617/nuuz.v1i1.2.1.1242 5. Ataboev, N. (2024b). Diachronic corpora: The role of corpus linguistics methodology in the studies on language development [Diaxronik korpuslar: Til rivoji tadqiqida korpus lingvistikasi metodologiyasining o‘rni]. Acta NUUz, *1*(1.3), 272 276. https://doi.org/10.69617/nuuz.v1i1.3.1385 6. Ataboev, N. (2024c). Media texts as the main resource of language social expression and language enrichment. Foreign Languages in Uzbekistan, *2024*(1), 60-75.

Leave a Comment

Your email address will not be published. Required fields are marked *