Core vocabulary for language learning at Tokyo University of Foreign Studies,
linked to the Open English WordNet (OEWN) via the ILI across 23 languages.
Arabic, Assamese, German, English, Spanish, French, Indonesian, Japanese, Khmer, Korean, Lao, Mongolian, Malay, Burmese, Portuguese (Brazil), Portuguese, Russian, Thai, Filipino, Turkish, Urdu, Vietnamese, Chinese.
The TUFS Basic Vocabulary is the core vocabulary used for language teaching at Tokyo University of Foreign Studies. This project converts it into per-language wordnets in Global WordNet LMF 1.4 format and links each concept to the Open English WordNet via the Collaborative Interlingual Index (ILI).
Around 644 concepts are ILI-linked to the Open English WordNet (OEWN 2025). A further ~2,274 TUFS-internal synsets carry definitions extracted from the original TUFS commentary. Each entry may include lemmas, variant forms, morphological tags, pronunciation (hiragana, pinyin, IPA, audio), thematic domain labels, and example sentences with Japanese glosses.
Source SQL dumps and LMF XML are available on GitHub. Raw TUFS data originates from TUFS Open Language Resources.
If you use this resource, please cite:
Francis Bond, Hiroki Nomoto, Luis Morgado da Costa and Arthur Bond (2020). Linking the TUFS Basic Vocabulary to the Open Multilingual Wordnet. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020), pp. 3171–3177. https://aclanthology.org/2020.lrec-1.389/