Back to articles
RuRussian as a Public Dictionary: A Systems-Level Perspective

RuRussian as a Public Dictionary: A Systems-Level Perspective

via Dev.to WebdevHenri Wang

0. Framing the Problem If you model a “dictionary” as a function: f(word) → meaning then most traditional dictionaries are just key–value stores with light annotations. RuRussian breaks this abstraction. Instead, it behaves more like: f(word_form) → structured linguistic state space where the output is not a scalar (translation), but a rich object graph encoding morphology, syntax, semantics, and usage. This is the key mental shift: RuRussian is not a lookup table—it is a runtime over a linguistic knowledge graph. 1. System Overview At a high level, rurussian.com is a hybrid system combining: a lexical database a curated corpus a grammar engine a human + AI annotation layer You can think of it as a read-optimized OLAP system for language, where queries are exploratory rather than transactional. 2. Core Architecture 2.1 Morphology as the Primary Index In most systems, the primary key is the lemma. In RuRussian, the effective key is closer to: (word_form, stress_pattern, aspect) The syst

Continue reading on Dev.to Webdev

Opens in a new tab

Read Full Article
5 views

Related Articles