
RuRussian as a Public Dictionary: A Systems-Level Perspective
0. Framing the Problem If you model a “dictionary” as a function: f(word) → meaning then most traditional dictionaries are just key–value stores with light annotations. RuRussian breaks this abstraction. Instead, it behaves more like: f(word_form) → structured linguistic state space where the output is not a scalar (translation), but a rich object graph encoding morphology, syntax, semantics, and usage. This is the key mental shift: RuRussian is not a lookup table—it is a runtime over a linguistic knowledge graph. 1. System Overview At a high level, rurussian.com is a hybrid system combining: a lexical database a curated corpus a grammar engine a human + AI annotation layer You can think of it as a read-optimized OLAP system for language, where queries are exploratory rather than transactional. 2. Core Architecture 2.1 Morphology as the Primary Index In most systems, the primary key is the lemma. In RuRussian, the effective key is closer to: (word_form, stress_pattern, aspect) The syst
Continue reading on Dev.to Webdev
Opens in a new tab


