Privacy-Preserving Active Learning for heritage language revitalization programs across multilingual stakeholder groups
Privacy-Preserving Active Learning for heritage language revitalization programs across multilingual stakeholder groups Introduction: A Personal Discovery in Language Preservation While exploring the intersection of federated learning and natural language processing for my research on low-resource languages, I stumbled upon a fascinating challenge that would consume my next six months of experimentation. I was working with a community organization attempting to document a critically endangered heritage language spoken by fewer than 200 elderly speakers scattered across three countries. The ethical dilemma was immediate: how could we build machine learning models to help preserve their language without compromising their privacy or cultural sovereignty? During my investigation of differential privacy techniques, I realized that standard approaches failed to address the unique constraints of heritage language revitalization. These programs involve multiple stakeholder groups—elders who a
Continue reading on Dev.to
Opens in a new tab




