Linguistics Software Research Intern

Computational Linguistics & Language Learning Platform

What we're building

We're developing Memmerse, a semantic web-based language learning platform that's also research infrastructure for computational linguistics. Think "if linguists designed Duolingo and LingQ with graph databases and AI."

This is a startup building a real product, not a research project. We approach development with an applied science mindset - using rigorous methods and research insights to build something people actually want to use.

What you'll work on

  • Semantic knowledge graphs for language learning (SPARQL, RDF, AWS Neptune)
  • AI-powered features for vocabulary acquisition and cultural immersion
  • Modern web development (NextJS, React, FastAPI, TypeScript)
  • Real linguistic problems: polysemy resolution, context-aware learning, cross-linguistic analysis
  • Chinese/Japanese language materials: Many features focus on character-based languages, so comfort with Chinese characters is essential for day-to-day work

What we care about

  • Shipping working code (we'll teach you the tools, help with AI coding assistance)
  • Understanding systems - how the pieces fit together, not memorizing algorithms
  • Domain curiosity - language learning, lexicography, or willingness to dive deep
  • Research mindset - this isn't just an app, it's infrastructure for studying language acquisition

What we don't care about

  • Leetcode performance or CS theory competitions
  • Years of experience (we prefer curiosity + learning ability)
  • Perfect knowledge of our tech stack (we'll help you learn)

Ideal background

  • Student in linguistics, cognitive science, CS, or a related field
  • Or self-taught developer with a strong interest in language learning
  • Experience with any of: graph databases, semantic web, NLP, or language learning tools
  • Required: Ability to work with Chinese characters (or kanji) – you should be comfortable typing and reading them using tools like Pleco, even if you're still learning (think: A2–B1 level or higher). Many of our current features focus on East Asian languages.
  • Bonus: Fluency in any language other than English
  • Extra credit: Understanding of lexicography, corpus linguistics, or educational technology

Requirements

  • Professional fluency in English
  • US citizens only (eventually we may have opportunities for candidates from Taiwan and Japan)
  • This is a remote work opportunity

The opportunity

15-20 hours/week, flexible around your schedule. Work under direct mentorship of the company founder, an AI research engineer who has co-authored papers with Microsoft Research and OpenAI. Learn cutting-edge AI coding techniques while building something that advances both research and real language education.

The entire platform architecture is already rigorously implemented and tested - you'll be extending a production-quality system, not building from scratch. Jump straight into feature development and focus on the intellectually interesting problems.

How to apply

Send an email to contact@celetris.com with:

  • Brief introduction and what interests you about computational linguistics
  • Any relevant experience (academic projects, side projects, coursework)
  • Links to code samples if available (GitHub, personal projects, etc.)

We're looking for curiosity and potential, not perfect credentials. If you're excited about the intersection of language, technology, and research, we'd love to hear from you.