Companies Oddin Research Scientist, Text-to-Speech

About the role

Oddin
 
About Valka.ai
 
Valka, a visionary spin-off from the Realms Group (the parent company of Oddin.gg), is on a mission to revolutionize the way people create and experience digital content.
 
Our team believes that content shouldn’t just be consumed; it should be co-created in real time, blurring the lines between imagination and reality. By harnessing the power of cutting-edge AI, we aim to build an interactive human-digital platform where virtual characters respond dynamically to each user’s voice, text, gestures, and more.
 
This is your chance to join a diverse group of innovators who are driven to redefine what’s possible in generative content. Together, we’re changing the paradigm from passive viewing to active participation, unlocking new creative frontiers across gaming, entertainment, education, and beyond.

What you will be doing

  • Research and train fast and quality SOTA TTS models for realistic and emotional voice generation for entertainment and education applications.
  • You will be experimenting with different architectures / data to improve the quality and speed of the TTS model(s) and put the best results to production.
  • Staying up to date with current research and coming up with new ideas / what to improve is very important for us!
  • You will be in immediate collaboration with a team of 3 researchers specializing in TTS, and the product is supported by engineering and hardware stuff to ensure deployment
  • Skills you need

  • Experience with training some text-to-speech / voice cloning models
  • Solid knowledge of transformers, diffusion models, GANs
  • Understanding of human speech and audio processing (sampling, spectrograms, vocoders)
  • Proficiency in Python and key libraries (e.g., PyTorch, Hugging Face Transformers).
  • Ability to keep up to date with research, understand papers, implement approaches; strong ML fundamentals and critical thinking
  •  
    Nice-to-have:
     
  • Familiarity with modern speech synthesis models (GPT-based, flow matching… such as Vevo, StyleTTS, IndexTTS, Maskgct etc.)
  • Contributions to open-source AI tools or research publications in Speech processing field
  • Familiarity with AWS / similar clusters
  • Ready to apply to Oddin?
    Apply to Oddin

    Similar jobs

    Sign up for suggestions tailored to the jobs you open and the searches you save.

    Apply now
    🤖

    Whoa — hold up

    JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

    Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

    Catch your next role the second it’s posted.

    Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

    Create free account

    Free forever · takes 30 seconds · already have one?

    Get the worldwide-remote edge.

    Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

    Join the channel — it's free