B
Machine Learning Researcher Engineer
BoldVoice (S21)
New York, NY, USFull-timemachine learning
About this role
About the role
Skills: Torch/PyTorch, Python, TensorFlow
đȘđŁ About BoldVoice
BoldVoice helps the 1 billion global non native English speakers speak English with clarity and confidence, so they can advance their careers and lives.
The app gives users instant pronunciation feedback from speech AI, and then teaches them how to improve with video lessons and training exercises, developed by Hollywood accent coaches.
Today, BoldVoice is one of the top Education apps on the App Store and serves non-native speakers of 100+ different language backgrounds all over the world.
đ» About the Role
As a Machine Learning Engineer / Researcher at BoldVoice, youâll play a critical role in driving the development and optimization of our AI systems. Your work will directly enhance the user experience by creating new machine learning-enabled capabilities, and improve the accuracy and efficacy of our existing machine learning systems. Specifically, youâll work on:
Model Development and Deployment
Designing, training, and fine-tuning machine learning models for AI coaching, pronunciation feedback, and accent detection. This will include working on LLMs, speech models like Wav2Wec2.0, and multi-modal models like speech to speech models.
Deploying these models into production environments for real-time and batch inference.
Pipeline Development and Optimization
Building reusable and organized data preprocessing pipelines for various data, including audio data, text data and more.
Setting up automated evaluation systems to monitor model performance.
Optimizing training workflows to reduce time-to-deployment.
You will be joining a top-notch machine learning team, who are striving to push forward whatâs possible in speech and audio AI, but also care about creating practical uses for their work. An example of our teamâs research can be found here: Accents in Latent Spaces. Our team is also behind the viral hit: BoldVoice Accent Oracle, which has been tried more than 50m times, by users
Requirements
You have at least 5 years of experience working on machine learning models in production environments, specifically training, fine-tuning, evaluating and directly implementing machine learning models, in the fields of Speech, NLP, and/or Vision
Experience in Automatic Speech Recognition (ASR) will be particularly useful, as will knowledge of phonetics and the ability to discern sounds and accents
Proficiency in Python and frameworks like TensorFlow, PyTorch, or similar.
Up to date with latest developments in using LLM tools like Claude Code, Cursor, Codex or similar to rapidly prototype, and ship code quickly
đ What we offer
You will be compensated in salary and generous stock options -- we want you to feel like an integral part of the success and growth of the company
Skills
Torch/PyTorchPythonTensorFlow