OPEN POSITION
OPEN POSITION
MLE (Speech Tech, TTS)
MLE (Speech Tech, TTS)
With several major projects already in progress, we are looking for a ML Engineer (Speech Tech, TTS).
With several major projects already in progress, we are looking for a ML Engineer (Speech Tech, TTS).
Apply now
About us
About us
Aiphoria is an innovative startup specializing in machine learning and AI solutions.
We are developing a cutting-edge platform that enables virtual employees for medium and large enterprises across various industries. Our solution leverages advanced conversational AI, large language models (LLMs), and natural language and speech processing technologies.
Our team has extensive experience in generative AI (GenAI) and specialized expertise in machine learning and speech technologies. We have previously developed voice assistants like Alexa and Siri for local markets and created our own GPT technology prior to the release of ChatGPT.
With several major projects already in progress, we are looking to expand our team. Currently, we are seeking talented candidates for the position of ML Engineer (Speech Tech, TTS).
Aiphoria is an innovative startup specializing in machine learning and AI solutions.
We are developing a cutting-edge platform that enables virtual employees for medium and large enterprises across various industries. Our solution leverages advanced conversational AI, large language models (LLMs), and natural language and speech processing technologies.
Our team has extensive experience in generative AI (GenAI) and specialized expertise in machine learning and speech technologies. We have previously developed voice assistants like Alexa and Siri for local markets and created our own GPT technology prior to the release of ChatGPT.
With several major projects already in progress, we are looking to expand our team. Currently, we are seeking talented candidates for the position of ML Engineer (Speech Tech, TTS).
Responsibilities
Responsibilities
Design and optimize TTS models to ensure our voice assistant sounds as natural and accurate as possible.
Collaborate closely with product managers and engineers to integrate TTS tech, making it seamless and intuitive for users.
Partner with data teams to build efficient audio data pipelines, from preprocessing to model training.
Design and optimize TTS models to ensure our voice assistant sounds as natural and accurate as possible.
Collaborate closely with product managers and engineers to integrate TTS tech, making it seamless and intuitive for users.
Partner with data teams to build efficient audio data pipelines, from preprocessing to model training.
Regularly update and refine TTS models to adapt to various accents, dialects, and speech styles, enhancing user satisfaction and responsiveness.
Keep up-to-date with the latest TTS advancements, bringing in innovative techniques and tools to keep us at the forefront of voice-assisted banking.
Rigorously test and validate models to meet strict standards.
Regularly update and refine TTS models to adapt to various accents, dialects, and speech styles, enhancing user satisfaction and responsiveness.
Keep up-to-date with the latest TTS advancements, bringing in innovative techniques and tools to keep us at the forefront of voice-assisted banking.
Rigorously test and validate models to meet strict standards.
Expectations
Expectations
Proficiency in Python and deep learning frameworks (especially, PyTorch).
Strong understanding of speech synthesis processing techniques.
Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
Familiarity with alignment methods such as MFA, MAS, or similar.
Knowledge of normalization techniques and FSTs.
Familiarity with TTS evaluation techniques, including MOS and A/B testing.
Familiarity with vocoder models (e.g. Vocos, HiFi-GAN) for audio generation with different sampling rates.
Proficiency in Python and deep learning frameworks (especially, PyTorch).
Strong understanding of speech synthesis processing techniques.
Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
Familiarity with alignment methods such as MFA, MAS, or similar.
Knowledge of normalization techniques and FSTs.
Familiarity with TTS evaluation techniques, including MOS and A/B testing.
Familiarity with vocoder models (e.g. Vocos, HiFi-GAN) for audio generation with different sampling rates.
Knowledge of signal processing, statistical modeling, and language structure.
Experience working with low-resource languages, including dataset creation, augmentation techniques, and model adaptation for low-data scenarios
MLOps experience, including MLFlow or similar experiment tracking systems.
Familiarity with low-latency audio streaming and optimization techniques for deploying efficient real-time processing systems.
Proficiency in model inference, deploying APIs, and building ML backends, with experience using Triton for efficient model serving.
Nice to have: experience with classic autoregressive models (e.g., Tacotron, Tacotron 2).
Knowledge of signal processing, statistical modeling, and language structure.
Experience working with low-resource languages, including dataset creation, augmentation techniques, and model adaptation for low-data scenarios
MLOps experience, including MLFlow or similar experiment tracking systems.
Familiarity with low-latency audio streaming and optimization techniques for deploying efficient real-time processing systems.
Proficiency in model inference, deploying APIs, and building ML backends, with experience using Triton for efficient model serving.
Nice to have: experience with classic autoregressive models (e.g., Tacotron, Tacotron 2).
What we offer
What we offer
Experienced team, Aiphoria is formed by a team of enthusiastic professionals who created award-winning devices, voice assistants and other AI-driven products for BigTech corporations.
Cutting-edge technologies, we build a technology using our areas of expertise including Computer Vision, Speech Technologies, Natural Language Understanding, Generative AI incl. LLM and Diffusion models.
Rapid career progression, facilitated by our team of seasoned senior professionals who hail from prestigious, industry-leading companies.
Remote work opportunities from anywhere globally or support for relocation to Cyprus.
Company has prominent clients with an opportunity for you to work on different projects and/or to be involved in developing our proprietary own products.
Competitive compensation in EUR/USD, surpassing market standards.
A company with entrepreneurial spirit. We offer a unique mix of a secure workspace thanks to the big clients raised along with a true start-up culture!
Experienced team, Aiphoria is formed by a team of enthusiastic professionals who created award-winning devices, voice assistants and other AI-driven products for BigTech corporations.
Cutting-edge technologies, we build a technology using our areas of expertise including Computer Vision, Speech Technologies, Natural Language Understanding, Generative AI incl. LLM and Diffusion models.
Rapid career progression, facilitated by our team of seasoned senior professionals who hail from prestigious, industry-leading companies.
Remote work opportunities from anywhere globally or support for relocation to Cyprus.
Company has prominent clients with an opportunity for you to work on different projects and/or to be involved in developing our proprietary own products.
Competitive compensation in EUR/USD, surpassing market standards.
A company with entrepreneurial spirit. We offer a unique mix of a secure workspace thanks to the big clients raised along with a true start-up culture!