Speech Algorithm Engineer (Top-tier AI research institute/AI/ML Optimization)
Speech Algorithm Engineer (Top-tier AI research institute/AI/ML Optimization)
语音算法工程师
Job Responsibilities
1. Research, develop, and deploy cutting-edge algorithms for speech recognition (ASR) and speech synthesis (TTS);
2. Optimize end-to-end voice interaction systems to achieve industry-leading performance in naturalness, fluency, and user experience;
3. Build model fine-tuning toolchains tailored for healthcare scenarios to enhance domain-specific adaptability of general-purpose models;
4. Stay updated with the latest advancements in speech technology and ensure deployed models remain state-of-the-art.
Requirements
1. Master's degree in Computer Science, Electronics, Automation, or a related field, with 2+ years of hands-on experience in speech algorithm development;
2. Proficient in applying and fine-tuning mainstream ASR/TTS models (e.g., Whisper, SenseVoice, CosyVoice);
3. Hands-on experience in developing offline and streaming voice interaction services, with a proven track record of real-world deployment;
4. Strong programming skills in Python, PyTorch, and C++, with expertise in deep learning frameworks;
5. Publication experience in top-tier conferences/journals (e.g., ICASSP, Interspeech) is a strong plus;
6. Excellent communication skills and a collaborative spirit, with the ability to drive projects proactively.
Preferred Qualifications
1. Experience in healthcare or domain-specific speech applications;
2. Knowledge of low-latency, real-time speech processing techniques;
3. Familiarity with multilingual or accented speech recognition challenges;
4. Contributions to open-source speech projects or relevant patents.
Application Method
Please send your resume to hr02@cair-cas.org.hk. The subject of the email should be marked as Application for [Speech Algorithm Engineer]-[Name].
岗位职责
1. 负责语音识别、语音合成等算法的研发与落地;
2. 优化语音交互全链条体验,自然度和流畅度等指标追平业界先进水平;
3. 针对医疗场景,构建模型微调工具链,提升通用模型在垂直场景的适应性;
4. 持续跟进前沿技术发展趋势,保持线上模型的先进性。
职位要求
1. 计算机、电子、自动化等相关专业硕士,具备2年以上语音相关开发经验;
2. 熟悉当前主流ASR、TTS等模型的应用和微调,如:Whisper、SenseVoice、CosyVoice等;
3. 具备离线和流式语音交互服务开发能力,有相关项目落地经验;
4. 熟练掌握Python、PyTorch、C++等编程语言和深度学习框架;
5. 在相关国际会议或期刊上发表论文着优先,如:ICASSP、InterSpeech等;
6. 具备良好的沟通能力和团队协作精神,能主动推动项目落地。
优先考虑
1. 拥有医疗健康领域或特定领域语音应用的相关经验;
2. 了解低延迟、实时语音处理技术;
3. 熟悉多语言或带口音的语音识别问题;
4. 曾为开源语音项目做出贡献或拥有相关专利。
申请方式
请将简历发送至 hr02@cair-cas.org.hk。 邮件主题请注明应聘 [语音算法工程师]-[姓名]。