Current Vacancies

Current Vacancies

Speech Algorithm Engineer (Top-tier AI research institute/AI/ML Optimization)

语音算法工程师


Job Responsibilities

1. Research, develop, and deploy cutting-edge algorithms for speech recognition (ASR) and speech synthesis (TTS);

2. Optimize end-to-end voice interaction systems to achieve industry-leading performance in naturalness, fluency, and user experience;

3. Build model fine-tuning toolchains tailored for healthcare scenarios to enhance domain-specific adaptability of general-purpose models;

4. Stay updated with the latest advancements in speech technology and ensure deployed models remain state-of-the-art.


Requirements

1. Master's degree in Computer Science, Electronics, Automation, or a related field, with 2+ years of hands-on experience in speech algorithm development;

2. Proficient in applying and fine-tuning mainstream ASR/TTS models (e.g., Whisper, SenseVoice, CosyVoice);

3. Hands-on experience in developing offline and streaming voice interaction services, with a proven track record of real-world deployment;

4. Strong programming skills in Python, PyTorch, and C++, with expertise in deep learning frameworks;

5. Publication experience in top-tier conferences/journals (e.g., ICASSP, Interspeech) is a strong plus;

6. Excellent communication skills and a collaborative spirit, with the ability to drive projects proactively.


Preferred Qualifications

1. Experience in healthcare or domain-specific speech applications;

2. Knowledge of low-latency, real-time speech processing techniques;

3. Familiarity with multilingual or accented speech recognition challenges;

4. Contributions to open-source speech projects or relevant patents.


Application Method 

Please send your resume to hr02@cair-cas.org.hk. The subject of the email should be marked as Application for [Speech Algorithm Engineer]-[Name].


岗位职责


1. 负责语音识别、语音合成等算法的研发与落地;

2. 优化语音交互全链条体验,自然度和流畅度等指标追平业界先进水平;

3. 针对医疗场景,构建模型微调工具链,提升通用模型在垂直场景的适应性;

4. 持续跟进前沿技术发展趋势,保持线上模型的先进性。


职位要求


1. 计算机、电子、自动化等相关专业硕士,具备2年以上语音相关开发经验;

2. 熟悉当前主流ASR、TTS等模型的应用和微调,如:Whisper、SenseVoice、CosyVoice等;

3. 具备离线和流式语音交互服务开发能力,有相关项目落地经验;

4. 熟练掌握Python、PyTorch、C++等编程语言和深度学习框架;

5. 在相关国际会议或期刊上发表论文着优先,如:ICASSP、InterSpeech等;

6. 具备良好的沟通能力和团队协作精神,能主动推动项目落地。


优先考虑


1. 拥有医疗健康领域或特定领域语音应用的相关经验;

2. 了解低延迟、实时语音处理技术;

3. 熟悉多语言或带口音的语音识别问题;

4. 曾为开源语音项目做出贡献或拥有相关专利。


申请方式


请将简历发送至 hr02@cair-cas.org.hk 邮件主题请注明应聘 [语音算法工程师]-[姓名]。