Working hours: Full-time – Monday to Friday, with alternate Saturdays
Work arrangement: Onsite at Vinhomes Ocean Park (Gia Lâm) for the first 2 weeks, then remote
We are seeking a Mid-level AI Engineer to contribute to the development of the AI Agent brain for the next-generation robotic assistants. This role focuses on the end-to-end development lifecycle of speech-to-text and text-to-speech (STT) systems and conversational agent integration, ensuring high-quality bilingual human–robot interaction in real-world applications
I. Key responsibilities
- Fine-tune Vietnamese and English STT/TTS models to support bilingual conversational capabilities.
- Develop and enrich audio datasets to improve speech recognition accuracy.
- Implement and optimize RAG systems for real-time information retrieval (News, Websearch, Weather) with LLM reasoning.
- Optimize model inference and 0.5s response latency target to ensure smooth, natural interactions on edge devices.
II. Requirements
- Bachelor’s degree in Computer Science, AI, or related technical fields.
- At least 3+ years of experience in full-stack development with a focus on developing AI-driven applications and features.
- Proven track record in speech processing, including audio data collection, enrichment, and filtering in large-scale technology ecosystems.
- Hands-on experience building and deploying LLM-based features and architectures.
- Proficiency in Python and modern frameworks such as FastAPI, Django, and Flask.
- Experience with containerization and orchestration tools, including Docker and Kubernetes.
- Preferred Qualifications – Nice to Have: Familiarity with edge deployment and performance tuning for production-grade AI applications.
- Proactive, responsible, and committed to high-quality AI solutions.
- Strong collaborative approach and clear communication to work with cross-functional AI and engineering teams
III. Benefit
- Salary range: 32-38M
- Support from PM and teammates
— Kindly submit your application using this form: