Job Description
xAI is seeking an AI Engineer to work directly with enterprise customers, focusing on Vision or Video Understanding integrations. The ideal candidate will act as a specialized AI startup CTO, leading high-stakes projects and delivering measurable impact in the Vision domain. This role requires a combination of deep technical expertise and customer-focused innovation.
Responsibilities: - Designing and building end-to-end AI solutions, from understanding customer pain points to deploying VLM-powered vision interfaces.
- Benchmarking vision models and analyzing performance to identify weaknesses in image recognition, object detection, or visual understanding.
- Improving model performance through system prompt tuning and fine-tuning VLMs.
- Working with multimodal teams to generate data for research efforts.
- Building internal tools to automate VLM workflows, such as image processing pipelines or real-time visual analysis.
- Defining critical benchmarks for Vision or Video Understanding performance.
- Initiating human data collection.
- Driving Vision model integration with enterprise partners.
Requirements: - Strong engineering background.
- Experience interfacing between technical and customer-facing teams.
- Excellent verbal and written communication skills in English.
- Ability to translate business and vision-specific product needs into technical solutions.
- Proven experience implementing VLM or machine learning products, including APIs, back-end, and front-end vision interfaces.
- Strong proficiency in Python and/or TypeScript.
- Solid understanding of HTTP protocol and real-time communication protocols (e.g., WebRTC for video streaming).
Benefits: - Competitive cash-based compensation
- xAI equity
- Private health and dental insurance
- Unlimited time off subject to prior approval