AI Engineer (Computer Vision - Stable Diffusion)
AI Engineer (Computer Vision - Stable Diffusion)37
Applications
37
Applications
About the Job
Skills
Job Title: AI Engineer (Computer Vision - Stable Diffusion)
Location: Gurgaon, India (Hybrid Model)
Company: Leading USA-based AI SaaS Product Company
Development Centers: USA (HQ), Canada, and Gurgaon, India
About the Role:
Are you a Computer Vision Engineer passionate about shaping the future of AI-driven solutions? Join a trailblazing team that is transforming the automotive and virtual showroom industries with AI-powered, real-time innovations. In this role, you’ll work with a global team of experts and leverage cutting-edge technology to solve complex problems and deliver exceptional, high-performance applications.
Key Responsibilities:
- Generative AI Models: Design, fine-tune, and deploy state-of-the-art models, focusing on Stable Diffusion, GANs, and img2img solutions for applications including data generation, image inpainting, and floor reflection creation.
- 3D Virtual Showroom Development: Collaborate on a 3D virtual car showroom project, using Gaussian Splatting, PlayCanvas (JavaScript), and CGAL (C++) for optimized 3D rendering and meshing.
- Computer Vision & Image Processing: Implement advanced computer vision techniques using OpenCV and custom algorithms for image and video analysis, pattern recognition, and real-time object detection.
- Machine Learning & Optimization: Utilize ML models for multimodal customer interaction platforms, specifically for the automotive industry, with a focus on conversation-driven customer conversion.
- Performance Engineering: Enhance the 3D Gaussian Splatting pipeline to reach processing speeds under 20 minutes using custom Triton kernels, achieving high-performance benchmarks.
- API & System Integration: Develop and optimize APIs for end-to-end image processing workflows, including AWS SQS, Celery, FastAPI, and OneFlow architectures, ensuring 3-second request times for complex inpainting tasks.
- Prototyping & POC Development: Lead proofs of concept (POCs) for client needs, focusing on CLI tools, custom scraping techniques, and data handling for high-efficiency processes.
Key Skills & Qualifications:
- Experience: 3+ years in computer vision, image processing, and ML-based application development.
- Computer Vision Expertise: Proficient in OpenCV and other vision libraries; experience with real-time image and video processing is essential.
- Generative AI: Hands-on experience with generative models like Stable Diffusion, GANs, pix2pix, and Smapgan; ability to fine-tune models for specific tasks and deploy them efficiently
- Programming Skills: Advanced proficiency in Python and C++ is required; familiarity with Triton, FastAPI, AWS SQS, Celery, and OneFlow frameworks is advantageous.
- 3D & Virtual Environments: Familiar with 3D rendering technologies like Gaussian Splatting and meshing tools like CGAL; experience with PlayCanvas (JavaScript) for virtual showroom development is a plus.
- Performance Optimization: Demonstrated experience optimizing processing pipelines and reducing runtimes significantly; ability to write custom kernels and optimize GPU performance.
Why Join Us?
- Innovative Culture: Work at the forefront of AI technology with some of the brightest minds in the field, contributing to products that are transforming the automotive and virtual experience sectors.
- Global Exposure: Collaborate with teams across the USA and Canada in a hybrid model that allows you to enjoy both remote flexibility and in-office synergy.
- Career Growth: Thrive in a fast-growing, AI-focused environment with ample opportunities for skill enhancement, project leadership, and career progression.
About the company
Industry
Human Resources Services
Company Size
2-10 Employees
Headquarter
Noida
Other open jobs from Talent Nexa Consulting