Job responsibilities:
- Responsibility for implementation and deployment of Agentic/ Gen AI frameworks at scale
- Strong in programming - Python and C++
- Previous experience of working on Computer Vision projects and VLM /VLAM models.
- Practical experience of working with Transformer Arch. and End to End Deep neural networks
- Full stack AI / ML development experience
- Design, build & maintain efficient and reliable Agentic / Generative AI code leveraging pipelines
- Hosting and deployment knowledge in GCP along with advanced engineering concepts to build user friendly UI interface for easy adoption.
Requirements:
- 8+ overall years of experience (Agentic AI, VLM, VLAM and LLM) with significant exposure in Development, Design, Architecture, scaling and hosting in cloud.
- Must Have –
- Solutioning experience with Python and FAST API, Agentic Ai frameworks, VLMs, VLAMs, Open source LLM’s and Code based LLM models at scale with - Langchain / Ollama, vector embeddings, Memory Management etc.,
- Practical experience in implementing Explainable and ethical AI models
- Practical experience in frameworks like RAG/ CAG/ Agentic RAG etc.,
- Experience in cloud hosting either AWS or Azure or GCP.
- Experience in ML-OPS - Implement a feedback mechanism to continually improve the model over time through feedback loop and monitoring end KPI’s in production.
- Experience with Quantization and Kubernetes or docker
- Good to have
-
-
- gRPC implementation to expose the API’s on a server for easy usage and good user interface
- Streamlit front end creation
- Experience with SAFe framework deliveries.
-
-
Candidate education background and experience:
- Degree in Computer science or AI/ML Engineering
- Strong in Programming concepts, frameworks, deployment, and customization of Agentic models, Vision models and Vision language action Models