Experience: 3–10+ Years
Employment Type: Full Time / Contract
Job Description:
We are seeking a skilled Generative AI Engineer to design, build, and deploy GenAI solutions using Large Language Models (LLMs). The ideal candidate will work on AI-powered applications such as chatbots, copilots, document intelligence, and automation tools, collaborating closely with data science, product, and engineering teams.
Key Responsibilities:
-
Design and develop GenAI applications using LLMs
-
Build and optimize prompt engineering and RAG (Retrieval Augmented Generation) pipelines
-
Integrate GenAI models into web and enterprise applications
-
Fine-tune and evaluate LLMs for performance and accuracy
-
Work with structured and unstructured data (PDFs, documents, APIs)
-
Implement AI safety, monitoring, and cost optimization strategies
-
Collaborate with cross-functional teams to deliver AI solutions to production
Required Skills:
-
Strong programming experience in Python (mandatory)
-
Hands-on experience with LLMs (OpenAI, Azure OpenAI, Anthropic, Gemini, LLaMA, etc.)
-
Experience with LangChain, LlamaIndex, or similar frameworks
-
Knowledge of Prompt Engineering and RAG architectures
-
Experience with Vector Databases (Pinecone, FAISS, Weaviate, Chroma, Milvus)
-
Familiarity with REST APIs and microservices
-
Experience deploying models on AWS / Azure / GCP
Nice to Have:
-
Fine-tuning using LoRA / PEFT
-
Experience with MLOps tools (MLflow, Kubeflow, CI/CD)
-
Knowledge of NLP, embeddings, transformers
-
Experience with Docker, Kubernetes
-
Exposure to AI governance, security, and compliance
Education: