Role: –Data Science
Bill Rate: $78/hour C2C
Location: Atlanta, GA
Duration: 12+ months/ long-term
Interview Criteria: Telephonic + Zoom
Direct Client Requirement
Role: Data Science
Job description:
We’re seeking a Junior Data Scientist with hands-on experience in agentic AI systems, large language models (LLMs), and transformer-based architectures. As a member of our Digital Solutions team, you’ll contribute to building and optimizing intelligent systems that reason, adapt, and act autonomously. This is a dynamic role suited for candidates who are eager to innovate with state-of-the-art language models and vector-based search technologies.
Key Responsibilities
- Design and optimize prompts for task-specific performance across Claude, GPT, LLaMA, and open-source LLMs.
- Work on retrieval-augmented generation (RAG) pipelines leveraging vector search (e.g., Pinecone, Weaviate, FAISS, Chroma, pgvector etc).
- Convert unstructured data (e.g., PDFs, scanned docs, images) into structured formats using OCR, document parsing, document classification, and document-centric Vision LLMs.
- Build and maintain agentic AI workflows using frameworks like LangChain, LangGraph, or AutoGen.
- Develop autonomous agent systems capable of multi-step reasoning and execution.
- Fine-tune foundation models (e.g., LLaMA, BERT, GPT, Mistral) using Hugging Face Transformers, OpenAI APIs, or LangChain.
- Apply transformer architectures and embedding techniques to domain-specific problems.
- Collaborate with cross-functional teams including senior ML engineers and product managers to deliver scalable GenAI solutions.
- Stay up-to-date with advancements in LLM fine-tuning, RAG strategies, and autonomous agent research.
Required Qualifications
- Bachelor’s or Master’s degree in Computer Science, Data Science, Machine Learning, or a related technical field.
- 2–5 years of experience in AI/ML Applications, GenAI development, or LLM-focused roles.
- Hands-on experience with prompt engineering and LLM-based task chaining.
- Experience extracting structured data from unstructured documents using popular Python libraries such as pytesseract, EasyOCR, Doctr, vision-parse, LayoutLM, Donut.
- Proficiency in Python, including ML libraries like PyTorch, Hugging Face Transformers, Pandas, Scikit-learn.
- Working knowledge of agentic AI frameworks (LangChain, AutoGen, LangGraph or CrewAI).
- Experience querying and integrating vector databases (e.g., Pinecone, Weaviate) for semantic search and RAG.
- Strong foundation in data analysis and ability to extract useful insights to guide from LLMs.
Nice to Have
- Experience working with Palantir Foundry or AIP platforms.
- Understanding of prompt engineering and instruction optimization.
- Experience integrating LLMs into production pipelines.
- Exposure to reinforcement learning or self-improving AI agents.
- Contributions to open-source LLM/AI projects.
Note: If you are interested, please share your updated resume and suggest the best number & time to connect with you. If your resume is shortlisted, one of our IT Recruiter from my team will contact you as soon as possible
Srinivasa Reddy Kandi
Client Delivery Manager
Valiant Technologies LLC
Equal Opportunity Employer:
We are an equal opportunity employer. All aspects of employment including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, performance, and business needs. We do not discriminate based on race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, national origin, citizenship/ immigration status, veteran status, or any other status protected under federal, state, or local law