Job Description :

AI Architect

Location: Plano, TX (Onsite)

Duration: 12+ Months Contract
 
Candidates who can work independently are more preferred
 
Job Description:
Looking for an AI Architect with strong AWS experience to design cloud-native AI/ML architectures, build end-to-end machine learning pipelines, and lead PoCs for emerging AI capabilities such as GenAI and agentic AI. The architect will establish reference architectures, drive cloud best practices, implement monitoring frameworks, and support model tuning efforts such as RLHF and fine-tuning.
Onsite role in Plano, TX.
 
Key Responsibilities:
  • Define cloud-native design principles and reference architectures for AI workloads
  • Architect secure, scalable, and cost-effective AWS-based AI solutions
  • Lead design and implementation of ML pipelines (data ingestion deployment)
  • Run PoCs for emerging AI capabilities (GenAI, Agentic AI, etc.)
  • Implement model tuning/evaluation (fine-tuning, RLHF)
  • Build monitoring & observability frameworks (Prometheus, Grafana, CloudWatch)
Must-Have Skills:
10+ years in software/solution architecture (3+ years in AI/ML)
Strong AWS experience (SageMaker, Bedrock, Lambda, EKS, Glue)
Expertise in Python/Java and ML frameworks (PyTorch, TensorFlow, HuggingFace)
Hands-on with Generative AI, LLMs, RAG architectures, vector databases (Pinecone, Cosmos, OpenSearch)
Experience designing ML pipelines end-to-end
Experience with AI/ML Ops, CI/CD, containerization (Docker, K8s)
Strong cloud-native architecture design
Knowledge of model tuning/evaluation (Fine-tuning, RLHF)
Experience setting up observability: CloudWatch, Prometheus, Grafana, Open Telemetry
Top Skills:
AI Architect, AWS SageMaker, AWS Bedrock, Machine Learning Pipelines, Generative AI, LLM, RAG, RLHF, Fine-Tuning, TensorFlow, PyTorch, Hugging Face, Vector Databases, Pinecone, EKS, Kubernetes, Cloud-Native Architecture, ML Ops, CI/CD, AWS Lambda, AI Workloads, Prometheus, Grafana, Open Telemetry.
             

Similar Jobs you may be interested in ..