Job Description :
Have managed production infrastructure sites for front and back-end services Good knowledge of Linux internals and administration Experience in systems software development (java go, python, bash, . ) Deep knowledge of infrastructure as code principles, knowledge of Terraform is a must to have. Deep experience with AWS (Cloud Computing: Ec2, S3, RDS, VPC, Security Groups, ELB, ElastiCache, Beanstalk, Redshift, knowledge with SQL and noSQL database administration Able to define actionable monitoring and alerting for systems On-call experience dealing with production incident management and resolution Cloud Expert: Well versed in AWS services for monitoring, logging, metrics, high availability, and automation Operationally Focused: Passionate about monitoring, resiliency, uptime, performance and automation Effective Communication: Excellent listener; proven collaborator with superiors, peers and staff Automation Driver: Constantly look for automation opportunities Curious: Hands-on, "roll up your sleeves" collaborative style of working Passionate: Bring energy and enthusiasm to the job and organization Achiever: Consistently attain/exceed individual and team goals Multitasker: Ability to juggle multiple work items Enjoy problem solving: Ability to find creative and reliable solutions to complex problems Define Service Level Objectives and performs the work required to ensure we meet those SLOs. Knowledge of networking and monitoring skills Strong communication skills with an ability to relay incident details expeditiously, concisely, and accurately Proficient leading remote online collaborative meetings adhering to project management principles and documentation Strong organizational skills with extremely high level of attention to detail Highly motivated, quality conscious self-starter that requires little to no supervision, able to own tasks from start to finish Customer focused - Investigates and resolves customer issues and inquiries (i.e., emergency and non-emergency) Identify, receive, triage and act upon events and incidents coming from various SaaS services Consistently meets or exceeds established Command Center key performance indicators (KPI's) Work per escalation, notification and incident practices Monitor the availability or the CI/CD environments Working under pressure in production environments running production customer workloads and services Previous knowledge or strong desire to learn about crisis management issues. Ability to work with geographically disperse teams part of a world-wide operations team Minimum Requirement 5+ Year of AWS Cloud Experience and 8+ years in IT Excellent written and verbal communication skills Experience implementing fault detection, and automating fixes Experience designing scalable services Experience designing distributed, fault-tolerant systems Experience with Micro-Services Architecture Experience managing services in AWS At least one AWS certification (Associates level) Regards, Imran Ashraf Khan | Account Manager KK Associates LLC. 8751 Collin McKinney Pkwy, # 1302, McKinney, TX 75070 555 Metro Place North, Suite # 100, Dublin, OH 43017 Direct: Email -