Job Description :
Senior Python Developer
3 months, likely extension thereafter
McLean VA

As a Principal Site Reliability Engineer on the Cyber ML team, you will be tasked with building and operating petabyte-scale, distributed, fault-tolerant systems that are essential to Capital One''s cyber defense capabilities. You will also be the driving force behind building an SRE culture of analytical problem solving, continuous process improvement, and openness. You will be part of an agile, dedicated SRE team that will be responsible for ensuring that customers and employees have fast, reliable access to production applications.

Who You Are
You have a solid background in operations and software engineering.
You are interested in working on challenging problems involving scalability and performance.
You can effectively collaborate with other teams to work on high-profile initiatives.
You enjoy learning new technologies and picking up new skills.
You are interested in and proficient at automating tasks, deployments, monitoring, and testing.

What The Role Is
Participate in architecture design and review, capacity planning, launch planning, and other activities prior to an application going live.
Maintain applications after they launch to production by monitoring availability, latency, and application health.
Scale up applications and modify application architecture to meet the evolving needs of the customer.
Conduct blameless postmortems and retrospectives as part of continuous process improvement.

Basic Qualifications
B.S. in Computer Science or related technical discipline.
3+ years of professional programming experience in Java, Scala, Python, C++, or Golang.
3+ years of professional programming experience in scripting languages such as Shell, Python, or Perl.
3+ years of professional experience working with automation frameworks (Ansible, Puppet, Salt stack, Cloud Formation, Terraform)
3+ years working with Linux-based OSes (Red Hat preferred)
3+ years experience working within cloud environments (AWS preferred)
Experience working with monitoring applications (ELK stack, TICK stack, Prometheus, Graphite, Grafana)
Experience working with CI/CD tools (Jenkins and Artifactory preferred)
Nice to Have
M.S. or Ph.D. in Computer Science or related technical discipline.
Experience working with Elastic search and Lucene-based search.
Experience working with Snowflake data warehouse.
Experience working with Spark or Flink
Experience working with container runtimes (Docker, rkt, cri-o, etc
Experience working with container frameworks (Kubernetes, Mesosphere, etc
Experience with building Machine Learning (ML) applications or implementing ML algorithms.


Client : AT&T

             

Similar Jobs you may be interested in ..