Job Description :

We have opening for Senior Systems Engineer(ELK Let me know

Location - Remote
Duration - 4 Months
Rate - $35-45/hr W2

Responsibilities and Day to Day Duties

Build and Operate large scale ELK clusters deployed in multiple Cloud Providers
Providing guidance to others on creating new and operating existing ELK, Kubernetes clusters
Driving initiatives to evolve our current platform to increase efficiency and keep it in line with current standards and best practices.
Manage and Onboard users to Elastic APM, Open Telemetry and Open Tracing
Rollout GitOps to manage many Elasticsearch Clusters to ensure zero downtime, highly available during upgrades and maintenance activities
Publish Automated Reports on Cluster Operations to leadership
Perform incident/alert troubleshooting, problem analysis and provide high quality solutions to technical issues
On-call support in cases of issues on production environment
Work with geographically dispersed team
Mentoring junior engineers on technical, architectural, design and related issues
Driving successful POCs, involving latest Cloud Native technologies.

You are an ideal candidate if you:

Have experience with logging and telemetry services, specifically ELK, Jaeger, Zipkin, Grafana, Prometheus
Have experience in either of Cloud Providers AWS, Azure, GCP
Able to code to a good standard with any programming language, such as Python, Go, Ruby
Experience writing infrastructure as code using tools such as Terraform, CloudFormation
A solid understanding of configuration management principles and tools such as Chef.
Comfortable with supporting and operating high availability Cloud services and provide on-call support for Sev1 incidents on production and critical development/QA cloud environments.
Understanding of CI/CD principles, Linux fundamentals, networking concepts and IP protocols
You are self-motivated with strong problem solving and troubleshooting skills.


BS degree in Computer Science or equivalent
5+ years of Systems administrations or enterprise software development or operations
3+ years of experience in managing Cloud operation environments at scale in Production
3+ years of experience in full implementation of building and managing Elasticsearch, Logstash Clusters
2+ years of experience working in real-time data streaming tools such as Kafka, Kinesis, etc
Experience in building and operating Kubernetes clusters
Experience in working with container runtime such as Docker, Containerd, CRI-O
Experience in building dashboards in Kibana, Grafana
Experience in instrumenting Distributed Tracing with Jaeger, Zipkin or Elastic APM
Experience with Production level monitoring and alerting with tools like Prometheus, Grafana
Strong Scripting language knowledge, such as Python, Shell, or Perl
Experience with cloud automated deployment tools such as Chef, Ansible, Spinnaker and Puppet is a plus
Prior experience in performing or participating in compliance and security audits is a plus
Prior experience in defining, configuring, and implementing disaster recovery process
Ability to perform data related benchmarking, performance analysis and tuning.
Strong interpersonal and team communications skills
Experience with project management and workflow tools such as Agile, Jira, ServiceDesk, etc.