Job Description :
  • Automate or streamline manual tasks and redundancies within the infrastructure organization
  • Identify creative ways to break the system, uncover and report nonfunctional defects, as well as validate systems/solutions are operating as intended.
  • Leads end-to-end availability, security and performance of mission-critical applications and services that are part of the Digital systems
  • Analyze technical issues and identify the root cause and provide fix in production environment. (Never solve the same problem twice)
  • Partners with multiple internal teams to groom the nonfunctional requirements and work on implementations
  • Implement best SRE practices to ensure availability/reliability and fault tolerance and wherever applicable
  • Be the SRE ambassador on an Agile software development team.
  • Drive product reliability improvements through monitoring, alerting, and application of software development best practices.
  • Perform proof of concepts to proof new technologies and integrations
  • Able to work fast and reliably under pressure
  • A strong critical thinker who identifies problems before they happen
  • Troubleshoot performance and stability issues using a wide variety of tools
  • Evaluate and manage application and environment security
  • Share off hours on call with team for any production issues



  • Familiarity with object-oriented programming languages and concepts and hands on experience in Java applications (spring boot services)
  • Hands on experience with any cloud service concepts, preferably AWS
  • Hands on experience with SRE practices and writing, running Chaos engineering experiments
  • Knowledge on HCL commerce and IBM sterling platforms is advantageous
  • A strong critical thinker who identifies problems before they happen
  • Strong written and oral communication skills with a high degree of comfort speaking with engineering management, developers, and leadership
  • Demonstrated ability to adapt to new technologies and learn quickly

Must have ???
??         JAVA experience ( preferrable with microservice)
??         Shell script experience
??         Observability experience using any APM tools
??         Experience in analyzing production issues using various APM tools, enabling logging, understanding the application technical flow etc
??         Cloud experience
Good to have ???
??         Retail ecommerce domain experience
??         AWS experience
??         Previous support experience with Java background

             

Similar Jobs you may be interested in ..