Job Description :
Job Title :Lead Site reliability Engineer

Category :Functional



Client Company :MasterCard

Zip Code :63368

Duration : 12 months

City :Ofallon

State :MO

Mode of Interview : Skype for Non Local and F2F for locals


Job Description :


· What is the name of your group and what is your team's main responsibility?

Biz Ops - Network of the Future - Primary replacing old tech with new tech (spring boot, PCF based applications, cloud, Microsoft azure.

· What is the culture of your team?

Looking for someone who can do production support - Only during STL hours, occasionally may have to call TRT (when there's an outage

· What are your top 3 required technical skills?

Lead - Site Reliability Engineer - Production support capability with production support

Spring Boot

Dynatrace/Splunk

Senior - Dev Ops Engineer - Responsible for Jenkins pipeline, dev ops role.

· What will be your interview process? (Phone, skype, on site)

1st Round - 1hour phone call w/ tech leads and architect

2nd Round - If local (Preference In person, if remote (want to do Skype)

The Network of the Future BizOps team is looking for a Site Reliability Engineer who can help us solve problems, build our CI/CD pipelines and lead Mastercard in DevOps automation and best practices.

· Are you a born problem solver who loves to figure out how something works?

· Are you a CI/CD geek who loves all things automation?

· Do you have a low tolerance for manual work and look to automate everything you can?


Business Operations is leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.

The role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support technology services. A business operations engineer will ensure operational criteria like system availability, capacity, performance, monitoring, self-healing, and deployment automation are implemented throughout the delivery process. Business Operations plays a key role in leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations.

We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders.

Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications.

Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement.

· Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible.

· Design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for branch management, code promotions, and version management is a plus.

· Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns

· Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.

· Maintain services once they are live by measuring and monitoring availability, latency and overall system health.

· Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

· Practice sustainable incident response and blameless postmortems.

· Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover

· Work with a global team spread across tech hubs in multiple geographies and time zones

· Share knowledge and mentor junior resources

· Able to perform on-call duties on a rotational basis.

· Occasional off hours work required.




· BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.

· Experience with Java, J2EE, WebServices (SOAP/REST), Spring/Spring Boot and Hibernate skills.

· Experience with algorithms, data structures, scripting, pipeline management, and software design.

· Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

· Ability to debug and optimize code and automate routine tasks.

· Interest in designing, analyzing and troubleshooting large-scale distributed systems.

· Must have experience supporting a web application in a Linux / JBoss / Apache Environment

· Experience managing and supporting a batch application system, including ability to analyze complex and layered outcomes.

· Ability to write complex SQL queries to effectively utilize available resources.

· Experience with Web Servers such as Jboss or Tomcat preferred.

· We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.

· Interest in designing, analyzing and troubleshooting large-scale distributed systems.

· Ability to debug and optimize code and automate routine tasks.

· Experience in Agile Development Environment.

· Experience in Splunk, Dyntrace, PCF (Pivotal cloud foundry) is a plus.

· Experience with public cloud (Azure, AWS) is a plus.

Must be self driven, with a passion for discovering, capturing, and training others on new information and establishing new procedures