Position: Site Reliability Engineer
Location: Redmond WA
Duration: Full Time
Responsibilities:
· Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our services
· Gain deep knowledge of both our complex internally developed applications and enterprise-class services.
· Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth.
· Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale Linux and Windows environment.
· Work closely with development teams to ensure that platforms are designed with "operability" in mind.
· Function well in a fast-paced, rapidly-changing environment.
· Participate in a 12x7 rotation for second-tier escalations.
Basic Qualifications:
· B.S. or higher in Computer Science or other technical discipline, or related practical experience.ma
· 4+ years experience with Unix/Linux
· 4+ years experience in Programming languages (Python, Perl, Ruby, Java/Scala, or C)
· Experience with Developing large scale projects
Preferred Qualifications:
· 8+ years in a UNIX-based large-scale web operations role.
· Experience with web-based Java/J2EE architectures and JVM configuration.
· Python experience, specifically for systems automation.
· Previous experience working with geographically-distributed coworkers.
· Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Product Managers, etc.
· Knowledge of most of these: data structures, relational and non-relational databases, networking, Linux internals, file systems, web architecture, and related topics
· Experience developing, deploying, and managing Azure PaaS component based services
· Knowledge of InfoSec best practices and their application to service design
,
Nitesh Kumar