Job Description :
Senior Data Engineer

Role:
Robert Bosch is a world-class engineering and electronics company with over 200 plants and thousands of assembly lines world-wide. Our products impact hundreds of millions of people every day, many in safety critical systems. We rely on data for every aspect of our operations and we collect a lot of it.

Our team is responsible for streaming Bosch data to centralized analytics platforms and building data-based services for a wide variety of Bosch engineering and research teams.

We are looking for a talented engineer who is passionate about building fault-tolerant data services and analytics tools. Your work will be used by hundreds of Bosch engineers and have global impact by improving the quality and value of Bosch products.

Primary Responsibilities:
Design and implement fault-tolerant data pipelines to integrate large amounts of data from many diverse storage systems.
Promote a culture of self-serve data analytics by minimizing technical barriers to data access and understanding.
Execute complex data engineering projects that have a significant impact on Bosch global business.
Share knowledge by clearly articulating results and ideas to customers, managers, and key decision makers.
Stay current with the latest research and technology and communicate your knowledge throughout the enterprise
Take responsibility for preparing data for analysis and provide critical feedback on issues of data integrity
Up to 10% travel may be required.

Job Requirements:
Degree Level: BS or MS in Computer Science or other engineering discipline
3+ years industry experience building and operating distributed data systems in production
Very strong programming skills in Scala or Java
Strong understanding in tuning and performance optimization of Apache Spark jobs
Experience with integration of data from multiple data sources
Experience with various messaging systems, such as Kafka or RabbitMQ
Ability to manage and solve ongoing issues with a Spark/Hadoop cluster

Desired:
Familiarity with distributed machine learning frameworks like Spark MLlib
General understanding of machine learning / deep learning methods