Job Description :
Strong programming proficiency and working experience (10 years) in a subset of Python, R, Scala, Java (Python preferred)
Strong proficiency and experience (10 years) with data science and machine learning software stacks, e.g. NumPy, Pandas, SK-learn for Python
Programming proficiency in Spark and MLlib (3 years)
Working experience with and understanding of large-scale data analysis systems, e.g. Hadoop or MPP databases (5 years)
Proficiency and experience with SQL (10 years)
Significant experience (10 years) implementing machine learning and data science models, including through production
Strong theoretical and practical knowledge of machine learning models and algorithms (unsupervised and supervised), their use in applications, and their advantages and disadvantages
Strong knowledge of code-based data visualization tools (e.g. matplotlib) for data exploration and to present models and results to internal and external stakeholders
Experience with cloud-based systems and toolsets (3 years)
Experience and understanding of experiment design and evaluation
Knowledge of big data engineering toolsets a plus
Superior data preparation skills; be able to access, transform and manipulate Railinc and external customer data in its base form
Superior problem-solving skills