Job Description :

Job Title: Data Scientist

Location-Remote

No OPT

Job Description:

The Data and Analytics Services Platform
The Data and Analytic Services Platform is an internal service offering to Aon colleagues that covers
the entire value chain from data ingestion to analytics distribution. Through the capabilities within the
platform, teams can create data pipelines that connect to global data sources, ingest data in batch or
near real-time, stage it in the Raw zone of the Governed Data Lake before processing it and exposing
as structured data in its near-natural form in the Discovery zone of the lake for further analysis.
Teams then go on to combine, enrich and aggregate data using corporate reference data, through
scalable data processing jobs, deriving new data assets for the Refined zone. Machine Learning
models may also serve to enrich data assets with new and interesting insights. These refined data
assets serve as reporting layers for wider teams or form the basis of new data-driven solutions which
are delivered to colleagues or direct to clients through dashboards or custom web applications or
embedded into existing solution line applications.
About the Role & Responsibilities
Aon is looking for a short to medium term engagement with a data professional to assist with critical
up-coming projects and tasks such as an imminent evaluation of Databricks and AWS Athena and
their suitability to Aon's Data & Analytics Platform (DASP). These key initiatives will shape the
development of Aon's DASP and will have an impact on the next iteration of data processing
technologies in the platform.
The engagement will require the professional:

  • To understand Aon's current analytic environment and platform
  • To Independently test and document solutions for the Databricks Evaluation based on:
  • Spark workloads
  • Python workloads
  • R workloads
  • ML engineering from the perspective of data preparation, training and tuning,
    deployment and monitoring

To independently test and document solutions for the AWS Athena Evaluation in respect:

  • Usability and connectivity
  • Performance benchmarking
  •  Integration with AWS and non-AWS services and platforms
  •  Security mechanisms
  • Comparison with similar offerings (Starburst / Azure Synapse / …)

Required Skillset
The ideal candidate will have good knowledge and experience in the following areas:

  • Coding in Scala, Python and R.
  • ML Engineering processes and deployments
  • Hadoop distributions (particularly Cloudera)
  • Cloud Solution Architecture (AWS)
  • Databricks and similar data processing platforms
  • Notebook development environments (Jupyter / Zeppelin)
  • Query Federation / Semantic Layer Technologies (Starburst/Trino, Presto, AWS Athena, Azure
    Synapse)
  • SQL & No-SQL Database / Data Warehousing technologies (Redshift, SnowFlake, Impala,)
  • Data Storage formats and technologies (S3, Azure Data Lake/Blob Storage, Avro, Parquet,
    DeltaLake)
  • Excellent listening, presentation, and interpersonal skills.
  • Ability to communicate ideas in both technical and user-friendly language.
  • Excellent analytical, problem solving and decision-making skills.
  • Ability to prioritize and execute tasks in a high-pressure environment.
  • Experience working in a team-oriented, collaborative environment.
             

Similar Jobs you may be interested in ..