GCP Data Engineer

Dearborn, MI Dearborn MI 48128

Date : Apr-08-22

GCP Data Engineer

Dearborn, MI

Apr-08-22

Work Authorization

US Citizen
GC
H1B
GC EAD, H4 EAD, TN EAD

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect, Senior, Midlevel

Rate/Salary ($)

Duration

12 MONTHS

Sp. Area

Others

Sp. Skills

Others

Consulting / Contract

Third Party OK

Required Skills :

Google Cloud Platform, Kafka, Apache, Data Engineer, ETL, Data Stage, Hadoop, Informatica, JAVA, JDBC, JMS, JSON, Real time, SQL, XML

Preferred Skills :

GCP, ETL, JAVA, APACHE, KAFKA

Domain :

Work Authorization

US Citizen
GC
GC EAD, H4 EAD, TN EAD
H1B

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect, Senior, Midlevel

Rate/Salary ($)

Duration

12 MONTHS

Sp. Area

Others

Sp. Skills

Others

Consulting / Contract

Third Party OK

Required Skills :

Google Cloud Platform, Kafka, Apache, Data Engineer, ETL, Data Stage, Hadoop, Informatica, JAVA, JDBC, JMS, JSON, Real time, SQL, XML

Preferred Skills :

GCP, ETL, JAVA, APACHE, KAFKA

Domain :

Infosight consulting
New York, NY
Post Resume to
View Contact Details &
Apply for Job

Job Description :

Position:GCP Data Engineer

Location:Dearborn, MI

JD:

Design and build production data engineering solutions to deliver data pipeline patterns using following Google Cloud Platform (GCP) services:

In-depth understanding of Google's product technology and underlying architectures
BigQuery – Warehouse/ data marts – Through understanding of Big Query internals to write efficient queries for ELT needs, creation of views/materialized views, creation of reusable store procedures etc.
DataFlow (Apache Beam) – reusable Flex templates/ data processing frameworks using Java for both batch and stream needs.
Pub/Sub, Kafka, Confluent Kafka – Real time streaming of database changes or events.
Experience of designing, building, and deploying production-level data pipelines using Kafka; Strong experience working on Event Driven Architecture
Strong knowledge of the Kafka Connect framework, with experience using several connector types HTTP REST proxy, JMS, File, SFTP, JDBC etc.
Experience in handling huge volumes of streaming messages from Kafka
Cloud Composer (Apache Airflow) – to build, monitor and orchestrating the pipeline
Knowledge on BigTable
Cloud SQL, Compute Engine, Cloud Function, Cloud Run and App Engine, Cloud Storage
Experience with open-source distributed storage and processing utilities in the Apache Hadoop family.
Extensive knowledge on processing various file formats orc, Avro, csv, json, xml etc.
Knowledge/experience in any ETL tools like DataStage/Informatica – Ability to understand existing on-premises ETL workflows and redesign them in GCP.
Experience and expertise on Terraform to deploy the GCP’s in CI/CD.
Knowledge/ Experience on connecting to on-prem API’s from google cloud.