Big data Architect with strong Spark

Peapack, NJ Peapack NJ 07977

Date : Oct-10-18

Big data Architect with strong Spark

Peapack, NJ

Oct-10-18

Work Authorization

US Citizen
GC
H1B
OPT EAD, GC EAD, L2 EAD, H4 EAD, TN EAD

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

:

Architect

Rate/Salary ($)

:

Market

Duration

:

LONG TERM

Sp. Area

:

Data Warehousing/ETL

Sp. Skills

:

Data Architect

Consulting / Contract

Required Skills :

linux, java, python, hbase, linux, spark, kafka

Preferred Skills :

Domain :

IT/Software

Work Authorization

US Citizen
GC
OPT EAD, GC EAD, L2 EAD, H4 EAD, TN EAD
H1B

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

:

Architect

Rate/Salary ($)

:

Market

Duration

:

LONG TERM

Sp. Area

:

Data Warehousing/ETL

Sp. Skills

:

Data Architect

Consulting / Contract

Required Skills :

linux, java, python, hbase, linux, spark, kafka

Preferred Skills :

Domain : IT/Software

Apply Now

Nityo Infotech Corporation
Pittsburgh, PA
Post Resume to
View Contact Details &
Apply for Job

Job Description :

10+ Years of experience in building platform, linux administration/database administration or Strong programming experience with Java / Python
3+ Years with Hadoop Ecosystem including Spark, Hbase, Kafka, Sentry, Sqoop, flume, oozie, Jupyter Notbook, Zeppelin
Experience of taking Spark to production and running production workloads is a must
Any production experience of running Spark on containers
Expertize in productionizing spark based applications in big data, Hadoop and cloud environments
Expertize in using Spark with big data processing and analytics use cases in production
Performance tuning of spark clusters and optimizing the configuration
Integration of Jupyter Notbook, Zeppelin etc., with Spark and other environments
Experience with Interoperability of various components in the Ecosystem
Experience with architecture and enterprise deployments for Big data/Spark based distributed environments
Experience with Containers and Kubernetes. Especially Spark setup with AWS EC2 and Kubernetes containers.
Expertize with Linux OS / RHEL
Batch Processing using Apache Spark, EMR, MapR and ability to recommend the right pattern for use case
Stream processing - Spark streaming, Apache Storm, Flink ,Kafka etc.,
Python / Unix Shell scripting
Ability to do capacity sizing with Hadoop and Spark based Cluster

Hot Job

Developer, New York, NY

Oct-15-25

Sage IT INC

($) : USD 70

Job Description:, ALL CAPS, NO SPACES B/T UNDERSCORES, , PTN_US_GBAMSREQID_CandidateBeelineID, i.e. PTN_US_9999999_SKIPJOHNSON0413, , Bill Rate: $65-70/hr, , MSP Owner: Kelly Gosciminski, Location: New York, NY - hybrid onsite, Duration: 6 months, GBaMS ReqID: 10272565, , - Experience 10 years of experience in data engineering or a related role., - Python Proficiency Strong proficiency in Python programming, including experience with data manipulation libraries such as Pandas and NumPy., - Data

Hot Job

Developer, New York, NY

Oct-15-25

Sage IT INC

($) : USD 80

Job Description:, ALL CAPS, NO SPACES BETWEEN UNDERSCORES, , Bill Rate:, , PTN_US_GBAMSREQID_CandidateBeelineID, Example: PTN_US_9999999_SKIPJOHNSON0413, , MSP Owner: Michelle Lee, Location: Bentonville, AR, Duration: 6 months, GBaMS ReqID: 10279411, , , Ideal candidates should be:, , *Well versed with Hadoop, Spark, Cloud, PythonScala and Java, Streaming, Kafka, Backend, J2EE. You evangelize an extremely high standard of code quality, system reliability, and performance., *You have a proven tra

Hot Job

Developer, New York, NY

Oct-15-25

Sage IT INC

($) : USD 80

Job Description:, ALL CAPS, NO SPACES BETWEEN UNDERSCORES, , Bill Rate:, , PTN_US_GBAMSREQID_CandidateBeelineID, Example: PTN_US_9999999_SKIPJOHNSON0413, , MSP Owner: Michelle Lee, Location: Bentonville, AR, Duration: 6 months, GBaMS ReqID: 10279412, , , Ideal candidates should be:, , *Well versed with Hadoop, Spark, Cloud, PythonScala and Java, Streaming, Kafka, Backend, J2EE. You evangelize an extremely high standard of code quality, system reliability, and performance., *You have a proven tra

Hot Job

Apache Spark - L1 Support, New York, NY

Oct-15-25

Sage IT INC

($) : $60k - $130k/year

Job DetailsJob Description Position: Apache Spark L1 SupportJob DescriptionApache Spark + Kubernetes - Must have very good experiencePySpark / Python - Must have some experience Hadoop - Good to haveApache Spark JD - L1 SupportGood with spark and KubernetesHas potential to learn and adapt to new processes He was able to provide context on his recent contributions with PythonHas theoretical understanding of Hadoop, since existing project stack is on S3, but willing to learn.Support tasks: rec

Java Solution Architect, Jersey City, NJ

Oct-04-25

Robotics technology LLC

($) : $80

Mandatory Skills: Java, Microservices, AWS, Kafka. Strong Application Development work experience - Agile environment preferred. Solid application design, coding, testing, maintenance and debugging skills Lead the design and architecture of scalable, enterprise-grade Java applications, ensuring alignment with business goals and technical best practices. Collaborate with cross-functional teams and stakeholders to translate requirements into effective, maintainable solutions. Prov

Hot Job

Network Data Architect, New York, NY

Oct-15-25

Sage IT INC

($) : USD 80.9000000000000

Role - Network Data Architect Domain - Telecom with OSS/BSS is a must Detailed Job Description: Primary skill - Cloud, Big data, data warehouse, data modeling, Microservices, K8s/Docker In the role of Data Architect, you will be a senior-level strategic professional responsible for designing, building, and managing the organization's data architecture, strategies, and solutions. This role ensures the integrity, security, availability, and usability of data across the enterprise to support b

Data Architect : Local NY/NJ, New York, NY

Sep-19-25

Everest Consulting Group

($) : 70

We are seeking a Data Architect with hands-on experience in modern data architecture, analytics engineering, and cloud-native data platforms. You will help shape and deliver high-quality, scalable data products using dbt Cloud, Databricks, and AWS, supporting analytics and business initiatives across the enterprise. This role is ideal for an experienced data professional who understands how to bridge data modeling, transformation, and platform operations with a product mindset. You will work

[Already Applied]

ETL AB initio Developer, Jersey City, NJ

Sep-25-25

Robotics technology LLC

($) : $70

Role: ETL AB initio Developer Location: Jersey City, NJ Mandatory Skills: Python, Java, Ansi SQL, Abinitio, Spark Experience in building data integration solutions using the AB initio ETL . Creating efficient and scalable data processing pipelines and applications using Apache spark Proficient in understanding distributed computing principles. Creating ,optimizing, and maintaining directed acyclic graphs(DAGs) in Python to define and orchestrate data pipelines and automated tasks.

Hot Job

Sr. Azure Data Architect (Databricks & S, New York City, NY

Oct-15-25

SoftPath Technologies LLC

($) : $60k - $130k/year

Role: Sr. Azure Data Architect (Databricks & Strong Banking Domain) Location : New York City, NY Duration: Long term contract Interview: Video + Client In-Person interview Domain: Only with Strong Banking domain experience which is mandate Job Overview: We are seeking an experienced Azure Data Architect to design and implement scalable, secure, and efficient data solutions on Azure Cloud for our financial services client. The architect will lead the development of data platforms usi

Sr. full stack engineer – React/Python/P, New York, NY

Oct-09-25

MND Systems

($) : 1

Responsibilities: Contribute to the design, development, and deployment of the firm’s central data marketplace platform, ensuring scalability, performance, reliability, and security to serve enterprise-wide business needs.Architect and build modern, user-friendly, and highly responsive full-stack applications that enable seamless discovery, access, and governance of data assets across the organization.Architect and build event-driven processes such as data subscriptions and distribut