Job Description :
Job Title: Hadoop Architect
Location: McLean, VA
Duration: Long Term


Job Description:
Responsible for delivery in the areas of: big data engineering/ data science/machine learning, including technology implementations and algorithm development
Develop scalable and reliable data solutions to move data across systems from multiple sources in real time as well as batch modes.
Construct data staging layers and fast real-time systems to feed BI applications and machine learning algorithms
Review and independently test the effectiveness and accuracy of Image Analytics, NLP and machine learning models
Utilize expertise in models that leverage the newest data sources, technologies, and tools, such as machine learning, Python, Hadoop, Spark, Azure/AWS, as well as other cutting-edge tools and applications for Big Data.
Investigate the impact of new technologies, applications, and data sources on the future secondary mortgage business
Demonstrated ability to quickly learn new tools and paradigms to deploy cutting edge solutions.
Develop both deployment architecture and scripts for automated system deployment in Azure/AWS
Create large scale deployments using newly researched methodologies.
Work in Agile environment
Experience mentoring junior engineers.

Basic Qualifications
Bachelor’s degree in Mathematics, Statistics, Computer Science
Solid experience with Hadoop including Hive, HDFS, MapReduce and Spark
Comprehensive knowledge of modern statistical learning methods
At least 3 years’ experience in Python (NumPy, SciPy, scikit-learn, pandas) and any other open source programming languages for large scale data analysis
At least 3 years’ experience in Java (Spring Boot)
At least 3 years’ experience with machine learning and natural language processing
At least 10 years’ experience with relational database

Preferred Qualifications
Master’s Degree in Computer Science
3+ years of experience working with AWS/Azure
2+ years of experience working with financial data
Familiarity with one or more streaming technologies, viz. Kafka, NiFi etc.
Experience with NoSQL databases
5+ years of experience in Python (including NLP) for large scale data analysis
10+ years of experience with SQL
Strong communication skills, with the ability to work both independently and in project teams