Job Description :
To design and develop machine learning capabilities and tools for complex, volumetric datasets.
Selecting and engineering features for predictive models
Creating optimized predictive models using machine learning algorithms
Enhancing data collection procedures to include information that is relevant for building analytic systems
Processing, cleansing, and verifying the integrity of data used for analysis
Performing exploratory data analysis (EDA)
Doing ad-hoc analysis and presenting results in a clear manner
Skills and Qualifications
Masters or PhD in Math/Stats/CS/Physics or a comparable quantitative discipline
Excellent understanding of machine learning techniques
Strong understanding of machine learning algorithms, such as those in scikit-learn and MLlib
Experience in deep learning frameworks (preferably TensorFlow)
Solid experience in Python and PySpark
Experience with Anaconda tools such as NumPy, Pandas and Matplotlib
Experience with Hadoop and Spark
Experience in Stata
Experience in R or SAS is a plus
Great communication skills
Proficiency in using query languages such as SQL, Hive, Pig
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase is desired
Good applied statistics skills, such as distributions, statistical testing, regression, etc.