Title: Data Scientist with Stata and Quant
Location: Houston, TX

To design and develop machine learning capabilities and tools for complex, volumetric datasets.


· Selecting and engineering features for predictive models

· Creating optimized predictive models using machine learning algorithms

· Enhancing data collection procedures to include information that is relevant for building analytic systems

· Processing, cleansing, and verifying the integrity of data used for analysis

· Performing exploratory data analysis (EDA)

· Doing ad-hoc analysis and presenting results in a clear manner

Skills and Qualifications

· Masters or PhD in Math/Stats/CS/Physics or a comparable quantitative discipline

· Excellent understanding of machine learning techniques

· Strong understanding of machine learning algorithms, such as those in scikit-learn and MLlib

· Experience in deep learning frameworks (preferably TensorFlow)

· Solid experience in Python and PySpark

· Experience with Anaconda tools such as NumPy, Pandas and Matplotlib

· Experience with Hadoop and Spark

· Experience in Stata

· Experience in R or SAS is a plus

· Great communication skills

· Proficiency in using query languages such as SQL, Hive, Pig

· Experience with NoSQL databases, such as MongoDB, Cassandra, HBase is desired

· Good applied statistics skills, such as distributions, statistical testing, regression, etc.

