You are a seasoned ML / NLP Data Scientist who’s itching to solve large and real-world business problems by applying AI, ML and NLP techniques on a variety of structured Enterprise data and some unstructured texts. You’ll work alongside a stellar team of engineers, data scientists, UX designers, and product managers to build really cool I.P. using open-source technologies. |
Deep knowledge on Apache Spark SQL, Spark Streaming, MLib, Kafka (or equivalent), TenserFlow GraphX on enterprise projects |
Understanding of how to size a RDBMS and NOSQL databases based on product requirements Must have worked with various ML techniques like Classifications, Regressions, Clustering, knowledge graphs, and recommender systems and or NLP products involving knowledge extraction, knowledge graphs, and recommender systems Hands-on experience with tokenizers, normalizers, stemming, lemmas, entity extractions, POS tagging, synonyms and quasi-synonyms, search, and general classifier models – mid to advanced NLP techniques over documents, emails Proficiency in a Python dev environment, experience with SQL & relational databases Knowledge of any of these packages will be highly preferred ; Stanford NL API package, Google Cloud Natural Language API, Spark ML models Strong written and verbal communication skills |