Job Description :
Role : Data Analytics Scientist Location : South San Francisco, CA Description : We are currently recruiting for a motivated Data Scientist. This will be a unique opportunity to see your ideas rapidly translate into the clinic and impact patient lives. The Data Scientist will be instrumental in developing models and data solutions that address specific business needs while being innovative and in bringing cutting-edge research into production. Responsibilities: Work with cross-functional research and operational departments to implement production-quality, cutting-edge data management, analysis and visualization tools Help build production-quality, scalable bioinformatics and scientific data pipelines and take responsibility for managing and integrating functional genomics, Immuno-oncology data Build and maintain relationships with external collaborators to obtain and manage complex longitudinal patient data Communicate data and scientific results to both internal and external audiences, and will contribute to open-source and collaborative projects Collaborate with other data scientists to solve problems by building extensible software, replicable analyses, and robust, well-tested data pipelines Qualifications: B.S. degree in Computer Science or related field with 5+ years of prior experience in an operations-focused Data Scientist role 3+ years of hands-on experience analyzing data, drawing conclusions, defining recommended actions, building predictive models, and visualizing results 3+ years of SQL development skills writing complex queries 2+ years of experience with Python, R, Java, C++, Julia, or another widely used open-source programming language 2+ years of experience deploying Machine Learning algorithms in a production environment 3+ years of experience with data visualization tools (Spotfire, R Shiny, Looker, Tableau or similar) 2+ years of experience working with SQL and Non-SQL databases Experience working with cloud environments: AWS and/or Azure and/or GCP cloud platforms Experience working with Linux/UNIX shell virtual machines Experience with big data processing tools such as MapReduce, PIG, and HIVE, Hadoop, with considerable experience working with Spark Familiarity with distributed systems (Docker, Kubernetes, Kafka , Spark etc