Data Scientist II - Cell and Gene Therapy

Philadelphia, PA, US, 19104

​Location: LOC_HUB-Hub for Clinical Collaboration 

Req ID: 192012

Shift: Days

Employment Status: Regular - Full Time 

​About Us
We’re seeking breakthrough makers! Children’s Hospital of Philadelphia was built on the belief that we can change lives. Today, in every role throughout our hospital, research institute and care network, the 22,000 members of our workforce are finding new ways – big and small – to make a difference for the patients and families we serve. 

If you are ready to challenge yourself, be inspired and grow – no matter what your role – you just may be the kind of breakthrough maker who will thrive at CHOP.

​CHOP is proud to share that we are ranked No. 1 on Forbes' 2022 list of America's Best Large Employers!

Job Summary

This is a full-time position located in Philadelphia, PA.  We are open to remote-hybrid or 100% remote work arrangements for U.S. based employees. 


You will join a cross-functional agile team that integrates clinical data sources to build dashboards, predictive models, and web apps. We work with Dr. Stephan Grupp’s group to improve the care of children who receive bone marrow transplants and other Cell and Gene Therapies. Your work will directly impact how we treat children with leukemias and other devastating diseases!


About Us: 


US News and World Report has ranked CHOP the #1 Children’s Hospital in the Mid-Atlantic region and its Pediatric Cancer Center the #1 in the country. In addition, Forbes has ranked CHOP the #1 large employer in the country. Check out this 4-minute video to get a sense of what it’s like to work here:


CHOP invests massively in analytics. We have a large and mature enterprise data warehouse and a complete technology stack for developing and deploying custom built applications and ETL processes. You will work alongside developers, database architects, data analysts, and data scientists, all aligned to provide the best possible care for children.


CHOP is committed to a culture of diversity in all its forms. We offer a competitive salary, health benefits, and other perks. We take mentorship and career growth seriously. You will get credit for your contributions to clinical research, and we will support your attendance at conferences and your efforts at achieving specialty certifications. 


About You:


You are a great communicator who is humble, kind, curious, and people smart. You should be familiar with the R programming language and ecosystem, including RStudio and RStudio Connect; relational databases and SQL; and good software engineering methodologies such as unit testing, version control with git/GitHub, code reviews, and continuous integration. It’s a plus if you have contributed to open source projects and/or are familiar with any of the following: the tidyverse dialect of R; the tidymodels framework for predictive modeling; basic familiarity with JavaScript; REDCap; agile project management; AWS (or another public cloud provider), kubernetes. 

Job Responsibilities

  • Implement computational algorithms and experiments for test and evaluation; interprets data to assess algorithm performance.
  • Make significant contributions to the formulation of analysis plans and associated documentation of methods that meet stringent criteria for reproducibility and measures of significance.
  • Develop high-quality code implementing models and algorithms as application programming interfaces or other service-oriented software implementations.
  • Participate in communication of research methods, implementation, and results to varied audience of clinicians, scientists, analysts, and programmers.
  • Work closely with applications research group to translate models and algorithms into engineered production applications.
  • Contribute to manuscript writing for results publication, authors abstracts, and presents at professional conferences.

Required Education and Experience

Required Education: Bachelors Degree


Required Experience: At least three (3) years of experience with progressively more complex data science, applied statistics, machine learning, or mathematical modeling projects.

Preferred Education, Experience & Cert/Lic

Preferred Education: Masters in Analytics, Data Science, Statistics, Mathematics, Computer Science or a related field.

Preferred Certificate in Analytics, Statistics, or Data Science.


Preferred Experience: At least four (4) years of experience with progressively more complex data science, applied statistics, machine learning, or mathematical modeling projects.

Additional Technical Requirements

  • Experience and demonstrated ability acquiring new technical/analytic skills and domain knowledge to support successful contribution to research and development projects is required.
  • Experience formulating or contributing to the formulation of analysis plans and selection of appropriate methods.
  • Experience using existing machine learning and analytic tools such as ScikitLearn, Weka, R, and Mathematica in either applied educational or professional projects is required.
  • Experience writing code in either applied educational or professional projects using one or more of the following languages: Python, Scala, Java is required.
  • Familiarity with relational databases (e.g. Postgres, MySQL) strongly preferred.
  • Familiarity RESTful web services application programming interfaces preferred.
  • Strong verbal and written communications skills with the demonstrated ability to explain complex technical concepts to a lay audience.
  • Applied statistics or mathematical modeling experience preferred.
  • Natural language processing experience particularly in the biological and medical domains preferred.
  • Experience using distributed computing technologies (e.g. Akka, MapReduce, Cuda) preferred.
  • Familiarity with graph, key value, and document data stores (e.g. Neo4j, Hadoop, MongoDB) preferred.
  • Experience creating informative visualizations for complex, high dimensional data preferred.
  • Experience with probabilistic graphical models, time series predictive models, Markov models preferred.

To carry out its mission, it is of critical importance for the Children’s Hospital of Philadelphia (CHOP) to keep our patients, families and workforce safe and healthy and to support the health of our global community. In keeping with this, CHOP has mandated all workforce members on site at any CHOP location for any portion of their time be vaccinated for COVID-19 as a condition of employment.  This mandate also applies to workforce members performing work for CHOP at non-CHOP locations. Additionally, all workforce members based in or regularly scheduled to work at any New Jersey location are mandated to be both vaccinated and boosted for COVID-19, with booster timing consistent with applicable guidelines. The CHOP COVID-19 vaccine mandate is in alignment with applicable local, state and federal mandates. CHOP also requires all workforce members who work in patient care buildings or who provide patient care to receive an annual influenza vaccine. Employees may request exemption consideration for CHOP vaccine requirements for valid religious and medical reasons.  Please note start dates may be delayed until candidates are fully immunized or valid exemption requests are reviewed. In addition, candidates other than those in positions with regularly scheduled hours in New Jersey, must attest to not using tobacco products.
EEO / VEVRAA Federal Contractor


Nearest Major Market: Philadelphia

Job Segment: Cloud, Testing, Patient Care, Clinical Research, Medical Research, Technology, Healthcare