Sr. Data Science Application Engineer

Visit the Partners careers page to apply online. Job ID is 3119363 

Through investments in big data and analytics, Partners Healthcare Systems (PHS) is innovating and transforming the delivery of health care, research and discovery. PHS’ Enterprise Research IS (ERIS team is tasked to provide the enterprise platform for raw data and self-service analytics necessary to derive insights and applications through development of machine learning (ML) and artificial intelligence (AI) capabilities. 

ERIS is immediately seeking an experienced engineer with demonstrated experience in implementing containerized ML platforms to deliver running models to production. This position be part of the multi-disciplinary engineering team that architects, builds, maintains and provides the Data Enclave platform for Partners HealthCare. 

The Data Enclave is project that will allow the analysis of multiple types of healthcare data in a secure environment. It will enable data science teams to develop algorithms, leveraging different types of data coming from diverse data sources, such as wearable physiological sensors, EHR systems, etc., to transform patient care and medical research. 

Using your knowledge of Containers, Big Data technologies, Linux, and data science projects you will specialize in evaluating, testing and installing tools for data science and frameworks, for our biomedical research, medical innovation, personalized medicine and healthcare analytics.

Candidates are not expected to be deeply knowledgeable in all areas but must demonstrate the ability and desire to learn and to support a large and diverse environment. Ideal candidates thrive on variety and innovation in their daily work, on interaction with customers who are world-renowned leaders in their scientific field, and on working with a wide range of technologies in an academic environment.

ERIS enables and provides technology and solutions to the highly successful and innovative research and clinical programs of the top teaching hospitals in the world--Massachusetts General (MGH #1 healthcare research organization), Brigham and Women’s (BWH #2), McLean (# 23) and Spaulding Rehabilitation (#24).

Principal Duties and Responsibilities:

  • Work closely with data scientists and researchers to build and optimize the technology and frameworks, such as Docker/Kubernetes, Spark, Python/R, Hadoop, by leveraging your understanding in these technologies.     
  • Build Docker images or VM images pre-installed with ML and Deep Learning tools such as NLTK, Tensorflow, etc., to make it easy for data science teams to train and deploy models quickly.
  • Evaluate, select and deploy tools and/or cloud solutions for data science within the Data Enclave, such as Apache Spark, Kubernetes, etc.
  • Develop, publish and maintain knowledgebase articles and documentation on systems features, best practices and usage how-to’s as well as training and reference materials for the community using knowledge management tools.
  • Analyze and resolve customer and technical problems: Troubleshoot platform problems when issues arise. Proposes, maintains and enforces polices, practices and security procedures.
  • Analyzes result of server monitoring and implement changes to improve performance, processing and utilization. Proposes, maintains and enforces polices, practices and security procedures.
  • Perform other duties as assigned or required by the situation and circumstances. 


  • BA/BS/engineering degree required or equivalent combination of skills/experience. Advanced degree in engineering or related scientific discipline preferred.
  • 8+ years minimum experience in working with systems, DB’s or programming environments. Expert Linux user experience required.
  • 4+ years of experience with Python.
  • 2+ years minimum experience in working with Docker or Kubernetes.
  • First-hand experience or direct exposure to Data Science and ML projects.
  • Highly desired: 
    • More than 1 year experience with Apache Spark.
    • More than 1 year experience with at least one additional database technology (SQL or NoSQL—Hbase, Hive, Pig, Cassandra, Mongo).
    • A combination of education and experience may be substituted for requirements.

Skills/Abilities/Competencies Required:

  • Significant experience with Linux. Experience using open source tools and integrating open source packages
  • Strong knowledge and experience with GNU Toolchain and GCC compiler
  • Demonstrated experience with software development processes to include, compilation, installation and developing automated builds
  • Proficiency in one or more programming languages (Python and R preferred)
  • Understanding of both Relational and modern databases and corresponding query language such as SQL.
  • Experience with code version control, specifically Git and GitHub 
  • Experience or deep understanding of Data Science and ML projects (AI, NLP, Deep learning) 
  • Experience with Docker containers and related technologies (Kubernetes, Platform9, etc)
  • Strong verbal and written communication, ability to write clear technical documentation.
  • High level of initiative and eagerness to learn new technologies.
  • Highly desired: 
    • Experience with Hadoop-related technologies (Hive, Spark, HDFS)
    • Familiarity with information technology security and data privacy considerations applicable to a healthcare environment is advantageous. 
    • Experience providing support to research investigators with diverse computing needs is a strong plus.

Working Conditions:

  • Standard office environment with travel to Hospital locations in the Boston Metro area including the data centers
  • As projects and priorities dictate, flexible work and off-hours are required including evening, night and weekend hours to cover events, roll-outs and special projects
  • Occasionally lift and carry supplies and equipment weighing up to 25 pounds.

EEO Statement 
Partners HealthCare is an Equal Opportunity Employer & by embracing diverse skills, perspectives and ideas, we choose to lead. All qualified applicants will receive consideration for employment without regard to race, color, religious creed, national origin, sex, age, gender identity, disability, sexual orientation, military service, genetic information, and/or other status protected under law.

Primary Location: MA-Somerville-PHS Assembly Row
Work Locations: PHS Assembly Row 399 Revolution Drive   Somerville 02145
Job: Business and Systems Analyst
Organization: Partners HealthCare(PHS)
Schedule: Full-time
Standard Hours: 40 
Shift: Day Job
Employee Status: Regular
Recruiting Department: PHS Enterprise Data & Digital Health

Job Posting: Feb 7, 2020

Visit the Partners careers page to apply online. Job ID is 3119363