Computational Resources

ERIS provides a range of computational resources, platforms and scientific computing support for research and innovation at Mass General Brigham hospitals. Our high-performance analysis servers, compute clusters and storage are relied upon daily for data processing and analysis by research groups across the organization. Clinical computational workflows such as genome sequencing and radiation dosimetry are also supported.

Service Model - What is the cost?

Shared resources are available to all users at no cost. This includes a basic storage quota, shared computational resources and assistance from the ERIS Support teams. Additional storage and compute capacity can be acquired using research funds.

Selecting a Computational Resource

When choosing a computational resource from our list of services below, consider

Your preferred computing platform (Microsoft Windows, Linux or Hadoop)
What the software application requirements are - what platforms does the software run on?
If you will work interactively with applications and data, or submit many jobs together for batch-processing
How large is the data you will be working with, what are the storage requirements?
How much memory will the application require?

‼ For details on the HPC Remediation plan see the section below

ShowHide

HPC Remediation Plan

Modernized, more reliable version of the ERISTwo HPC cluster that rebuilds the existing infrastructure using current-generation tools

MGB Digital is currently in the process of improving the ERISTwo Linux Cluster as part of our HPC remediation plan. The remediation project leverages the existing ERISTwo cluster hardware while implementing new operational and system-level processes designed to enhance performance, stability, and usability, building a more modern and sustainable HPC platform. This upgraded version of ERISTwo is what we’re calling ERIS Nucleus.

Approach

We are taking the existing ERISTwo and ERISXdl infrastructure and rebuilding it using modern HPC cluster management tools and methodologies. The project follows a phased approach:

Phase 1: Establish foundational infrastructure and services
Phase 2: Deploy ERISTwo Nucleus (a subset of existing cluster nodes) as a modern HPC cluster
Phase 3: Integrate the remaining cluster nodes into ERISTwo Nucleus

This is a major upgrade that includes a new RHEL 9 operating system, an updated software toolbox, and an upgraded 100 GbE network fabric. These changes will require many existing workflows and scientific applications to be thoroughly tested, so your participation in testing is essential.

Why are we doing this?

The ERISTwo HPC cluster has been experiencing performance, consistency, and stability issues. Additionally, the ERISXdl GPU cluster’s Slurm/Kubernetes setup has proven complex for many AI/ML and other workloads.

What are some of the underlying reasons for these issues?

The existing 40GbE network is underperforming and poorly configured.
The ERISTwo cluster still relies on legacy ERISOne services and processes, making day-to-day operations fragile.
The cluster is manually managed and configured, increasing the risk of configuration drift and instability.
Cluster services lack resilience, creating additional risks of instability.

ShowHide

ERIS Linux Cluster

High Performance Computing System with a job scheduler for batch jobs, storage and remote desktops with GPUs for graphical applications

The ERIS Linux Cluster is an ecosystem of scientific computing resources centered around a cluster of remote-desktop and compute nodes connected to very high speed storage. A large selection of popular scientific applications are installed and you can request additional software packages to be added. The cluster runs a Linux operating system and requires some familiarity with Linux for efficient use.

This platform is ideal for workflows that run many jobs in parallel, and for those that read and write many files or require very high speed access to data files. A job scheduling system queues jobs for dispatch to the compute nodes, allowing submission of many jobs at once. Linux remote-desktop nodes allow graphical applications for data visualization to interacting with data stored on the cluster, as well as software development and application testing. Some research groups also dispatch analysis pipeline jobs to the cluster through custom web-portals.

Typical Uses

Medical image processing
Genome sequencing
Monte-carlo modeling
MPI parallel workloads
Very large memory jobs
AI/ML and Deep Learning using NVIDIA GPUs

Supported methods of connecting to the cluster

SSH command line terminal for job submission
NoMachine Linux remote desktops for graphical applications
Network file share (SMB/CIFS) for data transfer
Web portals and applications (R Studio, Jupyter, etc)

Getting an account?

Use the registration link

ShowHide

ERISXdl Linux GPU Platform

ERISXdl (ERIS Extreme Deep Learning) platform provides efficient, multi-GPU performance designed for Deep Learning applications

ERIS Scientific Computing has implemented a new deep learning GPU Cluster, ERISXdl (ERIS Extreme Deep Learning). This system is built with NVIDIA DGX-1, an integrated system that includes high-performance GPU interconnection, delivering industry-leading performance for AI and deep learning. ERISXdl platform provides efficient, high-bandwidth streaming of training data, multi-GPU performance designed for HPC and Deep Learning applications. The platform supports Docker containerized environments to easily emulate the entire software workflow and maintain portability and reproducibility, as well as Jupyter notebooks for rapid development, integration with Github, and HPC scheduler Slurm to distribute the workload across the system. We provide access to high-bandwidth, low-latency Briefcase storage for the data required for analytics.

ERISXdl has a free basic tier for test and quick debugging of jobs. The main available partitions are available via a chargeback model at a rate of $0.01 min GPU. Users can sign up at rcservices.partners.org. More details about this platform can be found here.

Typical Uses

Deep Learning development for fast Image Neutral Network model development
TensorFlow applications for highly efficient GPU-enabled workloads.
Deployment of Docker containerized applications with GPU requirements.

Supported methods of connecting to the cluster

SSH command line terminal.
JupyterLab webportal for fast development.

ShowHide

Analytics Enclave Hub

A highly secure, privacy-aware, data ecosystem equipped with self-service AI, machine learning, and research data tools.

Launched in 2020, the Analytics Enclave, "Enclave Platform", is a highly secure, privacy-aware, data ecosystem equipped with self-service AI, machine learning, and research data tools that facilitate on-demand data analysis at the project-, program-, or institution-level.

The controlled and protected environment offers authorized users a dedicated, centralized, scalable, and customizable workspace for collaboration on highly confidential or sensitive data with predictable timeframe for completing planned or exploratory analyses.

The Enclave Platform supports cross-institution collaboration (for example: MGB research community and external clinical industry partners) on MGB data or curated data marts. At present, the Enclave platform has several COVID-19 data marts and project-specific databases.

Learn more about Analytics Enclave Resources, Accessing the Enclave, and Enclave Support.

Analytics Enclave Toolkit

This is a compilation of resources for end-users and the Analytics Enclave team aimed at optimizing the data analytics solutions and overall experience with the Analytics Enclave.

Access Request Form

Analytics Enclave FAQs & Tips

Analytics Enclave Resources

Enclave Best Practices & Guidelines

My.AnalyticsEnclave

Project Offboarding/Closing Out

Complete PDSR Curated Data Set

COVID-19 Tools

Analytics Enclave Office Hours

Contact Us

ShowHide

Windows Analysis Servers

Powerful servers with scientific applications installed and remote desktop capability, running on a Microsoft Windows OS

The Windows Analysis Servers are large-memory computers with a variety of free and commercial scientific applications installed. Running your data analysis in a remote desktop on these servers is easy, using Microsoft Remote Desktop. Some files may be stored locally on the server and for additional storage space they can connect to other network file shares at Partners. LINK TO STORAGE, APP

Typical Uses

Statistical analysis, spreadsheets and charting
Data visualization

Supported methods of connecting to the servers

Microsoft Remote Desktop Connection
Network file share (SMB/CIFS) for data transfer

Getting an account?

Use the registration form

Get Help

Service Model - What is the cost?

Selecting a Computational Resource

‼ For details on the HPC Remediation plan see the section below

Typical Uses

Supported methods of connecting to the cluster

ERISTwo Linux Cluster

Application Availability on ERISTwo

Service Model & Cost

Typical Uses

Supported methods of connecting to the cluster

ERISXdl GPU Linux Platform

Analytics Enclave Toolkit

Typical Uses

Supported methods of connecting to the servers

Getting an account?

Windows Analysis Servers

Service Model & Cost