GCP Cloud-DevOps Engineer Job at Merican Inc., Canada

SmFiUWhvOFltNWxMK0svNGxGWHpMOXdBU2c9PQ==
  • Merican Inc.
  • Canada

Job Description

Job Description

Job Description

GCP DevOps HPC Engineer - 

Location: Canada(Remote)

 

 

GCP DevOps HPC Engineer - Onyx 

About the Role 

As a Senior DevOps-HPC Engineer, you will join a dynamic Engineering team in a high-energy and collaborative environment. This role is ideal for a seasoned HPC engineer with deep expertise in SLURM, Linux, and cloud migration expertise in SLURM, Linux, and cloud migrations, who thrives on leading complex projects, designing robust architectures, and implementing high-performance solutions in Google Cloud. 

Responsibilities: 

  • Lead the migration of on-premises SLURM-based HPC clusters to Google Cloud Platform. 
  • Design, implement, and manage scalable and secure HPC infrastructure solutions on GCP. 
  • Optimize SLURM configurations and workflows to ensure efficient use of cloud resources. 
  • Manage and optimize HPC environments, focusing on workload scheduling, job efficiency, and scaling SLURM clusters. 
  • Automate cluster deployment, configuration, and maintenance tasks using scripting languages (Python, Bash) and automation tools (Ansible, Terraform). 
  • Integrate HPC software stack using tools like Spack for dependency management and easy installation of HPC libraries and applications. 
  • Deploy, manage, and troubleshoot applications using MPI, OpenMP, and other parallel computing frameworks on GCP instances. 
  • Collaborate with engineering, support teams, and stakeholders to ensure smooth migration and ongoing operation of HPC workloads. 
  • Provide expert-level support for performance tuning, job scheduling, and cluster resource optimization. 
  • Stay current with emerging HPC technologies and GCP services to continually improve HPC cluster performance and cost efficiency.

Requirements: 

Basics: 

  • Minimum 5 years of experience with HPC environments, including SLURM workload manager, MPI, and other HPC-related software. 
  • Extensive hands-on experience managing Linux-based systems, including performance tuning and troubleshooting in an HPC context. 
  • Proven experience migrating and managing SLURM clusters in cloud environments, preferably GCP. 
  • Proficiency with automation tools such as Ansible and Terraform for cluster deployment and management. 
  • Experience with Spack for managing and deploying HPC software stacks. 
  • Strong scripting skills in Python, Bash, or similar languages for automating cluster operations. 
  • In-depth knowledge of GCP services relevant to HPC, such as Compute Engine (GCE), Cloud Storage, and VPC networking. 
  • Strong problem-solving skills with a focus on optimizing HPC workloads and resource utilization. 

Recommended: 

  • Google Cloud Professional DevOps Engineer or similar GCP certifications. 
  • Familiarity with GCP’s HPC-specific offerings, such as Preemptible VMs, HPC VM images, and other cost-optimization strategies. 
  • Experience with performance profiling and debugging tools for HPC applications. 
  • Advanced knowledge of HPC data management strategies, including parallel file systems and data transfer tools. 
  • Understanding of container technologies (e.g., Singularity, Docker) specifically within HPC contexts. 
  • Experience with Spark or other big data tools in an HPC environment is a plus. 

Job Tags

Remote work,

Similar Jobs

Compass Health Consultants

Entry Level Sales Consultant Job at Compass Health Consultants

 ...right fit for us, you will be passionate about sales and account management! This position is about being a healthcare agent and showing the client their options for coverage. As a Entry Level Sales Consultant, you will be meeting with new and existing clients... 

Assured Nursing

Travel Telephone Triage Nurse Job at Assured Nursing

 ...Job Description Assured Nursing is seeking a travel nurse RN Clinic ED - Emergency Department for a travel nursing job in State College...  .... This is a 7a-3p, 07:00:00-15:00:00, 8.00-5 position in the Triage / Fast Track, Clinic . The ideal candidate will possess a... 

Clearance Jobs

Leadership Development Coach Job at Clearance Jobs

 ...Leadership Development Coach Greystones Group has an opportunity for a Leadership Development Coach to support the Navy's Integrated Project Team Development (IPTD) program. This initiative is focused on enhancing the professional development of personnel and ensuring... 

BayCare Health System

Biomedical Engineer Tech (Tampa) Job at BayCare Health System

&##128205; Tampa, FL | &##128343; Full Time | On-site BayCare Health System is seeking a Biomedical Equipment Technician to join our team in Tampa! In this role, youll perform repair, calibration, and maintenance services on medical treatment and diagnostic equipment... 

Frye Regional Medical Center

Registered Nurse (RN), Vascular Access PRN Job at Frye Regional Medical Center

 ...Registered Nurse (RN), Vascular Access PRN at Frye Regional Medical Center summary: Registered Nurse specializing in vascular access providing expert assessment, insertion, and management of vascular access devices such as PICC lines, midlines, and ultrasound-guided...