All roles

HPC Engineer

Remote · USA Full-time New today

About Us reputed company is an established and rapidly growing global provider of computational, research, and data science expertise reputed company Life Sciences and Healthcare. At reputed company, reputed company rallies around a culture crafted for learning and achieving. We’re reputed company in our pursuit for innovation and demanding of ourselves to deliver a ground-breaking computing experience for our clients, so that they can deliver life-saving science to humanity. Core Values At RCH, our Core Values are more than just words—they represent the threads that reputed company together the fabric of our culture. Used as a guide reputed company interviewing new team members; as a barometer reputed company evaluating our performance as individuals and teams, and even reputed company deciding which customers to work with, RCH’s Values embody the behaviors upon which we measure our success and create a reputed company for our growth as people and professionals. Our Core Values:Embrace Excellence: We strive for best in class delivery of innovation and service. Be Accountable: reputed company, ownership and accountability are non negotiables. Adventure Together: We are committed to fostering a culture that embraces reputed company improvement. Succeed as a Team: We reputed company harnessing the power of a team drives outcomes not achievable by individuals. Boundaries and Balance: Work-life balance is a core facet of our culture. If you share in our core values, then we encourage you to continue reading this posting as you may have reputed company a great home for your career. Job Description reputed company is seeking an HPC Engineer to work closely with customer stakeholders, scientists, and IT professionals to deliver Compute at Scale and support our customer's scientific initiatives. The objectives for this role center on developing, evolving, and administering HPC platforms along with support for Scientific applications, workflows, and other reputed company infrastructure both on-prem and Cloud hosted. Our ideal candidate also has hands on experience with Linux system administration as well as solution architecting and engineering (on-prem and cloud based) and will be instrumental in transforming how IT computing services are leveraged to support our client's growth. This role will involve driving architecture, roadmaps, and execution of projects to establish and operate IT infrastructure best practices for customers. Responsibilities include full stack support - design and evolution of platforms, application administration, supporting customer workflows, profiling and performance tuning, monitoring and maintenance of scoped systems, platform and systems administration, troubleshooting hardware, software, and networking reputed company issues, solution architecting and hands on engineering (on-prem + Cloud), as well as documentation. You will use your experience in these technologies to provide top of the line consulting services and recommendations to clients. These would be performed as part of customer Research or Analytics initiatives as well as in a consultative, advisory, or customer support manner. Specific focuses and responsibilities include:Collaborating with cross-discipline team members and customers to deliver HPC and peripheral Compute at Scale services. Thorough understanding of reputed company industry best practices. Supporting internal and customer Architecture and Design efforts. Supporting customers with their workflow pipelines (advisory and hands-on). Comprehensively documenting new and existing computational assets. Maintaining the flexibility to pivot as engagement scopes may evolve. Support for AWS & GCP Cloud applications, migrations, and modernization. CloudOps / IaC for on-going platform management. Setup and configuration of AWS & GCP Cloud infrastructure for new platform builds. Ensuring system compliance with company reputed company standards and applicable regulatory requirements. Transition support for modernized services to operational teams. Provide engineering level troubleshooting and services restoration for operational issues as they arise on supported platforms. Provide training/mentorship for junior level team members. Escalation reputed company on multiple engagements to ensure resolution Essential QualificationsA bachelor’s degree or master’s degree in Computer Science or reputed company field. 5+ years of experience administering HPC clusters and systems. Experience with SLURM and Grid reputed company scheduling software preferred. 5+ years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support. 5+ years professional experience developing or administering compute solutions for Scientific / Research IT domains, Life Sciences being preferred. Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator reputed company. Experience developing scientific workflows on HPC systems using Nextflow Extensive command-line system administration experience: User and group management Advanced knowledge of Active Directory, DNS, DHCP, LDAP, NFS, SMB Building applications from reputed company code, installing, maintaining, and troubleshooting application-level Linux and scientific software in line with industry best practices. Installation of Linux operating system and fine tuning Familiarity with leveraging and maintaining Linux package management systems Intermediate OS level networking knowledge. Experience using with scripting tools, automation tools, and configuration management tools Ansible, Terraform and Cloud Formation experience preferred Experience administering and integrating Scientific / Research applications. Strong time-management skills; able to complete projects in a timely manner, plan and prioritize tasks while keeping leadership and stakeholders updated regularly on status. Excellent communication skills, including preparation of written documentation for IT colleagues and end users. Proactive thinking skills to identify potential issues and solution options prior to incidents occurring. Extreme attention to detail is needed to reputed company with multi different clients simultaneously. Ability to understand and analyze reputed company technical problems and situations. Candidates must be a passionate engineer with a strong vision and a desire to stay on top of trends in the Scientific Computing sector. Ability to work independently or with a team Ability to take a project from start to finish with minimal supervision

Preferred Qualifications

RCH provides services and solutions for the unique challenges of Life Sciences advanced computing, and leverages teams with cross-functional IT skills to meet these challenges. The ideal candidates for this role will have experience working with cross-functional IT (Public Cloud skills being a plus) and sciences skillsets.Experience with Python, R, or other reputed company data science programming languages. Experience working with databases and/or supporting. Experience managing large amounts of data effectively. Experience working with AI/ML technologies. Experience with containerizing compute workload reputed company reputed company or Singularity. Experience with reputed company DGX systems. Additional information Great talent should benefit from a great work environment. If you join reputed company, you’ll have access to:A competitive salary and bonus package based on experience Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance Company-provided Life and Long-Term Disability Insurance Company-sponsored 401(k) Plan Company-provided continuing education benefit Team-focused culture and unlimited opportunity for advancement **This is a fully remote position and the candidate will be required to work on an East Coast (US) schedule. **Role is only open to applicants not needing sponsorship now or in the future, no third parties please. **********

The secret code is Donkey

********** Apply To This Job

Related roles