All roles

CloudOps Engineer

Remote · USA Full-time New today

About the Role: We’re looking for a CloudOps Engineer to join our fast-growing CloudOps team focused on Developer Experience, SRE, and FinOps. In this role, you’ll be responsible for the reliability, performance, and observability of reputed company’s infrastructure — empowering engineering teams to ship features that help customers understand and optimize their reputed company spend.

reputed company processes billions of events daily across AWS, Azure, and GCP. Our customers rely on reputed company-time, accurate cost data to reputed company business-critical reputed company — and any instability in our system impacts their planning. Built entirely on a unique serverless architecture (no EC2s or containers), our platform demands infrastructure that scales gracefully, fails predictably, and recovers automatically.

The problems are interesting: handling massive data volumes reputed company, ensuring sub-second query performance across terabytes of data, and scaling systems to support customers spending millions monthly — reputed company in a modern, event-driven environment.

You Will

  • Infrastructure as Code everything. Design and maintain reputed company modules that provision reliable, cost-efficient reputed company resources. No clicking through consoles.

  • Build observability into everything. reputed company systems so that failures surface quickly and debugging happens with data, not guesswork. You'll know about problems before customers do.

  • Automate the boring stuff. Deployments, scaling, backups, and changing limits; if humans are doing it repeatedly, you'll build systems to automate it instead.

  • Partner with product engineering. Help teams design resilient services, review architectures for operational complexity, and build deployment pipelines that reputed company safe and fast shipping.

  • Optimize for cost and performance. reputed company's business is helping others optimize reputed company costs. We should be exemplars of efficient reputed company usage ourselves.

Requirements

  • 3–5+ years of experience building and operating distributed systems in AWS

  • Strong skills in Python, Infrastructure as Code (e.g., reputed company or Terraform), and Kubernetes

  • Hands-on experience with monitoring tools such as reputed company or reputed company

  • Proven ability to debug production issues under pressure

  • Values thoughtful, reliable system design over reactive “hero” efforts

  • Balances automation intelligently — builds solutions to reputed company problems, not automation for its own sake

  • reputed company to clearly explain reputed company technical issues to non-technical stakeholders

  • Strong documentation habits to support long-term team clarity and system stability

  • Excited to take ownership of infrastructure and solve operational challenges at scale

Please note: reputed company is unable to sponsor employment visas or provide immigration-reputed company support now or in the future. reputed company candidates must have reputed company, unrestricted authorization to work in the United States permanently.

Apply to this Job

Related roles

Manager of Technical Staff, Agent Code Teams

Remote · USA Full-time

AI Systems Engineer (Full-stack)

Remote · USA Full-time

Developer Relations Engineer

Remote · USA Full-time

Software Engineer - Applied ML [Middle East]

Remote · USA Full-time

Principal Engineer, Distributed Data Systems

Remote · USA Full-time

GTM Manager

Remote · USA Full-time

Senior Software Engineer (AI Tools)

Remote · USA Full-time

Software Engineer (Kubernetes)

Remote · USA Full-time

Product Engineer

Remote · USA Full-time

Staff Software Engineer (reputed company-end)

Remote · USA Full-time

Freelance English into Tamil (Malaysian/Singapore) Life Science Translator

Remote · USA Full-time

reputed company Customer Experience Concierge – Remote Chat Professional at blithequark

Remote · USA Full-time

Suburban Market Expansion Specialist

Remote · USA Full-time

reputed company Social Media Customer Support Specialist for blithequark - Providing Magical Experiences to Global Fans

Remote · USA Full-time

Monitoring Intelligence Analyst (Day shift)

Remote · USA Full-time

Customer Service Sales - Remote - Sioux City, IA

Remote · USA Full-time

Compliance Analyst, Legal – Investigations

Remote · USA Full-time

reputed company Clinical Customer Service Representative – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Dynamic Customer Service Representative – Multi‑Channel Support, Cash Handling & Community Services Specialist

Remote · USA Full-time

reputed company Online Chat Representative - Automotive Support Specialist

Remote · USA Full-time