Talent.com
Tato nabídka není k dispozici ve vaší zemi.
Senior Platform Engineer / SRE - Tech Lead Critical Infrastructure Transformation

Senior Platform Engineer / SRE - Tech Lead Critical Infrastructure Transformation

CloudlinuxPrague, Prague, CZ
Před 4 dny
Typ pozice
  • Quick Apply
Popis pozice

Build the internal platform that powers our engineering teams, delivering mission-critical software to 4,000+ cloud hosting providers worldwide.

CloudLinux powers 4,000+ hosting providers managing millions of websites globally. Our infrastructure team is at a critical inflection point – moving from 8+ years of technical debt to building a modern platform. This isn't a typical SRE role; it's a chance to architect the future of infrastructure that cannot fail.

Where we are : Legacy systems, reactive operations, bus factor = 1. OpenNebula bottlenecks blocking releases. 70% time on firefighting.

Where we're going : Self-service platform, Infrastructure as Code, proactive engineering. You'll be one of 2-3 senior engineers leading this transformation alongside a new Infrastructure Director with full B-level support.

What You'll Actually Do

Stabilize & Assess :

  • Deep dive into OpenNebula issues with the existing team
  • Map critical dependencies and single points of failure
  • Implement quick wins (automated VM cleanup, monitoring gaps)
  • Begin documenting undocumented systems

Build Foundation :

  • Leading the design and development of an internal development platform (IDP)
  • Implement GitOps for critical workflows
  • Establish SLIs / SLOs for core services
  • Create runbooks for top incidents
  • Transform Platform :

  • Architect self-service Internal Developer Platform
  • Drive Infrastructure as Code to 60%+ coverage
  • Eliminate single points of failure
  • Drive development and implementation of complex architectural decisions
  • Technical Stack You'll Transform

    Current :

  • Virtualization : OpenNebula (main bottleneck), oVirt / OpenStack / CloudStack, KVM
  • Storage : Ceph (recently stabilized), Cephadm, Rook
  • Network : Juniper
  • Bare metal (3 Datacenters) + AWS + Google Cloud + Azure
  • Automation : ~5% Terraform coverage, manual operations dominant
  • CI / CD : Gitlab, Jenkins, Gerrit, Github
  • Your Tools for Transformation :

  • Kubernetes & KubeVirt and / or all necessary
  • Terraform / Terragrunt + Ansible
  • GitOps (ArgoCD / Flux)
  • Python / Go for custom tooling
  • Modern observability stack
  • Requirements

    To thrive in this role, we are looking for someone who has :

  • Migrated legacy systems to modern platforms at scale
  • Strong Kubernetes production experience (multi-tenant, federation)
  • Infrastructure as Code expertise (Terraform / Ansible in production)
  • Linux at scale (RHEL / CentOS / AlmaLinux, 1000+ servers)
  • Network fundamentals, underlay, overlay, (EVPN, BGP, VXLAN, DNS, network architecture & segmentation, native pod networking at scale)
  • Proven ability to work independently with minimal documentation
  • Experience building self-service platforms
  • English B2+ and excellent documentation skills
  • Critical Mindset :

  • Comfortable with ambiguity and technical debt
  • Pragmatic : know when to fix vs. replace vs. work around
  • Can balance firefighting with strategic improvements
  • Strong opinions, loosely held
  • Teaching mentality – you'll help upskill the team
  • What Makes You Successful Here :

  • You'll have significant technical decision-making power and direct impact
  • New Infrastructure Director + B-level backing for transformation
  • Approved investment in people and technology
  • Full authority to simplify and modernize
  • Protected time for strategic work, not just operations
  • The Opportunity

    This isn't about maintaining the status quo. You'll :

  • Define infrastructure strategy affecting 4,000+ companies
  • Build an internal development platform
  • Lead technical transformation with real budget and support
  • Become the principal architect of a modern platform
  • Work directly with the Infrastructure Director
  • Shape how critical infrastructure software gets delivered globally
  • Benefits

    What's in it for you?

  • Competitive senior-level compensation.
  • A focus on professional development.
  • Interesting and challenging projects.
  • Fully remote work with flexible working hours, which allows you to schedule your day and work from any location worldwide.
  • Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves.
  • Compensation for private medical insurance.
  • Co-working and gym / sports reimbursement.
  • Budget for education.
  • The opportunity to receive a reward for the most innovative idea that the company can patent.
  • Apply If You :

  • Thrive in high-impact, high-autonomy environments
  • Want to transform, not just maintain
  • Can see through chaos to architectural solutions
  • Are excited by the challenge, not scared by the current state
  • Believe infrastructure should be invisible when working, invaluable when measured
  • We're specifically looking for someone who has successfully navigated similar transformations. If you've only worked in already-stable environments, this role will be challenging. But if you've turned chaos into platform excellence before – let's talk.

    By applying for this position, you consent to the processing of your personal data as described in our Privacy Policy ( https : / / cloudlinux.com / candidate-privacy-notice ), which provides detailed information on how we maintain and handle your data.

    Vytvořit upozornění na toto hledání

    Senior • Prague, Prague, CZ