Build the internal platform that powers our engineering teams, delivering mission-critical software to 4,000+ cloud hosting providers worldwide.
CloudLinux powers 4,000+ hosting providers managing millions of websites globally. Our infrastructure team is at a critical inflection point – moving from 8+ years of technical debt to building a modern platform. This isn't a typical SRE role; it's a chance to architect the future of infrastructure that cannot fail.
Where we are : Legacy systems, reactive operations, bus factor = 1. OpenNebula bottlenecks blocking releases. 70% time on firefighting.
Where we're going : Self-service platform, Infrastructure as Code, proactive engineering. You'll be one of 2-3 senior engineers leading this transformation alongside a new Infrastructure Director with full B-level support.
What You'll Actually Do
Stabilize & Assess :
- Deep dive into OpenNebula issues with the existing team
- Map critical dependencies and single points of failure
- Implement quick wins (automated VM cleanup, monitoring gaps)
- Begin documenting undocumented systems
Build Foundation :
Leading the design and development of an internal development platform (IDP)Implement GitOps for critical workflowsEstablish SLIs / SLOs for core servicesCreate runbooks for top incidentsTransform Platform :
Architect self-service Internal Developer PlatformDrive Infrastructure as Code to 60%+ coverageEliminate single points of failureDrive development and implementation of complex architectural decisionsTechnical Stack You'll Transform
Current :
Virtualization : OpenNebula (main bottleneck), oVirt / OpenStack / CloudStack, KVMStorage : Ceph (recently stabilized), Cephadm, RookNetwork : JuniperBare metal (3 Datacenters) + AWS + Google Cloud + AzureAutomation : ~5% Terraform coverage, manual operations dominantCI / CD : Gitlab, Jenkins, Gerrit, GithubYour Tools for Transformation :
Kubernetes & KubeVirt and / or all necessaryTerraform / Terragrunt + AnsibleGitOps (ArgoCD / Flux)Python / Go for custom toolingModern observability stackRequirements
To thrive in this role, we are looking for someone who has :
Migrated legacy systems to modern platforms at scaleStrong Kubernetes production experience (multi-tenant, federation)Infrastructure as Code expertise (Terraform / Ansible in production)Linux at scale (RHEL / CentOS / AlmaLinux, 1000+ servers)Network fundamentals, underlay, overlay, (EVPN, BGP, VXLAN, DNS, network architecture & segmentation, native pod networking at scale)Proven ability to work independently with minimal documentationExperience building self-service platformsEnglish B2+ and excellent documentation skillsCritical Mindset :
Comfortable with ambiguity and technical debtPragmatic : know when to fix vs. replace vs. work aroundCan balance firefighting with strategic improvementsStrong opinions, loosely heldTeaching mentality – you'll help upskill the teamWhat Makes You Successful Here :
You'll have significant technical decision-making power and direct impactNew Infrastructure Director + B-level backing for transformationApproved investment in people and technologyFull authority to simplify and modernizeProtected time for strategic work, not just operationsThe Opportunity
This isn't about maintaining the status quo. You'll :
Define infrastructure strategy affecting 4,000+ companiesBuild an internal development platformLead technical transformation with real budget and supportBecome the principal architect of a modern platformWork directly with the Infrastructure DirectorShape how critical infrastructure software gets delivered globallyBenefits
What's in it for you?
Competitive senior-level compensation.A focus on professional development.Interesting and challenging projects.Fully remote work with flexible working hours, which allows you to schedule your day and work from any location worldwide.Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves.Compensation for private medical insurance.Co-working and gym / sports reimbursement.Budget for education.The opportunity to receive a reward for the most innovative idea that the company can patent.Apply If You :
Thrive in high-impact, high-autonomy environmentsWant to transform, not just maintainCan see through chaos to architectural solutionsAre excited by the challenge, not scared by the current stateBelieve infrastructure should be invisible when working, invaluable when measuredWe're specifically looking for someone who has successfully navigated similar transformations. If you've only worked in already-stable environments, this role will be challenging. But if you've turned chaos into platform excellence before – let's talk.
By applying for this position, you consent to the processing of your personal data as described in our Privacy Policy ( https : / / cloudlinux.com / candidate-privacy-notice ), which provides detailed information on how we maintain and handle your data.