We’re a fast-growing startup in the cloud computing space. We believe that while cloud platforms are functionally great and quite powerful, they are built on legacy software and are irritably inefficient (and expensive!). Based on award-winning research and open-source tech, we have built Unikraft Cloud, a next generation cloud platform that allows for order-of-magnitude better efficiency, performance and security.
To support our growing infrastructure and ensure seamless performance, we’re looking for a Site Reliability Engineer (SRE) who’s eager to learn, grow, and contribute to building reliable, scalable systems. If you have a solid technical foundation, enjoy solving infrastructure challenges, and want to work on modern cloud-native technologies, we’d love to hear from you!
What You’ll Do
We’re a small multidisciplinary team and you’d have a wide variety of responsibilities, including:
- Production Post-Deployment — Take responsibility to keep customer on-prem and cloud prem deployments of our platform operating and troubleshoot technical issues.
- Software Packaging and Update Roll-outs — Take responsibility of packaging and rolling out of our latest product features to our premises and to our customers. This includes update strategy planning, implementation, and pre-rollout testing.
- Quality Assurance — Collaborate with the engineering team to test and validate deployments and ensure a high-quality product delivery.
- Monitor & Troubleshoot — Set up monitoring tools, proactively identify issues, and troubleshoot production incidents.
- Automate Everything — Write scripts and tools to automate deployments, infrastructure management, CI/CD workflows, and repetitive tasks.
- Kubernetes Management — Deploy, manage, and troubleshoot Kubernetes clusters to ensure reliability and scalability of our infrastructure.
- Learn & Grow — Collaborate with senior engineers to gain hands-on experience in infrastructure, cloud systems, and incident management.
- Document Processes — Write clear, concise documentation for processes, systems, and tools to help the team operate effectively.
Who You Are
Experience
- Proven experience in Linux system administration, software packaging, and software delivery. This knowledge is also required to assist with setups and debugging.
- Linux networking expertise, including firewalls, DNS, proxies and best practices.
- Proven experience managing and troubleshooting Kubernetes clusters in production environments.
- Good understanding of the CNCF landscape and associated tools.
- Familiarity with observability tools (ideally Prometheus and Grafana).
- Basic scripting skills in languages like Bash, Python, or similar.
- Familiarity with cloud platforms (ideally AWS).
- Interest in automation tools (ideally Terraform, or similar).
- Exposure to CI/CD pipelines (ideally GitHub Actions).
- Familiarity with microservice architectures, serverless, and DevOps best practices.
- Familiarity with virtualization solutions, like QEMU/KVM. Micro-VMMs like Cloud-Hypervisor or Firecracker are a plus.
Mindset
- Eagerness to learn and take on new challenges.
- Strong problem-solving skills and a curious, analytical mindset.
- Enthusiasm for building reliable, high-performance systems.
- Team player with good communication skills.
Nice-to-Have
- Experience with the Go programming language.
- Experience contributing to or working with open-source projects in the CNCF ecosystem.
Why Work with Us
- Help define the future of cloud compute runtime while embracing continuously-evolving modern technologies.
- Work alongside a high-energy, entrepreneurial team.
- Make meaningful contributions in a rapidly growing company.
- Work from wherever you want. We collaborate in real-time every day but we all work from the comfort of our own homes.
Pay and Benefits
- Competitive salary and opportunities for career development.
- Six weeks of total time off for you to use through the year.
- A generous equipment budget to spend on anything you need to do your best work.
- Fun-focused annual team retreats where we get together in person to recharge and build better relationships.
Interested?
Send your resume and a short note on why you’re excited about this role using the link below.