Senior Infrastructure Engineer (Storage & Kubernetes)

Type: Full-Time

Location: Remote or Onsite (UK/Europe working hours preferred)

Salary: $150k - $200k

About Immich

Immich is on a mission to provide a secure and private home for your most precious memories through our high-performance, self-hostable photo and video backup solution. We are now taking the next step in our journey by building out our own cloud platform. Our initial cloud offering will be a fully managed backup service, which will be secured with mandatory end-to-end encryption to ensure our users retain absolute privacy over their data. In the long term, we will also be looking to build our own full cloud-hosted Immich solution.

About the Role

We are seeking a seasoned and versatile Senior Infrastructure Engineer to play a pivotal role in designing, building, and managing the foundational infrastructure for our new E2E encrypted cloud backup system. This is a unique opportunity for an individual with a deep understanding of distributed storage systems and bare-metal Kubernetes to shape the future of our cloud services. You will play a key role in investigating, selecting, and implementing the ideal storage solution for our needs. You will have the autonomy to build our infrastructure from the ground up on our own hardware clusters.

The Team

You’ll collaborate with the two infrastructure-focused members of the Immich team on building out our cloud services. While most of the team focuses on the core Immich application, you’ll be a driving force and technical expert for our cloud infrastructure. We operate as a fully remote team distributed across the USA, Europe, and Asia, with a collaborative culture built on async communication. We keep meetings to a minimum (with a weekly team sync) and favor written communication, allowing everyone the flexibility to work when they’re most productive. We’re looking for someone who thrives with this level of responsibility and can confidently chart the technical direction. As we grow our cloud offerings beyond the initial E2EE backup service, you’ll play a pivotal role in scaling both our infrastructure and the team that maintains it.

What You’ll Do

  • Lead the research and evaluation of open-source distributed storage solutions (such as Ceph, GlusterFS, MinIO, etc.) to determine the most suitable platform for our E2E encrypted backup service, considering factors like performance, scalability, reliability, and security.
  • Design, deploy, and manage a highly available and scalable storage cluster on bare-metal servers rented from partners like Hetzner.
  • Architect and implement a robust, production-grade Kubernetes environment on our bare-metal infrastructure to host our cloud services.
  • Develop and automate infrastructure provisioning, configuration management, and deployment processes using Infrastructure as Code (IaC) principles.
  • Ensure the security and integrity of our storage and container platforms, with a strong focus on implementing and managing end-to-end encryption.
  • Monitor, troubleshoot, and optimize the performance of our storage and Kubernetes clusters to ensure high levels of availability and reliability.
  • Collaborate closely with the development team to understand application requirements and provide the necessary infrastructure and support.
  • Document our infrastructure architecture, processes, and best practices.

What We’re Looking For

  • A team player, big in communication, who values others’ input.
  • Proven experience in designing, building, and managing large-scale distributed storage systems.
  • Hands-on expertise with one or more open-source distributed storage technologies such as Ceph, GlusterFS, or MinIO.
  • Strong practical experience in deploying, managing, and scaling Kubernetes clusters on bare-metal servers.
  • In-depth knowledge of Kubernetes architecture, including networking (CNI), storage (CSI), and security best practices.
  • Proficiency with infrastructure automation tools, we use Terraform.
  • Solid understanding of Linux/UNIX operating systems and networking fundamentals.
  • A strong understanding of security principles, including encryption, key management, and access control.
  • A proactive and investigative mindset, with the ability to independently research and recommend the best technical solutions.
  • Excellent problem-solving skills and the ability to work effectively in a collaborative and remote team environment.
  • Experience with backup tooling, and we are currently looking at using Restic for actually performing the backups.
  • Familiarity with the operational aspects of running services on hardware providers like Hetzner.

Location:

FUTO is based in Austin, Texas. For US applicants, relocation assistance to Austin will be provided for certain candidates.

Benefits:

  • Medical and dental insurance
  • 401k matching
  • Bonuses based on company performance

To apply, send an e-mail with your resume to: [email protected]