Back
Technology  · 

Infrastructure-as-Code for Latency-Critical Bare Metal Systems

About the Speaker

Leanne Fok is a Senior Infrastructure Engineer in Amsterdam with 15+ years at Optiver. She previously led the Infrastructure Platform team responsible for our internal Infrastructure-as-Code stack and is now leading the Infrastructure Kubernetes project.

Infrastructure-as-Code is straightforward in the cloud. But what about on bare metal?

In her PyLadies talk, Infrastructure-as-Code in a Latency-Critical Bare Metal World,” Senior Infrastructure Engineer Leanne Fok shares how we brought declarative IaC principles to physical, latency-sensitive trading infrastructure where nanoseconds matter and every cable is intentional.

Managing thousands of bare metal servers across global data centers introduces a different class of problems. There is no managed control plane. No provider APIs to rely on. Just hardware, strict performance constraints, and very little tolerance for configuration drift.

So how do you:

  • Define infrastructure intent clearly enough to automate it
  • Enforce standards across physical devices, ports, power, and networking
  • Detect when real-world state quietly diverges from what you declared
  • Build trust in automation while migrating from legacy processes

In this talk, Leanne walks through the architectural patterns we adopted, including declarative intent modeling, reconciliation loops, truth collectors, and a custom Terraform provider that translates infrastructure definitions into enforceable standards.

Watch the full 15-minute talk

Interested in latency-critical systems?

Explore open engineering roles and learn more about the infrastructure challenges behind them.

Technology
Insights

Related Articles

  • Technology

    UI as a Systems Problem

    When something feels slow, you’re probably looking in the wrong place. UI issues are rarely local and are often caused by how data is fetched, recomputed, and duplicated across the system, becoming visible only when multiple services, queries, and interactions run at the same time. Read how Optiver approaches building systems where UI and data are tightly coupled.

    Learn more
    Global
  • Technology

    Building visibility into a CI platform

    This article explores how we operate a CI platform on fixed hardware, where auto-scaling is not always an option. It covers how we built visibility into queue times, utilisation, and system behaviour to manage capacity with confidence.

    Learn more
    Global
  • Technology

    When Speed and Scale Collide

    Data systems are often described along two axes: speed and scale. In practice, “speed” usually means some combination of latency and throughput, and systems are often optimized for one at the expense of the other, sometimes by trading efficiency for raw capacity. Those distinctions tend to break down quickly once systems move beyond simple use […]

    Learn more
    Global
  • Experienced, Technology

    Choosing between free threading and async in Python

    As host of this year’s first PyAmsterdam Meetup, Optiver welcomed the local Python community back to our headquarters for an evening of tech talks, networking and knowledge sharing. To open the event, Optiver Senior Software Engineer and Team Lead Samet Yaslan delivered a timely talk for developers working on performance-critical systems: “Choosing Between Free Threading and Async.” Prompting his choice of topic was a recent change in CPython, as Samet explained: “Beginning with version 3.13, CPython introduces an option for a build known as free threading, where the Global Interpreter Lock (GIL) is removed. The question is: With the GIL gone, do we still need async in Python?” Here’s how he breaks it down, and what it means for your next Python project.

    Learn more