The "Root" of All Evil: Container Breakouts and How to Stop Them

Let’s be honest for a second. Most developers treat Docker containers like lightweight Virtual Machines. They stick an SSH server inside, run processes as root, and map the host's storage without a second thought. I recall auditing a fintech setup in Oslo last month where the CI/CD pipeline mapped /var/run/docker.sock into a Jenkins build agent. That is not a vulnerability; that is a digitally signed invitation for a takeover. Within 12 minutes of a simulated breach, we had root access to the host node and, by extension, their entire payment gateway.

It is September 2023. The "it works on my machine" excuse doesn't fly when you are handling Norwegian citizen data under GDPR. If your container security strategy relies solely on a firewall, you are already compromised. Real security happens at the kernel level, the image layer, and the infrastructure substrate.

1. The Supply Chain: Stop Using `latest`

The first line of defense is what you actually deploy. Supply chain attacks are the trend of the year (remember SolarWinds? Log4j?). If you are pulling node:latest or ubuntu:22.04 for a production microservice, you are importing hundreds of vulnerabilities you don't need.

The Fix: Use minimal base images. Distroless images are the gold standard here. They contain your application and its runtime dependencies. No shell, no package manager, no text editors. If an attacker gets in, they can't even run ls.

Here is a comparison of attack surfaces:

Base Image	Size	Vulnerabilities (Avg)	Shell Access?
ubuntu:22.04	~70MB	20+	Yes
alpine:3.18	~7MB	0-5	Yes
gcr.io/distroless/static	~2MB	0	No

Here is how a proper multi-stage build looks in 2023. Notice we build in a fat image, but deploy in a skeleton.

# Build Stage
FROM golang:1.21-alpine AS builder
WORKDIR /app
COPY . .
RUN go build -ldflags="-w -s" -o myapp

# Production Stage
FROM gcr.io/distroless/static-debian11
COPY --from=builder /app/myapp /
USER nonroot:nonroot
ENTRYPOINT ["/myapp"]

2. Runtime Hardening: capabilities and Read-Only Roots

By default, Docker allows a container to retain too many Linux capabilities. Does your Nginx server really need CAP_NET_ADMIN (network configuration) or CAP_SYS_ADMIN (basically root)? No.

In Kubernetes v1.28, we use Pod Security Standards (PSS) to enforce this. The securityContext is where the magic happens. We drop ALL capabilities and only add back the specific ones needed (usually NET_BIND_SERVICE if you are binding port 80, though you should be using a Service/Ingress for that anyway).

The "Paranoid" Configuration

This is the snippet I force every dev team to use in their Helm charts. It forces the container to run as a non-root user and makes the filesystem immutable.

apiVersion: v1
kind: Pod
metadata:
  name: secure-service
spec:
  securityContext:
    runAsNonRoot: true
    runAsUser: 10001
    runAsGroup: 10001
    fsGroup: 10001
  containers:
  - name: my-app
    image: my-registry/app:v1.0.4
    securityContext:
      allowPrivilegeEscalation: false
      readOnlyRootFilesystem: true
      capabilities:
        drop:
          - ALL
    volumeMounts:
    - name: tmp
      mountPath: /tmp
  volumes:
  - name: tmp
    emptyDir: {}

Pro Tip: When you enable readOnlyRootFilesystem: true, your app will crash if it tries to write logs or temp files to disk. Mount an emptyDir volume to /tmp and configure your logger to write to stdout (standard out) so Fluentd or Promtail can pick it up. This prevents an attacker from downloading a crypto-miner binary to your disk.

3. Local Context: Data Residency and Latency

In Norway, we have the Datatilsynet breathing down our necks regarding Schrems II. Data transfers to US-owned clouds are legally tricky. Hosting containers on US hyperscalers might expose you to the CLOUD Act. This is where sovereignty matters.

Hosting on CoolVDS servers located in Oslo/Europe ensures your data stays within the EEA. Furthermore, for those of you running high-frequency trading bots or real-time gaming backends, the speed of light is a hard constraint. Ping times from Oslo to Frankfurt are okay (~15-20ms), but Oslo to Oslo is instant (<2ms). Low latency isn't just a luxury; for synchronization in distributed databases like CockroachDB or ScyllaDB, it reduces the window for consistency errors.

4. The Infrastructure Layer: Noisy Neighbors and Kernel Panic

You can have the most secure Dockerfile in the world, but if the underlying kernel is shared with 500 other tenants on an oversold OpenVZ node, you are at risk. Container escapes rely on kernel vulnerabilities. If the kernel crashes, everyone goes down.

This is why we architect CoolVDS differently. We use KVM (Kernel-based Virtual Machine) hardware virtualization. Your NVMe storage is passed through, and your RAM is dedicated. You get your own kernel.

If a "noisy neighbor" on a different slice gets DDoS'd or tries to exploit a kernel bug, the hypervisor isolates them completely. In a shared container environment (like standard shared hosting), their problem becomes your downtime.

Checking Your IOPS

Security also means availability. If your database container is starved of I/O, your service is effectively dead. Verify your disk throughput right now:

fio --name=random-write --ioengine=posixaio --rw=randwrite --bs=4k --size=1g --numjobs=1 --iodepth=1 --runtime=60 --time_based --end_fsync=1

On a standard SATA VPS, you might see 500-1000 IOPS. On CoolVDS NVMe instances, we regularly clock 20,000+ IOPS for random writes. Speed is security when you are trying to write audit logs during a traffic spike.

5. Network Policies: The Forgotten Firewall

By default, in Kubernetes, all pods can talk to all other pods. Your frontend can talk to your database. Your database can talk to the billing service. This is a flat network, and it is dangerous.

Use NetworkPolicies to lock this down. Deny all traffic by default, then whitelist.

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: default-deny-all
spec:
  podSelector: {}
  policyTypes:
  - Ingress
  - Egress

This applies to the namespace. Once applied, nothing works. Then you explicitly allow your frontend to reach the backend on port 8080 only. It’s tedious, but it stops lateral movement dead in its tracks.

Summary

Security is not a product; it's a process of reducing risk. By stripping your images, locking down runtime privileges, and ensuring your underlying infrastructure provides true hardware isolation, you build a fortress.

Do not wait for a breach to take this seriously. Spin up a secure, KVM-isolated instance on CoolVDS today and test your hardening scripts on a platform built for professionals.

🍪 We Value Your Privacy

Privacy & Cookie Settings

Your Privacy Rights

Fortifying Containers: The 2023 Guide to Immutable Security in Norway

The "Root" of All Evil: Container Breakouts and How to Stop Them

1. The Supply Chain: Stop Using latest

2. Runtime Hardening: capabilities and Read-Only Roots

The "Paranoid" Configuration

3. Local Context: Data Residency and Latency

4. The Infrastructure Layer: Noisy Neighbors and Kernel Panic

Checking Your IOPS

5. Network Policies: The Forgotten Firewall

Summary

/// RELATED POSTS

Edge Computing in Norway: Architecting for Sub-5ms Latency in 2025

Kubernetes Networking Deep Dive: Optimizing Packet Flow for Low Latency in 2025

Surviving the Packet Storm: A Deep Dive into Kubernetes Networking & CNI Performance in 2025

Surviving the Millisecond War: Edge Computing Architectures for the Nordic Market

Kubernetes Networking Deep Dive: Why Your Packets Are Dropping in the Overlay

Serverless Without the Handcuffs: Implementing Private FaaS Patterns on High-Performance VDS in 2025

1. The Supply Chain: Stop Using `latest`