Bleeding Cash on Cloud? A CTO’s Guide to Taming Infrastructure Costs in 2020

Let’s look at the uncomfortable truth facing Norwegian businesses right now. With the NOK historically weak against the Dollar and Euro, your AWS or Azure invoice has likely jumped 20% in the last few months without you adding a single user. We are in an economic climate where efficiency isn't just a metric; it is a survival strategy.

I’ve audited three infrastructure setups this month. In every single case, the companies were paying a premium for "elasticity" they never used, effectively lighting money on fire to keep idle vCPUs warm. The promise of the public cloud—pay only for what you use—has morphed into "pay for what you provisioned and forgot about."

Real cost optimization isn't about switching off servers at night. It is about architectural discipline, understanding the physics of your hardware (yes, even virtual hardware), and realizing that data gravity is real. Here is how we stabilize TCO (Total Cost of Ownership) while keeping latency to Oslo under 5 milliseconds.

1. The "vCPU" Trap and the Reality of Steal Time

Most cloud providers sell you a "vCPU." In 2020, that term is dangerously vague. On a standard public cloud instance, a vCPU is often a timeslice of a hyperthread on a heavily oversubscribed host. You might be paying for 4 vCPUs, but if your "noisy neighbor" decides to mine cryptocurrency or re-index a massive Elasticsearch cluster, your performance tanks.

You end up provisioning larger instances just to get a guaranteed baseline of performance. That is the definition of waste.

The Fix: Monitor your %st (steal time). If you see this number creeping above 0.5% consistently, you are paying for resources you aren't getting.

# Install sysstat to get historical data
apt-get install sysstat

# Check CPU steal time statistics
sar -u 1 5

If you see output like this:

08:00:01     CPU     %user     %nice   %system   %iowait    %steal     %idle
08:00:02     all     15.20      0.00      4.50      2.10      8.40     69.80

That 8.40% steal is unacceptable. You are losing nearly 10% of the CPU cycles you paid for.

Pro Tip: This is why we enforce strict KVM virtualization at CoolVDS. Unlike container-based VPS solutions (OpenVZ/LXC), KVM provides hardware-level isolation. You get the cycles you pay for, eliminating the need to over-provision just to buffer against neighbors.

2. Storage I/O: The Bottleneck That Kills Concurrency

In 2020, spinning rust (HDD) should only be used for cold backups. Yet, many "budget" VPS providers still push SATA SSDs with capped IOPS (Input/Output Operations Per Second). When your database gets hit with complex joins, CPU isn't the bottleneck—disk I/O is.

When I/O waits spike, your worker processes (PHP-FPM, Gunicorn, etc.) get stuck waiting for data. This consumes RAM and forces you to scale up. Moving to NVMe storage is often cheaper than upgrading RAM/CPU because it clears the bottleneck, allowing the same compute resources to handle 3x the traffic.

Here is how to check if your current disk is choking your database using iotop:

iotop -oPa

If you see your MySQL or PostgreSQL process consistently at the top with high DISK READ, you need faster storage, not more CPU. We standardized on NVMe for all CoolVDS production tiers precisely to prevent this "I/O Tax."

3. Optimize the Stack Before Scaling the Server

Before you upgrade your plan, tune your software. The default configurations for Nginx and MySQL in Ubuntu 20.04 (released just last month) are designed for compatibility, not frugality.

Tuning Nginx for Low Resource Usage

Stop serving static assets through your application backend. It sounds obvious, but it is the #1 cause of bloated bills. Let Nginx handle the heavy lifting with efficient caching and keep-alive connections.

# /etc/nginx/nginx.conf

http {
    # ... other settings ...
    
    # Optimization 1: Efficient File Transfer
    sendfile on;
    tcp_nopush on;
    tcp_nodelay on;
    
    # Optimization 2: Keep-Alive to reduce handshake overhead
    keepalive_timeout 65;
    
    # Optimization 3: Aggressive Gzip Compression
    gzip on;
    gzip_comp_level 5;
    gzip_min_length 256;
    gzip_proxied any;
    gzip_types
        application/javascript
        application/json
        application/xml
        text/css
        text/plain;
}

MySQL 8.0 Memory Discipline

MySQL 8.0 is fantastic, but it can be memory-hungry compared to 5.7. If you are running a 4GB RAM VPS, you cannot leave the defaults. The innodb_buffer_pool_size should generally be 60-70% of available RAM, assuming the DB is on a dedicated node. If it's a shared web/DB server, dial it back to 40%.

# /etc/mysql/mysql.conf.d/mysqld.cnf

[mysqld]
# Disabling performance_schema saves ~100MB-400MB RAM immediately
performance_schema = 0

# Adjust based on your total RAM
innodb_buffer_pool_size = 1G

# Limit connection overhead
max_connections = 100

4. Data Sovereignty and the Bandwidth Bill

Here is a factor often ignored in TCO calculations: Compliance Risk. With the status of the Privacy Shield framework looking increasingly shaky this year, storing customer data outside the EEA is becoming a legal liability. If you are hosting Norwegian user data in a US-owned cloud region in Frankfurt, you are still navigating a legal minefield regarding the US CLOUD Act.

Furthermore, bandwidth pricing on hyperscalers is designed to trap you. Ingress is free; egress (data leaving the server) costs a fortune. If you run a media-heavy site or a backup server, that egress fee can exceed the instance cost.

Feature	Hyperscalers (AWS/GCP)	CoolVDS (Local)
Egress Bandwidth	$0.09 - $0.12 per GB	Generous TB allowances included
Latency to Oslo	20-35ms (via Frankfurt/London)	2-5ms (Local Peering)
Data Sovereignty	Complex (US CLOUD Act applies)	Strict Norwegian/EEA Jurisdiction

Hosting locally isn't just about patriotism; it's about latency and legal safety. By keeping traffic within the NIX (Norwegian Internet Exchange), you ensure the lowest possible RTT (Round Trip Time) for your local customers. For an e-commerce store, a 100ms delay can drop conversion rates by 7%.

Conclusion: Pragmatism Wins

There is a time for Kubernetes clusters and auto-scaling groups spanning three continents. But for 95% of businesses in 2020, that is over-engineering. The path to cost efficiency lies in right-sizing your instances, ensuring you are on high-IOPS NVMe storage so your CPU doesn't wait on disk, and hosting closer to your users to kill latency.

Don't let the exchange rate dictate your IT budget. Take control of your stack.

Ready to stop paying for "steal time"? Deploy a high-performance, NVMe-backed instance on CoolVDS today and experience the difference of raw, unthrottled KVM power in the heart of Norway.

🍪 We Value Your Privacy

Privacy & Cookie Settings

Your Privacy Rights

Bleeding Cash on Cloud? A CTO’s Guide to Taming Infrastructure Costs in 2020

Bleeding Cash on Cloud? A CTO’s Guide to Taming Infrastructure Costs in 2020

1. The "vCPU" Trap and the Reality of Steal Time

2. Storage I/O: The Bottleneck That Kills Concurrency

3. Optimize the Stack Before Scaling the Server

Tuning Nginx for Low Resource Usage

MySQL 8.0 Memory Discipline

4. Data Sovereignty and the Bandwidth Bill

Conclusion: Pragmatism Wins

/// RELATED POSTS

Cloud Cost Optimization in 2025: A CTO’s Guide to Surviving Egress Fees and Bloat

Cloud Repatriation & FinOps: A CTO’s Guide to Halving Infrastructure Costs in 2025

Disaster Recovery Architecture: Surviving the Inevitable in the Norwegian Cloud

Beyond the p99: Advanced API Gateway Tuning for Low-Latency Norwegian Workloads

Stop Bleeding Cash: A Pragmatic Guide to Cloud Cost Optimization in 2024

Cloud Cost Optimization in 2023: A CTO’s Guide to Escaping the Hyperscale Billing Trap in Norway