Linux Disk I/O Optimization Guide

Disk I/O is a critical bottleneck in Linux system performance, especially for workloads involving heavy read/write operations. Below is a structured approach to optimizing disk I/O, covering identification, software configuration, and hardware enhancements.

1. Identify I/O Bottlenecks

Before optimizing, use tools to pinpoint bottlenecks:

System-wide I/O stats: iostat -x 1 (key metrics: %util [close to 100% = saturation], await [average wait time], svctm [service time]).
Process-level I/O: iotop (identifies top I/O-consuming processes).
Virtual memory I/O: vmstat 1 (check si/so for swap activity, indicating memory pressure).

2. Filesystem Optimization

Choose the Right Filesystem

XFS: Best for large files and high-concurrency workloads (e.g., databases, media servers).
EXT4: General-purpose, stable, and widely supported (suitable for most desktops/servers).
Btrfs: Ideal for advanced features (snapshots, compression) but less mature than EXT4/XFS.

Mount Options

Adjust mount options in /etc/fstab to reduce unnecessary I/O:

noatime,nodiratime: Disables updating file/directory access times (reduces writes).
discard: Enables TRIM for SSDs (maintains performance by notifying the drive of unused blocks).
data=writeback: For SSDs/HDDs with battery-backed RAID (improves write performance by deferring metadata updates).
barrier=0: Disables write barriers (use only if data integrity is not critical, e.g., SSDs with capacitors).

Filesystem Block Size

Format filesystems with an optimal block size (e.g., 4k for most workloads) using mkfs:

mkfs.ext4 -b 4096 /dev/sdX  # Example for EXT4 with 4k block size

3. I/O Scheduler Optimization

The I/O scheduler manages request queues to minimize latency. Choose based on storage type:

SSD/NVMe: Use none (disable scheduling, as SSDs handle internal scheduling efficiently) or kyber (optimizes for latency).
HDD: Use mq-deadline (ensures timely processing of requests, balances throughput and latency).
Check current scheduler: cat /sys/block/sdX/queue/scheduler (shows available options).
Set scheduler: echo kyber > /sys/block/sdX/queue/scheduler (persistent changes require adding to bootloader/kernel parameters).

4. Kernel Parameter Tuning

Adjust kernel parameters in /etc/sysctl.conf to balance memory and I/O:

Dirty Page Control:
- vm.dirty_background_ratio = 5 (background writeback starts at 5% of memory).
- vm.dirty_ratio = 10 (writeback forced at 10% of memory).
  Reduces sudden I/O spikes by spreading writeback over time.
Swappiness:
- vm.swappiness = 1 (minimizes swap usage, prefers keeping data in RAM).
  Critical for SSDs (reduces wear) and high-performance workloads.
VFS Cache Pressure:
- vm.vfs_cache_pressure = 50 (controls reclaim of cached directory/file objects; higher values free memory faster).

5. Hardware-Level Optimization

Use SSDs

Replace HDDs with SSDs for dramatic improvements in IOPS (Input/Output Operations Per Second) and latency. Ensure TRIM is enabled (discard mount option or fstrim -av for periodic trimming).

RAID Configuration

Combine multiple disks via RAID to enhance performance/redundancy:

RAID 10: Best for performance + redundancy (striping + mirroring).
RAID 5/6: Suitable for capacity-focused workloads (parity-based, but write performance is lower due to parity calculations).
Use mdadm to create/manage RAID arrays (e.g., mdadm --create /dev/md0 --level=10 --raid-devices=4 /dev/sda /dev/sdb /dev/sdc /dev/sdd).

SSD Caching

For HDD-based systems, use SSDs as cache (e.g., bcache, dm-cache) to accelerate frequently accessed data. This is a cost-effective way to boost performance without full SSD migration.

6. Monitoring and Maintenance

Regularly monitor I/O performance to catch regressions:

Long-term monitoring: Use tools like sar (from sysstat package) to track historical I/O trends.
Fragmentation: Defragment HDDs periodically (e4defrag /dev/sdX for EXT4) to maintain performance.
Log Analysis: Check system/application logs for I/O-related errors (e.g., disk timeouts, ENOSPC).

By following these steps—identifying bottlenecks, optimizing filesystems/schedulers, tuning kernel parameters, and upgrading hardware—you can significantly improve Linux disk I/O performance and reduce dropped requests caused by I/O saturation.

Linux dropped磁盘I/O优化