Ubuntu Hadoop 性能测试实操指南
一 测试准备与基线
hadoop fs -mkdir -p /user/test && hadoop fs -chown $USER:$USER /user/testhadoop dfsadmin -setSpaceQuota 500g /user/testhadoop dfsadmin -safemode leavehadoop fs -ls -R / > /dev/null二 核心测试与命令
yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-*.tests.jar TestDFSIO -write -nrFiles 10 -size 1GB -resFile result.logyarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-*.tests.jar TestDFSIO -read -nrFiles 10 -size 1GB -resFile result.logyarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-*.tests.jar TestDFSIO -cleanyarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-*.tests.jar MRBench -numRuns 10yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-*.tests.jar NNBench -operation create_write -maps 10 -reduces 1 -blockSize 128M -numberOfFiles 1000 -bytesToWrite 0yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-*.jar teragen 100000000 /user/test/teragenyarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-*.jar terasort /user/test/teragen /user/test/terasortyarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-*.jar teravalidate /user/test/terasort /user/test/teravalidate./bin/run_all.sh)。三 结果记录与判读
----- TestDFSIO ----- : write
Date & time: Thu Dec 14 13:21:57 CST 2023
Number of files: 10
Total MBytes processed: 1000
Throughput mb/sec: 49.72
Average IO rate mb/sec: 190.11
IO rate std deviation: 75.6
Test exec time sec: 27.01
----- TestDFSIO ----- : read
Date & time: Thu Dec 14 13:36:56 CST 2023
Number of files: 10
Total MBytes processed: 1000
Throughput mb/sec: 631.31
Average IO rate mb/sec: 655.3
IO rate std deviation: 127
四 网络与系统层验证
iperf3 -siperf3 -c 192.168.1.10 -t 30 -P 4ping -c 100 <目标IP>,观察 RTT 分布与丢包率。fio -filename /data/test.log -direct=1 -iodepth=1 -thread -rw=write -ioengine=psync -bs=1M -size=10G -numjobs=4 -runtime=60 -group_reporting -name=seq_writefio -filename /data/test.log -direct=1 -iodepth=1 -thread -rw=read -ioengine=psync -bs=1M -size=10G -numjobs=4 -runtime=60 -group_reporting -name=seq_read五 测试流程与注意事项
TestDFSIO -clean),避免缓存干扰;尽量在 非高峰期 执行。