以下是在Ubuntu上安装配置Hadoop的步骤:
sudo adduser hadoop、sudo passwd hadoop设置密码,再输入sudo usermod -aG sudo hadoop将用户添加到管理员组。sudo apt-get install openssh-server,安装后用ssh localhost登录本机,按提示操作,然后通过ssh-keygen -t rsa生成密钥,cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys设置免密登录。sudo apt-get install openjdk-8-jdk安装JDK,通过java -version查看是否安装成功。sudo tar -zxf hadoop-*.tar.gz -C /usr/local解压到/usr/local目录,再通过sudo mv hadoop-* /usr/local/hadoop重命名文件夹,最后用sudo chown -R hadoop /usr/local/hadoop修改文件权限。~/.bashrc文件,添加export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64、export HADOOP_HOME=/usr/local/hadoop、export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin,然后执行source ~/.bashrc使配置生效。/usr/local/hadoop/etc/hadoop/目录下,修改core-site.xml,添加<property><name>hadoop.tmp.dir</name><value>file:/usr/local/hadoop/tmp</value></property>和<property><name>fs.defaultFS</name><value>hdfs://localhost:9000</value></property>。修改hdfs-site.xml,添加<property><name>dfs.replication</name><value>1</value></property>、<property><name>dfs.namenode.name.dir</name><value>file:/usr/local/hadoop/tmp/dfs/name</value></property>和<property><name>dfs.datanode.data.dir</name><value>file:/usr/local/hadoop/tmp/dfs/data</value></property>。/usr/local/hadoop目录下执行./bin/hdfs namenode -format。./sbin/start-dfs.sh启动HDFS,可通过jps查看进程,若有NameNode、DataNode等进程,说明启动成功。