Hadoop系列：在Linux下部署hadoop 0.20.1

小时？ · 发表于 2015-7-13 10:22:39

　　两台测试虚机，系统为REHL 5.3 x64，正常安装最新版本的JDK，正确设置SSH无密码登录。
服务器一：192.168.56.101 dev1
服务器二：192.168.56.102 dev2
从http://apache.freelamp.com/hadoop/core/hadoop-0.20.1/下载hadoop-0.20.1.tar.gz，把hadoop-0.20.1.tar.gz拷贝到dev1的“/usr/software/hadoop”目录下。登录dev1执行以下命令：
# cd /usr/software/hadoop
# tar zxvf hadoop-0.20.1.tar.gz
# cp -a hadoop-0.20.1 /usr/hadoop
# cd /usr/hadoop/conf
修改hadoop环境配置文件hadoop-env.sh
# vi hadoop-env.sh
添加以下内容：
export JAVA_HOME=/usr/java/jdk1.6.0_16
修改hadoop主要配置文件core-site.xml
# vi core-site.xml
添加以下内容（可以根据需求自行定义）：

　　
　　

fs.default.name
hdfs://dev1
The name of the default file system. Either the literal string "local" or a host:port for DFS.


hadoop.tmp.dir
/usr/hadoop/tmp
A base for other temporary directories.

　　
dfs.name.dir
/usr/hadoop/filesystem/name
Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy.

　　
dfs.data.dir
/usr/hadoop/filesystem/data

   Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are i
   gnored.


　　
dfs.replication
1
Default block replication. The actual number of replications can be specified when the file is created. The default isused if replication is not specified in create time.


修伽hadoop的mapred-site.xml文件
# vi mapred-site.xml
添加如下内容：

　　
　　

mapred.job.tracker
dev1:9001

   The host and port that the MapReduce job tracker runs at. If "local", then jobs are run in-process as a single map and
   reduce task.


　　
修改hadoop定义namenode的masters文件：
# vi masters
添加以下内容：
dev1
修改hadoop定义datanode的slaves文件：
# vi slaves
添加以下内容：
dev2
　　在dev2按以上步骤安装hadoop。
格式化namenode：
# ./hadoop namenode -format
到此所有安装和配置完成。
在dev1执行以下命令，启动hadoop:
# cd /usr/hadoop/bin
# ./start-all.sh
启动完成后，可以以下执行命令来查看hadoop查看其基本情况：
# ./hadoop dfsadmin -report
或在浏览器中输入http://192.168.56.101:50070/dfshealth.jsp查看。

账号		自动登录	找回密码
密码			立即注册

Centos6.5×64安装配置openmeetings3.0.3详

大疆运维招人啦，

C++ :try 语句块和异常处理

C++的多态

Red Hat RHCE 8 (EX294) Cert Guide

Java/C++ 区别：看完这一篇，就够用！

别再用过时库了！这 13 个顶级 C++ 库才是

[经验分享] Hadoop系列：在Linux下部署hadoop 0.20.1

浏览过的版块

扫码加入运维网微信交流群