hadoop2.2.0版本伪分布模式安装

we23213 · 发表于 2014-11-17 13:55:29

说明：HADOOP 解压到了/app目录
      1、解压文件
            tar -zxvf hadoop-2.2.0.tar.gz
      2、对解压的文件重命名
            mv  hadoop-2.2.0  hadoop
      3、首先修改HADOOP 运行环境变量在这里面HADOOP1与2的版本差异比较大hadoop1 版本直接在hadoop目录下的conf目录中而2周日在$HADOOP_HOME/etc/hadoop目录中。
         1）、cd /app/hadoop/etc/hadoop  会发现里面有好多文件不过里面文件好多都是和hadoop1的文件相同的。。
         2）、vi hadoop-env.sh修改里面export JAVA_HOME=${JAVA_HOME} 为export       JAVA_HOME=/app/jdk (这里面的我的JDK 安装在了/app/目录并命名为了jdk)
         3）、：wq!保存文件
      4、配置core-site.xml。其中这个配置代表hadoop 的核心配置文件.我的建议是到源码包找core-default.xml这个文件。打开会发现里面有好多配置。
      这里面我们暂时需要配置的有两项一个是格式化数据存储的临时目录另一个HDFS的访问路径,找到core-site.xml中的hadoop.tmp.dir 和fs.defaultFS拷贝并修改它们的value中的内容
      <configuration>
         <property>
            <name>hadoop.tmp.dir</name>
            <value>/app/hadoop/tmpdata</value>
            <description>A base for other temporary directories.</description>
         </property>

         <property>
            <name>fs.defaultFS</name>
            <value>hdfs://master:49000</value>
            <description>The name of the default file system.  A URI whose
            scheme and authority determine the FileSystem implementation.  The
            uri's scheme determines the config property (fs.SCHEME.impl) naming
            the FileSystem implementation class.  The uri's authority is used to
            determine the host, port, etc. for a filesystem.</description>
         </property>
      <configuration>
      最后保存就可以了.
      5、配置 hdfs-site.xml。这是HADOOP两大核心HDFS 的配置文件.同样需要去解压HDFS中的源码文件.里面会有一个hdfs-defalut.xml文件。
      这里面我们需要配置有几项目主要hdfs 数据存储目录和文件备份数据.

   <property>
      <name>dfs.namenode.name.dir</name>
      <value>file:///app/hadoop/dfs/name</value>
      <description>Determines where on the local filesystem the DFS name node
            should store the name table(fsimage).  If this is a comma-delimited list
            of directories then the name table is replicated in all of the
            directories, for redundancy. </description>
      </property>

      <property>
      <name>dfs.datanode.data.dir</name>
      <value>file:///app/hadoop/dfs/data</value>
      <description>Determines where on the local filesystem an DFS data node
      should store its blocks.  If this is a comma-delimited
      list of directories, then data will be stored in all named
      directories, typically on different devices.
      Directories that do not exist are ignored.
      </description>
      </property>

      <property>
      <name>dfs.permissions.enabled</name>
      <value>false</value>
      <description>
         If "true", enable permission checking in HDFS.
         If "false", permission checking is turned off,
         but all other behavior is unchanged.
         Switching from one parameter value to the other does not change the mode,
         owner or group of files or directories.
      </description>
      </property>
      <property>
      <name>dfs.replication</name>
      <value>1</value>
      <description>Default block replication.
      The actual number of replications can be specified when the file is created.
      The default is used if replication is not specified in create time.
      </description>
      </property>
      最后保存文件.
      5、配置mapred-site.xml.这个文件与HADOOP1 差别很大.
   <configuration>
         <property>
            <name>mapreduce.framework.name</name>
            <value>yarn</value>
            <description>The runtime framework for executing MapReduce jobs.
            Can be one of local, classic or yarn.
            </description>
         </property>
      </configuration>
      最后保存文件.
      6、配置yarn-site.xml
   <configuration>

         <property>
            <description>The hostname of the RM.</description>
            <name>yarn.resourcemanager.hostname</name>
            <value>master</value>
            </property>
         <property>
            <name>yarn.resourcemanager.resource-tracker.address</name>
            <value>master:49100</value>
            </property>

            <property>
            <description>The address of the scheduler interface.</description>
            <name>yarn.resourcemanager.scheduler.address</name>
            <value>master:49200</value>
            </property>

            <property>
            <description>The class to use as the resource scheduler.</description>
            <name>yarn.resourcemanager.scheduler.class</name>
            <value>org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler</value>
            </property>

            <property>
            <description>The address of the applications manager interface in the RM.</description>
            <name>yarn.resourcemanager.address</name>
            <value>master:49300</value>
            </property>

            <property>
            <description>List of directories to store localized files in. An
               application's localized file directory will be found in:
               ${yarn.nodemanager.local-dirs}/usercache/${user}/appcache/application_${appid}.
               Individual containers' work directories, called container_${contid}, will
               be subdirectories of this.
            </description>
            <name>yarn.nodemanager.local-dirs</name>
            <value></value>
            </property>


            <property>
            <description>The address of the container manager in the NM.</description>
            <name>yarn.nodemanager.address</name>
            <value>master:49400</value>
            </property>

            <property>
            <description>Amount of physical memory, in MB, that can be allocated
            for containers.</description>
            <name>yarn.nodemanager.resource.memory-mb</name>
            <value>10000</value>
            </property>

            <property>
            <description>Where to aggregate logs to.</description>
            <name>yarn.nodemanager.remote-app-log-dir</name>
            <value>/app/hadoop/logs</value>
            </property>

            <property>
            <description>
               Where to store container logs. An application's localized log directory
               will be found in ${yarn.nodemanager.log-dirs}/application_${appid}.
               Individual containers' log directories will be below this, in directories
               named container_{$contid}. Each container directory will contain the files
               stderr, stdin, and syslog generated by that container.
            </description>
            <name>yarn.nodemanager.log-dirs</name>
            <value>/app/hadoop/logs/userlogs</value>
            </property>

            <property>
            <description>the valid service name should only contain a-zA-Z0-9_ and can not start with numbers</description>
            <name>yarn.nodemanager.aux-services</name>
            <value>mapreduce_shuffle</value>
            
            </property>
      </configuration>
      保存文件

      7、修改集群文件slaves将localhost修改为master保存文件

      8、将HADOOP_HOME 目录改变为hadoop 用户
            chown -R hadoop:hadoops hadoop
      9、格式化namenode
            hadoop namenode -format
      9、启动
            start-all.sh
      10、通过浏览器访问http://192.168.1.106:50070/dfshealth.jsp
      其中在Cluster Summary里面有两项目Live Nodes 和Dead Nodes
      Live Nodes >=1 表示DATANODE 启动正确
      可以通过http://192.168.1.106:8088地址查看所有的应用

账号		自动登录	找回密码
密码			立即注册

VMware vcenter+vSphere 6.5 U2共享

【跟谁学】韩宇极简英语课-技术人员不得不

用Zabbix通过JMX方式监控weblogic

winhex数据恢复教程（非常巨大，内容丰富）

Symantec Backup Exec 2015 2016/2012 BE20

NetScaler VPX部署之：NetScaler Gateway调

zabbix3.4.1安装部署+微信推送信息+大屏显

[经验分享] hadoop2.2.0版本伪分布模式安装

浏览过的版块

扫码加入运维网微信交流群