设为首页 收藏本站
查看: 999|回复: 0

[经验分享] Install hadoop with Cloudera Manager 5.2 using Parcel on CentOS 6.5

[复制链接]

尚未签到

发表于 2018-10-30 13:08:57 | 显示全部楼层 |阅读模式
  主机分配考虑:
  master:
  HDFS NameNode 1 + HDFS NameNode 2
  YARN ResourceManager 1 + YARN ResourceManager 2
  slave: (these roles co-deployed)
  HDFS DataNode + YARN NodeManager + HBase RegionServer + Impala Daemon
  others:
  HBase Master: multiple, on dedicated nodes
  ZooKeeper + Jounal Node: >3, odd number, recommended on dedicated nodes, not with flume agent service together
  
  Hive + Hue + Impala + Oozie + Solr + Sqoop2
  Spark (like MapReduce)
  Cloudera Management Service
  分区考虑,不要使用LVM
  root -- >50G
  var -- >100G
  opt -- >50G
  /tmp -- >100G (run job失败的话请查看此目录空间)
  swap -- 2倍系统内存
  RAM -- >8GB
  Master node:
  RAID 10, dual Ethernet cards, dual power supplies, etc.
  Slave node:
  1. RAID is not necessary
  2. HDFS分区, not using LVM
  /etc/fstab -- ext3/ext4    defaults,noatime
  挂载到/data/N/, for N=0,1,2... (one partition per disk)
  挂载到/data/N/, for N=0,1,2... (one partition per disk)
  Cloudera CDH repository:
  http://archive.cloudera.com/cdh5
  http://archive-primary.cloudera.com/cm5
  http://archive.cloudera.com/gplextras5
  Cloudera parcel repository:
  http://archive.cloudera.com/cdh5/parcels/
  http://archive.cloudera.com/gplextras5/parcels/
  http://archive.cloudera.com/sqoop-connectors/parcels/
  http://archive.cloudera.com/accumulo-c5/parcels/
  http://archive.cloudera.com/kafka/parcels/
  Cloudera Labs repository:
  http://archive-primary.cloudera.com/cloudera-labs/
  on cloudera manager and all cluster nodes (including master + slave nodes):
  at least 3 Servers for ZooKeeper, 3 DataNodes for HDFS.
  1.disable selinux and iptables
  service iptables stop
  chkconfig iptables off; chkconfig ip6tables off
  setenforce 0
  sed -i 's,SELINUX=enforcing,SELINUX=disabled,g' /etc/selinux/config
  2. disable ipv6 and kernel parameters tuning
  echo "net.ipv6.conf.all.disable_ipv6 = 1" >> /etc/sysctl.conf
  echo "vm.swappiness = 0" >> /etc/sysctl.conf
  echo 'net.ipv4.tcp_retries2 = 2' >> /etc/sysctl.conf
  echo 'vm.overcommit_memory = 1' >> /etc/sysctl.conf
  echo "fs.file-max = 6815744" >> /etc/sysctl.conf
  echo "fs.aio-max-nr = 1048576" >> /etc/sysctl.conf
  echo "net.core.rmem_default = 262144" >> /etc/sysctl.conf
  echo "net.core.wmem_default = 262144" >> /etc/sysctl.conf
  echo "net.core.rmem_max = 16777216" >> /etc/sysctl.conf
  echo "net.core.wmem_max = 16777216" >> /etc/sysctl.conf
  echo "net.ipv4.tcp_rmem = 4096 262144 16777216" >> /etc/sysctl.conf
  echo "net.ipv4.tcp_wmem = 4096 262144 16777216" >> /etc/sysctl.conf
  only on ResourceManager and JobHistory Server
  echo "net.core.somaxconn = 1000" >> /etc/sysctl.conf
  sysctl -p
  echo "echo never > /sys/kernel/mm/redhat_transparent_hugepage/enabled" >> /etc/rc.local
  echo "echo never > /sys/kernel/mm/redhat_transparent_hugepage/defrag" >> /etc/rc.local
  echo "echo no > /sys/kernel/mm/redhat_transparent_hugepage/khugepaged/defrag" >> /etc/rc.local
  3. vi /etc/hosts to add all hosts FQDN, like below:
  192.168.1.19    cm5.local cm5 archive.cloudera.com
  192.168.1.20    master1.local master1  # HDFS NameNode
  192.168.1.21    master2.local master2  # YARN ResourceManager
  192.168.1.22    slave1.local slave1
  192.168.1.23    slave2.local slave2
  192.168.1.24    slave3.local slave3
  vi /etc/sysconfig/network to set FQDN
  yum -y install ntp openssh-clients lzo
  service ntpd start; chkconfig ntpd on
  cat  /etc/yum.repos.d/iso.repo
  [iso]
  name=iso
  baseurl=http://mirrors.aliyun.com/centos/6.5/os/x86_64
  enable=1
  gpgcheck=0
  EOF
  vi /etc/security/limits.conf
  *         soft    core         unlimited
  *         hard    core         unlimited
  *         soft    nofile       65536
  *         hard    nofile       65536
  *         soft    nproc        unlimited
  *         hard    nproc        unlimited
  *         soft    memlock      unlimited
  *         hard    memlock      unlimited
  vi /etc/grub.conf
  add "elevator=deadline"(no quotes) at the end of kernel line
  reboot to take effect
  4. On cloudera manager, we will install mysql 5.6 and apache
  rpm -e --nodeps mysql-libs
  yum -y install libaio perl
  rpm -ivh MySQL-shared-compat-5.6.20-1.el6.x86_64.rpm
  rpm -ivh MySQL-shared-5.6.20-1.el6.x86_64.rpm
  rpm -ivh MySQL-server-5.6.20-1.el6.x86_64.rpm
  rpm -ivh MySQL-client-5.6.20-1.el6.x86_64.rpm
  vi /etc/my.cnf
  [mysqld]
  transaction-isolation=READ-COMMITTED
  symbolic-links=0
  key_buffer = 16M
  key_buffer_size = 32M
  max_allowed_packet = 32M
  thread_stack = 256K
  thread_cache_size = 64
  query_cache_limit = 8M
  query_cache_size = 64M
  query_cache_type = 1
  # Allow 100 maximum connections for each database and then add 50 extra connections
  max_connections = 550
  log-bin=mysql-bin
  binlog_format=mixed
  expire_logs_days=10
  max_binlog_size=100M
  read_buffer_size = 2M
  read_rnd_buffer_size = 16M
  sort_buffer_size = 8M
  join_buffer_size = 8M
  # InnoDB settings
  innodb_file_per_table = 1
  innodb_flush_log_at_trx_commit = 2
  innodb_log_buffer_size = 64M
  innodb_buffer_pool_size = 4G
  innodb_thread_concurrency = 8
  innodb_flush_method = O_DIRECT
  innodb_log_file_size = 512M
  service mysql start; chkconfig mysql on
  cat ~/.mysql_secret
  mysqladmin -uroot -p'oldpassword' password newpassword
  mysql_secure_installation
  mysql -u root -p
  # for Activity Monitor
  create database amon DEFAULT CHARACTER SET utf8;

  grant all on amon.* TO 'amon'@'%'>
  grant all on amon.* TO 'amon'@'localhost'>  # for Reports Manager
  create database rman DEFAULT CHARACTER SET utf8;

  grant all on rman.* TO 'rman'@'%'>
  grant all on rman.* TO 'rman'@'localhost'>  # for Hive Metastore Server
  create database metastore DEFAULT CHARACTER SET utf8;

  grant all on metastore.* TO 'hive'@'%'>
  grant all on metastore.* TO 'hive'@'localhost'>  # for Sentry Server
  create database sentry DEFAULT CHARACTER SET utf8;

  grant all on sentry.* TO 'sentry'@'%'>
  grant all on sentry.* TO 'sentry'@'localhost'>  # for Cloudera Navigator Audit Server
  create database nav DEFAULT CHARACTER SET utf8;

  grant all on nav.* TO 'nav'@'%'>
  grant all on nav.* TO 'nav'@'localhost'>  flush privileges;
  yum -y install httpd
  service httpd start; chkconfig httpd on
  mkdir /var/www/html/cm520
  mkdir /var/www/html/parcel520
  mount -o loop cm520.iso /var/www/html/cm520
  mount -o loop parcel520.iso /var/www/html/parcel520
  cat  /etc/yum.repos.d/cm520.repo
  [cm520]
  name=cm520
  baseurl=http://192.168.1.19/cm520
  enable=1
  gpgcheck=0
  EOF
  yum -y install oracle-j2sdk1.7 cloudera-manager-daemons cloudera-manager-server
  ln -s  /usr/java/jdk1.7.0_67-cloudera /usr/java/default
  echo 'export JAVA_HOME=/usr/java/default' >> /etc/profile
  echo 'export PATH=$JAVA_HOME/bin:$PATH' >> /etc/profile
  source /etc/profile
  Install mysql jdbc connector:
  tar zxf mysql-connector-java-5.1.33.tar.gz
  mkdir /usr/share/java
  cp mysql-connector-java-5.1.33/mysql-connector-java-5.1.33-bin.jar /usr/share/java/mysql-connector-java.jar
  /usr/share/cmf/schema/scm_prepare_database.sh mysql -uroot -ppassword cm5 cm5 cm5
  (Running the script when MySQL is installed on another host
  on mysql server:

  grant all on *.* to 'temp'@'%'>  flush privileges;
  on cloudera manager server:
  /usr/share/cmf/schema/scm_prepare_database.sh mysql -h mysql-server-ip -utemp -ptemp --scm-host scm-host-ip cm5 cm5 cm5)
  service cloudera-scm-server start
  wait several minutes, then open http://192.168.1.19:7180
  username/password: admin/admin
  if it's ok
  yum -y install cloudera-manager-agent
  service cloudera-scm-agent start
  5. on all cluster nodes
  cat  /etc/yum.repos.d/cm520.repo
  [cm520]
  name=cm520
  baseurl=http://192.168.1.19/cm520
  enable=1
  gpgcheck=0
  EOF
  yum -y install oracle-j2sdk1.7 cloudera-manager-agent cloudera-manager-daemons
  ln -s  /usr/java/jdk1.7.0_67-cloudera /usr/java/default
  echo 'export JAVA_HOME=/usr/java/default' >> /etc/profile
  echo 'export PATH=$JAVA_HOME/bin:$PATH' >> /etc/profile
  source /etc/profile
  vi /etc/cloudera-scm-agent/config.ini
  server_host=cm5.local
  server_port=7182
  service cloudera-scm-agent start
DSC0000.jpg

DSC0001.jpg

DSC0002.jpg

DSC0003.jpg

DSC0004.jpg

DSC0005.jpg

DSC0006.jpg

DSC0007.jpg

DSC0008.jpg

  dfs.datanode.failed.volumes.tolerated
  If you have > three or four disks, you might want to set this to 1 or if you have many disks, two or more.
DSC0009.jpg

DSC00010.jpg

DSC00011.jpg

DSC00012.jpg

DSC00013.jpg

  关机的正确步骤:
  1. stop Cluster and cloudera management service
  2. poweroff hosts
  ok


运维网声明 1、欢迎大家加入本站运维交流群:群②:261659950 群⑤:202807635 群⑦870801961 群⑧679858003
2、本站所有主题由该帖子作者发表,该帖子作者与运维网享有帖子相关版权
3、所有作品的著作权均归原作者享有,请您和我们一样尊重他人的著作权等合法权益。如果您对作品感到满意,请购买正版
4、禁止制作、复制、发布和传播具有反动、淫秽、色情、暴力、凶杀等内容的信息,一经发现立即删除。若您因此触犯法律,一切后果自负,我们对此不承担任何责任
5、所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其内容的准确性、可靠性、正当性、安全性、合法性等负责,亦不承担任何法律责任
6、所有作品仅供您个人学习、研究或欣赏,不得用于商业或者其他用途,否则,一切后果均由您自己承担,我们对此不承担任何法律责任
7、如涉及侵犯版权等问题,请您及时通知我们,我们将立即采取措施予以解决
8、联系人Email:admin@iyunv.com 网址:www.yunweiku.com

所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其承担任何法律责任,如涉及侵犯版权等问题,请您及时通知我们,我们将立即处理,联系人Email:kefu@iyunv.com,QQ:1061981298 本贴地址:https://www.yunweiku.com/thread-628536-1-1.html 上篇帖子: Tachyon基本使用08-----Running Hadoop MapReduce on Tachyon 下篇帖子: 源码编译hadoop-2.5.1成功案例
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫码加入运维网微信交流群X

扫码加入运维网微信交流群

扫描二维码加入运维网微信交流群,最新一手资源尽在官方微信交流群!快快加入我们吧...

扫描微信二维码查看详情

客服E-mail:kefu@iyunv.com 客服QQ:1061981298


QQ群⑦:运维网交流群⑦ QQ群⑧:运维网交流群⑧ k8s群:运维网kubernetes交流群


提醒:禁止发布任何违反国家法律、法规的言论与图片等内容;本站内容均来自个人观点与网络等信息,非本站认同之观点.


本站大部分资源是网友从网上搜集分享而来,其版权均归原作者及其网站所有,我们尊重他人的合法权益,如有内容侵犯您的合法权益,请及时与我们联系进行核实删除!



合作伙伴: 青云cloud

快速回复 返回顶部 返回列表