6. About roles

This project has many kinds of roles. In this section, the short description for each role are shown.

6.1. Roles to configure basic environments

Role name

Use for

common

Basic configuration about OS, basic services, and so on

prompt

Configuration of console prompt

screen

Configuration of screen command

user

Configuration of users

epel

Configuration of EPEL Repository

jdk

Configuraiotn of Oracle JDK

scala

Configuraiton of Scala on Hadoop client node

sbt

Configuration of Sbt

activator_mini

Configuraiton of Activator mini

6.2. Roles to configure Ansible

Role name

Use for

ansible

Configuration of nodes where you executes ansible command

ansible_remote

Configuration of nodes which is configured ansible

6.3. Roles to boot EC2 instances for Hadoop cluster

Role name

Use for

ec2_hadoop

Boot EC2 instances for Hadoop cluster

6.4. Roles to configure CDH5 Hadoop

Role name

Use for

cdh5_base

Basic configuraiton about Hadoop

cdh5_jn

Configuration of JournalNode

cdh5_nn

Configuraiton of NameNode

cdh5_ot

Configuraiton of HistoryServer and YARN Proxy

cdh5_rm

Configuraiton of ResourceManager

cdh5_sl

Configuration of DataNode and NodeManager

zookeeper_server

Configuration of Zookeeper server

6.5. Roles to configure CDH5 pseudo Hadoop

Role name

Use for

cdh5_pseudo

Basic configuraiton about Hadoop pseudo environment

6.6. Roles to configure Spark core on client node

Role name

Use for

cdh5_spark

Configuration of Spark core on Hadoop client node

6.7. Roles to configure Ganglia

Role name

Use for

ganglia_master

Configuration of Ganglia Master and Web frontend

ganglia_slave

Configuration of Ganglia Slave

6.8. Roles to configure InfluxDB and Grafana

Role name

Use for

influxdb

Configuration of InfluxDB

grafana

Configuration of Grafana

Roles to configure Spark community edition

Role name

Use for

spark_comm

Configuration of Spark community edition

6.9. Roles to configure Zeppelin

Role name

Use for

zeppelin

Configuration of Zeppelin community edition

6.10. Roles to configure fluentd or td-agent

Role name

Use for

fluentd

Configuration of fluentd (community edition)

td_agent

Configuration of td-agent

6.11. Roles to configure Kafka

Role name

Use for

kafka

Configuration of Kafka cluster

6.12. Roles to configure Confluent

Role name

Use for

confluent_kafka

Configuration of Confluent packages

6.13. Roles to configure Ambari

Role name

Use for

ambari_server

Configuration of Ambari server

ambari_agent

Configuration of Ambari agent

6.14. Roles to configure CI environment

Role name

Use for

jenkins

Configuration of Jenkins.

6.15. Roles to configure Anaconda CE

Role name

Use for

jenkins

Configuration of Anaconda CE

6.16. Roles to configure PostgreSQL

Role name

Use for

postgresql

Configuration of PostgreSQL, which is used for a metastore_db of Hive

6.17. Roles to configure Hive

Role name

Use for

cdh5_hive

Configuration of Hive with metastore

6.18. Roles to configure Alluxio

Role name

Use for

alluxio_conf

Deploy Alluxio configuration files to slave nodes

alluxio_yarn

Configuration of Alluxio on YARN

6.19. Roles to configure TPC-DS

Role name

Use for

tpc_ds

Download TPC-DS packages and make binaries. The download URL is provided by yourself.