6. About roles¶

This project has many kinds of roles. In this section, the short description for each role are shown.

6.1. Roles to configure basic environments¶

Role name	Use for
common	Basic configuration about OS, basic services, and so on
prompt	Configuration of console prompt
screen	Configuration of screen command
user	Configuration of users
epel	Configuration of EPEL Repository
jdk	Configuraiotn of Oracle JDK
scala	Configuraiton of Scala on Hadoop client node
sbt	Configuration of Sbt
activator_mini	Configuraiton of Activator mini

6.2. Roles to configure Ansible¶

Role name	Use for
ansible	Configuration of nodes where you executes ansible command
ansible_remote	Configuration of nodes which is configured ansible

6.3. Roles to boot EC2 instances for Hadoop cluster¶

Role name	Use for
ec2_hadoop	Boot EC2 instances for Hadoop cluster

6.4. Roles to configure CDH5 Hadoop¶

Role name	Use for
cdh5_base	Basic configuraiton about Hadoop
cdh5_jn	Configuration of JournalNode
cdh5_nn	Configuraiton of NameNode
cdh5_ot	Configuraiton of HistoryServer and YARN Proxy
cdh5_rm	Configuraiton of ResourceManager
cdh5_sl	Configuration of DataNode and NodeManager
zookeeper_server	Configuration of Zookeeper server

6.5. Roles to configure CDH5 pseudo Hadoop¶

Role name	Use for
cdh5_pseudo	Basic configuraiton about Hadoop pseudo environment

6.6. Roles to configure Spark core on client node¶

Role name	Use for
cdh5_spark	Configuration of Spark core on Hadoop client node

6.7. Roles to configure Ganglia¶

Role name	Use for
ganglia_master	Configuration of Ganglia Master and Web frontend
ganglia_slave	Configuration of Ganglia Slave

6.8. Roles to configure InfluxDB and Grafana¶

Role name	Use for
influxdb	Configuration of InfluxDB
grafana	Configuration of Grafana

Roles to configure Spark community edition

Role name	Use for
spark_comm	Configuration of Spark community edition

6.9. Roles to configure Zeppelin¶

Role name	Use for
zeppelin	Configuration of Zeppelin community edition

6.10. Roles to configure fluentd or td-agent¶

Role name	Use for
fluentd	Configuration of fluentd (community edition)
td_agent	Configuration of td-agent

6.11. Roles to configure Kafka¶

Role name	Use for
kafka	Configuration of Kafka cluster

6.12. Roles to configure Confluent¶

Role name	Use for
confluent_kafka	Configuration of Confluent packages

6.13. Roles to configure Ambari¶

Role name	Use for
ambari_server	Configuration of Ambari server
ambari_agent	Configuration of Ambari agent

6.14. Roles to configure CI environment¶

Role name	Use for
jenkins	Configuration of Jenkins.

6.15. Roles to configure Anaconda CE¶

Role name	Use for
jenkins	Configuration of Anaconda CE

6.16. Roles to configure PostgreSQL¶

Role name	Use for
postgresql	Configuration of PostgreSQL, which is used for a metastore_db of Hive

6.17. Roles to configure Hive¶

Role name	Use for
cdh5_hive	Configuration of Hive with metastore

6.18. Roles to configure Alluxio¶

Role name	Use for
alluxio_conf	Deploy Alluxio configuration files to slave nodes
alluxio_yarn	Configuration of Alluxio on YARN

6.19. Roles to configure TPC-DS¶

Role name	Use for
tpc_ds	Download TPC-DS packages and make binaries. The download URL is provided by yourself.