MANAGING HADOOP CLUSTERS WITH ANSIBLE AND APACHE AMBARI

 
Videos  /  AnsibleFest SF 2016  /  STACKSPACE

Modern data products use a mix of distributed compute paradigms. Tools like Kafka, Storm, Spark and YARN allow users to build data products tuned for a particular problem. Increasingly, we are deploying these systems to cloud environments. Mapping hosts to services presents deployment challenges. Learn how Stackspace used Ansible to develop a custom, open source module for Ambari to significantly reduce the playbook logic and template files in their codebase to better work around a challenge installing and configuring Hadoop clusters.

Learn:
  • How Stackspace used Ansible to develop an open source module for Ambari
 

Presenter:

Mark Bittmann

 

Mark Bittmann, Lead Data Scientist, Stackspace

twitter linkedin