Login | Register   
RSS Feed
Download our iPhone app
Browse DevX
Sign up for e-mail newsletters from DevX

By submitting your information, you agree that devx.com may send you DevX offers via email, phone and text message, as well as email offers about other products and services that DevX believes may be of interest to you. DevX will process your information in accordance with the Quinstreet Privacy Policy.


Automate Your Infrastructure with Ansible

Learn more about why Ansible is a valuable tool that provides a coherent model for safely provisioning, configuring and orchestrating multiple remote machines.




Application Security Testing: An Integral Part of DevOps

Ansible is a tool that allows you to control remote servers from the comfort of your laptop. It works over SSH and doesn't require any special software or agent to be installed on the remote machine. That doesn't sound like much. Any beginning sys admin knows how to run commands or complete shell scripts on remote servers. The power of Ansible comes from its sophisticated conceptual model and from its plugin-based module system. The combination of the two allows you to manage, configure and efficiently orchestrate hundreds, and even thousands, of servers. To top it all off, Ansible is implemented in Python which is my favorite language.

Playing with Ansible on Vagrant

Vagrant is a tool that lets you manage an army of virtual machines right from your laptop. See my First Steps with Vagrant article. It is the perfect companion for Ansible development and testing. In this article, I'll provide examples using a cluster of three Vagrant virtual machines. If you want to follow along, install VirtualBox and Vagrant and use this Vagrantfile:

# -*- mode: ruby -*-
# vi: set ft=ruby :
hosts = {
  "athos" => "",
  "porthos" => "",
  "aramis" => ""
Vagrant.configure("2") do |config|
  config.vm.box = "precise64"
  config.vm.box_url = "http://files.vagrantup.com/precise64.box"
 hosts.each do |name, ip|
    config.vm.define name do |machine| 
      machine.vm.network :private_network, ip: ip
      machine.vm.provider "virtualbox" do |v|
        v.name = name

To make the cluster easily accessible through SSH type:

vagrant ssh-config >> ~/.ssh-config

This will take care of all the gory details of SSH key-pair generation and user names. Now you should be able to just type:

ssh athos

This will land you in a live SSH session on the athos VM.

Quick Ad-Hoc Commands

Let's start by verifying that all the nodes in our cluster are up and get some information. The first step is to tell Ansible about the machines in our cluster with the inventory file. Ansible uses primarily YAML files, but also supports JSON files. Inventory files are a special format and use the .INI format where the sections are host groups and the items in each group are host names or IP addresses. Here is our inventory file called 'hosts':



To check if all the nodes are up use the 'ping' module:

ansible all -i hosts -m ping

You should see the following: http://i.imgur.com/1SWoMdz.png

The 'all' argument is a host pattern and means run the command on all the hosts. You may also specify individual hosts or host groups. The -i <filename> tells Ansible which inventory file to use and the -m is the module to run. There could be many other arguments. In particular, the remote user to run the command and the credentials, which could be a password or a private key file. In this case, Ansible picked up this information from the ~/.ssh/config file.

Here is another example to collect a lot of information using the 'setup' module. Let's run it only on group2:

ansible group2 -i hosts -m setup 

The output is too large, so I created a gist you can explore here.


Ad-hoc commands are all fun and games, but if you are serious then you would want to execute a different series of commands on different host groups. This is where playbooks come in. Playbooks are a list of plays. Each play is a list of tasks to be executed on a group of hosts (could be just one).

The following playbook contains a single play that Installs IPython on every host and then creates a user named gigi and copies the SSH public key to the target host.

-- -
- hosts: all
     - name: Install ipython
      apt: name=ipython state=installed
     - name: Create a user named gigi
     user: name=gigi
     - name: Add public key
     authorized_key: user=gigi key="{{ lookup('file', '~/.ssh/id_rsa.pub') }}" 

Here is how you run the playbook with the ansible-playbook command:

ansible-playbook -i hosts playbook.yml -- sudo 

It is pretty similar to the ansible command, but instead of specifying a module you just provide the name of a playbook. The -- sudo flag executes the playbook commands as sudo, which is often required.

The tasks themselves are actually modules. You can execute each task as an ad-hoc command using the Ansible tool, but it would be very tedious and error-prone. Whenever you have a logical sequence of commands that you want to execute at the same time, consider putting them in a playbook.

The syntax is pretty simple and there are good online reference for the available options for each module here.

Sometimes, you can use more sophisticated approaches with templates and dynamically discovering information during execution. For example, the last task: authroized_key uses the lookup plugin to fetch the content of the local user's id_rsa.pub file and write it to the remote authorized_keys file.


Playbooks are great and when you combine them with inventories you can execute the same playbooks with different inventory files. You will often need slight variations. For example, in our playbook I created a user called 'gigi' on all the hosts, but maybe I would like to create a different user for some of the hosts and the name of the user should not be hard-coded in the playbook. This is where vars (as in variables) come in.


Roles are a high-level abstraction that I use very often. I like to think of roles as the atomic unit of deployment. Roles group tasks together, but are not tied to particular hosts. Moreover, they can declare dependencies on other roles. This great because it allows the formation of hierarchies. For example, you may have many types of web applications and services that require all kinds of complicated provisioning and configuration. But, all of them may require a standard nginx installation. You can have a webserver role that other roles depend on and then in your playbook you just specify the top-level role. If at some point you want to change something in your nginx configuration, you just need to change your nginx role. All the other roles that depend on it will automatically get the changes.

Roles have their own directory structure that I will not get into right now, but they provide a lot of fine-grained control and flexibility. Here is an example of two roles: 'common' and 'webserver' where the webserver role depends on the common role:


Each role has its own directory under the roles parent dir. Then there are multiple directories for each one. The most important directory is 'tasks'. The 'tasks' directory will normally contain a file called main.yml with a list of tasks just like in the playbook. The 'meta' directory contains dependencies. For example the webserver's 'meta' directory will contain a file called main.yml with the following content:

-- -
- { role: common } 

This indicates that before the webserver's tasks are executed the common role will be executed first.

A playbook for installing the webserver role for a host group called webservers would be:

-- -
- hosts: webservers
 - webserver 

The Ansible Vault

Ansible often needs to transfer sensitive data or credentials to remote machines. While the communication is done over SSH, and is thus secure, the Ansible playbooks, roles, and in particular, var files must contain the secret information. These Ansible entities are often stored in source control systems. To protect sensitive data, Ansible provides the vault. It is a mechanism to store encrypted data that is decrypted on the fly and only when needed. The vault password should only be known to a small number of trusted individuals.

Tasks, roles and playbooks can refer to information stored in the vault and it will be available during execution. Remember that when running a playbook that requires vault access you need to provide the vault password.


Ansible is a structured and flexible remote execution framework that provides a coherent model for provisioning, configuring and orchestrating multiple remote machines in a secure way. It is based on abstractions like playbooks, roles, tasks and vars and is extensible via modules/plugins.

Gigi Sayfan is the chief platform architect of VRVIU, a start-up developing cutting-edge hardware + software technology in the virtual reality space. Gigi has been developing software professionally for 21 years in domains as diverse as instant messaging, morphing, chip fabrication process control, embedded multi-media application for game consoles, brain-inspired machine learning, custom browser development, web services for 3D distributed game platform, IoT/sensors and most recently virtual reality. He has written production code every day in many programming languages such as C, C++, C#, Python, Java, Delphi, Javascript and even Cobol and PowerBuilder for operating systems such as Windows (3.11 through 7), Linux, Mac OSX, Lynx (embedded) and Sony Playstation. His technical expertise includes databases, low-level networking, distributed systems, unorthodox user interfaces and general software development life cycle.
Comment and Contribute






(Maximum characters: 1200). You have 1200 characters left.



We have made updates to our Privacy Policy to reflect the implementation of the General Data Protection Regulation.
Thanks for your registration, follow us on our social networks to keep up-to-date