An introduction to Terraform

My first exposure to Terraform was when I started to work with data in the AWS cloud. As a Cloud Data Engineer I was expected to cover a wide range of infrastructure provisioning so I found myself having to learn rather more about cloud infrastructure than I had expected.

Understanding the problem

We want repeatability when deploying infrastructure across environments. A web based Cloud/SaaS Console remains useful for visualising your infrastructure but for actual maintenance we must have infrastructure-as-code (IAC). This is just as we have SQL scripts to develop and maintain our databases.

When I started using AWS, it had three methods of spinning up infrastructure

From the web console
Using the aws command line tool (CLI)
Writing Cloud Formation templates.

Nowadays we also have the AWS CDK that allows us to maintain infrastructure in a programming language in which we might already have skills.

Python
Java
JavaScript/TypeScript
C#
GO

The AWS CLI has its uses though for IAC I would keep it as a lump hammer for when all else has failed. It isn't really intended to script up and maintain a large infrastructure estate.

Cloud Formation was fine for basic activities but as our needs became more sophisticated it became a bit of a nightmare. We also had to consider other SaaS products such a GitHub and working with other cloud providers. Maintaining several products is especially hard when each product and cloud has its own approach to IAC.

Terraform removed a lot of the stress by providing a common approach across all products. Take a look at the Hashicorp Providers page shows the breadth of providers available.

What is Terraform?

Terraform is a command line tool that allows you define your infrastructure in the Hashicorp Configuration Language, HCL. HCL acts as an abstraction layer for the cloud APIs and libraries. Just like SQL this is declarative language so you tell it what you want the end result to be and it works out how it is going to achieve it.

HCL has its own in-built functions for manipulating the contents of its data types. Examples include the following.

Concatenating lists into a single list
Joining lists into strings
Populating placemarkers in a template file with appropriate values
Reading file formats such as YAML and JSON into Terraform data structures
Extracting attributes from data structures and building other structures

Look at the last point. HCL does not edit data, it takes an immutable input and provides an immutable output. HCL is a declarative, functional programming language.

What are the main components of Terraform

There are 3 types of component in Terraform

Plug-in
Provider (which is a type of Plugin). It will be written in GO and talks directly to Terraform over RPC.
Module

A unit of code you write in Terraform is called a module and your modules can be made up of other modules. An example would be a module that defines the infrastructure for pulling data (extraction) from outside your organisation. Sub-modules could include the following infrastructure.

Retrieving data using an API
Taking a docker image, running it as a docker container to extract data from an external SQL database
Pulling data from an sFTP site

A Terraform provider is a wrapper for an API and within that API its endpoints are represented by the following: -

Terraform Artefact	Description
Resource	Creates or updates infrastructure. In SQL terms this is akin to CREATE or ALTER
Data	There are two things that a data object can do Retrieve information Transform an input document into a different format Think of it as being a DESCRIBE or SHOW query or SELECT against INFORMATION_SCHEMA objects.

Terraform Artefact

Description

Resource

Creates or updates infrastructure. In SQL terms this is akin to CREATE or ALTER

Data

There are two things that a data object can do

Retrieve information
Transform an input document into a different format

Think of it as being a DESCRIBE or SHOW query or SELECT against INFORMATION_SCHEMA objects.

Terraform Resource and Data objects are also modules however when we talk about Terraform modules we tend to mean something we write as represented by the graphic below.

Scenario	Action
Created outside of Terraform	No action. Terraform has no awareness of knowledge of this infrastructure.
Created by Terraform, updated outside of Terraform	Terraform will see the difference and update that item to match your code

Terraform command	Action
`terraform init`	Downloads all relevant providers and plugins
`terraform plan`	Refreshes the state file then produces an execution plan but does not make any infrastructure changes. You can store the execution plan for use later using the -out parameter
`terraform apply`	This will deploy your infrastructure but will pause to ask you to confirm deployment after displaying the execution plan. If you used the -out parameter with terraform plan then you can tell Terraform to apply the previously generated plan.
`terraform fmt`	This will format your Terraform code.
`terraform destroy`	Any infrastructure for which a state file reference exists will be destroyed, or at least an attempt to destroy it will be made.

Understanding the problem

What is Terraform?

What are the main components of Terraform

Providers

Provider authentication

Variables

Outputs

Locals

The utility module

Resources

Data

The Terraform State file

Deploying your code

The learning experience

Infrastructure knowledge

No dedicated IDE

Performance

Provider upgrades

Infrastructure destruction

Hints and tips

Rate

Share

Categories

Share

Rate

An introduction to Terraform

Understanding the problem

What is Terraform?

What are the main components of Terraform

Providers

Provider authentication

Variables

Outputs

Locals

The utility module

Resources

Data

The Terraform State file

Deploying your code

The learning experience

Infrastructure knowledge

No dedicated IDE

Performance

Provider upgrades

Infrastructure destruction

Hints and tips

Rate

Share

Categories

Share

Rate

Related content

Cloud Native

Experiments with full text search

Vagrant and SQL Server 2017 on Linux

Some of my greatest mistakes

Too much information can be counter-productive