Link

Retain Confidentiality in Open Source Infrastructure

Infrastructure as a Code principles implies that your have ability to tear up/down entire setup at any point of time. If you cannot make it then you have some code, duct tape and WD-40. The usage of commodity cloud platforms makes unnecessary to maintain an infrastructure code in proprietary repositories. Anyway, comparable design patterns are used over and over again. The Open Source Infrastructure is next logical evolution, tools like AWS CDK makes infrastructure code sharable, composable and reusable in various use-cases.

Going to Open Source does not mean that you have to disclosure private configurations with entire community. How to leverage confidentiality and openness same time? I am asking this question every time when committing IaaC features to open source repositories. The Key Vault is a right tool to address confidentiality challenge. I’ve began to use AWS Secret Manager in my projects. Let me elaborate here…

Often, Open Source Infrastructure uses existing cloud resources or config such as Hosted Zone Id, S3 Bucket Names, Internal Domain Names, not mentioned a secret keys. No one benefits if this information is exposed. It would not allow to re-use high-order infrastructure component. Its have to be configurable via side-channel. AWS CDK allows to use either stack parameters or environment variables. However, this does not sound as an ultimate solution to retain confidentiality. You advance a problem to next level. Your CI/CD gets a responsibility to parametrize infrastructure deployment with confidential configuration. A circle is closed - how to protect information at CI/CD? CI/CD advertises the usage of encryption methods or key vaults.

Key Vault

It is possible to use AWS Secret Manager directly from infrastructure code and making it to be a central place to keep sensitive configuration. As a side node, the secret manager uses AWS KMS inside.

The secret store becomes a config repository for your infrastructure. You can use aws command line tool to manage config as JSON objects

cat > config.json << EOF
{
  "confidential": "secret"
}
EOF

aws secretsmanager create-secret \
  --name MyConfig \
  --secret-string file://config.json

Once the storage is defined, use aws command line to retrieve and update values.

aws secretsmanager get-secret-value \
  --secret-id arn:aws:secretsmanager:eu-west-1:000000000000:secret:MyConfig-xxxxxx \
| jq '.SecretString | fromjson' > config.json

aws secretsmanager update-secret \
  --secret-id  arn:aws:secretsmanager:eu-west-1:000000000000:secret:MyConfig-xxxxxx \
  --secret-string file://config.json

Key Vault usage with AWS CDK

Once, the config is defined you are able to use confidential data inside open source infrastructure. Let me show a two examples for you.

The first examples uses class hierarchy of AWS CDK, please see the official guideline about it.

import * as secret from '@aws-cdk/aws-secretsmanager'
import * as dns from '@aws-cdk/aws-route53'

// fetch secret
const config = secret.Secret.fromSecretAttributes(stack, 'Secret', {secretArn: 'MyConfig'})
const secret = config.secretValueFromJson('confidential').toString()

// use secret
const zone = dns.HostedZone.fromHostedZoneAttributes(parent, 'HostedZone', {hostedZoneId: secret})

High Order Component: config

Let’s shift a focus from category of class hierarchy to category of pure functions. It is based on purely functional AWS CDK extension. The library aws-cdk-pure-hoc implements config component to fetch data from AWS Secret manager. The component provides a single function to read string values

function String(key: string, bucket?: string ): IPure<string>

If bucket is not defined then AWS_IAAC_CONFIG environment variable is used.

Here is a full example of config HoC usage

import * as config from 'aws-cdk-pure-hoc/config'

// fetch secret
config.String('confidential', 'MyConfig').flatMap(MyHostedZone)

// use secret
function MyHostedZone(secret: string) { ... }

Afterwords

The twelve-factor application principles advices environment variables to store the config.

Env vars are easy to change between deploys without changing any code; unlike config files, there is little chance of them being checked into the code repo accidentally; and .. they are a language- and OS-agnostic standard.

The discussed technique gives you a secure approach to retain confidentiality of your configuration. It is an environment variables managed by Key Vault service.