braintrust

Braintrust Helm Chart

Prerequisites

This helm chart requires a Kubernetes secret named braintrust-secrets to exist in the namespace where the chart is installed. Azure users will automatically sync secrets from Azure Key Vault into Kubernetes (see below for details). AWS and Google users will need to manually create and manage the braintrust-secrets Kubernetes secret.

Required Secrets

The braintrust-secrets secret must contain the following keys:

Secret Key	Description	Format
`REDIS_URL`	Redis connection URL	`redis://<host>:<port>`
`PG_URL`	PostgreSQL connection URL	`postgres://<username>:<password>@<host>:<port>/<database>` (append `?sslmode=require` if using TLS)
`BRAINSTORE_LICENSE_KEY`	Brainstore license key	Valid Brainstore license key from the Braintrust Data Plane settings page
`FUNCTION_SECRET_KEY`	Random string for encrypting function secrets	Random string
`AZURE_STORAGE_CONNECTION_STRING`	Azure storage connection string	Valid Azure storage connection string (only required if `cloud` is `azure`)
`GCS_ACCESS_KEY_ID`	Google HMAC Access ID string	Valid S3 API Key Id (only required if `cloud` is `google` and if `enableGcsAuth` is `false`)
`GCS_SECRET_ACCESS_KEY`	Google HMAC Secret string	Valid S3 Secret string (only required if `cloud` is `google` and if `enableGcsAuth` is `false`)

Azure Key Vault Driver Integration

If you're using Azure, the Azure Key Vault CSI driver is default enabled and will automatically sync secrets from Azure Key Vault into Kubernetes. This eliminates the need to manually create and manage the braintrust-secrets Kubernetes secret.

To enable this feature:

Configure your Key Vault details:

azure:
  keyVaultName: "your-keyvault-name"
  keyVaultCSIclientID: "your-client-id" # This should come from the terraform module
  tenantId: "your-tenant-id"

Optionally map your Key Vault secret names to the required Kubernetes secret keys. This is only required if you aren't using our terraform module. The defaults assume you are using the Braintrust terraform module to deploy the base infrastructure.
```
azureKeyVaultDriver:
  secrets:
    - keyVaultSecretName: "your-redis-secret-name"
      kubernetesSecretKey: "REDIS_URL"
      keyVaultSecretType: "secret"
    # ... other secret mappings
```

The CSI driver will:

Mount the secrets from Key Vault into your pods
Automatically sync them to the braintrust-secrets Kubernetes secret
Keep the secrets in sync as they change in Key Vault

GKE with Local SSDs

Braintrust requires local SSDs for maximum disk performance. Configuration varies depending on whether you're using GKE Autopilot or Standard mode.

GKE Autopilot

For Autopilot clusters, simply set the mode and the chart will automatically configure local SSDs:

cloud: "google"

google:
  mode: "autopilot"
  autopilotMachineFamily: "c4"  # Machine family that supports local SSDs

brainstore:
  reader:
    volume:
      size: "375Gi"  # Local SSDs come in 375Gi increments (375, 750, 1125, etc.)
    resources:
      requests:
        cpu: "8"
        memory: "16Gi"
  writer:
    volume:
      size: "375Gi"
    resources:
      requests:
        cpu: "32"
        memory: "64Gi"

What happens:

Autopilot automatically provisions C4 nodes with local SSDs
Node selectors are added automatically (including compute-class: Performance for dedicated nodes)
Ephemeral-storage requests ensure proper SSD allocation
Each brainstore pod gets its own dedicated node with full access to local SSDs

Supported machine families: c4, c4d

GKE Standard Mode

For Standard mode clusters, create node pools with local SSDs, then deploy:

Configure the Helm chart:

cloud: "google"

google:
  mode: "standard"

brainstore:
  reader:
    nodeSelector:
      cloud.google.com/gke-nodepool: "brainstore"  # Target your node pool
    resources:
      requests:
        cpu: "44"
        memory: "160Gi"
    # Prevent readers and writers from sharing nodes
    affinity:
      podAntiAffinity:
        requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
                - key: app
                  operator: In
                  values:
                    - brainstore-reader
                    - brainstore-writer
            topologyKey: kubernetes.io/hostname
  writer:
    nodeSelector:
      cloud.google.com/gke-nodepool: "brainstore"
    resources:
      requests:
        cpu: "44"
        memory: "160Gi"
    # Prevent readers and writers from sharing nodes
    affinity:
      podAntiAffinity:
        requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
                - key: app
                  operator: In
                  values:
                    - brainstore-reader
                    - brainstore-writer
            topologyKey: kubernetes.io/hostname

What happens:

Pods are scheduled on your pre-configured node pools
Local SSDs are automatically available via emptyDir volumes
Pod anti-affinity ensures readers and writers don't share nodes (each pod gets dedicated node access)

Testing

This Helm chart includes comprehensive automated unit tests.

# Run all tests
./test.sh

Breaking Changes

Version 2

With version 2 of this helm, the Brainstore pods are split into Readers and Writers improving performance and the ability to independently scale for more read operations or write operations. For existing customers that have deployed our Helm or via other means on Kubernetes, please update your override values file or deployment to match this change. This will result in no data loss, but will be a brief downtime as the existing Brainstore Pods are removed and new Brainstore Pods for Reading and Writing are launched.

Version 3

Breaking change only for Azure customers which introduced the Azure Container Storage CSI driver.

Version 4

This version of the Helm is in preparation of 2.0.0 of the Braintrust Self hosted Data Plane. Starting with 1.1.32 Brainstore will now need to reach out to the API, where before Brainstore didn't talk to the API. In Helm this is being done over the internal Kubernetes endpoint. If you have additional security restrictions or are limiting traffic between services, this will need to be allowed before upgrading to 2.0.0 of the data plane.

We are also increasing the default sizing of our deployments, please ensure you have the node pool capacity for these increased defaults.

Version 5

This release adds new Brainstore Fast Readers and enables them by default. Fast readers are isolated Brainstore nodes that handle common and known safe queries that power the Braintrust UI. This effectively isolates resource intensive adhoc queries into the standard Brainstore readers nodes which helps to keep the UI responsive. You may have to adjust your helm values.yaml overrides if you have adjusted any defaults for standard Brainstore reader nodes. We recommend keeping your fast readers sized the same as your existing readers and starting with only two nodes.

Also if you have custom readiness checks, please unset these customizations and use our new default readiness checks. There is a bug in the dataplane where the endpoint we were using for readiness checks, would never recover if it failed.

Example Values Files

Example values files for different cloud providers and configurations are located in the examples/ folder.

Name		Name	Last commit message	Last commit date
parent directory ..
ci		ci
examples		examples
templates		templates
tests		tests
Chart.yaml		Chart.yaml
README.md		README.md
UPGRADE.md		UPGRADE.md
values.yaml		values.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Braintrust Helm Chart

Prerequisites

Required Secrets

Azure Key Vault Driver Integration

GKE with Local SSDs

GKE Autopilot

GKE Standard Mode

Testing

Breaking Changes

Version 2

Version 3

Version 4

Version 5

Example Values Files

FilesExpand file tree

braintrust

Directory actions

More options

Directory actions

More options

Latest commit

History

braintrust

Folders and files

parent directory

README.md

Braintrust Helm Chart

Prerequisites

Required Secrets

Azure Key Vault Driver Integration

GKE with Local SSDs

GKE Autopilot

GKE Standard Mode

Testing

Breaking Changes

Version 2

Version 3

Version 4

Version 5

Example Values Files