RBE Setup
Getting started with Remote Build Execution (RBE) is less daunting than it may seem. We've put together a guide that not only helps you get started with BuildBuddy RBE, but also helps you understand what is going on under the hood.
This guide assumes you're using BuildBuddy Cloud or BuildBuddy Enterprise on-prem.
The basics
The very simplest Bazel command needed to enable RBE is the following:
bazel build //... --remote_executor=grpcs://remote.buildbuddy.io
This points Bazel at BuildBuddy Cloud as a remote executor. A simple repo that has no C/C++/CGO or Java dependencies will build just fine like this. Most interesting repos have some dependencies on C/C++/CGO or Java - so we'll need to tell our remote executors where to find tools like gcc or the JRE. We do this with platforms and toolchains.
Configuring your workspace
There are several options for configuring your platforms and toolchains, the most fully features of which being bazel-toolchains. It comes with an rbe_autoconfig
rule that works nicely with BuildBuddy.
Unfortunately, bazel-toolchains has a dependency on Docker and can take quite some time to start up in a clean workspace, so we provide a simple and easy-to-use BuildBuddy toolchain that enables you to get up and running quickly, and works for most use cases.
To get started with the BuildBuddy Toolchain, add the following lines to your WORKSPACE
file:
http_archive(
name = "io_buildbuddy_buildbuddy_toolchain",
sha256 = "747dbf28cb8b8d27b2d909aa05e00691fe6d9d8a28026e359cc4943261687592",
strip_prefix = "buildbuddy-toolchain-702567fd8a561ec94a0e8e7fd8aa00bb15d87b4f",
urls = ["https://github.com/buildbuddy-io/buildbuddy-toolchain/archive/702567fd8a561ec94a0e8e7fd8aa00bb15d87b4f.tar.gz"],
)
load("@io_buildbuddy_buildbuddy_toolchain//:deps.bzl", "buildbuddy_deps")
buildbuddy_deps()
load("@io_buildbuddy_buildbuddy_toolchain//:rules.bzl", "buildbuddy")
buildbuddy(name = "buildbuddy_toolchain")
Platforms
The first thing you'll want to do is tell BuildBuddy RBE in what environment you'll want to run your build actions. This is tools can be found in different locations on different platforms. This is done with the --host_platform
, --platforms
, and --extra_execution_platforms
flags.
BuildBuddy's default platform is Ubuntu 16.04 with Java 8 installed. We can specify this platform with the --host_platform
, --platforms
, and --extra_execution_platforms
flags:
--host_platform=@buildbuddy_toolchain//:platform
--platforms=@buildbuddy_toolchain//:platform
--extra_execution_platforms=@buildbuddy_toolchain//:platform
If you want to use a different environment, you can specify a custom Docker container image to use. More information on how to do this can be found in our platforms documentation.
Toolchains
Toolchains sound complicated (and they can be) - but the concept is simple. We're telling our remote executors where to find tools that are needed to build our code.
C toolchain
The first toolchain you'll likely run into the need for is a C/C++ compiler. Even if your code isn't written in one of these languages, it's likely that one of your dependencies is - or calls some C code with something like cgo.
You'll know you need a C toolchain when you see an error for a missing gcc or clang that looks like:
exec: "/usr/bin/gcc": stat /usr/bin/gcc: no such file or directory
To use BuildBuddy's default C toolchain, we can use the --crosstool_top
and --extra_toolchains
flag:
--crosstool_top=@buildbuddy_toolchain//:toolchain
--extra_toolchains=@buildbuddy_toolchain//:cc_toolchain
If you're looking for an llvm based toolchain instead, take a look at this project.
Java toolchain
If your project depends on Java code, you'll need to set the following flags:
--java_language_version=11
--tool_java_language_version=11
--java_runtime_version=remotejdk_11
--tool_java_runtime_version=remotejdk_11
Available versions are listed in Bazel's User Manual.
If you need a custom Java toolchain, see Bazel's docs on Java toolchain configuration.
Java toolchain for older Bazel versions
If your project is using a Bazel version before 6.0.0, you will need the following 4 flags instead. They will tell the executors where to look for Java tools.
Setting both the old flags and the new flags will result in an error and may result in incorrect toolchain selection.
See https://github.com/bazelbuild/bazel/issues/7849 for more information.
Using BuildBuddy's default Java 8 config:
--javabase=@buildbuddy_toolchain//:javabase_jdk8
--host_javabase=@buildbuddy_toolchain//:javabase_jdk8
--java_toolchain=@buildbuddy_toolchain//:toolchain_jdk8
--host_java_toolchain=@buildbuddy_toolchain//:toolchain_jdk8
If you need a different version of Java, we recommend using bazel-toolchains for now.
Attributes
Some tools like Bazel's zipper (@bazel_tools//tools/zip:zipper) use an attribute to determine whether or not they're being run remotely or not. For tools like these to work properly, you'll need to define an attribute called EXECUTOR
and set it to the value remote
.
--define=EXECUTOR=remote
Putting it all together
This can be a lot of flags to tack onto each bazel build, so instead you can move these to your .bazelrc
file under the remote
config block:
build:remote --remote_executor=grpcs://remote.buildbuddy.io
build:remote --host_platform=@buildbuddy_toolchain//:platform
build:remote --platforms=@buildbuddy_toolchain//:platform
build:remote --extra_execution_platforms=@buildbuddy_toolchain//:platform
build:remote --crosstool_top=@buildbuddy_toolchain//:toolchain
build:remote --extra_toolchains=@buildbuddy_toolchain//:cc_toolchain
build:remote --java_language_version=11
build:remote --tool_java_language_version=11
build:remote --java_runtime_version=remotejdk_11
build:remote --tool_java_runtime_version=remotejdk_11
build:remote --define=EXECUTOR=remote
And running:
bazel build //... --config=remote
Authentication
You'll want to authenticate your RBE builds with either API key or certificate based auth. For more info on how to set this up, see our authentication guide.
Configuration options
--jobs
This determines the number of parallel actions Bazel will remotely execute at once. If this flag is not set, Bazel will use a heuristic based on the number of cores on your local machine. Your builds & tests can likely be parallelized much more aggressively when executing remotely. We recommend starting with 50
and working your way up.
--jobs=50
--remote_timeout
This determines the maximum time Bazel will spend on any single remote call, including cache writes. The default value is 60s. We recommend setting this high to avoid timeouts when uploading large cache artifacts.
--remote_timeout=600
--remote_download_minimal
By default, bazel will download intermediate results of remote executions - so in case an artifact isn't found in the remote cache, it can be re-uploaded. This can slow down builds in networks constrained environments.
This can be turned off with the flag:
--remote_download_minimal
While this flag can speed up your build, it makes them more sensitive to caching issues - and likely shouldn't be used in production yet.
--remote_instance_name
If you'd like separate remote caches, whether it's for CI builds vs local builds or other reasons, you can use the remote_instance_name
flag to namespace your cache artifacts:
--remote_instance_name=buildbuddy-io/buildbuddy/ci
--disk_cache
While setting a local disk cache can speed up your builds, when used in conjunction with remote execution - your local and remote state has the opportunity to get out of sync. If you suspect you're running into this problem, you can disable your local disk cache by setting this to an empty value.
--disk_cache=
--incompatible_strict_action_env
Some rules (like protobuf) are particularly sensitive to changes in environment variables and will frequently be rebuilt due to resulting cache misses. To mitigate this, you can use the incompatible_strict_action_env
which sets a static value for PATH
.
--incompatible_strict_action_env
--action_env, --test_env
You can set environment variables that are available to actions with the --action_env
flag.
If you want to set environment variables that are only available to test actions, you can use the --test_env
flag.
This is commonly used to set environment variables such as GO_TEST_WRAP_TESTV=1
, which modifies how the go_test
rule handles the XML reports.
--test_env=GO_TEST_WRAP_TESTV=1
--action_env
Bazel docs
--test_env
Bazel docs
--repo_env
This flag sets additional environment variables that are available to repository rules executions. It's commonly used to set environment variables that are used to augment how toolchains and external repositories are prepared.
# Tell Bazel to not detect the CC toolchain automatically from the host machine to improve hermeticity.
# This requires you to declare and register a CC toolchain explicitly.
--repo_env=BAZEL_DO_NOT_DETECT_CPP_TOOLCHAIN=1
# Tell Gazelle's go_repository rule to use the path set in GOMODCACHE (or GOPATH) as a local cache directory.
# This can be useful to speed up Go modules downloads.
--repo_env=GO_REPOSITORY_USE_HOST_CACHE=1
--repo_env=GOMODCACHE=/some-path/my-go-mod-cache
--define
Define allows you to assign build variables. This is commonly use to set EXECUTOR
to compile singlejar and ijar from source.
--define=EXECUTOR=remote
--spawn_strategy
Sets the list of strategies in priority order from highest to lowest. Each action picks the highest priority strategy that it can execute. The default value is remote,worker,sandboxed,local
.
--strategy=remote,local
--strategy
Explicitly setting strategies should no longer be needed for Bazel versions post 0.27.0. It can be used to force certain bazel mnemonics to be build remotely.
--strategy=Scalac=remote
--remote_execution_priority
Sets the priority of remotely executed actions within a build, relative to other builds within your BuildBuddy organization.
The default value is 0, and BuildBuddy accepts values between -1000 and 1000 (inclusive). Lower priority values cause actions to be executed before actions with larger priority values.
Examples:
# Run a build that is deprioritized relative to other builds in the org:
bazel test //experimental/... --remote_execution_priority=1000
# Run a build that is prioritized over all other builds in the org:
bazel run //emergency_deployment --remote_execution_priority=-1000
Note: remote execution priority is applied on a best-effort basis. Setting this flag doesn't provide a strong guarantee of execution ordering.
--experimental_inmemory_dotd_files
If enabled, C++ .d files will be passed through in memory directly from the remote build nodes instead of being written to disk. This flag is automatically set when using --remote_download_minimal
.
--experimental_inmemory_dotd_files
--experimental_inmemory_jdeps_files
If enabled, .jdeps files generated from Java compilations will be passed through in memory directly from the remote build nodes instead of being written to disk. This flag is automatically set when using --remote_download_minimal
.
--experimental_inmemory_jdeps_files
Examples
- buildbuddy-io/buildbuddy .bazelrc --config=remote
- graknlabs/grakn .bazelrc --config=rbe
- wix/exodus .bazlerc.remote
Advanced configuration
If you need a more advanced configuration than provided by the basic BuildBuddy toolchain, we recommend exploring Bazel's bazel-toolchains repo. Its rbe_autoconfig
rule is highly configurable and works nicely with BuildBuddy.
Here's a quick snippet you can add to your WORKSPACE
file if using bazel 3.6.0:
load("@bazel_tools//tools/build_defs/repo:http.bzl", "http_archive")
http_archive(
name = "bazel_toolchains",
sha256 = "4fb3ceea08101ec41208e3df9e56ec72b69f3d11c56629d6477c0ff88d711cf7",
strip_prefix = "bazel-toolchains-3.6.0",
urls = [
"https://github.com/bazelbuild/bazel-toolchains/releases/download/3.6.0/bazel-toolchains-3.6.0.tar.gz",
"https://mirror.bazel.build/github.com/bazelbuild/bazel-toolchains/releases/download/3.6.0/bazel-toolchains-3.6.0.tar.gz",
],
)
load("@bazel_toolchains//rules:rbe_repo.bzl", "rbe_autoconfig")
# Creates a default toolchain config for RBE.
# Use this as is if you are using the rbe_ubuntu16_04 container,
# otherwise refer to RBE docs.
rbe_autoconfig(name = "rbe_default")
And to your .bazelrc
:
# Depending on how many machines are in the remote execution instance, setting
# this higher can make builds faster by allowing more jobs to run in parallel.
# Setting it too high can result in jobs that timeout, however, while waiting
# for a remote machine to execute them.
build:remote --jobs=50
# Set several flags related to specifying the platform, toolchain and java
# properties.
# These flags should only be used as is for the rbe-ubuntu16-04 container
# and need to be adapted to work with other toolchain containers.
build:remote --java_language_version=11
build:remote --tool_java_language_version=11
build:remote --java_runtime_version=remotejdk_11
build:remote --tool_java_runtime_version=remotejdk_11
build:remote --crosstool_top=@rbe_default//cc:toolchain
build:remote --repo_env=BAZEL_DO_NOT_DETECT_CPP_TOOLCHAIN=1
# Platform flags:
# The toolchain container used for execution is defined in the target indicated
# by "extra_execution_platforms", "host_platform" and "platforms".
# More about platforms: https://docs.bazel.build/versions/master/platforms.html
build:remote --extra_toolchains=@rbe_default//config:cc-toolchain
build:remote --extra_execution_platforms=@rbe_default//config:platform
build:remote --host_platform=@rbe_default//config:platform
build:remote --platforms=@rbe_default//config:platform
# Starting with Bazel 0.27.0 strategies do not need to be explicitly
# defined. See https://github.com/bazelbuild/bazel/issues/7480
build:remote --define=EXECUTOR=remote
# Enable remote execution so actions are performed on the remote systems.
build:remote --remote_executor=grpcs://remote.buildbuddy.io
# Enforce stricter environment rules, which eliminates some non-hermetic
# behavior and therefore improves both the remote cache hit rate and the
# correctness and repeatability of the build.
build:remote --incompatible_strict_action_env=true
# Set a higher timeout value, just in case.
build:remote --remote_timeout=3600
And then run:
bazel build //... --config=remote