A command line tool for running Airbyte sources & destinations locally with ease.
π‘ NOTE: The previous Bash-based version of this CLI has been deprecated.
For migration details, see Migration Guide.
For deprecated CLI usage, see legacy README.
Table of Contents
π¦ Getting Started
β Prerequisites
- Docker
- Faros API Key
π» Supported Systems
- Linux (x86_64)
- macOS (Apple Silicon - arm64)
- Windows (x86_64)
π Step 1. Install
MacOS (Arch: Apple Silicon)
Here is the steps of downloading the CLI on MacOS. Linux should have very similar steps.
wget -O airbyte-local.zip https://github.com/faros-ai/airbyte-local-cli/releases/download/latest/airbyte-local-macos-arm64.zip unzip -o airbyte-local.zip ./airbyte-local
Linux
wget -O airbyte-local.zip https://github.com/faros-ai/airbyte-local-cli/releases/download/latest/airbyte-local-linux-x64.zip unzip -o airbyte-local.zip ./airbyte-local
Windows (Powershell)
Invoke-WebRequest -Uri "https://github.com/faros-ai/airbyte-local-cli/releases/download/latest/airbyte-local-win-x64.zip" -OutFile "airbyte-local-win-x64.zip" Expand-Archive -Path "airbyte-local-win-x64.zip" -DestinationPath . -Force .\airbyte-local
π§© Step 2. Create an Airbyte Config File
Before you can run your sync, you need to tell the CLI:
- Where to pull data from (e.g., GitHub, Jira)
- Where to push data to (usually Faros)
You have two options to create this configuration:
β¨ Option A: Auto-Generate (Recommended for new users)
Use this if youβre new or want to get started quickly.
Run:
./airbyte-local generate-config <source> [destination]
Examples:
# Pull from GitHub β Push to Faros ./airbyte-local generate-config github # Pull from Jira β Push to Faros ./airbyte-local generate-config jira ./airbyte-local generate-config -s jira # Use your own custom or non Faros-managed images ./airbyte-local generate-config --image farosai/airbyte-github-custom-source ./airbyte-local generate-config --image farosai/airbyte-github-custom-source farosai/airbyte-custom-destination # For help ./airbyte-local generate-config --help
This command will:
- Print source/destination config tables in your terminal.
- Create a
faros_airbyte_cli_config.jsonfile with required fields only. - Show you πΉ Next Steps in the terminal β follow those to continue!
Tips & Tricks:
- Source and destination names are case-insensitive and tolerate typos.
- Destination defaults to Faros unless specified
- Add
-sto skip printing the config tables if you just want the file. - Use
--imageto specify your custom images. This doesn't tolerate typos, and sets the default destination to Faros, i.e.farosai/airbyte-faros-destination.
π§ Option B: Write Your Own Config (For Advanced Users)
If you already know your way around Airbyte and want full control, you can craft your own config file.
{
"connectionName": "<YOUR_CONNECTION_NAME>",
"src": {
"image": "<YOUR_SOURCE_IMAGE_NAME>",
"config": {...YOUR_SOURCE_CONFIG...}
},
"dst": {
"image": "<YOUR_DESTINATION_IMAGE_NAME>",
"config": {...YOUR_DESTINATION_CONFIG...}
}
}Common image names:
- GitHub:
farosai/airbyte-github-source - Jira:
farosai/airbyte-jira-source - Faros:
farosai/airbyte-faros-destination
π Full image list: Docker Hub - farosai
Example
Assuming you want to pull data from Github org my-org by using GitHub PAT and push data to Faros default workspace. This is what the JSON would look like
{
"src": {
"image": "farosai/airbyte-github-source",
"config": {
"api_key": "<YOUR_FAROS_API_KEY>", <-- Faros API key
"graph": "default" <-- Faros workspace
"authentication": {
"type": "token",
"personal_access_token": "<YOUR_GITHUB_PAT_TOKEN>"
},
"organizations": ["my-org"],
}
},
"dst": {
"image": "farosai/airbyte-faros-destination",
"config": {
"edition_configs": {
"tenant_id": "<YOUR_TENANT_ID>", <-- Faros tenant Id
"api_key": "<YOUR_FAROS_API_KEY>", <-- Faros API key
"graph": "default" <-- Faros workspace
}
}
}
}Save as faros_airbyte_cli_config.json.
In most cases, you always have to provide Faros API key and workspace under src.config and dst.config.edition_configs.
More resources you can find it in Faros Documentation, e.g. instructions to create GitHub PAT and what permission you need for the PAT, etc.
π Step 3. Run it!
./airbyte-local --config-file faros_airbyte_cli_config.json ./airbyte-local -c faros_airbyte_cli_config.json
Starting from v0.2.0, when running with a Faros destination, the CLI will prompt you to confirm before writing data:
=========================================================== ATTENTION: You are about to write data into: Tenant: Acme Workspace: default =========================================================== Would you like to proceed? (yes/No):This is a safety check to prevent accidentally writing data to the wrong tenant or workspace.
Running programmatically (CI/CD)? Use the
--assume-yesflag to suppress the prompt β but only after you have verified your config file targets the correct tenant and workspace:./airbyte-local --config-file βfaros_airbyte_cli_config.jsonβ --assume-yes
Youβll see logs and sync progress in the terminal and in the Faros App (Data Control > Sources).
βοΈ Advanced Settings
In some cases, you might want to customized the data sync more.
We provide some more CLI optional arguments and optional fields in the Airbyte configuration file.
CLI Arguments
| Option | Required | Description |
|---|---|---|
-c, --config-file <path> |
Yes | Airbyte source and destination connector config JSON file path |
-h, --help |
Display usage information | |
-v, --version |
Output the current version | |
--full-refresh |
Force full_refresh and overwrite mode. This overrides the mode in provided config file | |
--state-file <path> |
Override state file path for incremental sync | |
--no-src-pull |
Skip pulling Airbyte source image | |
--no-dst-pull |
Skip pulling Airbyte destination image | |
--src-only |
Only run the Airbyte source and write output in stdout. Use '--src-output-file' instead to write to a file | |
--src-output-file <path> |
Write source output as a file (requires a destination) | |
--src-check-connection |
Validate the Airbyte source connection | |
--dst-only <file> |
Use a file for destination input instead of a source | |
--dst-use-host-network |
Use the host network when running the Airbyte destination | |
--log-level <level> |
Set level of source and destination loggers (default: "info") | |
--raw-messages |
Output raw Airbyte messages | |
--connection-name <name> |
Sets the origin for data in Faros (used to namespace, query, and delete data by source) and the default state file name (overridden by --state-file). Must be unique per source instance to prevent data overwrites or conflicts. |
|
--keep-containers |
Do not remove source and destination containers after they exit | |
--assume-yes |
Skip confirmation prompts (e.g., tenant verification). Use only after verifying your config is correct | |
--debug |
Enable debug logging | |
--src <image> |
[Deprecated] Airbyte source Docker image | |
--dst <image> |
[Deprecated] Airbyte destination Docker image | |
--src.<key> <value> |
[Deprecated] Add "key": "value" into the source config | |
--dst.<key> <value> |
[Deprecated] Add "key": "value" into the destination config |
Example Usage
Linux/MacOS
# Turn on debug logs ./airbyte-local --config-file 'faros_airbyte_cli_config.json' --debug # Run source sync only ./airbyte-local \ --config-file 'faros_airbyte_cli_config.json' \ --src-only # Check source connection ./airbyte-local \ --config-file 'faros_airbyte_cli_config.json' \ --src-check-connection # Enforce full refreash ./airbyte-local \ --config-file 'faros_airbyte_cli_config.json' \ --full-refresh # Use customized connection name ./airbyte-local \ --config-file 'faros_airbyte_cli_config.json' \ --connection-name 'test-connection'
Windows (Powershell)
# Turn on debug logs ./airbyte-local --config-file 'faros_airbyte_cli_config.json' --debug # Run source sync only ./airbyte-local ` --config-file 'faros_airbyte_cli_config.json' ` --src-only # Check source connection ./airbyte-local ` --config-file 'faros_airbyte_cli_config.json' ` --src-check-connection # Enforce full refreash ./airbyte-local ` --config-file 'faros_airbyte_cli_config.json' ` --full-refresh # Use customized connection name ./airbyte-local ` --config-file 'faros_airbyte_cli_config.json' ` --connection-name 'test-connection'
Connection Name
The connection name identifies the origin of your data in Faros. It's used to namespace, query, and delete data by source. It also determines the default state file name for incremental syncs.
You can set the connection name in the config file:
{
"connectionName": "my-github-prod",
"src": { ... },
"dst": { ... }
}Or via CLI argument:
./airbyte-local --config-file config.json --connection-name my-github-prod
Precedence
The connection name is resolved in the following order (highest to lowest priority):
| Priority | Method | Example |
|---|---|---|
| 1 | CLI --connection-name |
--connection-name cli-conn |
| 2 | Config file connectionName |
"connectionName": "cfg-conn" |
| 3 | Auto-generated from image | mygithubsrc (from farosai/airbyte-github-source) |
If both CLI and config file specify a connection name, a warning is logged and the CLI value is used.
Note: The hidden
--dst-stream-prefixoption overrides the stream prefix directly. If used together withconnectionName, records will have their origin derived from the stream prefix, not the connection name. A warning will be shown in this case.
Environment Variables in Config
To safely version control your config files without leaking secrets, you can reference environment variables using ${VAR_NAME} syntax. Secrets like API keys and tokens are injected at runtime, keeping your config files clean and secure.
{
"src": {
"image": "farosai/airbyte-github-source",
"config": {
"authentication": {
"personal_access_token": "${GITHUB_TOKEN}"
},
"organizations": ["my-org"]
}
},
"dst": {
"image": "farosai/airbyte-faros-destination",
"config": {
"edition_configs": {
"api_key": "${FAROS_API_KEY}",
"graph": "default"
}
}
}
}Then run with the environment variables set:
export GITHUB_TOKEN="ghp_xxxx" export FAROS_API_KEY="sk_xxxx" ./airbyte-local --config-file config.json
Notes:
- Environment variable substitution only works for string values in
src.configanddst.config - Resolved values are always strings (e.g.,
PORT=8080becomes"8080", not the number8080) - There is no type checking against the connector spec β ensure substituted values are used for string fields only
- The config file keeps the
${VAR}placeholders (safe to commit to version control) - Resolved values are written to temporary files that are deleted after the sync completes
- If an environment variable is not set or is empty, the CLI will exit with an error
- Variable names must follow POSIX conventions (letters, numbers, underscores; cannot start with a number)
Airbyte Configuration - Override Airbyte Catalog
You can override the default Airbyte catalog in the Airbyte configuration file that passed via --config-file.
It should be defined under catalog and src and dst has the same schema.
You will have to know the stream name and sync mode you would like to run.
{
"src": {
"image": ...,
"config": ...,
"catalog": { <-- define your catalog
"streams":[
{
"stream":{"name":"<STREAM_NAME>"}, <-- stream name
"sync_mode":"full_refresh" <-- sync mode: "full_refresh" or "incremental"
}
]
}
},
"dst": {
"image": ...,
"config": ...,
"catalog": {
...
}
}
}Airbyte Configuration - Customize Docker Settings
To customize the Docker settings for Airbyte connectors, you can configure optional fields such as CPU, memory, and log file size in the Airbyte configuration file. These settings allow you to fine-tune resource allocation for your connectors.
Additionally, the additionalOptions field enables you to specify advanced Docker options beyond CPU, memory, and log size. The schema for these options follows the Docker API specification. While these options are rarely needed, they can be useful for debugging or handling specific source use cases.
{
"src": {
"image": "<SOURCE_IMAGE>",
"dockerOptions": {
"maxCpus": 2, <-- unit: CPU (type: number)
"maxMemory": 256, <-- unit: MB (type: number)
"maxLogSize": "10m", <-- unit: k/m/g (type: string)
"additionalOptions": { <-- Additinoal Docker options
...
}
}
},
"dst": {
...
"dockerOptions": { <-- Same `dockerOptions` schema as above
...
}
}
}Volume Mount Example
To bind a volume mount, you can use the Binds option under HostConfig in additionalOptions:
{
"src": {
"image": "<SOURCE_IMAGE>",
"config": {...},
"dockerOptions": {
"maxMemory": 2048,
"maxCpus": 2,
"additionalOptions": {
"HostConfig": {
"Binds": [
"/path/to/tasks.xlsx:/tasks.xlsx"
]
}
}
}
...Environment Variable Example
To define environment variables for the container, use the Env option in additionalOptions.
{
"src": {
"image": "<SOURCE_IMAGE>",
"config": {...},
"dockerOptions": {
"maxMemory": 6144,
"additionalOptions": {
"Env": ["NODE_OPTIONS=--max_old_space_size=6000"]
}
}
...π FAQ
- If you have customized your docker socket, please exports the docker socket in env var
DOCKER_HOST. - We only support reading Airbyte configuration file in encoding:
utf-8,utf-16le,utf-16be.
π§ͺ Migration Guide
As some users might come from our previous Airbyte CLI Bash version.
Here is some guide for you to upgrade to the new one.
Old CLI v.s. New CLI
- Update the CLI from Bash script to binary - open up to users that don't use Bash by default
- Move Airbyte configuration into one JSON file - avoid syntax issues caused by running on different systems
Example Usage
# Older version ./airbyte-local.sh \ --src 'farosai/airbyte-faros-graphql-source' \ --src.api_url $FAROS_API_URL \ --src.api_key $FAROS_API_KEY \ --src.graph 'faros' \ --src.result_model 'Flat' \ --src.models_filter '["org_Team"]' \ --dst 'farosai/airbyte-faros-destination' \ --dst.edition_configs.graph 'default' \ --dst.edition_configs.api_url $FAROS_API_URL \ --dst.edition_configs.api_key $FAROS_API_KEY # Newer version ./airbyte-local --config-file graph_copy.json
New/Renamed Arguments
| Old Argument | New Argument | Replacement/Notes |
|---|---|---|
--config-file |
New arugment to take Airbyte configuration | |
--check-connection |
--src-check-connection |
For naming consistency |
--state <file> |
--state-file <file> |
For naming consistency |
Unsupported Arguments
The following arguments are droppeed in the new CLI. Please update your command according.
For arguments --src ... and --dst ..., they are still supported for user convenience. We strongly encourage users to use the new arugment --config-file to pass in Airbyte configuration in favor of the deprecated ones. Also, you can find a file named faros_airbyte_cli_config.json be automatically generated after running the CLI. It should covert your Airybte configuration to the new schema and next time you can just pass in this file with arugment --config-file and stop using the deprecated arguments!
| Argument | Status | Replacement/Notes |
|---|---|---|
--src <image> |
Deprecated | Image name is now defined in Aribyte configuration file |
--dst <image> |
Deprecated | Image name is now defined in Aribyte configuration file |
--src.<key> <value> |
Deprecated | Airbyte config is now defined in Aribyte configuration file |
--dst.<key> <value> |
Deprecated | Airbyte config is now defined in Aribyte configuration file |
--src-catalog-overrides <json> |
Unsupported | Airbyte catalog config is now defined in Aribyte configuration file |
--src-catalog-file <path> |
Unsupported | Airbyte catalog config is now defined in Aribyte configuration file |
--src-catalog-json <json> |
Unsupported | Airbyte catalog config is now defined in Aribyte configuration file |
--dst-catalog-file <path> |
Unsupported | Airbyte catalog config is now defined in Aribyte configuration file |
--dst-catalog-json <json> |
Unsupported | Airbyte catalog config is now defined in Aribyte configuration file |
--src-wizard |
Unsupported | Use generate-config instead |
--dst-wizard |
Unsupported | Use generate-config instead |
--max-log-size <size> |
Unsupported | Docker settings are now defined in Aribyte configuration file |
--max-mem <mem> |
Unsupported | Docker settings are now defined in Aribyte configuration file |
--max-cpus <cpus> |
Unsupported | Docker settings are now defined in Aribyte configuration file |
--src-docker-options "<string> |
Unsupported | Docker settings are now defined in Aribyte configuration file |
--dst-docker-options "<string> |
Unsupported | Docker settings are now defined in Aribyte configuration file |
--k8s-deployment |
Unsupported | Stop surporting running on local kubernetes cluster |
--dst-stream-prefix <prefix> |
Unsupported | Use --connection-name instead |