Skip to content

Releases: airbytehq/airbyte-platform

Airbyte 1.4.0

17 Jan 22:42
Compare
Choose a tag to compare

Happy new year! Airbyte version 1.4.0 was released on January 17, 2025. We’re excited to share new improvements and changes to the Airbyte platform.

🚀 Platform Changes

Platform changes improve Airbyte for everyone with a self-managed instance.

Configure the schema refresh rate

At the start of a sync, Airbyte occasionally re-runs schema discovery to detect changes in your source, such as new fields or a change in column type. But running schema discovery has a performance cost, and may not always be necessary before every sync.

You may now use a new toggle, DISCOVER_REFRESH_WINDOW_MINUTES, to set the minimum number of minutes Airbyte will wait to refresh the schema for your sources. By setting a larger number, you run automatic schema detection less frequently, which can result in gains in sync performance. The default configuration in Airbyte Open Source is 1440, which refreshes schemas every 24 hours. The lowest interval you can set is 1, which refreshes schemas before every sync.

worker:
  env_vars:
    DISCOVER_REFRESH_WINDOW_MINUTES: 1440 # Airbyte automatically refreshes schemas no more than once per day (1440 minutes).

Set this to 0 to disable automatic schema refreshes. All schema refreshes will need to be done manually via the Airbyte UI or API.

worker:
  env_vars:
    DISCOVER_REFRESH_WINDOW_MINUTES: 0 # Airbyte does not automatically detect schema changes.

Connectors support custom image registries

Connectors can now use custom image registries rather than Airbyte’s public Docker registry. If you configure Airbyte to use a custom image registry, it now automatically uses that registry for connector images as well as platform images. Previously, only platform images supported this. In this example, we set Airbyte’s values.yaml file to pull all images from GitHub.

global:
  image:
    registry: ghcr.io/NAMESPACE

You must ensure copies of platform and connector images are available in your custom image registry. Learn how to set up custom image registries.

Custom Docker connectors in your workspace that specify an image using a fully qualified domain name (for example, example.com/airbyte/your-custom-source) ignore your configured custom image registry and pull images from the domain specified by that connector.

🐛 Bug fixes

  • Reduced resource consumption: Instances of Airbyte running multiple connections at a time now consume fewer resources, thanks to optimizations to the airbyte-worker pod.

  • Canceled syncs do not rerun until the next scheduled sync: If you previously cancelled an in-progress sync, and the next sync as configured in the connection frequency was behind schedule, Airbyte would immediately start a new sync. In practice, many users had to choose to ‘Cancel Sync’ twice in a row to stop moving data. Airbyte now automatically waits until the next scheduled sync to move data.

  • Reduce rate limit errors from the airbyte-cron service: We fixed an issue reported by the community that caused excessive rate limit errors on the airbyte-cron pod when users scheduled connections using the Airbyte CRON capability.

  • Autorecovery for hanging connections: Airbyte now packages a service to detect connections blocked by the unlikely event that a sync becomes stuck, and remains in a perpetual ‘in-progress’ state without moving data. This heartbeat service will detect syncs that are hanging, and automatically create a new job attempt.

Airbyte 1.3.0

09 Dec 22:10
Compare
Choose a tag to compare

It's already December! We are excited to release the following set of improvements and changes.

🚀 Platform Changes

  • Improved Log Viewer and Filtering: You can now quickly filter to error or warning logs when needed, as well as by log source - such as the source connector, destination connector, or platform. Together, this allows you to quickly diagnose issues with Airbyte connections if they arise, reducing time to resolution on issues.

Improved Log Viewer

  • OpenShift Support: Starting today, Airbyte supports deployments to OpenShift. This is the culmination of a journey over much of the past year: we've updated our platform and connectors to be rootless, we've improved pod to pod communication within our platform, and made a wide number of security improvements. To succeed, the user running helm deployment commands needs to be a cluster-admin user. Your Kubernetes namespace will also require the following annotations to ensure that all of the Airbyte resources run as UID 1000 and GID 1000. If you run into any edge cases or issues deploying to OpenShift, as always, please let us know by opening a GitHub issue.
openshift.io/sa.scc.supplemental-groups: 1000/1
openshift.io/sa.scc.uid-range: 1000/1

🚀 Self-Managed Enterprise Changes

  • Compliance Mappers in Airbyte API: Airbyte Self-Managed Enterprise now supports field hashing, field encryption (with self-managed encryption keys), field renaming and row filtering for connections from a common interface available in the Airbyte API. This functionality is coming soon to the Airbyte UI. To get started, see our API documentation.

Mappers API Example

Know bugs

Cron schedules cannot be configured to run more frequently than once per hour

This bug affects v1.3.0 and v1.3.1. A fix has been merged and is pending release in v1.3.2.

Airbyte 1.2.0

07 Nov 16:49
Compare
Choose a tag to compare

Happy Halloween! We are excited to release the following set of improvements and changes.

🚀 Platform Changes

  • File Transfers: Airbyte now supports transferring unstructured text data, non-text data and compressed files. This capability is in early access, and currently only available for moving data from the "SFTP Bulk" source to the "S3" destination. Support in the "S3" source is coming soon. File transfers allow you to copy raw files in Airbyte without parsing their contents. Bits are copied into the destination exactly as they appeared in the source. This is recommended for use with unstructured text data, non-text and compressed files. We currently support copying files up to 1GB in size. This is an entirely new way of moving data through Airbyte, as all pre-existing data movement methods involved parsing individual records.

file-transfers-sftp-bulk-source

  • Custom Image Registry Support: Introduces support for specifying a custom Docker registry from which all platform images are pulled. This should simplify helm chart configurations for users who require images to be pulled from internally approved image repositories. Configuring the following will prefix all docker images with my-registry.foo.com. The value provided in the registry field must have a trailing slash:
global:
  image:
    registry: my-registry.foo.com/
  • XML Support in the Connector Builder: Airbyte now supports custom API connectors built via the Airbyte Connector Builder which return responses as XML format.

xml-builder

🚀 Self-Managed Enterprise Changes

  • Runner RBAC Role: Users provisioned with the 'Runner' role are able to start or stop syncs, and run backfills for individual connections (in the workspaces where they've been granted Runner permissions). Outside of these explicit tasks, users with the 'Runner' role have an entirely read-only experience (Self-Managed Enterprise only).

rbac-1.2

  • Self-Managed Diagnostics: Diagnostics have been expanded to include deployment statistics. Diagnostics can be downloaded through the Organization Settings page. This exports a .zip file which can be easily shared with Airbyte Support (Self-Managed Enterprise only).
  • Removing SSO Users: You may now remove SSO users from your Airbyte Organization. This removes them from the list of Organization Members.

Airbyte 1.1.0

03 Oct 18:25
Compare
Choose a tag to compare

🚀 Features

The 1.1.0 release includes the following enhancements and bug fixes.

Platform Releases

  • Adds field hashing for Self-Managed Enterprise

  • Adds the Connection Timeline, replacing the Job History for a connection. For assistance with the migration of existing jobs to the new timeline events, follow our guide.

  • Redesigned the schema tab to improve the selection & deselection of streams and fields. Sync modes, cursor fields, and primary keys are also selected here.

Schema Tab

  • Custom Docker-based connectors can now be renamed and deleted

Support Experience

  • Diagnostics can be downloaded through the Organization Settings page, which will export a JSON file to facilitate sharing with Support. This JSON file includes relevant infrastructure and connection information. (Self-Managed Enterprise only)

Billing & Licenses

  • Self-Managed Enterprise users can now view directly in the UI when the license is expiring.

Changelog

See the Full Changelog

Airbyte 1.0

23 Sep 22:01
Compare
Choose a tag to compare

airbyte v1.0.0

Official launch page: https://airbyte.com/v1

This page includes new features and improvements to the Airbyte products we're excited to share as a part of the 1.0 release. Airbyte v1.0 contains many improvements and additions to enhance the reliability, scalability, and uses of Airbyte.

Moving forward, Airbyte will release official new platform versions on a monthly cadence, with the associated changelog.

Ease of deployment

  • abctl is the easiest, quickest way to get started with Airbyte Self-managed. See our quickstart docs for more details.

  • Native authentication by email and password is available so that any instance of Airbyte is secure by default.

Proven Reliability

  • Automatic detection of dropped records ensures data is passed without fail through the Airbyte platform.

  • Enhanced our notification suite to enable webhook integrations and added more contextual information about sync failures or schema changes.

  • Connections now offer rate limited messaging, improved error handling, and live monitoring of ongoing syncs.

Status Page

  • Many sources, in particular S3, have become faster. With CDK 2.0, the protocol now uses Pydantic V2 and removes the serialization of each record. We also improved Databricks and S3 destinations to enable faster sync speeds, checkpointing, and modernization to our Destinations V2 framework.

Resiliency at Scale

  • Refreshes bring an improved experience to resyncing all of your data again. This enables data to never be deleted from final tables during a historical resyncing of data.

  • Resumable Full Refresh allows for large streams syncing in Full Refresh to sync without failures.

  • Our database sources added resumability to reduce failures and stay resilient when incoming data is inconsistent. We gracefully handle record-level error in the destination field _airbyte_meta.errrors for large records or mistyped data. Read more in our typing & deduping documentation.

  • Large initial CDC syncs are gracefully handled by consuming WAL and acknowledges logs periodically to ensure disk space is freed in a timely manner.

  • Workloads provide a more scalable and reliable architecture to run sync jobs by separating scheduling and orchestration from data movement tasks. This improvement unlocks more automated management of workloads by managing job spikes and enables horizonal scaling (for Cloud and Enterprise users).

Tackling the long-tail of connectors

  • We launched our connector Marketplace for our community-maintained connectors. Automated testing ensures our connectors stay high quality, and most connectors have been migrated to low/no-code.

  • The Connector Builder seamlessly switches betweem UI and YAML modes, supports custom components, and contains extensive testing and error handling.

  • AI Assist now builds connectors from scratch for you with just a link to the API docs.

  • Anyone can now contribute new connectors directly from the builder to add a new connector directly to Airbyte's Marketplace. Additionally, connectors can be forked directly from the UI for faster edits, ensuring our connectors stay up-to-date.

  • Airbyte supports writing to several vector store destinations, including pgVector, Pinecone, and Snowflake Cortex.

Self-Managed Enterprise

Self-Managed Enterprise extends on Airbyte 1.0 by introducing new classes of functionality: multitenancy and access management, enterprise source connectors , sensitive data masking and enterprise support – all while data never leaves your infrastructure.

  • Support for Okta and OpenID Connect (OIDC) SSO
  • Availability of Role-based Access Control and multitenancy
  • Columns can now be hashed to protect sensitive data

Announcements

  • To ensure adherence to security best practices, Airbyte is migrating all connectors to non-root versions. It is highly recommended that you upgrade your platform version to v0.63.9 or later before October 2024 to ensure your syncs continue to succeed.

  • As we prepare to deprecate Docker Compose, we published a migration guide for those migrating from Docker Compose to abctl.

v0.64.7

18 Sep 14:25
Compare
Choose a tag to compare

55f8144 fix: restore secret persistence config (#14015)
da74ee2 chore: change wording on link to invoice (#14012)
4b2a524 chore: [SyncCatalog V2] remove debug code from catalog (#13994)
ab5be73 fix: [SyncCatalog V2] wrong "breaking changes message" position (#13993)
f9e7607 feat: add banners for manual and auto recharge to billing page (#14011)
1129cef chore: delete deprecated worker v1 helm and infra code. (#13984)
4070642 chore: mappers also rename source default cursor and source defined pk. (#14002)
559dc25 chore: delete remaining worker v1 config code (#13989)
342f72e refactor: remove connector rollout cli shortcut for actor IDs (#14001)
30eaac3 chore: deploy connector rollout worker on oss (#13887)
a1b5395 Bump helm chart version reference to 0.620.2
f5b500b chore: refactor workload acceptance tests to use dynamic feature flags (#13492)
0993ad0 Bump helm chart version reference to 0.620.0
43565b5 fix: parsing docker image when the full image name contains registry port (#13990)

v0.64.6

17 Sep 20:23
Compare
Choose a tag to compare

d1c36c4 feat: implement correct license statuses (#13894)
0665001 fix: properly generate the airbyte_type (#13998)
df7d84e chore: use separate message for empty exit code file. (#13997)
43f964a docs: add learn more link about recovering credentials in oss (#13951)
737e8c5 chore: delete all non-config worker v1 code. (#13988)
ca7ff83 chore(ai-assist): log assist service response codes (#13995)
4e6b440 feat: add organization balance API (#13964)

v0.64.5

17 Sep 17:58
Compare
Choose a tag to compare

Caution

This release contains the activation of a feature which mutates the type TIMESTAMP WITHOUT TIMEZONE in TIMESTAMP WITH TIMEZONE. This was unintentional and was fixed in the next release.
Existing Airbyte users should skip this version and directly upgrade to 0.64.6.

eb28d9b feat: introduce diagnostic tool api (#13845)
f264427 fix: uses SHA-256 not SHA-1 (#13970)
f9c23bd feat: virtualize all connections page (#13548)
9717c1c chore: add data-testid on hashing input for fullstory sessions (#13992)
dd882ff feat: connector progressive rollout CLI (#13888)
483f470 chore: rename the <PKCell /> and <CursorCell /> components to match the level in the catalog (#13991)
2a75339 chore: change cursor from checkbox to radio button (#13965)
74800c2 chore: add tooltip to source defined pk and cursor combobox (#13974)
e733b86 refactor: add useCurrentConnection hook (#13922)
2b4ba10 chore: add analytics tracking functions to new sync catalog (#13976)
512e493 feat: worker for connector progressive rollouts (#13886)
d74ce0d chore: ensure launcher fails on certain errors (#13987)
d2114e5 feat: update connector builder project contributionInfo after contributing (#13897)
d8af275 chore(ai-assist): add alpha badge (#13973)
b9cf0a9 feat: fork in builder buttons (#13787)
50b9b34 fix: remove typecasting and centering from running jobs in timeline ui (#13972)
26d3236 feat: add details field to known exception (#13930)
f812e04 feat: new & updated API endpoints and schema for connector rollouts (#13864)
5d8b230 refactor: rename commons logging (#13925)
53e3e4c chore: set the mapper flag to true (#13945)
3454fb5 refactor: move storage client to shared library (#13916)
bc2aefd feat: enable renaming of cursor and pk in mappers (#13959)
10af3de chore(frontend): update format command to match ci requirements (#13848)
d8a8a58 fix: identify streams with changed hashedFields as needing refresh (#13943)
181a2e9 fix: fix not applied height in case of empty table state (#13975)
bd254f0 chore: don't send known invalid events to datadog for timeline (#13952)
8190bb0 chore: make resource error as external error message (#13971)
54849f2 design: use standard empty state for dbt cloud transformations tab (#13944)
bc351cc fix: handle both null and empty namespaces for stream status completion (#13967)
8189ffd chore: default hashing config to set SHA-256 instead of SHA-1 (#13947)
311be67 feat: hashing UI feedback (#13931)
8476ebf fix: align schema refreshing message properly (#13963)
f249bee chore: update to micronaut 4.6.2 (#13940)
06196c9 fix: autoescape regex characters in sync catalog search (#13962)
5cf62e0 fix: don't display usage data if it doesn't match user's current timezone (#13961)
69b33f6 chore: upgrade base image to alpine3.20 (#13956)
9418f09 fix: add the connection string to all the application.yml files that need it (#13955)
4a5327f fix: catalog generation related to mappers (#13950)
5120384 chore: add tracking for contribute form focus (#13912)
dec1795 feat: deterministic workflow Id (#13948)
4174406 chore: hashing mapper populate meta.changes on errors (#13919)
06af5db chore: cleanup rolled out flags (#13906)
b739c1b Bump helm chart version reference to 0.594.0

v0.64.4

13 Sep 18:05
Compare
Choose a tag to compare

b32d1d2 feat: ensure keycloak master realm is properly configured on startup (#13821)
074558c feat: custom sync catalog search and expansion (#13924)
59e9bdd fix: a few wording adjustments (#13939)
db1a531 chore: add dd tracking for missing timeline event item (#13932)
8944018 fix: update "enable/disable all streams in namespace" implementation (#13937)
c0debfc feat: azure key vault persistence (#13791)
d7bc738 chore: change error message (#13914)
865e444 fix: add aws and vault env vars to init container app yml. (#13921)
ddde2b0 chore: remove all worker v1 / workload branching. (#13861)
6ad3f9d build: gradle 8.10.1 (#13926)
11be428 fix: apply mapper to the destination catalog in monopod (#13918)
a4d7371 fix: revert fix for modal scrollbars for now (#13927)
9c970c5 fix: show settings change events in timeline again (#13917)
b886328 feat(billing): add workspace usage graph and connection table (#13586)
12319bb feat: show base connector and contribution info in Builder UI (#13724)
b172a40 feat: allow forking from manifest-only connectors in Builder Fork page (#13671)
7b59c7b feat: return base actor definition version info and contribution info from connector builder project endpoints (#13829)
10eb8ca fix: conditionally create local-secrets-bean. (#13920)
fde00dd feat: store contribution info on project update (#13832)
597bb8b feat: show {enabled|disabled} count / {totalCount} in namespace row when tab filter is active (#13907)
fd94d28 chore: print container and pod info for monopod. (#13915)
cfdad4e chore: fix hashing mapper test (#13913)
671a36f chore: remove condition on setting the workload (#13911)
6c6b657 feat: schema propagation events in timeline UI (#13892)
9b701d5 chore: improve error message for timeout exceptions (#13883)
845eda4 design: reword notification settings for connection updates (#13901)
19f894f chore: add data and caching to AI assist (#13885)
0e63273 chore: hashing mapper returns hex instead of base64 (#13898)
832ead0 design: show tooltip for relative time toggle on sync status page (#13902)
b91926c chore: [SyncCatalog V2] always show expand/collapse control (#13884)
c9ff4d8 chore: remove connector name in use backend check (#13895)
6b35241 chore: fallback to true if getUseWorkloadApi null. (#13896)
0c98541 fix: handle loading state flicker in generate (#13857)
c4b1a9a fix: handle invalid mapper configs (#13825)
03cce50 refactor: remove dead instance_config options (#13891)
3cf3aa5 feat: return actor definition id from /contribute/generate (#13665)
568839c fix: remove old init sql (#13877)
823e7ed feat: endpoint to create a forked connector builder project (#13664)
6b5f842 feat: add endpoint to get connector builder project by definiton ID (#13784)
3171f4e fix: handle jobs with 0 attempts in the ui (#13875)
163f4d9 feat: add azure support for storage (#13608)
013c398 feat: hash mapper UI v0 (#13805)
bed7719 fix: prevent submitting sync catalog form from search input (#13881)
e47a59b feat: report new usage_category with Orb events (#13764)
ec83760 revert: chore: add data and caching to AI assist (#13880)
c87691a fix: add back failureOrigin enum values for normalization and dbt (#13830)
3893558 feat: defaults monopod to enabled for builds. (#13873)
b7f1ed0 revert: "revert: "feat: add internal api to support field hashing"" (#13865)
88de307 chore: add data and caching to AI assist (#13750)
9d5a39e feat: add new license info endpoint (#13711)
c769251 fix: support the unexpected json schema (#13871)
9a95174 feat(billing): new workspace usage endpoint (#13811)
1c62ddb chore: set monopod flags to true for accpetance tests. (#13856)
2565d7b fix: isTerminal(Pod) logic works with Monopod (#13854)
3824ae2 chore: clean up some permission logic (#13849)
5849764 fix: do not log duplicate connection disabled events (#13752)
98992ba fix(test): use local chart version for temporal-ui (#13866)
d6833d5 fix: don't allow invalid filter selections on timeline (#13855)
6cca28f revert: "feat: add internal api to support field hashing" (#13859)
bf799dd feat: add internal api to support field hashing (#13798)
aff5a15 Bump helm chart version reference to 0.551.0

v0.64.3

06 Sep 18:22
Compare
Choose a tag to compare

e6ade26 fix: include refreshes in early syncs query (#13852)
d378e75 fix(helm): remove pre-upgrade hook from service account (#13729)
a353a6a build: micronaut 4.6.1 (#13847)
d295dcd chore: enable node selector override (#13843)
84d599f feat(ai-assist): animated loading state (#13841)
c16e5be chore: [SyncCatalog V2] imrove collapse/expand row perfomace (#13840)
8a529d9 chore: removes unnecessary print statement that clutters logs. (#13844)
2531253 chore(helm): add logging and storage templates (#13807)
0c753eb chore: add basic mapper performance metrics to the sync summary (#13842)
ed7bf3e feat: progressive rollouts initial connector_rollout table & API (#13675)
6db66c0 feat: progressive rollouts table migration (#13819)
910f012 feat: support unknown type in the jsonSchema catalog (#13818)
efd0adf feat: split org members access management onto its own page (#13834)
4d99e61 fix: remove duplicate jobId on failures in "advanced view" of timeline (#13827)
cbfbfe8 chore: introduce AirbyteRecord and AirbyteJsonRecordAdapter (#13774)
d9e3dd5 fix: missing local string (#13831)