How does Airbyte handle load massive volume of data?

datayoshi · April 27, 2022, 11:51am

How does Airbyte handle extracting data in high load massive volume?
Does it auto-scale up/down?
How to control it?

I read from the docs:

Airbyte allows scaling sync workloads horizontally using Kubernetes.

Is that related?

harshith · April 27, 2022, 12:11pm

Hey, we do have a k8s solution in beta which can scale both the workers and sync pods based on the load. Is this is same one you are looking for?

datayoshi · April 27, 2022, 12:18pm

Thanks @harshith ,
to understand if that is what I’m looking for - can you please elaborate about:

Are those workers include the components that do the data extraction?
Is it auto-scale both up & down?
Does the auto scale can be triggered by data volume?

harshith · April 27, 2022, 12:22pm

@datayoshi you can go through this doc for more understanding of the worker

Otherwise

Sync pods are created to fetch data and are deleted once the data sync is done
What do you mean by data volume?

We have the basic k8s deployment module in our repo but if you are looking to scale up/down by some metrics you can also explore GKE custom metrics which can be added to the k8s charts.

datayoshi · April 27, 2022, 1:03pm

Thanks for the reference, I’ve reviewed it.

What is the trigger that scales an Airbyte worker?

harshith · April 27, 2022, 1:13pm

@datayoshi we don’t have it already configured but you can refer to this to choose the method Scaling Airbyte | Airbyte Documentation

Topic		Replies	Views
Autoscale on airbyte worker in k8s Platform, Deploy & Infra Issues kubernetes , deploy	0	387	May 31, 2023
Airbyte deployment not handling large data size after adding configurations Platform Questions kubernetes , platform , helm , question , airbyte-deployment	0	35	June 8, 2024
Scaling Airbyte on Kubernetes for Faster Data Processing Platform Questions kubernetes , platform , question , scaling , helm-charts	0	83	May 14, 2024
Scaling airbyte-worker pods for resource optimization Platform Questions platform , question , scaling , airbyte-worker , resource-optimization	0	77	June 13, 2024
Sync job not respecting CPU/Memory limits after Airbyte upgrade on Kubernetes Platform Questions kubernetes , platform , sync-job , bug , environment-variables	1	32	August 2, 2024

How does Airbyte handle load massive volume of data?

Related topics