Replicating 500 GB PostgreSQL to BigQuery with Fresh Data and Cost Concerns

slack-user-airbyte · May 14, 2024, 6:13pm

Summary

The user is looking to replicate a 500 GB PostgreSQL database to BigQuery with relatively fresh data every 15 minutes. They are concerned about the cost and are seeking information on how the staging table is merged into the destination table and what precautions are taken to minimize costs and ensure proper partition pruning.

Question

Hello,

I tried the ask-ai, but it didn’t have an answer for me . I’d like to replicate a 500 GB postgresql to BigQuery. I’d like the data to be relatively fresh (15 minutes), but I’m worried about the cost. If I understand the steps correctly, the postgresql data is first copied to GCS, which is then imported to a staging table using a load job. How is the staging table merged into the destination table? What precautions are used to minimize the cost/make sure proper partition pruning is applied?

Thanks!

This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want to access the original thread.

Join the conversation on Slack

_{["replicate", "postgresql", "bigquery", "data-freshness", "cost-concerns", "staging-table", "destination-table", "partition-pruning"]}

Topic		Replies	Views
High costs due to data load into BigQuery Connector Questions & Issues destination-bigquery	4	568	July 14, 2022
Source BigQuery copies schema but no records Connector Questions & Issues destination-postgres , data-loading , source-bigquery	6	400	July 14, 2022
Recommended way to sync PG tables of varying sizes to BigQuery in Airbyte Connector Questions performance , multiple-connections , connector , bigquery , postgresql	7	11	September 23, 2024
BigQuery SQL optimization Connector Questions & Issues normalization , data-loading	6	1314	July 14, 2022
Destination BigQuery - Deduped + history generates too much processing costs Connector Questions & Issues connectors	1	395	September 1, 2022

Replicating 500 GB PostgreSQL to BigQuery with Fresh Data and Cost Concerns

Summary

Question

Related topics