Summary
Airbyte BigQuery raw table stores _airbyte_data field as string instead of json, impacting data size and cost
Question
why does the raw table in bigquery which is generated by Airbyte have _airbyte_data data type as string instead of json even though the data stored is of json format
And i also checked that the data while using json_functions consumed is less compared to doing the same thing on a string column
String __airbyte_data costed around 1GB
Json __airbyte_data costed around 900 MB
This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want to access the original thread.