Airbyte BigQuery raw table data type for _airbyte_data field

Summary

Airbyte BigQuery raw table stores _airbyte_data field as string instead of json, impacting data size and cost


Question

why does the raw table in bigquery which is generated by Airbyte have _airbyte_data data type as string instead of json even though the data stored is of json format
And i also checked that the data while using json_functions consumed is less compared to doing the same thing on a string column
String __airbyte_data costed around 1GB
Json __airbyte_data costed around 900 MB



This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want to access the original thread.

Join the conversation on Slack

["airbyte", "bigquery", "raw-table", "_airbyte_data", "json", "data-type", "data-size", "cost"]