Data not fully loading from HubSpot connector to Databricks

Summary

Data from HubSpot connector not fully loading into Databricks, causing discrepancies in row counts. Difficulty in testing during incremental runs. Seeking fixes beyond rerunning the connector.


Question

We’ve got an issue where the data from our HubSpot connector is not fully loading into Databricks. For example the HubSpot connector status says 432 rows loaded for CMQLs but the table in databricks has no data. This can be quite tricky to test for on the databricks side when we do incremental runs as well, we use volume anomaly testing to try and get an idea but it’s not foolproof.

Has anyone else had this issue and are there any fixes apart from rerunning the connector?



This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want
to access the original thread.

Join the conversation on Slack

["hubspot-connector", "databricks", "data-loading", "incremental-runs", "volume-anomaly-testing"]

we’ve raised this as a git issue as we find that at least once a week our hubspot contacts table is missing users

https://github.com/airbytehq/airbyte/issues/40591