Summary
The user is facing an issue with setting up a GCS source due to the Globs pattern not finding any files in the specified bucket. The user is trying to access a compressed .CSV file in a subfolder ‘sales’ with the filename ‘salesreport_202401.zip’. They have attempted to use the pattern ‘sales/salesreport_*.zip’ without success.
Question
Configuration check failed while setting up the GCS source
bucket-pubsite_prod_rev_XXXXXXXXXXXXXXXXXXXX. I can’t get the “Globs” pattern to find any files in the bucket.
This is for compressed .CSV file in GCS bucket in a subfolder sales: salesreport_202401.zip. I tried with sales/salesreport_*.zip
, but that doesn’t work
['Traceback (most recent call last):\n File "/usr/local/lib/python3.9/site-packages/airbyte_cdk/sources/file_based/availability_strategy/default_file_based_availability_strategy.py", line 81, in _check_list_files\n file = next(iter(stream.get_files()))\nStopIteration\n\nDuring handling of the above exception, another exception occurred:\n\nTraceback (most recent call last):\n File "/usr/local/lib/python3.9/site-packages/airbyte_cdk/sources/file_based/availability_strategy/default_file_based_availability_strategy.py", line 62, in check_availability_and_parsability\n file = self._check_list_files(stream)\n File "/usr/local/lib/python3.9/site-packages/airbyte_cdk/sources/file_based/availability_strategy/default_file_based_availability_strategy.py", line 83, in _check_list_files\n raise CheckAvailabilityError(FileBasedSourceError.EMPTY_STREAM, stream=stream.name)\nairbyte_cdk.sources.file_based.exceptions.CheckAvailabilityError: No files were identified in the stream. This may be because there are no files in the specified container, or because your glob patterns did not match any files. Please verify that your source contains files last modified after the start_date and that your glob patterns are not overly strict. Contact Support if you need assistance.\nstream=salesreport\n']```
<br>
---
This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. [Click here](https://airbytehq.slack.com/archives/C027KKE4BCZ/p1706554495650149) if you want to access the original thread.
[Join the conversation on Slack](https://slack.airbyte.com)
<sub>
["configuration-check", "gcs-source", "globs-pattern", "compressed-csv", "sales-subfolder", "salesreport-202401.zip"]
</sub>