Error reading Parquet file from S3 as a source

Summary

Error occurs when trying to read a Parquet file from S3 as a source in Airbyte. The error message indicates a ‘NoneType’ object is not iterable.


Question

Hey team,
I’m having error when reading parquet from S3 as a source.

The parquet file is actually coming from Airbyte it self from other destination connection, but somehow can’t read it back as a souce
File "/usr/local/lib/python3.9/site-packages/airbyte_cdk/sources/file_based/file_types/parquet_parser.py", line 75, in <dictcomp>\n column: ParquetParser._to_output_value(batch.column(column)[row], parquet_format)\n File "/usr/local/lib/python3.9/site-packages/airbyte_cdk/sources/file_based/file_types/parquet_parser.py", line 122, in _to_output_value\n return {k: v for k, v in parquet_value.as_py()}\nTypeError: \'NoneType\' object is not iterable\n\nThe above exception was the direct cause of the following exception:\n\nTraceback (most recent call last)



This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want to access the original thread.

Join the conversation on Slack

["error", "reading-parquet", "s3-source", "airbyte"]