- Is this your first time deploying Airbyte?: Yes
- OS Version / Instance: Linux
- Memory / Disk: 16GB
- Deployment: Docker
- Airbyte Version: What version are you using now? 0.40.22
- Source name/version: -
- Destination name/version: N/A
- Step: Adding Source, Connection with File connector 0.2.31
- Description:
I have a bunch of CSVs I’ve added as a Source for the File-Connector.
I’m specifying the dtypes in the “Reader Options” like this:
{"dtype": {"Open": "float64", "High": "float64", "Low": "float64", "Close": "float64", "Adj Close": "float64", "Volume": "int64"}}
However when I have a connection set-up with a Postgres destination all columns are CAST into texts.
It seems to be specific to Yahoo Finance (Example URL here) where I’m getting the data from. Other sources providing CSVs work correctly.
RAW-JSON from the File-Source looks like this:
{"Low": 287.119995, "Date": "1987-03-09", "High": 290.660004, "Open": 290.660004, "Close": 288.299988, "Volume": 165400000, "Adj Close": 288.299988}
In the logs there is this:
i.a.w.g.DefaultReplicationWorker(lambda$readFromSrcAndWriteToDstRunnable$5):336 - Schema validation errors found for stream _YF_^GSPC_1D. Error messages: [$.Low is of an incorrect type. Expected it to be [string, null], $.Close is of an incorrect type. Expected it to be [string, null], $.Adj Close is of an incorrect type. Expected it to be [string, null], $.Volume is of an incorrect type. Expected it to be [string, null], $.High is of an incorrect type. Expected it to be [string, null], $.Open is of an incorrect type. Expected it to be [string, null]]
Any idea why this like this and how this can be changed?