Optimizing Connector Performance for S3 to Teradata

slack-user-airbyte · November 28, 2024, 10:22am

Summary

User is developing a connector in Java and Kotlin to transfer 36GB of data from an S3 bucket to Teradata, experiencing performance issues with a transfer time of over 7 hours. They seek advice on modifying the batch size and other optimizations.

Question

hello, I am coding my own connector in Java and Kotlin. Fetching data from a S3 butket (AWS) to Teradata. I have like 36gb of data to load, the data contains an id, a json object and a date (totals 162M records). This is taking too much time; 7+ h. I want to optimize that. How can I modify the batch_size? the default value is 25MB it seems.
Can you suggest me additional tweaks to optimize the performance please.

Info: I am using the com.teradata.jdbc:terajdbc4:17.20.00.12 driver.

This topic has been created from a Slack thread to give it more visibility.
It will be on Read-Only mode here. Click here if you want
to access the original thread.

Join the conversation on Slack

_{['s3', 'teradata', 'batch-size', 'performance-optimization', 'java', 'kotlin', 'jdbc-driver']}

Topic		Replies	Views
Optimizing data loading from AWS S3 to Teradata using custom connector in Java and Kotlin Connector Questions connector , question , custom-connector , java , teradata	2	22	June 23, 2024
Performance issues replicating data from Oracle to Teradata Connector Questions connector , question , teradata , replicating-data , jdbc-parameters	2	22	June 23, 2024
Improving Throughput for Destination Connectors in Teradata Java Connector Connector Questions connector , question , improve-throughput , destination-connectors , teradata-java-connector	0	3	August 16, 2024
Improving performance for Oracle to Teradata replication Connector Questions performance , connector , question , scaling , oracle-connector	0	0	August 8, 2024
Source Oracle - Poor sync performance for high volume loads (>100 GB) Connector Questions & Issues source-oracle-db , destination-s3 , data-loading	4	968	June 15, 2023

Optimizing Connector Performance for S3 to Teradata

Summary

Question

Related topics