Question: How to optimize load_data-Operation

I want to copy my MySQL data (>200 Mio. rows) to BigQuery. Therefore I created a python script, which uses this library. At the moment it streams 1000 rows with one request and it generates about 1,1 requests/second. This is not really fast and it would take me days to transfer the whole dataset. I am sure that this can be optimized, but I don't know how. Would you have some suggestions? You can find my source code [here](https://github.com/inkrement/MySQLbq/blob/e8f484a5aac93dc77f687457424995a72ad4460b/run.py)

I thought about the following points:

 * Each request contains 1000 rows, should I choose a bigger number?
 * Does this library use gzip per default?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: How to optimize load_data-Operation #2960

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question: How to optimize load_data-Operation #2960

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions