perf: benchmarking #10

aljazerzen · 2024-02-22T07:46:55Z

ATM I have no idea how performant this connector is. Surly it is slower than using plain connections to the database and not converting to Arrow at all.

But how does it compare to:

pandas.read_sql,
polars.read_database,
connectorx,
ADBC,
dyplr?

Ideally, I would reuse the benchmarks from the connector-x project, but I'm not sure how portable they are.

The text was updated successfully, but these errors were encountered:

aljazerzen · 2024-02-22T08:20:42Z

This is also needed because right now, it doesn't make sense making any performance optimizations, since I have no way of verifying that they are even any faster.

aljazerzen · 2024-03-19T08:16:37Z

There is something similar here: https://github.com/pola-rs/tpch

Ideally, I would be able to benchmark "getting X amount of data from a data store Y into memory", for a few different amounts of data X and for all supported data stores Y.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: benchmarking #10

perf: benchmarking #10

aljazerzen commented Feb 22, 2024

aljazerzen commented Feb 22, 2024

aljazerzen commented Mar 19, 2024

perf: benchmarking #10

perf: benchmarking #10

Comments

aljazerzen commented Feb 22, 2024

aljazerzen commented Feb 22, 2024

aljazerzen commented Mar 19, 2024