Replies: 2 comments
-
We send a lot of CSV results through to duckdb-wasm in the browser, and it is always pretty much a similar size or better (however we're not dealing with 150mb files). We wrote our own csv-to-arrow implementation which dictionary encodes and chooses the smallest datatype when possible (eg Int8 vs BigInt64). |
Beta Was this translation helpful? Give feedback.
-
Arrow Flight is not supported. @sgilligan & @fadmaa: I would give it a try here to duckdb's own storage format, basically having server side a duckdb instance (natively or via python / nodejs clients) parse the CSV and exporting a duckdb database, to then be send to duckdb-wasm and read natively. Otherwise Parquet or Arrow should also be good to go (potentially also converted by DuckDB!) |
Beta Was this translation helpful? Give feedback.
-
Hello,
I am considering my options for getting data remotely from a service and loading it into duckdb-wasm in the browser.
The remote service has the data in CSV and will convert it to a format more suitable for Duckdb Wasm... Arrow Flight would probably be ideal but as far as I can tell, it is not supported. Am I right?
Also, when I experimented with Arrow IPC, I found it extremely verbose (a 150MB CSV file resulted in about 2GB in Arrow IPC format)... is that something other people experienced or am I doing something wrong?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions