@@ -159,8 +159,15 @@ without prior discussion infeasible. If you are interested in contributing, plea
159
159
160
160
This repo uses submodules for non-Rust dependencies (e.g., for the zig fastlanez repo). Before building make sure to run
161
161
162
- * ` git submodule update --init --recursive `
163
- * ` ./zigup ` (this will install the zig version required by fastlanez)
162
+ ``` bash
163
+ git submodule update --init --recursive
164
+
165
+ # Install the zig version required by fastlanez
166
+ ./zigup
167
+
168
+ # Install Rye from https://rye-up.com, and setup the virtualenv
169
+ rye sync
170
+ ```
164
171
165
172
## License
166
173
@@ -172,24 +179,31 @@ This project is inspired by and--in some cases--directly based upon the existing
172
179
and OSS developers.
173
180
174
181
In particular, the following academic papers greatly influenced the development:
175
- * Maximilian Kuschewski, David Sauerwein, Adnan Alhomssi, and Viktor Leis. 2023. [ BtrBlocks: Efficient Columnar Compression
176
- for Data Lakes] ( https://www.cs.cit.tum.de/fileadmin/w00cfj/dis/papers/btrblocks.pdf ) . Proc. ACM Manag. Data 1, 2,
177
- Article 118 (June 2023), 14 pages. https://doi.org/10.1145/3589263
178
- * Azim Afroozeh and Peter Boncz. [ The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar
179
- Code] ( https://www.vldb.org/pvldb/vol16/p2132-afroozeh.pdf ) . PVLDB, 16(9): 2132 - 2144, 2023.
180
- * Peter Boncz, Thomas Neumann, and Viktor Leis. [ FSST: Fast Random Access String
181
- Compression] ( https://www.vldb.org/pvldb/vol13/p2649-boncz.pdf ) .
182
- PVLDB, 13(11): 2649-2661, 2020.
183
- * Azim Afroozeh, Leonardo X. Kuffo, and Peter Boncz. 2023. [ ALP: Adaptive Lossless floating-Point
184
- Compression] ( https://ir.cwi.nl/pub/33334/33334.pdf ) . Proc. ACM
185
- Manag. Data 1, 4 (SIGMOD), Article 230 (December 2023), 26 pages. https://doi.org/10.1145/3626717
182
+
183
+ * Maximilian Kuschewski, David Sauerwein, Adnan Alhomssi, and Viktor Leis.
184
+ 2023 . [ BtrBlocks: Efficient Columnar Compression
185
+ for Data Lakes] ( https://www.cs.cit.tum.de/fileadmin/w00cfj/dis/papers/btrblocks.pdf ) . Proc. ACM Manag. Data 1,
186
+ 2,
187
+ Article 118 (June 2023), 14 pages. https://doi.org/10.1145/3589263
188
+ * Azim Afroozeh and Peter
189
+ Boncz. [ The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar
190
+ Code] ( https://www.vldb.org/pvldb/vol16/p2132-afroozeh.pdf ) . PVLDB, 16(9): 2132 - 2144, 2023.
191
+ * Peter Boncz, Thomas Neumann, and Viktor Leis. [ FSST: Fast Random Access String
192
+ Compression] ( https://www.vldb.org/pvldb/vol13/p2649-boncz.pdf ) .
193
+ PVLDB, 13(11): 2649-2661, 2020.
194
+ * Azim Afroozeh, Leonardo X. Kuffo, and Peter Boncz. 2023. [ ALP: Adaptive Lossless floating-Point
195
+ Compression] ( https://ir.cwi.nl/pub/33334/33334.pdf ) . Proc. ACM
196
+ Manag. Data 1, 4 (SIGMOD), Article 230 (December 2023), 26 pages. https://doi.org/10.1145/3626717
186
197
187
198
Additionally, we benefited greatly from:
188
- * the collected OSS work of [ Daniel Lemire] ( https://github.com/lemire ) , such as [ FastPFor] ( https://github.com/lemire/FastPFor ) ,
189
- and [ StreamVByte] ( https://github.com/lemire/streamvbyte ) .
190
- * the [ parquet2] ( https://github.com/jorgecarleitao/parquet2 ) project by [ Jorge Leitao] ( https://github.com/jorgecarleitao ) .
199
+
200
+ * the collected OSS work of [ Daniel Lemire] ( https://github.com/lemire ) , such
201
+ as [ FastPFor] ( https://github.com/lemire/FastPFor ) ,
202
+ and [ StreamVByte] ( https://github.com/lemire/streamvbyte ) .
203
+ * the [ parquet2] ( https://github.com/jorgecarleitao/parquet2 ) project
204
+ by [ Jorge Leitao] ( https://github.com/jorgecarleitao ) .
191
205
* the public discussions around choices of compression codecs, as well as the C++ implementations thereof,
192
- from [ duckdb] ( https://github.com/duckdb/duckdb ) .
206
+ from [ duckdb] ( https://github.com/duckdb/duckdb ) .
193
207
* the existence, ideas, & implementation of the [ Apache Arrow] ( https://arrow.apache.org ) project.
194
208
* the [ Velox] ( https://github.com/facebookincubator/velox ) project and discussions with its maintainers.
195
209
0 commit comments