Commit 856597b
authored
feat(writer): Add clustered and fanout writer (apache#1735)
## Which issue does this PR close?
- Closes apache#1572 apache#1573
## What changes are included in this PR?
New:
- Added new `partitioning` module with `PartitioningWriter` trait
- `ClusteredWriter`: Optimized for pre-sorted data, requires writing in
partition order
- `FanoutWriter`: Flexible writer that can handle data from any
partition at any time
Modification:
- (BREAKING) Modified `DataFileWriterBuilder` to support dynamic
partition assignment
- Updated DataFusion integration to use the new writer API
## Are these changes tested?
Added unit tests1 parent 273991e commit 856597b
File tree
12 files changed
+1024
-84
lines changed- crates
- iceberg/src/writer
- base_writer
- file_writer
- partitioning
- integrations/datafusion/src/physical_plan
12 files changed
+1024
-84
lines changedLines changed: 9 additions & 17 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | | - | |
34 | 33 | | |
35 | 34 | | |
36 | 35 | | |
| |||
40 | 39 | | |
41 | 40 | | |
42 | 41 | | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
| 42 | + | |
| 43 | + | |
51 | 44 | | |
52 | 45 | | |
53 | 46 | | |
| |||
60 | 53 | | |
61 | 54 | | |
62 | 55 | | |
63 | | - | |
| 56 | + | |
64 | 57 | | |
65 | 58 | | |
66 | | - | |
| 59 | + | |
67 | 60 | | |
68 | 61 | | |
69 | 62 | | |
| |||
194 | 187 | | |
195 | 188 | | |
196 | 189 | | |
197 | | - | |
198 | | - | |
| 190 | + | |
| 191 | + | |
199 | 192 | | |
200 | 193 | | |
201 | 194 | | |
| |||
280 | 273 | | |
281 | 274 | | |
282 | 275 | | |
283 | | - | |
284 | | - | |
285 | | - | |
286 | | - | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
287 | 279 | | |
288 | 280 | | |
289 | 281 | | |
| |||
Lines changed: 18 additions & 24 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
70 | 69 | | |
71 | 70 | | |
72 | 71 | | |
73 | 72 | | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
| 73 | + | |
79 | 74 | | |
80 | 75 | | |
81 | 76 | | |
| |||
110 | 105 | | |
111 | 106 | | |
112 | 107 | | |
113 | | - | |
114 | 108 | | |
115 | 109 | | |
116 | 110 | | |
| |||
129 | 123 | | |
130 | 124 | | |
131 | 125 | | |
132 | | - | |
| 126 | + | |
133 | 127 | | |
134 | | - | |
| 128 | + | |
135 | 129 | | |
136 | 130 | | |
137 | | - | |
| 131 | + | |
138 | 132 | | |
139 | 133 | | |
140 | 134 | | |
| |||
428 | 422 | | |
429 | 423 | | |
430 | 424 | | |
431 | | - | |
| 425 | + | |
432 | 426 | | |
433 | 427 | | |
434 | 428 | | |
| |||
444 | 438 | | |
445 | 439 | | |
446 | 440 | | |
447 | | - | |
| 441 | + | |
448 | 442 | | |
449 | 443 | | |
450 | 444 | | |
| |||
531 | 525 | | |
532 | 526 | | |
533 | 527 | | |
534 | | - | |
535 | | - | |
| 528 | + | |
| 529 | + | |
536 | 530 | | |
537 | | - | |
| 531 | + | |
538 | 532 | | |
539 | | - | |
| 533 | + | |
540 | 534 | | |
541 | | - | |
542 | | - | |
543 | | - | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
544 | 538 | | |
545 | | - | |
546 | | - | |
| 539 | + | |
| 540 | + | |
547 | 541 | | |
548 | 542 | | |
549 | 543 | | |
| |||
597 | 591 | | |
598 | 592 | | |
599 | 593 | | |
600 | | - | |
| 594 | + | |
601 | 595 | | |
602 | 596 | | |
603 | 597 | | |
| |||
611 | 605 | | |
612 | 606 | | |
613 | 607 | | |
614 | | - | |
| 608 | + | |
615 | 609 | | |
616 | 610 | | |
617 | 611 | | |
| |||
795 | 789 | | |
796 | 790 | | |
797 | 791 | | |
798 | | - | |
| 792 | + | |
799 | 793 | | |
800 | 794 | | |
801 | 795 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
329 | 329 | | |
330 | 330 | | |
331 | 331 | | |
332 | | - | |
333 | | - | |
| 332 | + | |
334 | 333 | | |
335 | 334 | | |
336 | | - | |
| 335 | + | |
337 | 336 | | |
338 | 337 | | |
339 | 338 | | |
| |||
388 | 387 | | |
389 | 388 | | |
390 | 389 | | |
391 | | - | |
| 390 | + | |
392 | 391 | | |
393 | 392 | | |
394 | | - | |
| 393 | + | |
395 | 394 | | |
396 | 395 | | |
397 | 396 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
100 | 100 | | |
101 | 101 | | |
102 | 102 | | |
103 | | - | |
104 | | - | |
| 103 | + | |
105 | 104 | | |
106 | | - | |
| 105 | + | |
107 | 106 | | |
108 | 107 | | |
109 | 108 | | |
| |||
122 | 121 | | |
123 | 122 | | |
124 | 123 | | |
125 | | - | |
| 124 | + | |
126 | 125 | | |
127 | 126 | | |
128 | 127 | | |
| |||
149 | 148 | | |
150 | 149 | | |
151 | 150 | | |
152 | | - | |
| 151 | + | |
153 | 152 | | |
154 | | - | |
| 153 | + | |
155 | 154 | | |
156 | 155 | | |
157 | 156 | | |
| |||
231 | 230 | | |
232 | 231 | | |
233 | 232 | | |
234 | | - | |
235 | | - | |
| 233 | + | |
236 | 234 | | |
237 | 235 | | |
238 | 236 | | |
239 | | - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
240 | 241 | | |
241 | 242 | | |
242 | 243 | | |
243 | 244 | | |
244 | 245 | | |
245 | 246 | | |
246 | 247 | | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
247 | 251 | | |
248 | 252 | | |
249 | 253 | | |
250 | 254 | | |
251 | | - | |
| 255 | + | |
252 | 256 | | |
253 | 257 | | |
254 | 258 | | |
| |||
260 | 264 | | |
261 | 265 | | |
262 | 266 | | |
263 | | - | |
264 | | - | |
| 267 | + | |
| 268 | + | |
265 | 269 | | |
266 | 270 | | |
267 | 271 | | |
| |||
0 commit comments