Commit f4da19e
authored
Add Avro compression (#1976)
# Rationale for this change
PyIceberg did not compress the Avro. This will make gzip/deflate the
same as in Java.
# Are these changes tested?
Existing round-trip tests with FastAvro and Spark. Some tests are
extended to both write compressed and uncompressed data.
# Are there any user-facing changes?
Smaller and faster manifest files :)
<!-- In the case of user-facing changes, please add the changelog label.
-->1 parent ea57cbb commit f4da19e
File tree
8 files changed
+167
-29
lines changed- pyiceberg
- avro
- codecs
- table
- update
- tests
- integration
- test_writes
- utils
8 files changed
+167
-29
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
29 | | - | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
30 | 32 | | |
31 | 33 | | |
32 | 34 | | |
33 | 35 | | |
34 | 36 | | |
35 | 37 | | |
36 | 38 | | |
37 | | - | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
38 | 44 | | |
39 | 45 | | |
40 | 46 | | |
41 | 47 | | |
42 | 48 | | |
43 | 49 | | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
| 38 | + | |
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| |||
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
72 | | - | |
73 | 72 | | |
74 | 73 | | |
75 | 74 | | |
| |||
92 | 91 | | |
93 | 92 | | |
94 | 93 | | |
95 | | - | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
96 | 97 | | |
97 | 98 | | |
98 | 99 | | |
99 | | - | |
| 100 | + | |
100 | 101 | | |
101 | 102 | | |
102 | 103 | | |
| |||
276 | 277 | | |
277 | 278 | | |
278 | 279 | | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
279 | 286 | | |
280 | | - | |
| 287 | + | |
| 288 | + | |
281 | 289 | | |
282 | 290 | | |
283 | 291 | | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
284 | 310 | | |
285 | 311 | | |
286 | 312 | | |
| |||
289 | 315 | | |
290 | 316 | | |
291 | 317 | | |
292 | | - | |
293 | | - | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
294 | 327 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| 40 | + | |
40 | 41 | | |
41 | 42 | | |
42 | 43 | | |
| |||
946 | 947 | | |
947 | 948 | | |
948 | 949 | | |
949 | | - | |
| 950 | + | |
950 | 951 | | |
951 | | - | |
| 952 | + | |
| 953 | + | |
| 954 | + | |
| 955 | + | |
| 956 | + | |
| 957 | + | |
| 958 | + | |
| 959 | + | |
952 | 960 | | |
953 | 961 | | |
954 | 962 | | |
| |||
963 | 971 | | |
964 | 972 | | |
965 | 973 | | |
| 974 | + | |
966 | 975 | | |
967 | 976 | | |
968 | 977 | | |
| |||
998 | 1007 | | |
999 | 1008 | | |
1000 | 1009 | | |
| 1010 | + | |
1001 | 1011 | | |
1002 | 1012 | | |
1003 | 1013 | | |
| |||
1109 | 1119 | | |
1110 | 1120 | | |
1111 | 1121 | | |
1112 | | - | |
1113 | | - | |
1114 | | - | |
1115 | | - | |
1116 | | - | |
1117 | | - | |
1118 | | - | |
| 1122 | + | |
| 1123 | + | |
| 1124 | + | |
| 1125 | + | |
| 1126 | + | |
| 1127 | + | |
| 1128 | + | |
| 1129 | + | |
| 1130 | + | |
1119 | 1131 | | |
1120 | 1132 | | |
1121 | 1133 | | |
| |||
1129 | 1141 | | |
1130 | 1142 | | |
1131 | 1143 | | |
1132 | | - | |
1133 | | - | |
| 1144 | + | |
| 1145 | + | |
| 1146 | + | |
| 1147 | + | |
| 1148 | + | |
| 1149 | + | |
| 1150 | + | |
| 1151 | + | |
| 1152 | + | |
1134 | 1153 | | |
1135 | 1154 | | |
1136 | 1155 | | |
| |||
1156 | 1175 | | |
1157 | 1176 | | |
1158 | 1177 | | |
1159 | | - | |
| 1178 | + | |
| 1179 | + | |
| 1180 | + | |
| 1181 | + | |
| 1182 | + | |
| 1183 | + | |
1160 | 1184 | | |
1161 | 1185 | | |
1162 | | - | |
| 1186 | + | |
1163 | 1187 | | |
1164 | | - | |
| 1188 | + | |
1165 | 1189 | | |
1166 | 1190 | | |
1167 | 1191 | | |
| |||
1211 | 1235 | | |
1212 | 1236 | | |
1213 | 1237 | | |
1214 | | - | |
| 1238 | + | |
| 1239 | + | |
| 1240 | + | |
| 1241 | + | |
| 1242 | + | |
| 1243 | + | |
| 1244 | + | |
1215 | 1245 | | |
1216 | 1246 | | |
1217 | 1247 | | |
1218 | 1248 | | |
1219 | 1249 | | |
1220 | 1250 | | |
1221 | 1251 | | |
| 1252 | + | |
1222 | 1253 | | |
1223 | 1254 | | |
1224 | 1255 | | |
| |||
1232 | 1263 | | |
1233 | 1264 | | |
1234 | 1265 | | |
1235 | | - | |
| 1266 | + | |
| 1267 | + | |
| 1268 | + | |
| 1269 | + | |
| 1270 | + | |
| 1271 | + | |
| 1272 | + | |
| 1273 | + | |
1236 | 1274 | | |
1237 | 1275 | | |
1238 | 1276 | | |
| |||
1241 | 1279 | | |
1242 | 1280 | | |
1243 | 1281 | | |
| 1282 | + | |
1244 | 1283 | | |
1245 | 1284 | | |
1246 | 1285 | | |
| |||
1275 | 1314 | | |
1276 | 1315 | | |
1277 | 1316 | | |
| 1317 | + | |
1278 | 1318 | | |
1279 | 1319 | | |
1280 | | - | |
| 1320 | + | |
1281 | 1321 | | |
1282 | 1322 | | |
1283 | 1323 | | |
1284 | | - | |
| 1324 | + | |
1285 | 1325 | | |
1286 | 1326 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
192 | 192 | | |
193 | 193 | | |
194 | 194 | | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
195 | 198 | | |
196 | 199 | | |
197 | 200 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
30 | 31 | | |
31 | 32 | | |
32 | 33 | | |
| |||
104 | 105 | | |
105 | 106 | | |
106 | 107 | | |
| 108 | + | |
107 | 109 | | |
108 | 110 | | |
109 | 111 | | |
| |||
126 | 128 | | |
127 | 129 | | |
128 | 130 | | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
129 | 136 | | |
130 | 137 | | |
131 | 138 | | |
| |||
154 | 161 | | |
155 | 162 | | |
156 | 163 | | |
| 164 | + | |
157 | 165 | | |
158 | 166 | | |
159 | 167 | | |
| |||
184 | 192 | | |
185 | 193 | | |
186 | 194 | | |
| 195 | + | |
187 | 196 | | |
188 | 197 | | |
189 | 198 | | |
| |||
249 | 258 | | |
250 | 259 | | |
251 | 260 | | |
| 261 | + | |
252 | 262 | | |
253 | 263 | | |
254 | 264 | | |
255 | 265 | | |
256 | 266 | | |
257 | 267 | | |
| 268 | + | |
258 | 269 | | |
259 | 270 | | |
260 | 271 | | |
| |||
291 | 302 | | |
292 | 303 | | |
293 | 304 | | |
| 305 | + | |
294 | 306 | | |
295 | 307 | | |
296 | 308 | | |
| |||
416 | 428 | | |
417 | 429 | | |
418 | 430 | | |
| 431 | + | |
419 | 432 | | |
420 | 433 | | |
421 | 434 | | |
| |||
550 | 563 | | |
551 | 564 | | |
552 | 565 | | |
| 566 | + | |
553 | 567 | | |
554 | 568 | | |
555 | 569 | | |
| |||
0 commit comments