Commit 53f958e
Preserve formatting in concatenated IterableDataset (#7522)
* Preserve formatting in concatenated iterable dataset when the inputs have consistent formatting
* style
* If `dset._formatting` is None for any of the datasets, set the concatenated dataset format to None.
Add log line for inputs with inconsistent format.
* fix incorrect grouping
* Reset output formatting if any of the inputs has formatting not set
* log unset format also in case `formatting` is set, but `format_type` is None
* Update src/datasets/iterable_dataset.py
* Update src/datasets/iterable_dataset.py
* Update src/datasets/iterable_dataset.py
* Update iterable_dataset.py
---------
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>1 parent e38e6c2 commit 53f958e
File tree
2 files changed
+38
-1
lines changed- src/datasets
- tests
2 files changed
+38
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3430 | 3430 | | |
3431 | 3431 | | |
3432 | 3432 | | |
| 3433 | + | |
| 3434 | + | |
| 3435 | + | |
| 3436 | + | |
| 3437 | + | |
| 3438 | + | |
| 3439 | + | |
| 3440 | + | |
| 3441 | + | |
| 3442 | + | |
| 3443 | + | |
| 3444 | + | |
| 3445 | + | |
| 3446 | + | |
| 3447 | + | |
| 3448 | + | |
| 3449 | + | |
| 3450 | + | |
| 3451 | + | |
3433 | 3452 | | |
3434 | 3453 | | |
3435 | 3454 | | |
| |||
3451 | 3470 | | |
3452 | 3471 | | |
3453 | 3472 | | |
3454 | | - | |
| 3473 | + | |
| 3474 | + | |
| 3475 | + | |
| 3476 | + | |
| 3477 | + | |
| 3478 | + | |
| 3479 | + | |
3455 | 3480 | | |
3456 | 3481 | | |
3457 | 3482 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2128 | 2128 | | |
2129 | 2129 | | |
2130 | 2130 | | |
| 2131 | + | |
| 2132 | + | |
| 2133 | + | |
| 2134 | + | |
| 2135 | + | |
| 2136 | + | |
| 2137 | + | |
| 2138 | + | |
| 2139 | + | |
| 2140 | + | |
| 2141 | + | |
| 2142 | + | |
2131 | 2143 | | |
2132 | 2144 | | |
2133 | 2145 | | |
| |||
0 commit comments