Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch #503

vpratz · 2025-06-01T10:58:57Z

This PR generalizes the transforms from the Approximator.create_default method, so that they can be used with nested inputs as well. For ConvertDType and ToArray, it supplies a generalization that works on dictionaries.
Edit: For Concatenate, it adds a minor change in the Adapter.concatenate dispatch function to detect the special case of only one key being present also if it is nested in a sequence of length one.

The recursive structure of nested inputs requires some extra utility functions, which I put into bayesflow.utils.tree.

Concatenate can be equal to rename if only one key is supplied. By not calling concatenate in that case, we can accept arbitrary inputs in the transform, as long as only one is supplied. This simplifies things e.g. in the `BasicWorkflow`, where the user passes the `summary_variables` to concatenate, which may be a single dict, which does not need to be concatenated.

codecov · 2025-06-01T11:11:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines	Coverage Δ
bayesflow/adapters/adapter.py	`86.34% <100.00%> (+0.51%)`	⬆️
bayesflow/adapters/transforms/convert_dtype.py	`100.00% <100.00%> (ø)`
bayesflow/adapters/transforms/to_array.py	`82.75% <100.00%> (+3.59%)`	⬆️
bayesflow/utils/tree.py	`100.00% <100.00%> (ø)`

... and 11 files with indirect coverage changes

LarsKue · 2025-06-05T18:18:15Z

@vpratz Isn't the concat issue already addressed by this if-else in the adapter dispatch?

bayesflow/bayesflow/adapters/adapter.py

Lines 485 to 486 in 449a79a

    
           if isinstance(keys, str): 
        
               transform = Rename(keys, to_key=into)

vpratz · 2025-06-05T20:32:53Z

Thanks for the response. I didn't see that, I encountered an error in the transform, probably because I passed a list of length one instead of a string. With that additional context, I think it's better to just add that case in the dispatch as well, and revert the changes in the transform.

Moves the fix from the Concatenate transform to the concatenate method of the adapter.

LarsKue

Thank you for the PR. I think we should discuss these changes a bit more before we move on with an implementation. See comments.

bayesflow/adapters/adapter.py

bayesflow/adapters/transforms/convert_dtype.py

bayesflow/adapters/transforms/to_array.py

LarsKue · 2025-06-14T17:12:18Z

bayesflow/adapters/transforms/to_array.py

I think the added complexity to these transforms is not worth it. Why do we need to support sampling nested dictionaries?

LarsKue · 2025-06-14T17:13:33Z

bayesflow/utils/tree.py

Utilities like these are very general, and need to be extensively documented and tested. I find it somewhat opaque for now what these are trying to achieve, and why this functionality isn't already present in optree or keras.tree.

vpratz · 2025-06-14T18:03:59Z

Thanks a lot for the review! I'm not sure if the motivation behind the changes is totally clear yet. I'll try to describe it once more and we can exchange ideas on how (and if) we want to proceed.
The scenario I would like to improve is the following:

a user wants to do inference on multi-modal data using a FusionNetwork, and in their simulator directly specifies summary_variables as a dictionary that will work with the FusionNetwork.
The user then wants to use the BasicWorkflow and sets summary_variables=["summary_variables"].
This results in errors, as the transforms in the default adapter cannot handle the dictionary that was provided as summary_variables.

We already have ways to work around this using group and ungroup, but this requires manually specifying the adapter and knowing how to achieve this. The main motivation for the changes at hand is to make the defaults general enough that users do not encounter errors, and that the question what is and isn't possible with the Adapter does not get in their way if they only use the defaults.

@stefanradev93 mentioned that in the future we might see more summary networks that require dictionaries as inputs, so I figured this might become a more common scenario and worth resolving.

Most of the complexity is due to trying to keep ToArray invertible. If we do not consider this important, this could be simplified.
Other avenues (like adapting the creation of the default adapter) would be possible as well.

What do you think? Is this a change we want, and if so, what would be the best way to get there?

stefanradev93 · 2025-06-14T18:06:35Z

Yes, dictionary inputs and composite summary networks will be increasingly important.

LarsKue · 2025-06-14T18:34:58Z

I understand the motivation, but I don't think the Adapter should handle the increased complexity. Instead, I think we should look moreso into the Approximator and how we can facilitate having multiple summary networks there.

vpratz · 2025-06-14T19:44:41Z

@LarsKue This is not only about multiple summary networks, but any summary network that requires multiple inputs that do not fit in one tensor (see Stefan's comment above). Do you have any ideas regarding that?

- simplify map_dict to only a single structure, as we probably will not require the more general behavior. Add test and docstring. - remove tree functions that were required for restoring original types - minor cleanups to account for review comments

vpratz · 2025-06-15T09:19:49Z

I have reduced the complexity a lot by dropping the inverse in ToArray, and reduced the tree utility to the minimum necessary. I think with those changes the code would be manageable.
I'm not yet convinced there is a better place to handle this. Manually handling dictionaries everywhere they might come up will probably be more complex and prone to inconsistencies than supporting them here.

stefanradev93 · 2025-06-18T08:42:10Z

Yes, mappings and sequences should be handled by the adapter. Downstream networks should only consume what the adapter provides.

vpratz requested review from LarsKue and stefanradev93 June 2, 2025 11:53

vpratz added 3 commits June 5, 2025 22:33

Merge remote-tracking branch 'upstream/dev' into feat-adapter-nested

d4cb1ac

use Rename instead of Concatenate if only one key is supplied

a47fb3d

Moves the fix from the Concatenate transform to the concatenate method of the adapter.

add test for concatenate to rename conversion

c145fe3

vpratz changed the title ~~Handle nesting for ConvertDType, ToArray. Relax conditions for Concatenate if only one key is present~~ Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch Jun 7, 2025

vpratz mentioned this pull request Jun 14, 2025

Release 2.0.4 #510

Merged

Merge remote-tracking branch 'upstream/dev' into feat-adapter-nested

c421f92

LarsKue requested changes Jun 14, 2025

View reviewed changes

vpratz added 4 commits June 14, 2025 21:34

Merge remote-tracking branch 'upstream/dev' into feat-adapter-nested

2f78f65

add and adapt type hints

608a6f4

Merge remote-tracking branch 'upstream/dev' into feat-adapter-nested

5e274f7

vpratz mentioned this pull request Jun 18, 2025

Design Discussion: Incorporating multi-input networks #517

Open

vpratz marked this pull request as draft June 30, 2025 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch #503

Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch #503

Uh oh!

vpratz commented Jun 1, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 1, 2025 •

edited

Loading

Uh oh!

LarsKue commented Jun 5, 2025

Uh oh!

vpratz commented Jun 5, 2025

Uh oh!

LarsKue left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LarsKue Jun 14, 2025

Uh oh!

LarsKue Jun 14, 2025

Uh oh!

vpratz commented Jun 14, 2025

Uh oh!

stefanradev93 commented Jun 14, 2025

Uh oh!

LarsKue commented Jun 14, 2025

Uh oh!

vpratz commented Jun 14, 2025

Uh oh!

vpratz commented Jun 15, 2025

Uh oh!

stefanradev93 commented Jun 18, 2025

Uh oh!

Uh oh!

Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch #503

Are you sure you want to change the base?

Handle nesting for ConvertDType, ToArray, adapt concatenate dispatch #503

Uh oh!

Conversation

vpratz commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

LarsKue commented Jun 5, 2025

Uh oh!

vpratz commented Jun 5, 2025

Uh oh!

LarsKue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LarsKue Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

LarsKue Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

vpratz commented Jun 14, 2025

Uh oh!

stefanradev93 commented Jun 14, 2025

Uh oh!

LarsKue commented Jun 14, 2025

Uh oh!

vpratz commented Jun 14, 2025

Uh oh!

vpratz commented Jun 15, 2025

Uh oh!

stefanradev93 commented Jun 18, 2025

Uh oh!

Uh oh!

vpratz commented Jun 1, 2025 •

edited

Loading

codecov bot commented Jun 1, 2025 •

edited

Loading