Skip to content

Conversation

greole
Copy link
Collaborator

@greole greole commented Feb 13, 2024

For a good overall performance, typically a decomposition into 2n_GPUs up to 4n_GPUs subdomains performs best. This PR enables efficient on device repartitioning, such that decompositions up to nCPUs subdomains work without affecting performance negatively.

@greole
Copy link
Collaborator Author

greole commented May 14, 2024

Close in favour of #115

@greole greole closed this May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant