Fix: Downsample before embedding in multi-resolution mode and fix fps… #32

maliozer · 2025-06-15T14:15:37Z

Ensure correct order of FPS downsampling and embedding for multi-resolution inputs; fix import location of fps

Summary

This PR fixes the following issues in the PerceiverCrossAttentionEncoder._forward() method:

Correct import location for fps:
The function fps from torch_cluster is now imported directly before it is used in the multi-resolution downsampling block, preventing NameError.

Correct order of downsampling and embedding:
Multi-resolution FPS downsampling of point clouds and associated features (pc, feats, sharp_pc, sharp_feat) is now performed before any embedding or projection. This ensures that only the downsampled tensors are passed through the embedder and input projections, so shapes are always aligned. No redundant recomputation is performed.

Changes

Moved from torch_cluster import fps to immediately before usage in the if self.use_multi_reso: block.
Downsample input tensors first (if use_multi_reso is enabled), and then perform embedding/projection.
Prevents mismatches in shape/dimension and avoids unnecessary recomputation.
All logic paths now consistently process only the current (possibly downsampled) batch.

Related Issue: #31

… import location

Fix: Downsample before embedding in multi-resolution mode and fix fps…

292887e

… import location

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Downsample before embedding in multi-resolution mode and fix fps… #32

Fix: Downsample before embedding in multi-resolution mode and fix fps… #32

Uh oh!

maliozer commented Jun 15, 2025

Uh oh!

Uh oh!

Fix: Downsample before embedding in multi-resolution mode and fix fps… #32

Are you sure you want to change the base?

Fix: Downsample before embedding in multi-resolution mode and fix fps… #32

Uh oh!

Conversation

maliozer commented Jun 15, 2025

Summary

Changes

Uh oh!

Uh oh!