Refactor `BasePlot` not to create a dataframe representation of the data

### What kind of feature would you like to request?

Additional function parameters / changed functionality / changed defaults?

### Please describe your wishes

See  #3717 for what prompted me to look at this code.

Currently `BasePlot` creates a in-memory copy as a dataframe of the main data of interest (obsm, X, layers etc.): https://github.com/scverse/scanpy/blob/0b82c934edeb640095df14e60725ed1fad6eebb1/src/scanpy/plotting/_baseplot_class.py#L148-L157

I believe this to be unnecessary as this dataframe is only ever used for `groupby` operations, for which we have a zero-copy solution in https://scanpy.readthedocs.io/en/latest/generated/scanpy.get.aggregate.html

Thus we should

- [ ] Refactor `Baseplot` not to create a copy
- [ ] Use the  https://scanpy.readthedocs.io/en/latest/generated/scanpy.get.aggregate.html for aggregation
- [ ] Ensure this doesn't affect performance
- [ ] Integrate #3700

	self.categories, self.obs_tidy = _prepare_dataframe(
	adata,
	self.var_names,
	groupby,
	use_raw=use_raw,
	log=log,
	num_categories=num_categories,
	layer=layer,
	gene_symbols=gene_symbols,
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `BasePlot` not to create a dataframe representation of the data #3718

What kind of feature would you like to request?

Please describe your wishes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Refactor BasePlot not to create a dataframe representation of the data #3718

Description

What kind of feature would you like to request?

Please describe your wishes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Refactor `BasePlot` not to create a dataframe representation of the data #3718