Expose OpenMP backends to more analysis methods

## Is your feature request related to a problem? ##
Some analysis tools rely on underlying libraries that have both OpenMP and serial implementations, but only ever allow the serial implementation to run. InterRdf is a good example of this. In the main loop:
* The tool calls [pairs.capped_distance](https://github.com/MDAnalysis/mdanalysis/blob/develop/package/MDAnalysis/analysis/rdf.py#L355)
* pairs.capped_distance dispatches either to an neighbor search grid (serial) or a [brute force method](https://github.com/MDAnalysis/mdanalysis/blob/149eb504e35670568201af28306fa04462f6c100/package/MDAnalysis/lib/distances.py#L481) 
* The brute force implementation calls [distance_array](https://github.com/MDAnalysis/mdanalysis/blob/149eb504e35670568201af28306fa04462f6c100/package/MDAnalysis/lib/distances.py#L171). distance_array has an input kwarg to determine serial or parallel execution, but this method is not invoked with that kwarg in this call sequence, and so always defaults to serial.

## Describe the solution you'd like ##
Allow users to accelerate RDF and other routines with existing parallel implementations. A demo implementation (not ready for submisison) can be found on my fork [here](https://github.com/scal444/mdanalysis/commit/8673b530d2a7bd0f5f7b3da84b8b5c4301858d4d). Some local benchmarks on my Ryzen 5:


![perf_comparison](https://user-images.githubusercontent.com/18430915/137157771-c044434e-7ac1-4569-8878-a5b90b9e824c.png)

I tuned the brute force thread count with the OMP_NUM_THREADS env variable while running asv. Isolating a benchmark with 2000 atoms, we get linear scaling of performance per openmp thread. Also of interest is the very poor scaling of the nsgrid implementation, but that's another issue (and in smaller benchmarks, nsgrid outperforms brute force for shorter cutoffs).

## Describe alternatives you've considered ##

There are two questions here -

1. How should the ability to choose a backend be exposed to users?

This could be done test-by-test (see hacky example [here](https://github.com/scal444/mdanalysis/commit/8673b530d2a7bd0f5f7b3da84b8b5c4301858d4d)), but it may be worth looking into something more standard.

2. Should mdanalysis try to dispatch to OpenMP by default if it exists?

OpenMP support is easily detected, and IIRC other MDAnalysis dependencies like Numpy already implement transparent multithreading for some routines.

## Additional context ##

Looking for feedback / input.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expose OpenMP backends to more analysis methods #3435

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Expose OpenMP backends to more analysis methods #3435

Description

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions