Add methods to form thermometer codes #93

denkle · 2022-11-11T15:48:04Z

No description provided.

mikeheddes

Thank you for the PR, it's looking good! I just have a couple comments about some details.

mikeheddes · 2022-11-11T18:31:37Z

torchhd/functional.py

+        device=rand_hv.device,
+    )
+
+    for i in range(num_vectors):


Could you perform a quick benchmark (timing) of your implementation against something like torch.triu_indices? This would let you set all elements in parallel so I suspect it would be quite a bit faster especially as the number of dimensions increases but I haven't tried it.

Great point! I agree it is no-brainer that the usage of indexing will make the implementation extremely fast. So the code was modified accordingly

mikeheddes · 2022-11-11T18:35:00Z

torchhd/functional.py

+    )
+
+    for i in range(num_vectors):
+        if (model == BSC) | (model == FHRR):


I am wondering shouldn't the FHRR also use bipolar elements? Since it's a generalization of the bipolar hypervectors. This would also be inline with the model agnostic version I mentioned in another comment using the identity and negative identity.

mikeheddes · 2022-11-11T18:38:30Z

torchhd/functional.py

+    requires_grad=False,
+    **kwargs,
+) -> VSA_Model:
+    """Creates a thermometer code for given dimensionality.


Maybe another way to look at the thermometer codes is to take the first hypervector as the identity hypervector of the model and the last hypervector as the negative of the identity and then perform the interpolation in between. This could make it such that the function is agnostic to the model used.

We need to save this point somewhere! I think the current implementation with the use of torch.tril_indices is pretty compact but as we expand the number of supported HD/VSA models we might want to switch to this idea of interpolation.

mikeheddes · 2022-11-11T18:43:04Z

torchhd/functional.py

+) -> VSA_Model:
+    """Creates a thermometer code for given dimensionality.
+
+    Implements similarity-preserving hypervectors as described in "Sparse Binary Distributed Encoding of Scalars" <https://doi.org/10.1615/J Automat Inf Scien.v37.i6.20>.


Could you add the thermometer_hv and Thermometer to the documentation under docs/torchhd.rst and docs/embeddings.rst? Please double check using the sphinx documentation generation command in the README whether the link appears correctly, there might be an error in your syntax for the reference in the function description.

Thanks. I have not thought about the documentation in the first place. Fixed it now by updating the corresponding .rst files. There was indeed an error in the link formatting. Fixed it as well so it works in the generated .html

denkle · 2022-11-14T09:54:08Z

torchhd/embeddings.py

+            input, self.low_value, self.high_value, self.num_embeddings
+        ).clamp(0, self.num_embeddings - 1)
+
+        return super(Thermometer, self).forward(indices).as_subclass(MAP)


@mikeheddes a question here. I followed the structure as in Level(). It, however, looks weird that MAP is used at the end when doing .as_subclass(). It seems that we rather need to specify a "model" parameter at the input and use the corresponding model with return.

Yes, you are right. It is currently this way to be compatible with the way we implemented it before v4. But we should allow the user to specify the model of the various embedding classes. I will make a new issue for this.

mikeheddes · 2022-11-14T20:37:17Z

I'm wondering whether it would be better to keep the same API as for the other hypervector creation functions where you can specify the number of vectors and their dimensions. Because for 10000 dimension this will create a very large matrix while you might only need to use 6 hypervectors or something.

denkle · 2022-11-19T10:56:20Z

I'm wondering whether it would be better to keep the same API as for the other hypervector creation functions where you can specify the number of vectors and their dimensions. Because for 10000 dimension this will create a very large matrix while you might only need to use 6 hypervectors or something.

Will work now on revising the code to address this.

mikeheddes

@denkle thank you for revising the code and congrats with merging your first PR!

I removed the VSA model argument from the embedding class so that all the embeddings have a consistent API. I think it's better if we change all the embedding classes to accept a VSA model argument in a separate PR.

denkle · 2022-11-22T15:24:13Z

Thanks, @mikeheddes! Agree with the point on separate PR for changing embeddings' API for all classes. Just let me know if you would like to start it or whether I should give the first try. It might at least be worth checking if the code for Thermometer was reasonable.

mikeheddes · 2022-11-23T04:34:19Z

@denkle you can start working on that feature if you want. What you had before was in the right direction I think, it's a similar API to the functional ones. The only thing that was still missing is making sure that the dtype of the created tensors matches (or is compatible with) the provided VSA_Model.

Add methods to form thermometer codes

2317b1f

mikeheddes reviewed Nov 11, 2022

View reviewed changes

denkle commented Nov 14, 2022

View reviewed changes

Add revisions after review

e81fda0

denkle requested a review from mikeheddes November 14, 2022 11:59

mikeheddes added 2 commits November 14, 2022 12:26

Merge branch 'main' into Thermometer

a6c5fac

Formatting and micro optimizations

4ae7a86

denkle and others added 5 commits November 19, 2022 12:59

Modify code to allow flexible number of hypervectors

d07f8d0

Modify code to allow flexible number of hypervectors

37282b2

[github-action] formatting fixes

d4c3591

Modify Thermometer embedding to accept HD/VSA model type

e33b6c6

Remove model option from embeddings

86ad9b1

mikeheddes approved these changes Nov 20, 2022

View reviewed changes

mikeheddes merged commit 1fd1344 into main Nov 20, 2022

mikeheddes deleted the Thermometer branch November 20, 2022 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add methods to form thermometer codes #93

Add methods to form thermometer codes #93

Uh oh!

denkle commented Nov 11, 2022

Uh oh!

mikeheddes left a comment

Uh oh!

mikeheddes Nov 11, 2022

Uh oh!

denkle Nov 14, 2022

Uh oh!

mikeheddes Nov 11, 2022

Uh oh!

mikeheddes Nov 11, 2022

Uh oh!

denkle Nov 14, 2022

Uh oh!

mikeheddes Nov 11, 2022

Uh oh!

denkle Nov 14, 2022 •

edited

Loading

Uh oh!

denkle Nov 14, 2022

Uh oh!

mikeheddes Nov 14, 2022

Uh oh!

mikeheddes commented Nov 14, 2022

Uh oh!

denkle commented Nov 19, 2022 •

edited

Loading

Uh oh!

mikeheddes left a comment

Uh oh!

denkle commented Nov 22, 2022

Uh oh!

mikeheddes commented Nov 23, 2022

Uh oh!

Uh oh!

Add methods to form thermometer codes #93

Add methods to form thermometer codes #93

Uh oh!

Conversation

denkle commented Nov 11, 2022

Uh oh!

mikeheddes left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

denkle Nov 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mikeheddes commented Nov 14, 2022

Uh oh!

denkle commented Nov 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikeheddes left a comment

Choose a reason for hiding this comment

Uh oh!

denkle commented Nov 22, 2022

Uh oh!

mikeheddes commented Nov 23, 2022

Uh oh!

Uh oh!

denkle Nov 14, 2022 •

edited

Loading

denkle commented Nov 19, 2022 •

edited

Loading