Using (Absolute) Positional Embeddings with Hyena Operators

Hi @DanFu09 
Hope you're well,

I was reading the source code and the config files, and I realized that `use_positional_encodings` is `True` ([link](https://github.com/HazyResearch/m2/blob/b449ddbdbc9347ccf8e6332c1177affa1dec08f6/bert/yamls/embeddings/m2-bert-80M-32k-retrieval.yaml#L63)). So, the M2-BERT model is using an absolute positional embeddings ([link](https://github.com/HazyResearch/m2/blob/b449ddbdbc9347ccf8e6332c1177affa1dec08f6/bert/src/bert_layers.py#L70)) before feeding the tokens to Hyena operators.

I checked the original Hyena and HyenaDNA source codes, and they haven't used any positional embeddings for their models.
My question is why have you used the positional embeddings? Have you tried not using them? Did it worsen the performance?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using (Absolute) Positional Embeddings with Hyena Operators #29

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Using (Absolute) Positional Embeddings with Hyena Operators #29

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions