Closed
Description
@theabhirath @darsnack do you think the multi head attention layer is now general enough we can move it to Flux?
Metadata
Metadata
Assignees
Labels
No labels
@theabhirath @darsnack do you think the multi head attention layer is now general enough we can move it to Flux?