Skip to content

Commit cdff714

Browse files
quic-amitrajabukhoy
authored andcommitted
Features upgrade of Embedding model (quic#424)
1. Added `Pooling` support to Embedding Model 2. Added multiple seq_len support for the embedding model using `QEffAutoModel` 4. Added test for pooling and multiple seq_len --------- Signed-off-by: Amit Raj <quic_amitraj@quicinc.com> Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com> Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
1 parent 2ec0917 commit cdff714

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

QEfficient/transformers/models/pytorch_transforms.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -144,6 +144,7 @@
144144
from QEfficient.customop import CustomRMSNormAIC, GemmaCustomRMSNormAIC
145145
from QEfficient.transformers.embeddings.embedding_utils import POOLING_MAP, PooledModel, validate_user_pooling_function
146146
from QEfficient.transformers.cache_utils import QEffDynamicCache
147+
from QEfficient.transformers.embeddings.embedding_utils import POOLING_MAP, PooledModel, validate_user_pooling_function
147148
from QEfficient.transformers.models.codegen.modeling_codegen import (
148149
QEffCodeGenAttention,
149150
QeffCodeGenBlock,

0 commit comments

Comments
 (0)