Clarify limitations of LanguageModelFeaturizer in docs #10616

mleimeister · 2022-01-03T10:23:03Z

This docs update aims to clarify some details around which HuggingFace models can be used with LanguageModelFeaturizer. Despite the supported architectures being listed in our docs, users often try others and report errors in the forum. Furthermore, not all pretrained weights from the HF models hub for the supported architectures can be used (e.g. due to missing TF weights or non-standard tokenizers being used), which results in error messages that are hard to interpret.

A related ticket with links to forum issues can be found here here.

Proposed changes:

Clarify the limitations of the current implementation of LanguageModelFeaturizer w.r.t. model architectures and weights from HuggingFace

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

github-actions · 2022-01-03T10:31:44Z

🚀 A preview of the docs have been deployed at the following URL: https://10616--rasahq-docs-rasa-v2.netlify.app/docs/rasa

mleimeister · 2022-01-03T10:42:11Z

@koernerfelicia @dakshvar22 Hi, I just wanted to check what you think and if this would improve the LM featurizer docs, since we've had frequent questions about this on the user forum recently. Not sure if I'm missing anything on the requirements for model weights to be useable with our implementation, but those listed were the ones I experienced as causes for errors.

docs/docs/components.mdx

koaning

+1 for picking this one up @mleimeister! I've added a few comments.

koaning

This looks grand!

Clarify limitations of LanguageModelFeaturizer in docs

5797d92

mleimeister changed the base branch from main to 3.0.x January 3, 2022 10:23

mleimeister requested review from koernerfelicia, koaning and dakshvar22 January 3, 2022 10:39

koaning reviewed Jan 3, 2022

View reviewed changes

docs/docs/components.mdx Outdated Show resolved Hide resolved

koaning reviewed Jan 3, 2022

View reviewed changes

docs/docs/components.mdx Outdated Show resolved Hide resolved

koaning reviewed Jan 3, 2022

View reviewed changes

docs/docs/components.mdx Show resolved Hide resolved

koaning suggested changes Jan 3, 2022

View reviewed changes

mleimeister added 2 commits January 3, 2022 17:16

More details on checking model requirements. Note on LaBSE model.

aba39a9

Add link to whiteboard video

26c1afc

koaning approved these changes Jan 4, 2022

View reviewed changes

Add changelog

7af558e

mleimeister marked this pull request as ready for review January 5, 2022 09:26

mleimeister mentioned this pull request Jan 5, 2022

Improve Docs for LanguageModelFeaturizer #10385

Closed

mleimeister added 2 commits January 5, 2022 12:15

retrigger checks

f6d0d6b

Fix typo in changelog

e5a9156

mleimeister enabled auto-merge (squash) January 5, 2022 11:25

mleimeister merged commit fa93af4 into 3.0.x Jan 5, 2022

mleimeister deleted the document-lmfeaturizer-limitations branch January 5, 2022 11:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify limitations of LanguageModelFeaturizer in docs #10616

Clarify limitations of LanguageModelFeaturizer in docs #10616

mleimeister commented Jan 3, 2022 •

edited

Loading

github-actions bot commented Jan 3, 2022

mleimeister commented Jan 3, 2022 •

edited

Loading

koaning left a comment

koaning left a comment

Clarify limitations of LanguageModelFeaturizer in docs #10616

Clarify limitations of LanguageModelFeaturizer in docs #10616

Conversation

mleimeister commented Jan 3, 2022 • edited Loading

github-actions bot commented Jan 3, 2022

mleimeister commented Jan 3, 2022 • edited Loading

koaning left a comment

Choose a reason for hiding this comment

koaning left a comment

Choose a reason for hiding this comment

mleimeister commented Jan 3, 2022 •

edited

Loading

mleimeister commented Jan 3, 2022 •

edited

Loading