You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Pre-made tokenizers, CNN- and transformer-style encoders + training, …
…BLOSUM-based augmentation, and flatten_swiss, which extracts tabular data for NLP pretraining
Add is_padded(), includes_bos(), includes_eos(), alphabet_size(), pad…
…(), eos(), bos() functions to Tokenizers; add prebuilt Tokenizers, and a bioseq.make_embedding utility to handled padding and torch.nn.Embedding creation.