You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update the Transformers converter to support more architectures:
Falcon-40B
XLM-RoBERTa
Add the generation option sampling_topp to enable top-p (nucleus) sampling
Save vocabulary files in the JSON format to better support tokens containing newlines or carriage returns
Fixes and improvements
Fix the application of min_length and max_length when using include_prompt_in_result=False and a batch input with variable lengths: the length constraint should only apply to the sequence after the prompt