This version contains several features and bug fixes.
Training
support BertEncoder #116
support torch amp and apex amp #100
Inference
support big models like gpt2-large and bart-large #82
Fixes
fix adam bug when param size < 1024 #98
fix training compiling fail in cuda < 11 #80