integrating_external_data
2020-JMLR-Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer.pdf
2021-ERNIE 3.0- LARGE-SCALE KNOWLEDGE ENHANCED PRE-TRAINING FOR LANGUAGE UNDERSTANDING AND GENERATION.pdf
2021-ICLR-DEBERTA DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION.pdf
2021-Scaling Language Models- Methods, Analysis & Insights from Training Gopher.pdf
2022-ACL-GLM- General Language Model Pretraining with Autoregressive Blank Infilling.pdf
2022-Gpt-neox-20b- An open-source autoregressive language model.pdf
2022-ICML-GLaM- Efficient Scaling of Language Models with Mixture-of-Experts.pdf
2022-LaMDA- Language Models for Dialog Applications.pdf
2022-OPT- Open Pre-trained Transformer Language Models.pdf
2022-PaLM- Scaling Language Modeling with Pathways.pdf
2022-Training Compute-Optimal Large Language Models.pdf
2022-WeLM- A Well-Read Pre-trained Language Model for Chinese.pdf
2023-BLOOM- A 176B-Parameter Open-Access Multilingual Language Model.pdf
2023-BloombergGPT- A Large Language Model for Finance.pdf
2023-ICLR-GLM-130B- An Open Bilingual Pre-trained Model.pdf
2023-LLaMA- Open and Efficient Foundation Language Models.pdf
2023-Llama 2 Open Foundation and Fine-Tuned Chat Models.pdf
2023-Panda LLM- Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models.pdf
2023-Qwen Technical Report.pdf
2024-Chameleon Mixed-Modal Early-Fusion Foundation Models.pdf
2024-Mixtral of Experts.pdf
2023-ChatDoctor- A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge.pdf
2023-OpenAssistant Conversations - Democratizing Large Language Model Alignment.pdf
2023-Video-LLaMA- An Instruction-tuned Audio-Visual Language Model for Video Understanding.pdf
2023-Visual Instruction Tuning.pdf
You can’t perform that action at this time.