🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
-
Updated
Feb 19, 2025 - Python
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
⚙️🦀 Build portable, modular & lightweight Fullstack Agents
Implementation for MatMul-free LM.
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Building AI agents, atomically
Awesome papers about unifying LLMs and KGs
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
NestJS Helper + AI Chatbot Development
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
日本語LLMまとめ - Overview of Japanese LLMs
Add a description, image, and links to the large-language-model topic page so that developers can more easily learn about it.
To associate your repository with the large-language-model topic, visit your repo's landing page and select "manage topics."