Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.4k000Updated Nov 3, 2024Nov 3, 2024
    • The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k000Updated Nov 2, 2024Nov 2, 2024
    • Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
      Python
      MIT License
      2.6k000Updated Oct 30, 2024Oct 30, 2024