Skip to content

Pinned Loading

  1. Intro_to_ML_Safety Intro_to_ML_Safety Public

    67 19

  2. trojan-dc-2023 trojan-dc-2023 Public

    JavaScript 1

Repositories

Showing 10 of 20 repositories
  • cerberus-cluster Public

    HPC cluster code and configurations for running on OCI

    centerforaisafety/cerberus-cluster’s past year of commit activity
    Python 4 UPL-1.0 0 70 0 Updated Feb 15, 2025
  • hle Public

    Humanity's Last Exam

    centerforaisafety/hle’s past year of commit activity
    Python 392 MIT 16 3 0 Updated Feb 14, 2025
  • AISES Public
    centerforaisafety/AISES’s past year of commit activity
    CSS 0 1 0 0 Updated Feb 13, 2025
  • emergent-values Public

    Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"

    centerforaisafety/emergent-values’s past year of commit activity
    35 MIT 1 0 0 Updated Feb 11, 2025
  • cluster-docs Public
    centerforaisafety/cluster-docs’s past year of commit activity
    CSS 0 MIT 2 4 0 Updated Jan 27, 2025
  • safetywashing Public

    Measuring correlations between safety benchmarks and general AI capabilities benchmarks.

    centerforaisafety/safetywashing’s past year of commit activity
    Python 6 MIT 0 0 0 Updated Oct 2, 2024
  • centerforaisafety/course.mlsafety.org’s past year of commit activity
    HTML 3 MIT 0 0 0 Updated Sep 20, 2024
  • forecasting Public

    Forecasting.

    centerforaisafety/forecasting’s past year of commit activity
    TypeScript 32 11 1 0 Updated Sep 11, 2024
  • HarmBench Public

    HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

    centerforaisafety/HarmBench’s past year of commit activity
    Jupyter Notebook 527 MIT 72 21 5 Updated Aug 16, 2024
  • tdc2023-starter-kit Public

    This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.

    centerforaisafety/tdc2023-starter-kit’s past year of commit activity
    Python 84 MIT 28 0 0 Updated May 19, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…