HarryMayne

Harry Mayne HarryMayne

Highlights

SV_interpretability SV_interpretability Public

Code for the paper "Can sparse autoencoders be used to decompose and interpret steering vectors?"

2
ICU-patient-subgroups ICU-patient-subgroups Public

Unsupervised Learning Approaches for Identifying ICU Patient Subgroups: Do Results Generalise?

Jupyter Notebook 1
bookmark2email bookmark2email Public

Python 1
SAELensPlus SAELensPlus Public

Forked from jbloomAus/SAELens

Training Sparse Autoencoders on Language Models

HTML
TransformerLens TransformerLens Public

Forked from TransformerLensOrg/TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python
representation-engineering representation-engineering Public

Forked from andyzoujm/representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook