Skip to content

elenagaz/An-Investigation-of-Depression-on-a-Language-Moderation-Model-Using-Concept-Activation-Vectors

Repository files navigation

An Investigation of Depression on a Language Moderation Model Using Concept Activation Vectors

Getting Started

To get started with this project, follow the steps below:

Prerequisites

You will need an Anaconda environment for a faster setup.

Installation

  1. Download and Unzip Environment

  2. Clone the Repository

    • Clone the project and checkout the main branch
  3. Add Conda Environment

    • Add the conda environment to the project as the interpreter
  4. Install Dependencies

Running the Demo

To run the demo I have created, execute the main method of the specified file. Navigate to the path: my_model_moderation and run the demo

To reproduce the TCAV scores for the depression concept, run the demo and only choose the first 148 examples labeled as OK (image 1) and add them as a slice (image 2), then navigate to the TCAV tab (image 3) and run TCAV with the selected classes (image 4) - this takes a while as there are predictions that must be rerun.

image 1: image

image 2: image

image 3: image

image 4: image

Moreover, for each of the symptoms, each symptom file from the directory must be added as the file path on line and the demo must be rerun.

All results

All results with the corrected p-value of 0.00256 can be found in this directory

However, the results with the initial p-value of 0.05 can be found in this directory

Additional Information regarding the paper

All protocols of data generation for depression and random data can be found here

All files used for the generation of TCAV scores can be found in this directory

Additionally, all files for validation, including the files used as training data can be found in these directories KoalaAI_Text-Moderation-v2-smal and moderation_api_release

Note:

Some edits have been made to ensure compatibility with Windows.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published