Skip to content

Commit

Permalink
GPTQ add Arkiv link, move file location (#1100)
Browse files Browse the repository at this point in the history
## Purpose ##
* Better docstring for GPTQ
* Reduce unnecessary file hierarchy

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
  • Loading branch information
kylesayrs authored Jan 29, 2025
1 parent a76563a commit ba8563c
Show file tree
Hide file tree
Showing 3 changed files with 4 additions and 5 deletions.
6 changes: 4 additions & 2 deletions src/llmcompressor/modifiers/quantization/gptq/base.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@
from llmcompressor.core import State
from llmcompressor.modifiers import Modifier, ModifierFactory
from llmcompressor.modifiers.quantization.calibration import freeze_module_quantization
from llmcompressor.modifiers.quantization.gptq.utils.gptq_quantize import (
from llmcompressor.modifiers.quantization.gptq.gptq_quantize import (
accumulate_hessian,
make_empty_hessian,
quantize_weight,
Expand All @@ -36,7 +36,9 @@

class GPTQModifier(Modifier, HooksMixin):
"""
Modifier for applying the one-shot OBCQ algorithm to a model
Implements the GPTQ algorithm from https://arxiv.org/abs/2210.17323. This modifier
uses activations to calibrate a hessian matrix, which is then used to determine
optimal quantizion values and orderings for the model weights.
| Sample yaml:
| test_stage:
Expand Down

This file was deleted.

0 comments on commit ba8563c

Please sign in to comment.