Skip to content

Commit

Permalink
ignore list (#171)
Browse files Browse the repository at this point in the history
  • Loading branch information
horheynm authored Sep 30, 2024
1 parent f3d9ec2 commit 7351fdb
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions src/compressed_tensors/quantization/quant_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -201,6 +201,13 @@ def from_pretrained(
if len(quant_scheme_to_layers) == 0: # No quantized layers
return None

# kv-cache only, no weight/activation quantization
if (
len(quantization_type_names) == 1
and "attention" in list(quantization_type_names)[0].lower()
):
quantization_type_names.add("Linear")

# clean up ignore list, we can leave out layers types if none of the
# instances are quantized
consolidated_ignore = []
Expand Down

0 comments on commit 7351fdb

Please sign in to comment.