Fix the logic for perplexity evaluation (`Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt`) #1633

dbogunowicz · 2024-03-07T12:58:28Z

No description provided.

bfineran

looks like this may be out of date or not ready to merge @dbogunowicz?

bfineran · 2024-03-11T14:39:58Z

src/deepsparse/evaluation/integrations/perplexity.py

@@ -165,6 +166,7 @@ def run_perplexity(
                    return_input_tokens=True,
                )
            else:
+                print(len(pipeline.tokenizer(batch[0]).input_ids))


bfineran · 2024-03-11T14:40:06Z

src/deepsparse/evaluation/integrations/perplexity.py

+                pipeline.sequence_length - pipeline.prompt_sequence_length - 1
+            )
+            # account for potential additional BOS token
+            breakpoint()


dbogunowicz · 2024-04-22T11:29:03Z

Closing, due to inactivity.

dbogunowicz and others added 8 commits February 26, 2024 22:23

initial commit

021fb13

fix tests

b12d326

Merge branch 'main' into feature/damian/seq_len

9216199

Merge branch 'main' into feature/damian/seq_len

0596c7d

Merge branch 'main' into feature/damian/seq_len

83d8ac6

Update src/deepsparse/evaluation/utils.py

ab18e23

quality

5154d62

initial commit

4d58eb6

bfineran suggested changes Mar 11, 2024

View reviewed changes

dbogunowicz closed this Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the logic for perplexity evaluation (`Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt`) #1633

Fix the logic for perplexity evaluation (`Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt`) #1633

dbogunowicz commented Mar 7, 2024

bfineran left a comment

bfineran Mar 11, 2024

bfineran Mar 11, 2024

dbogunowicz commented Apr 22, 2024

Fix the logic for perplexity evaluation (Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt) #1633

Fix the logic for perplexity evaluation (Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt) #1633

Conversation

dbogunowicz commented Mar 7, 2024

bfineran left a comment

Choose a reason for hiding this comment

bfineran Mar 11, 2024

Choose a reason for hiding this comment

bfineran Mar 11, 2024

Choose a reason for hiding this comment

dbogunowicz commented Apr 22, 2024

Fix the logic for perplexity evaluation (`Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt`) #1633

Fix the logic for perplexity evaluation (`Not enough kv_cache capacity to run generation. Please use a larger sequence_length or a shorter prompt`) #1633