Skip to content

Commit

Permalink
Update linear.py
Browse files Browse the repository at this point in the history
Fix bias handling with tgemm
  • Loading branch information
gshtras authored Jun 3, 2024
1 parent f3e1926 commit 34e010c
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions vllm/model_executor/layers/linear.py
Original file line number Diff line number Diff line change
Expand Up @@ -104,6 +104,8 @@ def apply_weights(self,
if bias is not None:
return F.linear(x, weight) + bias
return F.linear(x, weight)
elif bias is not None:
return F.linear(x, weight, bias)
return tgemm.mm(x, weight)


Expand Down

0 comments on commit 34e010c

Please sign in to comment.