Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
uaydonat committed Jul 14, 2024
1 parent 63e06d3 commit f1dc594
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -62,7 +62,7 @@

| Model | Technique | Gen. Token [3] | Batch | End-to-end throughput [1] | Device throughput [2] | Target |
|-----------------------------------------------------------|--------------------|---------------------|-----------------------|------------------------------|------------------------------|-----------------|
| [Falcon7B-decode](./models/demos/t3000/falcon7b) | Data Parallel | 129th | 256 | 4.9 t/s/u - 1247 t/s | 15.5 t/s/u - 3968 t/s | 26 t/s/u |
| [Falcon7B-decode](./models/demos/t3000/falcon7b) | Data Parallel | 129th | 256 | 7.4 t/s/u - 1901 t/s | 15.5 t/s/u - 3968 t/s | 26 t/s/u |
| [LLaMA-2-70B-decode](./models/demos/t3000/llama2_70b) | Tensor Parallel | 129th | 32 | 10.4 t/s/u - 333 t/s | 16.6 t/s/u - 532 t/s | 20 t/s/u |
| [LLaMA-3-70B-decode](./models/demos/t3000/llama3_70b) | Tensor Parallel | 129th | 32 | 10.4 t/s/u - 333 t/s | 15.8 t/s/u - 506 t/s | 20 t/s/u |
| [Falcon40B-decode](./models/demos/t3000/falcon40b) | Tensor Parallel | 129th | 32 | work-in-progress | 10.0 t/s/u - 320 t/s | 36 t/s/u |
Expand Down

0 comments on commit f1dc594

Please sign in to comment.