Skip to content

BurstGPT v1.0

Compare
Choose a tag to compare
@lzzmm lzzmm released this 28 Apr 08:48
· 24 commits to main since this release

As we continue to update and improve our dataset in the future, this release marks the initial version of our dataset. Thank you for your continued support and feedback as we work to improve our dataset.

Main characteristics

  • Duration: 61 consecutive days in 2 consecutive months.
  • Dataset size: 1.4m lines, ~50MB.

Schema

  • Timestamp: request submission time, seconds from 0:00:00 on the first day.
  • Model: called models, including ChatGPT and GPT-4.
  • Request tokens: Request tokens length.
  • Response tokens: Response tokens length.
  • Total tokens: Request tokens length plus response tokens length.
  • Log Type: the way users call the model, in conversation mode or using API, including Conversation log and API log.