Skip to content

v0.1.23

Compare
Choose a tag to compare
@nsthorat nsthorat released this 07 Dec 16:25
· 210 commits to main since this release

High-level

Lilac is now moving towards editing data directly in the tool. The first vehicle for this is Dataset.map.

New blog post on curating data with the new Dataset.map feature:
https://docs.lilacml.com/blog/curate-coding-dataset.html

Documentation on Dataset.map:
https://docs.lilacml.com/datasets/dataset_edit.html

Features

  • Add dataset.map support for limit/filter by @brilee in #933
  • Add support for arbitrary value type v in map<k, v> in parquet by @dsmilkov in #935
  • Add batch size support and collapse transform impl by @brilee in #934

Improvements

  • Improve the UI for repeated values. by @nsthorat in #904
  • Small ergonomic fixes while writing the "code formatting" blog post by @dsmilkov in #909
  • Merge multiple shards of the same task into the same progress bar. by @nsthorat in #910
  • Add threaded task execution. by @nsthorat in #920
  • Fix css style for markdown tables by @dsmilkov in #931
  • Fix tqdm progress bars by separating report_progress from show_progress. by @nsthorat in #929
  • Make parquet the default source by @dsmilkov in #941

Bug fixes

  • Fix keyword search to work with apostrophe ' by @dsmilkov in #907
  • Make sure the results of dataset.map() always returns an iterable. by @nsthorat in #925
  • Remove position= in tqdm. by @nsthorat in #913

Docs

Other

  • Refactor dataset/signal endpoints into separate module by @brilee in #900
  • Add memray dep and instructions by @brilee in #917
  • Add spec for select options by @brilee in #918
  • Simplify helper methods to closer align to API for select options by @brilee in #919
  • Start writing the query options compiler by @brilee in #924

Coming soon

Full Changelog: v0.1.22...v0.1.23