v0.1.23
High-level
Lilac is now moving towards editing data directly in the tool. The first vehicle for this is Dataset.map
.
New blog post on curating data with the new Dataset.map
feature:
https://docs.lilacml.com/blog/curate-coding-dataset.html
Documentation on Dataset.map:
https://docs.lilacml.com/datasets/dataset_edit.html
Features
- Add dataset.map support for limit/filter by @brilee in #933
- Add support for arbitrary value type
v
inmap<k, v>
in parquet by @dsmilkov in #935 - Add batch size support and collapse transform impl by @brilee in #934
Improvements
- Improve the UI for repeated values. by @nsthorat in #904
- Small ergonomic fixes while writing the "code formatting" blog post by @dsmilkov in #909
- Merge multiple shards of the same task into the same progress bar. by @nsthorat in #910
- Add threaded task execution. by @nsthorat in #920
- Fix css style for markdown tables by @dsmilkov in #931
- Fix tqdm progress bars by separating report_progress from show_progress. by @nsthorat in #929
- Make parquet the default source by @dsmilkov in #941
Bug fixes
- Fix keyword search to work with apostrophe
'
by @dsmilkov in #907 - Make sure the results of dataset.map() always returns an iterable. by @nsthorat in #925
- Remove position= in tqdm. by @nsthorat in #913
Docs
- Add a guide for iterating on dataset by @dsmilkov in #923
- Add blog post for diffing and
dataset.map
by @dsmilkov in #912 - Redo the docs.lilacml.com landing page by @dsmilkov in #932
- Small tweaks to improve the glaive dataset blog post. by @nsthorat in #938
- Rename the guide to edit a dataset by @dsmilkov in #930
- Revamp welcome/intro pages by @brilee in #908
Other
- Refactor dataset/signal endpoints into separate module by @brilee in #900
- Add memray dep and instructions by @brilee in #917
- Add spec for select options by @brilee in #918
- Simplify helper methods to closer align to API for select options by @brilee in #919
- Start writing the query options compiler by @brilee in #924
Coming soon
- Add server-side RAG python code. by @nsthorat in #911
- Migrate the UI to the server-side python RAG. by @nsthorat in #914
- Improve the RAG UI by @nsthorat in #916
Full Changelog: v0.1.22...v0.1.23