v0.1.20
This release refactors Sycamore’s dependencies to use extras in order to conditionally pull in dependencies for connectors and local inference (e.g. creating vector embeddings). For example, if you want to use the OpenSearch connector, you will need to: pip install sycamore-ai[opensearch]. Or, if you want to run a local vector embedding model, you will need to: pip install sycamore-ai[local-inference]. To do both, you will need to: pip install sycamore-ai[opensearch,local-inference]
Also, this release includes performance and stability improvements.
What's Changed
- Dependencies 1/n: Remove need to restart colab runtime by @bsowell in #728
- Don't require installing neo4j unless it's used. by @eric-anderson in #733
- Handle None cases for element.table = <> by @baitsguy in #735
- Fix materialize + S3 not working. by @eric-anderson in #734
- Fixed neo4j relationship property loading + added support for loading lists and dictionaries as properties by @RitxmSaha in #736
- Handle non-hashable data types in opensearch schema extractor by @baitsguy in #737
- docs: update README.md by @eltociear in #739
- Support concurrent libreoffice executions, fix bug to support s3 source paths in file_format_tools by @baitsguy in #741
- Fix calls to structured outputs so that they can be cached by @RitxmSaha in #738
- fix 'SycamorePartitioner' error message by @HenryL27 in #748
- Fix context test by @eric-anderson in #749
- Enforce the constraint that each cell is only in one spanning cell. by @bsowell in #754
- Add context_params decorator to read args from Context by @baitsguy in #747
- Remove unnecessary tracing code. by @mdwelsh in #752
- Dependencies 2/3: Move connectors to extras. by @bsowell in #740
- Allow any pinecone error on create index by @HenryL27 in #750
- Allow all Exceptions while creating Connector Targets by @karanataryn in #753
- Adding new ETL tutorial by @jonfritz in #751
- Add materialize to the ntsb loader for luna by @eric-anderson in #742
- Add Weaviate notebook by @karanataryn in #757
- Update get_started.rst by @jonfritz in #759
- Update pinecone.md by @jonfritz in #758
- added new document structure + tests by @RitxmSaha in #746
- Dependencies 3/3: Add partitioning extras. by @bsowell in #755
- Dependencies: Remove need to restart colab session for aryn-sdk by @bsowell in #756
- Default llm in transforms by @baitsguy in #760
- Improve materialize by @eric-anderson in #762
- adding neo4j s3 proxy for aura db + split_calls flag for entity and relationship extractor. by @RitxmSaha in #761
- Fix show_pages in Google Colab. by @bsowell in #763
- Jonfritz patch 3 tutorial by @jonfritz in #764
- Fix materialize to work even if it is re-executed on the same documents. by @eric-anderson in #765
- add clear_materialize(path=) by @eric-anderson in #767
- Jonfritz patch 3 consoledocs by @jonfritz in #768
- Update docs with more info on dependencies. by @bsowell in #769
- bump sycamore version to 0.1.20 by @HenryL27 in #770
New Contributors
- @eltociear made their first contribution in #739
Full Changelog: v0.1.19...v0.1.20