Releases: quiltdata/quilt
Package Delete
This release adds a new command to delete a package including all versions and history from the registry.
Fast Builds for Large Packages
Building a package from a directory of input files now skips generating a build file. That speeds up the build process and makes it easier to change the package contents and rebuild.
Push Public Packages
This release includes support for paid plans on quiltdata.com and is recommended for all individual and business-plan users. It adds a shortcut to push packages and make them public in a single command and improves documentation.
Parquet, editable packages, de-duplication
Version 2.5 includes the following:
- Ability to edit packages and build packages from in-memory DataFrames or from source files
- The Quilt store now serializes all structured data to Parquet (instead of HDF5). Parquet opens the door to high-performance querying over Quilt packages with tools like Spark and AWS Athena.
- Preliminary Spark support for Quilt packages through pyspark
- Content-aware file de-duplication: all upload and download fragments are uniqued in the registry as well as in the local
quilt_packages
. Data fragments that already exist on the server, or on local disk, are skipped saving you time, bandwidth, and storage.
Compatibility with Pandas 20
This release includes bug fixes and changes to be compatible with Pandas 20 and Arrow 0.3.
Also includes preliminary support for Parquet (requires pyarrow and format: PARQUET
.
Portable Notebooks Post
Bugfixes and code clean up. The package tree has been improved and simplified. Table/DataFrame-datasets can now be imported in Python without materializing. Building packages is easier and more flexible. This release is synced to the publication of "Data Packages for Fast Reproducible Python Analysis."
Fast package browsing
Minor bug fixes to make browsing packages faster.
Update build.yml format
Support new syntax for arbitrary keys on leaf nodes and pass-through options for transform
Simplify build.yml
Simplifying the build-file format to use a single tree mixing files and dataframes.
Documentation update
2.2.1-beta Update version to 2.2.1 (#45)