feat(examples): Support S3 for all `StableDiffusionPipeline` components #76

Eta0 · 2024-02-01T01:46:59Z

Updated `diffusers` support in `hf_serialization.py`

More fixes and improvements for #73.

This change:

Adds S3 upload capability for all StableDiffusionPipeline components
- text_encoder, vae (updated), unet (updated), scheduler (new), tokenizer (new)
- The scheduler and tokenizer are saved as .zip files containing the directory written by their .save_pretrained() methods
- Requires nothing from HuggingFace Hub at deserialization time
Adds S3 upload capability for transformers tokenizers
Adjusts serialization to pass validation checks by using include_non_persistent_buffers=False
Merges in the latest changes from main to support using include_non_persistent_buffers=False correctly
Avoids re-initializing models from HF for no reason
Cleans up misspelled/outdated CLI args and help
Adds parameter weight validation for diffusers models
Refactors serialize_model substantially
Adds logging level command line arguments, and
Shifts more outputs to use a logger

(Outdated): I left in the code to generate a test image through diffusers because it is good example code for this repository of how to re-assemble the components of an SD model and was good for testing that these changes work. It could be commented out or deleted later, or changed to be only enabled through a flag.
(Update): The test image generation code for diffusers is now commented out.

Previously, setting include_non_persistent_buffers=False would only include persistent buffers, and remove both non-persistent buffers and parameters. Parameters were not supposed to be affected by this flag, so this fix changes it to only remove non-persistent buffers.

fix(serialization): Don't drop parameters with non-persistent buffers

This additionally adds parameter weight validation for diffusers models, refactors `serialize_model` substantially, adds logging level command line arguments, and shifts more outputs to use a logger.

sangstar

Dramatically improved! LGTM. One or two questions on your thought process

sangstar · 2024-02-01T13:50:52Z

examples/hf_serialization.py

+    serialize_pretrained(
+        pipeline.tokenizer, output_prefix, "tokenizer", force=args.force
+    )
+    serialize_pretrained(
+        pipeline.scheduler, output_prefix, "scheduler", force=args.force
+    )


So the purpose of serialize_pretrained is specifically to support saving artifacts like SD's scheduler and tokenizer? I assume this is cleaner as they're not configs nor modules.

Yes, pretty much. The scheduler can actually be saved as a single JSON file (around 300 to 400 bytes), so it could have its own special code to avoid a zip file and temporary directory, but it seemed unnecessary at this time.
In summary:

text_encoder / vae / unet — tensors & config.json

We use tensorizer to save these

Potentially very, very large, so the optimizations in tensorizer are important

scheduler — config.json only

Supports .save_pretrained(dir) and .from_pretrained(dir)

Roughly around 400 bytes, so downloader optimizations aren't a huge deal

tokenizer — entire directory of files

Supports .save_pretrained(dir) and .from_pretrained(dir)

Roughly around 1.5 MB for SD 1.5, and compresses well, so zips work nicely

For models that use "fast tokenizers," these can be saved as a single tokenizer.json file instead, but not all models support fast tokenizers

examples/hf_serialization.py

Eta0 and others added 6 commits January 31, 2024 16:36

fix(examples): Clean up CLI args in hf_serialization.py

7f6fdd8

fix(examples): Use include_non_persistent_buffers=False

8c9afae

Merge pull request #75 from coreweave/eta/fix-persistent-buffers

f387122

fix(serialization): Don't drop parameters with non-persistent buffers

feat(examples): Support S3 for all StableDiffusionPipeline components

051fe55

This additionally adds parameter weight validation for diffusers models, refactors `serialize_model` substantially, adds logging level command line arguments, and shifts more outputs to use a logger.

Merge branch 'main' into eta/update-hf-serialization

6e09ca3

Eta0 added the enhancement New feature or request label Feb 1, 2024

Eta0 requested review from harubaru and sangstar February 1, 2024 01:46

Eta0 self-assigned this Feb 1, 2024

Eta0 mentioned this pull request Feb 1, 2024

feat(examples): Update hf_serialization.py to be used as Kubernetes Job container #73

Merged

sangstar approved these changes Feb 1, 2024

View reviewed changes

harubaru requested changes Feb 1, 2024

View reviewed changes

examples/hf_serialization.py Outdated Show resolved Hide resolved

Eta0 added 3 commits February 1, 2024 12:34

fix(examples): Validate deserialized tokenizers

a78f522

fix(examples): Comment out unneeded StableDiffusionPipeline inference

032a12f

feat(examples): Serialize & validate transformers tokenizers

54434a1

Eta0 requested a review from harubaru February 1, 2024 18:38

harubaru approved these changes Feb 1, 2024

View reviewed changes

sangstar merged commit 184d7a6 into sangstar/update-serialization-cl-script-for-container Feb 1, 2024
6 of 7 checks passed

sangstar deleted the eta/update-hf-serialization branch February 1, 2024 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(examples): Support S3 for all `StableDiffusionPipeline` components #76

feat(examples): Support S3 for all `StableDiffusionPipeline` components #76

Eta0 commented Feb 1, 2024 •

edited

Loading

sangstar left a comment

sangstar Feb 1, 2024

Eta0 Feb 1, 2024

feat(examples): Support S3 for all StableDiffusionPipeline components #76

feat(examples): Support S3 for all StableDiffusionPipeline components #76

Conversation

Eta0 commented Feb 1, 2024 • edited Loading

Updated diffusers support in hf_serialization.py

sangstar left a comment

Choose a reason for hiding this comment

sangstar Feb 1, 2024

Choose a reason for hiding this comment

Eta0 Feb 1, 2024

Choose a reason for hiding this comment

feat(examples): Support S3 for all `StableDiffusionPipeline` components #76

feat(examples): Support S3 for all `StableDiffusionPipeline` components #76

Eta0 commented Feb 1, 2024 •

edited

Loading

Updated `diffusers` support in `hf_serialization.py`