We propose an enhanced approach to real-world image super-resolution by integrating Quave preprocessing into the pipeline, enabling richer feature embeddings to feed into the time-aware encoder of the StableSR framework.
This work combines the strengths of StableSR, a state-of-the-art super-resolution framework, and QUAVE, a quaternion wavelet-based preprocessing tool, to achieve superior generalization and performance for image analysis tasks.
This work builds on the StableSR model, which leverages diffusion priors for real-world image super-resolution.
- Reference: StableSR Paper
- Code Repository: StableSR GitHub
- Quave Preprocessing: Extracts advanced embeddings to enhance input quality.
- Time-Aware Encoder: Integrates temporal and feature-rich embeddings for improved image fidelity.
- Real-World Applications: Targets arbitrary upscaling with minimal artifacts.
- Input Preprocessing: Quave processes the low-resolution (LR) image, extracting salient sub-band features.
- Diffusion Prior: StableSR’s encoder-decoder generates latent codes.
- Time-Aware Encoding: Combines Quave embeddings with temporal features for enhanced decoding.
The Quaternion Wavelet Network (QUAVE) is a novel framework designed to generalize image representations. It enhances neural model performance by extracting and selecting frequency sub-bands to provide approximation and fine-grained features, offering a more complete input representation for image processing tasks.
- Reference: Quave Paper
- Authors: Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello
- Code Repository: Quave GitHub
- Pytorch: 1.12.1
- CUDA: 11.7
- Quave
- Other: See
environment.yaml
Run the training pipeline:
python main.py --train --base configs/stableSRNew/v2-finetune_text_T_quave.yaml --gpus 0 --name "SuperRes_Quave" --scale_lr False
Test real-world super-resolution performance:
python scripts/sr_test_quave.py --config configs/stableSRNew/v2-test_quave.yaml --ckpt ./models/quave_stablesr.ckpt --input ./inputs/test_image.png --output ./outputs/
- Enhanced detail retention with Quave embeddings.
- Improved temporal coherence in time-aware sequences.
This project is currently under active development. Some features and components may not be finalized yet. We are also in the process of preparing a research paper that documents our findings and contributions in detail.
Stay tuned for updates!
@article{wang2024exploiting,
author = {Wang, Jianyi and Yue, Zongsheng and Zhou, Shangchen and Chan, Kelvin C.K. and Loy, Chen Change},
title = {Exploiting Diffusion Prior for Real-World Image Super-Resolution},
journal = {International Journal of Computer Vision},
year = {2024}
}
@misc{sigillo2024generalizing,
title={Generalizing Medical Image Representations via Quaternion Wavelet Networks},
author={Luigi Sigillo and Eleonora Grassucci and Aurelio Uncini and Danilo Comminiello},
year={2024},
eprint={2310.10224},
archivePrefix={arXiv},
primaryClass={eess.IV}
}
This project is based on the StableSR framework developed by researchers at Nanyang Technological University and the QUAVE framework developed at Sapienza University. Their combined capabilities offer a powerful approach to image super-resolution.