We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality

Abstract: While burst LR images are useful for improving the SR image quality compared with a single LR image, prior SR networks accepting the burst LR images are trained in a deterministic manner, which is known to produce a blurry SR image. In addition, it is difficult to perfectly align the burst LR images, making the SR image more blurry. Since such blurry images are perceptually degraded, we aim to reconstruct the sharp high-fidelity boundaries. Such high-fidelity images can be reconstructed by diffusion models. However, prior SR methods using the diffusion model are not properly optimized for the burst SR task. Specifically, the reverse process starting from a random sample is not optimized for image enhancement and restoration methods, including burst SR. In our proposed method, on the other hand, burst LR features are used to reconstruct the initial burst SR image that is fed into an intermediate step in the diffusion model. This reverse process from the intermediate step 1) skips diffusion steps for reconstructing the global structure of the image and 2) focuses on steps for refining detailed textures. Our experimental results demonstrate that our method can improve the scores of the perceptual quality metrics. Code: this https URL
Comments: Accepted to IJCNN 2024 (International Joint Conference on Neural Networks)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2403.19428 [cs.CV]
  (or arXiv:2403.19428v3 [cs.CV] for this version)

Submission history

From: Norimichi Ukita [view email]
[v1] Thu, 28 Mar 2024 13:58:05 GMT (6290kb,D)
[v2] Wed, 3 Apr 2024 02:59:24 GMT (6290kb,D)
[v3] Mon, 8 Apr 2024 08:18:33 GMT (6290kb,D)

Link back to: arXiv, form interface, contact.