研究生: 鄭子文
Tzu-Wen Cheng
論文名稱: 基於條件擴散模型之炫光去除
FlareDiffusion: Conditional Diffusion Model for Flare Removal
指導教授: 林昌鴻
Chang-Hong Lin
口試委員: 林昌鴻
Chang-Hong Lin
Wei-Mei Chen
Chin-Hsien Wu
Jenq-Shiou Leu
學位類別: 碩士
系所名稱: 電資學院 - 電子工程系
Department of Electronic and Computer Engineering
論文出版年: 2024
畢業學年度: 112
語文別: 英文
論文頁數: 68
中文關鍵詞: 炫光去除影像恢復影像處理擴散模型卷積神經網路 (CNN)深度學習半監督式學習
外文關鍵詞: Flare Removal, Image Restoration, Image Processing, Diffusion Network, Convolution Neural Network (CNN), Deep Learning, Semi-supervised Learning
相關次數: 點閱:131下載:0
In photography, dirt on the lens or imperfections in the lens itself can lead to light scattering or reflecting within the lens, which cause unwanted artifacts, such as lens flare, glare, or halos, which degrade the quality of images. Additionally, directly pointing a camera at strong light sources can lead to similar defects, especially at night. Therefore, the goal of flare removal is to eliminate these artifacts, restoring the corrupted parts naturally while preserving all the details.
In this thesis, we present the FlareDiffusion, a novel conditional diffusion model designed for flare removal task. Our approach leverages the advantages and strengths of diffusion models, incorporating diverse flare patterns during training to improve the generalization capabilities of the model. By using input images to condition the model and integrating a specially designed loss function, the FlareDiffusion effectively removes flares while preserving light sources, ensuring high-quality image restoration.
The quantitative comparisons on the Flare7K test dataset demonstrate that our method achieves better results than state-of-the-art methods, which demonstrate its effectiveness in the flare removal task. Moreover, our method produces more natural and clearer images in visualize comparisons, presenting our model's robustness and generalization to various types of flares.

摘要 I ABSTRACT II 致謝 III LIST OF CONTENTS IV LIST OF FIGURES VII LIST OF TABLES IX CHAPTER 1 INTRODUCTIONS 1 1.1 Motivation 1 1.2 Contributions 5 1.3 Thesis Organization 6 CHAPTER 2 RELATED WORKS 7 2.1 Flare Removal 7 2.2 Diffusion-based Networks for Similar Tasks 9 CHAPTER 3 PROPOSED METHODS 10 3.1 Data Preprocessing 12 3.1.1 Light Source Detection 13 3.1.2 Light and Flare Synthesis 15 Inverse Gamma Correction 15 Intensity and Noise Adjustments 17 Geometric Transformations 18 Brightness and Blur Adjustment 19 3.2 Diffusion Network 22 3.2.1 Denoising Diffusion Probabilistic Models [12] 22 3.2.2 Conditional Denoising Diffusion Models 25 3.2.3 Model Architecture 27 Denoising U-Net 28 Time Embedding Block 30 Residual Block 32 Attention Block 35 Downsampling Block 37 Upsampling Block 38 3.3 Loss Functions 39 3.3.1 Reconstruction Loss 39 3.3.2 Flare Loss 41 3.4 Sampling Method 42 CHAPTER 4 EXPERIMENTAL RESULTS 43 4.1 Training Details 44 4.2 Flare7K Dataset [2] 45 4.3 Evaluation Metrics 47 4.3.1 Structural Similarity Index (SSIM) [41] 47 4.3.2 Peak Signal-to-Noise Ratio (PSNR) [42] 48 4.3.3 Learned Perceptual Image Patch Similarity (LPIPS) [43] 49 4.4 Comparisons with State-of-the-art Methods 50 4.4.1 Quantitative Comparisons 51 4.4.2 Qualitative Comparisons 53 4.5 Ablation Studies 57 4.5.1 Comparison of Flare Synthesis Methods 57 4.5.2 Impact of Loss Functions 59 CHAPTER 5 CONCLUSIONS AND FUTURE WORKS 61 5.1 Conclusions 61 5.2 Future Works 62 REFERENCES 65

