Awesome-CVPR2023-Low-Level-Vision
A Collection of Papers and Codes in CVPR2023 related to Low-Level Vision
Related collections for low-level vision
- Awesome-CVPR2022-Low-Level-Vision
- Awesome-ECCV2022-Low-Level-Vision
- Awesome-AAAI2022-Low-Level-Vision
- Awesome-NeurIPS2021-Low-Level-Vision
- Awesome-ICCV2021-Low-Level-Vision
- Awesome-CVPR2021/CVPR2020-Low-Level-Vision
- Awesome-ECCV2020-Low-Level-Vision
Catalogue
Image Restoration - 图像恢复
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
- Paper: https://arxiv.org/abs/2303.00748
- Code: https://github.com/ofsoundof/GRL-Image-Restoration
- Tags: Transformer
Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective
Generative Diffusion Prior for Unified Image Restoration and Enhancement
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration
Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration
Contrastive Semi-supervised Learning for Underwater Image Restoration via Reliable Bank
- Paper: https://arxiv.org/abs/2303.09101
- Code: https://github.com/Huang-ShiRui/Semi-UIR
- Tags: Underwater Image Restoration
Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior
- Paper: https://arxiv.org/abs/2303.15046
- Code: https://github.com/ykdai/BracketFlare
- Tags: Reflective Flare Removal
Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera
GamutMLP - A Lightweight MLP for Color Loss Recovery
- Paper: https://arxiv.org/abs/2304.11743
- Code: https://github.com/hminle/gamut-mlp
- Tags: restore wide-gamut color values
ABCD : Arbitrary Bitwise Coefficient for De-quantization
- Paper: https://ipl.dgist.ac.kr/ABCD_cvpr23.pdf
- Code: https://github.com/WooKyoungHan/ABCD
- Tags: De-quantization/Bit depth expansion
Image Reconstruction
Raw Image Reconstruction with Learned Compact Metadata
High-resolution image reconstruction with latent diffusion models from human brain activity
- Paper: https://www.biorxiv.org/content/10.1101/2022.11.18.517004v2
- Code: https://github.com/yu-takagi/StableDiffusionReconstruction
Burst Restoration
Burstormer: Burst Image Restoration and Enhancement Transformer
Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement
Video Restoration
Blind Video Deflickering by Neural Filtering with a Flawed Atlas
- Paper: https://arxiv.org/abs/2303.08120
- Code: https://github.com/ChenyangLEI/All-In-One-Deflicker
- Tags: Deflickering
Super Resolution - 超分辨率
Image Super Resolution
Activating More Pixels in Image Super-Resolution Transformer
- Paper: https://arxiv.org/abs/2205.04437
- Code: https://github.com/XPixelGroup/HAT
- Tags: Transformer
N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution
Omni Aggregation Networks for Lightweight Image Super-Resolution
OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution
Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution
Super-Resolution Neural Operator
- Paper: https://arxiv.org/abs/2303.02584
- Code: https://github.com/2y7c3/Super-Resolution-Neural-Operator
Human Guided Ground-truth Generation for Realistic Image Super-resolution
Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution
- Paper: https://arxiv.org/abs/2304.03542
- Tags: Blind
Implicit Diffusion Models for Continuous Super-Resolution
Zero-Shot Dual-Lens Super-Resolution
- Paper:
- Code: https://github.com/XrKang/ZeDuSR
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input
Learning Generative Structure Prior for Blind Text Image Super-resolution
- Paper: https://arxiv.org/abs/2303.14726
- Code: https://github.com/csxmli2016/MARCONet
- Tags: Text SR
Guided Depth Super-Resolution by Deep Anisotropic Diffusion
- Paper: https://arxiv.org/abs/2211.11592
- Code: https://github.com/prs-eth/Diffusion-Super-Resolution
- Tags: Guided Depth SR
Quantum Annealing for Single Image Super-Resolution
- Paper: https://arxiv.org/abs/2304.08924
- Tags: [Workshop]
Video Super Resolution
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Structured Sparsity Learning for Efficient Video Super-Resolution
Image Rescaling - 图像缩放
HyperThumbnail: Real-time 6K Image Rescaling with Rate-distortion Optimization
Denoising - 去噪
Image Denoising
Masked Image Training for Generalizable Deep Image Denoising
Spatially Adaptive Self-Supervised Learning for Real-World Image Denoising
- Paper: https://arxiv.org/abs/2303.14934
- Cdoe: https://github.com/nagejacob/SpatiallyAdaptiveSSID
- Tags: Self-Supervised
LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising
- Paper: https://arxiv.org/abs/2304.00534
- Code: https://github.com/Wang-XIaoDingdd/LGBPN
- Tags: Self-Supervised
Real-time Controllable Denoising for Image and Video
Deblurring - 去模糊
Image Deblurring
Structured Kernel Estimation for Photon-Limited Deconvolution
- Paper: https://arxiv.org/abs/2303.03472
- Code: https://github.com/sanghviyashiitb/structured-kernel-cvpr23
Blur Interpolation Transformer for Real-World Motion from Blur
Neumann Network with Recursive Kernels for Single Image Defocus Deblurring
- Paper:
- Code: https://github.com/csZcWu/NRKNet
Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring
Deraining - 去雨
Learning A Sparse Transformer Network for Effective Image Deraining
Dehazing - 去雾
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
Curricular Contrastive Regularization for Physics-aware Single Image Dehazing
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior
SCANet: Self-Paced Semi-Curricular Attention Network for Non-Homogeneous Image Dehazing
- Paper: https://arxiv.org/abs/2304.08444
- Code: https://github.com/gy65896/SCANet
- Tags: [Workshop]
HDR Imaging / Multi-Exposure Image Fusion - HDR图像生成 / 多曝光图像融合
Learning a Practical SDR-to-HDRTV Up-conversion using New Dataset and Degradation Models
SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders
A Unified HDR Imaging Method with Pixel and Patch Level
Inverting the Imaging Process by Learning an Implicit Camera Model
- Paper: https://arxiv.org/abs/2304.12748
- Code: https://github.com/xhuangcv/neucam
- Tags: generating all-in-focus photos & HDR imaging
Frame Interpolation - 插帧
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
A Unified Pyramid Recurrent Network for Video Frame Interpolation
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
Event-based Video Frame Interpolation with Cross-Modal Asymmetric Bidirectional Motion Fields
- Paper:
- Code: https://github.com/intelpro/CBMNet
- Tags: Event-based
Event-based Blurry Frame Interpolation under Blind Exposure
- Paper:
- Code: https://github.com/WarranWeng/EBFI-BE
- Tags: Event-based
Joint Video Multi-Frame Interpolation and Deblurring under Unknown Exposure Time
- Paper: https://arxiv.org/abs/2303.15043
- Code: https://github.com/shangwei5/VIDUE
- Tags: Frame Interpolation and Deblurring
Image Enhancement - 图像增强
Low-Light Image Enhancement
Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement
- Paper: https://arxiv.org/abs/2304.07039
- Code: https://github.com/langmanbusi/Semantic-Aware-Low-Light-Image-Enhancement
Visibility Constrained Wide-band Illumination Spectrum Design for Seeing-in-the-Dark
- Paper: https://arxiv.org/abs/2303.11642
- Code: https://github.com/MyNiuuu/VCSD
- Tags: NIR2RGB
Image Matting - 图像抠图
Referring Image Matting
Adaptive Human Matting for Dynamic Videos
Shadow Removal - 阴影消除
ShadowDiffusion: When Degradation Prior Meets Diffusion Model for Shadow Removal
Image Compression - 图像压缩
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger
Context-based Trit-Plane Coding for Progressive Image Compression
Learned Image Compression with Mixed Transformer-CNN Architectures
Video Compression
Neural Video Compression with Diverse Contexts
Image Quality Assessment - 图像质量评价
Quality-aware Pre-trained Models for Blind Image Quality Assessment
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and a New Method
Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild
Style Transfer - 风格迁移
Fix the Noise: Disentangling Source Feature for Controllable Domain Translation
Neural Preset for Color Style Transfer
CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer
StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer
- Paper: https://arxiv.org/abs/2304.02744
- Project: https://stylegan-salon.github.io/
Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer
- Paper: https://arxiv.org/abs/2304.04461
- Project: https://kaist-viclab.github.io/old-photo-modernization/
QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity
Master: Meta Style Transformer for Controllable Zero-Shot and Few-Shot Artistic Style Transfer
Image Editing - 图像编辑
Imagic: Text-Based Real Image Editing with Diffusion Models
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
CoralStyleCLIP: Co-optimized Region and Layer Selection for Image Editing
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
SIEDOB: Semantic Image Editing by Disentangling Object and Background
DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Image Generation/Synthesis / Image-to-Image Translation - 图像生成/合成/转换
Text-to-Image / Text Guided / Multi-Modal
Multi-Concept Customization of Text-to-Image Diffusion
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Scaling up GANs for Text-to-Image Synthesis
MAGVLT: Masked Generative Vision-and-Language Transformer
Freestyle Layout-to-Image Synthesis
Variational Distribution Learning for Unsupervised Text-to-Image Generation
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
- Paper: https://arxiv.org/abs/2303.17490
- Project: https://sound2scene.github.io/
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Shifted Diffusion for Text-to-image Generation
Collaborative Diffusion for Multi-Modal Face Generation and Editing
Image-to-Image / Image Guided
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Person Image Synthesis via Denoising Diffusion Model
Picture that Sketch: Photorealistic Image Generation from Abstract Sketches
Fine-Grained Face Swapping via Regional GAN Inversion
Masked and Adaptive Transformer for Exemplar Based Image Translation
Zero-shot Generative Model Adaptation via Image-specific Prompt Learning
- Paper: https://arxiv.org/abs/2304.03119
- Code: https://github.com/Picsart-AI-Research/IPL-Zero-Shot-Generative-Model-Adaptation
Others for image generation
AdaptiveMix: Robust Feature Representation via Shrinking Feature Space
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Regularized Vector Quantization for Tokenized Image Synthesis
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Exploring Incompatible Knowledge Transfer in Few-shot Image Generation
- Paper:
- Code: https://github.com/yunqing-me/RICK
Post-training Quantization on Diffusion Models
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
DiffCollage: Parallel Generation of Large Content with Diffusion Models
Few-shot Semantic Image Synthesis with Class Affinity Transfer
NoisyTwins: Class-Consistent and Diverse Image Generation through StyleGANs
DCFace: Synthetic Face Generation with Dual Condition Diffusion Model
Exploring Incompatible Knowledge Transfer in Few-shot Image Generation
Video Generation
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Video Probabilistic Diffusion Models in Projected Latent Space
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
Decomposed Diffusion Models for High-Quality Video Generation
Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding
- Paper: https://arxiv.org/abs/2212.02802
- Code: https://github.com/man805/Diffusion-Video-Autoencoders
MoStGAN: Video Generation with Temporal Motion Styles
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
Others
Perspective Fields for Single Image Camera Calibration
DC2: Dual-Camera Defocus Control by Learning to Refocus
- Paper: https://arxiv.org/abs/2304.03285
- Project: https://defocus-control.github.io/
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Cross-GAN Auditing: Unsupervised Identification of Attribute Level Similarities and Differences between Pretrained Generative Models
LightPainter: Interactive Portrait Relighting with Freehand Scribble
- Paper: https://arxiv.org/abs/2303.12950
- Tags: Portrait Relighting
Neural Texture Synthesis with Guided Correspondence
- Paper:
- Code: https://github.com/EliotChenKJ/Guided-Correspondence-Loss
- Tags: Texture Synthesis
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias
Large-capacity and Flexible Video Steganography via Invertible Neural Network
- Paper: https://arxiv.org/abs/2304.12300
- Code: https://github.com/MC-E/LF-VSN
- Tags: Steganography
Talking Head Generation
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
High-Fidelity and Freely Controllable Talking Head Video Generation
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Virtual Try-on
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning
Handwriting/Font Generation
CF-Font: Content Fusion for Few-shot Font Generation
- Paper: https://arxiv.org/abs/2303.14017
- Code: https://github.com/wangchi95/CF-Font
- Tags: Font Generation
DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality
Handwritten Text Generation from Visual Archetypes
- Paper: https://arxiv.org/abs/2303.15269
- Tags: Handwriting Generation
Disentangling Writer and Character Styles for Handwriting Generation
- Paper: https://arxiv.org/abs/2303.14736
- Code: https://github.com/dailenson/SDT
- Tags: Handwriting Generation
Layout Generation
Unifying Layout Generation with a Decoupled Diffusion Model
Unsupervised Domain Adaption with Pixel-level Discriminator for Image-aware Layout Generation
PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout
- Paper: https://arxiv.org/abs/2303.15937
- Code: https://github.com/PKU-ICST-MIPL/PosterLayout-CVPR2023
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation