wrong.wang

分享一些我曾阅读过的论文，笔记比较简略

2024/10/27 OmniGen -- Unified Image Generation
2024/01/27 Scaling Up to Excellence - Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
2024/01/06 StyleDrop - Text-to-Image Generation in Any Style
2023/12/09 Paragraph-to-Image Generation with Information-Enriched Diffusion Model
2023/11/04 De-Diffusion Makes Text a Strong Cross-Modal Interface
2023/10/20 Improving Image Generation with Better Captions
2023/10/10 Kosmos-G - Generating Images in Context with Multimodal Large Language Models
2023/10/09 DiffBlender - Scalable and Composable Multimodal Text-to-Image Diffusion Models
2023/10/03 PixArt-alpha - Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
2023/09/28 Emu - Enhancing Image Generation Models Using Photogenic Needles in a Haystack
2023/08/02 Common Diffusion Noise Schedules and Sample Steps are Flawed
2023/02/09 BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models