分享一些我曾阅读过的论文,笔记比较简略
- 2024/10/27 OmniGen -- Unified Image Generation
- 2024/01/27 Scaling Up to Excellence - Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
- 2024/01/06 StyleDrop - Text-to-Image Generation in Any Style
- 2023/12/09 Paragraph-to-Image Generation with Information-Enriched Diffusion Model
- 2023/11/04 De-Diffusion Makes Text a Strong Cross-Modal Interface
- 2023/10/20 Improving Image Generation with Better Captions
- 2023/10/10 Kosmos-G - Generating Images in Context with Multimodal Large Language Models
- 2023/10/09 DiffBlender - Scalable and Composable Multimodal Text-to-Image Diffusion Models
- 2023/10/03 PixArt-alpha - Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
- 2023/09/28 Emu - Enhancing Image Generation Models Using Photogenic Needles in a Haystack
- 2023/08/02 Common Diffusion Noise Schedules and Sample Steps are Flawed
- 2023/02/09 BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models