1. Rombach, R. , Blattmann, A. , Lorenz, D. , Esser, P. & Ommer, B. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10684–10695.
2. Midjourney Inc. Midjourney, (2022).
3. Betker, J. et al. Improving image generation with better captions. Computer Science. https://cdn.openai.com/papers/dall-e-3.pdf 2, p8 (2023).
4. The dawn of lmms: Preliminary explorations with gpt-4v (ision);arXiv preprint,2023
5. Zhang, L. , Rao, A. & Agrawala, M. in Proceedings of the IEEE/CVF International Conference on Computer Vision. 3836–3847.