Anima is a 2 billion parameter text-to-image AI model created via a collaboration between CircleStone Labs and Comfy Org, built on the NVIDIA Cosmos architecture. It is focused mainly on anime concepts, characters, and styles, and was trained on several million anime images alongside roughly 800k non-anime artistic images.
Notably, Anima features extremely good prompt adherence and artist style adherence. The model is trained on a mix of standard Danbooru-style tags and natural language captions, allowing users to combine both methods seamlessly. It is intentionally designed for illustrations and artistic images, and does not excel at realism or lengthy text rendering.
