Great stories often rely on story tellers. Often, these tellers converge the meaning to a singular expression and hence constrain perspectives and ultimately creativity. This project aims at widening the perspective lens through AI (stable-diffusion model) by creating multiple interpretations / visualizations off a text input. All these would be smoothly compiled as a GIF.
I’ve been working a bit on this, here are some premature results
- Prompt engineer story/text for the model.
- Create an interpolation module to traverse and generate across multiple perspectives in the model’s latent space.
- Create a function to smoothen transitions between two consecutive frames.
- Compile it into a GIF / video.
Creating an open-source colab notebook walkthrough for generating perspectives with stable diffusion.
Amit Singh (metamyth)
R&D, Paperplane Technology
Researcher, Active Inference Lab
Discord: metamyth #8558
USDC Wallet Address: 0x0159af752e0220ed3eef439bef36f982cc0a6fbf