I used to spend days rotoscoping people in videos. Generative infill for background painting and automatic rotoscoping have saved probably a year of my life at this point. Image generation relies on CLIP, which needs a language model for conditioning.
I used to spend days rotoscoping people in videos. Generative infill for background painting and automatic rotoscoping have saved probably a year of my life at this point. Image generation relies on CLIP, which needs a language model for conditioning.