Looks like too much work to recreate easily.
this post was submitted on 17 Nov 2023
1 points (100.0% liked)
Machine Learning
1 readers
1 users here now
Community Rules:
- Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
- Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
- Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
- Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.
founded 1 year ago
MODERATORS
I was just talking to a friend yesterday about how AI images won't take off unless tweaks can be done using natural language. If the paper's claims are true, this is going to be revolutionary.
We'll have a finer definition on what an edit is.
Currently from flipping the image vertically, to swapping out sub regions, to truly semantic edits like "make the person stand up". They're all lumped together and called "edits".
Something like different tiers of autonomous driving will be needed. Tier1 edits, all the way to tier5.
The proposed method is like tier2. Capable of swapping out sub regions via style transfer, but cannot meaningfully change the structure of the scene, ie "make the man stand up".