Blended Diffusion for Text-driven Editing of Natural Images
We introduce a solution for performing local (region-based) edits in generic natural images, based on a natural language description along with an ROI mask.
I am a Computer Science PhD student at the Hebrew University of Jerusalem, under the supervision of Prof. Dani Lischinski and Dr. Ohad Fried.
My research interests include machine learning, computer vision, and generative models. More specifically, I am interested in developing new tools for content synthesis and editing.
We introduce a solution for performing local (region-based) edits in generic natural images, based on a natural language description along with an ROI mask.
We tackle the problem of model merging, given two constraints that often come up in the real world: (1) no access to the original training data, and (2) without increasing the size of the neural network.