Omri Avrahami

I am a Computer Science Ph.D. student at the School of Computer Science and Engineering at the Hebrew University of Jerusalem, under the joint supervision of Prof. Dani Lischinski and Prof. Ohad Fried.

In addition, I am a Researcher at Runway AI working on generative video models. Previously, I had the privilege of working as a Researcher at Pika Labs, Snap Research, NVIDIA Research, Google Research, Facebook AI Research (FAIR), and Lightricks.

My research interests include machine learning, computer vision, and generative models. More specifically, I am interested in developing new tools for content synthesis and editing --- known popularly as Generative AI.

Email Github Scholar Twitter LinkedIn YouTube

Research Projects

Aleph: A new way to edit, transform and generate video

Runway, 2025

Runway Aleph is a state-of-the-art in-context video model, setting a new frontier for multi-task visual generation, with the ability to perform a wide range of edits on an input video such as adding, removing, and transforming objects, generating any angle of a scene, and modifying style and lighting, among many other tasks.

Project Page

Academic Publications

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

arXiv, 2025

David Dinkevich, Matan Levy, Omri Avrahami, Dvir Samuel, Dani Lischinski

A training-free storyboard generation that balances identity consistency with cinematic layout diversity.

Project Page arXiv Code

Stable Flow: Vital Layers for Training-Free Image Editing

CVPR, 2025

Omri Avrahami, Or Patashnik, Ohad Fried, Egor Nemchinov, Kfir Aberman, Dani Lischinski, Daniel Cohen-Or

A training-free editing method that is able to perform various types of image editing operations, including non-rigid editing, object addition, object removal, and global scene editing.

Project Page arXiv Code

Click2Mask: Local Editing with Dynamic Mask Generation

AAAI, 2025

Omer Regev, Omri Avrahami, Dani Lischinski

An image editing method that that given a click and a prompt, infers the desired area to edit.

Project Page arXiv Code

DiffUHaul: A Training-Free Method for Object Dragging in Images

SIGGRAPH Asia, 2024

Omri Avrahami, Rinon Gal, Gal Chechik, Ohad Fried, Dani Lischinski, Arash Vahdat*, Weili Nie*

Given an image with an object, our method can seamlessly relocate it within the scene.

Project Page arXiv

PALP: Prompt Aligned Personalization of Text-to-Image Models

SIGGRAPH Asia, 2024

Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter, Yael Pritch, Daniel Cohen-Or, Ariel Shamir

Prompt aligned personalization allow rich and complex scene generation, including all elements of a condition prompt.

Project Page arXiv

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

SIGGRAPH, 2024

Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

Given a text prompt describing a character, our method distills a representation that enables consistent depiction of the same character in novel contexts.

Project Page arXiv Code

Break-A-Scene: Extracting Multiple Concepts from a Single Image

SIGGRAPH Asia, 2023

Omri Avrahami, Kfir Aberman, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

Given a single image with multiple concepts, annotated by loose segmentation masks, our method can learn a distinct token for each concept, and use natural language guidance to re-synthesize the individual concepts or combinations of them in various contexts.

Project Page arXiv Code

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields

ICCVW, 2023

Ori Gordon, Omri Avrahami, Dani Lischinski

Given a NeRF scene, our pipeline trains a NeRF generator model guided by a similarity loss defined by a language-image model such as CLIP, to synthesize a new object inside a user-specified ROI.

Project Page arXiv Code

SpaText: Spatio-Textual Representation for Controllable Image Generation

CVPR, 2023

Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin

We suggest a new method for text-to-image generation using open-vocabulary scene control.

Project Page arXiv

Blended Latent Diffusion

SIGGRAPH, 2023

Omri Avrahami, Ohad Fried, Dani Lischinski

We present an accelerated solution to the task of local text-driven editing of generic images, where the desired edits are confined to a user-provided mask.

Project Page arXiv Code

Blended Diffusion for Text-driven Editing of Natural Images

CVPR, 2022

Omri Avrahami, Dani Lischinski, Ohad Fried

We introduce a solution for performing local (region-based) edits in generic natural images, based on a natural language description along with an ROI mask.

Project Page arXiv Code

GAN Cocktail: mixing GANs without dataset access

ECCV, 2022

Omri Avrahami, Dani Lischinski, Ohad Fried

We tackle the problem of model merging, given two constraints that often come up in the real world: (1) no access to the original training data, and (2) without increasing the size of the neural network.

Project Page arXiv Code