Google announces Lumiere, an AI model capable of generating realistic videos

Pietro

Google, the Weizmann Institute of Science, and Tel Aviv University have published a paper announcing Lumiere, a “space-time diffusion model” capable of generating realistic and stylized short videos, with editing options on command, leveraging a generative AI model.

In the paper, the researchers say that this model takes a different approach than existing ones (Pika, for example) by being able to synthesize videos that portray realistic, diverse and consistent movements – a challenge judged to be “fundamental” in video generation. Currently, the document detailing the technology can be freely consulted but no models are yet available to test.

To use Lumiere, users will be able to provide text inputs that describe what they want in natural language. The template, as a result, generates a video that represents the textual input. Users can also upload a static image and add a prompt to turn it into a dynamic video.

The template also supports additional features including inpainting, which can inherit specific objects to edit videos with text instructions; Cinemagraph, to add movement to specific parts of a scene; It is the stylized generation that uses the reference style of an image for the creation of the video.

While these features are not new to the industry, Lumiere uses an architecture called “Space-Time U-Net” to generate the entire time duration of a video in one go, leading to more realistic and consistent motion. According to the researchers, this differs from existing video models that “synthesize images between keyframes to which temporal super resolution (TSR) models are added to generate the missing data.”

Lumiere’s video model was trained on a dataset of 30 million videos, along with their text captions, and is capable of generating 80 frames at 16fps. The base model is trained at 128×128″. The source of this data, however, at least at this early stage, is still unclear. In the research paper, the researchers say that the model produces 1024×1024 pixel videos that are five seconds long “in low resolution.”

Despite these limitations, the researchers performed a study on users and say that Lumiere’s results were preferred over existing AI video synthesis models. Currently, Lumiere is also not yet able to generate videos that consist of multiple shots or involve transitions between scenes, an open challenge for future research.

Intel, new Wi-Fi drivers for Windows fix blue screen errors and…

Intel, identified a vulnerability that also affects recent processors

iPhone 15, not only overheating: problems also reported to the speakers

Intel will use three-dimensional cache for future processors

Intel Reveals Next Generation of Xeon Architectures

HiSense, 2022 of great satisfaction: first in the world in the…

Mac with Touch display, Apple is working on it: for Jobs…

iPhone 15 Pro with Type-C Port: Transfer Speeds Take Off, According…

Honor awaits Qualcomm for Magic 5 and the new foldable: latest…

Sony LinkBuds S become Earth Blue, with recycled plastic from water…

iPhone 15 Ultra will be the most expensive smartphone ever sold…

AirPods Pro 2, Apple explains the secrets of the best audio…

Xiaomi 12T Pro Review: 200 MP Photos, Battery and… physical sensor

Apple Watch, six new features that will arrive in the coming…

Official iPhone 14 Pro and Pro Max: Dynamic Island and new…

HiSense, 2022 of great satisfaction: first in the world in the…

Apple Watch Ultra: the new model will arrive in 2025 and…

iPhone 15 Ultra will be the most expensive smartphone ever sold…

AirPods Pro 2, Apple explains the secrets of the best audio…

Xiaomi 12T Pro Review: 200 MP Photos, Battery and… physical sensor

Google announces Lumiere, an AI model capable of generating realistic videos

LEAVE A REPLY Cancel reply

WHAT YOU NEED TO KNOW

Dear old web search relegated to the background on Google

Google I/O 2024: What to remember from the May 14 keynote

Apple, after India and Vietnam, could also start producing in Indonesia

Dear old web search relegated to the background on Google

Google I/O 2024: What to remember from the May 14 keynote

Dear old web search relegated to the background on Google

Google I/O 2024: What to remember from the May 14 keynote

Apple, after India and Vietnam, could also start producing in Indonesia