Tag: I see 3.1

Dustin Tran, a core member of Gemini and a senior researcher at Google DeepMind, joins xAI

Google Gemini imports presentation generation function, which can automatically create Google Slides slides through prompt words or upload files

Google recently introduced a useful new feature for its free interactive workspace "Canvas," built into its AI assistant Gemini. This new feature allows users to automatically generate presentation slides directly using AI, helping students and office workers quickly create presentation content. Launched in March of this year, Canvas was initially designed to allow users to easily paste longer text or code into Gemini for editing, debugging, or suggestions, while also visualizing project prompts for apps, web designs, and more. The addition of presentation generation further expands Canvas's application scenarios. Users can trigger presentation generation in two ways: • By directly entering a prompt: For example, entering "Create a presentation about a trip to the UK." • By uploading source files: If the presentation is based on specific content, users can upload documents, spreadsheets, or research papers, and then request Gemini to generate the presentation based on those files. Automatic Background and Image Application, Export to Google Slides Editing: Gemini-generated presentations not only include text content but also automatically apply background themes and add relevant images. Even more conveniently, after initial generation, users can directly export the entire presentation from the Gemini app to Google Slides. After exporting, users can then further edit, adjust formatting, add content, or share and collaborate with team members in Google Slides. Rollout to Personal and Workspace Accounts Starting Today: Google stated that this presentation generation feature will be rolled out to personal Google accounts and Google Workspace (enterprise/education) accounts starting today. More Gemini App Updates: In addition to updating the Canvas functionality, Google has added the following updates to the new version of the Gemini app: • Veo 3.1 Integration: Now you can create videos with more realistic textures, easier camera control, and including dialogue and sound effects through Gemini. • ...

Google launches Veo 3.1 image generation model, enhancing image-to-video conversion capabilities

Google launches Veo 3.1 image generation model, enhancing image-to-video conversion capabilities

Google announced an update to its AI video generation model, Veo, to version 3.1, boasting improved performance in following prompts and converting images into video. Veo 3.1 is currently available for trial via Google's Gemini API and is integrated into Google's Flow video editing tool. Veo 3.1's technical upgrades build upon Veo 3, unveiled at this year's Google I/O conference. According to Google, the new model performs better in following prompts, more easily creating videos from user-uploaded image "materials" combined with text prompts. Furthermore, Veo 3.1 adds the ability to simultaneously convert images into video and generate audio, a feature not present in Veo 3. Enhanced Flow editor functionality: In the Flow video editor, Veo 3.1 supports a new "Scene-to-Video" feature, allowing users more precise control over the generated video. Users can upload start and end frames, and the AI ​​will automatically generate the intermediate video content. While Adobe's Firefly offers similar functionality, Flow's unique feature is its ability to generate audio simultaneously. This audio generation capability also applies to the editor's video extension and object insertion functions. Regarding the current state of the technology and its application prospects, based on the samples shared by Google, videos generated through Veo 3.1 still have a slightly unrealistic feel, and the effect varies greatly depending on the prompts and themes. Although it may not yet be as realistic as OpenAI's Sora 2, Google is trying to make Veo more practical for professionals actually working on videos, rather than just a source of social media spam. With the rapid development of AI video generation technology, competition among tech giants in this field is becoming increasingly fierce, and Google, through continuous updates to the Veo model, demonstrates its determination to remain competitive in the creative tools market.

Welcome back!

Login to your account below

Retrieve your password

Hãy nhập tên người dùng hoặc địa chỉ email để mở mật khẩu