Google and OpenAI compete: Veo 2, Imagen 3, and free ChatGPT Search

Welcome, Growth Pioneers! 🚀

Google introduces Veo 2 (creating realistic 4K video, integrating with YouTube Shorts in 2025) and Imagen 3 (upgrading images, surpassing Midjourney). OpenAI opens free ChatGPT Search functionality, improves voice support, and mobile experience.

DiffSensei transforms text into Manga comics, automating character creation, background layout, and dialogue, enabling seamless and personalized storytelling.This Email Newsletter will cover:

  • Google and OpenAI compete: Veo 2, Imagen 3, and free ChatGPT Search

  • ChatGPT Search is now free for everyone

  • DiffSensei: Turning Text into Manga!

VEO 2
GOOGLE AND OPENAI COMPETE: VEO 2, IMAGEN 3, AND FREE CHATGPT SEARCH

Source: Veo 2

 

Google just announced the release of Veo 2, an advanced video generation model that produces high-resolution outputs with incredible realism and detail — along with Imagen 3, an upgraded image model that also delivers cutting-edge quality.

Veo 2:

  • Veo 2 can create 8-second clips at 4K resolution (720p at launch), and it has received significant upgrades in cinematic control quality.

  • The model also shows major improvements in physics simulation and hallucination reduction, leading to more realistic movement and detail.

  • Veo 2 has outperformed all competitors in head-to-head human evaluations and prompt adherence, including the recently released Sora by OpenAI.

  • The model is being gradually rolled out through the VideoFX waitlist, with plans to integrate with YouTube Shorts in 2025.

Imagen 3:

  • The upgraded model offers enhanced vibrancy and composition across art styles, with better handling of details, textures, and fine text rendering.

  • New capabilities include more accurate prompt interpretation and better rendering of complex scenes in line with user intent.

  • Imagen 3 has surpassed all models, including Midjourney, Flux, and Ideogram, in human evaluations of preference, image quality, and prompt adherence.

  • The model is currently available through Google Labs’ ImageFX and is being rolled out in over 100 countries.

Google is having a tremendously big end to 2024 — first Gemini 2.0 and now Veo 2 and Imagen 3. These models seem to raise the bar in both categories, giving Google cutting-edge performance in almost every area of AI. OpenAI may have the hype this holiday season, but Google is showing results.

CHATGPT SEARCH
CHATGPT SEARCH IS NOW FREE FOR EVERYONE

Source: ChatGPT

 

OpenAI just announced a significant expansion of its ChatGPT Search feature, making it free for all users, along with voice search capabilities and improved mobile features.

Previously available only to paid users, the search feature is now extended to all logged-in users, with faster response speeds and accessible through the globe icon on the platform.

Search has also been added to Enhanced Voice Mode for paid users, allowing them to conduct searches through natural speech commands.

The mobile Search experience has been improved, with enhanced visual layouts for local businesses and built-in integration with Google and Apple Maps.

Users can also set ChatGPT Search as their default search engine, with results displaying relevant links before ChatGPT’s text responses for quicker access.

ChatGPT’s web accessibility and updated information are a significant step towards a proactive future, especially with the Advanced Voice Mode—transforming this tool into a smarter, more powerful version of Siri (and potentially powering it later). Search is about to change in a big way in the age of AI.

DIFFSENSEI
DIFFSENSEI: TURNING TEXT INTO MANGA!

Source: Diffsensei

 

DiffSensei helps create characters and expressions based on descriptions and automatically arranges dialogue boxes and backgrounds to tell a seamless and engaging story.

Highlights:

  • Customize character appearance, status, and actions from text content.

  • Flexible layout: expressions, dialogue box positions, backgrounds.

  • Uses MangaZero data with 48 series, 43,264 pages, 427,147 frames with detailed annotations.

📊 Superior Performance:

  • Automatic evaluation: FID, CLIP, DINO metrics confirm high image quality, closely adhering to the content.

  • User research: High scores for character consistency, image-content matching, and story quality.

  • Qualitative comparison: Expressive frames, logical layout, appropriate dialogue.

💡 Potential Applications:

  • Automatic comic and animation creation.

  • Intelligent education.

  • Marketing and personalized content.

PROMPT OF THE DAY
Creating Christmas greeting cards for the company to send to partners with AI📝

“Create an elegant and festive digital greeting card for a company to send to its partners, celebrating Christmas. The card should feature a modern design with a professional yet warm tone. Include elements like a Christmas tree, ornaments, and snowflakes, along with a message that says ‘Merry Christmas & Happy New Year!’ in elegant typography. Add the company’s logo at the bottom corner and ensure the colors are in harmony, such as gold, green, and red, to evoke the holiday spirit. The layout should have space for the company name and a personal touch to make partners feel appreciated.”

TAF

Thank you for listening!

See you next time.

The AI Growth Team 😄 😄 ❤️

​The AI First

Rate this post

Để lại một bình luận

Email của bạn sẽ không được hiển thị công khai. Các trường bắt buộc được đánh dấu *

Để lại một bình luận

Email của bạn sẽ không được hiển thị công khai. Các trường bắt buộc được đánh dấu *