The World This Week in AI (16th May 2024)

The World This Week in AI (16th May 2024)

google text-to-video, veo

Share This Article

Now, You Can Use ChatGPT 4 For Free, and It’s Improving In How It Communicates.✨

OpenAI just dropped the mic with GPT-4o (“o” stands for “Omni”) and it’s available for free with a limit on messages. So, what’s the scoop on its latest improvement?

  • It can understand vision and voice in 232 milliseconds, with an average response time of 320 milliseconds, which is similar to human response time in a conversation.
ChatGPT-4o (Omni) Human Interaction
  • Provide instant voice translations
  • Uploads files or images for help with summarization, writing, or analysis.
  • Explore and leverage GPTs and the GPT Store.
  • Analyze data (opens in a new window) and generate charts.
  • Chat about photos you take

Amazing Text-to-Video AI Model by Google, Veo

Source: Google DeepMind

Google released a competitor to OpenAI’s Sora, called Veo. It can generate videos from text and even make adjustments to the results. Veo is only available in some countries and is still on a waitlist. Some capabilities of Veo include:

  • Generate text-to-video over 60 seconds in 1080p resolution.

Example:

Prompts:

A fast-tracking shot through a bustling dystopian sprawl with bright neon signs, flying cars and mist, night, lens flare, volumetric lighting.

A fast-tracking shot through a futuristic dystopian sprawl with bright neon signs, starships in the sky, night, volumetric lighting.

A neon hologram of a car driving at top speed, speed of light, cinematic, incredible details, volumetric lighting.

The cars leave the tunnel, back into the real world city Hong Kong

The result:

  • Adjust the results, such as adding objects.
  • Consistency across video frames

Generating AI Images Is Easier Now, Thanks to ImageFX.

Google released ImageFX, powered by Imagen 2, Google DeepMind’s latest text-to-image model that delivers the highest-quality images. 

Google knows that when creating a prompt to generate an image, users try many prompts to get the image they want. Therefore, in ImageFX, Google provides prompt suggestions to help users get the image they want faster.

Source: Google DeepMind

Google’s Improved Text-to-Music AI Model, MusicFX

Source: Google DeepMind

People can now create tunes up to 70 seconds long and music loops, explore prompts with expressive chips, and download or share their creations with friends.

Source

Top executives of OpenAI, Jan Leike (Co-Leader) and Ilya Sutskever (Chief Scientist), Have Left OpenAI.

OpenAI faces a leadership challenge as two of its leaders, Jan Leike and Ilya Sutskever, have left the company.

Jan Leike made significant contributions in leading the team to align AI with human interests. In September, he was named one of Time 100’s most influential people in AI.

Sutskever, the co-founder and chief scientist of OpenAI, announced that he has quit and is excited for his next journey.

Get Free AI E-Book

Do you want to learn more about AI? Get this FREE AI E-Book to guide you in using AI effectively. Download it here.

Signup to our Newsletter

icon

Check out These Related Blog Posts

icon

Check out These Related Blog Posts

Ready For G

Ready For Your Giant Leap?

Enhance your marketing's effectiveness, cost-efficiency, and results with AI now! Contact us today and thank us later!

Ready For G

Ready For Your Giant Leap?

Enhance your marketing's effectiveness, cost-efficiency, and results with AI now! Contact us today and thank us later!

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Read More

Decline