Google Gemini can now listen to audio files

0
31

[ad_1]

Right now, technology is developing at a rapid pace, and AI models are able to ascertain several types of media. Well, Google just announced that its new AI model, Gemini 1.5 Pro, now understands audio. This news comes soon after Google announced Gemini in Android Studio.

In order for AI models to learn, they have to be fed a ton of data. At first, AI models were mostly trained on text-based data. This is mostly important for chatbots. However, as time went on, they gained the ability to process image data. Several chatbots give you the ability to upload your own images to either be reconstructed or ascertained.

Gemini 1.5 Pro can understand audio files

When Google first introduced Gemini to the public, the company said that it would eventually be able to ascertain multiple forms of media such as images, audio, and video. Well, it’s been able to ascertain images for a while, and the company has just checked off another one. Gemini 1.5 Pro is the company’s newest AI model, and it’s currently in testing. What’s neat about this model is that it’s actually even more powerful than Gemini Ultra. So, the company is outdoing itself.

This latest update gives it the ability to analyze and process audio files. So, if you want a summary of a long keynote, conversation, earnings call, etc., you will be able to upload the audio directly to Gemini. While there are tools that can summarize conversations (there are even tools available on smartphones), this implementation is different. Current tools will transcribe the speech into text and then summarize the conversation based on the text. However, Gemini 1.5 will be able to cut out the middleman and listen to the audio directly. This could possibly increase the accuracy.

If you want to use this functionality, there is some disappointing news. In order to use this function, you will need to use Google’s development platform called Vertex AI. Also, you’ll be able to use it if you are using AI Studio. So, if you’re waiting for a public release, you will just have to be patient.

[ad_2]

Source link