After ChatGPT took the world by storm late last year, it was refreshing to see last week’s announcement of the imminent arrival of the chatbot’s next iteration received relatively modest coverage.
German publication Heise quoted senior members of Microsoft’s in-country leadership last week as saying GPT-4 would arrive this week.
Microsoft Germany CTO Andreas Braun said: “We will introduce GPT-4 next week, there we will have multimodal models that will offer completely different possibilities – for example videos,” Braun said. ChatGPT (or GPT-3.5) launched in December 2022, while GPT-3 launched in 2020.
Both Braun and Microsoft Germany CEO Marianne Janik stressed that while generative AI was a game changer, it wouldn’t be replacing human jobs. The big takeaway from the GPT-4 announcement was the fact that it will be “multimodal”, meaning it will be able to generate text, audio, images and videos.
This may sound like a big deal, but in reality it’s simply a consolidation of pre-existing AI technologies — including OpenAi’s own DALL-e image generator. In fact, GPT-4 sounds like it will tread on the toes of a range of third-party software, including MidJourney, ElevenLabels and D-ID, which together can create an AI animated avatar with voice overs.





