I had a conversation with the AI twin.

I had a conversation with the AI twin.

Synthesia has announced a new feature called Personal Avatar. This is a digital version of yourself that can be created within minutes and looks so much like you that it can fool your friends and perhaps even your mother if you are not paying close attention.

Generative AI video startups have long had the option of using digital avatars of varying degrees of realism within PowerPoint-style presentation software. The avatar acts like a virtual presenter reading a script alongside the text and graphics on the slides.

Personalized avatars could also be created, but unless one took the time to visit Synthesia's office and record a digital version of oneself, they were flat and not particularly expressive. With this latest update, however, personalization has been taken to a whole new level, making it much easier to create realistic twin avatars.

To create a digital version of yourself, sit in front of a webcam, read 60 seconds of text, and wait a day for the avatar to be generated. It is important to note that this is only available with a paid Synthesia subscription, and you can only access your virtual likeness through your Synthesia account.

With the rise of video-based social media platforms such as TikTok and Instagram Reels, it has become increasingly important to express oneself visually.

Not everyone can shine in front of a camera or even has the desire to do so. Now you can type and the digital version of the words you type can be read by you - no need to speak.

But the primary use is in an office environment where it breathes new life into PowerPoint presentations.

Teachers can easily use it to create custom lessons for different children, and it can be used to deliver presentations and TikTok in multiple languages, even if you personally only speak English.

The most exciting part of personal avatar technology is the amazingly accurate voice cloning, which can capture a mixture of inflection, emotion, and style. Animation and lip-sync are then added to photos taken with a webcam during setup.

It does not create a "true" avatar that can be moved to different environments, dressed differently, or changed on a whim, but what is clever is the voice and seamless looping.

Personal avatars use "auto-alignment" for loop playback. This is a form of AI that understands the difference between moments of silence and moments of speech. It can then make appropriate adjustments to the body movements along the script to make them more natural.

When I sat in front of the webcam and recorded a few minutes of fairly rambling text, I expected a slightly awkward, slightly off kilter version of myself. I had seen the demo, but I didn't think it could match that realism. I was wrong.

It took about a day for the system to finish training the model and create my avatar. Then, with just a few lines of text, I was ready to be used within the Synthesia system. I could literally put words in my mouth and it took less than 10 minutes to create the video.

If you look closely or watch carefully on a large monitor, it is obvious that the avatar is not real. The way they speak is very impressive, but they tend to look in one direction, their movements are a bit off compared to what would be expected naturally, and their eyes have been corrected. I have had amblyopia since childhood, a condition in which my left eye and my brain do not communicate particularly well, so I have minimal vision in that eye and always look to the left. In my avatar, both eyes look in the expected direction.

It was shocking to see this slightly corrected version of myself. Even though I am used to using AI every day, it was a reminder of how far technology has come and how accurately it can replicate humans. I suggest you check my guide for signs that it may be a deepfake.

According to Synthesia, personal avatars are available for Starter, Creator, and Enterprise plans. The free plan offers built-in avatars, but the first plan, which is customizable, costs $22/month, about the same price as ChatGPT and Claude.

"At Synthesia, we believe that AI avatars will revolutionize the way we interact in the digital space," a spokesperson explained, and how users can utilize their personal avatars to "express themselves, connect with others, and push the limits of digital communication," He added that he would like to see how users "express themselves, connect with others, and push the limits of digital communication.

Admittedly, a very accurate recreation of oneself can be a bit creepy, but I think Synthesia took the right approach. Locking into an authenticated user account does not mean that anyone can put words in your virtual mouth.

I suspect the next logical evolution of this is not a digital twin that can be used anywhere, but a digital twin once created. There are other companies building technology in this area, such as Captions, which offers an app-based digital twin that allows you to change the background and add customization, but nothing comes close to the realism of Synthesia Personal Avatars.

Captions' approach is more flexible and mobile-friendly, but Synthesia captures my voice even more accurately than my ElevenLabs clone, which fooled my kids.

.

Categories