Jun 212025
 

Here is an AI project that I could build right now, probably in a matter of hours, not days.

I am not going to do it, because it would be a waste of time, as it is simply a proof-of-concept, nothing more. A concept that I wish would remain unproven but it won’t, not for long.

The project is a Web app. Very simple. An app that has permission to use your camera, and it starts by taking a snapshot of you every second. The app shows an exercise video and you are instructed to follow suit. Better yet, it shows a real-time, AI-generated avatar doing exercise.

Combining twelve webcam images into a collage to show a time series, the app then sends the resulting image, through the RESTful API of OpenAI, to GPT4.1, utilizing its ability to analyze images with human-level comprehension. The image will be accompanied by a simple question: “Does this person appear to be engaged in vigorous exercise? If the answer is yes, respond with the word ‘yes’. If the answer is no, assume the role of a drill instructor in charge of unruly civilians (think recruits or prisoners), scold the person and order him to do better. The person’s name is 6079 Smith W, and he is a member of a squad that you monitor. Phrase your answer accordingly.”

The prompt may need to be tweaked a little, to make sure that the AI’s response remains consistent. And then, a bit of post-processing: If the AI response is not ‘yes’, perhaps after a bit of post-processing and elementary sanity checks, I send its crafted response to another API that offers a real-time speaking avatar. Heygen, maybe? I’d have to do a bit of research as to which API works best. Or maybe I’d just use a static image and a text-to-speech service like Amazon’s Polly.

Either way, the result will speak for itself, when your computer screams are you in a shrill female voice:

Smith! 6079 Smith W.! Yes, YOU! Bend lower, please! You can do better than that. You’re not trying. Lower, please! THAT’S better, comrade. Now stand at ease, the whole squad, and watch me.

Yes, this technology is here, today. A tad over four decades late, I guess, but welcome to the future, comrades.

 Posted by at 1:58 pm