{"id":13538,"date":"2025-06-21T13:58:57","date_gmt":"2025-06-21T17:58:57","guid":{"rendered":"https:\/\/spinor.info\/weblog\/?p=13538"},"modified":"2025-06-21T13:58:57","modified_gmt":"2025-06-21T17:58:57","slug":"an-ai-thought-experiment","status":"publish","type":"post","link":"https:\/\/spinor.info\/weblog\/?p=13538","title":{"rendered":"An AI thought experiment"},"content":{"rendered":"<p>Here is an AI project that I could build right now, probably in a matter of hours, not days.<\/p>\n<p>I am not going to do it, because it would be a waste of time, as it is simply a proof-of-concept, nothing more. A concept that I wish would remain unproven but it won&#8217;t, not for long.<\/p>\n<p>The project is a Web app. Very simple. An app that has permission to use your camera, and it starts by taking a snapshot of you every second. The app shows an exercise video and you are instructed to follow suit. Better yet, it shows a real-time, AI-generated avatar doing exercise.<\/p>\n<p>Combining twelve webcam images into a collage to show a time series, the app then sends the resulting image, through the RESTful API of OpenAI, to GPT4.1, utilizing its ability to analyze images with human-level comprehension. The image will be accompanied by a simple question: &#8220;Does this person appear to be engaged in vigorous exercise? If the answer is yes, respond with the word &#8216;yes&#8217;. If the answer is no, assume the role of a drill instructor in charge of unruly civilians (think recruits or prisoners), scold the person and order him to do better. The person&#8217;s name is 6079 Smith W, and he is a member of a squad that you monitor. Phrase your answer accordingly.&#8221;<\/p>\n<p>The prompt may need to be tweaked a little, to make sure that the AI&#8217;s response remains consistent. And then, a bit of post-processing: If the AI response is not &#8216;yes&#8217;, perhaps after a bit of post-processing and elementary sanity checks, I send its crafted response to another API that offers a real-time speaking avatar. Heygen, maybe? I&#8217;d have to do a bit of research as to which API works best. Or maybe I&#8217;d just use a static image and a text-to-speech service like Amazon&#8217;s Polly.<\/p>\n<p>Either way, the result will speak for itself, when your computer screams are you in a shrill female voice:<\/p>\n<blockquote><p><strong>Smith! 6079 Smith W.! Yes, YOU! Bend lower, please! You can do better than that. You&#8217;re not trying. Lower, please! THAT&#8217;S better, comrade. Now stand at ease, the whole squad, and watch me.<\/strong><\/p><\/blockquote>\n<div style='text-align: center; margin-bottom: 0.5em;'>\n<video width=\"715\" height=\"540\" autoplay=\"autoplay\" loop=\"true\"><source src=\"\/weblog\/wp-content\/uploads\/2025\/06\/ws.mp4\" type=\"video\/mp4\" \/><\/video><\/div>\n<p>Yes, this technology is here, today. A tad over four decades late, I guess, but welcome to the future, comrades.<\/p>\n<fb:like href='https:\/\/spinor.info\/weblog\/?p=13538' send='true' layout='standard' show_faces='true' width='450' height='65' action='like' colorscheme='light' font='lucida grande'><\/fb:like>","protected":false},"excerpt":{"rendered":"<p>Here is an AI project that I could build right now, probably in a matter of hours, not days. I am not going to do it, because it would be a waste of time, as it is simply a proof-of-concept, nothing more. A concept that I wish would remain unproven but it won&#8217;t, not for <a href='https:\/\/spinor.info\/weblog\/?p=13538' class='excerpt-more'>[&#8230;]<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,58,5,18],"tags":[],"class_list":["post-13538","post","type-post","status-publish","format-standard","hentry","category-books","category-cybernetics","category-politics","category-society","category-9-id","category-58-id","category-5-id","category-18-id","post-seq-1","post-parity-odd","meta-position-corners","fix"],"_links":{"self":[{"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/posts\/13538","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=13538"}],"version-history":[{"count":10,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/posts\/13538\/revisions"}],"predecessor-version":[{"id":13549,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=\/wp\/v2\/posts\/13538\/revisions\/13549"}],"wp:attachment":[{"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=13538"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=13538"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/spinor.info\/weblog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=13538"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}