LLM Responses compared
I thought a nice exercise would be to take a relatively simple prompt and assess how the Closed and Open models currently available compare. This is the prompt that I used:
<PROMPT>
I’m taking my daughter to an oral surgeon today to discuss removing her 3 present wisdom teeth. In overwhelming (?) circumstances wisdom teeth are removed as there is not room in the mouth for them. Is there ever going to be a chance mother nature starts creating humans without any wisdom teeth?
</PROMPT>
First, all the responses essentially touched on the same items. Some had more detail and more flourish in their delivery. Consider how I’m using this data; it has informed me and enlightened me on the question at hand. I have no intention of repurposing this copy on any site except for illustrative purposes, which is Chrisspeak for not caring as much about how “pretty” the prose of the LLMs are. Here’s a slideshow of the responses.
Here are a couple opinions in no particular order:
1. the Meta.ai response is sad, and they are the only Open source US model in the list.
2. Deepseek R1 and Qwen both have impressive results.
3. I like the Grok response more that I anticipated. But then again, I was predisposed not to like it (shakes fist at Elon) but I keep an open mind.
Musings – Prompting, productivity, and context
Prompting, Productivity and Context Finish the following sentence: "Blogging is so ..." and yet here I am. Prompting I've been trying to engage people close to me as to their AI experiences and uses, either professionally or personnally. I find myself reminding...
“Accumulation of Cognitive Debt”
There's an article published at MIT that studied "Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task https://arxiv.org/abs/2506.08872 This blog post was pitched as a rebuttal of sorts to the MIT study - definitely...
The upside of AI
Positive thoughts? I came across two older items last week that really made me feel good about Artificial Intelligence and some of the good aspects and possibilities. Like most people grounded in reality, the very impact of AI's disruption on the job front and...
That “aha!” moment – a simple image
I finally committed to the "Plus" ($20 a month) version of ChatGPT and have been finding more and more things to use it for, and trying my best not to use it as lazy google. Hum de dum and I'm looking at making an AI Policy for TQuist, as well as provide information...
Anthropic’s AI Fluency course
Here's the link to Anthropic's AI Fluency course: https://www.anthropic.com/ai-fluency
Alexa, cook me some eggs!
NY Times / AMZN deal Here's an article from The Verge about The New York Times recent deal with Amazon do deliver its “editorial content to a variety of Amazon customer experiences,” I shudder to think how little The Gray Lady is getting paid. However, I hope the AI...
Oh the Humanity!’s Last Exam!
Humanity's Last Exam Benchmarks are interesting. Here's the deep thought - at what point in the overall benchmark process will AI inject bias into the benchmark test? And to what end? Maybe not so deep a thought. Humanity's Last Exam has been bantered about...