LLM Responses compared
I thought a nice exercise would be to take a relatively simple prompt and assess how the Closed and Open models currently available compare. This is the prompt that I used:
<PROMPT>
I’m taking my daughter to an oral surgeon today to discuss removing her 3 present wisdom teeth. In overwhelming (?) circumstances wisdom teeth are removed as there is not room in the mouth for them. Is there ever going to be a chance mother nature starts creating humans without any wisdom teeth?
</PROMPT>
First, all the responses essentially touched on the same items. Some had more detail and more flourish in their delivery. Consider how I’m using this data; it has informed me and enlightened me on the question at hand. I have no intention of repurposing this copy on any site except for illustrative purposes, which is Chrisspeak for not caring as much about how “pretty” the prose of the LLMs are. Here’s a slideshow of the responses.
Here are a couple opinions in no particular order:
1. the Meta.ai response is sad, and they are the only Open source US model in the list.
2. Deepseek R1 and Qwen both have impressive results.
3. I like the Grok response more that I anticipated. But then again, I was predisposed not to like it (shakes fist at Elon) but I keep an open mind.
Subscribed to One Useful Thing
One Useful Thing is the name of Ethan Mollick's substack newsletter. Ethan is the Co-Director of the Wharton Generative AI Labs. Wharton Generative AI Labs has lots of good information including a prompt library: https://gail.wharton.upenn.edu/prompt-library/ Check...
CMS sites revisited
Recent work research includes CMS review. I haven't looked into what is out there in a long time. The quick search hits showed lots of familiar faces and a couple new ones. I found this post informative:...
Congratulations Impact Makers
Impact Makers was awarded "Best for the World" by B Lab. Here's a Richmond Time's Dispatch mentioning of the award. I met Michael Pirron shortly after moving to Richmond in 2004, and have watched him methodically and conscientiously build Impact Makers into a...
DropBox Security
Oh, the line between security and convenience is harsh. While reading TechRepublic I found this interesting article on DropBox security. I love the convenience of cloud technologies, and use DropBox like lots of people, including the article author (Michael Kassner)....
GOOG perspective on links
Analytics is an area that I will be focusing quite a bit in the coming months. I read this article about linking and Penguin 2.0 changes on a website called "Search Engine Watch." There are many pieces to the content puzzle that websites have to face, and...
Google Apps v. Office 365
The TechRepublic newsletter is worth a scan every day. I found this gem recently and it's worth a read if you use, or might use in the future, either Google Apps or Microsoft Office. Here it is: Google Apps v. Office 365: Head-to-head comparison of features...
UltraEdit
I first used UltraEdit sometime in the late 90s. I loved it back then. When I went to work for TQuist in 2003 I purchased another copy. I just now retired my last XP system, and so I've decided to get the latest copy. I hope it's aged well. They've changed...