I Tried 31 Different AI Models. These Are the Ones That Work.

The Nerdy Novelist

18 Oct 202328:15

Summary

TLDRIn this video, the host explores various AI language models, focusing on open-source options available on the cloud platform Open Router. The host compares models like GPT-3.5, GPT-4, and others through creative writing prompts, assessing their performance in generating urban fantasy novel ideas, social media headlines, and a dark fantasy book chapter. The results reveal strengths in different models, with GPT models and Mistol showing promise, and the video concludes with a discussion on the affordability of using these AI models for creative writing.

Takeaways

🤖 The video discusses comparing various AI language models, including both well-known and open-source options.
🔍 The host introduces a tool called 'open router' that allows access to multiple AI models in the cloud, which is user-friendly for non-technical individuals.
💰 Open router operates on a pay-as-you-go model, with the ability to use crypto for transactions, and the host demonstrates the pricing structure during the video.
📝 The host tests the AI models by giving them a brainstorming prompt about an urban fantasy novel, evaluating their responses for creativity and relevance.
📊 The AI models perform variedly, with GPT 4 and mistol showing promise for brainstorming tasks, and the host shares detailed feedback on each model's output.
🎯 The video also tests the models on creating social media headlines using a dark fantasy book concept, with llama and Claude delivering the most engaging results.
📖 A writing prompt for a dark fantasy chapter is used to assess the models' prose quality, with Claude and mistol standing out among the rest.
🚫 The host mentions that while some models do not generate safe-for-work content, others like mistol and myax do not have such restrictions.
💡 The importance of experimenting with different models for various tasks, such as marketing or prose writing, is emphasized.
📉 The host concludes that GPT models are good for following instructions, while Claude offers creative output, and mistol shows potential for certain tasks.
💸 The cost of using open router for the tests was minimal, making it an affordable option for AI model experimentation.

Q & A

What is the main focus of the video?
-The main focus of the video is to compare different AI language models, including open-source models, and test their capabilities in various tasks such as brainstorming, writing headlines, and creating content for a dark fantasy novel.
Which tool is introduced in the video for accessing multiple AI models?
-The tool introduced in the video is called Open Router, which allows users to access and test various AI models in the cloud.
How much did the user spend from their $5 server credit during the testing?
-The user spent approximately 11 cents from their $5 server credit during the testing of the AI models.
What type of content did the user test the AI models with?
-The user tested the AI models with content related to brainstorming ideas for an urban fantasy novel, writing marketing headlines for a dark fantasy book concept, and creating a 600-word chapter for a dark fantasy story.
Which AI model stood out for its performance in writing marketing headlines?
-Llama, Meta's AI model, stood out for its performance in writing marketing headlines, providing creative and attention-grabbing options.
What was the user's overall verdict on the GPT models?
-The user found the GPT models to be good at staying on task but not as creative or engaging as some of the other models like Claude and Llama.
Which AI model did the user find to be the best for brainstorming?
-The user gave the edge to both GPT models (3.5 and 4) and Mistol for brainstorming, as they provided a mix of creative ideas and followed the user's instructions well.
What was the user's opinion on the performance of the Weaver model?
-The user encountered issues with the Weaver model, as it was unable to generate responses during the testing, so they were unable to evaluate its performance.
Which AI models did not run not safe for work content?
-The GPT models, Claud models, and the ones from Meta and Google did not run not safe for work content.
What is the user's recommendation for writing not safe for work content?
-The user recommends using Mistol for writing not safe for work content, as it is one of the open-source models that allow such content, and suggests censoring the explicit parts in the story beats.
How does the user feel about the pay-as-you-go model of Open Router compared to subscriptions?
-The user prefers subscriptions that offer unlimited words like Chat GPT and Claude, but acknowledges that the pay-as-you-go model of Open Router is inexpensive and allows for continuous use as long as one is willing to pay the small per-chat cost.