Testing the Limits: My GTX 1070 Rig vs Mistral Small 22B

SmokeyDope@lemmy.world · 8 months ago

Testing the Limits: My GTX 1070 Rig vs Mistral Small 22B

BaroqueInMind@lemmy.one · 8 months ago

Read up on Hermes3 technical paper and you’ll realize it’s the best one. Running 8B model with the correct initial system prompt makes it as smart as GPT4o

SmokeyDope@lemmy.world · 8 months ago

The linked paper was a good read. Thank you.

BaroqueInMind@lemmy.one · edit-2 8 months ago

Ironically, if you ask ChatGPT to write you an initial system prompt for Hermes that will sound similar to its own, it will essentially share a trade secret with you and give up portions of its system prompt to make your 8B self hosted LLM perform like a commercial one.