0-10 subscribers
Go offline with the Player FM app!
Decoding LLM Quality: From Unit Testing to User Feedback
Manage episode 379415542 series 3519364
Dive deep into the nuances of measuring the quality of Large Language Model (LLM) prompts, as we explore past methodologies, and evaluate both qualitative assessments and large-scale testing techniques. Join us as we discuss the challenges of traditional metrics, the role of user feedback, and brainstorm new ways to gauge generative model performance.
—
Continue listening to The Prompt Desk Podcast for everything LLM & GPT, Prompt Engineering, Generative AI, and LLM Security.
Check out PromptDesk.ai for an open-source prompt management tool.
Check out Brads AI Consultancy at bradleyarsenault.me.
Add Justin Macorin and Bradley Arsenault on LinkedIn.
Please fill out our listener survey here to help us create a better podcast: https://docs.google.com/forms/d/e/1FAIpQLSfNjWlWyg8zROYmGX745a56AtagX_7cS16jyhjV2u_ebgc-tw/viewform?usp=sf_link
Hosted by Ausha. See ausha.co/privacy-policy for more information.
52 episodes
Manage episode 379415542 series 3519364
Dive deep into the nuances of measuring the quality of Large Language Model (LLM) prompts, as we explore past methodologies, and evaluate both qualitative assessments and large-scale testing techniques. Join us as we discuss the challenges of traditional metrics, the role of user feedback, and brainstorm new ways to gauge generative model performance.
—
Continue listening to The Prompt Desk Podcast for everything LLM & GPT, Prompt Engineering, Generative AI, and LLM Security.
Check out PromptDesk.ai for an open-source prompt management tool.
Check out Brads AI Consultancy at bradleyarsenault.me.
Add Justin Macorin and Bradley Arsenault on LinkedIn.
Please fill out our listener survey here to help us create a better podcast: https://docs.google.com/forms/d/e/1FAIpQLSfNjWlWyg8zROYmGX745a56AtagX_7cS16jyhjV2u_ebgc-tw/viewform?usp=sf_link
Hosted by Ausha. See ausha.co/privacy-policy for more information.
52 episodes
All episodes
×


1 Why you can't automate everything with LLMs 18:24

1 Data Preparation Best Practices for Fine Tuning 20:26

1 How to Rescue AI Innovation at Big Companies 19:20



1 Towards an Inter-Agent Communication Standard 21:21



Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.