Satisfice is a publication authored by James Bach, a software testing expert and advocate for exploratory testing. Readers can explore articles covering topics such as software testing techniques, test management, and quality assurance practices. Additionally, they can learn about critical thinking in testing, risk-based testing approaches, and improving software quality.

Satisfice

A testing professional shares empirical data from a LARC experiment testing four LLMs at different temperatures and prompt styles on a simple task: extracting apple pie ingredients from unstructured text. The experiment demonstrates the importance of evidence-based evaluation of AI systems before deployment, with raw data stored in MongoDB and analysis reports available online. The work is part of developing a 'Testers and AI' class and aims to establish rigorous testing methodologies for AI systems.

Serious Data From Testing LLMs