The MLCommons AI Safety Working Group has released version 0.5 of the AI Safety Benchmark, which evaluates the safety risks of AI systems using chat-tuned language models. The benchmark covers 13 hazard categories and provides a structured approach to benchmark construction. It is aimed at model providers, model integrators, and AI standards makers and regulators.
•5m read time• From marktechpost.com
Sort: