EPISODE · Sep 18, 2023 · 7 MIN
“ChinAI #237: Safety Benchmarks for Chinese Large Models” by Jeffrey Ding
from ChinAI Newsletter
Subtitle: SuperCLUE-Safety, the first Chinese large-model multi-round adversarial safety benchmark, is released.. Greetings from a world where…for the rest of the college football season, this status update will be devoted to tracking the Iowa Hawkeye offense's march to mediocrity…As always, the searchable archive of all past issues is here. Please please subscribe here to support ChinAI under a Guardian/Wikipedia-style tipping model (everyone gets the same content but those who can pay support access for all AND compensation for awesome ChinAI contributors).Feature Translation: SuperCLUE-SafetyContext: Every two months or so, we’ve been checking in with the SuperCLUE rankings, which aim to benchmark large language models from Chinese and international labs along different dimensions. In the previous update to the SuperCLUE benchmark, we saw Baidu's ErnieBot soar up the rankings, on the strength of its performance with Chinese-language particularities (e.g. idioms). This past week, the SuperCLUE team released a safety benchmark (link to [...] --- First published: September 18th, 2023 Source: https://chinai.substack.com/p/chinai-237-safety-benchmarks-for --- Narrated by TYPE III AUDIO.
NOW PLAYING
“ChinAI #237: Safety Benchmarks for Chinese Large Models” by Jeffrey Ding
No transcript for this episode yet
Similar Episodes
Dec 19, 2018 ·25m
Nov 21, 2018 ·36m
Oct 17, 2018 ·20m
Sep 19, 2018 ·30m