Template:Did you know nominations/Nicholas Carlini
Appearance
- The following is an archived discussion of the DYK nomination of the article below. Please do not modify this page. Subsequent comments should be made on the appropriate discussion page (such as this nomination's talk page, the article's talk page or Wikipedia talk:Did you know), unless there is consensus to re-open the discussion at this page. No further edits should be made to this page.
The result was: promoted by DimensionalFusion talk 13:12, 17 September 2024 (UTC)
DYK toolbox |
---|
Nicholas Carlini
- ... that Nicholas Carlini showed that ChatGPT could leak personal information?
ALT1: ... that, in 2018, Nicholas Carlini's team broke 7 of the 11 AI defenses presented in the ICLR conference that year?Source: https://www.wired.com/story/ai-has-a-hallucination-problem-thats-proving-tough-to-fix/- Reviewed: Template:Did you know nominations/Answers Research Journal
Improved to Good Article status by Sohom Datta (talk) and 2401hz (talk).
Number of QPQs required: 1. Nominator has 10 past nominations.
Sohom (talk) 14:39, 8 September 2024 (UTC).
General: Article is new enough and long enough |
---|
Policy: Article is sourced, neutral, and free of copyright problems |
---|
|
Hook: Hook has been verified by provided inline citation |
---|
|
QPQ: Done. |
Overall: Approved ALT0. ALT1 is rejected, and I'll provide a couple comments on it which you can feel free to ignore if you prefer ALT0. The phrasing could certainly be tighter: you don't need to say "in 2018" and "that year" in the same sentence. "ICLR" is an initialism used without context, so I might pipe that link as "a 2018 conference". Also, the source uses the word "broken" in quotes for a reason: it's not clear exactly what breaking a defense means in this context, and it seems to only be a claim from the team, not a fact that the source is backing. —TechnoSquirrel69 (sigh) 18:24, 8 September 2024 (UTC)
- I think it makes sense to go forward with only ALT0, we could try and get ALT1 to work but trying to explain "defenses" would probably make the hook fail WP:DYKINT since that would require talking about what adversarial examples are. Sohom (talk) 19:11, 8 September 2024 (UTC)
- @TechnoSquirrel69: The DYK bot only picks up the approval if the green tick is the last symbol: the rejection symbol is blocking the hook's approval. If If one of the ALTs is approved, can you add a green tick below, indicating the hooks that are approved? Thanks, Z1720 (talk) 14:42, 9 September 2024 (UTC)
- Thanks Z1720, that's good to know! Hello bot, ALT0 appoved. —TechnoSquirrel69 (sigh) 15:38, 9 September 2024 (UTC)
- @TechnoSquirrel69: The DYK bot only picks up the approval if the green tick is the last symbol: the rejection symbol is blocking the hook's approval. If If one of the ALTs is approved, can you add a green tick below, indicating the hooks that are approved? Thanks, Z1720 (talk) 14:42, 9 September 2024 (UTC)