{"id":40312,"date":"2023-08-02T10:13:31","date_gmt":"2023-08-02T14:13:31","guid":{"rendered":"https:\/\/www.technewsday.com\/?p=40312"},"modified":"2023-08-03T10:39:11","modified_gmt":"2023-08-03T14:39:11","slug":"gpt-4-breaks-ai-guardian-defense-with-natural-language-prompts","status":"publish","type":"post","link":"https:\/\/technewsday.com\/staging\/gpt-4-breaks-ai-guardian-defense-with-natural-language-prompts\/","title":{"rendered":"GPT-4 breaks AI-guardian defense with natural language prompts"},"content":{"rendered":"<p data-ar-index=\"1\">Nicholas Carlini, a Google scientist, has demonstrated how OpenAI&#8217;s GPT-4 big language model may be used to circumvent AI-Guardian, a safeguard against adversarial attacks on machine learning models.<\/p>\n<p data-ar-index=\"2\">Carlini utilized GPT-4 to develop code capable of identifying the mask used by AI-Guardian to detect adversarial samples. This enabled Carlini to create hostile cases that could go around the defense.<\/p>\n<p data-ar-index=\"3\">By directing GPT-4 to create an attack method and explain its workings, Carlini revealed how the chatbot could compromise AI-Guardian&#8217;s detection capabilities. Specifically, GPT-4 produced Python code to manipulate images without triggering AI-Guardian&#8217;s suspicions. This ability to fool classifiers significantly reduced AI-Guardian&#8217;s robustness from 98 percent to a mere 8 percent.<\/p>\n<p data-ar-index=\"4\">The study reveals machine learning algorithms, such as image recognition systems, are vulnerable to adversarial examples\u2014input that misleads the model&#8217;s identification process. Carlini&#8217;s revelation of the mask used to identify adversarial samples contradicted AI-Guardian&#8217;s technique of establishing a backdoor to reject hostile input, allowing the design of effective adversarial assaults.<\/p>\n<p data-ar-index=\"5\">&#8220;This work shows that GPT-4 can be used as a powerful tool for attacking machine learning models,&#8221; said Carlini. &#8220;It also raises concerns about the security of AI-Guardian and other similar defenses.&#8221;<\/p>\n<p data-ar-index=\"6\">The sources for this piece include an article in <a href=\"https:\/\/www.theregister.com\/2023\/08\/01\/google_boffin_breaks_ai_model\/?td=rt-3a\" target=\"_blank\" rel=\"noopener\">TheRegister<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Nicholas Carlini, a Google scientist, has demonstrated how OpenAI&#8217;s GPT-4 big language model may be used to circumvent AI-Guardian, a safeguard against adversarial attacks on machine learning models. Carlini utilized GPT-4 to develop code capable of identifying the mask used by AI-Guardian to detect adversarial samples. This enabled Carlini to create hostile cases that could [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[34],"tags":[525],"class_list":["post-40312","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","tag-ai"],"acf":[],"_links":{"self":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/40312","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/comments?post=40312"}],"version-history":[{"count":2,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/40312\/revisions"}],"predecessor-version":[{"id":40314,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/40312\/revisions\/40314"}],"wp:attachment":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/media?parent=40312"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/categories?post=40312"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/tags?post=40312"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}