As more organizations move toward the adoption of generative AI,Arouse (2025) Google wants us all to be more concerned about security. To that end, on Thursday the tech giant released its Secure AI Framework (SAIF), meant to be a sort of security roadmap, if a somewhat thinly sketched one for the time being.
But if you’re imagining this is a scheme for averting the sort of existential AI peril Elon Musk is always talking about, think smaller and more immediate.
Here’s a summary of the framework’s six “core elements”:
Elements 1 and 2 are about expanding an organization’s existing security framework to include AI threats in the first place.
Element 3 is about integrating AI into your defense against AI threats, which rather disturbingly calls to mind a nuclear arms race, whether that was intentional or not.
Element 4 is about the security benefits of uniformity in your AI-related “control frameworks.”
Elements 5 and 6 are about constantly inspecting, evaluating, and battle-testing your AI applications to make sure they can withstand attacks, and aren’t exposing you to unnecessary risk.
It looks like for now, Google mostly just wants organizations to bring elementary cybersecurity ideas to bear around AI. As Google Cloud’s info security chief Phil Venables told Axios, “Even while people are searching for the more advanced approaches, people should really remember that you've got to have the basics right as well.”
But there are already some new and unique security concerns cropping up in the here-and-now with generative AI applications like ChatGPT.
For instance, security researchers have identified one potential risk: “prompt injections,” a bizarre form of AI exploitation in which a malicious command directed at an unsuspecting AI chatbot plugin lies in wait in some block of text. When the AI scans the prompt injection, it changes the nature of the command given to the AI. It’s sort of like hiding a sinister mind-control spell in the text on Ron Burgundy’s teleprompter. Weird, right?
And prompt injections are just one of the new types of threats Google specifically says it hopes to help curb. Others include:
“Stealing the model,” a possible way of tricking a translation model into giving up its secrets.
“Data poisoning,” in which a bad actor sabotages the training process with intentionally faulty data.
Constructing prompts that can extract the potentially confidential or sensitive verbatim text that was originally used to train a model.
Google’s blog post about SAIF says the framework is being adopted by, well, Google. As for what the release of a “framework” means for the wider world, it could come to basically nothing, but it could also be adopted as a standard. For example, the US government’s National Institute of Standards and Technology (NIST) released a more general framework for cybersecurity in 2014. That was aimed at protecting critical infrastructure from cyberattacks, but it’s also highly influential, and recognized as the gold standard in cybersecurity by the majority of IT professionals surveyed about it.
Google, however, isn’t the US government, which calls into doubt just how authoritative its framework will be in the eyes of Google’s AI rivals, such as OpenAI. But in security, it looks like Google is trying to lead from the front in the AI space, instead of racing to play catch-up. Perhaps earning back some of the clout it lost in the earlier phases of the AI race is what the release of SAIF is really about.
Topics Artificial Intelligence
Beto O'Rourke's being sued over campaign text messagesGoogle News redesigned with a cleaner look, more customization optionsWordle today: Here's the answer, hints for June 18Wordle today: Here's the answer, hints for June 19Prince Harry climbs the Sydney Harbour Bridge because he's basically SpiderThe best time travel movies you can watch right nowInstagram to test video selfies for age verificationSubscription managers and other ways to get rid of unnecessary subscriptionsJulian Assange faces U.S. extradition after UK gives green lightNothing Phone 1 preorder reservations are openVSCO launches Spaces, a collaborative gallery for creators'Pokémon Go' now has a bunch of new creatures and there's 1 runaway favouriteTelegram is now offering a Premium subscriptionThe Smithsonian's 'FUTURES' virtual exhibit imagines the year 2050Sarah Silverman said she let Louis C.K. masturbate in front of her, with consentThis absurd parody proves that all TED Talks really do sound the sameDonald Trump’s election was a 'traumatic experience' for manySarah Silverman said she let Louis C.K. masturbate in front of her, with consentBitcoin continues to plummet, dropping below $20KWordle today: Here's the answer, hints for June 19 Nope, the Samsung Galaxy S8 is not coming in February Syfy's 'The Magicians' does 'some g*ddamn magic' at Brooklyn installation Massive seas of humanity move in these Women's March timelapses If you search 'a*holes' on Twitter, Donald Trump shows up first The trailer for Lifetime's Britney Spears movie will just make you sad At Women's Marches across the globe, dads were out in full force Figuring out what Aussies think about Trump on Twitter is pretty difficult Darren Criss joins DC superhero musical episode Here's where to download those amazing Leia signs at Women's March Half an onion desperately wants to get more Twitter followers than Donald Trump Emma Watson hugging her mom at the Women's March is total sweetness Hermione Granger makes magic and mistakes in new web series Nick Offerman's 'Pussyhat' rallied Reddit around a Photoshop battle Don't believe reports that Trump is pulling the U.S. out of the United Nations Hey Trump, check out these YUGE Women's March crowds across America The Grateful Dead played a beautiful private show ahead of 'Long Strange Trip' premiere The new Star Wars episode title means more than you think Finally, Samsung reveals why the Note 7 exploded The Reddit CEO and I both got LASIK in the event of the apocalypse Apple is reportedly working on new touch screen technology for iPhone
1.7712s , 8223.7578125 kb
Copyright © 2025 Powered by 【Arouse (2025)】,Creation Information Network