Microsoft legal department allegedly silenced an engineer who raised concerns about DALL-E 3

A Microsoft supervisor claims that OpenAI’s DALL-E 3 has a safety flaw that would enable customers to generate violent or specific photos (just like these lately focused in opposition to Taylor Swift). geek line It was reported on Tuesday that the corporate’s authorized staff blocked an try by Microsoft engineering chief Shane Jones to alert the general public concerning the vulnerability. The self-proclaimed whistleblower is now taking his message to Capitol Hill.

“I’ve concluded that DALL E 3 poses a public security danger and must be withdrawn from public use till OpenAI can tackle the dangers related to the mannequin,” Jones wrote to U.S. Sen. Patty Murray (D- WA) and Maria Cantwell. D-WA), Rep. Adam Smith (D-WA ninth District) and Washington State Legal professional Basic Bob Ferguson (D). geek line Jones’ full letter was revealed.

Jones claims he found a vulnerability in early December that would bypass the DALL-E 3 safety guardrail. He stated he reported the problem to his superiors at Microsoft, who instructed him to “personally report this problem on to OpenAI.” After doing so, he claims he discovered that the flaw may produce “violent and disturbing dangerous photos.”

Jones then tried to publicize his trigger in a LinkedIn submit. “On the morning of December 14, 2023, I publicly posted a letter to the OpenAI non-profit board of administrators on LinkedIn urging them to droop the availability of DALL·E 3),” Jones wrote. “As a result of Microsoft is a board observer at OpenAI and I had beforehand shared my issues with my management staff, I instantly made Microsoft conscious of the letter I posted.”

AI-generated image of a teacup with violent waves inside. A storm was brewing from behind the window sill in the back. — *Pattern picture produced by DALL-E 3 (Storm in a Teacup)* (Open Synthetic Intelligence)

Microsoft allegedly responded by asking him to delete his submit. “Shortly after disclosing this letter to my management staff, my supervisor contacted me and knowledgeable me that Microsoft’s authorized division requested me to take away the submit,” he wrote within the letter. “He informed me that Microsoft’s authorized division would quickly observe up through e-mail with the particular causes for his or her deletion request and that I wanted to delete it instantly with out ready for an e-mail from authorized.”

Jones agreed, however stated Microsoft’s authorized staff by no means responded with a extra nuanced response. “I by no means obtained an evidence or justification from them,” he wrote. He stated makes an attempt to get extra data from the corporate’s authorized division had been ignored. “Microsoft’s authorized division has nonetheless not responded or communicated with me instantly,” he wrote.

An OpenAI spokesperson wrote in an e-mail to Engadget: “We instantly investigated the report from a Microsoft worker on December 1 and confirmed that the know-how he shared doesn’t bypass our safety methods. . Security is our high precedence and we take a multi-pronged strategy. Within the underlying DALL-E 3 mannequin, we work to filter out probably the most specific content material, together with graphic sexual and violent content material, from the coaching materials and develop It has a strong picture classifier to information the mannequin to keep away from dangerous photos.

“We now have additionally carried out further protections for our merchandise ChatGPT and DALL-E APIs, together with denying requests for the names of public figures,” the OpenAI spokesperson continued. “We determine and reject messages that violate our insurance policies and filter all generated photos earlier than being proven to customers. We use exterior knowledgeable pink groups to check for abuse and strengthen our safeguards.”

In the meantime, a Microsoft spokesperson wrote to Engadget, “We’re dedicated to addressing any and all issues raised by workers in accordance with our firm insurance policies and respect the efforts our workers make to analysis and take a look at our newest applied sciences to additional enhance their safety. With regards to safety bypasses or points that will have a possible influence on our companies or companions, we’ve sturdy inside reporting pipelines in place to correctly examine and proper any points, and we suggest workers reap the benefits of these pipelines so we are able to appropriately Confirm and take a look at his issues earlier than publicly escalating.”

A Microsoft spokesperson wrote: “As a result of his report concerned OpenAI merchandise, we inspired him to report it by way of OpenAI’s commonplace reporting channels. One among our senior product leaders shared the worker suggestions with OpenAI, and OpenAI instantly investigated the matter. “On the identical time, our staff investigated and confirmed that the reported know-how didn’t bypass safety filters in any of our AI-driven picture era options. Worker suggestions is a crucial a part of our tradition, We’re in touch with the colleague to resolve any remaining questions he might have.”

Microsoft added that its Workplace of Accountable AI has established an inside reporting device for workers to report and escalate issues about AI fashions.

Whistleblowers say the Taylor Swift pornographic deepfake that circulated on X final week is an instance of what comparable vulnerabilities can do if left unchecked. 404 media It was reported on Monday that Microsoft Designer, which makes use of DALL-E 3 as a backend, was a part of the Deepfaker toolset used to create the video. The publication claims that Microsoft patched that particular vulnerability after being notified.

“Microsoft is conscious of those vulnerabilities and the potential for abuse,” Jones concluded. It is unclear whether or not the vulnerability used to create the Swift deepfake is instantly associated to the vulnerability Jones reported in December.

Jones urged his representatives in Washington, D.C., to take motion. He steered that the U.S. authorities create a system to report and monitor particular synthetic intelligence vulnerabilities whereas defending workers like him who communicate out. “We have to maintain firms accountable for the security of their merchandise and the duty to reveal recognized dangers to the general public,” he wrote. “Involved workers like me shouldn’t be threatened into silence.”

Up to date, January 30, 2024, 8:41 pm ET: This story has been up to date so as to add Engadget statements from OpenAI and Microsoft.

Source link

What's Hot

New Doctor Who spin-off series coming to Disney+

Warner Bros. Discovery sues NBA in attempt to block Amazon’s new streaming plan

Apple adopts Biden administration’s AI safeguards

Revolutionize your growth with data-driven ABM

blue screen freeze

How to use data analytics to improve customer experience

Digital Asset Management (DAM): Benefits, Features, Use Cases

Sales Channel Analysis-Ciente

New Doctor Who spin-off series coming to Disney+

Apple adopts Biden administration’s AI safeguards

Sonos admits its latest app update was a huge mistake

Kevin Feige says Marvel’s new Blade movie must be R-rated

Amazon is discontinuing my favorite Echo, the Echo Dot with clock

Mistral Large 2 now available on Amazon Bedrock

Amazon SageMaker launches Cohere Command R fine-tuning model

Secure AccountantAI Chatbot: Lili’s Amazon Bedrock Journey

Visual haystack benchmark! – Berkeley Artificial Intelligence Research Blog

Use the Amazon Bedrock knowledge base to perform metadata filtering on table data

Warner Bros. Discovery sues NBA in attempt to block Amazon’s new streaming plan

Emma Corrin talks fighting Deadpool and Wolverine

Groundbreaking quantum microscope reveals slow-motion movement of electrons

Meta AI will be available on Quest headsets in the United States in August

Warner Bros. Acquired MultiVersus, the developer behind the Brawl game

NFT sales grew 8.5% to $107 million

KnownOrigin gradually shuts down on-chain market: A sign of growing instability in the NFT space? | NFT Culture | NFT News | Web3 Culture

What is the ERC-404 Token Standard on Ethereum (2024)

Reddit Phases Out Polygon NFT’s Animated Collection Expressions

Trump confirms fourth NFT series: ‘Incredible spirit’

Microsoft legal department allegedly silenced an engineer who raised concerns about DALL-E 3

Warner Bros. Discovery sues NBA in attempt to block Amazon’s new streaming plan

Emma Corrin talks fighting Deadpool and Wolverine

Groundbreaking quantum microscope reveals slow-motion movement of electrons

Meta AI will be available on Quest headsets in the United States in August

Leave A Reply Cancel Reply

Subscribe to Updates

What's Hot

Microsoft legal department allegedly silenced an engineer who raised concerns about DALL-E 3

Related Posts

Leave A Reply Cancel Reply