Responsive Advertisement

Ads

Advertise Here
Advertise Here

Featured Post

TikTok is pushing longer videos. Some creators worry about the vibe shift - Aynorm

When TikTok took off in 2020 — with short dancing or comedy clips providing much-needed entertainment to many users at the start of the Covi...


Hundreds of images of child sexual abuse found in dataset used to train AI image-generating tools

More than a thousand images of child sexual abuse material were found in a massive public dataset used to train popular AI image-generating models, Stanford Internet Observatory researchers said in a study published earlier this week.


The presence of these images in the training data may make it easier for AI models to create new and realistic AI-generated images of child abuse content, or “deepfake” images of children being exploited.

The findings also raise a slew of new concerns surrounding the opaque nature of the training data that serves as the foundation of a new crop of powerful generative AI tools.

The massive dataset that the Stanford researchers examined, known as LAION 5B, contains billions of images that have been scraped from the internet, including from social media and adult entertainment websites.

Of the more than five billion images in the dataset, the Stanford researchers said they identified at least 1,008 instances of child sexual abuse material.

LAION, the German nonprofit behind the dataset, said in a statement on its website that it has a “zero tolerance policy for illegal content.”

The organization said that it received a copy of the report from Stanford and is in the process of evaluating its findings. It also noted that datasets go through “intensive filtering tools” to ensure they are safe and comply with the law.

“In an abundance of caution we have taken LAION 5B offline,” the organization added, saying that it is working with the UK-based Internet Watch Foundation “to find and remove links that may still point to suspicious, potentially unlawful content on the public web.”

LAION said it planned to complete a full safety review of LAION 5B by the second half of January and plans to republish the dataset at that time.

The Stanford team, meanwhile, said that removal of the identified images is currently in progress after the researchers reported the image URLs to the National Center for Missing and Exploited Children and the Canadian Centre for Child Protection.

In the report, the researchers said that while developers of LAION 5B did attempt to filter certain explicit content, an earlier version of the popular image-generating model Stable Diffusion was ultimately trained on “a wide array of content, both explicit and otherwise.”

A spokesperson for Stability AI, the London-based startup behind Stable Diffusion, told CNN in a statement that this earlier version, Stable Diffusion 1.5, was released by a separate company and not by Stability AI.

And the Stanford researchers do note that Stable Diffusion 2.0 largely filtered out results that were deemed unsafe, and as a result had little to no explicit material in the training set.

“This report focuses on the LAION-5b dataset as a whole,” the Stability AI spokesperson told CNN in a statement. “Stability AI models were trained on a filtered subset of that dataset. In addition, we subsequently fine-tuned these models to mitigate residual behaviors.”

The spokesperson added that Stability AI only hosts versions of Stable Diffusion that includes filters that remove unsafe content from reaching the models.

“By removing that content before it ever reaches the model, we can help to prevent the model from generating unsafe content,” the spokesperson said, adding that the company prohibits use of its products for unlawful activity.

But the Stanford researchers note in the report that Stable Diffusion 1.5, which is still used in some corners of the internet, remains “the most popular model for generating explicit imagery.”

As part of their recommendations, the researchers said that models based on Stable Diffusion 1.5 should be “deprecated and distribution ceased where feasible.”

More broadly, the Stanford report said that massive web-scale datasets are highly problematic for a number of reasons, even with the attempts at safety filtering, because of their possible inclusion of not just child sexual abuse material but also because of other privacy and copyright concerns that arises from their use.

The report recommended that such datasets should be restricted to “research settings only” and that only “more curated and well-sourced datasets” should be used for publicly distributed models.

No comments:

Post a Comment


Advertise Here

How To


How to connect a monitor to your laptop

Hooking a monitor up to your laptop can help you multitask and give your eyes a break. But the setup process may seem a bit daunting—especia...


How to save YouTube Shorts songs to YouTube Music

If you've ever found yourself scrolling through YouTube Shorts wondering "what song is playing right now," you probably want an easy way to learn the songs name and quickly save it to your account. While this doesn't always work, it's still a useful


How to contact share on iOS 17 with NameDrop

iOS 17 has fully rolled out to the public, so you can take advantage of all the major upgrades the software has to offer, including the new ...


5 Ways Sales Forecasting Promote Growth of Businesses

Companies create strategic plans for expansion and make informed decisions based on essential information they gather. An increase in a company’s customer base, goods produced, market share, and revenue indicates business growth. Fortunately, revenue intelligence software can help collect .......


How to bypass Windows 11 hardware requirements

Windows 11—love it or hate it—is the latest operating system from Microsoft, and it comes with some perks you can't get in Windows 10. Unfortunately, the hardware requirements for Windows 11 include a compatible 64-bit CPU or a TPM 2.0 chip (via PCWorld), which shuts out quite a few computers....


How to block 'no caller id' on iPhone

If you're waiting for an important call, it's unlikely going to come from 'no caller id', so time wasted by a nuisance call can be frustrating — we can teach you how to block 'no caller id' on iPhone without an issue....


Guide on How to Become a CEO of a Company

Becoming a Chief Executive Officer (CEO) of a company is a significant milestone for many ambitious professionals. The role of a CEO involve...


How To Triple Your Sales By Creating Marketing Sequences

In my last post, I highlighted the different ways you can boost sales with condition marketing. Today, I’m going to show you how you can take this a step further with the addition of sequence marketing...


Advertise Here

Most Read


Advertise Here

Popular Posts

Sponsored


Advertise Here

What is Cybersecurity

What are the different types of cybersecurity threats?

How to connect a monitor to your laptop

If you've spilled water on your laptop, what should you do?

What Is An NFT? : 5 Advantages and Disadvantages of NFT

Top 10 Cryptocurrencies Of March 2024


Technology Posts


LastPass warns users not to fall for fake iPhone app

Boston-based password security service LastPass has warned its customers not to install a phony version of its app for Apple iPhones. LastPa...


How to create generative AI images from Google Search

Say goodbye to trying to find the perfect external app to generate AI images—Google has you covered. For a while, Google has offered an opt-...


How to contact share on iOS 17 with NameDrop

iOS 17 has fully rolled out to the public, so you can take advantage of all the major upgrades the software has to offer, including the new ...


Connect your Xbox controller to PC

Here's how to connect an Xbox controller to your Windows PC Having trouble getting your Xbox controller connected to your PC? No worries...


Invest in Cryptocurrency Without Actually Investing in It

Cryptocurrency can be a profitable investment, but it’s also high-risk, especially when you attempt to invest for the first time. Imagine, e...


Boost Your Job Search with ChatGPT

Boost Your Job Search with ChatGPT: Your Secret Weapon for Landing a Job In today’s competitive job market, finding ways to stand out from t...


Guide on How to Become a CEO of a Company

Becoming a Chief Executive Officer (CEO) of a company is a significant milestone for many ambitious professionals. The role of a CEO involve...


What Holds More Weight in Today’s World? - Skills vs. Degrees

In the ever-evolving job market, the age-old debate of skills versus degrees continues to be a topic of discussion. While a college degree w...


aynorm

ABOUT

  • playstore
  • apple
  • apk
© 2024 Aynorm All Rights Reserved.
Created ByAynorm