July 2024

Showing 10 of 742 Results

AI training dataset used by tech giants allegedly created by scraping YouTube videos in violation of terms

Non-profit AI research group EleutherAI scraped YouTube subtitles to create a dataset in violation of YouTube’s terms of service, ProofNews said on July 16. The dataset, called the Pile, allegedly […]