Connect with us

Hi, what are you looking for?

Your Profit SpringYour Profit Spring

Tech News

OpenAI transcribed over a million hours of YouTube videos to train GPT-4

Cath Virginia / The Verge | Photos from Getty Images

Earlier this week, The Wall Street Journal reported that AI companies were running into a wall when it comes to gathering high-quality training data. Today, The New York Times detailed some of the ways companies have dealt with this. Unsurprisingly, it involves doing things that fall into the hazy gray area of AI copyright law.

The story opens on OpenAI which, desperate for training data, reportedly developed its Whisper audio transcription model to get over the hump, transcribing over a million hours of YouTube videos to train GPT-4, its most advanced large language model. That’s according to The New York Times, which reports that the company knew this was legally questionable but believed it to be fair use. OpenAI president Greg…

Continue reading…

You May Also Like

Tech News

The new adaptive charging feature could help to save power and preserve the life of controller batteries. | Photo by Amelia Holowaty Krales /...

Tech News

The latest Netflix app update will require Apple devices to run iOS 17 or later. | Illustration by Alex Castro / The Verge The...

World News

In theory, the question should have been easy. Debate moderator Linsey Davis on Tuesday night pointed out to former president Donald Trump that he...

Tech News

Hi, friends! Welcome to Installer No. 32, your guide to the best and Verge-iest stuff in the world. (If you’re new here, welcome, happy...