Nvidia and other giants exposed for illegally using YouTube data to train models involving 170000 videos
六月清晨搅
发表于 2024-7-17 15:00:42
200
0
0
According to media reports, some large tech companies, including Apple, NVIDIA, Salesforce, and Anthropic, have been exposed for using unauthorized data from Google's video website YouTube to train their AI models. These companies used a dataset provided by a third party, which contained a large amount of video subtitle text crawled from YouTube, violating YouTube's ban on unauthorized content crawling from the platform. The report points out that these tech companies all use a dataset called "YouTube Subtitles" when training their AI models, which is 5.7GB in size and contains 489 million words from 173500 videos across over 48000 channels on YouTube. This dataset consists of pure text for video subtitles, including parts uploaded by video bloggers and automatically transcribed text from YouTube. In addition to English, it usually comes with translations for languages such as Japanese, German, and Arabic.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Thanks to TSMC, Nvidia has good news!
- NVIDIA once again surpasses Apple to become the world's largest market value
- High energy ahead! The "Seven Giants" of the US stock market are about to announce their financial reports, and the shocking data is coming
- SoftBank's Masayoshi Son: Nvidia's stock price 'undervalued', expected to achieve 'super AI' by 2035
- Heavy data released in the United States! Nasdaq fell more than 512 points, and the "seven sisters" of technology fell across the board. Apple fell more than 1.8%. NVIDIA's market value evaporated 1.15 trillion yuan overnight
- Nvidia requests SK Hynix to supply HBM4 chips 6 months in advance
- The third largest public pension fund in the United States reduced its holdings of Apple, Nvidia, and others in the third quarter
- Is the demand for NVIDIA too high? SK Hynix: Huang Renxun requests HBM4 chip to be delivered 6 months in advance!
- Nvidia is considering investing in Musk's xAI, with a valuation of $40 billion
- Top 20 US stock transactions: Trump Media Technology Group's stock price surged by 12%; Nvidia once became the world's most valuable company during trading
-
【英偉達の需要が高すぎる?SKハイニックス:黄仁勲がHBM 4チップの6カ月前納入を要求!】SKハイニックスの崔泰源(チェ・テウォン)会長は月曜日、インビダーの黄仁勲(ファン・インフン)CEOが同社の次世代高帯域 ...
- 琳271
- 前天 17:54
- 支持
- 反对
- 回复
- 收藏
-
ファイザーが前立腺がんを治療する革新薬テゼナ& ;reg;(TALZENNA®,一般名:トルエンスルホン酸タラゾールパーリカプセル)は2024年10月29日に国家薬品監督管理局(NMPA)の承認を得て、HRR遺伝子突然変異 ...
- 什么大师特
- 昨天 17:41
- 支持
- 反对
- 回复
- 收藏
-
南方財経は11月5日、中央テレビのニュースによると、現地時間11月5日、米ボーイング社のストライキ労働者が59%の投票結果で新たな賃金協定を受け入れ、7週間にわたるストライキを終えた。ストライキ労働者は11月12 ...
- Dubssgshbsbdhd
- 昨天 16:27
- 支持
- 反对
- 回复
- 收藏
-
【マスクはテスラが携帯電話を作ることに応えた:作れるが作らないアップルとグーグルが悪さをしない限り】現地時間11月5日、有名ポッドキャストのジョローガン氏のインタビューに応じ、「携帯電話を作るのは私たち ...
- 波大老师
- 昨天 14:41
- 支持
- 反对
- 回复
- 收藏