Tech
Microsoft’s Mustafa Suleyman says everything on the internet can be used for free to train AI models
At a time when various publications and organisations are suing AI companies, Mustafa Suleyman, the CEO of Microsoft’s AI division says its okay to scrape content available on the open web to train AI models.
Mustafa Suleyman, the CEO of Microsoft’s new AI division, recently said in an interview with CNBC’s Andrew Ross that anything you publish on the internet becomes ‘freeware’ and that it can be copied and used to train AI models.
When asked if “AI companies have effectively stolen the world’s IP”, the Google Deepmind co-founder said, “With respect to content that is already on the open web, the social contract of that content since the 90s has been that it is fair use. Anyone can copy it, recreate with it, reproduce with it. That has been freeware, if you like. That’s been the understanding.”
You have exhausted your
monthly limit of free stories.
Read more stories for free
with an Express account.
Invest in democracy. Full access to Express at just Rs 999/year
This premium article is free for now.
Register to read more free stories and access offers from partners.
Invest in democracy. Full access to Express at just Rs 999/year
This content is exclusive for our subscribers.
Subscribe now to get unlimited access to The Indian Express exclusive and premium stories.
He went on to say that unless a publisher or a news organisation explicitly asks not to scrape or crawl their content for anything other than indexing to make content visible to other people, it can be freely used to train AI models. This might suggest that Microsoft, alongside other AI companies like Perplexity, Google and OpenAI think it is okay to train their AI models on content available on the web without having to pay the creator.
Currently, one of the biggest controversies concerning AI chatbots like ChatGPT, Gemini and Copilot is that generative AI companies might be scraping copyrighted data and using it to train their upcoming AI models.
In the last few months, several organisations and publications like Forbes, the New York Times and the Recording Industry Association of America have filed lawsuits against the likes of Microsoft, ChatGPT maker OpenAI, Perplexity, Udio and others, saying that these companies have been using their content to train their AI models without permission.
© IE Online Media Services Pvt Ltd
First uploaded on: 29-06-2024 at 12:54 IST