Meta will start using data from EU users to train its AI models

Meta plans to start using data collected from its users in the European Union to train its AI systems, the company announced today. Starting this week, the tech giant will begin notifying Europeans through email and its family of apps of the fact, with the message set to include an explanation of the kind of data it plans to use as part of the training. Additionally, the notification will link out to a form users can complete to opt out of the process. "We have made this objection form easy to find, read, and use, and we’ll honor all objection forms we have already received, as well as newly submitted ones," says Meta. The company notes it will only use data it collects from public posts and Meta AI interactions for training purposes. It won't use private messages in its training sets, nor any interactions, public or otherwise, made by users under the age of 18. As for why the company wants to start using EU data now, it claims the information will allow it to fine tune its future models to better serve Europeans. "We believe we have a responsibility to build AI that’s not just available to Europeans, but is actually built for them. That’s why it’s so important for our generative AI models to be trained on a variety of data so they can understand the incredible and diverse nuances and complexities that make up European communities," Meta states. "That means everything from dialects and colloquialisms, to hyper-local knowledge and the distinct ways different countries use humor and sarcasm on our products. This is particularly important as AI models become more advanced with multi-modal functionality, which spans text, voice, video, and imagery." Meta notes other AI companies, including Google and OpenAI, have similarly used data from European users to fine tune their own systems. Today's announcement follows the initial release of Meta's new Llama 4 models. After some early hype, Meta was accused of gaming LMArena, a website where humans compare the outputs of different AI models to rank them. Researchers noticed the company had provided an experimental version of Llama 4 to the site "optimized for conversationality.”This article originally appeared on Engadget at https://www.engadget.com/ai/meta-will-start-using-data-from-eu-users-to-train-its-ai-models-175307338.html?src=rss

Apr 14, 2025 - 19:00

Meta will start using data from EU users to train its AI models

Meta plans to start using data collected from its users in the European Union to train its AI systems, the company announced today. Starting this week, the tech giant will begin notifying Europeans through email and its family of apps of the fact, with the message set to include an explanation of the kind of data it plans to use as part of the training. Additionally, the notification will link out to a form users can complete to opt out of the process. "We have made this objection form easy to find, read, and use, and we’ll honor all objection forms we have already received, as well as newly submitted ones," says Meta.

The company notes it will only use data it collects from public posts and Meta AI interactions for training purposes. It won't use private messages in its training sets, nor any interactions, public or otherwise, made by users under the age of 18. As for why the company wants to start using EU data now, it claims the information will allow it to fine tune its future models to better serve Europeans.

"We believe we have a responsibility to build AI that’s not just available to Europeans, but is actually built for them. That’s why it’s so important for our generative AI models to be trained on a variety of data so they can understand the incredible and diverse nuances and complexities that make up European communities," Meta states.

"That means everything from dialects and colloquialisms, to hyper-local knowledge and the distinct ways different countries use humor and sarcasm on our products. This is particularly important as AI models become more advanced with multi-modal functionality, which spans text, voice, video, and imagery."

Meta notes other AI companies, including Google and OpenAI, have similarly used data from European users to fine tune their own systems. Today's announcement follows the initial release of Meta's new Llama 4 models. After some early hype, Meta was accused of gaming LMArena, a website where humans compare the outputs of different AI models to rank them. Researchers noticed the company had provided an experimental version of Llama 4 to the site "optimized for conversationality.”This article originally appeared on Engadget at https://www.engadget.com/ai/meta-will-start-using-data-from-eu-users-to-train-its-ai-models-175307338.html?src=rss