Google scrapes public data to train AI

Google has updated its privacy policy to disclose that the company may use publicly available information to train its AI models, including Bard and Cloud AI. This update, implemented on July 1st, 2023, aims to provide more transparency to users regarding the company’s data collection practices.

Google spokesperson Christa Muldoon said that the company has “long been transparent” about its use of publicly available information to train its AI models. “This latest update simply clarifies that newer services like Bard are also included,” she said. “We incorporate privacy principles and safeguards into the development of our AI technologies, in line with our AI Principles.”

The updated policy expands the scope beyond “language models” to encompass broader “AI Models,” granting Google the freedom to train and build systems using public data. However, the policy fails to specify how the company will prevent copyrighted materials from being included in their training data.

The policy has also sparked concerns about the potential for Google to misuse copyrighted materials in its AI training data. Some experts have also raised questions about how Google will ensure that its AI models do not generate harmful or discriminatory content.

The sources for this piece include an article in TheVerge.

Top Stories

Related Articles

June 20, 2024 Target is introducing a new generative artificial intelligence tool aimed at enhancing the efficiency of its store employees more...

June 13, 2024 Generative AI tools are transforming the coding landscape, making both skilled and novice developers more efficient. However, the more...

May 16, 2024 Microsoft's ambitious strides in AI technology are now posing a significant challenge to its own climate goals, as more...

May 15, 2024 Ilya Sutskever, co-founder and chief scientist of OpenAI, has officially announced his departure from the company. This move more...

Jim Love

Jim Is and author and pud cast host with over 40 years in technology.