Microsoft Analytics Tools Identify Trends, Privacy In Mind

September 24, 2021

Synthetic data is useful only if it is accurate, said Microsoft Research Director Darren Edge.

“You can generate synthetic data with perfect privacy but zero utility by sampling random values from random distributions.” Useful synthetic data must match the distribution of the real data set, down to the combinations of individual traits such as age, nationality, location, occupation, and so on.

Using Microsoft’s Synthetic Data Showcase, an open-source tool, the United Nations International Organization for Migration has developed a synthetic human trafficking dataset that has the same structure and statistics as the real data.

Thus, the study yields the same insights into what types of people are exploited where and how – but not enough data to track real individuals – in addition to a Power BI dashboard that can be opened in the cloud or uses the free Power BI Desktop app.

The answer lies in controlling the resolution of the data: making sure that a certain combination of attributes applies to a sufficiently large number of people that it does not act like a fingerprint for a specific person.

Microsoft is able to do this using a technique called k-anonymity, where k is the minimum number of people for each combination. Likewise, password monitoring tools such as Have I Been Pwned, 1Password, and Google’s Password Checkup can tell you if your password has been leaked.

For more information, view the original story from Techrepublic.

Top Stories

Related Articles

June 24, 2025 A new report from Okta shows that despite growing fears about identity theft, most more...

June 23, 2025 Canada’s cybersecurity agency and the U.S. Federal Bureau of Investigation have confirmed that a more...

June 12, 2025 A new vulnerability discovered in Microsoft Copilot has raised urgent concerns about the security more...

May 6, 2025 A coordinated supply chain attack has compromised between 500 and 1,000 e-commerce websites by more...

Jim Love

Jim is and author and podcast host with over 40 years in technology.

Share:
Facebook
Twitter
LinkedIn