Part
01
of two
Part
01
Privacy-Preserving Machine Learning - Key Players
This report provides an overview of key players in the following segments of privacy-preserving machine learning: differential privacy, encrypted deep learning, model integrity, and synthetic data. Microsoft is a key player in differential privacy and recently released a world-first open source platform. Intel is a leader in encrypted deep learning with its recently-released HE Transformer. IBM leads the Natural Language Processing field with Watson. In synthetic data, Mostly AI has gained attention from major investors.
Differential Privacy
- Key player: Microsoft leads the differential privacy field with a score of 144.6, according to emerging technology blog Linknovate. Its nearest competitor is Apple with a score of 101.5. It is significantly involved in the field's development.
- Products and solutions in differential privacy: Microsoft is actively working with its partners (including Harvard) to develop open toolkits "to better enable differential privacy." The first open source platform of its kind was announced in June 2020; it uses Harvard's Open DP initiative.
- Products and solutions in other fields:
- Encrypted deep learning: Microsoft offers Microsoft Simple Encrypted Arithmetic Library (SEAL) to facilitate encrypted computations. This service was established on March 27, 2016.
- Model integrity: Microsoft offers Azure for natural language processing solutions. Azure became available on February 1st, 2010.
- Synthetic data: The primary solution Microsoft offers for data simulation is Azure, which launched on February 1st, 2010.
- Federated learning: Microsoft released research in March 2020 related to its in-developed federated learning framework.
- Format preserving encryption solutions: Microsoft does not offer a first-party format-preserving encryption solution.
- Related Findings: As of 2018, the most active organizations in the field of differential privacy are SMEs and startups with 43 percent of activity. Universities are not far behind with 36 percent of activity in the field.
- Related finding: Of the top 10 organizations involved in differential privacy, the top three are tech corporations based in America. Of the remaining seven, six are American universities (and one is a Chinese university).
Encrypted Deep Learning
- Key player: Intel in engaged in creating Homomorphic Encryption technologies that are more accessible and can scale up. The possibility of widespread application is significant and makes them a key player.
- Products and solutions in encrypted deep learning: Its HE Transformer for nGraph is a backend to Intel's nGraph, a "graph compiler for Artificial Neural Networks." This solution became available in November 2019.
- Products and solutions in other fields:
- Differential privacy : Intel does not have a differential privacy solution but produced a related white paper on AI privacy in 2018.
- Model integrity : Intel offers NLP Architect for natural language processing. This became available in 2018.
- Synthetic data : Intel does not provide a synthetic data solution.
- Federated learning : Intel has partnered with Data Republic to develop and offer federated data solutions to financial institutions. The first trial of Intel's solution was completed in 2018 with Singapore's United Overseas Bank.
- Format preserving encryption solutions: Intel does not have a software-based format-preserving encryption solution.
Model Integrity
- Key player: IBM is a key player in the model integrity or natural language processing (NLP) field. This is mainly due to the prevalence and dominance of its Watson AI.
- Products and solutions in model integrity: IBM's Watson is the primary offering in NLP solutions.
- Products and solutions in other fields:
- Differential privacy : IBM released its open source differential privacy tool, the IBM Differential Privacy Library, in June 2020.
- Encrypted deep learning : IBM completed its first successful field trials of "fully homomorphic encryption" to enable encrypted deep learning in July 2020.
- Synthetic data : IBM InfoSphere Optim Test Data Fabrication enables the production of synthetic data.
- Federated learning : IBM's Federated Learning tool was released in July 2020.
- Format preserving encryption solutions: IBM hosts an FPE Translate callable service for some format preserving encryption purposes. It is not clear how long it has been available.
Synthetic Data
- Key player: Mostly AI is a key player in the synthetic data segment. This is due to the significant interest and investment it has received from major banks and partners since its launch, including one of the top five insurance companies in the U.S.
- Products and solutions in synthetic data: Mostly Generate is Mostly AI's offering in the synthetic data field; it has been available since 2017.
- Products and solutions in other fields:
- Differential privacy : Mostly AI does not have a differential privacy solution but one of its staff wrote a recent blog post on it.
- Encrypted deep learning : Mostly AI does not offer encrypted deep learning but its CEO was recently interviewed and discussed it.
- Model integrity : Mostly AI does not offer model integrity solutions.
- Federated learning : Mostly AI does not offer federated learning solutions, but one of its staff was interviewed recently and discussed its relationship with synthetic data.
- Format-preserving encryption solutions: Mostly AI does not offer a format-preserving encryption solution.
Research Strategy
For each technology segment, key players were identified by a search through industry-related media and blogs.