Is ML model training actually possible on Enigma?How?

ezio · October 25, 2019, 11:45am

ML learning requires HUGE DATASETS in order of Terrabytes.

I saw DIDI’s cab data around 3TB somewhere on the web, if I am to train a self driving car model using data from Tesla, GM, Uber, BMW. How would I process these many TB of data on the blockchain?

How would the training take place? Could anyone please delineate as to how it will happen? (step by step that is…)

On the Discord I saw that current SGX limitation is 4gb, how would training happen if the raw data is in TBs ?

ainsley · October 25, 2019, 9:44pm

Hey there @ezio, welcome!
I think initially, approaches such as using pre-trained models to evaluate smaller sections of user data is a feasible approach. Additionally, work in federated learning is promising, for training models while doing the computation on edge-devices (such that only the modifications to the model are returned as encrypted tasks).
for ML as a vertical, you are right to identify the size constraint as key. But this is an active field of research-- here are some recent research papers that address federated learning on edge devices using TEE hardware, which account for limited computational capacity… https://www.intel.ai/federated-learning-for-medical-imaging/#gs.bw8dg3 and https://eurosys2019.org/wp-content/uploads/2019/03/eurosys19posters-abstract66.pdf

Topic		Replies	Views
Enigma's sMPC for Reinforcement Learning to Potentiate Artificial Intelligence Private Computation	8	1174	August 7, 2018
Not able to Understand this how to compute on data without having access to it Secret Nodes	6	742	May 22, 2020
Using GPU on secret nodes Secret Contracts and Secret Apps	2	669	January 14, 2019
General FAQ (living document) General / FAQ	0	1151	June 19, 2018
Machine Learning Library works on Secret Contract Private Computation	8	1125	November 22, 2019

Is ML model training actually possible on Enigma?How?

Related topics