The Future of Large Language Model Pre-training is Federated

Hackworth@lemmy.world · edit-2 6 months ago

The Future of Large Language Model Pre-training is Federated

General_Effort@lemmy.world · 6 months ago

As far as I know, federated learning is pretty much dead. The point would be that it allows organizations to create a joint model without sharing data. But it doesn’t look like anyone who doesn’t want to share data wants to share a model.

Hackworth@lemmy.world · 6 months ago

Until they can distribute the training load of large models to consumer graphics cards (and do something like SETI@Home) it does seem like the benefit of distributed training isn’t enough to overcome the friction.

Audrey0nne@leminal.space · 6 months ago

Lot of words just to say that once the advertisers move in on a centralized platform its value is shot. A huge part of the reason I abandoned the last platform I was using and sought a federated alternative.

Hackworth@lemmy.world · 6 months ago

The papers have a ton of practical info about feasibility, implementation, etc.