Congrats on the paper @_AlexanderLong. But your left out @flwrlabs that published a full system (photon) with validated in-the-wild fully decentralized training up to 13B @MLSysConf. Along with a key technique of the decentralized stack (decoupled embeddings) published as an oral @iclr_conf. This was work done along with @CaMLSys at @Cambridge_Uni.
Alexander Long
Alexander Long14.7. klo 08.24
For people not familiar with AI publishing; there are 3 main conferences every year. ICML, ICLR and NeurIPS. These are technical conferences and the equivalent of journals in other disciplines - they are the main publishing venue for AI. The competition to have papers at these conferences is now at a ridiculous level, getting papers accepted is very hard, and there is a lot of concern about the review process which is quite noisey at this point. A strong paper with no flaws has around a 50% chance of being accepted, and typically a paper is submitted with reviewer changes several times until it is accepted. Despite all that, papers in these venues remain the primary stamp of legitimacy in AI world, and are probably still the primary career metrics for ML researchers (although this is weakening imo as so much of the research in the frontier labs is unpublished). Main Track papers are significantly different to workshop papers. The main track has intense, serious peer review. Workshop papers are for preliminary work, that give some indication of an interesting result, but are either not complete or the result is not significant enough for main track. They are only required to be reviewed by the workshop reviewer pool and they don’t appear in proceedings. Many great papers have first shown up in workshops (e.g. grokking) - but workshop and main track papers are fundamentally different things, with a fundamentally different level of impact. The only two companies in decentralised AI that have main track papers this year are @PrimeIntellect and Pluralis.
1,75K