Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks
Posted by ag2718 4 hours ago
Comments
Comment by mikeayles 3 hours ago
I've been trying to hit 100,000tokens/s with a 3.28m dumb model, and even this is an order of magnitude too large to benefit.
It appears to be focussed more on latency, than throughput. Happy to be corrected?
Comment by ag2718 2 hours ago
Comment by RantyDave 4 hours ago
Comment by ag2718 3 hours ago
One primary application of this work is in high-energy physics (https://home.cern/smarter-decisions-at-the-speed-of-collisio...). Ultrafast and real-time learning is also very applicable for problems in quantum computing, plasma control, etc. (https://arxiv.org/pdf/2602.02005).
Comment by poly2it 3 hours ago
Comment by tomrod 2 hours ago
Comment by Animats 3 hours ago
Comment by cwmoore 1 hour ago
Comment by babelfish 3 hours ago
Comment by amdeisimncrmnls 2 hours ago
Comment by KAN_LUT 1 hour ago