Loading...
JetSpec: Speculative decoding with 9.64x LLM speedup | Next.js Blog