Lookahead

less than 1 minute read

Last blog about Medusa was going longer than I expected. So I will write separete blogs about lookahead and EAGLE1/2

Lookahead is from LMSYS, and I mainly read from this blog and this post

1 Jacob Iteration Algorithm

This is an algorithm to solve nonlinear equations iteratively. Details here. It breaks down the matrix to a diagonal one (easy to revert it by taking reciprocal of all diagonal elements) and L/U. Alt text

The idea is to change auto-regression decoding into a nonliear equations system and solve all the tokens by iteration Alt text

2 Lookahead decoding

So we can solve all m tokens in k iterations with parellel decoding. If $k«m$ then we can achieve speed up with lookahead decoding. The paper is here Alt text

Tags:

Categories:

Updated: