Skip to content

Runtime scaling of JIT compilation #3478

Discussion options

You must be logged in to vote

XLA TPU lead here, it is important to distinguish between the time it takes to process the the HLO graph and the time it takes to compile each op in the final HLO graph.

For HLO passes, typically the dominant complexity is linear in number of ops. Passes like fusion have quadratic behavior in the number of parameters in a fusion node every time a fusion is created. In theory this could be cubic complexity of everything in a graph fuses and adds an operand.

Many hlo passes produce a ReachabilityMap which is quadratic complexity with the largest computation. In practice this is generally quite fast due to a small constant factor. As a result, hlo passes are definitely quadratic.

That said, …

Replies: 5 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by nickmcgreivy
Comment options

You must be logged in to vote
1 reply
@jekbradbury
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
6 participants