Compiler Theory and Reactivity

22 February 2024
Forget,
React,
Compiler

If you're wondering what the React compiler is, I recommend reading our recent update post for some background. This post is for anyone curious about the compiler theory behind it. Don't feel pressured to understand everything in this post in order to use the compiler.

The React compiler implements numerous traditional compiler transformations, that are generally not accessible without having some background in compiler theory. In this post, I'll try to provide a more accessible explanation of a compiler pass called Static Single Assignment form (SSA) using examples.

Consider this simple React component:

function Component({ colours }) {
  let styles = { colours };
  return <Item styles={styles} />;
}

We can easily memoize it like this:

function Component(props) {
  const $ = useMemoCache(2);
  const { colours } = props;
  let t0;

  if ($[0] !== colours) {
    t0 = { colours };

    $[0] = colours;
    $[1] = t0;
  } else {
    t0 = $[1];
  }

  const styles = t0;
  return <Item styles={styles} />;
}

The compiler can track the styles object being created and passed down as props.

Don't worry too much about the useMemoCache hook, it's an internal API used by the compiler to cache values. Think of `$` as an array.

The React Compiler can memoize JSX too, but I'm leaving it out in this post for brevity.

Now, let's say you want to refactor the styles based on a condition.

function Component({ colours, hover, hoverColours }) {
  let styles;
  if (!hover) {
    styles = { colours };
  } else {
    styles = { colours: hoverColours };
  }
  return <Item styles={styles} />;
}

Memoizing the styles object becomes a bit more challenging for the compiler because it's no longer a single statement. It's spread across several statements, and there's control flow involved -- styles is created in both the if and else block.

The compiler can still track styles creation across both blocks and memoize it like this:

function Component(props) {
  const $ = useMemoCache(4);
  const { hover, colours, hoverColours } = props;
  let styles;

  if ($[0] !== hover || $[1] !== colours || $[2] !== hoverColours) {
    if (!hover) {
      styles = { colours };
    } else {
      styles = { colours: hoverColours };
    }

    $[0] = hover;
    $[1] = colours;
    $[2] = hoverColours;
    $[3] = styles;
  } else {
    styles = $[3];
  }

  return <Item styles={styles} />;
}

This works, but it's not ideal because we'd invalidate the memoized value if any of hover, colours or hoverColours changes. It's too coarse grained. Can we do better?

Track values, not variables #

One core intuition is that we'd memoize the values in the if block separately from the else block. They are separate values (separate objects), just being referenced by the same variable identifier (styles).

Consider our previously example, but slightly modified to track the value separately by giving them different identifiers:

let styles;
if (!hover) {
  t0 = { colours };              // <-- separate value
} else {
  t1 = { colours: hoverColours}; // <-- separate value
}
styles = choose(t0 or t1);

Now, it's pretty clear that we can memoize t0 and t1 separately. You've also realized that we need to choose between t0 and t1 and assign it correctly to styles, but let's ignore that for now.

The compiler can memoize the values in their respective blocks:

if (!hover) {
	if ($[0] !== colours) {
		t0 = {
			colours,
		};
		$[0] = colours;
		$[1] = t0;
	} else {
		t0 = $[1];
	}
} else {
	if ($[2] !== hoverColours) {
		t1 = {
			colours: hoverColours,
		};
		$[2] = hoverColours;
		$[3] = t1;
	} else {
		t1 = $[3];
	}
}
styles = choose(t0 or t1)

This is more fine grained than the previous example.

Where's the complexity? #

But, wait, we're just memoizing a value in the scope it was created, what's so hard about it?

Well, let's consider another example:

function Component({ colours, hover, hoverColours }) {
  let styles;
  if (!hover) {
    styles = { colours };
  } else {
    styles = { colours: hoverColours };
  }
  styles.height = "large"; // <-- modifying styles object
  return <Item styles={styles} />;
}

In the above example, we modify the styles object after the if-else block by adding a new property named height. It's no longer safe to memoize the values inside the if-block and else-block separately.

We can't modify a value after it's memoized. Not because it's sub-optimal performance-wise, but because it leads to incorrect behavior during re-rendering. Take a minute to think about how this behavior can manifest in practice.

We need a way to track the values as they flow, not just simply memoize it in the scope they are created.

One could argue that you shouldn't be writing code like this. But, local mutations are very natural in JavaScript and there's plenty of React code written like this that we need to compile efficiently.

Track the flow #

Remember the "choose" function, we ignored earlier? This lets the compiler track the values as they flow across if-else block!

if (!hover) {
  t0 = { colours };
} else {
  t1 = { colours: hoverColours};
}
styles = choose(t0 or t1); // <-- tracks values after control flow
styles.height = 'large';

Now, the code (or to be precise, the compiler's intermediate representation) tells the compiler that styles is either t0 or t1 and modifying styles is equivalent to modifying the values t0 and t1.

The compiler can now infer that the styles can only be memoized at a coarser level like this:

if ($[0] !== hover || $[1] !== colours || $[2] !== hoverColours) {
  if (!hover) {
    styles = {
      colours,
    };
  } else {
    styles = {
      colours: hoverColours,
    };
  }

  styles.height = "large";
  $[0] = hover;
  $[1] = colours;
  $[2] = hoverColours;
  $[3] = styles;
} else {
  styles = $[3];
}

Compiler theory #

To recap, we've talked about tracking values separately with temporary identifiers and tracking the values across control flow with a "choose" function.

Interestingly, a classical compiler transformation called Static single-assignment form (SSA) does exactly this! Tracking new values and re-assignments by creating a new temporary value is the core part of the SSA transform. The "choose" function we talked about earlier is simply the "phi" (Φ) function defined in the SSA form.

The exact SSA transformation that the React compiler uses is from the excellent Simple and Efficient Construction of Static Single Assignment Form paper.

If you're curious to read more about compiler theory in the React compiler, take a look at the other tagged posts.

Previous: Alias analysis in the React Compiler
Next: Type system of the React compiler