diff --git a/asai/Asai/Logger/Make/index.html b/asai/Asai/Logger/Make/index.html
index 8c68bf2..f68e90a 100644
--- a/asai/Asai/Logger/Make/index.html
+++ b/asai/Asai/Logger/Make/index.html
@@ -68,7 +68,7 @@
     <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
     <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
   <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
-  <span class="type-var">'a</span></span></code></div><div class="spec-doc"><p><code>adopt m run f</code> runs the thunk <code>f</code> that uses a different <code>Logger</code> instance, with the help of the runner <code>run</code> from that <code>Logger</code> instance, and then uses <code>m</code> to map the diagnostics generated by <code>f</code> into the ones in the current <code>Logger</code> instance. The backtrace within <code>f</code> will include the backtrace that leads to <code>adopt</code>. The intended use case is to integrate diagnostics from a library into those in the main application.</p><p><code>adopt</code> is a convenience function that can be implemented as follows:</p><pre class="language-ocaml"><code>let adopt m f run =
+  <span class="type-var">'a</span></span></code></div><div class="spec-doc"><p><code>adopt m run f</code> runs the thunk <code>f</code> that uses a <i>different</i> <code>Logger</code> instance. It takes the runner <code>run</code> from that <code>Logger</code> instance as an argument to handle effects, and will use <code>m</code> to transform diagnostics generated by <code>f</code> into ones in the current <code>Logger</code> instance. The backtrace within <code>f</code> will include the backtrace that leads to <code>adopt</code>, and the innermost specified location will be carried over, too. The intended use case is to integrate diagnostics from a library into those in the main application.</p><p><code>adopt</code> is a convenience function that can be implemented as follows:</p><pre class="language-ocaml"><code>let adopt m f run =
   run
     ?init_loc:(get_loc())
     ?init_backtrace:(Some (get_backtrace()))
@@ -77,7 +77,7 @@
     f</code></pre><p>Here shows the intended usage, where <code>Lib</code> is the library to be used in the main application:</p><pre class="language-ocaml"><code>module MainLogger = Logger.Make(Code)
 module LibLogger = Lib.Logger
 
-let _ = MainLogger.adopt (Diagnostic.map code_mapper) LibLogger.run @@ fun () -&gt; ...</code></pre><ul class="at-tags"><li class="parameter"><span class="at-tag">parameter</span> <span class="value">init_backtrace</span> <p>The initial backtrace to start with. The default value is the empty backtrace.</p></li></ul></div></div><div class="odoc-spec"><div class="spec value anchored" id="val-try_with"><a href="#val-try_with" class="anchor"></a><code><span><span class="keyword">val</span> try_with : 
+let _ = MainLogger.adopt (Diagnostic.map code_mapper) LibLogger.run @@ fun () -&gt; ...</code></pre></div></div><div class="odoc-spec"><div class="spec value anchored" id="val-try_with"><a href="#val-try_with" class="anchor"></a><code><span><span class="keyword">val</span> try_with : 
   <span>?emit:<span>(<span><span><a href="argument-1-Code/index.html#type-t">Code.t</a> <a href="../../Diagnostic/index.html#type-t">Diagnostic.t</a></span> <span class="arrow">&#45;&gt;</span></span> unit)</span> <span class="arrow">&#45;&gt;</span></span>
   <span>?fatal:<span>(<span><span><a href="argument-1-Code/index.html#type-t">Code.t</a> <a href="../../Diagnostic/index.html#type-t">Diagnostic.t</a></span> <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
   <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
diff --git a/asai/Asai/Logger/module-type-S/index.html b/asai/Asai/Logger/module-type-S/index.html
index c0eeb7c..1fb7f06 100644
--- a/asai/Asai/Logger/module-type-S/index.html
+++ b/asai/Asai/Logger/module-type-S/index.html
@@ -68,7 +68,7 @@
     <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
     <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
   <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
-  <span class="type-var">'a</span></span></code></div><div class="spec-doc"><p><code>adopt m run f</code> runs the thunk <code>f</code> that uses a different <code>Logger</code> instance, with the help of the runner <code>run</code> from that <code>Logger</code> instance, and then uses <code>m</code> to map the diagnostics generated by <code>f</code> into the ones in the current <code>Logger</code> instance. The backtrace within <code>f</code> will include the backtrace that leads to <code>adopt</code>. The intended use case is to integrate diagnostics from a library into those in the main application.</p><p><code>adopt</code> is a convenience function that can be implemented as follows:</p><pre class="language-ocaml"><code>let adopt m f run =
+  <span class="type-var">'a</span></span></code></div><div class="spec-doc"><p><code>adopt m run f</code> runs the thunk <code>f</code> that uses a <i>different</i> <code>Logger</code> instance. It takes the runner <code>run</code> from that <code>Logger</code> instance as an argument to handle effects, and will use <code>m</code> to transform diagnostics generated by <code>f</code> into ones in the current <code>Logger</code> instance. The backtrace within <code>f</code> will include the backtrace that leads to <code>adopt</code>, and the innermost specified location will be carried over, too. The intended use case is to integrate diagnostics from a library into those in the main application.</p><p><code>adopt</code> is a convenience function that can be implemented as follows:</p><pre class="language-ocaml"><code>let adopt m f run =
   run
     ?init_loc:(get_loc())
     ?init_backtrace:(Some (get_backtrace()))
@@ -77,7 +77,7 @@
     f</code></pre><p>Here shows the intended usage, where <code>Lib</code> is the library to be used in the main application:</p><pre class="language-ocaml"><code>module MainLogger = Logger.Make(Code)
 module LibLogger = Lib.Logger
 
-let _ = MainLogger.adopt (Diagnostic.map code_mapper) LibLogger.run @@ fun () -&gt; ...</code></pre><ul class="at-tags"><li class="parameter"><span class="at-tag">parameter</span> <span class="value">init_backtrace</span> <p>The initial backtrace to start with. The default value is the empty backtrace.</p></li></ul></div></div><div class="odoc-spec"><div class="spec value anchored" id="val-try_with"><a href="#val-try_with" class="anchor"></a><code><span><span class="keyword">val</span> try_with : 
+let _ = MainLogger.adopt (Diagnostic.map code_mapper) LibLogger.run @@ fun () -&gt; ...</code></pre></div></div><div class="odoc-spec"><div class="spec value anchored" id="val-try_with"><a href="#val-try_with" class="anchor"></a><code><span><span class="keyword">val</span> try_with : 
   <span>?emit:<span>(<span><span><a href="Code/index.html#type-t">Code.t</a> <a href="../../Diagnostic/index.html#type-t">Diagnostic.t</a></span> <span class="arrow">&#45;&gt;</span></span> unit)</span> <span class="arrow">&#45;&gt;</span></span>
   <span>?fatal:<span>(<span><span><a href="Code/index.html#type-t">Code.t</a> <a href="../../Diagnostic/index.html#type-t">Diagnostic.t</a></span> <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
   <span><span>(<span>unit <span class="arrow">&#45;&gt;</span></span> <span class="type-var">'a</span>)</span> <span class="arrow">&#45;&gt;</span></span>
diff --git a/asai/design.html b/asai/design.html
index c7fec2a..ec2f7f3 100644
--- a/asai/design.html
+++ b/asai/design.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html xmlns="http://www.w3.org/1999/xhtml"><head><title>design (asai.design)</title><link rel="stylesheet" href="../odoc.support/odoc.css"/><meta charset="utf-8"/><meta name="generator" content="odoc 2.2.1"/><meta name="viewport" content="width=device-width,initial-scale=1.0"/><script src="../odoc.support/highlight.pack.js"></script><script>hljs.initHighlightingOnLoad();</script></head><body class="odoc"><nav class="odoc-nav"><a href="index.html">Up</a> – <a href="index.html">asai</a> &#x00BB; design</nav><header class="odoc-preamble"><h1 id="design-principles"><a href="#design-principles" class="anchor"></a>Design Principles</h1></header><nav class="odoc-toc"><ul><li><a href="#four-factors-of-a-diagnostic">Four Factors of a Diagnostic</a></li><li><a href="#stable-unicode-art-must-avoid-column-numbers">Stable Unicode Art Must Avoid Column Numbers</a></li><li><a href="#raw-bytes-as-positions">Raw Bytes as Positions</a></li></ul></nav><div class="odoc-content"><h2 id="four-factors-of-a-diagnostic"><a href="#four-factors-of-a-diagnostic" class="anchor"></a>Four Factors of a Diagnostic</h2><p>In addition to the main message, the API should allow implementers to easily specify the following four factors, and they are somewhat independent.</p><ol><li><b>Whether the program terminate now.</b> This is done by the choice between <a href="Asai/Logger/module-type-S/index.html#val-emit">emit</a> for non-fatal messages and <a href="Asai/Logger/module-type-S/index.html#val-fatal">fatal</a> for fatal ones.</li><li><b>How the user should classify the message.</b> See <a href="Asai/Diagnostic/index.html#type-severity"><code>Asai.Diagnostic.severity</code></a>.</li><li><b>A succinct Google-able message code.</b> While severity should be changeable independently of the message code, often the same code implies the same severity. That is why we have <a href="Asai/Diagnostic/module-type-Code/index.html#val-default_severity"><code>Asai.Diagnostic.Code.default_severity</code></a> to specify the default severity for each code.</li><li><b>The backtrace and locations of other related text.</b> See <a href="Asai/Logger/module-type-S/index.html#val-tracef"><code>Asai.Logger.S.tracef</code></a> and <a href="Asai/Diagnostic/index.html#type-t.additional_messages"><code>Asai.Diagnostic.t.additional_messages</code></a>.</li></ol><h2 id="stable-unicode-art-must-avoid-column-numbers"><a href="#stable-unicode-art-must-avoid-column-numbers" class="anchor"></a>Stable Unicode Art Must Avoid Column Numbers</h2><p>There is a long history of using ASCII printable characters and ANSI escape sequences, and recently also non-ASCII Unicode characters, to draw pictures on terminals. To display compiler diagnostics, this technique has been used to assemble line numbers, code from end users, code highlighting, and other pieces of information in a visually pleasing way. Non-ASCII Unicode characters (from implementers or from end users) greatly expand the vocabulary of ASCII art, and we will call the new art form <i>Unicode art</i> to signify the use of non-ASCII characters. However, these Unicode characters also impose new challenges as their visual widths are unpredictable without knowing the exact terminal (emulator), the exact font, etc. Unicode emoji sequences might be one of the most challenging cases: a pirate flag (🏴‍☠️) may be shown as a single flag on supported platforms but as a sequence with a black flag (🏴) and a skull (☠️) on other platforms. This means the visual width of the pirate flag is unpredictable. (See <a href="https://unicode.org/reports/tr51/#Display">UTS #51 Section 2.2</a>.) The rainbow flag (🏳️‍🌈), skin tones, and many other emoji sequences have the same issue. Other less chaotic but still challenging cases include <a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=A:]">characters whose East Asian width is Ambiguous.</a></p><p>It is thus wise for implementers to think twice before using emoji sequences and other tricky characters in Unicode art. To quantify the degree to which a Unicode art can remain visually pleasing on different platforms, we specify the following four levels of stability. Note that if implementers decide to integrate content from end users into their Unicode art, the end users should have the freedom to include arbitrary emoji sequences and tricky characters in their content, and the final Unicode art must remain visually pleasing as defined by the stability levels.</p><ul><li><b>Level 0 (the least stable):</b> Stability under the assumption that every character occupies exactly the same visual width. Thanks to the popularity of Unicode, programs of this level are mostly considered outdated.</li></ul><ul><li><b>Level 1:</b> Stability under the assumption each Unicode string visually occupies a multiple of some fixed width, where the multiplier is determined by heuristics (such as various implementations of <code>wcwidth</code> and <code>wcswidth</code>). These heuristics are created to help programmers handle more characters, in particular CJK characters, without dramatically changing the code. They however do not solve the core problem (that is, visual width is fundamentally ill-defined) and they often could not handle tricky cases such as emoji sequences. Many compilers are at this level.</li></ul><ul><li><b>Level 2a:</b> Stability under very limited assumptions on which characters should have the same widths. For example, if a Unicode art only assumes Unicode box-drawing characters are of the same visual width (which is the case in all conceivable situations), then its stability is at this level. However, the phrase &quot;very limited&quot; is somewhat subjective, and thus we present a more precise version below.</li></ul><ul><li><p><b>Level 2b:</b> Stability under only theses assumptions:</p><ul><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=F:]|[:ea=W:]">All the characters whose East Asian width is either Fullwidth or Wide</a> have the same width (as long as they are not used as part of an emoji sequence).</li><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=H:]|[:ea=Na:]">All the characters whose East Asian width is either Halfwidth or Narrow</a> have the same width. Note this class includes ASCII printable characters.</li><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:Block=Box_Drawing:]">All the box-drawing characters</a> have the same width.</li></ul><p>This is making explicit what Level 2a means; however, we might update the details of Level 2b later to better match our understanding of Level 2a. Collectively, Levels 2a and 2b are called &quot;Level 2&quot;.</p></li></ul><ul><li><b>Level 3 (the most stable):</b> Stability under only one assumption that the same grapheme clusters will have the same width regardless of the context. This means that the Unicode art will remain visually pleasing in almost all situations. It can even be rendered with a variable-width font.</li></ul><p>Unlike most implementations, which are at Level 1, our <a href="Asai/Tty/index.html">terminal backend</a> strives to achieve Level 2. That means we must not make any assumption about the visual width of end users' code and must abandon the idea of <i>column numbers.</i> As a result, our terminal backend <i>never</i> shows column numbers and we consider that as a significant improvement. We believe Level 3 is too restricted for compiler diagnostics because we cannot show line numbers along with the end users' code. (We cannot assume the numbers &quot;10&quot; and &quot;99&quot; will have the same visual width at Level 3.)</p><p>Note: a fixed-width Unicode font is often technically duospaced, not monospaced, because many CJK characters would occupy a double character width. Thus, we do not use the terminology &quot;monospaced&quot;.</p><h2 id="raw-bytes-as-positions"><a href="#raw-bytes-as-positions" class="anchor"></a>Raw Bytes as Positions</h2><p>All positions are <b>byte-oriented.</b> Here are some popular alternatives which we think are worse:</p><ol><li><b>Unicode characters</b> (which may not match user-perceived characters).</li><li><b>Unicode grapheme clusters</b> or user-perceived characters. See the <a href="https://erratique.ch/software/uuseg">uuseg</a> library.</li><li><b>Column numbers,</b> the visual width of a string in display.</li></ol><p>It takes at least linear time to count Unicode characters (except when UTF-32 is in use) or Unicode grapheme clusters from raw bytes. Column numbers are even worse because they are not well-defined, as elaborated in the previous section. The only well-defined unit that also admits an efficient implementation is <i>raw byte</i>.</p><p>Note: Our LSP prototype does not handle <code>positionEncoding</code> yet, and thus an LSP client may be confused about the ranges returned by this library. A proper LSP implementation should negotiate with the client to determine how to represent column positions (and our current prototype does not). On the other hand, it can be tricky to negotiate with the client to use <i>raw bytes</i> because there is not an <a href="https://microsoft.github.io/language-server-protocol/specifications/lsp/3.17/specification/#positionEncodingKind">official predefined encoding scheme</a> for raw bytes yet.</p></div></body></html>
\ No newline at end of file
+<html xmlns="http://www.w3.org/1999/xhtml"><head><title>design (asai.design)</title><link rel="stylesheet" href="../odoc.support/odoc.css"/><meta charset="utf-8"/><meta name="generator" content="odoc 2.2.1"/><meta name="viewport" content="width=device-width,initial-scale=1.0"/><script src="../odoc.support/highlight.pack.js"></script><script>hljs.initHighlightingOnLoad();</script></head><body class="odoc"><nav class="odoc-nav"><a href="index.html">Up</a> – <a href="index.html">asai</a> &#x00BB; design</nav><header class="odoc-preamble"><h1 id="design-principles"><a href="#design-principles" class="anchor"></a>Design Principles</h1></header><nav class="odoc-toc"><ul><li><a href="#five-independent-parameters-of-a-diagnostic">Five Independent Parameters of a Diagnostic</a></li><li><a href="#compositionality:-using-libraries-that-use-asai">Compositionality: Using Libraries that Use <code>asai</code></a></li><li><a href="#stability-of-unicode-art:-no-column-numbers!">Stability of Unicode Art: No Column Numbers!</a></li><li><a href="#raw-bytes-as-positions">Raw Bytes as Positions</a></li></ul></nav><div class="odoc-content"><h2 id="five-independent-parameters-of-a-diagnostic"><a href="#five-independent-parameters-of-a-diagnostic" class="anchor"></a>Five Independent Parameters of a Diagnostic</h2><p>In addition to the main message, the API should allow implementers to easily specify the following five factors of a diagnostic, and it should be possible to specify them independently.</p><ol><li><b>Whether the program terminates after sending the message.</b> This is indicated by the choice between <a href="Asai/Logger/module-type-S/index.html#val-emit">emit</a> (for non-fatal messages) and <a href="Asai/Logger/module-type-S/index.html#val-fatal">fatal</a> (for fatal ones).</li><li><b>A message code with a succinct Google-able representation,</b> for example <code>V0003</code>. A succinct representation is useful for an end user to report a bug or ask for help.</li><li><b>How seriously end users should take the message.</b> Is it a warning, an error, or just a hint? See the type <a href="Asai/Diagnostic/index.html#type-severity">severity</a> for available classifications. In practice, messages with the same message code tend to have the same severity, and thus our API requires an implementer to specify a default severity for each message code. While this seems to violate the independence constraint, our API allows overriding the default severity at each call of <a href="Asai/Logger/module-type-S/index.html#val-emit">emit</a> or <a href="Asai/Logger/module-type-S/index.html#val-fatal">fatal</a>.</li><li><b>A stack backtrace.</b> There should be a straightforward way to push new stack frames. Our implementation is <a href="Asai/Logger/module-type-S/index.html#val-trace">trace</a>.</li><li><b>Additional messages.</b> It should be possible to attach any numbers of additional related messages. Currently, <a href="Asai/Logger/module-type-S/index.html#val-emit">emit</a> and <a href="Asai/Logger/module-type-S/index.html#val-fatal">fatal</a> are taking .</li></ol><h2 id="compositionality:-using-libraries-that-use-asai"><a href="#compositionality:-using-libraries-that-use-asai" class="anchor"></a>Compositionality: Using Libraries that Use <code>asai</code></h2><p>It should be easy for an application to use other libraries who themselves use <code>asai</code>. Our current implementation allows an application to <a href="Asai/Logger/module-type-S/index.html#val-adopt">adopt</a> messages from a library.</p><h2 id="stability-of-unicode-art:-no-column-numbers!"><a href="#stability-of-unicode-art:-no-column-numbers!" class="anchor"></a>Stability of Unicode Art: No Column Numbers!</h2><p>There is a long history of using ASCII printable characters and ANSI escape sequences, and recently also non-ASCII Unicode characters, to draw pictures on terminals. To display compiler diagnostics, this technique has been used to assemble line numbers, code from end users, code highlighting, and other pieces of information in a visually pleasing way. Non-ASCII Unicode characters (from implementers or from end users) greatly expand the vocabulary of ASCII art, and we will call the new art form <i>Unicode art</i> to signify the use of non-ASCII characters.</p><p>These non-ASCII Unicode characters impose new challenges as their visual widths are unpredictable without knowing the exact terminal (or terminal emulator), the exact font, etc. Unicode emoji sequences might be one of the most challenging cases: a pirate flag (🏴‍☠️) may be shown as a single emoji flag on supported platforms but as a sequence with a black flag (🏴) and a skull (☠️) on other platforms. This means the visual width of the pirate flag is unpredictable. (See <a href="https://unicode.org/reports/tr51/#Display">UTS #51 Section 2.2</a>.) The rainbow flag (🏳️‍🌈), skin tones, and many other emoji sequences have the same issue. Other less chaotic but still challenging cases include <a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=A:]">characters whose East Asian width is Ambiguous.</a> These challenges bear some similarity with the unpredictability of the visual width of a horizontal tab, but in a much wilder way.</p><p><i>Note: &quot;Unicode characters&quot; are not really defined in the Unicode standard, and here they mean <a href="https://unicode.org/glossary/#unicode_scalar_value">Unicode scalar values</a>, that is, <a href="https://unicode.org/glossary/#code_point">all Unicode code points</a> except the <a href="https://unicode.org/glossary/#surrogate_code_point">surrogate code points</a> for UTF-16 to represent all scalar values. Although the word &quot;character&quot; has many incompatible meanings and usages, we decided to call scalar values &quot;Unicode characters&quot; anyway because (1) most people are not familiar with the official term &quot;scalar values&quot; and (2) scalar values are the <i>only</i> stable primitive unit one can work with in a programming language.</i></p><p>It is thus wise to think twice before using emoji sequences and other tricky characters in Unicode art. To quantify the degree to which a Unicode art can remain visually pleasing on different platforms, we specify the following four levels of stability. Note that if implementers decide to integrate content from end users into their Unicode art, the end users should have the freedom to include arbitrary emoji sequences and tricky characters in their content. The final Unicode art must remain visually pleasing as defined by the stability levels for any reasonable user content.</p><ul><li><b>Level 0 (the least stable):</b> Stability under the assumption that every Unicode character occupies exactly the same visual width. Thankfully, programs meeting only this level are mostly considered outdated.</li></ul><ul><li><b>Level 1:</b> Stability under the assumption each Unicode string visually occupies a multiple of some fixed width, where the multiplier is determined by heuristics (such as various implementations of <code>wcwidth</code> and <code>wcswidth</code>). These heuristics are created to help programmers handle more characters, in particular CJK characters, without dramatically changing the code. They however do not solve the core problem (that is, visual width is fundamentally ill-defined) and they often could not handle tricky cases such as emoji sequences. Many compilers are at this level.</li></ul><ul><li><b>Level 2a:</b> Stability under very limited assumptions on which characters should have the same widths. For example, if a Unicode art only assumes Unicode box-drawing characters are of the same visual width (which is the case in all conceivable situations), then its stability is at this level. However, the phrase &quot;very limited&quot; is somewhat subjective, and thus we present a more precise version below.</li></ul><ul><li><p><b>Level 2b:</b> Stability under only theses assumptions:</p><ul><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=H:]|[:ea=Na:]">All characters whose East Asian width is either Halfwidth or Narrow</a> have the same visual width. This class includes all ASCII printable characters and thus an ASCII art very likely satisfies Level 2b.</li><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:ea=F:]|[:ea=W:]">All characters whose East Asian width is either Fullwidth or Wide</a> have the same visual width (as long as they are not used as part of an emoji sequence). Note that we do not assume the visual width of these characters is exactly double the visual width of the characters in the previous class.</li><li><a href="https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=[:Block=Box_Drawing:]">All box-drawing characters</a> have the same visual width.</li><li>Equivalent <a href="https://unicode.org/glossary/#extended_grapheme_cluster">(extended) grapheme clusters</a> have the same visual width (regardless of the context). Note that an application can and maybe should customize grapheme clusters, but we believe it is okay to leave out the detail here.</li></ul><p>Level 2b is making explicit what Level 2a means; we might update the details of Level 2b later to better match our understanding of Level 2a. Collectively, Levels 2a and 2b are called &quot;Level 2&quot;.</p></li></ul><ul><li><b>Level 3 (the most stable):</b> Stability under only one assumption that equivalent (extended) grapheme clusters have the same visual width (the last assumption of Level 2b). This means that the Unicode art will remain visually pleasing in almost all situations. It can even be rendered with a variable-width font.</li></ul><p>Unlike most implementations, which are at Level 1, our <a href="Asai/Tty/index.html">terminal backend</a> strives to achieve Level 2. That means we must not make any assumption about the visual width of end users' code and must abandon the idea of <i>column numbers.</i> As a result, our terminal backend <i>never</i> uses column numbers and we consider that as a significant improvement. We believe Level 3 is too restricted for compiler diagnostics because we cannot show line numbers along with the end users' code. (We cannot assume the numbers &quot;10&quot; and &quot;99&quot; will have the same visual width at Level 3.)</p><p><i>Note: a fixed-width font with enough <a href="https://unicode.org/glossary/#glyph">glyphs</a> that covers many Unicode characters is often technically duospaced, not monospaced, because many CJK characters would occupy a double character visual width. Thus, we do not use the terminology &quot;monospaced&quot;.</i></p><h2 id="raw-bytes-as-positions"><a href="#raw-bytes-as-positions" class="anchor"></a>Raw Bytes as Positions</h2><p>All positions should be <b>byte-oriented.</b> We believe other popular alternatives proposals are worse:</p><ol><li><b>Unicode characters</b> (Unicode scalar values): This is a reasonable and technically well-defined choice. The problem is that it may take linear time to count the number of characters from raw bytes without a clever data structure (unless we are using <a href="https://unicode.org/glossary/#UTF_32">UTF-32</a>), and they often do not match what end users perceive as &quot;characters&quot;. In other words, it takes more time to compute and may invite misconceptions about Unicode characters.</li><li><b>Code units used in UTF-16</b>: This is somewhat similar to Unicode characters, but with quirks from UTF-16: a Unicode scalar value above <code>U+FFFF</code> (such as <code>😎</code>) will require two code units to form a <a href="https://unicode.org/glossary/#surrogate_pair">surrogate pair</a>. This scheme was unfortunately <a href="https://microsoft.github.io/language-server-protocol/specifications/lsp/3.17/specification/#textDocuments">chosen by the Language Service Protocol (LSP) as the default unit,</a> and until LSP version 3.17 was the <i>only</i> choice. The developers of the protocol made this choice probably because Visual Studio Code was written in JavaScript (and TypeScript), whose strings use UTF-16 encoding.</li><li><b>Unicode (extended) grapheme clusters</b> or user-perceived characters. The notion of grapheme clusters can help segment a Unicode text for end users to edit or select part of it in an &quot;intuitive&quot; way. It is not trivial to implement the segmentation algorithm (see the OCaml library <a href="https://erratique.ch/software/uuseg/doc/">uuseg</a>) and the default rules can (and maybe should) be overriden for each application. The complexity and external dependency of grapheme clusters make it an unreliable unit for specifying positions. It also takes at least linear time to count the number of grapheme clusters from raw bytes.</li><li><b>Column numbers,</b> the visual width of a string in display. As analyzed in the above section, this is the most ill-defined unit of all, and a heuristic that can give passable results in most cases still takes linear time.</li></ol><p><i><b>Know Bug:</b> Our <a href="Asai/Lsp/index.html">LSP prototype</a> does not handle <code>positionEncoding</code> yet, and because the default unit in LSP is based on UTF-16 (see above), an LSP client may be confused about the byte-oriented ranges returned by this library. A proper LSP implementation should negotiate with the client to determine how to represent column positions (and our current prototype does not). On the other hand, it can be tricky to negotiate with the client to use <i>raw bytes</i> because there is not an <a href="https://microsoft.github.io/language-server-protocol/specifications/lsp/3.17/specification/#positionEncodingKind">official predefined encoding scheme</a> for raw bytes yet.</i></p></div></body></html>
\ No newline at end of file