Add Use Verbose Flag To Get More Diff as a Python TIL

2026-03-06 16:08:44 +00:00 · 2026-03-06 09:32:05 -06:00
parent e2603f1445
commit f2a6fddba8
2 changed files with 163 additions and 1 deletions
--- a/README.md
+++ b/README.md
@@ -10,7 +10,7 @@ working across different projects via [VisualMode](https://www.visualmode.dev/).

 For a steady stream of TILs, [sign up for my newsletter](https://visualmode.kit.com/newsletter).

-_1752 TILs and counting..._
+_1753 TILs and counting..._

 See some of the other learning resources I work on:

@@ -1055,6 +1055,7 @@ If you've learned something here, support my efforts writing daily TILs by
 - [Store And Access Immutable Data In A Tuple](python/store-and-access-immutable-data-in-a-tuple.md)
 - [Test A Function With Pytest](python/test-a-function-with-pytest.md)
 - [Use pipx To Install End User Apps](python/use-pipx-to-install-end-user-apps.md)
+- [Use Verbose Flag To Get More Diff](python/use-verbose-flag-to-get-more-diff.md)

 ### Rails

--- a/python/use-verbose-flag-to-get-more-diff.md
+++ b/python/use-verbose-flag-to-get-more-diff.md
@@ -0,0 +1,161 @@
+# Use Verbose Flag To Get More Diff
+
+Here is the output of running some `pytest` unit tests. A couple of the tests
+pass, which produces little output. But I get a big block of details for the one
+failing test. In this case the failure is an assertion between two lists that
+don't match.
+
+```bash
+❯ uv run pytest
+========================================== test session starts ==========================================
+platform darwin -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0
+rootdir: /Users/lastword/dev/misc/build-an-llm
+configfile: pyproject.toml
+collected 3 items
+
+tests/chapter_02/test_bpe_tokenizer.py .F.                                                        [100%]
+
+=============================================== FAILURES ================================================
+_____________________________________ test_merge_with_byte_sequence _____________________________________
+
+    def test_merge_with_byte_sequence():
+        token_ids = [1, 2, 3, 4, 5, 2, 3, 1, 2, 3, 4, 1]
+        merged_tokens = BPETokenizer._merge(token_ids, [2, 3, 4], 256)
+        # assert merged_tokens == [1, 256, 5, 2, 3, 1, 256, 1]
+>       assert merged_tokens == [1, 256, 5, 4, 5, 1, 256, 1]
+E       assert [1, 256, 5, 2, 3, 1, ...] == [1, 256, 5, 4, 5, 1, ...]
+E
+E         At index 3 diff: 2 != 4
+E         Use -v to get more diff
+
+tests/chapter_02/test_bpe_tokenizer.py:13: AssertionError
+======================================== short test summary info ========================================
+FAILED tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_sequence - assert [1, 256, 5, 2, 3, 1, ...] == [1, 256, 5, 4, 5, 1, ...]
+====================================== 1 failed, 2 passed in 0.02s ======================================
+```
+
+The lists are too long to fully display in the failure output. `pytest` is able
+to tell us two useful things though. First, it mentions that the first
+discrepancy in the lists is at index `3` where `2 != 4`. Second, it says `Use -v
+to get more diff`.
+
+Let's try rerunning the tests with `-v`.
+
+```bash
+❯ uv run pytest -v
+========================================== test session starts ==========================================
+platform darwin -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /Users/lastword/dev/misc/build-an-llm/.venv/bin/python3
+cachedir: .pytest_cache
+rootdir: /Users/lastword/dev/misc/build-an-llm
+configfile: pyproject.toml
+collected 3 items
+
+tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_pair PASSED                          [ 33%]
+tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_sequence FAILED                      [ 66%]
+tests/chapter_02/test_bpe_tokenizer.py::test_subsequence_at_index PASSED                          [100%]
+
+=============================================== FAILURES ================================================
+_____________________________________ test_merge_with_byte_sequence _____________________________________
+
+    def test_merge_with_byte_sequence():
+        token_ids = [1, 2, 3, 4, 5, 2, 3, 1, 2, 3, 4, 1]
+        merged_tokens = BPETokenizer._merge(token_ids, [2, 3, 4], 256)
+        # assert merged_tokens == [1, 256, 5, 2, 3, 1, 256, 1]
+>       assert merged_tokens == [1, 256, 5, 4, 5, 1, 256, 1]
+E       AssertionError: assert [1, 256, 5, 2, 3, 1, ...] == [1, 256, 5, 4, 5, 1, ...]
+E
+E         At index 3 diff: 2 != 4
+E
+E         Full diff:
+E           [
+E               1,
+E               256,...
+E
+E         ...Full output truncated (13 lines hidden), use '-vv' to show
+
+tests/chapter_02/test_bpe_tokenizer.py:13: AssertionError
+======================================== short test summary info ========================================
+FAILED tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_sequence - AssertionError: assert [1, 256, 5, 2, 3, 1, ...] == [1, 256, 5, 4, 5, 1, ...]
+====================================== 1 failed, 2 passed in 0.02s ======================================
+```
+
+That was sort of a tease because it starts to display a "Full diff", but that
+gets quickly truncated. `pytest` then tells us that we can `use '-vv' to show`
+the full diff.
+
+```bash
+❯ uv run pytest -vv
+========================================== test session starts ==========================================
+platform darwin -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /Users/lastword/dev/misc/build-an-llm/.venv/bin/python3
+cachedir: .pytest_cache
+rootdir: /Users/lastword/dev/misc/build-an-llm
+configfile: pyproject.toml
+collected 3 items
+
+tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_pair PASSED                          [ 33%]
+tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_sequence FAILED                      [ 66%]
+tests/chapter_02/test_bpe_tokenizer.py::test_subsequence_at_index PASSED                          [100%]
+
+=============================================== FAILURES ================================================
+_____________________________________ test_merge_with_byte_sequence _____________________________________
+
+    def test_merge_with_byte_sequence():
+        token_ids = [1, 2, 3, 4, 5, 2, 3, 1, 2, 3, 4, 1]
+        merged_tokens = BPETokenizer._merge(token_ids, [2, 3, 4], 256)
+        # assert merged_tokens == [1, 256, 5, 2, 3, 1, 256, 1]
+>       assert merged_tokens == [1, 256, 5, 4, 5, 1, 256, 1]
+E       assert [1, 256, 5, 2, 3, 1, 256, 1] == [1, 256, 5, 4, 5, 1, 256, 1]
+E
+E         At index 3 diff: 2 != 4
+E
+E         Full diff:
+E           [
+E               1,
+E               256,
+E               5,
+E         -     4,
+E         ?     ^
+E         +     2,
+E         ?     ^
+E         -     5,
+E         ?     ^
+E         +     3,
+E         ?     ^
+E               1,
+E               256,
+E               1,
+E           ]
+
+tests/chapter_02/test_bpe_tokenizer.py:13: AssertionError
+======================================== short test summary info ========================================
+FAILED tests/chapter_02/test_bpe_tokenizer.py::test_merge_with_byte_sequence - assert [1, 256, 5, 2, 3, 1, 256, 1] == [1, 256, 5, 4, 5, 1, 256, 1]
+
+  At index 3 diff: 2 != 4
+
+  Full diff:
+    [
+        1,
+        256,
+        5,
+  -     4,
+  ?     ^
+  +     2,
+  ?     ^
+  -     5,
+  ?     ^
+  +     3,
+  ?     ^
+        1,
+        256,
+        1,
+    ]
+====================================== 1 failed, 2 passed in 0.02s ======================================
+```
+
+This is a lot more output to look at. What we can perhaps see more clearly now
+is that the lists match up until there is a mismatch between `2` and `4` at the
+third index. And then right after that is another mismatch between `3` and `5`.
+
+This kind of output can only scale so much, so use it when it works and when the
+diff view starts to fall short, rework the assertions to get more readable and
+actionable test output.