From f68479a4666cf0e1325c394898650f641877dd7b Mon Sep 17 00:00:00 2001
From: Flavio Soibelmann Glock <fglock@gmail.com>
Date: Fri, 10 Apr 2026 20:53:49 +0200
Subject: [PATCH 1/2] docs: update documentation to reflect DESTROY/weaken
 support

DESTROY and weaken/isweak/unweaken are now implemented (PR #464 merged).
Update all documentation that previously listed these as unsupported:

- changelog.md: move from WIP to released features
- feature-matrix.md: mark DESTROY as supported, remove from unsupported list
- roadmap.md: mark DESTROY and weak references as completed
- relation-perlito.md: update JVM limitations paragraph
- AGENTS.md: remove "on feature/destroy-weaken branch" qualifier
- blog post README: update Moo results to 841/841

Generated with [Devin](https://cli.devin.ai/docs)

Co-Authored-By: Devin <158243242+devin-ai-integration[bot]@users.noreply.github.com>
---
 AGENTS.md                                            | 4 ++--
 docs/about/changelog.md                              | 3 ++-
 docs/about/relation-perlito.md                       | 2 +-
 docs/about/roadmap.md                                | 4 ++--
 docs/reference/feature-matrix.md                     | 8 ++------
 src/main/java/org/perlonjava/core/Configuration.java | 4 ++--
 6 files changed, 11 insertions(+), 14 deletions(-)

diff --git a/AGENTS.md b/AGENTS.md
index 4a91b0c90..e5b7e513c 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -64,8 +64,8 @@ Example format at the end of a design doc:
 
 | Feature | Status |
 |---------|--------|
-| `weaken` / `isweak` | Implemented on the `feature/destroy-weaken` branch. Uses cooperative reference counting on top of JVM GC. See `dev/architecture/weaken-destroy.md` for details. |
-| `DESTROY` | Implemented on the `feature/destroy-weaken` branch. Fires deterministically for tracked objects (blessed into a class with DESTROY). See `dev/architecture/weaken-destroy.md`. |
+| `weaken` / `isweak` | Implemented. Uses cooperative reference counting on top of JVM GC. See `dev/architecture/weaken-destroy.md` for details. |
+| `DESTROY` | Implemented. Fires deterministically for tracked objects (blessed into a class with DESTROY). See `dev/architecture/weaken-destroy.md`. |
 | `Scalar::Util::readonly` | Works for compile-time constants (`RuntimeScalarReadOnly` instances). Does not yet detect variables made readonly at runtime via `Internals::SvREADONLY` (those copy type/value into a plain `RuntimeScalar` without replacing the object). |
 
 ### Unimplemented Features
diff --git a/docs/about/changelog.md b/docs/about/changelog.md
index 491d31830..398c2ea36 100644
--- a/docs/about/changelog.md
+++ b/docs/about/changelog.md
@@ -36,10 +36,11 @@ Release history of PerlOnJava. See [Roadmap](roadmap.md) for future plans.
 
 - Add `attributes` pragma with `MODIFY_*_ATTRIBUTES`/`FETCH_*_ATTRIBUTES` callbacks for subroutines and variables.
 - Add modules: `Filter::Simple` with `FILTER` and `FILTER_ONLY` support.
+- Add `DESTROY` method support with cooperative reference counting on blessed objects, cascading destruction, closure capture tracking, and global destruction phase.
+- Add `Scalar::Util` functions: `weaken`, `isweak`, `unweaken`.
 
 - Work in Progress
   - [Multiplicity — per-runtime isolation for concurrent Perl interpreters](https://github.com/fglock/PerlOnJava/pull/480): `PerlRuntime` with `ThreadLocal`-based isolation; all mutable state (globals, I/O, regex, caller stack, method caches) moved to per-runtime instances; 122/126 concurrent interpreter tests pass; pending closure/method dispatch optimization
-  - [DESTROY, weaken/isweak/unweaken with refCount tracking](https://github.com/fglock/PerlOnJava/pull/464): explicit reference counting on blessed objects; cascading destruction, closure capture tracking, global destruction, AUTOLOAD-based DESTROY; 196/196 destroy/weaken tests and 841/841 Moo tests pass; pending performance optimization
   - PerlIO
     - `get_layers`
   - Term::ReadLine
diff --git a/docs/about/relation-perlito.md b/docs/about/relation-perlito.md
index cb5d28235..892b3d48e 100644
--- a/docs/about/relation-perlito.md
+++ b/docs/about/relation-perlito.md
@@ -4,5 +4,5 @@ The key difference between PerlOnJava and Perlito (https://github.com/fglock/Per
 
 From an architectural standpoint, PerlOnJava is more mature. However, Perlito is currently more feature-rich due to its longer development history. PerlOnJava, however, doesn't support JavaScript like Perlito does.
 
-Both compilers share certain limitations imposed by the JVM, such as the lack of support for DESTROY, XS modules, and auto-closing filehandles, among others.
+Both compilers share certain limitations imposed by the JVM, such as the lack of support for XS modules and auto-closing filehandles, among others. PerlOnJava implements `DESTROY` via cooperative reference counting.
 
diff --git a/docs/about/roadmap.md b/docs/about/roadmap.md
index a6639ab60..7fedbf878 100644
--- a/docs/about/roadmap.md
+++ b/docs/about/roadmap.md
@@ -75,8 +75,8 @@ Work currently in progress:
 
 ### Core Language Gaps
 
-- **`DESTROY` Support** — Implement scope-based destructors for predictable cleanup at block boundaries, with GC-based fallback for edge cases. Critical for `File::Temp`, `SelectSaver`, tied variable cleanup, and many CPAN modules. See `dev/design/object_lifecycle.md`.
-- **Weak References** — Implement `Scalar::Util::weaken`/`isweak` with minimal memory overhead (external WeakHashMap registry or sentinel value approach). Required by Moo, Moose, and many OO frameworks. See `dev/design/object_lifecycle.md`.
+- ~~**`DESTROY` Support**~~ — Implemented with cooperative reference counting. Supports cascading destruction, closure capture tracking, and global destruction phase.
+- ~~**Weak References**~~ — Implemented: `Scalar::Util::weaken`/`isweak`/`unweaken` with external WeakRefRegistry.
 - **Taint Mode (`-T`)** — Track external data provenance using a `TAINTED` wrapper type (no extra storage for untainted scalars). Required for security-sensitive Perl applications. See `dev/design/TAINT_MODE.md`.
 - **Dynamically-Scoped Regex Variables** — `$1`, `$2`, etc. should be localized per regex match in the dynamic scope.
 
diff --git a/docs/reference/feature-matrix.md b/docs/reference/feature-matrix.md
index 02288baf6..e8e7c819e 100644
--- a/docs/reference/feature-matrix.md
+++ b/docs/reference/feature-matrix.md
@@ -57,7 +57,6 @@ PerlOnJava implements most core Perl features with some key differences:
 ❌ Not Supported:
 - XS modules and C integration
 - Threading
-- DESTROY blocks
 
 ---
 
@@ -245,7 +244,7 @@ my @copy = @{$z};         # ERROR
 - ✅  **Field inheritance**: Parent class fields are inherited.
 - 🟡  **`__CLASS__`**: Compile-time evaluation only, not runtime.
 - 🟡  **Argument validation**: Limited by operator implementation issues.
-- ❌  **`DESTROY`**: Destructor blocks not yet implemented.
+- ✅  **`DESTROY`**: Destructor methods with cooperative reference counting.
 
 ---
 
@@ -784,10 +783,7 @@ The DBI module provides seamless integration with JDBC drivers:
 ## Features Incompatible with JVM
 
 - ❌  **`fork` operator**: `fork` is not implemented. Calling `fork` will always fail and return `undef`.
-- ❌  **`DESTROY`**: Handling of object destruction may be incompatible with JVM garbage collection.
-  - For more details see: `dev/design/object_lifecycle.md`.
-  - Some modules that depend on `DESTROY`: `SelectSaver`, `File::Temp`.
-  - `DESTROY` method in tied variables is also not implemented. DESTROY is called when the variable is `untie`.
+- ✅  **`DESTROY`**: Implemented with cooperative reference counting on top of JVM GC. Supports cascading destruction, closure capture tracking, `weaken`/`isweak`/`unweaken`, and global destruction phase.
 - ❌  **Perl `XS` code**: XS code interfacing with C is not supported on the JVM.
 - ❌  **Auto-close files**: File auto-close depends on handling of object destruction, may be incompatible with JVM garbage collection. All files are closed before the program ends.
 - ❌  **Keywords related to the control flow of the Perl program**: `dump` operator.
diff --git a/src/main/java/org/perlonjava/core/Configuration.java b/src/main/java/org/perlonjava/core/Configuration.java
index 2befd81f4..4c66bcad9 100644
--- a/src/main/java/org/perlonjava/core/Configuration.java
+++ b/src/main/java/org/perlonjava/core/Configuration.java
@@ -33,7 +33,7 @@ public final class Configuration {
      * Automatically populated by Gradle/Maven during build.
      * DO NOT EDIT MANUALLY - this value is replaced at build time.
      */
-    public static final String gitCommitId = "ab863e968";
+    public static final String gitCommitId = "97ec12b8b";
 
     /**
      * Git commit date of the build (ISO format: YYYY-MM-DD).
@@ -48,7 +48,7 @@ public final class Configuration {
      * Parsed by App::perlbrew and other tools via: perl -V | grep "Compiled at"
      * DO NOT EDIT MANUALLY - this value is replaced at build time.
      */
-    public static final String buildTimestamp = "Apr 10 2026 18:40:15";
+    public static final String buildTimestamp = "Apr 10 2026 20:53:01";
 
     // Prevent instantiation
     private Configuration() {

From ffc46612426ef15b02e036f947bcb9beee3312ca Mon Sep 17 00:00:00 2001
From: Flavio Soibelmann Glock <fglock@gmail.com>
Date: Fri, 10 Apr 2026 21:44:42 +0200
Subject: [PATCH 2/2] docs: audit and fix architecture docs against actual
 codebase

- Verify all 6 architecture docs + README against source code
- Fix wrong file paths in control-flow.md (packages reorganized)
- Add missing RETURN control flow type and non-local map/grep return
- Remove all stale TABLESWITCH references (actual: conditional branches)
- Rewrite large-code-refactoring.md: was describing non-existent retry
  architecture; now documents actual two-tier strategy (proactive block
  refactoring + interpreter fallback)
- Fix lexical-pragmas.md: wrong stack types, non-existent StrictOptions
  class, missing warning stacks and CompilerFlagNode fields
- Fix dynamic-scope.md: wrong DeferBlock types, missing implementors,
  missing blessId/reset-to-undef in save
- Fix block-dispatcher-optimization.md: wrong per-site bytecode size,
  missing Dereference.java as implementation file
- Fix weaken-destroy.md: wrong pop/shift deferred decrement claim,
  wrong code ref birth-tracking path, wrong type check order
- Update inline-cache.md and method-call-optimization.md status:
  inline caching IS implemented in RuntimeCode.java
- Move unimplemented design docs to dev/design/
- Add dev/architecture/ to make check-links target
- Fix 2 broken doc links (warnings-scope.md -> lexical-warnings.md)
- Fix 3 stale Java code comments (WeakRefRegistry, RuntimeScalar,
  DestroyDispatch)
- Add RuntimeList, org.perlonjava.app to README overview
- All links clean (368 checked, 0 errors), make passes

Generated with [Devin](https://cli.devin.ai/docs)

Co-Authored-By: Devin <158243242+devin-ai-integration[bot]@users.noreply.github.com>
---
 Makefile                                      |   2 +-
 dev/architecture/README.md                    |  15 +-
 .../block-dispatcher-optimization.md          |  24 +-
 dev/architecture/control-flow.md              | 194 +++----
 dev/architecture/dynamic-scope.md             |  39 +-
 dev/architecture/large-code-refactoring.md    | 228 +++-----
 dev/architecture/lexical-pragmas.md           |  51 +-
 dev/architecture/weaken-destroy.md            | 517 +-----------------
 dev/{architecture => design}/inline-cache.md  |   2 +
 .../method-call-optimization.md               |   2 +
 dev/prompts/architecture-docs-audit.md        | 193 +++++++
 docs/reference/architecture.md                |  13 +-
 .../org/perlonjava/core/Configuration.java    |   4 +-
 .../runtime/runtimetypes/DestroyDispatch.java |   5 -
 .../runtime/runtimetypes/RuntimeScalar.java   |   1 -
 .../runtime/runtimetypes/WeakRefRegistry.java |  23 +-
 16 files changed, 521 insertions(+), 792 deletions(-)
 rename dev/{architecture => design}/inline-cache.md (97%)
 rename dev/{architecture => design}/method-call-optimization.md (97%)
 create mode 100644 dev/prompts/architecture-docs-audit.md

diff --git a/Makefile b/Makefile
index 304456019..8701b1c37 100644
--- a/Makefile
+++ b/Makefile
@@ -196,5 +196,5 @@ sbom-clean:
 check-links:
 	@command -v lychee >/dev/null 2>&1 || { echo "Error: lychee not found. Install with: brew install lychee"; exit 1; }
 	@echo "Checking documentation links..."
-	lychee --offline *.md docs/ dev/design/
+	lychee --offline *.md docs/ dev/design/ dev/architecture/
 
diff --git a/dev/architecture/README.md b/dev/architecture/README.md
index 362c72b32..ba9f0db65 100644
--- a/dev/architecture/README.md
+++ b/dev/architecture/README.md
@@ -16,10 +16,14 @@ PerlOnJava is a Perl 5 implementation that compiles Perl source code to JVM byte
    - Bytecode interpreter: Interprets a subset of operations for eval STRING
 
 3. **Runtime** (`org.perlonjava.runtime`)
-   - Runtime types: RuntimeScalar, RuntimeArray, RuntimeHash, RuntimeCode
+   - Runtime types: RuntimeScalar, RuntimeArray, RuntimeHash, RuntimeList, RuntimeCode, RuntimeGlob, RuntimeIO
    - Operators: Arithmetic, string, comparison, I/O
    - Perl modules: Built-in implementations of core modules
 
+4. **Application** (`org.perlonjava.app`)
+   - CLI entry point (Main, ArgumentParser, CompilerOptions)
+   - JSR-223 ScriptEngine integration
+
 ## Key Architecture Documents
 
 | Document | Description |
@@ -27,6 +31,11 @@ PerlOnJava is a Perl 5 implementation that compiles Perl source code to JVM byte
 | [dynamic-scope.md](dynamic-scope.md) | Dynamic scoping via `local` and DynamicVariableManager |
 | [weaken-destroy.md](weaken-destroy.md) | Cooperative reference counting, DESTROY, and weak references |
 | [lexical-pragmas.md](lexical-pragmas.md) | Lexical warnings, strict, and features |
+| [control-flow.md](control-flow.md) | Control flow implementation (die/eval, last/next/redo, block dispatchers) |
+| [block-dispatcher-optimization.md](block-dispatcher-optimization.md) | Block-level shared dispatchers for control flow |
+| [large-code-refactoring.md](large-code-refactoring.md) | Large code handling: proactive block refactoring and interpreter fallback |
+| [../design/inline-cache.md](../design/inline-cache.md) | Inline caching design (runtime global cache implemented; per-site variant planned) |
+| [../design/method-call-optimization.md](../design/method-call-optimization.md) | Method call optimization plan (Phase 1 done; Phases 2-3 not yet implemented) |
 | [../design/interpreter.md](../design/interpreter.md) | Bytecode interpreter design |
 | [../design/variables_and_values.md](../design/variables_and_values.md) | Runtime value representation |
 
@@ -37,7 +46,7 @@ Perl Source
     │
     ▼
 ┌─────────┐
-│  Lexer  │  Tokenizes source into LexerTokens
+│  Lexer  │  Tokenizes source into LexerToken instances
 └────┬────┘
      │
      ▼
@@ -70,7 +79,7 @@ Perl Source
 ┌────────────────────────────────────────────────┐
 │              Runtime Types                      │
 │  RuntimeScalar, RuntimeArray, RuntimeHash      │
-│  RuntimeCode, RuntimeGlob, RuntimeIO           │
+│  RuntimeList, RuntimeCode, RuntimeGlob, RuntimeIO │
 └────────────────────────────────────────────────┘
                       │
                       ▼
diff --git a/dev/architecture/block-dispatcher-optimization.md b/dev/architecture/block-dispatcher-optimization.md
index d6388fa61..3327ca15a 100644
--- a/dev/architecture/block-dispatcher-optimization.md
+++ b/dev/architecture/block-dispatcher-optimization.md
@@ -48,11 +48,20 @@ notControlFlow:
 ALOAD controlFlowTempSlot       // Not marked, continue
 ```
 
+> **Note:** In addition to the above, each call site also emits an inline tail-call
+> trampoline (~30 instructions) that handles `goto &NAME` without jumping to the
+> block dispatcher. The actual per-call-site bytecode is ~100-150 bytes, not ~20.
+
 **Block dispatcher (emitted once, ~150 bytes):**
 ```java
 blockDispatcher:
   Get control flow type ordinal
   Check if LAST/NEXT/REDO (0/1/2)
+  IF_ICMPGT propagateToCaller    // ordinals > 2 handled below
+  // Higher ordinals (GOTO=3, TAILCALL=4, RETURN=5):
+  // - GOTO and TAILCALL: propagate to caller
+  // - RETURN (ordinal 5): unwrap return value for non-map/grep blocks,
+  //   then jump to returnLabel; otherwise propagate
   Loop through visible loop labels:
     Match label name
     Dispatch by type to appropriate label
@@ -79,6 +88,11 @@ skipDispatcher:
 - New: 20N + 150 + 3 bytes
 - **Savings: 130N - 153 bytes**
 
+> **Note:** These calculations use a simplified ~20 bytes per call site. The actual
+> per-call-site overhead is ~100-150 bytes due to the inline tailcall trampoline.
+> The block dispatcher sharing still provides significant savings, but the
+> absolute numbers differ from this simplified model.
+
 **Examples:**
 | Calls | Old (bytes) | New (bytes) | Savings | Percentage |
 |-------|-------------|-------------|---------|------------|
@@ -106,14 +120,20 @@ skipDispatcher:
 1. **JavaClassInfo.java**
    - Added `blockDispatcherLabels` map to track dispatcher reuse
    - Added `getLoopStateSignature()` method to compute unique signatures
-   - Imports: Added `HashMap` and `Map`
+   - Imports: Uses wildcard `import java.util.*`
 
 2. **EmitSubroutine.java**
    - Modified call-site emission to use block-level dispatchers
    - Added `emitBlockDispatcher()` helper method
    - Simplified call-site code to ~20 bytes (check + GOTO)
 
-3. **CONTROL_FLOW_IMPLEMENTATION.md**
+3. **Dereference.java** (`backend/jvm/Dereference.java`)
+   - Contains a full copy of the block-dispatcher pattern (lines 982-1109)
+   - Includes `getLoopStateSignature()`, `blockDispatcherLabels` lookup,
+     the tailcall trampoline, and calls `EmitSubroutine.emitBlockDispatcher()`
+   - Significant second emission point alongside EmitSubroutine.java
+
+4. **control-flow.md**
    - Documented block-level dispatcher approach
    - Updated performance metrics
    - Explained why method-level centralization doesn't work
diff --git a/dev/architecture/control-flow.md b/dev/architecture/control-flow.md
index a1f4f4772..2d651ac6d 100644
--- a/dev/architecture/control-flow.md
+++ b/dev/architecture/control-flow.md
@@ -71,7 +71,7 @@ Savings: 600 - 233 = 367 bytes (61% reduction!)
 
 ### Why Method-Level Centralization Doesn't Work
 
-We also investigated centralizing to a single TABLESWITCH at the method's `returnLabel`. Here's why it was rejected:
+We also investigated centralizing to a single dispatcher at the method's `returnLabel`. Here's why it was rejected:
 
 **Problems:**
 1. **Frame computation issues**: Jumping from `returnLabel` (outside loop scope) back to loop labels (inside loop scope) causes "Bad local variable type" errors
@@ -113,11 +113,12 @@ We also investigated centralizing to a single TABLESWITCH at the method's `retur
 #### ControlFlowType Enum
 ```java
 public enum ControlFlowType {
-    LAST(0),    // Exit loop
-    NEXT(1),    // Continue to next iteration
-    REDO(2),    // Restart current iteration
-    GOTO(3),    // Jump to label or named goto
-    TAILCALL(4) // Tail call optimization
+    LAST,      // Exit loop (ordinal 0)
+    NEXT,      // Continue to next iteration (ordinal 1)
+    REDO,      // Restart current iteration (ordinal 2)
+    GOTO,      // Jump to label or named goto (ordinal 3)
+    TAILCALL,  // Tail call optimization (ordinal 4)
+    RETURN     // Non-local return from map/grep block (ordinal 5)
 }
 ```
 
@@ -126,6 +127,8 @@ public enum ControlFlowType {
 public class ControlFlowMarker {
     ControlFlowType type;
     String label;              // Loop/block label (may be null)
+    RuntimeScalar codeRef;     // Code reference for TAILCALL
+    RuntimeArray args;         // Arguments for TAILCALL
     String fileName;           // Source location for errors
     int lineNumber;
 }
@@ -135,10 +138,7 @@ public class ControlFlowMarker {
 ```java
 public class RuntimeControlFlowList extends RuntimeList {
     ControlFlowMarker marker;
-
-    // For tail calls:
-    RuntimeScalar tailCallCodeRef;
-    RuntimeArray tailCallArgs;
+    RuntimeBase returnValue;   // For RETURN type (non-local return from map/grep)
 }
 ```
 
@@ -184,6 +184,9 @@ ISTORE controlFlowActionSlot
 ILOAD controlFlowActionSlot
 ICONST_2
 IF_ICMPGT propagateToCaller
+// Higher ordinals (GOTO=3, TAILCALL=4, RETURN=5) are handled separately:
+// GOTO and TAILCALL propagate to caller; RETURN is unwrapped for
+// non-map/grep blocks or propagated for map/grep blocks.
 
 // Loop through visible loop labels
 for each visible loop {
@@ -227,37 +230,56 @@ skipDispatcher:
 - Subsequent calls with same signature reuse the existing dispatcher
 
 #### EmitterMethodCreator.java
-**Tail call trampoline** at `returnLabel`:
+**Return label** — handles normal returns and propagation:
 
 ```java
-// Check if result is marked
+returnLabel:
 ALOAD returnListSlot
 INVOKEVIRTUAL isNonLocalGoto()
 IFEQ normalReturn
 
-// Get control flow type ordinal
-ALOAD returnListSlot
-CHECKCAST RuntimeControlFlowList
-ASTORE controlFlowTempSlot
+// For eval blocks: check if RETURN type — if so, propagate
+// Otherwise: non-local control flow escaped eval, set $@ and return empty
+// For regular subs: propagate marker to caller
+ARETURN
+
+normalReturn:
+ARETURN
+```
+
+**Tail call trampoline** — emitted at each call site in `EmitSubroutine.java` (not at returnLabel):
+
+```java
+// At each call site, after isNonLocalGoto check:
 ALOAD controlFlowTempSlot
+CHECKCAST RuntimeControlFlowList
 INVOKEVIRTUAL getControlFlowType()
 INVOKEVIRTUAL ordinal()
+ICONST_4                          // TAILCALL ordinal
+IF_ICMPNE blockDispatcher         // Not a tail call, go to dispatcher
 
-// Dispatch with TABLESWITCH
-TABLESWITCH (0-4) {
-  case 0: handleLast
-  case 1: handleNext
-  case 2: handleRedo
-  case 3: handleGoto
-  case 4: handleTailcall
-  default: handleError
-}
-```
+tailcallLoop:
+ALOAD controlFlowTempSlot
+CHECKCAST RuntimeControlFlowList
+INVOKEVIRTUAL getTailCallCodeRef()
+ASTORE codeRefSlot
+INVOKEVIRTUAL getTailCallArgs()
+ASTORE argsSlot
+// Re-invoke
+ALOAD codeRefSlot
+ALOAD argsSlot
+INVOKESTATIC RuntimeCode.apply(...)
+ASTORE controlFlowTempSlot
 
-Each case handler:
-- Checks loop labels to find matching target
-- Jumps to appropriate loop label (lastLabel/nextLabel/redoLabel)
-- Or propagates marker to caller if no match found
+// Check if result is another tail call
+ALOAD controlFlowTempSlot
+INVOKEVIRTUAL isNonLocalGoto()
+IFEQ notControlFlow
+// Get type ordinal, check if still TAILCALL
+ICONST_4
+IF_ICMPEQ tailcallLoop           // Loop if still TAILCALL
+GOTO blockDispatcher              // Otherwise dispatch normally
+```
 
 ---
 
@@ -295,29 +317,14 @@ for my $i (1..10) {
 At call site:
 ```
 INVOKESTATIC RuntimeCode.apply(...)  # Call inner()
-ASTORE checkTempSlot                 # Store result
-ALOAD checkTempSlot
-INSTANCEOF RuntimeControlFlowList
-IFEQ notControlFlow
-ALOAD checkTempSlot
-ASTORE returnValueSlot
-GOTO returnLabel                     # Jump to dispatcher
-notControlFlow:
-ALOAD checkTempSlot
-```
-
-At returnLabel (TABLESWITCH dispatcher):
-```
-ALOAD returnValueSlot
+ASTORE controlFlowTempSlot          # Store result
+ALOAD controlFlowTempSlot
 INVOKEVIRTUAL isNonLocalGoto()
-IFEQ normalReturn
-# ... get ordinal ...
-TABLESWITCH -> handleLast
+IFEQ notControlFlow
 
-handleLast:
-# Check if loop label matches
-# If match: GOTO lastLabel
-# Else: propagate to caller
+GOTO blockDispatcher                 # Jump to block-level dispatcher
+notControlFlow:
+ALOAD controlFlowTempSlot
 ```
 
 ### Example 3: Tail Call
@@ -353,9 +360,17 @@ IFEQ normalReturn
 ICONST_4
 IF_ICMPEQ tailcallLoop  # Loop if still TAILCALL
 
-# Not TAILCALL anymore, dispatch via TABLESWITCH
+# Not TAILCALL anymore, dispatch via block dispatcher
 ```
 
+### Non-Local Return from Map/Grep Blocks
+
+When `return` is used inside a `map` or `grep` block, it should return from the enclosing subroutine, not just the block. This uses the `RETURN` control flow type:
+
+- Inside map/grep blocks (`isMapGrepBlock`), `return` creates a `RuntimeControlFlowList` with `ControlFlowType.RETURN` and carries the return value in `returnValue`
+- The block dispatcher recognizes RETURN (ordinal 5) and unwraps the return value if the current context is a normal subroutine (not map/grep)
+- If unwrapping fails, `PerlNonLocalReturnException` is thrown for stack unwinding through Java-level map/grep calls
+
 ---
 
 ## Performance
@@ -406,8 +421,7 @@ IF_ICMPEQ tailcallLoop  # Loop if still TAILCALL
   - One GOTO to shared dispatcher (if marked)
   - Shared dispatcher logic executes once (not per call)
   - O(1) dispatch regardless of loop depth
-  - One TABLESWITCH at dispatcher
-  - O(1) dispatch regardless of loop depth
+  - Conditional branch chain at dispatcher
 - **Tail calls**: Iterative trampoline (constant stack space)
 
 ---
@@ -437,17 +451,17 @@ ALOAD tempSlot                // Stack: [result]
 
 **Key principle:** All control flow paths must arrive at labels with identical stack heights.
 
-### Why Centralized TABLESWITCH?
+### Why Block-Level Shared Dispatchers?
 
 **Old approach:** Check and dispatch at each call site (150 bytes each)
 
 **New approach:**
-1. Call site: Simple check + jump to returnLabel (~20 bytes)
-2. returnLabel: Single TABLESWITCH dispatches all types (100 bytes total)
+1. Call site: Simple check + jump to block dispatcher (~20 bytes)
+2. Block-level dispatcher handles all types using conditional branches (emitted once per unique loop state)
 
 **Benefits:**
 - **Massive bytecode savings** (130 bytes per call)
-- **O(1) dispatch** via TABLESWITCH (hardware-optimized)
+- **Efficient dispatch** using conditional branch chain
 - **Better JIT compilation** (less bytecode to optimize)
 - **Single point of control** (easier to maintain/debug)
 
@@ -465,12 +479,13 @@ The `controlFlowTempSlot` holds the `RuntimeControlFlowList` during dispatch, se
 
 ```java
 // EmitSubroutine.java
-ENABLE_CONTROL_FLOW_CHECKS = true;  // ✅ Call-site checks
+ENABLE_CONTROL_FLOW_CHECKS = true;  // Call-site checks
 
 // EmitterMethodCreator.java
-ENABLE_TAILCALL_TRAMPOLINE = true;  // ✅ Tail call optimization
+ENABLE_TAILCALL_TRAMPOLINE = true;  // Tail call optimization
 
 // EmitControlFlow.java
+ENABLE_TAGGED_RETURNS = true;       // Tagged return values
 DEBUG_CONTROL_FLOW = false;         // Debug output
 ```
 
@@ -499,39 +514,12 @@ DEBUG_CONTROL_FLOW = false;         // Debug output
 
 ---
 
-## Historical Context
-
-### Evolution of the Implementation
-
-**Phase 1: Exception-Based (2024)**
-- Used Java exceptions (LastException, NextException)
-- Problems: VerifyErrors, stack consistency issues, "Method too large"
-- Pass rate: ~70%
-
-**Phase 2: Tagged Returns v1 (2025-11)**
-- Introduced RuntimeControlFlowList
-- Call-site checks with DUP/stack manipulation
-- Problem: ASM frame computation failures
-- Pass rate: 30% (massive regression)
-
-**Phase 3: Runtime Registry (2025-11)**
-- ThreadLocal storage for control flow markers
-- Checks at loop boundaries instead of call sites
-- Success: 100% pass rate
-- Trade-off: Additional checks at every labeled loop
-
-**Phase 4: Optimized Tagged Returns (2026-02) ← CURRENT**
-- Tagged returns with register-only bytecode
-- Centralized TABLESWITCH dispatch
-- **Success: 100% pass rate with minimal bytecode**
-- No stack manipulation, ASM-friendly patterns
-
-### Key Learnings
+## Design Lessons
 
 1. **ASM's COMPUTE_FRAMES is fragile** with stack manipulation after method calls
 2. **Local variable slots are ASM-friendly**, stack operations are not
 3. **Centralized dispatch** is more efficient than per-call-site dispatch
-4. **TABLESWITCH is perfect** for control flow type dispatch
+4. **Conditional branch chains** efficiently dispatch control flow types
 5. **Zero-overhead local flow** is achievable with direct GOTO
 
 ---
@@ -539,14 +527,14 @@ DEBUG_CONTROL_FLOW = false;         // Debug output
 ## Implementation Files
 
 ### Core Implementation
-- `src/main/java/org/perlonjava/runtime/ControlFlowType.java`
-- `src/main/java/org/perlonjava/runtime/ControlFlowMarker.java`
-- `src/main/java/org/perlonjava/runtime/RuntimeControlFlowList.java`
+- `src/main/java/org/perlonjava/runtime/runtimetypes/ControlFlowType.java`
+- `src/main/java/org/perlonjava/runtime/runtimetypes/ControlFlowMarker.java`
+- `src/main/java/org/perlonjava/runtime/runtimetypes/RuntimeControlFlowList.java`
 
 ### Code Generation
-- `src/main/java/org/perlonjava/codegen/EmitControlFlow.java` - Emit control flow operators
-- `src/main/java/org/perlonjava/codegen/EmitSubroutine.java` - Call-site checks
-- `src/main/java/org/perlonjava/codegen/EmitterMethodCreator.java` - TABLESWITCH dispatcher
+- `src/main/java/org/perlonjava/backend/jvm/EmitControlFlow.java` - Emit control flow operators
+- `src/main/java/org/perlonjava/backend/jvm/EmitSubroutine.java` - Call-site checks
+- `src/main/java/org/perlonjava/backend/jvm/EmitterMethodCreator.java` - Return label and eval handling
 
 ### Tests
 - `src/test/resources/unit/control_flow.t`
@@ -613,16 +601,16 @@ if (ctx.javaClassInfo.hasLabeledBlocks) {
 
 ---
 
-## Why Centralized TABLESWITCH Doesn't Work
+## Why Full Centralization Doesn't Work
 
-**Initial idea:** Move all control flow checking to a single TABLESWITCH dispatcher at the method's returnLabel to reduce per-call-site bytecode.
+**Initial idea:** Move all control flow checking to a single centralized dispatcher at the method's returnLabel to reduce per-call-site bytecode.
 
 **Problem:** The centralized dispatcher would need to check ALL loop labels in the entire method, not just the labels visible at each call site. For complex methods with many nested loops:
 
-- **Old approach (distributed):** Each of N calls checks M visible labels = N × M × ~13 bytes
-- **New approach (centralized):** Each of N calls: ~20 bytes + central dispatcher checking ALL L labels = N × 20 + L × 3 × ~13 bytes
+- **Distributed block-level approach:** Each of N calls checks M visible labels = N × M × ~13 bytes
+- **Fully centralized approach:** Each of N calls: ~20 bytes + central dispatcher checking ALL L labels = N × 20 + L × 3 × ~13 bytes
 
-The centralized approach only helps when:
+The fully centralized approach only helps when:
 ```
 N × M × 13 > N × 20 + L × 3 × 13
 N × (M × 13 - 20) > L × 39
@@ -631,9 +619,9 @@ N > L × 39 / (M × 13 - 20)
 
 For typical values (M=5, L=20): N > 20 × 39 / 45 ≈ 17.3
 
-So centralization only helps when there are 18+ call sites AND each call site has fewer visible labels than the method has total labels. In practice, this rarely occurs.
+So full centralization only helps when there are 18+ call sites AND each call site has fewer visible labels than the method has total labels. In practice, this rarely occurs.
 
-**Conclusion:** The distributed approach with fast-path optimization (implemented above) is superior.
+**Conclusion:** The distributed block-level dispatcher approach with fast-path optimization (implemented above) is superior.
 
 ---
 
@@ -678,8 +666,6 @@ The current control flow implementation represents a **mature, production-ready
 
 ## References
 
-- **Implementation branch:** `master` (merged 2026-02-04)
-- **Original design docs:** `dev/design/CONTROL_FLOW_*.md` (archived)
 - **ASM documentation:** https://asm.ow2.io/javadoc/
 - **JVM Spec on stack frames:** https://docs.oracle.com/javase/specs/jvms/se17/html/jvms-4.html#jvms-4.7.4
 - **Perl control flow semantics:** https://perldoc.perl.org/perlsyn#Basic-BLOCKs
diff --git a/dev/architecture/dynamic-scope.md b/dev/architecture/dynamic-scope.md
index c76f558cb..07ac653db 100644
--- a/dev/architecture/dynamic-scope.md
+++ b/dev/architecture/dynamic-scope.md
@@ -39,6 +39,8 @@ Implementations:
 - `RuntimeArray` - array variables  
 - `RuntimeHash` - hash variables
 - `RuntimeGlob` - typeglobs
+- `GlobalRuntimeArray` - global array localization (`local @array`)
+- `GlobalRuntimeHash` - global hash localization (`local %hash`)
 - `DeferBlock` - defer block execution
 - `RegexState` - regex match state ($1, $2, etc.)
 
@@ -50,15 +52,19 @@ Manages a stack of saved states:
 public class DynamicVariableManager {
     private static final Deque<DynamicState> variableStack = new ArrayDeque<>();
     
-    // Save current state and push onto stack
-    public static void pushLocalVariable(DynamicState variable) {
-        variable.dynamicSaveState();
-        variableStack.addLast(variable);
-    }
+    // Save current state and push onto stack (4 overloads)
+    public static RuntimeBase pushLocalVariable(RuntimeBase variable)     // returns variable
+    public static RuntimeScalar pushLocalVariable(RuntimeScalar variable) // returns variable
+    public static RuntimeGlob pushLocalVariable(RuntimeGlob variable)    // special: returns new glob from GlobalVariable.getGlobalIO()
+    public static void pushLocalVariable(DynamicState variable)          // for DeferBlock, RegexState
+    
+    // Each overload calls variable.dynamicSaveState() and variableStack.addLast(variable).
+    // The RuntimeGlob overload has special behavior: it returns the new glob obtained
+    // from GlobalVariable.getGlobalIO(), not the original variable.
     
     // Restore all states back to a saved level
-    public static void popToLocalLevel(int targetLevel) {
-        while (variableStack.size() > targetLevel) {
+    public static void popToLocalLevel(int targetLocalLevel) {
+        while (variableStack.size() > targetLocalLevel) {
             DynamicState variable = variableStack.removeLast();
             variable.dynamicRestoreState();
         }
@@ -85,7 +91,13 @@ public class RuntimeScalar implements DynamicState {
         RuntimeScalar copy = new RuntimeScalar();
         copy.type = this.type;
         copy.value = this.value;
+        copy.blessId = this.blessId;
         dynamicStateStack.push(copy);
+        // Reset to undef — this is the key `local` behavior:
+        // the variable is cleared after saving
+        this.type = UNDEF;
+        this.value = null;
+        this.blessId = 0;
     }
     
     @Override
@@ -93,6 +105,7 @@ public class RuntimeScalar implements DynamicState {
         RuntimeScalar saved = dynamicStateStack.pop();
         this.type = saved.type;
         this.value = saved.value;
+        this.blessId = saved.blessId;
     }
 }
 ```
@@ -150,12 +163,13 @@ The same mechanism is used for several other features:
 Implementation:
 ```java
 public class DeferBlock implements DynamicState {
-    private final RuntimeCode code;
+    private final RuntimeScalar codeRef;
+    private final RuntimeArray capturedArgs;  // captures enclosing subroutine's @_
     
     @Override
     public void dynamicRestoreState() {
-        // Execute the defer block
-        code.apply(new RuntimeArray(), RuntimeContextType.VOID);
+        // Execute the defer block (static call, uses capturedArgs)
+        RuntimeCode.apply(codeRef, capturedArgs, RuntimeContextType.VOID);
     }
 }
 ```
@@ -245,6 +259,9 @@ This ensures:
 | `RuntimeScalar.java` | Scalar save/restore |
 | `RuntimeArray.java` | Array save/restore |
 | `RuntimeHash.java` | Hash save/restore |
+| `GlobalRuntimeArray.java` | Implements DynamicState for global array localization |
+| `GlobalRuntimeHash.java` | Implements DynamicState for global hash localization |
+| `GlobalRuntimeScalar.java` | Contains `makeLocal()`, the primary entry point for `local $scalar` |
 | `DeferBlock.java` | Defer block execution |
 | `RegexState.java` | Regex state save/restore |
 | `Local.java` | Code generation helpers |
@@ -253,4 +270,4 @@ This ensures:
 ## See Also
 
 - [lexical-pragmas.md](lexical-pragmas.md) - How warnings/strict use this mechanism
-- [../design/warnings-scope.md](../design/warnings-scope.md) - Warning scope design
+- [../design/lexical-warnings.md](../design/lexical-warnings.md) - Warning scope design
diff --git a/dev/architecture/large-code-refactoring.md b/dev/architecture/large-code-refactoring.md
index fcda0f612..4bfe3c600 100644
--- a/dev/architecture/large-code-refactoring.md
+++ b/dev/architecture/large-code-refactoring.md
@@ -1,30 +1,27 @@
-# Large Code Refactoring - Automatic Retry Implementation
-
-## Status: ✅ IMPLEMENTED
+# Large Code Handling — JVM 64KB Method Limit
 
 ## Overview
 
-PerlOnJava automatically handles large code blocks that exceed JVM's 65KB method size limit through on-demand refactoring. When compilation fails with "Method too large" error, the compiler automatically retries with refactoring enabled.
+PerlOnJava uses a **two-tier strategy** to handle Perl code that exceeds the JVM's 65,535-byte method size limit:
 
-## Problem Discovery
+1. **Proactive**: During codegen, large blocks are detected and wrapped in closure calls to split them across multiple JVM methods
+2. **Reactive fallback**: If ASM still produces a method that's too large, the code is compiled using the bytecode interpreter backend instead
 
-When attempting to enable proactive refactoring by default, all 151 unit tests failed with errors like:
-```
-Global symbol "%Config" requires explicit package name
-```
+## The Problem
+
+The JVM limits each method to 65,535 bytes of bytecode. PerlOnJava compiles each Perl subroutine (or eval block) into a single JVM method. Large Perl files — such as test suites with thousands of assertions, or modules with large data structures — can exceed this limit.
 
-### Root Cause
+### Closure Scoping Complication
 
-Large-code refactoring wraps chunks of block statements in closures to avoid JVM's 65KB method size limit. However, when `use` or `require` statements are wrapped in closures, their imports happen in the closure's lexical scope instead of the package scope, breaking code that expects those imports to be available.
+The natural fix is to split large blocks into chunks wrapped in anonymous subs: `sub { ...chunk... }->(@_)`. However, this changes lexical scoping. When `use` or `require` statements are wrapped in closures, their imports happen in the closure's scope instead of the package scope:
 
-Example:
 ```perl
 # Original code
 package Foo;
 use Config qw/%Config/;   # Import %Config into Foo package
 my $x = $Config{foo};      # Access imported variable
 
-# After refactoring (BROKEN)
+# After naive refactoring (BROKEN)
 package Foo;
 sub {
     use Config qw/%Config/;   # Import happens in closure scope!
@@ -32,156 +29,109 @@ sub {
 my $x = $Config{foo};      # ERROR: %Config not in scope
 ```
 
-## Solution Evolution
+This is why proactive refactoring skips subroutines, special blocks (BEGIN/END/INIT/CHECK/UNITCHECK), and blocks with unsafe control flow.
 
-### Initial Approach: Proactive Detection
-First considered detecting `use`/`require`/`BEGIN` statements and keeping them in unrefactored prefix sections. However, this approach had several issues:
-- Estimation overhead for all code
-- Complex logic to detect all compile-time statements
-- Still produces false positives
-- Changes semantics even when refactoring not needed
+## Tier 1: Proactive Block Refactoring
 
-### Better Approach: Reactive/On-Demand Refactoring ✅
+### Entry Point
 
-Instead of predicting when refactoring is needed, **react to actual compilation errors**:
+`LargeBlockRefactorer.processBlock()` is called from `EmitBlock.emitBlock()` during bytecode emission for every `BlockNode`.
 
-**Old flow (Proactive):**
-```
-Parse → Estimate size → Refactor if large → Compile
-Problems:
-- Estimation overhead for all code
-- False positives break imports
-- Semantic changes even when not needed
-```
+### Flow
 
-**New flow (Reactive):**
 ```
-Parse → Compile normally
-  ↓ (if Method too large error)
-Catch error → Enable refactoring → Retry compilation
-Benefits:
-- Zero overhead for normal code
-- Only refactor when truly necessary
-- No semantic changes unless required
+EmitBlock.emitBlock(visitor, blockNode)
+  └── LargeBlockRefactorer.processBlock(visitor, blockNode)
+        ├── Skip if: already refactored, is subroutine, is special block, ≤4 elements
+        ├── Estimate bytecode size (capped at 2 × LARGE_BYTECODE_SIZE)
+        ├── If estimated > LARGE_BYTECODE_SIZE (40,000):
+        │     └── tryWholeBlockRefactoring():
+        │           ├── Check for unsafe control flow (unlabeled next/last/redo/goto) → abort if found
+        │           ├── Mark blockAlreadyRefactored = true
+        │           └── Wrap entire block in: sub { <block> }->(@_)
+        └── Return false → normal block emission continues
 ```
 
-## Implementation
-
-### Code Changes
-
-#### LargeBlockRefactorer.java
-Added support for automatic retry that bypasses the `IS_REFACTORING_ENABLED` check:
-
-```java
-public static void forceRefactorForCodegen(BlockNode node, boolean isAutoRetry) {
-    // Only check IS_REFACTORING_ENABLED if NOT auto-retry
-    if (!isAutoRetry && !IS_REFACTORING_ENABLED) {
-        return;
-    }
-    if (node == null) {
-        return;
-    }
-    // Prevent infinite retry loops
-    Object attemptsObj = node.getAnnotation("refactorAttempts");
-    int attempts = attemptsObj instanceof Integer ? (Integer) attemptsObj : 0;
-    if (attempts >= MAX_REFACTOR_ATTEMPTS) {
-        return;
-    }
-    node.setAnnotation("refactorAttempts", attempts + 1);
-    node.setAnnotation("blockAlreadyRefactored", false);
-    trySmartChunking(node, null, 256);
-    processPendingRefactors();
-}
-```
+Wrapping pushes the block's code into a separate JVM method (the anonymous sub body), giving it its own 64KB budget.
 
-#### EmitterMethodCreator.java
-Modified to catch `MethodTooLargeException` and automatically retry with refactoring:
-
-```java
-try {
-    return getBytecodeInternal(ctx, ast, useTryCatch, false);
-} catch (MethodTooLargeException tooLarge) {
-    // Automatic retry with refactoring on "Method too large" error
-    if (ast instanceof BlockNode blockAst) {
-        try {
-            // Notify user that automatic refactoring is happening
-            System.err.println("Note: Method too large, retrying with automatic refactoring.");
-            // Force refactoring with auto-retry flag
-            LargeBlockRefactorer.forceRefactorForCodegen(blockAst, true);
-            // Reset JavaClassInfo to avoid reusing partially-resolved Labels
-            if (ctx != null && ctx.javaClassInfo != null) {
-                String previousName = ctx.javaClassInfo.javaClassName;
-                ctx.javaClassInfo = new JavaClassInfo();
-                ctx.javaClassInfo.javaClassName = previousName;
-                ctx.clearContextCache();
-            }
-            return getBytecodeInternal(ctx, ast, useTryCatch, false);
-        } catch (MethodTooLargeException retryTooLarge) {
-            throw retryTooLarge;
-        } catch (Throwable retryError) {
-            System.err.println("Warning: Automatic refactoring failed: " + retryError.getMessage());
-        }
-    }
-    throw tooLarge;
-}
-```
-
-## Testing Results
+### Thresholds
 
-### Test Case: 30,000 Element Array
+| Constant | Value | File | Purpose |
+|----------|-------|------|---------|
+| `LARGE_BYTECODE_SIZE` | 40,000 bytes | `BlockRefactor.java` | Trigger threshold (below 65,535 for safety margin) |
+| `MIN_CHUNK_SIZE` | 4 elements | `BlockRefactor.java` | Minimum block size to consider refactoring |
 
-```bash
-# Automatic retry (default behavior)
-./jperl /tmp/test_auto_refactor.pl
-# Output: "Note: Method too large, retrying with automatic refactoring."
-#         May still fail if code is extremely large (exceeds limits even after refactoring)
-```
+### Key Classes
 
-### Unit Tests
-- ✅ All 2012 unit tests pass
-- ✅ Normal code has zero overhead (no refactoring unless needed)
-- ✅ Config imports work correctly
-- ✅ op/pack.t: 14656/14726 ok (same as before)
+- **`BlockRefactor`** (`backend/jvm/astrefactor/BlockRefactor.java`) — Utility methods: `createAnonSubCall()` creates `sub { ... }->(@_)` AST nodes, `buildNestedStructure()` builds nested tail-closure chains, `createBlockNode()` with anti-recursion guard
+- **`LargeBlockRefactorer`** (`backend/jvm/astrefactor/LargeBlockRefactorer.java`) — Orchestrates block-level refactoring: size estimation, control flow safety checks, whole-block wrapping
 
-## Benefits
+## Tier 2: Interpreter Fallback
 
-1. **Zero overhead for normal code** - No bytecode estimation unless actually needed
-2. **Semantic correctness** - Imports and `use` statements work normally
-3. **Automatic handling** - Users don't need to set environment variables
-4. **Fail-safe** - Catches extreme cases that exceed limits even after refactoring
+When the proactive refactoring is insufficient (or skipped due to unsafe control flow), ASM may still throw `MethodTooLargeException`. The fallback catches this and compiles the code using the bytecode interpreter instead.
 
-## User Messages
+### Flow
 
-### On automatic retry:
 ```
-Note: Method too large, retrying with automatic refactoring.
+EmitterMethodCreator.createRuntimeCode(ctx, ast, useTryCatch)
+  └── try: createClassWithMethod() → getBytecode() → ASM toByteArray()
+        └── MethodTooLargeException thrown by ASM
+  └── catch (MethodTooLargeException):
+        └── if USE_INTERPRETER_FALLBACK (default: true):
+              └── compileToInterpreter(ast, ctx, useTryCatch)
+                    → Returns InterpretedCode (walks AST at runtime)
 ```
 
-### On failure after retry:
-```
-Hint: If this is a 'Method too large' error after automatic refactoring,
-      the code may be too complex to compile. Consider splitting into smaller methods.
-```
+Both `CompiledCode` and `InterpretedCode` extend `RuntimeCode`, so call sites don't need to know which backend was used.
+
+### Configuration
 
-## Usage Recommendations
+`USE_INTERPRETER_FALLBACK` is enabled by default. It can be disabled with the environment variable `JPERL_DISABLE_INTERPRETER_FALLBACK`.
 
-- **Default behavior (on-demand)**: ✅ Automatic and transparent for all code
-- No configuration needed - refactoring happens automatically when required
+The fallback also handles other compilation failures (`VerifyError`, `ClassFormatError`, certain `PerlCompilerException` and `RuntimeException` cases).
+
+### User Message
+
+When fallback is triggered with `JPERL_SHOW_FALLBACK=1`:
+```
+Note: Method too large after AST splitting, using interpreter backend.
+```
 
 ## Technical Details
 
-### JVM Method Size Limits
+### JVM Constraints
 - Maximum method bytecode size: 65,535 bytes (64KB)
-- Threshold for refactoring check: 40,000 bytes
-- Refactoring splits large blocks into closure-wrapped chunks
+- Proactive refactoring threshold: 40,000 bytes (safety margin)
 
 ### Refactoring Strategy
-1. **Smart chunking**: Groups statements into manageable chunks
-2. **Closure wrapping**: Each chunk becomes `sub { ... }->()`
-3. **Context preservation**: Return values and control flow maintained
-4. **Safe boundaries**: Never splits statements mid-expression
-
-### Retry Logic
-- Maximum retry attempts per block: 1 (prevents infinite loops)
-- Tracks attempts via block annotation `refactorAttempts`
-- Resets `JavaClassInfo` between attempts to clear partial state
+1. **Whole-block wrapping**: The entire block becomes `sub { <block> }->(@_)`
+2. **`@_` passthrough**: Arguments are forwarded so the wrapper is transparent
+3. **Anti-recursion guard**: `BlockRefactor.createBlockNode()` sets a thread-local `skipRefactoring` flag to prevent infinite recursion when the wrapper's BlockNode is constructed
+4. **Safe boundaries**: Blocks with unlabeled control flow (`next`/`last`/`redo`/`goto` outside loops) are not refactored, since these would break when wrapped in a closure
+
+### Dead Code
+
+The codebase contains remnants of a former retry-based approach that was replaced by the interpreter fallback:
+
+| Dead Code | Purpose (unused) |
+|-----------|-----------------|
+| `LargeBlockRefactorer.forceRefactorForCodegen()` | Was meant for reactive retry after MethodTooLargeException |
+| `LargeBlockRefactorer.trySmartChunking()` | Sophisticated chunking algorithm (only called by dead code above) |
+| `DepthFirstLiteralRefactorVisitor` (entire class) | Depth-first literal refactoring (marked OBSOLETE in design docs) |
+| `LargeNodeRefactorer` (entire class) | Element list chunking (only called by dead code above) |
+
+These are candidates for removal.
+
+## Implementation Files
+
+| File | Role |
+|------|------|
+| `backend/jvm/astrefactor/BlockRefactor.java` | Constants, closure-wrapping utilities |
+| `backend/jvm/astrefactor/LargeBlockRefactorer.java` | Block-level proactive refactoring |
+| `backend/jvm/EmitBlock.java` | Calls `processBlock()` during block emission |
+| `backend/jvm/EmitterMethodCreator.java` | Catches `MethodTooLargeException`, triggers interpreter fallback |
+
+## See Also
+
+- [control-flow.md](control-flow.md) — Control flow interacts with refactoring (unsafe control flow prevents block wrapping)
+- [../design/interpreter.md](../design/interpreter.md) — Bytecode interpreter design (the fallback backend)
diff --git a/dev/architecture/lexical-pragmas.md b/dev/architecture/lexical-pragmas.md
index c37fe225c..3328d3c2f 100644
--- a/dev/architecture/lexical-pragmas.md
+++ b/dev/architecture/lexical-pragmas.md
@@ -43,23 +43,32 @@ PerlOnJava tracks both using different mechanisms.
 ```java
 public class ScopedSymbolTable {
     // Warning flags - one BitSet per scope level
-    public final Deque<BitSet> warningFlagsStack = new ArrayDeque<>();
+    public final Stack<BitSet> warningFlagsStack = new Stack<>();
     
     // Feature flags - integer bitmask per scope
-    public final Deque<Integer> featureFlagsStack = new ArrayDeque<>();
+    public final Stack<Integer> featureFlagsStack = new Stack<>();
     
     // Strict options - integer bitmask per scope
-    public final Deque<Integer> strictOptionsStack = new ArrayDeque<>();
+    public final Stack<Integer> strictOptionsStack = new Stack<>();
+    
+    // Warning disabled flags - tracks "no warnings 'category'" per scope
+    public final Stack<BitSet> warningDisabledStack = new Stack<>();
+    
+    // Warning fatal flags - tracks "use warnings FATAL => 'category'" per scope
+    public final Stack<BitSet> warningFatalStack = new Stack<>();
 }
 ```
 
 When entering a new scope (block, subroutine, file):
 ```java
-void enterScope() {
+int enterScope() {
     // Clone current flags for new scope
     warningFlagsStack.push((BitSet) warningFlagsStack.peek().clone());
     featureFlagsStack.push(featureFlagsStack.peek());
     strictOptionsStack.push(strictOptionsStack.peek());
+    warningDisabledStack.push((BitSet) warningDisabledStack.peek().clone());
+    warningFatalStack.push((BitSet) warningFatalStack.peek().clone());
+    // returns current scope depth
 }
 ```
 
@@ -85,27 +94,22 @@ public class WarningFlags {
     }
     
     // Each category maps to a bit position
-    public void enableWarning(String category) {
-        int bit = getCategoryBit(category);
-        currentFlags.set(bit);
-        // Also enable subcategories
-        for (String sub : getSubcategories(category)) {
-            enableWarning(sub);
-        }
-    }
+    // Enabling/disabling warnings delegates to ScopedSymbolTable:
+    //   ScopedSymbolTable.enableWarningCategory(category)
+    // which calls setWarningState() to update the current scope's BitSet.
+    // Subcategories are recursively expanded via the hierarchy map.
 }
 ```
 
 ### Strict Options
 
-Strict has three options encoded as bits:
+Strict has three options encoded as bit constants in `Strict.java`:
 
 ```java
-public class StrictOptions {
-    public static final int STRICT_REFS = 1;  // no symbolic refs
-    public static final int STRICT_VARS = 2;  // must declare variables
-    public static final int STRICT_SUBS = 4;  // no barewords as subs
-}
+// In Strict.java (not a separate StrictOptions class)
+public static final int HINT_STRICT_REFS = 0x00000002;
+public static final int HINT_STRICT_SUBS = 0x00000200;
+public static final int HINT_STRICT_VARS = 0x00000400;
 ```
 
 ### CompilerFlagNode
@@ -118,6 +122,9 @@ public class CompilerFlagNode extends AbstractNode {
     private final int featureFlags;
     private final int strictOptions;
     private final int warningScopeId;  // For runtime propagation
+    private final java.util.BitSet warningFatalFlags;
+    private final java.util.BitSet warningDisabledFlags;
+    private final int hintHashSnapshotId;
 }
 ```
 
@@ -147,7 +154,7 @@ We use a runtime mechanism with dynamic scoping:
    ```java
    public static int registerScopeWarnings(Set<String> categories) {
        int scopeId = scopeIdCounter.incrementAndGet();
-       scopeDisabledWarnings.put(scopeId, expandCategories(categories));
+       scopeDisabledWarnings.put(scopeId, expandCategory(categories));
        return scopeId;
    }
    ```
@@ -169,6 +176,10 @@ We use a runtime mechanism with dynamic scoping:
        // ... emit warning
    }
    ```
+   > **Note:** The actual implementation in `Warnings.java` is significantly more complex—it
+   > uses `caller()[9]` warning bits as the primary mechanism for checking whether a warning
+   > category is enabled at the call site, with `${^WARNING_SCOPE}` as an additional
+   > suppression check. The pseudocode above is a simplified illustration.
 
 4. **Automatic Restore**: When scope exits, `DynamicVariableManager` restores `${^WARNING_SCOPE}` to its previous value.
 
@@ -300,4 +311,4 @@ Warnings.warnIf()
 ## See Also
 
 - [dynamic-scope.md](dynamic-scope.md) - How `local` makes runtime scoping work
-- [../design/warnings-scope.md](../design/warnings-scope.md) - Detailed design for warning scope propagation
+- [../design/lexical-warnings.md](../design/lexical-warnings.md) - Detailed design for warning scope propagation
diff --git a/dev/architecture/weaken-destroy.md b/dev/architecture/weaken-destroy.md
index c0a5d90be..2a78f5ff3 100644
--- a/dev/architecture/weaken-destroy.md
+++ b/dev/architecture/weaken-destroy.md
@@ -2,7 +2,6 @@
 
 **Last Updated:** 2026-04-10
 **Status:** PRODUCTION READY - 841/841 Moo subtests (100%), all unit tests passing
-**Branch:** `feature/destroy-weaken` (rebased on origin/master at `3a3bb3f8e`)
 
 ---
 
@@ -87,19 +86,10 @@ reference is overwritten — even though the CODE ref is still alive in the stas
 This would break Sub::Quote/Sub::Defer (which use `weaken()` for
 back-references) and cascade to break Moo's accessor inlining.
 
-**Note:** The previous blessId==0→WEAKLY_TRACKED transition (for unblessed
-birth-tracked objects with remaining strong refs after `weaken()`) was removed.
-It caused premature clearing of weak refs when ANY strong ref exited scope, even
-though other strong refs still existed. Birth-tracked objects maintain accurate
-refCounts through `setLarge()`/`scopeExitCleanup()` — closure captures are
-birth-tracked (refCountOwned=false) and don't decrement refCount on cleanup.
-
-See "Design History" section for the evolution of this design.
-
 | Value | Meaning |
 |-------|---------|
 | `-1` | **Untracked.** Default state. Object is unblessed or blessed into a class without DESTROY. No refCount bookkeeping occurs. `weaken()` transitions non-CODE objects to WEAKLY_TRACKED (-2) and registers the weak ref in WeakRefRegistry. CODE refs stay at -1. |
-| `0` | **Birth-tracked.** Freshly blessed into a DESTROY class, or anonymous hash/code via `createReferenceWithTrackedElements`. No variable holds a reference yet -- `setLarge()` will increment to 1 on first assignment. |
+| `0` | **Birth-tracked.** Freshly blessed into a DESTROY class, anonymous hash via `createReferenceWithTrackedElements()`, or closures with captures via `RuntimeCode.makeCodeObject()`. No variable holds a reference yet -- `setLarge()` will increment to 1 on first assignment. |
 | `> 0` | **Tracked.** N strong references exist in named variables. Each `setLarge()` assignment increments; each scope exit or reassignment decrements. |
 | `-2` | **WEAKLY_TRACKED.** Entered when `weaken()` is called on an untracked non-CODE object (refCount == -1). A heuristic allowing weak ref clearing when a strong ref is explicitly dropped via `undefine()`. `setLarge()` and `scopeExitCleanup()` do NOT clear weak refs for this state — only explicit `undefine()`. |
 | `MIN_VALUE` | **Destroyed.** DESTROY has been called (or is in progress). Prevents double-destruction. |
@@ -145,7 +135,7 @@ variable to trigger destruction.
 | `RuntimeScalar.java` | `setLarge()` (increment/decrement), `scopeExitCleanup()`, `undefine()`, `incrementRefCountForContainerStore()` |
 | `RuntimeList.java` | `setFromList()` -- list destructuring with materialized copy refcount undo |
 | `RuntimeHash.java` | `createReferenceWithTrackedElements()` (birth-tracking for anonymous hashes), `delete()` with deferred decrement |
-| `RuntimeArray.java` | `createReferenceWithTrackedElements()` (element tracking, NOT birth-tracked -- see Limitations), `pop()`/`shift()` with deferred decrement |
+| `RuntimeArray.java` | `createReferenceWithTrackedElements()` (element tracking, NOT birth-tracked -- see Limitations) |
 | `WeakRefRegistry.java` | Weak reference tracking: forward set + reverse map |
 | `DestroyDispatch.java` | DESTROY method resolution, caching, invocation |
 | `MortalList.java` | Deferred decrements (FREETMPS equivalent) |
@@ -228,8 +218,9 @@ handles steps 5-11.
 5. Resolves DESTROY method via cache or `InheritanceResolver`.
 6. Handles AUTOLOAD: sets `$AUTOLOAD = "ClassName::DESTROY"`.
 7. Saves/restores `$@` around the call (DESTROY must not clobber `$@`).
-8. Builds a `$self` reference with the correct type (`GLOBREFERENCE` for
-   `RuntimeGlob`, then `HASHREFERENCE`/`ARRAYREFERENCE`/etc. -- note:
+8. Builds a `$self` reference with the correct type (`HASHREFERENCE` for
+   `RuntimeHash`, `ARRAYREFERENCE` for `RuntimeArray`, `GLOBREFERENCE` for
+   `RuntimeGlob`, then `SCALAR`/`CODE`/etc. -- note:
    `RuntimeGlob` is checked before `RuntimeScalar` because it is a subclass).
 9. Calls `RuntimeCode.apply(destroyMethod, args, VOID)`.
 10. **Cascading destruction:** After DESTROY returns, walks the destroyed
@@ -247,13 +238,12 @@ Equivalent to Perl 5's `FREETMPS` / mortal stack. Provides deferred refCount
 decrements at statement boundaries so that temporaries survive long enough to
 be used.
 
-**The `active` field:** A `boolean` that is always `true`. It is initialized to
-`true` and never changed. (Historically, it was lazily activated by the first
-`bless()` into a DESTROY class, but this was changed because birth-tracked
-objects like anonymous hashes and closures with captures need balanced refCount
-tracking from the start.) Most operations are guarded by cheap checks
-(`refCount >= 0`, `refCountOwned`, empty pending list) that make the overhead
-negligible for programs that don't use DESTROY.
+**The `active` field:** A `boolean` that is always `true`. Birth-tracked
+objects (anonymous hashes and closures with captures) need balanced refCount
+tracking from the start, so the mortal system cannot be lazily activated.
+Most operations are guarded by cheap checks (`refCount >= 0`,
+`refCountOwned`, empty pending list) that make the overhead negligible for
+programs that don't use DESTROY.
 
 **Pending list:** `ArrayList<RuntimeBase>` of referents awaiting decrement.
 
@@ -477,9 +467,9 @@ after the eval block completes, rather than waiting for GC. (eval STRING uses
 block.) Without this, weak refs inside eval blocks wouldn't be cleared until
 the next GC cycle.
 
-**Note:** `apply()` does NOT call `flush()` at the top of the method (this
-was removed). Flushing happens at statement boundaries via `setLarge()` and
-scoped `popAndFlush()` instead.
+**Note:** `apply()` does NOT call `flush()` at the top of the method.
+Flushing happens at statement boundaries via `setLarge()` and scoped
+`popAndFlush()` instead.
 
 ---
 
@@ -516,9 +506,9 @@ scoped `popAndFlush()` instead.
 
 ```perl
 our $cache;
-$cache = bless {}, 'Cached';    # refCount stays -1 (no DESTROY → untracked)
-weaken($weak = $cache);         # registers in WeakRefRegistry; refCount: -1 → -2 (WEAKLY_TRACKED)
-undef $cache;                   # undefine() sees WEAKLY_TRACKED → callDestroy()
+$cache = bless {}, 'Cached';    # refCount stays -1 (no DESTROY -> untracked)
+weaken($weak = $cache);         # registers in WeakRefRegistry; refCount: -1 -> -2 (WEAKLY_TRACKED)
+undef $cache;                   # undefine() sees WEAKLY_TRACKED -> callDestroy()
                                 # callDestroy() clears weak refs: $weak = undef
                                 # Matches Perl 5 behavior
 ```
@@ -527,9 +517,9 @@ Note: This is a heuristic. If multiple strong refs exist:
 ```perl
 my $a = [1,2,3];               # refCount: -1 (untracked array)
 my $b = $a;                    # refCount: still -1 (not tracked)
-weaken($weak = $a);            # refCount: -1 → -2 (WEAKLY_TRACKED)
-undef $a;                      # WEAKLY_TRACKED → callDestroy() → $weak = undef
-                               # $b still valid but $weak is gone — may be too eager
+weaken($weak = $a);            # refCount: -1 -> -2 (WEAKLY_TRACKED)
+undef $a;                      # WEAKLY_TRACKED -> callDestroy() -> $weak = undef
+                               # $b still valid but $weak is gone -- may be too eager
                                # Perl 5 would keep $weak alive since $b is still strong
 ```
 This over-eager clearing is accepted because unblessed objects have no
@@ -645,461 +635,6 @@ decrement per reference assignment), but this is by design.
 
 ---
 
-## Design History: WEAKLY_TRACKED Evolution
-
-**Date:** 2026-04-09
-**Current Status:** WEAKLY_TRACKED re-enabled for untracked objects (-1 → -2)
-with heuristic clearing via `undefine()`, `setLarge()`, and `scopeExitCleanup()`.
-
-The following sections document the design evolution. The current implementation
-combines elements of the original design, Refined Strategy A, and the heuristic
--1 → -2 transition added to fix `weaken_edge_cases.t` test 15.
-
-### Original Problem (qr-72922.t regression)
-
-The WEAKLY_TRACKED (-2) state causes **premature weak reference clearing**.
-When `undefine()` encounters a WEAKLY_TRACKED object, it unconditionally
-calls `callDestroy()`, clearing ALL weak refs — even when other strong
-references still exist.
-
-**Concrete failure (qr-72922.t):**
-```perl
-my $re = qr/abcdef/;           # R.refCount = -1 (untracked)
-my $re_copy1 = $re;            # still -1 (no tracking)
-my $re_weak_copy = $re;        # still -1
-weaken($re_weak_copy);         # R.refCount: -1 → -2 (WEAKLY_TRACKED)
-undef $re;                     # WEAKLY_TRACKED triggers callDestroy!
-# $re_weak_copy is now undef — WRONG, $re_copy1 is still a strong ref
-```
-
-Perl 5 behavior: `$re_weak_copy` remains valid because `$re_copy1` is
-still alive. The weak ref should only become undef when ALL strong refs
-are gone.
-
-**Root cause:** When `weaken()` transitions -1 → -2, the system loses
-track of how many strong refs exist. The `undefine()` heuristic
-("destroy on any undef") is incorrect when multiple strong refs exist.
-
-### Strategy Analysis
-
-Three strategies are evaluated below. All preserve correct behavior for
-**blessed-with-DESTROY objects** (which use the fully-tracked refCount
->= 0 path and are unaffected by WEAKLY_TRACKED changes).
-
-#### Strategy A: Eliminate WEAKLY_TRACKED entirely
-
-Remove the -2 state. `weaken()` only participates in refCount for
-objects that are already tracked (refCount >= 0).
-
-**Changes:**
-1. `weaken()` on untracked (-1): register in WeakRefRegistry only. No
-   refCount change. No `MortalList.active = true`.
-2. `weaken()` on tracked (>= 0): decrement refCount as today. Remove the
-   `blessId == 0` transition to WEAKLY_TRACKED (lines 79-88); keep the
-   refCount as-is after decrement.
-3. `undefine()`: remove the WEAKLY_TRACKED block (lines 1873-1877).
-4. `callDestroy()`: move `clearWeakRefsTo()` to AFTER the `className`
-   null check — only clear weak refs for blessed objects. For unblessed
-   objects (CODE refs), `releaseCaptures()` still fires but weak refs
-   are not cleared.
-5. `GlobalDestruction`: no change needed (already checks `refCount >= 0`).
-6. Remove or deprecate the `WEAKLY_TRACKED` constant.
-
-**State machine (simplified):**
-```
-    -1 ──────────────────────────────────► 0
- (untracked)    bless into DESTROY class   (birth-tracked)
-                                            │
-                                            │ setLarge()
-                                            ▼
-                                           1+
-                                        (N strong refs)
-                                            │
-                                            │ last strong ref dropped
-                                            ▼
-                                         MIN_VALUE
-                                         (destroyed: DESTROY + clearWeakRefsTo)
-```
-
-**Pros:**
-- Simplest design. Eliminates an entire state and all its special cases.
-- Fixes qr-72922.t (weak refs survive because undefine() doesn't clear
-  them for untracked objects).
-- Zero risk to Moo (841/841) — blessed-with-DESTROY objects are on the
-  refCount >= 0 path, completely unaffected.
-
-**Cons:**
-- Weak refs to non-DESTROY objects (unblessed or blessed-without-DESTROY)
-  are never cleared deterministically. In Perl 5 they become undef when
-  the last strong ref is dropped. In PerlOnJava they persist forever
-  (still valid, still dereferenceable).
-- Risk on Path B removal: unblessed tracked objects (CODE refs from
-  `makeCodeObject` with `MortalList.active`) may see premature clearing
-  if refCount undercounts due to closure captures bypassing `setLarge()`.
-  Mitigated by point 4 (clearWeakRefsTo only for blessed objects).
-
-**Test plan:**
-1. Run `make` — must pass.
-2. Run `perl dev/tools/perl_test_runner.pl perl5_t/t/re/qr-72922.t` —
-   should recover from 5/14 to 10/14 (matching master).
-3. Run Moo full suite — must remain 841/841.
-4. Run `make test-all` — no new regressions.
-5. Run `perl dev/tools/perl_test_runner.pl perl5_t/t/op/die_keeperr.t` —
-   should recover from 6/15 to 15/15 (with the warning format fix).
-
-#### Strategy B: Keep WEAKLY_TRACKED but skip clearing on undef
-
-Keep the -2 state for registry purposes but remove the destruction
-trigger from `undefine()`.
-
-**Changes:**
-1. `undefine()`: remove the WEAKLY_TRACKED block (lines 1873-1877).
-2. `callDestroy()`: move `clearWeakRefsTo()` after className check.
-3. Keep the -2 transition in `weaken()` and the `MortalList.active = true`.
-
-**Pros:**
-- Minimal code change (only 2 sites).
-- Fixes qr-72922.t (undef no longer clears WEAKLY_TRACKED weak refs).
-
-**Cons:**
-- WEAKLY_TRACKED state still exists but is now "dead code" — the only
-  place that acted on it (undefine) no longer does. The state adds
-  complexity without benefit.
-- Still sets `MortalList.active = true` on `weaken()` for untracked
-  objects, adding overhead for programs that use `weaken()` without
-  DESTROY.
-
-**Test plan:** Same as Strategy A.
-
-#### Strategy C: Deferred clearing via Java WeakReference sentinel (future)
-
-Use a Java `WeakReference` + `ReferenceQueue` to detect when the last
-strong Perl reference to an untracked object is dropped.
-
-**Sketch:**
-1. When `weaken()` is called on an untracked object, create a sentinel
-   Java object.
-2. All "strong" Perl scalars that reference this object also hold a
-   strong Java ref to the sentinel.
-3. The Perl "weak" scalars do NOT hold the sentinel.
-4. Register a Java `WeakReference<Sentinel>` on a `ReferenceQueue`.
-5. When all strong Perl scalars drop their ref (via undef, scope exit,
-   reassignment), the sentinel becomes unreachable, the WeakReference
-   is enqueued, and we poll the queue to clear Perl-level weak refs.
-
-**Pros:**
-- Most Perl 5-compliant: weak refs to unblessed objects are cleared
-  when all strong refs are truly gone.
-- Deterministic within one GC cycle (not immediate, but timely).
-
-**Cons:**
-- High implementation complexity. Requires modifying `RuntimeScalar`
-  to hold sentinel refs, `setLarge()` to propagate sentinels.
-- Clearing is NOT immediate (depends on JVM GC timing), which is a
-  semantic difference from Perl 5.
-- Adds per-reference memory overhead (sentinel objects).
-- May interact poorly with JVM GC pauses.
-
-**Test plan:** Same as A/B, plus timing-sensitive tests for sentinel
-clearing (would need `System.gc()` hints in tests).
-
-### Experimental Results: Strategy A (2026-04-09)
-
-Strategy A was implemented on the `feature/eliminate-weakly-tracked` branch
-and tested end-to-end. Results:
-
-#### What worked
-
-- **`make` passes**: All unit tests pass EXCEPT `weaken_edge_cases.t` test 15.
-- **qr-72922.t**: Recovered from 5/14 to **10/14** (matches master). The
-  premature clearing regression is fully fixed.
-- **die_keeperr.t**: 15/15 with the separate warning format fix in
-  DestroyDispatch.java (already committed).
-- **Blessed-with-DESTROY objects**: Completely unaffected. The refCount >= 0
-  path is unchanged by Strategy A.
-
-#### What failed
-
-**`weaken_edge_cases.t` test 15** ("nested weak array element becomes undef"):
-
-```perl
-my $strong = [1, 2, 3];          # unblessed array, refCount = -1
-my @nested;
-$nested[0][0] = $strong;         # refCount still -1 (untracked)
-weaken($nested[0][0]);           # Strategy A: register only, no refCount change
-undef $strong;                   # Strategy A: no action for untracked
-ok(!defined($nested[0][0]), ...);  # FAILS: weak ref still valid
-```
-
-**Root cause: Hash/Array Birth-Tracking Asymmetry.**
-
-`RuntimeHash.createReferenceWithTrackedElements()` sets `refCount = 0`
-for anonymous hashes, making them birth-tracked. This means `weaken()` on
-unblessed hash refs works correctly — the refCount path handles everything.
-
-`RuntimeArray.createReferenceWithTrackedElements()` does **NOT** set
-`refCount = 0`. Arrays stay at -1 (untracked). This means `weaken()` on
-unblessed array refs cannot detect when the last strong ref is dropped.
-
-**Why arrays differ:** Adding `this.refCount = 0` to RuntimeArray was
-tested and caused **54/839 Moo subtest failures** across 7 test files:
-- accessor-coerce, accessor-default, accessor-isa, accessor-trigger,
-  accessor-weaken, overloaded-coderefs, method-generate-accessor
-
-**Root cause of Moo failures:** Sub::Quote closures capture arrays by
-sharing the RuntimeScalar variable (via `captureCount`). This capture
-does NOT go through `setLarge()`, so refCount is never incremented for
-the captured reference. When the original strong ref drops, refCount hits
-0 even though the closure still holds a valid reference → premature
-DESTROY.
-
-Hash refs avoid this problem because Moo's usage patterns don't capture
-hash refs in the same way, or because hash captures coincidentally go
-through setLarge().
-
-#### Strategy A Summary
-
-| Test suite | Result | Notes |
-|------------|--------|-------|
-| `make` (unit tests) | PASS (except 1) | weaken_edge_cases.t #15 |
-| qr-72922.t | 10/14 (matches master) | Regression fixed |
-| die_keeperr.t | 15/15 | With warning format fix |
-| Moo (without array tracking) | Not re-tested | Expected same as master |
-| Moo (WITH array tracking) | 54/839 failures | Array birth-tracking breaks closures |
-| **Moo (pure Strategy A, no blessId==0 safety)** | **54/841 failures** | Removing blessId==0→WEAKLY_TRACKED also breaks Moo |
-
-**Critical finding:** Removing the `blessId == 0 → WEAKLY_TRACKED`
-transition in `weaken()` causes the same 54/841 Moo failures even
-without array birth-tracking. This transition is a safety valve for
-Sub::Quote closures that capture birth-tracked unblessed objects.
-
-### Refined Strategy A (Intermediate Step)
-
-Instead of eliminating WEAKLY_TRACKED entirely, **only remove transition
-#1** (untracked → WEAKLY_TRACKED) while **keeping transition #2**
-(unblessed tracked → WEAKLY_TRACKED):
-
-**Changes from original code (2 lines in weaken() only):**
-
-```java
-// OLD: weaken() on untracked object
-if (base.refCount == -1) {
-    MortalList.active = true;          // REMOVED
-    base.refCount = WEAKLY_TRACKED;    // REMOVED
-}
-// NEW: no action for untracked objects — just register in WeakRefRegistry
-```
-
-The `blessId == 0 → WEAKLY_TRACKED` transition in the `refCount > 0`
-branch is preserved unchanged. The WEAKLY_TRACKED handling in
-`undefine()` is preserved unchanged.
-
-**Result:**
-
-| Test suite | Result |
-|------------|--------|
-| `make` (unit tests) | PASS (except weaken_edge_cases.t #15) |
-| qr-72922.t | 10/14 (matches master) |
-| Moo | **841/841 PASS** |
-
-### Final Implementation: Heuristic -1 → -2 Transition (Current)
-
-Refined Strategy A left `weaken_edge_cases.t` test 15 failing ("nested weak
-array element becomes undef"). The fix: **re-add the -1 → -2 transition**
-but with important differences from the original design:
-
-1. **`MortalList.active` always true**: The mortal system is always on
-   (required for birth-tracked objects). The -1 → -2 transition does not
-   change this.
-2. **Heuristic clearing only in `undefine()`**: Only explicit `undef`
-   triggers WEAKLY_TRACKED clearing. `setLarge()` and `scopeExitCleanup()`
-   do NOT clear WEAKLY_TRACKED objects — clearing on overwrite/scope-exit
-   was too aggressive (broke Sub::Quote/Moo when closures capture copies).
-3. **`refCountOwned = false`**: The weak scalar's `refCountOwned` is cleared
-   so it doesn't trigger spurious decrements.
-4. **CODE refs excluded**: `weaken()` on a CODE ref does NOT transition
-   to WEAKLY_TRACKED (stash refs bypass setLarge, making refcounting
-   unreliable). CODE refs are also skipped in `clearWeakRefsTo()`.
-
-**Changes in `WeakRefRegistry.weaken()`:**
-```java
-} else if (base.refCount == -1 && !(base instanceof RuntimeCode)) {
-    // Heuristic: transition to WEAKLY_TRACKED so that undefine()
-    // can clear weak refs when a strong reference is dropped.
-    // CODE refs excluded: stash refs bypass setLarge().
-    ref.refCountOwned = false;
-    base.refCount = WEAKLY_TRACKED;  // -2
-}
-```
-
-**Changes in `RuntimeScalar.setLarge()`:**
-```java
-// WEAKLY_TRACKED objects: do NOT clear weak refs on overwrite.
-// Overwriting ONE reference doesn't mean no other strong refs exist.
-// Weak refs for WEAKLY_TRACKED objects are cleared only via undefine().
-```
-
-**Changes in `RuntimeScalar.scopeExitCleanup()`:**
-```java
-// WEAKLY_TRACKED objects: do NOT clear weak refs on scope exit.
-// Scope exit of ONE reference doesn't mean no other strong refs exist.
-// Weak refs for WEAKLY_TRACKED objects are cleared only via undefine().
-```
-
-**Changes in `WeakRefRegistry.clearWeakRefsTo()`:**
-```java
-// Skip clearing weak refs to CODE objects. Stash refs bypass setLarge(),
-// causing false refCount==0 via mortal flush.
-if (referent instanceof RuntimeCode) return;
-```
-
-**Result:**
-
-| Test suite | Result |
-|------------|--------|
-| `make` (unit tests) | **ALL PASS** (including weaken_edge_cases.t all 42) |
-| weaken.t | 34/34 PASS |
-| qr-72922.t | 10/14 (matches master) |
-| Moo | **841/841 PASS** |
-
-**Trade-off:** The heuristic may clear weak refs too eagerly when multiple
-strong refs exist to the same untracked object (since we never counted them).
-This is acceptable because unblessed objects have no DESTROY, so the only
-effect is the weak ref becoming `undef` earlier than Perl 5 would.
-
-### Blast Radius Analysis: Java WeakReference Approach
-
-An alternative to refCount-based tracking is to use Java's own
-`WeakReference<RuntimeBase>` for Perl weak refs to untracked objects.
-The JVM GC would detect when no strong Java references remain and clear
-the weak ref automatically.
-
-**The fundamental requirement:** The Perl weak scalar must NOT hold a
-strong Java reference to the referent. Currently, `RuntimeScalar.value`
-is a strong `Object` reference — changing this for weak scalars means
-changing how every dereference site accesses the referent.
-
-**Measured blast radius:**
-
-| Scope | Cast/instanceof sites | Files |
-|-------|-----------------------|-------|
-| RuntimeScalar.java internal | 46 | 1 |
-| External codebase | 303 | 63 |
-| **Total** | **349** | **64** |
-
-Top-impacted files: RuntimeCode.java (36), RuntimeScalar.java (33),
-ModuleOperators.java (32), RuntimeGlob.java (17), ReferenceOperators.java (15).
-
-There are **zero existing accessor methods** (`getReferent()`, `asHash()`,
-etc.) — every consumer casts `scalar.value` directly. This means either:
-
-1. **Option 1:** Modify all 349 sites to check for WeakReference.
-   Extremely high risk, touches most of the runtime.
-2. **Option 2:** Add accessor methods first (separate refactoring), then
-   change the internal representation behind the accessor. Two-phase
-   approach but lower risk per phase.
-3. **Option 3:** Use a side-channel mechanism (e.g., `PhantomReference` +
-   `ReferenceQueue`) that doesn't require changing `value` storage. But
-   this doesn't work because the `value` field still holds a strong ref.
-
-**Conclusion:** Java WeakReference is architecturally clean but requires
-a prerequisite refactoring (accessor methods) before it's feasible. This
-is a future enhancement, not an immediate fix.
-
-### Strategy D: Java WeakReference via Accessor Refactoring (Future)
-
-**Phase 1 prerequisite:** Introduce accessor methods on RuntimeScalar:
-```java
-public RuntimeBase getReferentBase() { ... }
-public RuntimeHash  getHashReferent() { ... }
-public RuntimeArray getArrayReferent() { ... }
-public RuntimeCode  getCodeReferent() { ... }
-```
-Refactor all 349 cast sites to use these accessors. This is a pure
-refactoring with no behavioral change.
-
-**Phase 2:** Inside the accessors, check for a Java WeakReference:
-```java
-public RuntimeBase getReferentBase() {
-    if (javaWeakRef != null) {
-        RuntimeBase ref = javaWeakRef.get();
-        if (ref == null) {
-            // JVM GC collected the referent — clear this weak ref
-            this.type = RuntimeScalarType.UNDEF;
-            this.value = null;
-            this.javaWeakRef = null;
-            return null;
-        }
-        return ref;
-    }
-    return (RuntimeBase) value;
-}
-```
-
-**Phase 3:** In `weaken()`, for untracked objects:
-- Set `value = null` (remove strong Java reference)
-- Set `javaWeakRef = new WeakReference<>(referent)`
-- On dereference, the accessor checks the WeakReference
-
-**Pros:** Handles ALL objects (DESTROY via refCount, non-DESTROY via JVM
-GC). Eliminates WEAKLY_TRACKED entirely. Zero overhead for non-weak refs.
-
-**Cons:** Clearing is GC-dependent (not immediate like Perl 5). Requires
-prerequisite refactoring. Adds 8 bytes (WeakReference field) to every
-RuntimeScalar.
-
-### Strategy E: Fix Array Closure Capture (Targeted)
-
-Instead of Java WeakReference, fix the root cause of the hash/array
-asymmetry: make closure captures properly track refCount for arrays.
-
-**Approach:** When a closure captures a variable that holds a reference,
-increment the referent's refCount (like setLarge does). When
-`releaseCaptures()` fires, decrement it.
-
-**This is narrower than Strategy D** — it only fixes the array case,
-not the general "weak ref to non-DESTROY object" case. But it would:
-- Allow array birth-tracking without breaking Moo closures
-- Make `weaken_edge_cases.t` test 15 pass
-- Keep the simple refCount model without JVM GC dependency
-
-**Risk:** Closure capture paths are in codegen (EmitterVisitor), which
-is a high-risk area. Needs careful testing.
-
-### Revised Recommendation (Updated)
-
-The heuristic -1 → -2 transition (current implementation) resolves both the
-qr-72922.t regression and the weaken_edge_cases.t test 15 failure. The
-previous `blessId == 0 → WEAKLY_TRACKED` safety valve has been removed
-(it caused premature clearing when closures captured copies). WEAKLY_TRACKED
-now only applies to untracked non-CODE objects.
-
-**Accepted trade-off:** Weak refs to untracked objects may be cleared too
-eagerly when one strong ref is undef'd while others exist. This affects only
-unblessed objects (no DESTROY), so the impact is limited to the weak ref
-becoming undef slightly earlier than Perl 5 would.
-
-**Future work (if needed):**
-
-1. **Strategy E** (fix array closure capture) — Would allow precise refCount
-   tracking for arrays, eliminating the need for WEAKLY_TRACKED heuristics.
-2. **Strategy D** (Java WeakReference via accessor refactoring) — Full
-   Perl 5 compliance for all weak ref cases. Higher effort but
-   architecturally clean.
-
-### Regression Classification (2026-04-09)
-
-| Test file | Delta | DESTROY/weaken related? | Strategy A fixes? |
-|-----------|-------|------------------------|-------------------|
-| die_keeperr.t | -9 | Yes (warning format) | Yes (separate fix already applied) |
-| qr-72922.t | -5 | Yes (WEAKLY_TRACKED premature clearing) | Yes |
-| substr_left.t | -1 | Possibly (MortalList.flush timing in tied STORE) | Needs testing |
-| eval.t | -1 | Possibly (TIEARRAY + eval + last interaction) | Needs testing |
-| runlevel.t | -1 | Possibly (bless in tie constructors) | Needs testing |
-| array.t | -8 | No (arylen magic, `$#{@array}` syntax, @_ aliasing) | No — separate investigation needed |
-
----
-
 ## Limitations & Known Issues
 
 1. **Weak refs to non-DESTROY objects: heuristic clearing.**
@@ -1119,11 +654,11 @@ becoming undef slightly earlier than Perl 5 would.
 2. **Hash/Array birth-tracking asymmetry.** Anonymous hashes (`{...}`) are
    birth-tracked (`refCount = 0` in `createReferenceWithTrackedElements`),
    so `weaken()` works precisely for unblessed hash refs via the refCount
-   path. Anonymous arrays (`[...]`) are **not** birth-tracked — they start
+   path. Anonymous arrays (`[...]`) are **not** birth-tracked -- they start
    at -1 and rely on the WEAKLY_TRACKED heuristic (see limitation 1).
    Adding array birth-tracking breaks Moo because Sub::Quote closure
    captures bypass `setLarge()`, causing refCount undercounting and
-   premature destruction. See "Strategy E" for the fix proposal.
+   premature destruction.
 
 3. **Global variables bypass `setLarge()`.** Stash slots are assigned via
    `GlobalVariable` infrastructure, which doesn't always go through the
@@ -1137,10 +672,10 @@ becoming undef slightly earlier than Perl 5 would.
 5. **Single-threaded.** The refCount system is not thread-safe. This matches
    PerlOnJava's current single-threaded execution model.
 
-6. **349 dereference sites access `value` directly.** There are zero accessor
+6. **Dereference sites access `value` directly.** There are zero accessor
    methods for `RuntimeScalar.value` in reference context. This makes it
    infeasible to change how weak references store their referent without a
-   prerequisite refactoring to introduce accessors (see "Strategy D").
+   prerequisite refactoring to introduce accessors.
 
 ---
 
@@ -1153,7 +688,7 @@ Tests are organized in three tiers:
 | `src/test/resources/unit/destroy.t` | 1 file, 14 subtests | Basic DESTROY semantics: scope exit, multiple refs, exceptions, inheritance, re-bless, void-context delete, untie DESTROY (immediate and deferred) |
 | `src/test/resources/unit/weaken.t` | 1 file, 4 subtests | Basic weaken: isweak flag, weak ref access, copy semantics, weaken+DESTROY interaction |
 | `src/test/resources/unit/refcount/` | 8 files | Comprehensive: circular refs, self-refs, tree structures, return values, inheritance chains, edge cases (weaken on non-ref, resurrection, closures, deeply nested structures, multiple simultaneous weak refs) |
-| `src/test/resources/unit/refcount/weaken_edge_cases.t` | 42 subtests | Edge cases: nested weak refs, WEAKLY_TRACKED heuristic, multiple strong refs, scope exit clearing |
+| `src/test/resources/unit/refcount/weaken_edge_cases.t` | 34 subtests | Edge cases: nested weak refs, WEAKLY_TRACKED heuristic, multiple strong refs, scope exit clearing |
 
 Integration coverage via Moo test suite: **841/841 subtests across 71 test files.**
 
@@ -1161,6 +696,6 @@ Integration coverage via Moo test suite: **841/841 subtests across 71 test files
 
 ## See Also
 
-- [dev/design/destroy_weaken_plan.md](../design/destroy_weaken_plan.md) -- Original design document with implementation history
+- [dev/design/destroy_weaken_plan.md](../design/destroy_weaken_plan.md) -- Design document with implementation history, strategy analysis, and evolution of the WEAKLY_TRACKED design
 - [dev/modules/moo.md](../modules/moo.md) -- Moo test tracking and category-by-category fix log
 - [dev/architecture/dynamic-scope.md](dynamic-scope.md) -- Dynamic scoping (related: `local` interacts with refCount via `DynamicVariableManager`)
diff --git a/dev/architecture/inline-cache.md b/dev/design/inline-cache.md
similarity index 97%
rename from dev/architecture/inline-cache.md
rename to dev/design/inline-cache.md
index d7afc54dc..ae0bbbb69 100644
--- a/dev/architecture/inline-cache.md
+++ b/dev/design/inline-cache.md
@@ -1,5 +1,7 @@
 # Inline Caching in PerlOnJava: Technical Implementation Guide
 
+**Status:** PARTIALLY IMPLEMENTED - A runtime global inline cache (4096 entries in `RuntimeCode.java`) is implemented using callsite IDs allocated at compile time. The per-call-site static field approach and INVOKEDYNAMIC variant described below remain unimplemented design alternatives.
+
 ## How It Works
 
 PerlOnJava compiles Perl code to JVM bytecode using the ASM library. When you write:
diff --git a/dev/architecture/method-call-optimization.md b/dev/design/method-call-optimization.md
similarity index 97%
rename from dev/architecture/method-call-optimization.md
rename to dev/design/method-call-optimization.md
index 61a4c0e4f..85782ade4 100644
--- a/dev/architecture/method-call-optimization.md
+++ b/dev/design/method-call-optimization.md
@@ -1,5 +1,7 @@
 # Method Call Performance Optimization Plan
 
+**Status:** PARTIALLY IMPLEMENTED - Phase 1 (inline caching) is implemented as a runtime global cache in `RuntimeCode.java`. Phases 2-3 (fast hash access, method-specific optimizations) remain unimplemented. Some references below (e.g. `SpillSlotManager`, `RuntimeArrayPool`, `bench_method.pl`) are to planned components that were never created.
+
 **Goal**: Achieve >340 iterations/sec on `dev/bench/bench_method.pl` (matching or exceeding native Perl performance)
 
 **Current Status**: 119 iter/sec (2.87x slower than target)
diff --git a/dev/prompts/architecture-docs-audit.md b/dev/prompts/architecture-docs-audit.md
new file mode 100644
index 000000000..176bfea2f
--- /dev/null
+++ b/dev/prompts/architecture-docs-audit.md
@@ -0,0 +1,193 @@
+# Architecture Documentation Audit & Fix Plan
+
+## Context
+Audited all 6 architecture docs + README against the actual codebase. Found discrepancies ranging from minor (parameter names) to major (entire described architecture doesn't exist).
+
+## Priority 1: Factually Wrong Status Labels
+
+### 1a. inline-cache.md — IS implemented, not "NOT YET IMPLEMENTED"
+- `RuntimeCode.java` has a 4096-entry inline cache (`inlineCacheBlessId`, `inlineCacheMethodHash`, `inlineCacheCode` arrays)
+- Cache invalidation via `clearInlineMethodCache()` exists
+- Remove "NOT YET IMPLEMENTED" from `dev/design/inline-cache.md` line 3
+- Update README.md table entry (line 33) to reflect partial implementation
+- Review whether `dev/design/method-call-optimization.md` Phase 1 should also be updated
+
+### 1b. method-call-optimization.md — Phase 1 (inline cache) is implemented
+- At minimum note that Phase 1 is done; Phases 2-3 remain unimplemented
+- The `SpillSlotManager` and `RuntimeArrayPool` references are to non-existent classes — remove or note
+- `bench_method.pl` doesn't exist — remove reference or note
+
+## Priority 2: large-code-refactoring.md — Describes Non-Existent Architecture
+
+The doc says: compile -> fail with MethodTooLargeException -> catch -> enable refactoring -> retry.
+Reality: compile -> fail -> fall back to interpreter.
+
+Specific issues:
+- The "reactive retry" flow (lines 44-66, 97-127) does not exist
+- `forceRefactorForCodegen(BlockNode node, boolean isAutoRetry)` — dead code, wrong signature (no `isAutoRetry` param)
+- `IS_REFACTORING_ENABLED` field doesn't exist
+- `MAX_REFACTOR_ATTEMPTS` is 3, not 1
+- User messages wrong ("retrying with automatic refactoring" doesn't exist)
+- Doc omits actual mechanism: interpreter fallback via `USE_INTERPRETER_FALLBACK`
+- Doc omits `DepthFirstLiteralRefactorVisitor` (proactive literal refactoring)
+- Doc omits `LargeBlockRefactorer.processBlock()` called from `EmitBlock` (proactive size check)
+
+Action: Rewrite to describe what actually exists:
+1. Proactive: `LargeBlockRefactorer.processBlock()` estimates size, wraps large blocks in closures
+2. Proactive: `DepthFirstLiteralRefactorVisitor` splits large list/array/hash literals
+3. Fallback: `MethodTooLargeException` -> interpreter backend
+4. Note `forceRefactorForCodegen` as dead code candidate for removal
+
+## Priority 3: control-flow.md — Wrong Paths, Missing Feature, Wrong Bytecode
+
+### 3a. All 6 file paths wrong
+| Doc Path | Actual Path |
+|----------|-------------|
+| `runtime/ControlFlowType.java` | `runtime/runtimetypes/ControlFlowType.java` |
+| `runtime/ControlFlowMarker.java` | `runtime/runtimetypes/ControlFlowMarker.java` |
+| `runtime/RuntimeControlFlowList.java` | `runtime/runtimetypes/RuntimeControlFlowList.java` |
+| `codegen/EmitControlFlow.java` | `backend/jvm/EmitControlFlow.java` |
+| `codegen/EmitSubroutine.java` | `backend/jvm/EmitSubroutine.java` |
+| `codegen/EmitterMethodCreator.java` | `backend/jvm/EmitterMethodCreator.java` |
+
+### 3b. ControlFlowType enum wrong
+- Enum values don't take ordinal arguments `(0)`, `(1)` — they're plain enum values
+- Missing `RETURN` (6th variant for non-local return from map/grep)
+
+### 3c. ControlFlowMarker fields incomplete
+- Missing `codeRef` (RuntimeScalar) and `args` (RuntimeArray) fields
+
+### 3d. RuntimeControlFlowList fields wrong
+- `tailCallCodeRef`/`tailCallArgs` are NOT direct fields — accessed via `marker.codeRef`/`marker.args`
+- Missing `returnValue` field (carries return value for RETURN type)
+
+### 3e. TABLESWITCH at returnLabel doesn't exist
+- No TABLESWITCH anywhere in control flow dispatch
+- returnLabel just checks `isNonLocalGoto()`, handles eval-specific error, returns
+- Tail call trampoline is at each call site in EmitSubroutine.java, NOT at returnLabel
+
+### 3f. Example 2 bytecode wrong
+- Uses `INVOKEVIRTUAL isNonLocalGoto()`, not `INSTANCEOF RuntimeControlFlowList`
+- Jump target is `blockDispatcher`, not `returnLabel`
+
+### 3g. Missing feature: RETURN / non-local return from map/grep
+- `ControlFlowType.RETURN`
+- `RuntimeControlFlowList.returnValue`
+- `PerlNonLocalReturnException`
+- `JavaClassInfo.isMapGrepBlock`
+- Special RETURN-handling in `emitBlockDispatcher` and `EmitterMethodCreator`
+
+### 3h. Missing feature flag: `ENABLE_TAGGED_RETURNS`
+
+### 3i. "Why Centralized TABLESWITCH" section (lines 440-452) misleading
+- Actual approach uses sequential `IF_ICMPNE`/`IF_ICMPGT` chain, not TABLESWITCH
+
+## Priority 4: block-dispatcher-optimization.md
+
+### 4a. Per-call-site bytecode size wrong
+- Doc says ~20 bytes. Actual is ~100-150 bytes (tailcall trampoline at every call site)
+- All savings calculations in the table are based on wrong per-site size
+
+### 4b. Call-site pseudocode omits tailcall trampoline
+- Between `IFEQ notControlFlow` and `GOTO blockDispatcher` there are ~30 instructions for tailcall handling
+
+### 4c. Missing implementation file
+- `Dereference.java` (lines 982-1109) has a full copy of the block-dispatcher pattern — not listed
+
+### 4d. Wrong doc reference
+- `CONTROL_FLOW_IMPLEMENTATION.md` doesn't exist; actual file is `control-flow.md`
+
+### 4e. Block dispatcher pseudocode omits RETURN handling
+- Higher-ordinal handling (GOTO=3, TAILCALL=4, RETURN=5) not shown
+
+## Priority 5: lexical-pragmas.md
+
+### 5a. Stack types wrong
+- Doc says `Deque<>` / `ArrayDeque<>`. Actual is `Stack<>`
+
+### 5b. Missing stacks
+- `warningDisabledStack` and `warningFatalStack` not documented
+
+### 5c. StrictOptions class doesn't exist
+- Constants are in `Strict.java` with names `HINT_STRICT_REFS/SUBS/VARS` and values `0x2/0x200/0x400`
+
+### 5d. CompilerFlagNode missing fields
+- Missing `warningFatalFlags`, `warningDisabledFlags`, `hintHashSnapshotId`
+
+### 5e. enterScope() return type
+- Returns `int`, not `void`
+
+### 5f. enableWarning pseudocode wrong
+- No `getCategoryBit()` or `currentFlags` field; actual delegates to `ScopedSymbolTable.enableWarningCategory()`
+
+### 5g. warnIf() pseudocode drastically simplified
+- Actual uses caller()[9] bits as primary mechanism, not just `${^WARNING_SCOPE}`
+
+## Priority 6: dynamic-scope.md
+
+### 6a. Missing implementors
+- Add `GlobalRuntimeArray`, `GlobalRuntimeHash` to implementations list
+
+### 6b. pushLocalVariable overloads
+- Doc shows 1 overload; actual has 4 (RuntimeBase, RuntimeScalar, RuntimeGlob with special behavior, DynamicState)
+
+### 6c. RuntimeScalar save missing blessId and reset-to-undef
+- `dynamicSaveState()` also saves `blessId` and clears scalar to undef after
+
+### 6d. DeferBlock wrong
+- Field is `RuntimeScalar codeRef`, not `RuntimeCode code`
+- Missing `capturedArgs` field
+- Method call is `RuntimeCode.apply(codeRef, capturedArgs, VOID)`, not `code.apply(...)`
+
+### 6e. Files table missing entries
+- Add `GlobalRuntimeArray.java`, `GlobalRuntimeHash.java`, `GlobalRuntimeScalar.java`, `RuntimeBase.java`
+
+## Priority 7: weaken-destroy.md
+
+### 7a. RuntimeArray pop/shift claim wrong
+- Doc says pop()/shift() have deferred decrement — they don't
+
+### 7b. Code ref birth-tracking path
+- Should say "closures with captures via `makeCodeObject()`", not "via `createReferenceWithTrackedElements()`"
+
+### 7c. doCallDestroy type check order
+- Actual order: Hash -> Array -> Glob -> Scalar -> Code (not Glob-first as doc implies)
+
+### 7d. Test count
+- `weaken_edge_cases.t` has 34 assertions, not 42
+
+## Priority 8: README.md
+
+### 8a. Package names
+- Add `org.perlonjava.app` (CLI entry point, JSR-223 ScriptEngine)
+- Add `org.perlonjava.core` (Configuration)
+
+### 8b. Runtime types
+- Add `RuntimeList` to the list
+
+### 8c. LexerToken
+- "LexerTokens" -> "LexerToken" (singular class name)
+
+### 8d. Inline cache status
+- Update table entry for inline-cache.md
+
+## Priority 9: Stale Code Comments (Java source)
+
+Found during weaken-destroy.md audit:
+- `WeakRefRegistry.java:25-38` — WEAKLY_TRACKED Javadoc describes superseded Strategy A
+- `RuntimeScalar.java:2210-2212` — stale comment about setLarge clearing WEAKLY_TRACKED weak refs
+- `DestroyDispatch.java:162` — stale comment about apply() calling flush()
+
+## Execution Order
+
+1. Fix inline-cache status labels (Priority 1) — quick
+2. Rewrite large-code-refactoring.md (Priority 2) — needs careful work
+3. Fix control-flow.md (Priority 3) — many targeted edits
+4. Fix block-dispatcher-optimization.md (Priority 4) — moderate edits
+5. Fix lexical-pragmas.md (Priority 5) — moderate edits
+6. Fix dynamic-scope.md (Priority 6) — moderate edits
+7. Fix weaken-destroy.md (Priority 7) — small edits
+8. Fix README.md (Priority 8) — small edits
+9. Fix stale Java comments (Priority 9) — small edits
+10. Run `make check-links` to verify
+11. Run `make` to ensure nothing broken
diff --git a/docs/reference/architecture.md b/docs/reference/architecture.md
index 6b414c2b1..98c86528d 100644
--- a/docs/reference/architecture.md
+++ b/docs/reference/architecture.md
@@ -230,6 +230,13 @@ Tests use Perl's TAP (Test Anything Protocol) format and are executed via JUnit
 
 ## Related Documentation
 
-- `dev/design/interpreter.md` - Interpreter design details
-- `dev/design/shared_ast_transformer.md` - AST normalization
-- `dev/custom_bytecode/` - Bytecode VM documentation
+- `dev/architecture/` - Deep-dive architecture documents for contributors:
+  - [Overview and index](../../dev/architecture/README.md)
+  - [DESTROY and weak references](../../dev/architecture/weaken-destroy.md) - Cooperative reference counting overlay
+  - [Dynamic scoping](../../dev/architecture/dynamic-scope.md) - `local` and DynamicVariableManager
+  - [Lexical pragmas](../../dev/architecture/lexical-pragmas.md) - Warnings, strict, and features
+  - [Control flow](../../dev/architecture/control-flow.md) - die/eval, loop control, exceptions
+- `dev/design/` - Design documents and implementation plans:
+  - [Interpreter design](../../dev/design/interpreter.md)
+  - [Variables and values](../../dev/design/variables_and_values.md)
+  - [Shared AST transformer](../../dev/design/shared_ast_transformer.md)
diff --git a/src/main/java/org/perlonjava/core/Configuration.java b/src/main/java/org/perlonjava/core/Configuration.java
index 4c66bcad9..ff19793f3 100644
--- a/src/main/java/org/perlonjava/core/Configuration.java
+++ b/src/main/java/org/perlonjava/core/Configuration.java
@@ -33,7 +33,7 @@ public final class Configuration {
      * Automatically populated by Gradle/Maven during build.
      * DO NOT EDIT MANUALLY - this value is replaced at build time.
      */
-    public static final String gitCommitId = "97ec12b8b";
+    public static final String gitCommitId = "f68479a46";
 
     /**
      * Git commit date of the build (ISO format: YYYY-MM-DD).
@@ -48,7 +48,7 @@ public final class Configuration {
      * Parsed by App::perlbrew and other tools via: perl -V | grep "Compiled at"
      * DO NOT EDIT MANUALLY - this value is replaced at build time.
      */
-    public static final String buildTimestamp = "Apr 10 2026 20:53:01";
+    public static final String buildTimestamp = "Apr 10 2026 21:43:26";
 
     // Prevent instantiation
     private Configuration() {
diff --git a/src/main/java/org/perlonjava/runtime/runtimetypes/DestroyDispatch.java b/src/main/java/org/perlonjava/runtime/runtimetypes/DestroyDispatch.java
index 3e28e6df9..e80f8f8c5 100644
--- a/src/main/java/org/perlonjava/runtime/runtimetypes/DestroyDispatch.java
+++ b/src/main/java/org/perlonjava/runtime/runtimetypes/DestroyDispatch.java
@@ -158,11 +158,6 @@ private static void doCallDestroy(RuntimeBase referent, String className) {
             // internal fields for any blessed references and defer their refCount
             // decrements. This ensures nested objects (e.g., $self->{inner}) are
             // destroyed when their parent is destroyed.
-            // 
-            // Note: RuntimeCode.apply() calls MortalList.flush() at the top, which
-            // clears all pending entries. So we must walk AFTER apply returns and
-            // process the cascading entries immediately (flush them inline) rather
-            // than relying on the caller's popAndFlush loop to pick them up.
             if (referent instanceof RuntimeHash hash) {
                 MortalList.scopeExitCleanupHash(hash);
                 MortalList.flush();
diff --git a/src/main/java/org/perlonjava/runtime/runtimetypes/RuntimeScalar.java b/src/main/java/org/perlonjava/runtime/runtimetypes/RuntimeScalar.java
index 3516dd51f..88c7ef55a 100644
--- a/src/main/java/org/perlonjava/runtime/runtimetypes/RuntimeScalar.java
+++ b/src/main/java/org/perlonjava/runtime/runtimetypes/RuntimeScalar.java
@@ -2209,7 +2209,6 @@ public static void scopeExitCleanup(RuntimeScalar scalar) {
         // Sub::Quote/Moo constructor inlining.
         // Weak refs for WEAKLY_TRACKED objects are cleared only via:
         //   - explicit undefine() of a strong reference
-        //   - setLarge() overwriting a strong reference
         // Since unblessed objects have no DESTROY, delayed clearing is safe.
     }
 
diff --git a/src/main/java/org/perlonjava/runtime/runtimetypes/WeakRefRegistry.java b/src/main/java/org/perlonjava/runtime/runtimetypes/WeakRefRegistry.java
index 0b3e9f2dc..445d94076 100644
--- a/src/main/java/org/perlonjava/runtime/runtimetypes/WeakRefRegistry.java
+++ b/src/main/java/org/perlonjava/runtime/runtimetypes/WeakRefRegistry.java
@@ -22,20 +22,23 @@ public class WeakRefRegistry {
             new IdentityHashMap<>();
 
     /**
-     * Special refCount value for unblessed birth-tracked objects that have weak
-     * refs but whose strong refs can't be counted accurately. These objects were
-     * born via {@code createReferenceWithTrackedElements} (refCount started at 0)
-     * but have blessId == 0 (unblessed), meaning closure captures and temporary
-     * copies bypass {@code setLarge()}, making refCount unreliable.
+     * Special refCount value for objects that have weak refs but whose strong
+     * refs can't be counted accurately. Used in two cases:
+     * <p>
+     * 1. Unblessed birth-tracked objects (refCount started at 0) with blessId == 0,
+     *    where closure captures and temporary copies bypass {@code setLarge()},
+     *    making refCount unreliable.
+     * <p>
+     * 2. Untracked non-CODE objects (refCount == -1) that acquire weak refs via
+     *    {@code weaken()}. These are transitioned to WEAKLY_TRACKED so that
+     *    {@code undefine()} and {@code scopeExitCleanup()} can clear weak refs
+     *    when a strong reference is dropped. CODE refs are excluded because they
+     *    live in both lexicals and the symbol table, making stash references
+     *    invisible to refcounting.
      * <p>
      * Setting refCount to WEAKLY_TRACKED prevents {@code setLarge()} from
      * incorrectly decrementing to 0 and triggering false destruction.
      * Weak ref clearing happens only via explicit {@code undef} or scope exit.
-     * <p>
-     * Note: untracked objects (refCount == -1) are NOT transitioned to
-     * WEAKLY_TRACKED — they stay at -1 and their weak refs are never cleared
-     * deterministically. This distinction fixes the qr-72922.t regression
-     * where untracked regex objects had weak refs prematurely cleared.
      */
     public static final int WEAKLY_TRACKED = -2;