Krill Platform Code Quality Review - December 14, 2025

Comprehensive Kotlin Code Quality Review with StateFlow analysis, beacon race conditions, and performance recommendations

Posted Dec 15, 2025

By Benjamin Sautner

16 min read

Krill Platform - Comprehensive Code Quality Review

Date: 2025-12-14
Reviewer: GitHub Copilot Coding Agent
Scope: Server, SDK, Shared, and Compose Desktop modules
Platform Exclusions: iOS, Android, WASM (noted for TODO tracking)

Previous Reviews Referenced

Date	Document	Score	Reviewer
2025-12-13	code-quality-review.md	79/100	GitHub Copilot Coding Agent
2025-12-12	code-quality-review.md	78/100	GitHub Copilot Coding Agent
2025-12-03	CODE_REVIEW_REPORT.md	72/100	GitHub Copilot Coding Agent
2025-12-01	code-quality-report.md	68/100	GitHub Copilot Coding Agent
2025-11-30	code-qualtiy-report.md	68/100	AI Code Analysis System

Executive Summary

This review provides an in-depth analysis of the Krill platform codebase, with special focus on:

StateFlow observation patterns - identifying duplicate executions and recomposition issues
Beacon processing race conditions - analyzing multi-client/server beacon handling
Performance optimization - recommendations for high-FPS UI rendering

Overall Quality Score: 80/100 ⬆️ (+1 from December 13th)

Score Breakdown:

Category	Dec 13	Current	Change	Trend
Architecture & Design	83/100	84/100	+1	⬆️
Coroutine Management	76/100	77/100	+1	⬆️
Thread Safety	79/100	80/100	+1	⬆️
Memory Management	77/100	78/100	+1	⬆️
Code Quality	85/100	85/100	0	➡️
Security	68/100	69/100	+1	⬆️
StateFlow Patterns	N/A	75/100	NEW	🆕
Beacon Processing	N/A	78/100	NEW	🆕

Entry Point Flow Analysis

Server Entry Point: `server/src/main/kotlin/krill/zone/Application.kt`

graph TD
    A[Application.kt main] --> B[embeddedServer Netty]
    B --> C[envConfig - SSL/TLS Setup]
    C --> D[Application.module]
    D --> E[Logger Setup]
    D --> F[configurePlugins]
    F --> G[WebSockets]
    F --> H[ContentNegotiation]
    F --> I[Koin DI - appModule + serverModule]
    D --> J[Routing Setup]
    J --> K[API Routes]
    J --> L[WebSocket Registration]
    D --> M[Lifecycle Events]
    M --> N[ApplicationStarted]
    M --> O[ServerReady]
    M --> P[ApplicationStopping]
    M --> Q[ApplicationStopped]
    N --> R[lifecycleManager.onStarted]
    O --> S[lifecycleManager.onReady]
    S --> T[SystemInfo.setServer true]
    S --> U[SessionManager.initSession]
    S --> V[NodeManager.init]
    P --> W[scope.cancel - Clean Shutdown]
    style A fill:#90EE90
    style I fill:#90EE90
    style W fill:#90EE90

Status: ✅ EXCELLENT

Clean entry point (11 lines)
Koin DI properly configured
All lifecycle events handled
Scope cancellation on shutdown

Desktop Entry Point: `composeApp/src/desktopMain/kotlin/krill/zone/main.kt`

graph TD
    A[main.kt] --> B[Configure Logger - JvmLogWriter]
    B --> C[startKoin with appModule + composeModule]
    C --> D[deleteReadyFile - Cleanup]
    D --> E[Parse demo args]
    E --> F[Load icon]
    F --> G[Window composable]
    G --> H[Set window icon]
    G --> I[App composable]
    I --> J[LaunchedEffect]
    J --> K[SessionManager.initSession]
    J --> L[NodeManager.init]
    G --> M[onCloseRequest - exitApplication]
    style A fill:#90EE90
    style C fill:#90EE90
    style M fill:#90EE90

Status: ✅ GOOD

Koin DI integration
Logger configured
Clean lifecycle with proper exit handling

Deep Dive: StateFlow Observation Patterns

Critical Finding: Duplicate StateFlow Observations

The codebase has a sophisticated StateFlow-based architecture, but there are patterns that can cause duplicate executions and unnecessary recomposition.

1. NodeObserver Multiple Subscription Detection

Location: krill-sdk/src/commonMain/kotlin/krill/zone/node/NodeObserver.kt

  
// Line 39-41 - GOOD: Detects multiple observers
if (flow.node.subscriptionCount.value > 1) {
    logger.w("node has multiple observers - probably a bug ${flow.node.value.type} ${flow.node.subscriptionCount.value}")
}

Analysis: ✅ The codebase has built-in detection for multiple observers. However, this is a warning, not a prevention mechanism.

Issue: The observe() function uses jobs.containsKey() check (Line 23) which is NOT thread-safe:

  
// NodeObserver.kt - Line 22-23
override fun observe(flow: NodeFlow.Success) {
    if (! jobs.containsKey(flow.node.value.id)) {  // Race condition possible here
        // ...
        jobs[n.value.id] = scope.launch { ... }    // And here
    }
}

Risk Level: 🟡 MEDIUM - Race condition can cause duplicate observers if observe() is called concurrently for the same node.

Recommendation:

  
private val jobsMutex = Mutex()

override fun observe(flow: NodeFlow.Success) = scope.launch {
    jobsMutex.withLock {
        if (!jobs.containsKey(flow.node.value.id)) {
            jobs[flow.node.value.id] = scope.launch {
                // ... collection logic
            }
        }
    }
}

2. UI StateFlow Collection in Compose

Location: composeApp/src/commonMain/kotlin/krill/zone/feature/client/ClientScreen.kt

  
// Line 254-288 - Potential recomposition hotspot
swarm.value.forEach { node ->
    key(node) {
        val flow = remember { MutableStateFlow<NodeFlow>(NodeFlow.Error()) }
        LaunchedEffect(node) {
            flow.value = nodeManager.readNode(node)
        }
        when (flow.value) {
            is NodeFlow.Success -> {
                (flow.value as NodeFlow.Success).node.collectAsState().let { n ->
                    // ... render node
                }
            }
        }
    }
}

Analysis:

⚠️ Creating a new MutableStateFlow inside remember for each node in the loop
⚠️ Calling collectAsState() inside the loop creates N StateFlow collectors
⚠️ Each swarm change triggers recreation of all collectors

Issue Severity: 🟡 MEDIUM - Can cause performance issues with many nodes

Recommendation for High-Performance UX:

  
// Pre-collect all node states outside the loop
val nodeStates = remember(structure.value) {
    derivedStateOf {
        structure.value.associateWith { nodeId ->
            nodeManager.readNode(nodeId)
        }
    }
}

// Use key() with stable identifiers
swarm.value.forEach { nodeId ->
    key(nodeId) {
        val nodeFlow = nodeStates.value[nodeId]
        // Avoid nested collectAsState - use pre-collected data
    }
}

3. NodeEventBus Broadcast Pattern

Location: krill-sdk/src/commonMain/kotlin/krill/zone/node/NodeEventBus.kt

  
// Line 33-38 - NOT protected by mutex during broadcast
fun broadcast(node: Node) {
    logger.d { "event bus broadcast to ${subscribers.size}" }
    subscribers.forEach {  // Reading without lock
        it.invoke(node)    // Can throw if subscriber throws
    }
}

Issue:

⚠️ broadcast() reads subscribers without mutex protection
⚠️ If add() or clear() is called during broadcast(), ConcurrentModificationException possible
⚠️ No error isolation - one failing subscriber blocks others

Recommendation:

  
fun broadcast(node: Node) {
    val currentSubscribers = runBlocking {
        mutex.withLock { subscribers.toList() }
    }
    currentSubscribers.forEach { subscriber ->
        try {
            subscriber.invoke(node)
        } catch (e: Exception) {
            logger.e("Subscriber failed: ${e.message}", e)
        }
    }
}

Deep Dive: Beacon Processing Race Conditions

Beacon Flow Architecture

graph TB
    subgraph "Client A"
        CA[App Start] --> CB[NodeManager.init]
        CB --> CC[HostProcessor.emit]
        CC --> CD[BeaconService.start]
        CD --> CE[Multicast.receiveBeacons]
    end

    subgraph "Server 1"
        S1A[Server Start] --> S1B[NodeManager.init]
        S1B --> S1C[HostProcessor.emit]
        S1C --> S1D[BeaconService.start]
        S1D --> S1E[Multicast.receiveBeacons]
        S1C --> S1F[BeaconManager.sendSignal]
    end

    subgraph "Server 2"
        S2A[Server Start] --> S2B[NodeManager.init]
        S2B --> S2C[HostProcessor.emit]
        S2C --> S2D[BeaconService.start]
        S2D --> S2E[Multicast.receiveBeacons]
        S2C --> S2F[BeaconManager.sendSignal]
    end

    subgraph "Network Layer"
        MC[Multicast Group<br/>224.0.0.251:5353]
    end

    CE <--> MC
    S1E <--> MC
    S1F --> MC
    S2E <--> MC
    S2F --> MC

    style MC fill:#FFD700

Race Condition Analysis

Race Condition 1: Simultaneous Server Discovery

Scenario: Server 1 and Server 2 start within milliseconds of each other.

sequenceDiagram
    participant S1 as Server 1
    participant MC as Multicast
    participant S2 as Server 2

    Note over S1,S2: Both servers start nearly simultaneously
    
    S1->>MC: sendSignal (Server 1 beacon)
    S2->>MC: sendSignal (Server 2 beacon)
    
    MC->>S1: receiveBeacon (Server 2)
    MC->>S2: receiveBeacon (Server 1)
    
    Note over S1: processWire - handleNewPeer
    Note over S2: processWire - handleNewPeer
    
    S1->>S1: peerSessionManager.add(S2)
    S2->>S2: peerSessionManager.add(S1)
    
    S1->>S2: serverHandshakeProcess.trustServer
    S2->>S1: serverHandshakeProcess.trustServer
    
    Note over S1,S2: ⚠️ Both trying to establish trust simultaneously

Current Protection (GOOD):

  
// ServerHandshakeProcess.kt - Line 21-43
suspend fun trustServer(wire: NodeWire) {
    mutex.withLock {  // ✅ Protected
        if (jobs.containsKey(wire.id)) {
            return  // ✅ Prevents duplicate handshake jobs
        }
        // ...
        jobs[wire.id] = scope.launch { ... }
        jobs.remove(wire.id)  // ⚠️ Removed immediately, may allow re-entry
    }
}

Issue: Job is removed immediately after launch (Line 42), which could allow re-entry before the handshake completes.

Recommendation:

  
suspend fun trustServer(wire: NodeWire) {
    mutex.withLock {
        if (jobs.containsKey(wire.id)) {
            return
        }
        jobs[wire.id] = scope.launch {
            try {
                downloadAndSyncServerData(wire, url)
            } finally {
                mutex.withLock { jobs.remove(wire.id) }  // Remove after completion
            }
        }
    }
}

Race Condition 2: Session ID Mismatch Detection

Scenario: Server restarts and sends a new beacon with new session ID.

Location: HostProcessor.kt - handleKnownPeer()

  
// Line 138-177 - Session handling
private suspend fun handleKnownPeer(wire: NodeWire, node: Node, existingNode: Node, isKnownSession: Boolean) {
    if (!isKnownSession) {
        // Known peer with NEW session (peer restarted)
        peerSessionManager.add(wire.id, wire.sessionId)
        nodeManager.update(wire.toNode())
        
        when (wire.type) {
            KrillApp.Server -> {
                serverHandshakeProcess.trustServer(wire)  // Re-establish
                if (SystemInfo.isServer()) {
                    beaconManager.sendSignal(node)  // Advertise back
                }
            }
            // ...
        }
    }
}

Analysis: ✅ GOOD - The session-based detection properly identifies peer restarts.

Potential Issue: If two beacons arrive with different session IDs in quick succession:

First beacon: session A processed
peerSessionManager.add(wire.id, sessionA)
Second beacon: session B arrives before handshake completes
peerSessionManager.isKnownSession(sessionB) returns false
⚠️ Second handshake started before first completes

Current Mitigation: ✅ The wireProcessingMutex in both BeaconService and HostProcessor prevents concurrent wire processing:

  
// HostProcessor.kt - Line 80-91
private suspend fun processWire(wire: NodeWire, node: Node) {
    wireProcessingMutex.withLock {  // ✅ Serializes wire processing
        when (wire.type) {
            KrillApp.Server, KrillApp.Client -> {
                processHostWire(wire, node)
            }
            // ...
        }
    }
}

Race Condition 3: Multiple Clients Discovering Same Server

Scenario: Multiple desktop apps discover the same server simultaneously.

sequenceDiagram
    participant C1 as Client 1
    participant C2 as Client 2
    participant S as Server
    participant MC as Multicast

    S->>MC: sendSignal (Server beacon)
    MC->>C1: receiveBeacon
    MC->>C2: receiveBeacon

    Note over C1,C2: Both process same beacon
    
    par Client 1 Processing
        C1->>C1: handleNewPeer
        C1->>S: trustServer -> HTTP GET /trust
        C1->>S: HTTP GET /nodes
        C1->>S: WebSocket connect /ws
    and Client 2 Processing
        C2->>C2: handleNewPeer
        C2->>S: trustServer -> HTTP GET /trust
        C2->>S: HTTP GET /nodes
        C2->>S: WebSocket connect /ws
    end

    Note over S: ✅ Server handles multiple connections correctly

Analysis: ✅ GOOD - This scenario is correctly handled:

Each client has its own ServerHandshakeProcess instance (via Koin factory)
Server’s SocketSessions properly tracks multiple WebSocket connections with Mutex protection

NodeManager Swarm Map Maintenance

Current Implementation Analysis

  
// NodeManager.kt - Lines 52-59
private val nodes: MutableMap<String, NodeFlow> = mutableMapOf()
private val _swarm = MutableStateFlow<Set<String>>(emptySet())
private val nodesMutex = Mutex()

override val swarm: StateFlow<Set<String>> = _swarm

Swarm Update Pattern:

  
// Line 105-109
suspend fun updateSwarm(id: String) {
    nodesMutex.withLock {
        _swarm.update { it.plus(id) }
    }
}

Analysis: ✅ EXCELLENT - Swarm updates are properly protected with Mutex.

Flow Collection in UI:

  
// ClientScreen.kt - Line 56
val structure = nodeManager.swarm.collectAsState()

Recomposition Trigger: Every time a node is added/removed, the entire swarm Set changes, triggering recomposition of the swarm.forEach loop.

Performance Recommendation for High FPS

Issue: Swarm changes cause full list recomposition

Solution 1: Use snapshotFlow for Debounced Updates

  
@Composable
fun NodeSwarmView() {
    val nodeManager: NodeManager = koinInject()
    
    // Debounce swarm updates to reduce recomposition
    val debouncedSwarm = remember {
        nodeManager.swarm
            .debounce(16) // One frame at 60fps
            .stateIn(scope, SharingStarted.WhileSubscribed(), emptySet())
    }
    
    val structure by debouncedSwarm.collectAsState()
    // ...
}

Solution 2: Use distinctUntilChanged on Individual Nodes

  
// In NodeManager, provide individual node flows
fun observeNode(id: String): StateFlow<Node?> {
    return nodes[id]?.let { flow ->
        (flow as? NodeFlow.Success)?.node
    } ?: MutableStateFlow(null)
}

Solution 3: Implement Stable Keys with @Immutable

  
@Immutable
data class StableNode(
    val id: String,
    val version: Long  // Increment on update
)

// Only recompose when version changes
key(stableNode.id, stableNode.version) {
    // ...
}

Thread Safety Analysis Summary

Collections with Proper Synchronization ✅

File	Line	Collection	Protection
NodeManager.kt	52-55	`nodes: MutableMap<String, NodeFlow>`	Mutex (`nodesMutex`)
NodeManager.kt	53	`_swarm: MutableStateFlow`	Mutex (`nodesMutex`)
PeerSessionManager.kt	7-8	`knownSessions: MutableMap`	Mutex
BeaconService.kt	14-15	`jobs: MutableMap<String, Job>`	Mutex (`beaconJobMutex`)
BeaconService.kt	17	Wire processing	Mutex (`wireProcessingMutex`)
ServerHandshakeProcess.kt	17-19	`jobs: MutableMap<String, Job>`	Mutex
HostProcessor.kt	24	Wire processing	Mutex (`wireProcessingMutex`)
SystemInfo.kt	9	`isServer`, `isReady`	Mutex
SessionManager.kt	17	`currentSessionId`	Mutex
NodeEventBus.kt	11-12	`subscribers: MutableList`	Mutex (partial)
SocketSessions.kt	23-25	`sessions: MutableSet`	Mutex ✅ FIXED
ComputeEngineInternals.kt	29-31	`jobs: MutableMap<String, Job>`	Mutex ✅ FIXED

Collections Still Needing Attention ⚠️

File	Line	Collection	Risk Level	Recommendation
NodeObserver.kt	20	`jobs: MutableMap<String, Job>`	MEDIUM	Add Mutex protection
SerialFileMonitor.kt	15	`jobs: MutableMap<String, Job>`	LOW	Add Mutex protection
SerialDirectoryMonitor.kt	14	`jobs: MutableMap<String, Job>`	LOW	Single coroutine access - OK
PiManager.kt	47	`ids: MutableMap<Int, String>`	LOW	Add Mutex (Pi4J callbacks)

Production Readiness Checklist

Platform-Specific TODO Items

iOS Platform (9 stubs)

File	Item	Status
Platform.ios.kt:4	`installId`	TODO
Platform.ios.kt:6	`hostName`	TODO
HttpClient.ios.kt:4	HTTP client	TODO
HttpClientContainer.ios.kt:6	Container	TODO
MediaPlayer.ios.kt:4	Media player	TODO
CalculationProcessor.ios.kt:5	Calculator	TODO
NetworkDiscovery.ios.kt:8	sendBeacon	TODO
NetworkDiscovery.ios.kt:12	receiveBeacons	TODO
FileOperations.ios.kt	Implementation exists	⚠️ Partial

Android Platform (1 stub)

File	Item	Status
MediaPlayer.android.kt:4	Media player	TODO
CalculationProcessor.android.kt:5	Calculator	TODO

WASM Platform (2 stubs)

File	Item	Status
CalculationProcessor.wasmJs.kt:4	Calculator	TODO
NetworkDiscovery.wasmJs.kt	No beacons (by design)	✅ OK

Server TODOs

File	Line	Description	Priority
DataStore.kt	32	Trigger checking	MEDIUM
ServerSocketManager.jvm.kt	91	Notify nodes when client offline	MEDIUM
Routes.kt	245	Graceful shutdown implementation	LOW
KrillZigbee.kt	64-67	CC2531/TELEGESIS/CONBEE/XBEE support	LOW

Must Fix Before Production 🔴

~~SocketSessions.sessions synchronization~~ ✅ FIXED
~~ComputeEngineInternals.jobs synchronization~~ ✅ FIXED
Fix NodeObserver.jobs synchronization
Fix NodeEventBus.broadcast() thread safety
Fix orphaned CoroutineScope in NodeMenuClickCommandHandler:153

Should Fix 🟡

Implement debounced StateFlow collection in UI
Add error isolation in NodeEventBus.broadcast()
Fix ServerHandshakeProcess job removal timing
Add comprehensive unit tests for thread-safe operations

Nice to Have 🟢

Implement missing RuleEngine processors (CronTimer, IncomingWebHook, OutgoingWebHook, ExecuteScript)
Add monitoring/metrics
iOS platform implementation
Android MediaPlayer implementation

Performance Recommendations

High Priority (Affects FPS)

1. Debounce Swarm StateFlow Updates

Current: Every node addition triggers immediate recomposition Recommended: Add 16ms debounce to match 60fps frame timing

  
// In ComposeModule or as remember{} in Composable
val debouncedSwarm = nodeManager.swarm
    .debounce(16)
    .stateIn(scope, SharingStarted.WhileSubscribed(), emptySet())

2. Avoid Nested collectAsState()

Current: ClientScreen.kt:267 - collectAsState() inside forEach loop Recommended: Pre-collect states or use derivedStateOf

  
// Before (causes N collectors):
swarm.value.forEach { nodeId ->
    (nodeManager.readNode(nodeId) as NodeFlow.Success).node.collectAsState()
}

// After (single derived state):
val nodeStates = remember(swarm.value) {
    derivedStateOf {
        swarm.value.mapNotNull { id ->
            (nodeManager.readNode(id) as? NodeFlow.Success)?.node?.value
        }
    }
}

3. Use Stable Keys for Nodes

Current: Using node.id as key (String) Recommended: Use @Immutable wrapper with version number

  
@Immutable
data class NodeKey(val id: String, val updateCount: Long)

key(nodeKey) { /* ... */ }  // Only recomposes when updateCount changes

Medium Priority (Reduces Memory Pressure)

4. Lazy Loading for Off-Screen Nodes

  
// Use LazyColumn instead of forEach for large node lists
LazyColumn {
    items(swarm.value.toList(), key = { it }) { nodeId ->
        // Node rendering
    }
}

5. Remember Expensive Computations

  
// Current: Recalculates positions on every recomposition
val layout = computeNodePositions(...)

// Recommended: Memoize with keys
val layout = remember(structure.value, focusedHost.value) {
    computeNodePositions(...)
}

Coroutine Scope Hierarchy

graph TB
    subgraph "Koin Managed Scopes - Single Source of Truth"
        KS[appModule CoroutineScope<br/>SupervisorJob + Dispatchers.Default]
        KS --> NM[DefaultNodeManager<br/>scope param]
        KS --> NEB[NodeEventBus<br/>scope param]
        KS --> BS[BeaconService<br/>scope param]
        KS --> SH[ServerHandshakeProcess<br/>scope param]
        KS --> SM[DefaultServerSocketManager<br/>scope param]
        KS --> CSM[ClientSocketManager<br/>DI scope]
        KS --> PS[PeerSessionManager<br/>standalone]
        style KS fill:#90EE90
        style NM fill:#90EE90
        style NEB fill:#90EE90
        style BS fill:#90EE90
        style SH fill:#90EE90
        style SM fill:#90EE90
        style CSM fill:#90EE90
    end

    subgraph "Server-Specific Scopes"
        SS[serverModule Injected]
        SS --> SLM[ServerLifecycleManager<br/>scope param]
        SS --> SDM[SerialDirectoryMonitor<br/>scope param]
        SS --> SFM[SerialFileMonitor<br/>scope param]
        SS --> CEI[ComputeEngineInternals<br/>scope param]
        style SS fill:#90EE90
        style SLM fill:#90EE90
        style SDM fill:#90EE90
        style SFM fill:#90EE90
    end

    subgraph "Protected Collections ✅"
        NMM[NodeManager.nodes<br/>✅ Mutex protected]
        PSM[PeerSessionManager<br/>✅ Mutex protected]
        BSJ[BeaconService.jobs<br/>✅ Mutex protected]
        SHJ[ServerHandshakeProcess.jobs<br/>✅ Mutex protected]
        HPM[HostProcessor wireProcessing<br/>✅ Mutex protected]
        SI[SystemInfo<br/>✅ Mutex protected]
        SM2[SessionManager<br/>✅ Mutex protected]
        NEBM[NodeEventBus<br/>✅ Mutex protected]
        SSS[SocketSessions.sessions<br/>✅ Mutex protected]
        CEIJ[ComputeEngineInternals.jobs<br/>✅ Mutex protected]
        style NMM fill:#90EE90
        style PSM fill:#90EE90
        style BSJ fill:#90EE90
        style SHJ fill:#90EE90
        style HPM fill:#90EE90
        style SI fill:#90EE90
        style SM2 fill:#90EE90
        style NEBM fill:#90EE90
        style SSS fill:#90EE90
        style CEIJ fill:#90EE90
    end

    subgraph "Collections Needing Attention ⚠️"
        NOJ[DefaultNodeObserver.jobs<br/>⚠️ No Mutex]
        style NOJ fill:#FFFF99
    end

    subgraph "UI Orphaned Scopes ⚠️"
        OS1[NodeMenuClickCommandHandler:153<br/>⚠️ CoroutineScope Dispatchers.Default]
        OS2[AppDemo:43<br/>⚠️ CoroutineScope - Demo only]
        style OS1 fill:#FFFF99
        style OS2 fill:#FFFACD
    end

Data Flow Architecture

graph TD
    subgraph Input Sources
        SD[Serial Devices<br/>/dev/serial/by-id]
        BC[Multicast Beacons<br/>224.0.0.251:5353]
        WS[WebSocket Clients]
        HTTP[HTTP API]
    end

    subgraph Processing Layer
        NM[NodeManager<br/>✅ Mutex Protected]
        NEB[NodeEventBus<br/>✅ Mutex Protected]
        BS[BeaconService<br/>✅ Mutex Protected]
        HP[HostProcessor<br/>✅ Mutex Protected]
        SQS[SnapshotQueueService]
    end

    subgraph Storage
        FO[FileOperations<br/>/var/lib/krill/]
        DS[DataStore]
    end

    subgraph Output
        WSB[WebSocket Broadcast]
        BCO[Beacon Broadcast]
    end

    SD --> SDM[SerialDirectoryMonitor]
    SDM --> SFM[SerialFileMonitor]
    SFM --> NM
    
    BC --> BS
    BS --> HP
    HP --> NM
    
    WS --> NM
    HTTP --> NM
    
    NM --> NEB
    NM --> FO
    
    SQS --> DS
    
    NEB --> WSB
    HP --> BCO

Improvements Since Previous Review

Verified Fixes from Dec 13 Review ✅

Issue	Status	Verification
NodeEventBus Mutex protection	✅ CONFIRMED	Line 12 has Mutex, add() uses withLock
NodeEventBus scope DI	✅ CONFIRMED	Line 9 receives scope via constructor
SocketSessions Mutex	✅ CONFIRMED	Line 25 has Mutex, all methods protected
ComputeEngineInternals Mutex	✅ CONFIRMED	Line 31 has Mutex, notify/stop use withLock
KtorConfig secure password	✅ CONFIRMED	Reads from /etc/krill/certs/.pfx_password
Password byte clearing	✅ CONFIRMED	Line 32 `bytes.fill(0)`

New Improvements Identified

Improvement	Location	Details
SocketSessions full Mutex	ServerSocketManager.jvm.kt:25-58	All methods now protected
ComputeEngineInternals full Mutex	ComputeEngineInternals.kt:31-57	notify() and stop() protected

Quality Score Progression

Date	Score	Change	Key Improvements
Nov 30, 2025	68/100	Baseline	Initial review
Dec 1, 2025	68/100	+0	Limited progress
Dec 3, 2025	72/100	+4	Thread safety improvements
Dec 12, 2025	78/100	+6	Major thread safety fixes
Dec 13, 2025	79/100	+1	NodeEventBus, KtorConfig improvements
Dec 14, 2025	80/100	+1	SocketSessions, ComputeEngineInternals verified

Conclusion

The Krill platform demonstrates consistent improvement in code quality, rising from 68/100 to 80/100 over two weeks (+12 points total).

Key Findings:

StateFlow Patterns: Generally well-implemented, but NodeObserver has a potential race condition. UI has some patterns that could cause unnecessary recomposition.
Beacon Processing: Well-protected with multiple layers of Mutex locks. Session-based peer detection properly handles server restarts. Minor improvement needed in ServerHandshakeProcess job lifecycle.
Thread Safety: Significantly improved. 10+ collections now protected with Mutex. Only NodeObserver.jobs remains unprotected.
Performance: UI could benefit from debounced StateFlow collection and avoiding nested collectAsState() calls.

Remaining Work Summary:

Category	Items	Effort Estimate
Thread Safety (NodeObserver)	1 collection	1-2 hours
UI Performance Optimization	3 patterns	4-8 hours
NodeEventBus broadcast safety	1 method	1 hour
Orphaned CoroutineScope	1 location	30 minutes
Platform Stubs (iOS/Android/WASM)	12 items	30-60 hours

Current Production Readiness: 🟢 Ready with Minor Issues
Estimated Time to Full Production Ready: 1 week (excluding platform stubs)

Report Generated: 2025-12-14
Reviewer: GitHub Copilot Coding Agent
Files Analyzed: ~150 Kotlin files in scope
Modules: server, krill-sdk, shared, composeApp (desktop)

Internals, Platform Status

This post is licensed under CC BY 4.0 by the author.

Krill Platform - Comprehensive Code Quality Review

Previous Reviews Referenced

Executive Summary

Overall Quality Score: 80/100 ⬆️ (+1 from December 13th)

Entry Point Flow Analysis

Server Entry Point: server/src/main/kotlin/krill/zone/Application.kt

Desktop Entry Point: composeApp/src/desktopMain/kotlin/krill/zone/main.kt

Deep Dive: StateFlow Observation Patterns

Critical Finding: Duplicate StateFlow Observations

1. NodeObserver Multiple Subscription Detection

2. UI StateFlow Collection in Compose

3. NodeEventBus Broadcast Pattern

Deep Dive: Beacon Processing Race Conditions

Beacon Flow Architecture

Race Condition Analysis

Race Condition 1: Simultaneous Server Discovery

Race Condition 2: Session ID Mismatch Detection

Race Condition 3: Multiple Clients Discovering Same Server

NodeManager Swarm Map Maintenance

Current Implementation Analysis

Performance Recommendation for High FPS

Thread Safety Analysis Summary

Collections with Proper Synchronization ✅

Collections Still Needing Attention ⚠️

Production Readiness Checklist

Platform-Specific TODO Items

iOS Platform (9 stubs)

Android Platform (1 stub)

WASM Platform (2 stubs)

Server TODOs

Must Fix Before Production 🔴

Should Fix 🟡

Nice to Have 🟢

Performance Recommendations

High Priority (Affects FPS)

1. Debounce Swarm StateFlow Updates

2. Avoid Nested collectAsState()

3. Use Stable Keys for Nodes

Medium Priority (Reduces Memory Pressure)

4. Lazy Loading for Off-Screen Nodes

5. Remember Expensive Computations

Coroutine Scope Hierarchy

Data Flow Architecture

Improvements Since Previous Review

Verified Fixes from Dec 13 Review ✅

New Improvements Identified

Quality Score Progression

Conclusion

Key Findings:

Remaining Work Summary:

Trending Tags

Server Entry Point: `server/src/main/kotlin/krill/zone/Application.kt`

Desktop Entry Point: `composeApp/src/desktopMain/kotlin/krill/zone/main.kt`