ArmDeveloperEcosystem
diff --git a/‎content/learning-paths/embedded-and-microcontrollers/observing-ethos-u-on-alif/1-overview.md‎
Lines changed: 147 additions & 28 deletions b/‎content/learning-paths/embedded-and-microcontrollers/observing-ethos-u-on-alif/1-overview.md‎
Lines changed: 147 additions & 28 deletions
diff --git a/‎content/learning-paths/embedded-and-microcontrollers/observing-ethos-u-on-alif/2-boot-alif.md‎
Lines changed: 0 additions & 129 deletions b/‎content/learning-paths/embedded-and-microcontrollers/observing-ethos-u-on-alif/2-boot-alif.md‎
Lines changed: 0 additions & 129 deletions
@@ -1,22 +1,46 @@
 ---
 title: Overview
-weight: 2
-
-### FIXED, DO NOT MODIFY
+weight: 1
 layout: learningpathall
 ---
 
-## Hardware Overview - Alif's Ensemble E8 Series Board
+This Learning Path covers deploying PyTorch neural network models on the **Alif Ensemble E8 DevKit** using ExecuTorch with Ethos-U55 NPU acceleration.
+
+## What you'll build
+
+A complete pipeline to:
+1. Export PyTorch models to ExecuTorch format (`.pte`)
+2. Optimize models for Ethos-U55 NPU using Vela compiler
+3. Build the ExecuTorch runtime for Cortex-M55
+4. Deploy and run inference on Alif E8 hardware
+
+## Hardware Overview - Alif Ensemble E8 Series
 
-Selecting the best hardware for machine learning (ML) models depends on effective tools. You can visualize Arm Ethos-U85 performance early in the development cycle by using Alif's [Ensemble E8 Series Development Kit](https://alifsemi.com/ensemble-e8-series/).
+Selecting the best hardware for machine learning (ML) models depends on effective tools. You can visualize Arm Ethos-U performance early in the development cycle using Alif's [Ensemble E8 Series Development Kit](https://alifsemi.com/ensemble-e8-series/).
 
 <center>
 <iframe src='https://www.youtube.com/embed/jAvi2xKxkE4?si=Wd-E1PUCM4Y49uXM' allowfullscreen frameborder=0 width="800" height="400"></iframe>
 
 *Alif Ensemble Series Overview*
 </center>
 
-![Alif Ensemble E8 Board SoC Highlighted alt-text#center](./alif-ensemble-e8-board-soc-highlighted.jpg "Arm Ethos-U85 NPU location")
+![Alif Ensemble E8 Board SoC Highlighted alt-text#center](./alif-ensemble-e8-board-soc-highlighted.jpg "Arm Ethos-U NPU location")
+
+### Alif Ensemble E8 DevKit (DK-E8-Alpha)
+
+| Component | Specification |
+|-----------|---------------|
+| **CPU** | Arm Cortex-M55 (HE core @ 160MHz) |
+| **NPU** | Arm Ethos-U55 (128 MAC configuration) |
+| **ITCM** | 256 KB (fast instruction memory) |
+| **DTCM** | 256 KB (fast data memory) |
+| **SRAM0** | 4 MB (general purpose) |
+| **SRAM1** | 4 MB (NPU accessible) |
+| **MRAM** | 2-5.5 MB (non-volatile code storage) |
+
+{{% notice Note %}}
+The DK-E8-Alpha DevKit may use E7 silicon (AE722F80F55D5AS) which has 5.5MB MRAM and 13.5MB SRAM total. SETOOLS will auto-detect your actual chip variant. Always build for the detected silicon type.
+{{% /notice %}}
 
 ### Alif's Ensemble E8 Processor Decoded
 
@@ -31,25 +55,120 @@ Selecting the best hardware for machine learning (ML) models depends on effectiv
 |B4ADKA 2508|• B4ADKA - Assembly site & line identifier<br>• 2508 - year + week of manufacture (Week 08 of 2025)|
 |UASA37002.000.03|• UASA37002 - Identifies the silicon mask set<br>• .000.03 - means revision 3 of that mask|
 
-## Software Overview - Alif SETOOLS
-
-The [Alif Security Toolkit](https://swrm.alifsemi.com/Content/3.4%20SETOOLS.htm?TocPath=Secure%20Enclave%20Subsystem%7C_____4) (SETOOLS) contains utlities for working with the Alife Ensemble E8 board. You will install this later in this learning path on the following page: [Install Alif SETOOLS](/learning-paths/embedded-and-microcontrollers/observing-ethos-u-on-alif/3-install-setools/).
-
-## Software Overview - TinyML
-
-This Learning Path uses TinyML. TinyML is machine learning tailored to function on devices with limited resources, constrained memory, low power, and fewer processing capabilities.
-
-For a learning path focused on creating and deploying your own TinyML models, please see [Introduction to TinyML on Arm using PyTorch and ExecuTorch](/learning-paths/embedded-and-microcontrollers/introduction-to-tinyml-on-arm/)
-
-## Benefits and Applications
-
-NPUs like Arm's [Ethos-U85](https://www.arm.com/products/silicon-ip-cpu/ethos/ethos-u85) provide significant advantages for embedded ML applications:
-
-- **Hardware Acceleration**: 10-50x faster inference compared to CPU-only execution
-- **Power Efficiency**: Lower power consumption per inference operation
-- **Real-time Capable**: Suitable for latency-sensitive applications
-- **On-device Processing**: No cloud dependency, enhanced privacy
-- **Visual Feedback**: RGB LED indicators provide immediate status confirmation
-- **Debug Capabilities**: UART and RTT output for detailed performance analysis
-
-The Alif [Ensemble E8 Series Development Kit](https://alifsemi.com/ensemble-e8-series/) integrates the Ethos-U85 NPU with Cortex-M55 and Cortex-A32 cores, making it ideal for prototyping TinyML applications that require both ML acceleration and general-purpose processing.
+## Software Stack
+
+```
+┌────────────────────────────────────────────────────┐
+│                    Your Application                │
+├────────────────────────────────────────────────────┤
+│  ExecuTorch Runtime                                │
+│  ├── Program Loader                                │
+│  ├── Executor                                      │
+│  └── Memory Manager                                │
+├────────────────────────────────────────────────────┤
+│  Delegates & Kernels                               │
+│  ├── Ethos-U Delegate (NPU acceleration)          │
+│  ├── Cortex-M Kernels (CPU fallback)              │
+│  └── Quantized Kernels (INT8 ops)                 │
+├────────────────────────────────────────────────────┤
+│  Alif SDK / CMSIS                                  │
+│  ├── Device HAL                                    │
+│  ├── UART Driver                                   │
+│  └── GPIO Driver                                   │
+├────────────────────────────────────────────────────┤
+│  Hardware: Cortex-M55 + Ethos-U55                  │
+└────────────────────────────────────────────────────┘
+```
+
+## Prerequisites
+
+### Required hardware
+- Alif Ensemble E8 DevKit (DK-E8-Alpha)
+- USB-C cable (connect to **PRG USB** port)
+- Optional: USB-to-Serial adapter for UART debugging
+
+### Required software
+
+| Tool | Version | Purpose |
+|------|---------|---------|
+| Docker | Latest | Development container |
+| Arm GCC | 13.x or 14.x | Cross-compiler |
+| CMSIS-Toolbox | 2.6.0+ | Build system |
+| J-Link | 7.x+ | Programming/debugging |
+| SETOOLS | 1.107.x | Alif flashing tools |
+| Python | 3.10+ | ExecuTorch export |
+
+## Key concepts
+
+### Model quantization
+
+ExecuTorch uses **INT8 quantization** for Ethos-U55:
+- Reduced memory footprint (4x smaller than FP32)
+- Faster inference on NPU
+- Minimal accuracy loss with proper calibration
+
+### Memory layout
+
+{{% notice Warning %}}
+Large tensors and model weights must be placed in **SRAM0** (4MB), not DTCM (256KB). Failing to do this causes linker overflow errors.
+{{% /notice %}}
+
+Place large buffers in SRAM0 using the section attribute:
+
+```c
+static uint8_t __attribute__((section(".bss.noinit"), aligned(16)))
+    tensor_arena[512 * 1024];  // 512KB in SRAM0
+```
+
+### SRAM0 power management
+
+{{% notice Important %}}
+SRAM0 must be powered on before use via Secure Enclave services. Accessing unpowered SRAM causes HardFault crashes.
+{{% /notice %}}
+
+```c
+#include "se_services_port.h"
+#include "services_lib_api.h"
+
+uint32_t mem_error = 0;
+SERVICES_power_memory_req(
+    se_services_s_handle,
+    POWER_MEM_SRAM_0_ENABLE,
+    &mem_error);
+```
+
+## Example: MNIST digit classification
+
+The included MNIST example demonstrates:
+- Loading a quantized CNN model (~100KB)
+- INT8 input preprocessing (28x28 grayscale image)
+- NPU-accelerated inference (~10-20ms)
+- Output processing (argmax of 10 classes)
+
+```
+Input: 28x28 grayscale image (784 bytes INT8)
+  │
+  ▼
+┌─────────────────────────────────────────┐
+│ Conv2d(1→16) → ReLU → MaxPool          │  NPU
+│ Conv2d(16→32) → ReLU → MaxPool         │  accelerated
+│ Linear(1568→64) → ReLU                 │
+│ Linear(64→10)                          │
+└─────────────────────────────────────────┘
+  │
+  ▼
+Output: 10 class scores (10 bytes INT8)
+```
+
+## Benefits and applications
+
+NPUs like Arm's [Ethos-U55](https://www.arm.com/products/silicon-ip-cpu/ethos/ethos-u55) and [Ethos-U85](https://www.arm.com/products/silicon-ip-cpu/ethos/ethos-u85) provide significant advantages for embedded ML applications:
+
+- **Hardware acceleration**: 10-50x faster inference compared to CPU-only execution
+- **Power efficiency**: Lower power consumption per inference operation
+- **Real-time capable**: Suitable for latency-sensitive applications
+- **On-device processing**: No cloud dependency, enhanced privacy
+- **Visual feedback**: RGB LED indicators provide immediate status confirmation
+- **Debug capabilities**: UART and RTT output for detailed performance analysis
+
+The Alif [Ensemble E8 Series Development Kit](https://alifsemi.com/ensemble-e8-series/) integrates the Ethos-U55 NPU with Cortex-M55 and Cortex-A32 cores, making it ideal for prototyping TinyML applications that require both ML acceleration and general-purpose processing.