Layer-adaptive pruning produces models where some layers retain all experts (important layers) while others are aggressively pruned. moe-stream reads the per-layer expert count from the GGUF metadata ...