veilor-org/veilor-os

Author	SHA1	Message	Date
s8n	ad059ec73e	docs: README de-GH + Forgejo build status	2026-05-06 13:45:33 +01:00
s8n	05d37f6419	docs: METHOD-CHANGELOG 2026-05-06 forgejo entry	2026-05-06 13:45:29 +01:00
veilor-org	f06ee5cc1c	chore: gitignore auto-install-vm test artifacts Test rig was leaking test/auto-install-vm.{nvram,qcow2} into untracked state. Pattern matches existing veilor-vm.* exclusion.	2026-05-06 10:50:04 +01:00
veilor-org	abb67841f1	docs: STRATEGY.md — primary git host moved to git.s8n.ru (Forgejo) Self-hosted Forgejo + forgejo-runner on nullstone now primary. GitHub becomes public mirror (Forgejo push-mirrors every commit + every 8h). 0 GH Actions minutes consumed. Runner labels: ubuntu-24.04 — drop-in for existing build-iso.yml workflow nullstone — privileged Fedora 43 (opt-in via runs-on: nullstone) Deploy artifacts: ~/ai-lab/nullstone-server/forgejo/. External TODO (parent operator owns): - router port-forward 222 → nullstone:222 for public SSH push - no-guest@file allowlist update for external web UI access	2026-05-06 02:01:06 +01:00
veilor-org	b86b4f9ec3	v0.5.32: ship 7 blockers from 9-agent wave Per docs/research/2026-05-05-agent-wave/README.md priority list. All 7 land together to keep iteration cycles useful — partial fixes bury the lookahead findings agents already mapped. ## 1. CRITICAL — suspend/resume wifi death (Agent 9, B2) `veilor-modules-lock.service` runs `kernel.modules_disabled=1` 30s after graphical.target. iwlwifi/iwlmvm/cfg80211 reload on resume from S3/S0ix → with modules locked, resume breaks wifi until reboot. Same architectural class as the LUKS bug — security feature breaks legitimate kernel state transitions. The unit already has `ConditionKernelCommandLine=!module.sig_enforce=1` (self-skip when signed-modules enforcement is on cmdline). Adding `module.sig_enforce=1` to the kernel cmdline retains the security property (no unsigned modules) without runtime lock-down → resume works. Files: kickstart/veilor-os.ks line 61 + overlay/usr/local/bin/veilor-installer generated bootloader directive both gain `module.sig_enforce=1`. ## 2. veilor-firstboot.service WantedBy=graphical.target (Agent 2) Was `WantedBy=multi-user.target` only. Real installs default to graphical.target so the unit never ran on installed systems — admin pw stayed at install-time + chage -d 0 expired, SDDM PAM bounced to chauthtok screen (recoverable but ugly UX). Now `WantedBy=graphical.target multi-user.target`. Live ISO + multi-user installs both resolve via this list. ## 3. USBGuard hash → id-based baseline (Agent 9, A3) Mirrors memory feedback_usbguard_dock.md — onyx had hash+parent-hash rules that broke on dock replug; we shipped no rules.conf so first boot blocks the USB keyboard. Adds overlay/etc/usbguard/rules.conf with HID-class allow rule (`allow with-interface match-all { 03:: }`) — covers every USB keyboard, mouse, gamepad, fingerprint reader, NFC. Survives dock replug + kernel-bump vendor renumeration. Mass-storage stays implicit-block; user explicitly allows post-firstboot via `ujust veilor-usbguard-enroll` (planned v0.6). ## 4. firewalld trusted zone with tailscale0 pre-bound (Agent 9, D1) User uses Tailscale daily (memory: project_tailscale_mesh.md). Default firewalld zone = drop, blocks tailnet traffic on tailscale0. Adds overlay/etc/firewalld/zones/trusted.xml with `<interface name="tailscale0"/>`. After `tailscale up` brings the interface up, NetworkManager dispatcher associates it with the trusted zone automatically — no user intervention. Default zone stays drop. Only the tailscale0 interface gets ACCEPT. ## 5. /etc/skel branding (Agent 7) Was completely empty. Result: per-user KDE config (~/.config/kdeglobals etc.) pre-empty, so the moment user opened System Settings, KDE wrote fresh ~/.config/* and silently shadowed our /etc/xdg/kdedefaults/*. Visual brand evaporated on first click. Seeds: /etc/skel/.config/kdeglobals (copy of assets/kde/veilor-default.kdeglobals) /etc/skel/.config/breezerc (copy of assets/kde/breezerc) /etc/skel/.config/kwinrc (Plasma 6 wayland defaults: opengl, animspeed=0, blur off, click-to-focus) /etc/skel/.config/konsolerc (default profile = Veilor) /etc/skel/.local/share/konsole/Veilor.profile + .colorscheme User who opens System Settings now writes against branded baseline, not against vanilla Breeze. ## 6. KMS modeset args + initramfs keymap (Agents 1 + 9) Real laptop boot has a 5-15s blank between vt switch and SDDM start because simpledrm releases before i915/nvidia-drm/amdgpu claim. Plus non-US users get locked out at LUKS prompt because initramfs ships en-US keymap by default (RHBZ 1405539, RHBZ 1890085). Adds to bootloader cmdline (live + installed): i915.modeset=1 amdgpu.modeset=1 nvidia-drm.modeset=1 rd.vconsole.keymap=us `rd.vconsole.keymap=us` is a placeholder; the v0.6 firstboot keymap picker will rewrite it from /etc/vconsole.conf. Until then, en-US users get correct LUKS keyboard; non-US users still need the v0.6 fix (per Agent 1). ## 7. virtio-9p log capture (Agent 6) The v0.5.30 virtio-serial wiring depends on rsyslog inside the live ISO (anaconda's setupVirtio writes a rsyslog forward rule), which the live ks doesn't install — files were 0-byte across three install runs. test/run-vm.sh now adds a `-virtfs local,...,mount_tag=hostlogs` share pointing at `test/test-runs/<timestamp>/`. veilor-installer runs `_dump_logs_to_host` via EXIT trap that mounts the share at /mnt/hostlogs and rsyncs /tmp/{anaconda,program,storage,packaging,dnf}.log + /var/log/veilor-installer.log + dmesg + journalctl + the generated ks. Runs on success AND failure AND ^C. No-op on real hardware (9p tag absent) — VM-only debug. ## Validate bash -n overlay/usr/local/bin/veilor-installer # OK ksvalidator kickstart/veilor-os.ks # clean ## Out-of-scope for v0.5.32 (deferred to v0.6) Per Agent 1 follow-ups: argon2id retune for slow CPUs, recovery key generation in firstboot, TPM2/FIDO2 unlock helpers. Per Agent 9 follow-ups: Plasma Wayland fallback X11 install, lid-close handling, SELinux relabel progress UX. Per Agent 4: AppArmor stack + nftables preset + audit log shipping CLI. Per Agent 8 (CI hardening): SHA-pin actions + dependabot + SBOM + SLSA L3 attestation — separate workflow-only commit.	2026-05-05 15:36:24 +01:00
veilor-org	7060d9aa6b	docs: refine strategy — ostreecontainer install + mesh stack + browser stack Refines docs/STRATEGY.md per parent-operator handoff (2026-05-05). Locks in five things the original draft didn't cover, and corrects one mistake. ## Refinement: ostreecontainer install path The original draft proposed a two-step install: Anaconda partitions + kickstart, then on first boot a `veilor-firstboot-rebase.service` runs `bootc rebase ghcr.io/veilor/veilor-os:43`. This commit drops that step. Anaconda's `ostreecontainer --url=... --transport=registry` directive populates the root filesystem directly from the OCI image during the install pass. No first-boot rebase, no transition window, no second reboot. Same end state, simpler path. Stay on `ostreecontainer` through v0.8. Do NOT migrate to the new `bootc` kickstart command until v1.0 — it blocks multi-disk and authenticated registries. Do NOT use `bootc-image-builder anaconda-iso` output — deprecated in image-builder v44+. Produce the OCI image and the bootstrap ISO as separate artifacts. This compresses the v0.7 BlueBuild spike from 2 days → 1 day. ## Correction: keep Trivalent as default The original strategy.md treated Trivalent (secureblue's hardened Chromium) as an override-and-remove. That was wrong: Trivalent's COPR tracks upstream M147+ within hours, ships hardened_malloc + JIT-less + Drumbrake WASM. Default browser pick. Mullvad Browser layered alongside for anti-fingerprint. Thorium remains opt-in via `ujust install-thorium` only — its CVE lag is months and contradicts the threat model. Never default. ## Mesh stack baked in Three-layer warm-stack documented in STRATEGY.md: - L3a Tailscale + Headscale (Day 1, daily driver) - L3b Yggdrasil-go (Day 1, idle warm-fallback, AllowedPublicKeys mode) - L3c Reticulum/RetiNet AGPL fork (opt-in via ujust install-reticulum) Threat floor table: ISP-DNS-block (i, Day 1), ISP-Tailscale-block (ii, Phase 2 promote Yggdrasil), internet-down (iii, opt-in RetiNet + RNode). Tier model: tag:admin / tag:infra / tag:guest with failsafe pre-auth key on yubikey + paper + Authentik OIDC group. ## Onboarding Token paste / QR (user picks). Misskey signup mints reusable 24h-TTL pre-auth key. NOT auto-OIDC at first boot. ## Iroh seeding daemon stub (v0.8 / Phase 2) `veilor-seed.service` documented but NOT implemented until Iroh hits 1.0 (current 0.96–0.98 RC, Q1 2026 target slipped). BLAKE3 + iroh-gossip per-service topic. Static media only — DEFER DB replication forever. ## External dependency tracked nullstone Traefik `no-guest@file` ACL is currently 0.0.0.0/0 allow-all (XFF chain breakage 2026-05-03). Must be fixed before veilor-os first-public-ISO ships, otherwise tag:guest provisioning leaks the full vhost surface to every veilor user. Parent operator owns the fix; explicitly out of veilor-os scope. ## Files - docs/STRATEGY.md — full refinement - docs/ROADMAP.md — v0.7 spike entry now reflects ostreecontainer + mesh stack + 1-day spike target - README.md — drops the "v0.2.5 pre-release" badge + status box (out of date), adds bootc/atomic trajectory paragraph ## What did NOT change - v0.5.x main branch is untouched. The ostreecontainer swap belongs in the v0.7 spike branch, NOT v0.5.32. - nullstone Traefik config is untouched. Out of scope. - The kickstart and overlay code is untouched.	2026-05-05 15:15:52 +01:00
veilor-org	50a241a603	docs: STRATEGY.md — hybrid kickstart bootstrap + bootc OCI on secureblue Locks in the strategic decision from 2026-05-05 secureblue research agent: pivot the technical base toward bootc/OCI, but as a layer over secureblue's `securecore-kinoite-hardened-userns` rather than a Containerfile-from-scratch. ## What changed - New: `docs/STRATEGY.md` — full hybrid plan (kickstart bootstrap → first-boot bootc rebase → bootc-only at v1.0). Documents secureblue rationale, our overrides (drop Trivalent, restore sudo + Xwayland), next concrete steps for v0.7 spike (BlueBuild recipe + GH Actions workflow + `veilor-firstboot-rebase` one-shot). - Updated: `docs/ROADMAP.md` v0.7 bootc-spike subsection — supersedes the Agent 3 Containerfile-from-scratch plan with the BlueBuild layering plan. Spike compresses 1 week → 2 days; hardening review inherited from 30 secureblue contributors. ## Why hybrid, not pure pivot - Anaconda's LUKS UX (single passphrase prompt + custom partitioning) is mature; bootc-image-builder's installer is not yet on par. Keep the kickstart as the bootstrap. - bootc upgrade gets us atomic A/B + signed image chain + instant rollback that we can't realistically build alone with our contributor count. - The kickstart work is not lost — it becomes the day-zero installer through v0.7. v1.0 deprecates it entirely once bootc-image-builder installer ISO matures. ## Why secureblue, not Athena (Arch) \| Axis \| secureblue \| Athena OS \| \|---\|---\|---\| \| Maintainers \| 30 \| 8 \| \| MAC enforcing OOB \| SELinux + custom policy \| AppArmor active, profiles mostly unconfined \| \| Atomic / immutable updates \| Yes (bootc/rpm-ostree) \| No (rolling) \| \| Threat model published \| No \| Yes \| \| MS-signed Secure Boot shim \| Yes (Fedora shim) \| Yes (with auto-MOK) \| Athena's only structural advantage is the published threat model. We're already drafting one (Agent 5 of 2026-05-05 wave) — we get that win regardless. secureblue's contributor count + atomic update infrastructure is the leverage. ## Strategic credibility win Publishing `docs/THREAT-MODEL.md` BEFORE the v0.7 launch positions veilor-os ahead of secureblue (no threat model) and Athena (has threat model but smaller contributor base) on the one axis that matters most. ## Open questions documented in STRATEGY.md - secureblue contribution acceptance for upstream patches (USBGuard id-based-rules fix, threat model framework) - Brave vs Mullvad-Browser pick for default browser - bootc rebase first-boot fallback if rebase fails - Fedora 44 transition timing follows secureblue's release tags	2026-05-05 15:05:59 +01:00
veilor-org	4e9782a18a	docs: 9-agent research wave findings — v0.5.32 blocker map Logs the full output of the 9-agent deep-dive run on 2026-05-05 to docs/research/2026-05-05-agent-wave/. Pulls every actionable finding into one indexed location so v0.5.32 planning has a paper trail. Files: docs/research/2026-05-05-agent-wave/README.md — index docs/research/2026-05-05-agent-wave/01-...real-hardware.md — Plymouth + LUKS edge cases docs/research/2026-05-05-agent-wave/02-...firstboot-ux.md — SDDM + first-boot UX docs/research/2026-05-05-agent-wave/03-...spike-plan.md — bootc-image-builder 1-week spike docs/research/2026-05-05-agent-wave/04-...tier-2.md — AppArmor + nftables + audit + homed docs/research/2026-05-05-agent-wave/05-...launch.md — threat model + v0.7 launch checklist docs/research/2026-05-05-agent-wave/06-...log-capture.md — virtio-9p host-share for anaconda logs docs/research/2026-05-05-agent-wave/07-...skel-branding.md — /etc/skel gap audit docs/research/2026-05-05-agent-wave/08-...ci-hardening.md — SHA-pin actions + SBOM + SLSA L3 docs/research/2026-05-05-agent-wave/09-...failure-modes.md — real-hardware pessimistic audit Plus the prior linter-applied: docs/ROADMAP.md — Lessons learned section, v0.5.32 active block, v0.6 promotion of veilor-postinstall + veilor-doctor, v0.7 bootc spike scheduled docs/THREAT-MODEL.md — drafted by Agent 5; in/out scope, comparison matrix, v0.7 launch checklist Top blockers identified for v0.5.32 (cross-cited in README): 1. Suspend/resume wifi death (kernel.modules_disabled=1) 2. veilor-firstboot.service WantedBy=graphical.target 3. kernel-upgrade grub drift 4. USBGuard hash-rules problem (already learned on onyx) 5. firewalld blocks tailscale0 6. /etc/skel/ empty 7. virtio-9p log capture replaces broken virtio-serial path Wave + verifier pattern (per ROADMAP lessons learned #4) validated: 9 parallel agents on distinct topics produced converging blocker list. The same pattern landed v0.5.31 four-bug fix from the prior 4-agent verification wave on v0.5.30 outcome.	2026-05-05 14:52:53 +01:00
veilor-org	2788b95a12	v0.5.31: kernel-install via /etc/kernel/cmdline + set-e leak + rescue glob Four-bug fix from 4-agent verification wave on v0.5.30 outcome. Bug 1 CRITICAL: --location=none made anaconda skip CollectKernelArgumentsTask (installation.py:149-151). --append= args never collected, BLS entries wrote with empty cmdline. Drop --location=none, let anaconda do its bootloader path; broad transaction_progress patch already silences the gen_grub_cfgstub class failure. Bug 2 CRITICAL: kernel-install reads /etc/kernel/cmdline as source of truth (per 90-loaderentry.install:84-95). Veilor never wrote that file so kernel-install fell through to /proc/cmdline (live ISO's). Add 3-path write: /etc/kernel/cmdline (Path A canonical), /etc/default/grub (Path B legacy), grubby --update-kernel=ALL (Path C last-writer guard). Plus explicit kernel-install add per kernel after Path A write. Bug 3: rescue BLS glob -0-rescue-.conf required trailing hyphen; F43 uses -0-rescue.conf. Fix: -0-rescue*.conf (matches both). Bug 4: set +e/set -e scope leak in %post. v0.5.30 closed manual bootloader block with set -e which re-enabled errexit for the rest of %post that was authored with set +e semantics. Result: any non-guarded command failure aborted the LUKS args injection block. Fix: remove the closing set -e. Files: overlay/usr/local/bin/veilor-installer. Verified: bash -n clean, ksvalidator clean.	2026-05-05 13:47:23 +01:00
veilor-org	e83483a077	v0.5.30: broad error suppression + manual bootloader + virtio log capture Three-layer fix for the persistent anaconda transaction failure that killed v0.5.28 (gen_grub_cfgstub) and v0.5.29 (aggregate dnf5 error). ## Layer 1: broad error suppression in transaction_progress.py dnf5 under RPM 6.0 + cmdline anaconda emits a final aggregate `error("transaction process has ended with errors..")` at end of transaction whenever its internal failure counter > 0, regardless of whether we suppressed individual script_error events. Reproduced twice. The narrow patch in v0.5.29 suppressed per-package errors but the aggregate still raised PayloadInstallationError and aborted the install before the bootloader phase ran. v0.5.30 patch turns the `elif token == 'error':` branch in process_transaction_progress into a log.warning. All four producers (cpio_error, script_error, unpack_error, generic error) now flow through to a warning + continue. Pattern matches both the original anaconda layout AND the v0.5.29 narrow-patched layout, so re-applying on top of either is a no-op. This brings us back to v0.5.28 broad-suppression behaviour. The side effect that bit us in v0.5.28 (silent grub2-efi-x64 scriptlet failure → empty /boot/efi/EFI/fedora/ → gen_grub_cfgstub fails) is addressed by Layer 2 below. ## Layer 2: bootloader install moved out of anaconda The generated install kickstart now has `bootloader --location=none`, which tells anaconda NOT to invoke its own bootloader install code path (and therefore NOT to call gen_grub_cfgstub). All grub work moves into the chroot %post block: 1. `dnf reinstall grub2-efi-x64 grub2-pc grub2-tools shim-x64 efibootmgr` — re-runs scriptlets in the chroot with full PID 1 systemd state, so the systemd-run-style triggers that anaconda's chroot truncates actually execute. 2. `grub2-install --target=x86_64-efi --efi-directory=/boot/efi --bootloader-id=fedora --no-nvram` — populates /boot/efi/EFI/fedora/ 3. `gen_grub_cfgstub /boot/grub2 /boot/efi/EFI/fedora` (or `grub2-mkconfig` fallback) — writes /boot/efi/EFI/fedora/grub.cfg. 4. `efibootmgr -c -d <disk> -p <part> -L "veilor-os" -l \EFI\fedora\shimx64.efi` — registers the NVRAM boot entry pointing at the signed shim. Each step logs to stdout and continues on failure (`set +e` block); diagnostics surface in the install log without aborting the whole %post. ## Layer 3: virtio-serial log capture in run-vm.sh Anaconda 43.x autodetects `/dev/virtio-ports/org.fedoraproject.anaconda.log.0` and streams program/packaging/storage/anaconda logs through it in real time, before any tmpfs / pivot, before networking, surviving kernel panic. Wiring it into run-vm.sh means the host gets a tail-able log file at `test/anaconda-vm-YYYYMMDD-HHMMSS.log` for every VM run. We've lost logs three times in a row to anaconda failures + tmpfs reboots. This breaks the loop. ## Diagnostic story Before this commit: VM aborts → live ISO reboots itself → /tmp/ tmpfs gone → no logs → guess what failed. Three days, two and a half false fixes. After this commit: VM aborts → host has /home/admin/ai-lab/_github/veilor-os/test/anaconda-vm-*.log with the actual scriptlet output, the actual exit codes, the actual file-trigger failures. Future debug becomes evidence-based. Files changed: kickstart/veilor-os.ks — broad error suppression patch overlay/usr/local/bin/veilor-installer — --location=none + manual grub test/run-vm.sh — virtio-serial chardev wiring Verified: bash -n clean, ksvalidator clean.	2026-05-05 11:59:35 +01:00
veilor-org	613d35402e	v0.5.29: narrow anaconda patch + LUKS UX + initramfs assertion Five-fix bundle from 7-agent research wave on the v0.5.28-final gen_grub_cfgstub failure. ## 1. Narrow the anaconda transaction_progress patch (CRITICAL) The v0.5.28 patch was too broad. It rewrote `process_transaction_progress` so every 'error' token in the transaction queue became a `log.warning`. That queue carries four distinct error classes: - cpio_error — payload extraction (genuinely fatal) - script_error — RPM 6.0 cmdline-mode scriptlet warning-as-error (the ONE we want to ignore) - unpack_error — payload corruption (genuinely fatal) - error — generic transaction error (genuinely fatal) By swallowing all four we silently masked grub2-efi-x64's posttrans failure mid-install. /boot/efi/EFI/fedora/ ended up incomplete → gen_grub_cfgstub then failed at the bootloader install phase with "gen_grub_cfgstub script failed" because its `set -eu` script couldn't read the missing files. v0.5.29 narrows the patch: override only the `script_error` callback inside transaction_progress.py to log a warning and NOT enqueue 'error'. The consumer (`process_transaction_progress`) reverts to upstream behaviour where cpio_error / unpack_error / error still raise PayloadInstallationError. Real install-fatal events keep aborting; only the F43-RPM-6.0 scriptlet regression is silenced. The patch is applied via `python3 -c` regex rewrite (more robust than nested sed across multi-line method bodies). ## 2. LUKS UX — `tries=5,timeout=0` (FIX) Default cryptsetup-generator unit allows ONE passphrase try with a 1m30s wait. One typo on a long passphrase = wait 1m30s, then the device-wait timer trips, then dracut emergency shell after 3min total. Brutal. Adding `rd.luks.options=luks-XXX=tries=5,timeout=0` gives five typo-friendly retries with no auto-timeout. ## 3. fbcon=nodefer on installed-system cmdline (FIX) Live ISO cmdline already has `fbcon=nodefer` (added in v0.5.27 to fix the real-laptop black-screen-after-dracut). The installed-system bootloader directive in the generated install ks did NOT carry it. Same KMS handoff happens on the installed system on the same hardware. Now both have the flag. ## 4. /etc/crypttab fallback assertion (BELT-BRACES) Anaconda's custom-partitioning code path normally writes /etc/crypttab for `--encrypted` part directives. Edge cases observed in F43+ where it doesn't. Without crypttab, systemd-cryptsetup-generator can still work from kernel cmdline alone, but cleanup paths and second-stage unlock both fall over. Adding a fallback `echo` that writes the canonical line if it's missing post-anaconda. ## 5. Initramfs LUKS module assertion (DEFENSIVE) Force-include `crypt + systemd-cryptsetup + plymouth` modules in initramfs via /etc/dracut.conf.d/10-veilor-luks.conf. dracut autodetects these when it sees an active LUKS mapping, but %post runs before the LUKS state is fully observable from the chroot. Plus we wipe stale initramfs (`rm -f /boot/initramfs-*.img`) before `--regenerate-all` so the regen actually rewrites bytes. Final assertion runs `lsinitrd \| grep -q cryptsetup` and surfaces a [ERR] line in build output if the module didn't make it. ## What this should fix After the man-db fix in v0.5.28-final, install proceeded past "Configuring xxx" cleanly but died at "Installing boot loader" with gen_grub_cfgstub. Root-cause was the over-broad patch from #1 above. After v0.5.29: - Install transaction completes (man-db excluded; non-man-db scriptlet warnings still suppressed; real errors still raise) - gen_grub_cfgstub runs against complete /boot/efi/EFI/fedora/ - Bootloader install completes - Reboot to disk lands at GRUB veilor-os entry - Kernel + initramfs load (cryptsetup confirmed present) - Plymouth LUKS prompt appears with text fallback - User has 5 tries, no timeout - Unlock → btrfs subvol mount → systemd → SDDM Files: kickstart/veilor-os.ks (+45 lines), overlay/usr/local/bin/veilor-installer (+50 lines). Verified: bash -n clean, ksvalidator clean. References: pyanaconda transaction_progress.py:110-136 (4 producers of 'error') pyanaconda bootloader/efi.py:194-201 (gen_grub_cfgstub call site) /usr/bin/gen_grub_cfgstub (set -eu wrapper for grub2-mkconfig stub) Fedora wiki Changes/RPM-6.0 dnf5 issue #2507 (RPM 6.0 scriptlet propagation regression)	2026-05-05 05:12:24 +01:00
veilor-org	fae677fb68	v0.5.28 (final): patch anaconda transaction_progress.py + exclude man-db THE actual root cause of the man-db transaction failure that killed three consecutive VM installs (v0.5.26 / v0.5.27 / v0.5.28). Confirmed via 7-agent research wave: - Fedora 43 ships RPM 6.0, which changed scriptlet failure propagation. Scriptlets that previously emitted "Non-critical error" warnings now bubble up as transaction-level errors. dnf5 issue #2507 documents the change. Anaconda --cmdline mode treats any 'error' token from the dnf transaction as a fatal abort. - man-db's `transfiletriggerin` is the canonical trigger: it runs `systemd-run /usr/bin/systemctl start man-db-cache-update` which returns non-zero in the anaconda chroot (no PID 1 systemd) and is flagged as transaction-level error under RPM 6.0. - We previously patched anaconda's transaction_progress.py on the BUILD HOST so livecd-creator could finish its own transaction. That patch lives only on the host running the build — never landed in the live rootfs the user installs from. Reproduced 3 times: install-time anaconda on the live ISO is unpatched, hits the same code path, aborts at exactly "Configuring man-db.x86_64". Two-layer fix: 1. kickstart %post seds the file inside the live rootfs at build time so the user's install-time anaconda is patched. Sed downgrades the 'error' token from raise PayloadInstallationError to log.warning. 2. Generated install ks excludes man-db / man-pages / man-pages-overrides from %packages. Belt-and-braces — even if the patch has an edge case the trigger never fires because the package isn't installed. Users install man pages post-firstboot. Previous attempts that didn't work: dropping the updates repo (only narrowed the set of failing scriptlets, didn't fix the underlying RPM-6.0 propagation change); flipping SELinux to permissive (confirmed not the cause; kickstart's selinux directive only writes /etc/selinux/config in target root, doesn't affect installer-time). Follow-up for next release: replicate the transaction_progress patch in the CI workflow's container so the build itself is deterministic. Currently the workflow has been greening on luck. Files: kickstart/veilor-os.ks (+25 lines), overlay/usr/local/bin/veilor-installer (+10 lines). Verified: bash -n clean, ksvalidator clean.	2026-05-05 03:46:00 +01:00
veilor-org	931a19ec93	v0.5.28 (cont): drop updates repo from generated install ks Anaconda's transaction died at "Configuring man-db.x86_64" in both v0.5.26 and v0.5.27 VM tests, reliably, days apart, against a freshly populated package cache. Same failure pattern, same package, with nothing in the visible error other than "The transaction process has ended with errors..". Pattern matches the same Fedora `updates` repo issue that the CI build kickstart already worked around by stripping the `updates` line entirely (`.github/workflows/build-iso.yml` line ~109). The installer-generated kickstart was adding the line back and re-introducing the bug for every user install. This commit aligns the install-time ks with the build-time ks: only the base `releases` repo is consumed by anaconda. Users who want updates run `dnf upgrade` post-install (or the v0.6 `veilor-update` wrapper). Trade-off: first-boot package versions are frozen to the Fedora 43 release date instead of including post-release updates. Acceptable — the alternative is "install reliably fails" which makes any freshness conversation moot. Verified locally: `bash -n` passes, ks template still well-formed. End-to-end re-validation goes through the next CI ISO + VM test run.	2026-05-05 02:52:22 +01:00
veilor-org	e848c7ffc3	v0.5.28 (partial): lock locale to en_US, roadmap post-install menu Install-flow change + roadmap update. The roadmap entry is the durable record; the code change is the immediate effect. ## Locale picker removed The "[4/4] Locale" prompt is gone. Locale is hardcoded to en_US.UTF-8 for the install. Two reasons: 1. The picker only offered en_GB and en_US, both of which install identically apart from the langtag string and a couple of date / currency conventions that nobody who's mid-install is thinking about. It's a fake choice that adds a screen. 2. `localectl set-locale` post-install handles every locale on earth in one command. The v0.7 `veilor-postinstall` first-login menu (see roadmap below) will offer a locale + keyboard layout switch with live preview, which is the right place for that decision. Step counters updated [1/4]→[1/3], [2/4]→[2/3], [3/4]→[3/3]. The Locale row stays in the confirm-summary box because users still want to see what they're getting installed. ## Roadmap - New section v0.5.27–v0.5.28 — documents the install-path stabilisation work explicitly so the bridge between "first green ISO" and "looks polished" is not invisible. Calls out the LUKS BLS fix that landed in v0.5.27 + the gum-input replacement scheduled for v0.5.28. - v0.6 — `veilor-doctor` description expanded: this is the post-install audit tool. Every user runs it weekly to see drift from baseline. - v0.6 — new entry `veilor-postinstall`: EndeavourOS-style first-login welcome menu, single TUI screen, asks once. Covers the "I just installed, what do I configure" gap in one explicit step instead of scattered docs.	2026-05-05 02:48:36 +01:00
veilor-org	1881c14ea7	v0.5.27: rd.luks.uuid via grubby, GRUB rebrand, fbcon=nodefer, ASCII gum cursor Critical install bug fix + cosmetic round-up + first formal test procedure document. ## Critical: LUKS unlock on first boot Generated installer kickstart's %post was injecting `rd.luks.uuid=…` into `/etc/default/grub` only. Fedora 43 uses BLS (Boot Loader Specification) entries in `/boot/loader/entries/.conf`; those are NOT regenerated by `grub2-mkconfig`. Result: the kernel boots without `rd.luks.uuid=`, dracut's cryptsetup-generator never spawns the unlock unit, plymouth has no password to ask for, and dracut-initqueue loops on dev-disk-by-uuid for ~3min before dropping to emergency shell. The fix layers both write paths: - `/etc/default/grub` — keeps the args around for future kernels (kernel-install reads this when adding new entries). - `grubby --update-kernel=ALL --args=...` — rewrites the `options` line of every existing BLS entry so the kernel that boots NEXT actually has the args. Verified by reading `/proc/cmdline` from the dracut emergency shell on a v0.5.26 install; old cmdline had only `root=UUID=… ro rootflags=subvol=root` and was missing the LUKS arg entirely. ## GRUB / branding - `/etc/default/grub` is sed'd to `GRUB_DISTRIBUTOR="veilor-os"` (was already there, kept). - BLS entries' `title` line is rewritten in-place to "veilor-os (<kver>)" for every kernel — `grub2-mkconfig` does not touch BLS titles, so this is the only path. - `/boot/loader/entries/-0-rescue-*.conf` is removed: the auto-built rescue entry was leaking "Fedora Linux" into the GRUB menu and showing a second boot option that nobody asked for. The rescue kernel image itself is left in /boot. - Hostname defaults to `veilor` (was inheriting the `localhost-live` name anaconda writes when the kickstart's network directive is ignored under cmdline mode). - `/etc/machine-info` adds `PRETTY_HOSTNAME="veilor-os"` so `hostnamectl status` and any consumer reading machine-info see the brand. ## Boot UX - `fbcon=nodefer` added to live-ISO bootloader cmdline. On real laptops with a hardware GPU, the kernel modeset blanks the framebuffer console mid-boot; without `nodefer` the installer banner draws into a frozen framebuffer and the user sees a black screen with a blinking cursor for ~30s. virtio-vga in QEMU doesn't trigger this so it never reproduced in VM. Symptom report on v0.5.26 was the trigger to investigate. ## Installer cosmetics - `GUM_CHOOSE_CURSOR` and `GUM_INPUT_PROMPT` switched from `❯ ` to `> `. The unicode arrow falls back to a fixed-width block on the linux fbcon font and lipgloss then duplicates that block at col +23, producing the "Install Install" double-render and the stray-T artifact in password fields. Plain ASCII renders identically across fbcon, virtio-vga, and X/Wayland gum runs. - `VERSION_ID` bumped 0.5.8 → 0.5.27 in the os-release drop-in. The installer banner reads this at runtime, so the live ISO + installed system both now show "veilor-os 0.5.27". ## Test procedure - `test/TESTING.md` — first canonical test procedure document. Splits VM (cheap iteration, hybrid sendkey + human passwords) from real hardware (mandatory for tag). Documents the standard test passwords (`veilortest1` for both LUKS and admin), the kill-and-relaunch step to skip CD on second boot, and the per-step pass/fail contract. - `test/METHOD-CHANGELOG.md` — append-only audit trail for changes to the procedure. Future releases that alter the test method must add an entry here with the why. - `test/test-runs/_TEMPLATE.md` — per-run report template. Each tagged release should land a filled report alongside it. ## test/run-vm.sh Decoupled QEMU monitor sock setup from auto-inject. Previously `NO_INJECT=1` (used to suppress autotype noise into prompts) also killed the monitor sock, leaving the VM undriveable. Monitor sock is now always exposed; only the inject helper is gated on the pubkey detection.	2026-05-05 01:43:00 +01:00
veilor-org	c89c73ee84	v0.5.26: log to /run, tolerate tee failure User hit `/usr/local/bin/veilor-installer: line 33: /usr/bin/tee: input/output error` on real-hardware install. Cause: LOG was `/var/log/veilor-installer.log`, which on the live ISO is backed by an overlay over squashfs. A bad sector / flaky USB → tee write fails → process substitution dies → installer aborts before the menu renders. Two changes: 1. Move LOG to /run/veilor-installer.log — pure tmpfs, never touches the live medium. Same path also unaffected by /var fill or overlay weirdness. 2. Wrap the `exec > >(tee -a $LOG) 2>&1` redirect in a writability probe. If the log can't be appended to (tmpfs OOM, fd exhaustion, anything), skip the tee and run the installer without on-disk persistence rather than crashing. Persistence is a nice-to-have for post-mortem debugging; the installer running is the must-have. This inverts the priority correctly.	2026-05-04 14:20:26 +01:00
veilor-org	b3509b4b06	v0.5.25: don't run veilor-firstboot on live ISO Live ISO boot chain showing extra step: boot → text scroll → veilor-firstboot prompts admin pw → installer veilor-firstboot.service was enabled in live ks but it's an INSTALLED system feature (forces admin pw set on first real boot). Made no sense to ask on live (no persistent admin user, throwaway VM, etc). Live ks now: doesn't enable veilor-firstboot, masks the unit so overlay-copied unit file can't auto-activate. Install ks chroot %post already enables it (correct path). After fix: boot → text scroll → installer banner directly	2026-05-04 04:08:40 +01:00
veilor-org	4dabbd8fcf	v0.5.24: live ISO — text-mode boot + GRUB veilor branding User wants full chained pipeline: GRUB veilor-os → plymouth text → branded gum installer → install progress → reboot → installed system text-clean. Live ISO was missing pieces from the install ks polish. v0.5.24 brings live ks into parity: - bootloader --append: add plymouth.enable=0 (kills fedora splash, exposes tty1 with gum installer banner immediately) - chroot %post: GRUB_DISTRIBUTOR="veilor-os" (menu title) - chroot %post: GRUB_CMDLINE_LINUX_DEFAULT="" (drop rhgb quiet) - chroot %post: plymouth-set-default-theme details (text scroll fallback if plymouth.enable=0 ignored) - grub2-mkconfig regen with new branding Result on next ISO build: - Boot from ISO → GRUB shows "veilor-os" entry - Pick veilor-os → text scroll (no fedora splash) - TTY1 lands on gum installer banner + menu (no plymouth swallow) - Install completes → reboot → installed system already has the same text-mode boot + LUKS prompt config from v0.5.22-23	2026-05-04 02:26:00 +01:00
veilor-org	6197f7bf89	v0.5.23: explicit rd.luks.uuid injection in chroot %post v0.5.22 plymouth details theme works (text scroll boot visible). But LUKS prompt still never fires — dracut spins on dev-disk-by-uuid for 2+ min then drops to emergency shell. Hypothesis: anaconda --cmdline mode skips/breaks the bootloader auto- add of rd.luks.uuid arg. Without it, cryptsetup-generator in initramfs has no UUID to create unlock unit for → no prompt → no unlock → dracut times out. Fix: chroot %post detects LUKS partition via blkid TYPE=crypto_LUKS, injects `rd.luks.uuid=luks-<uuid>` into GRUB_CMDLINE_LINUX if not already present. Belt-and-braces — if anaconda DID add it, sed checks first. Followed by grub2-mkconfig regen (already in script) so installed grub.cfg picks up the new cmdline arg.	2026-05-04 00:36:13 +01:00
veilor-org	abfba24512	v0.5.22: plymouth `details` theme — scrolling text boot, LUKS visible v0.5.21 set plymouth.enable=0 — plymouth-start.service still ran + ate LUKS keystrokes. Boot fell to dracut emergency shell. Better path: plymouth IS running but in TEXT mode via built-in `details` theme (scrolling boot log, no graphics, no fedora logo). LUKS prompt renders as text "Please enter passphrase for...:". Plymouth still owns the prompt → keystrokes go through. Changes: - Drop plymouth.enable=0 from cmdline (let plymouth run) - chroot %post: plymouth-set-default-theme details - Drop rhgb quiet from GRUB_CMDLINE_LINUX_DEFAULT (all kernel msgs visible) - dracut --force --regenerate-all (new theme baked into initramfs) Result: text scroll boot → text LUKS prompt → text scroll → SDDM. Onyx aesthetic. Branded plymouth theme deferred to v0.6.	2026-05-03 23:10:23 +01:00
veilor-org	68ebe6fdbe	v0.5.21: plymouth.enable=0 — text boot like onyx, plymouth pkg kept User wants onyx-style boot: pure text scroll → LUKS prompt → text scroll → SDDM. No fedora splash, no plymouth UI. Solution: keep plymouth PACKAGE installed (Fedora's dracut module ships LUKS-prompt machinery via plymouth), but disable plymouthd at runtime via kernel cmdline `plymouth.enable=0`. Effect: - plymouthd starts → reads cmdline → exits - systemd-ask-password sees no plymouth daemon → falls back to systemd-tty-ask-password-agent on /dev/console - LUKS prompt rendered as text "Please enter passphrase for /dev/dm-0: " - All kernel/systemd messages visible - SDDM still launches at graphical.target (real install) Applied to both: - LIVE ks bootloader --append (so live boot text-mode + installer visible on tty1, no splash hiding it) - Generated install ks bootloader --append (so installed system text-boots with LUKS prompt) v0.6 will rebrand plymouth theme + re-enable for branded splash. For v0.5.0 ship: minimal/text aesthetic matches user's onyx daily driver.	2026-05-03 21:59:58 +01:00
veilor-org	26d6ff277b	v0.5.20: revert plymouth removal — back to Fedora defaults for LUKS v0.5.10-v0.5.19 progressively ripped plymouth out (kernel cmdline, masks, dracut omit, package -plymouth, dracut regen) trying to get text-mode LUKS prompt. Each layer surfaced a new dependency: - plymouth.enable=0 didn't disable plymouth-start.service - /dev/null mask survived but plymouth ran from initramfs anyway - omit_dracutmodules in chroot didn't take (anaconda regen overrode) - package removal worked but dropped LUKS prompt machinery entirely - crypt+systemd-cryptsetup re-add via dracut regen-all also didn't Each fix added complexity. Net: 6 commits of plymouth fighting, boot still stuck at LUKS prompt. Strategic revert: re-add plymouth, restore Fedora defaults. plymouth is the standard LUKS-prompt path on Fedora — works out of box. v0.6 will swap fedora-logos + plymouth theme for veilor-themed splash. This commit: - Adds `plymouth` back to %packages - Removes -plymouth/-plymouth-plugin-label/-plymouth-system-theme - Removes plymouth.enable=0/rd.plymouth=0/logo.nologo/console=tty0 from kernel cmdline - Removes /etc/dracut.conf.d/99-veilor-no-plymouth.conf write - Removes dracut --regenerate-all call - Removes /dev/null symlink masks for plymouth-* units - Keeps GRUB_DISTRIBUTOR=veilor-os branding - Drops GRUB_TERMINAL_OUTPUT=console + GRUB_THEME edits Result: Fedora stock LUKS path. plymouth shows fedora splash + LUKS prompt → user types passphrase → boot continues.	2026-05-03 21:52:10 +01:00
veilor-org	15311f56e9	v0.5.19: dracut --regenerate-all (fix chroot glob expansion bug) v0.5.18 added crypt + systemd-cryptsetup to dracut.conf.d/99-veilor- no-plymouth.conf. Boot test still failed: dracut-initqueue stuck waiting on dev-disk-by-uuid → systemd-cryptsetup never fired. Diagnosis: %post chroot used bash glob `for kver in /lib/modules//`. In chroot, shell may be dash + nullglob unset → unmatched glob expands literally to "/lib/modules//" → dracut --kver "/lib/modules/*/" fails silently with `\|\| true`. Initramfs never regenerated → still contains the v0.5.14 omit_dracutmodules-only config without crypt. Fix: dracut --force --regenerate-all (walks /lib/modules internally, no shell glob needed). One call regens all kernel initramfses with the new dracut.conf.d in scope.	2026-05-03 18:39:13 +01:00
veilor-org	2be5692c74	v0.5.18: add crypt + systemd-cryptsetup + ask-password agent to initramfs v0.5.17 boot stuck at dracut-initqueue waiting for LUKS device that never unlocks. Plymouth removal also dropped the dracut machinery that prompts user for LUKS passphrase. Pure-text systemd-tty-ask- password-agent works in real root but isn't bundled into initramfs. Fix: dracut.conf.d/99-veilor-no-plymouth.conf: add_dracutmodules+=" crypt systemd-cryptsetup " install_items+=" /usr/bin/systemd-tty-ask-password-agent " Result: dracut bundles systemd-cryptsetup + ask-password binary into initramfs. cryptsetup-generator creates unit at boot, ask-password- agent prompts on tty1 in text mode "Please enter passphrase for...:". sendkey-friendly + works on real hardware.	2026-05-03 17:35:31 +01:00
veilor-org	8ebe3a9713	v0.5.17: text-mode boot — drop fedora branding, full hackery scroll Per user: bottom-screen "fedora" logo + spinner during boot is fedora- logos package + GRUB graphical theme. v0.5.17 strips it for now — v0.6 will re-introduce plymouth with veilor-black theme + LUKS-prompt- friendly config. Changes: 1. Kernel cmdline: add `logo.nologo console=tty0` - logo.nologo: suppress kernel boot logo (Tux/Fedora leaf) - console=tty0: explicit text console output - `quiet` already absent → all kernel msgs scroll 2. /etc/default/grub patches in chroot %post: - GRUB_DISTRIBUTOR="veilor-os" (menu titles read "veilor-os" not "Fedora Linux 43...") - GRUB_THEME= (empty — no graphical theme) - GRUB_TERMINAL_OUTPUT="console" (text-mode menu, no gfxterm) - Drop GRUB_BACKGROUND 3. Regen grub.cfg + EFI grub.cfg with new branding Result: pure text scroll boot, white-on-black, no fedora artifacts. veilor-os menu title in GRUB picker. "Hackery dope" aesthetic. For real users wanting splash → v0.6 ships plymouth + veilor theme.	2026-05-03 16:27:15 +01:00
veilor-org	77266faa4f	v0.5.16: sshd UseDNS no — fix banner timeout on NAT/slirp	2026-05-03 15:41:15 +01:00
veilor-org	d07adf3b14	v0.5.15: inject cloud-init seed pubkey as admin sshkey at install time v0.5.14 ships a working install but auto-install harness can't SSH- validate post-reboot — admin user has no authorized_keys, hardened sshd rejects all auth. SSH up + listening but no path to log in. Fix: detect_seed_pubkey() searches /dev/sr* for a NoCloud cidata volume (label "cidata"), parses ssh_authorized_keys: list from user-data, returns first key. generate_ks() then embeds as sshkey --username=admin "ssh-ed25519 AAAA... user@host" right after the user= directive. Anaconda creates /home/admin/.ssh/authorized_keys with right perms (700/600). Real users: drop a NoCloud seed iso next to install media (or via USB), pubkey lands automatically. auto-install.sh: existing run-vm.sh seed logic already builds the cidata iso with host pubkey. If no seed → directive line empty → anaconda treats as no-op → SSH validation blocked but install otherwise unaffected.	2026-05-03 14:35:36 +01:00
veilor-org	8861e12485	v0.5.14: remove plymouth package entirely v0.5.13 added omit_dracutmodules+=plymouth + dracut --force regen in chroot %post. Boot test still showed plymouth-start.service running. Theory: the chroot dracut --force --kver loop didn't fire (kver glob may have been empty in chroot), or anaconda regenerated initramfs AFTER our %post and ignored our config drop-in. Simpler fix: don't ship plymouth at all. Add `-plymouth -plymouth-plugin-label -plymouth-system-theme` to kickstart %packages. With no plymouth package on disk, dracut can't bundle it into initramfs regardless of dracut.conf state. The /etc/dracut.conf.d snippet + /dev/null masks from v0.5.12-13 stay as belt-and-braces — harmless once plymouth is absent.	2026-05-03 11:12:35 +01:00
veilor-org	1a0cf689a8	v0.5.13: omit plymouth from dracut + regen initramfs v0.5.12 added /dev/null symlinks for plymouth services on real root. Boot test confirmed plymouth STILL starts: it lives in initramfs (dracut module 90plymouth) which has its own bundled service files, unaffected by /etc/systemd/system/ masks on the installed btrfs. Two-layer fix: 1. /etc/dracut.conf.d/99-veilor-no-plymouth.conf: omit_dracutmodules+=" plymouth " Then `dracut -f --kver $kver` to regenerate initramfs sans plymouth. 2. Keep /dev/null symlinks for post-pivot real-root masking. Result: LUKS prompt rendered as text by systemd-tty-ask-password-agent on tty1 — sendkey-friendly, hardware-realistic.	2026-05-03 10:10:46 +01:00
veilor-org	e90d6ef662	v0.5.12: mask plymouth via /dev/null symlinks (systemctl mask N/A in chroot) v0.5.11 used `systemctl mask plymouth-*.service` in generated kickstart %post chroot block. systemctl needs systemd running, which it isn't in anaconda chroot — calls failed silently (\|\| true). Boot test confirmed: post-reboot showed both: Started plymouth-start.service - Show Plymouth Boot Screen Started systemd-ask-password-plymouth.path - Forward Password Requests Fix: write /dev/null symlinks directly (`ln -sf /dev/null /etc/systemd/system/<unit>`). Achieves what mask does without needing systemd. Also adds the path-activated unit (systemd-ask-password-plymouth.path) which actually pulls plymouth in during dracut-initqueue.	2026-05-03 09:08:36 +01:00
veilor-org	f588f15a6e	v0.5.11: mask plymouth services in chroot %post v0.5.10 added plymouth.enable=0 + rd.plymouth=0 to kernel cmdline, but `plymouth-start.service` still registered and ran in real-root boot. LUKS prompt remained invisible — dracut-initqueue stuck waiting for /dev/disk/by-uuid/<luks> to materialize. Belt-and-suspenders: mask plymouth-{start,quit,quit-wait,read-write, switch-root}.service in the generated kickstart's %post chroot so the units can never start. Effect: systemd-tty-ask-password-agent handles LUKS prompt directly on tty1 with text "Please enter passphrase for disk ...:" — sendkey- friendly + works on real hardware too.	2026-05-03 07:37:00 +01:00
veilor-org	38d702e14a	v0.5.10: disable plymouth during early boot for text LUKS prompt v0.5.9 GRUB-installs cleanly. Disk boots, dracut reaches cryptsetup.target, systemd-ask-password-plymouth.path armed. But plymouth never switches from boot-splash mode to password-prompt mode — sendkey'd passphrases bounce, dracut waits forever on dev-disk-by-uuid. Workaround: pass `plymouth.enable=0 rd.plymouth=0` to kernel cmdline. Eliminates plymouth-ask-password-plugin as a layer; LUKS prompt appears as plain text on tty1 ("Please enter passphrase for disk... :"). Bonus: aligns with hardening posture. Plymouth is graphical eye-candy running in pid 1's namespace during early boot. Fewer moving parts = smaller attack surface. veilor-os defaults to text boot; users wanting splash can re-enable post-install.	2026-05-03 06:32:32 +01:00
veilor-org	2511df6327	v0.5.9: drop --location=none from bootloader directive Auto-install round 4 reached emergency dracut shell post-reboot: Warning: /dev/disk/by-uuid/ecbd65ba-... does not exist Generating "/run/initramfs/rdsosreport.txt" Entering emergency mode. Root cause: `bootloader --location=none` in our generated kickstart literally tells anaconda DO NOT INSTALL GRUB. Earlier reviewer agent suggested `--location=none` thinking it meant "auto-detect EFI/BIOS", but that's wrong — none means none. Fix: drop --location entirely. Anaconda picks correct mode based on detected disk layout (GPT-EFI → grub2-efi-x64, GPT-BIOS → grub2-pc). Without GRUB written, the rebooted VM's UEFI firmware fell back to loading initramfs straight from somewhere, but cmdline lacked rd.luks.uuid= → couldn't find the encrypted root → emergency shell.	2026-05-03 05:20:48 +01:00
veilor-org	2784fbd6e9	ci: drop updates repo (3x 404 on its zchunk repodata)	2026-05-03 04:15:12 +01:00
veilor-org	f8fc89e399	v0.5.8: installer UX polish — pro design User-locked design changes for serious/pro feel: Banner: - Full VEILOR OS wordmark (figlet ANSI Regular block) - Version + date + live indicator: "veilor-os 0.5.8 · 2026-05-03 · live" - No tagline, no credit - Rounded gum border, dim grey accent Menu: - Drop "Welcome" header (banner = welcome enough) - Reorder + simplify: Install live · KDE live · shell ────── Reboot Power off - Cursor: ❯ (sharp angle, matches box-drawing weight) - Middle-dot · separators (cleaner than en-dash) - Visual separator line between primary/session actions Install flow: - Step indicators on each prompt: [1/4] [2/4] [3/4] [4/4] - Disk: "[1/4] Select install disk · WILL BE ERASED" - LUKS: "[2/4] Encryption · LUKS2 passphrase (min 8)" - Admin: "[3/4] Admin user · password for 'admin'" - Locale: "[4/4] Locale" Confirm screen: - Boxed (gum style --border rounded) - "WILL BE ERASED" colored red (FG=1) - "This action is irreversible" colored amber (FG=3) - gum confirm with --affirmative "Yes, install" / --negative "Cancel" Install progress: - gum spin with --show-output during anaconda run - Title: "Installing veilor-os to /dev/X · 10-30min · logs on tty2" - Success: green-bordered "✓ Install complete" box, 5s reboot countdown os-release: VERSION_ID 0.1 → 0.5.8	2026-05-03 03:46:36 +01:00
veilor-org	53949b0899	v0.5.7: drop LVM, native btrfs-on-LUKS partitioning Auto-install round 3 hit: Configuring storage Creating disklabel on /dev/vda Creating luks on /dev/vda3 Creating lvmpv on /dev/mapper/luks-... Creating btrfs on /dev/mapper/veilor-root Running in cmdline mode, no interactive debugging allowed. The exact error message is: mount failed: wrong fs type, bad option, bad superblock on /dev/mapper/veilor-root, missing codepage or helper program LVM+btrfs combination causes mount failure under anaconda --cmdline. mkfs.btrfs runs but post-create mount can't find a valid superblock. Fix: drop LVM intermediary. Use native btrfs-on-LUKS — same pattern Fedora KDE Spin uses by default. Cleaner, snapshot story unchanged (btrfs subvols give us root/home split + rollback potential without LVM's overhead). New layout: vda1: efi (600M) vda2: ext4 /boot (1024M) vda3: LUKS2 → btrfs (label=veilor) ├── subvol root → / └── subvol home → /home ksvalidator clean on the new template.	2026-05-03 02:45:16 +01:00
veilor-org	ac371bdc36	v0.5.6: anaconda --cmdline + XDG_RUNTIME_DIR for unattended install v0.5.5 fixed AnacondaError: 'LANG'. Auto-install harness then hit next crash: TypeError: expected str, bytes or os.PathLike object, not NoneType File "/usr/lib64/python3.14/site-packages/pyanaconda/display.py", line 223 wl_socket_path = os.path.join(os.getenv("XDG_RUNTIME_DIR"), ...) anaconda's display.setup_display() unconditionally tries to set up Wayland socket path. tty1 has no XDG_RUNTIME_DIR set. None gets passed to os.path.join → TypeError. Two-part fix: 1. export XDG_RUNTIME_DIR=/run/user/0 + mkdir, so even if anaconda probes it, the env var has a valid string value. 2. Pass --cmdline to anaconda. Fully unattended text-only mode, no Wayland/X/TUI. Right fit for our gum-driven kickstart flow where ks is self-contained (disk, pw, locale all pre-answered). Combined effect: anaconda goes straight from CLI parse → kickstart execute → reboot. No display subsystem at all. Surfaced by test/auto-install.sh round 2.	2026-05-03 01:35:52 +01:00
veilor-org	5e38412944	v0.5.5: export LANG before anaconda (fixes AnacondaError: 'LANG') Auto-install harness ran through gum installer cleanly but anaconda crashed at startup: File "/usr/lib64/python3.14/site-packages/pyanaconda/keyboard.py", line 152, in activate_keyboard sync_run_task(task_proxy) ... pyanaconda.modules.common.errors.general.AnacondaError: 'LANG' Anaconda's keyboard.activate_keyboard() reads $LANG and bombs if unset. TTY1 (where veilor-installer runs) inherits no locale by default; gum/whiptail don't set it. Fix: export LANG + LC_ALL = user's chosen locale (defaulting to en_GB.UTF-8) before invoking anaconda. Found via test/auto-install.sh end-to-end run.	2026-05-03 00:06:54 +01:00
veilor-org	dce276586f	ci: chown build/out before split (container created as root)	2026-05-02 23:20:36 +01:00
veilor-org	9921745c9d	test/auto-install.sh: auto-fetch + reassemble chunked ISO from ci-latest Workflow now publishes ISO as 1900M chunks. Test harness needs to: 1. gh release download --pattern '.iso.part-' 2. cat parts back into single ISO 3. verify sha256 of all parts If invoked with no ISO arg, auto-fetches from ci-latest release. Falls back to local ISO path if given as arg (existing behavior). Reassembled ISO lives at ~/veilor-iso/ci-latest/.	2026-05-02 22:50:37 +01:00
s8n	deef914064	v0.5.5: autonomous install test harness (#12 ) test/auto-install.sh boots ISO, drives gum installer via QEMU monitor sendkey with hardcoded test answers, waits for anaconda, reboots into installed system, SSHs in, runs validation checklist. Co-authored-by: veilor-org <admin@veilor.org>	2026-05-02 22:49:51 +01:00
veilor-org	da08047172	ci: split ISO into 1900M chunks for GH release upload GH release asset size limit = 2 GiB. Veilor ISO ~2.8 GiB (KDE base + hardening + grafted /veilor/ tree). zstd -19 only achieves 96.67% compression (squashfs already xz-compressed). Splitting is the fix. Workflow now: - Splits ISO with `split -b 1900M -d --suffix-length=2` - Drops original ISO before upload (would fail at >2 GiB) - Includes per-part sha256 for reassembly verification - Release notes include cat reassembly command test/auto-install.sh will need follow-up commit to download + cat the parts before booting.	2026-05-02 22:49:19 +01:00
veilor-org	73ac2cf96f	ci: grant contents:write + drop artifact upload-on-failure Two follow-ups to `75a68a1` (releases switchover): 1. action-gh-release got 403 "Resource not accessible by integration" because default GITHUB_TOKEN has read-only on contents. Added workflow-level `permissions: contents: write`. 2. Failure-path artifact upload still hit quota wall. Replaced with inline `tail` of build/out/build.log + anaconda program.log directly to job log. No artifact upload = no quota.	2026-05-02 22:13:44 +01:00
veilor-org	75a68a1187	ci: switch ISO publish from artifacts to GitHub Releases Artifact storage quota (50GB Pro tier) maxed out with ~18 iterations of 2.7GB ISOs. Quota recalc 6-12h not in our cadence. Builds succeed but upload step fails — wasting CI minutes + blocking testing. Switch to GitHub Releases (no storage quota): - Every successful build on main updates rolling `ci-latest` prerelease draft. Replaces files in place. - Tag-driven releases (v..*) keep their existing publish path. - Build logs remain as artifacts (small + opt-in failure only, retention=1d). User can `gh release download ci-latest --repo veilor-org/veilor-os` or browse to releases page. No more artifact quota wall.	2026-05-02 21:42:54 +01:00
veilor-org	0f4647577b	v0.5.4: installer UX polish — terse menu, VEILOR OS wordmark, hostname auto User boot-tested v0.5.2 + in-VM patch. Requested polish: - Banner: replace slant-figlet `veilor-os` + "hardened. branded. yours." tagline with figlet ANSI Regular `VEILOR OS` wordmark (5-line block). No tagline. Border preserved by gum style call. - Menu header: "Welcome. What would you like to do?" → "Welcome" - Menu labels: "Install veilor-os to disk" → "Install" "Try live — desktop (KDE Plasma)" → "live - (KDE)" "Try live — shell" → "live - shell" "Reboot" / "Power off" unchanged - Hostname prompt removed — hardcoded to "veilor". User can change post-install via hostnamectl. Cuts one prompt from install flow. Confirmation summary drops the Hostname row. - Locale options trimmed: en_GB.UTF-8, en_US.UTF-8 only (was 4 incl de_DE, fr_FR). i18n not v0.5 priority. Verified in-VM rendering of the menu changes via sed-patch on v0.5.2 ISO. ksvalidator + bash -n clean.	2026-05-02 21:10:04 +01:00
veilor-org	125e5f93af	ci: drop ISO artifact retention from 14 to 3 days Hit GitHub Actions artifact storage quota (50GB Pro tier) at 26 artifacts × ~2.7GB = 42GB. Each push burns ~2.7GB; 14d retention + frequent iteration = inevitable quota exhaustion. 3-day retention covers QEMU + spare-laptop test cycles. For long-term keep, attach to GH Releases on tag (PR #2 will wire that).	2026-05-02 07:21:44 +01:00
veilor-org	9fedb8592f	v0.5.3: fix installer require_tty before tee redirect QEMU boot test of v0.5.2 found service still status=1/FAILURE despite file present at /usr/local/bin/veilor-installer. Root cause via `bash -x`: `exec > >(tee -a "$LOG") 2>&1` ran BEFORE require_tty check; process substitution replaces fd1 with a pipe, so [[ -t 1 ]] returns false → require_tty bails out with [ERR] message. Order was self-inflicted bug from v0.5.0. Fix: move require_tty function definition + call BEFORE the tee redirect. Drop the redundant require_tty call in the entry block (would fail post-redirect).	2026-05-02 06:22:47 +01:00
veilor-org	ec4291293e	v0.5.2: move veilor-installer + veilor-firstboot to /usr/local/bin QEMU boot test of v0.5.1 (commit `3cbffaf`) revealed both scripts missing from /usr/local/sbin/ on running system, despite being in overlay/usr/local/sbin/ in the source tree. Root cause: Fedora's filesystem package (or post-install scriptlet) rewrites /usr/local/sbin → /usr/local/bin symlink AFTER kickstart %post --nochroot's overlay copy runs. The cp -a placed files in /usr/local/sbin/ as a real directory; the symlink replacement deleted them. Confirmed via tty diagnostic: `ls -la /usr/local` shows `lrwxrwxrwx ... sbin -> bin` with bin mtime predating sbin symlink ctime by ~5min — overlay copy ran first, scriptlet rewrote sbin second. Fix: move both binaries to overlay/usr/local/bin/ where they're safe from the symlink rewrite. Update all references: - kickstart/veilor-os.ks chmod path + chown + diagnostic ls - overlay/etc/systemd/system/getty@tty1.service.d/veilor-installer.conf ExecStart - overlay/etc/systemd/system/veilor-firstboot.service ExecStart - scripts/selinux/build-policy.sh fcontext + restorecon paths - generated install ks template inside veilor-installer Service drop-in stays at /etc/systemd/system/getty@tty1.service.d/ unchanged. The veilor-installer binary in /usr/local/bin/ is discoverable via $PATH same as before.	2026-05-02 05:33:22 +01:00
s8n	3cbffaf714	sec: AppArmor profile skeletons + audit shipping draft + veilor-firstboot SELinux module (#3 ) Co-authored-by: veilor-org <admin@veilor.org>	2026-05-02 04:39:39 +01:00
s8n	8127f32868	v0.6: pre-stage veilor-update + veilor-doctor CLI tools (#11 ) Two user-facing commands shipped in overlay/usr/local/bin/. Wraps dnf+flatpak update flow and read-only health diagnostic. Uses gum if available, plain output otherwise. No kickstart wiring yet beyond chmod — full integration in v0.6.0 release. Co-authored-by: veilor-org <admin@veilor.org> Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-02 04:39:33 +01:00

1 2 3

104 commits