Kaurin’s Paste Depository

A short Proxmox journey

2024-05-11T03:30:00+01:00

My short journey into Proxmox land. These are just notes for my self reference should I ever need them.

Disclaimer

These are very destructive procedures. I bare no responsibility for any damages done to your system.

Goal

Install proxmox.
Have it use letsencrypt for the webui HTTPS.
Familiarize with the product to better understand the current virtualization landscape

KVM/libvirt Preparation

I will be using libvirt’s virt-manager to spin up Proxmox.

Volumes

This time around I will be going with the default storage pool - meaning qcow2 backing image files.

sudo virsh vol-create-as --pool default --name proxmox-os --capacity 20G
sudo virsh vol-create-as --pool default --name proxmox-data-1 --capacity 200G
sudo virsh vol-create-as --pool default --name proxmox-data-2 --capacity 200G
sudo virsh vol-create-as --pool default --name proxmox-data-3 --capacity 200G

Network

Bridge networking is fine here.

My home DNS will forward anything headed for the *.pm.dood.ie towards 192.168.0.60

First boot

Graphical Install
Accept license
Select the 20GB disk for OS
Country/Timezone/Keyboard Layout
Password/Email
Network
- Management interface: default
- Hostname: proxmox.pm.dood.ie
- IP CIDR: 192.168.0.60/24
- Gateway: 192.168.0.1
- DNS: 127.0.0.1
Install

Admin user

Proxmox has a concept of “Realms”, which roughly correspond to authentication mechanisms. The preferred realm is “pve” which is the proxmox propriatery auth system.

There is also the “pam” Realm which corresponds to the system-local auth, but is not provisioned top-down. You can also add more auth mechanisms (realms), but that’s outside of the scope of this document.

What do I mean by “not provisioned top-down”? When we create a user, say

pveum user add testuser@pam --email youremail@something.invalid

… and try to change it’s password

pveum passwd testuser@pam

We will get an error saying:

change password failed: user 'testuser' does not exist

This is because proxmox requires you to handle PAM authenticated users yourself.

This would work:

pveum user add testuser@pam --email youremail@something.invalid 
useradd -m testuser
pveum passwd testuser@pam

I did not want to proceed researching this and will be using the builtin root user in this guide.

letsencrypt powered HTTPS for the Promoxmox Webui

You will need your CloudFlare Account ID and API token to proceed. Here is a video guide on how to get them.

Permissions for the API token need to be:

Zone / Zone / Read
Zone / DNS / Edit
Include / Specific Zone / yourdomain.com

cat > acme.txt <<EOF
CF_Account_ID=YOUR_ACC_ID
CF_Token=YOUR_TOKEN
EOF

pvenode acme account register account-name youremail@something.invalid

After the last command you will be asked for a few choices regarding letsencrypt - whether you want to use the poduction or staging server as well as to accept the terms of service. I would recommend staging certs at this point, just be aware that they will produce invalid certs. You can inspect your browser “padlock” icon to check whether it’s actually letsencrypt.

But first, we need to finish the process:

pvenode acme plugin add dns cloudflare --api cf --data acme.txt
pvenode config set -acmedomain0 proxmox.pm.dood.ie,plugin=cloudflare
pvenode acme cert order

When we refresh the proxmox UI, the certificate should be updated.

Conclusion

I don’t really have one of much value. Speaking completely subjectively, it didn’t sit right with me, and I will probably be moving on to a DIY virtualization host powered by gitops.

Regardless of my subjective feel, Proxmox seems like a great tool, especially if you need advanced features like clustering.

Cleanup

sudo virsh vol-delete --pool default proxmox-os    
sudo virsh vol-delete --pool default proxmox-data-1
sudo virsh vol-delete --pool default proxmox-data-2
sudo virsh vol-delete --pool default proxmox-data-3

Virtualized TrueNAS with Truechart and Letsencrypt

2024-05-10T04:00:00+01:00

Notes on what i’ve done to provision a virtualized TrueNAS setup with Truechart and Letsencrypt for testing purposes.

Disclaimer

These are very destructive procedures. I bare no responsibility for any damages done to your system.

Goal

TrueCharts powered setup with letsencrypt. Letsencrypt should get certs via DNS checks.

Setting up the libvirt host

Looks like ZFS is pretty good as a backing store for QEMU.

First, we need a libvirt pool. Because i’m using ZFS, I can do the following

sudo virsh pool-define-as --name zfs-pool --type zfs --source-name zroot
sudo virsh pool-start zfs-pool

Then, let’s create the required volumes inside that pool

sudo virsh vol-create-as --pool zfs-pool --name truenas-os-1 --capacity 20G
sudo virsh vol-create-as --pool zfs-pool --name truenas-os-2 --capacity 20G
sudo virsh vol-create-as --pool zfs-pool --name truenas-data-1 --capacity 200G
sudo virsh vol-create-as --pool zfs-pool --name truenas-data-2 --capacity 200G
sudo virsh vol-create-as --pool zfs-pool --name truenas-data-3 --capacity 200G

Then, use virt-manager to set up a “Debian 12” machine while using the TrueNAS boot ISO.

Remove the sound card
Do not use default storage from the wizard
In the virt customization menu add all the devices we created earlier from the zfs-pool.

Why no XML for the virt?

Because the libvirt / virt-manager team will make future runs through the UI more future-proof than my XML.

First time boot

Install the OS.

Select the two small OS drives for the OS
provide a password
EFI Boot

After the reboot, you might have to re-enable the two OS disks as boot devices.

Home network

I set up my DHCP server to lease 192.168.0.60 to the MAC address of the virt.

My home router is the primary (and secondary) DNS server. On my router I can set a regex rule to route all *.tn.dood.ie to a specific IP.

If you don’t have this capability available to you, you can probably set-up the hosts file on your computer, but this is beyond the scope of this guide.

Initial TrueNAS config

This will mostly be my retelling of the TrueCharts guide, just more compact. Definitely use the Truechart guide for reference.

Storage -> Create Pool
- General Info
  - Name: tank
  - Allow non-unique serialed disks: Allow
- Data:
  - Layout: zraid1
  - with/number: should be autopopulated (3/1)
  - Save and Go To Review
- Create Pool
System Settings -> General ->
- GUI
  - Ports 81 and 444 respectively
  - Turn off Usage collection
  - HTTPS TLSv1.3 only
  - Once confirmed, make sure to reconnect via https://192.168.0.60:444
- Localization
  - Timezone
- Apps ->
  - Settings -> Choose pool
    - Select tank
  - Discover Apps -> Manage Catalogs
    - Add Catalog
      - name: truecharts
      - repository: https://github.com/truecharts/catalog
      - Preferred Trains: premium, stable and system
      - Branch: main
- Services ->
  - SSH - Enable and start
Credentials -> Users ->
- Admin / Edit
  - Paste Authorized Key

Adding the catalog can take several minutes++

Bootstrap Truechart with proper HTTPS

We will be using the “cluster wide certificates” as documented here.

Big tip when adding apps is that the navigation menu is on the right-hand side. Huge help as the UI is trying to show yaml in UI form, and it can be a bit hard to read.

Apps -> Discover Apps ->
- Available Apps -> Refresh
- Prometheus-Operator (truecharts, system)
  - Retention: as desired (default 31d)
- CertManager - source doc
  - Set preferred DNS. Needs to be reliable. In my case 192.168.0.1:53,192.168.0.1:53.
- kubernetes-reflector
  - Default settings
- clusterissuer - source doc
  - namespace: ix-cert-manager or ix-
  - ACME Issuer
    - name: cert-staging
    - Type: Cloudflare
    - Server: Staging
    - email: your email
    - CloudFlare API Token: your CF token
    - Repeat for name: cert and Type: Production
  - Cluster Wide Certificates
    - Add
    - Enabled ✔
    - name: cluster-staging
    - CertManager Cluster Issuer: cert-staging
    - Certificate Hosts / Add
    - *.tn.dood.ie
    - Repeat for name: cluster and CertManager Cluster Issuer: cert
- Traefik (truecharts, premium)
  - Name: traefik
  - Metrics: enabled
    - Prometheus: enabled

Eat our own dogfood

Now that we’ve primed traefik, we can go ahead and edit it, and have it integrate with itself.

Apps ->
- Traefik (edit this time)
  - Services
    - Main Service / Service Type: ClusterIP (No need for a dedicated exposed port!)
    - Don’t touch those services with port 443 and 80. Those are very much needed as LoadBalancer.
  - Ingress
    - Main - Enable
    - Hosts / Add / traefik.tn.dood.ie
      - Path: /
      - Path Type: Prefix
  - Integrations / Traefik
    - Enabled ✔
    - Entrypoints / Entrypoint: websecure
    - DO NOT check “CertManager”
    - Show Advanced Settings ✔
      - TLS Settings / Add
        
        Certificate Hosts / Add
        
        Host: traefik.tn.dood.ie
        
        TLS Settings / Add
        
        Certificate Hosts / Add / traefik.tn.dood.ie
        
        Cert Manager Cluster issuer: leave empty!
        
        Cluster Certificate (advanced): cluster-staging OR cluster

Standalone example - Cyberchef

Cyberchef is a cool web-based utility that’s completely standalone, does not require databases etc.

Let’s see how it looks like when we can deploy an app with proper HTTPS in one go.

Apps -> Discover Apps ->
- Cyberchef
  - Services
    - Service Type: ClusterIP (No need for a dedicated exposed port!)
  - Ingress
    - Main - Enable
    - Hosts / Add / cyberchef.tn.dood.ie
      - Path: /
      - Path Type: Prefix
  - Integrations / Traefik
    - Enabled ✔
    - Entrypoints / Entrypoint: websecure
    - DO NOT check “CertManager”
    - Show Advanced Settings ✔
      - TLS Settings / Add
        
        Certificate Hosts / Add
        
        Host: cyberchef.tn.dood.ie
        
        TLS Settings / Add
        
        Certificate Hosts / Add / cyberchef.tn.dood.ie
        
        Use Cert Manager Cluster issuer: leave empty!
        
        Cluster Certificate (advanced): cluster-staging OR cluster

Libvirt cleanup

Delete the machine we created. If doing it through virt-manager it will ask you to delete the volumes. This can sometimes fail (for some volumes).

In any case:

sudo virsh pool-destroy zfs-pool
sudo virsh pool-undefine zfs-pool

sudo zfs destroy zroot/truenas-os-1
sudo zfs destroy zroot/truenas-os-2
sudo zfs destroy zroot/truenas-data-1
sudo zfs destroy zroot/truenas-data-2
sudo zfs destroy zroot/truenas-data-3

Sometimes this can happen:

cannot destroy 'zroot/truenas-data-3': dataset is busy

Looks like it happens to folks.

Unfortunately, the workaround for me required a reboot:

Disable libvirtd: sudo systemctl disable libvirtd
Reboot
Try to delete again
Enable libvirtd: sudo systemctl enable libvirtd

Complete derail while trying to clean up

On my third run of this very guide, the dataset is busy error happened again. This time the reboot trick didn’t work. Probably because I had to mask libvirtd.socket, libvirt-ro.socket and libvirt-admin.socket as they will start libvirtd even when it is disabled. This is pure speculation, though, because I didn’t investigate what else aside from libvirt could be using this zvol.

I opted for booting into the Fedora live USB so I can have a clean slate while trying to fix the issue.

Context: Some months ago I followed this ZFSBootMenu guide guide so I can get the live system prepared for installing Fedora with root ZFS.

Fedora Live USB

Unfortunately, I recently updated my system to Fedora 40. Website referenced in the guide, zfsonlinux.org, did not have the Fedora 40 RPMs yet.

So… naturally I had to build the packages myself in the live Fedora system. I opted for the DKMS package build and that worked fine. I installed the resulting packages and I could modprobe zfs.

Now, because I’m fairly fresh to the ZFS game, I had no idea that ZFS is aware of the host it is being mounted on.

What this meant was that when I was trying to do zpool import, I was getting pool may be in use from other system.

Being careless I just read the error message and forced the import with zpool import -f zroot.

I was able to get rid of the last remaining zvol with zfs destroy zroot/truenas-data-3 and rebooted.

Heart attack

After typing in my disk encryption password, I was greeted by a, now familiar, error during systemd startup: pool may be in use from other system with an option to drop to the emergency shell.

This system has this very guide as well as days of other work that has not been backed up…

Luckily, still having access to the zfs utilities in the emergency shell I could see that the host ID is that of the live system, so I figured that another forced import would fix the issue.

It would appear that I was right: zpool import -f zroot. Reboot

The issue went away after that forced import back on the normal system.

Conclusion

This endeavor has taught me a few things:

That Truenas looks like a pretty cool project
That Truechart also looks like a pretty cool project, but is too underdocumented for my taste at the moment. Will definitely be using their charts, though.
That I still have a lot to learn about ZFS
That I might skip ZFS-libvirt integration for now

Microsoft Flight Simulator Checklists

2024-05-06T14:00:00+01:00

My personal MSFS ecosystem has grown so large that I require checklists.

Update checklist

Startup Checklist

Installing Arch on a 2015 MacBookPro

2024-05-05T16:00:00+01:00

These are my notes on what I did to get Arch running on a MacBookPro 11,5 (Mid-2015)

My setup has no swap, EFI boot, encrypted root (single partition)

Disclaimer

These are very destructive procedures. I bare no responsibility for any damages done to your system.

Boot into the arch install USB

Live System Wifi

It is worth mentioning that I first connected to my wifi network using the iwctl utility. I actually didn’t make note of the commands there, but from memory I think they were:

station wlan0 connect MY_WIFI_SSID
station wlan0 show

Prep root password and SSH

My next step is to set a root password, enable root login in the sshd config, and start the sshd service.

Note: This unsafe SSH setup is for the live system only and does not propagate to the installed system.

Log in remotely and start setting up the system

I like to do this because all my notes are on a working, stable system. When I inevitably have to iterate, using remote access proves to be super useful.

First, let’s wipe our disk (you’ve been warned by the disclaimer!)

wipefs -a /dev/sda

Set up partitions

parted /dev/sda mklabel gpt
parted /dev/sda mkpart boot fat32 0% 1GB
parted /dev/sda set 1 esp on
parted /dev/sda mkpart luks 1GB 100%

Set up encryption

cryptsetup -y -v luksFormat /dev/sda2
cryptsetup open /dev/sda2 root

Optional: Fill up the disk space (takes a long time)

dd if=/dev/urandom of=/dev/mapper/root bs=1M

Create rootFS and mount

mkfs.xfs /dev/mapper/root
mount /dev/mapper/root /mnt

Set up the EFI partition

mkfs.fat -F32 /dev/sda1
mount --mkdir /dev/sda1 /mnt/boot

Basic bootstrap & fstab & chroot

pacstrap -K /mnt base linux linux-firmware
genfstab -U /mnt >> /mnt/etc/fstab
arch-chroot /mnt

Can’t live without vim

pacman -Syu --noconfirm vim

Timezone, timesync

ln -sf /usr/share/zoneinfo/Europe/Dublin /etc/localtime
hwclock --systohc

Locale

localectl set-locale LANG=en_US.UTF-8
echo "en_US.UTF-8 UTF-8" > /etc/locale.gen
locale-gen

# These two commands are just for the current session. May not be needed
unset LANG
source /etc/profile.d/locale.sh

Hostname

echo myhostname > /etc/hostname

initramfs modifications in /etc/mkinitcpio.conf We want the amdgpu driver to have precedence over radeon

HOOKS=(base udev autodetect microcode modconf kms keyboard keymap consolefont block encrypt filesystems fsck)
MODULES=(amdgpu radeon)

Ensure we are using the amdgpu with the appropriate family support

echo "options amdgpu si_support=1" > /etc/modprobe.d/amdgpu.conf
echo "options radeon si_support=0" > /etc/modprobe.d/radeon.conf

Install xfsprogs before we re-bake initramfs

pacman -Syu --noconfirm xfsprogs
mkinitcpio -P

Gnome & friends

pacman -Syu --noconfirm mesa libva-mesa-driver mesa-vdpau vulkan-radeon vulkan-intel
pacman -Syu --noconfirm gnome gnome-extra

Enable gdm

systemctl enable gdm

More software…

pacman -Syu --noconfirm less rsync firefox htop i2c-tools lm_sensors aspell hunspell hunspell-en_us hunspell-en_gb wget

Devel and AUR software

pacman -S --needed base-devel git

Audio software

pacman -Syu --noconfirm sudo networkmanager
pacman -Syu --noconfirm pipewire wireplumber alsa-utils 

Enable networkmanager and sshd

systemctl enable NetworkManager
systemctl enable sshd

Enable gnome power control panel options

pacman -Syu power-profiles-daemon
systemctl enable power-profiles-daemon

Add my user

useradd -m myusername
passwd myusername
usermod -aG wheel myusername

Root password

passwd

Install and setup refind (needed for osx spoofing)

pacman -Syu --noconfirm refind
refind-install

Edit /boot/efi/EFI/refind/refind.conf

spoof_osx_version 10.11

List our encrypted root block device ID

ls -l /dev/disk/by-uuid/ | grep sda2
# lrwxrwxrwx 1 root root 10 May  5 15:01 EXAMPLE_UUID -> ../../sda2

Edit /boot/refind_linux.conf. Find “Boot with Standard options” (first entry). Add the following kernel options

Make sure to use the correct UUID as per what we got above. The brcmfmac.feature_disable=0x82000 stanza hais from this solution. WiFi is currently broken without this stanza.

"Boot with standard options"  "amdgpu.aspm=0 acpi_osi=Darwin acpi_backlight=native radeon.si_support=0 amdgpu.si_support=1 cryptdevice=UUID=EXAMPLE_UUID:root root=/dev/mapper/root rw add_efi_memmap intel_iommu=on iommu=pt brcmfmac.feature_disable=0x82000"

Disable suspend. We do this because currently there is a, what looks like, a kernel bug that crashes amdgpu on S3 suspend.

mkdir /etc/systemd/sleep.conf.d
vim /etc/systemd/sleep.conf.d/disable-suspend.conf

[Sleep]
AllowSuspend=no
AllowHibernation=no
AllowHybridSleep=no
AllowSuspendThenHibernate=no

Finally…

exit
umount -R /mnt
reboot

References

General Arch installation references

https://wiki.archlinux.org/title/Installation_guide
https://wiki.archlinux.org/title/Arch_boot_process#Boot_loader
https://wiki.archlinux.org/title/REFInd
https://www.rodsbooks.com/refind/linux.html#easiest
https://wiki.archlinux.org/title/Dm-crypt/Encrypting_an_entire_system#LUKS_on_a_partition
https://wiki.archlinux.org/title/MacBookPro11,x

Arch sound references

https://wiki.archlinux.org/title/Advanced_Linux_Sound_Architecture
https://wiki.archlinux.org/title/PipeWire
https://wiki.archlinux.org/title/WirePlumber

Power management references

https://wiki.archlinux.org/title/Power_management/Suspend_and_hibernate

Graphics references

https://wiki.archlinux.org/title/AMDGPU
https://wiki.archlinux.org/title/Intel_graphics
Radeon M370X quirk
MBP 11,5 suspend hang on arch forums
MBP 11,5 suspend hang on gentoo forums
Possible related MBP suspend hang bug on freedesktop gitlab

Mounting libvirt block devices with losetup

2024-05-02T18:00:00+01:00

How to perform disk surgery on a “cold” libvirt volume.

Disclaimer

These are very destructive procedures. I bare no responsibility for any damages done to your system.

Problem

Sometimes we are very smart and provision our virtual machine using a cloud image and “nocloud” metadata to provision a passwordless system that only allows key-based SSH access.

Sometimes we are not so smart and destroy our network access to the machine which propagates on reboot. Because we have a passwordless virt, using virsh console won’t save us.

In my case, disabling a systemd service was all that was needed for the fix. This can be done on a “cold” OS disk by symlinking the service to /dev/null - so, disk surgery.

This particular VM was a Fedora 39 based on the cloud image, which has 5 partitions out of the box - and the target device on the virt host was a LVM volume. This guide should work for any block storage volume aside from LVM, but I haven’t tested it.

Procedure

Before you get started, check whether you are using the /dev/loop0 device for anything else. You should also be able to use any of the available /dev/loopX devices in your system (not tested).

# Status
losetup -a

# Setup loop dev
losetup -P /dev/loop0 /dev/mapper/libvirt_lvm-samba

# Check status again (we should now see output)
losetup -a

# See partitions
lsblk | grep loop0

# Mount desired partition
mount /dev/loop0p5 /mnt

# Do some work on your mountpoint

# Umount 
umount /mnt

# undefine loop dev
losetup -d /dev/loop0

# Check if no output shown
losetup -a

Windows EFI Boot Surgery

2024-03-17T17:00:00+00:00

How to create an EFI partition on a fully utilized Windows Drive

Disclaimer

These are very destructive procedures. I bare no responsibility for any damages done to your system.

Windows and Linux knowledge is required. This guide is tailored for my system. Your mileage may vary.

Problem

After wiping my Fedora disk and re-installing Fedora from scratch I no longer had my UEFI Windows boot entry available in the BIOS.

This means that I previously only had one EFI partition on that Linux storage device and that Windows piggy backed onto it.

Now that the Fedora is reinstalled on the freshly wiped disk, I lost the Windows boot entry.

Disk layout

My system looked like this (lsblk output with comments).

Notice the lack of EFI partition anywhere else except for the Linux storage device.

                NAME                                          MAJ:MIN RM   SIZE RO TYPE  MOUNTPOINTS

                sda                                           8:0      1 447.1G  0 disk  
                └─sda1                                        8:1      1 447.1G  0 part 

Linux ssd       sdb                                           8:16     1 223.6G  0 disk  
                ├─sdb1                                        8:17     1   600M  0 part  /boot/efi
                ├─sdb2                                        8:18     1     1G  0 part  /boot
                └─sdb3                                        8:19     1   222G  0 part  
                └─luks-REDACTED                               253:0    0   222G  0 crypt /home
                                                                                         /

                zram0                                         252:0    0     8G  0 disk  [SWAP]
                nvme0n1                                       259:0    0 931.5G  0 disk  
                ├─nvme0n1p1                                   259:1    0    16M  0 part  
                └─nvme0n1p2                                   259:2    0 931.5G  0 part  

Windows nvme    nvme2n1                                       259:3    0 465.8G  0 disk  
                ├─nvme2n1p1                                   259:4    0    16M  0 part  
                └─nvme2n1p2                                   259:5    0 465.8G  0 part  

                nvme1n1                                       259:7    0   3.6T  0 disk  
                ├─nvme1n1p1                                   259:8    0    16M  0 part  
                └─nvme1n1p2                                   259:9    0   3.6T  0 part 

Options

I guess I can go back and learn how to put the Windows EFI boot option onto the existing linux-ssd EFI partition, but seeing that the Windows installer did this before, and it caused grief, I opted for the following:

Resize the Windows partition and create an EFI partition on the Windows disk for redundancy

Execute the fix

Download the Windows ISO
Write the ISO to the usb Flash dd if=/home/username/Downloads/Win11_23H2_English_x64v2.iso of=/dev/sdc bs=4M
Boot into the Windows-install USB flash using UEFI
When greeted with the Windows Installer language select window, press Shift+F10 to get the command prompt
Use diskpart and do the following in the diskpart command prompt:
Unassign the current C: drive: remove letter=C
command list disk to identify the Windows disk and partition. In my case it was Disk 2 and Partition 2.
select disk 2
select part 2
assign letter=C
Temporarily drop out of diskpart by running exit and check whether the C drive contains the correct partition.
Back into diskpart : select disk 2, select part 2
shrink desired=500 minimum=500
create partition efi
Probably redundant after partition creation, but does not hurt: select part 3
format fs=fat32 quick
assign letter=y
exit
Run bcdboot C:\windows /s Y:
Reboot and optionally set your UEFI entry boot priority in your UEFI BIOS.

References

Fedora ZFS raidz1 Encrypted Root and ZFSBootMenu

2024-02-25T17:42:00+00:00

ZFS raidz across multiple storage devices which are also the boot device in Fedora using ZFSBootMenu

Use-case

I want my linux home server to have an encrypted ZFS rootFS with raidz1. In my quest to realize my use-case, I found ZFSBootManager which has guides for major linux operating systems, including Fedora Workstation.

I decided to go with the ZFSBootMenu Fedora Workstation guide rather than try to hack something on my own.

Problem

The ZFSBootMenu Fedora Workstation guide covers the use-case with one block storage device without zraid1.

Solution

Luckily, most of it is still valid. These are the changes I had to make for my system.

NOTE: The guide still needs to be followed, just substitute the relevant sections

export BOOT_DISK="/dev/nvme0n1"
export BOOT_PART="1"
export BOOT_DEVICE="${BOOT_DISK}p${BOOT_PART}"
export POOL_PART="2"


zpool labelclear -f /dev/nvme0n1p2
zpool labelclear -f /dev/nvme1n1p2
zpool labelclear -f /dev/nvme2n1p2

wipefs -a "$BOOT_DISK"
wipefs -a /dev/nvme0n1
wipefs -a /dev/nvme1n1
wipefs -a /dev/nvme2n1

sgdisk --zap-all /dev/nvme0n1
sgdisk --zap-all /dev/nvme1n1
sgdisk --zap-all /dev/nvme2n1
sgdisk --zap-all "$BOOT_DISK"

sgdisk -n "${BOOT_PART}:1m:+512m" -t "${BOOT_PART}:ef00" "/dev/nvme0n1"
sgdisk -n "${BOOT_PART}:1m:+512m" -t "${BOOT_PART}:ef00" "/dev/nvme1n1"
sgdisk -n "${BOOT_PART}:1m:+512m" -t "${BOOT_PART}:ef00" "/dev/nvme2n1"

sgdisk -n "${POOL_PART}:0:-10m" -t "2:bf00" "/dev/nvme0n1"
sgdisk -n "${POOL_PART}:0:-10m" -t "2:bf00" "/dev/nvme1n1"
sgdisk -n "${POOL_PART}:0:-10m" -t "2:bf00" "/dev/nvme2n1"

zpool create -f -o ashift=9 \
 -O compression=lz4 \
 -O acltype=posixacl \
 -O xattr=sa \
 -O relatime=on \
 -O encryption=aes-256-gcm \
 -O keylocation=file:///etc/zfs/zroot.key \
 -O keyformat=passphrase \
 -o autotrim=on \
 -m none \
 zroot raidz1 /dev/nvme1n1p2 /dev/nvme0n1p2 /dev/nvme2n1p2

Fedora Cloud DHCP Client hostname on-boot

2024-01-31T00:00:00+00:00

How to have Fedora-cloud use the DHCP-provided hostname

Use-case

Testing Fedora cloud image on KVM while using NoCloud cloud-init. I like to have my “trusted” home devices (and virts) publish their hostname via their DHCP client request towards my router.

My router has a script that generates static DNS entries based on the hostname value of a client’s DHCP request.

Problem

I’ve noticed that a few “cloud” images that I’ve been trying out don’t usually propagate the “hostname” value from the meta-data of NoCloud to the DHCP settings.

So far, Alpine which uses dhclient via openRC and Fedora-cloud which uses systemd + NetworkManager for its DHCP client.

I guess this is the standard. More stealthy this way, but I don’t need this stealth.

Solution (Fedora-Cloud)

Notice the runcmd. This is the current naming scheme for Fedora-cloud in 2024/01

#cloud-config

packages:
  - sudo

users:
  - name: myuser
    primary_group: myuser
    ssh_authorized_keys:
      - ssh-rsa YOUR_PUBKEY comment
    sudo: "ALL=(ALL) NOPASSWD:ALL"
    groups: wheel
    shell: /bin/bash

runcmd:
  - 'nmcli con modify "cloud-init eth0" ipv4.dhcp-hostname myhostname'

Alpine Linux cloud-init (nocloud)

2024-01-30T17:45:00+00:00

Alpine-Linux and cloud-init difficulties

I was playing around with Alpine Linux cloud images. Apparently, cloud-init in alpine-linux creates locked user accounts by default.

To get around this, I am using the * password hash (not sure if needed) which should not match any password.

In addition to this, I also have to unlock the account with runcmd which happens after the user is created. This is different to bootcmd which happens earlier.

It is worth noting that neither of these two hacks are needed in Fedora cloud images.

#cloud-config

packages:
  - sudo

users:
  - name: myuser
    passwd: "*"
    primary_group: myuser
    ssh_authorized_keys:
      - ssh-rsa YOPUR_SSH_PUBLIC_KEY keycomment
    sudo: "ALL=(ALL) NOPASSWD:ALL"
    groups: wheel
    shell: /bin/ash

runcmd:
  - passwd -u myuser

Hairpin NAT

2022-04-15T01:45:00+01:00

My attempt at explaining the Hairpin NAT concept

Disclaimer

Not a networking expert. A better explanation of this can be found in Mikrotik’s NAT Documentation

Use Case

Assumption: Typical home LAN with a router that provides access to the internet.

I would like to access my home server by using the Public IP that’s assigned to the router’s “WAN” port.

First attempt at a solution

I thought - simple - Just use a port forwarding rule (DST-NAT):

Chain:              dstnat
Input interfaces:   LAN (Mikrotik specific, interface lists)
DST port:           443
Protocol:           TCP
DST address:        
Action:             dst-nat
To-Address:         

Unfortunately, when used from within the home network I get a timeout while trying to connect to the home server via the public IP.

What goes on “under the hood” is the following:

Packet originates from our LAN client computer and heads towards the router, because the destination is a public IP and not a local one
Router matches the packet to our port forwarding rule and sends the packet through, but it also changes the Destination IP to the private IP of the server
The server happily accepts the incoming packet, but replies to the private IP directly, completely ignoring the router. In this very simple scenario, home server and client are on the same /24 network which makes them neighbors.
The client drops/rejects the returning invalid packet (in the networking stack) because the source IP of the returning packet is server’s PRIVATE IP, which is not the IP we sent the original packet to.

The missing piece

To make this type of connectivity work, we also need to set up a SNAT (source NAT, masquerade… use your preferred term). This Source NAT is not the one we already have for LAN->WAN connectivity. This one is specific to LAN->Public-IP->Home-Server traversal, so, depending on our use case, we need to have something like this:

Chain:              srcnat
Source Address:     192.168.0.0/16
Dst Address:        
DST port:           443
Protocol:           TCP
Action:             masquerade

Of course, if you have a wide range of ports you’d like to loop back from LAN to your home server (via the public IP), then this rule should be changed accordingly. It is even simpler if you want your home server to be a “catch all” for any LAN->Public-IP communication.

Situations when you don’t need a hairpin NAT

If your setup is slightly more robust, say, the home server(s) are on a separate subnet, then you don’t need the source NAT rule.

I’m not actually sure EXACTLY why passing back through the router is “magical”, but this is what happens:

Packet originates from our LAN client computer and heads towards the router, because the destination is a public IP and not a local one
Router matches the packet to our port forwarding rule and sends the packet through, but it also changes the Destination IP to the private IP of the server (note: different subnet to the client’s)
The server happily accepts the incoming packet, and sends the reply packet back through the router
The router sees the related packet, and returns it to the client. This packet’s source IP gets re-written to the public-ip (I don’t know why)
The client sees the related packet and accepts it because the source IP is the public IP

Sources

Mikrotik’s NAT Documentation