Things I figure out, break, fix, and occasionally understand — written down so I don't forget them.
Most of this comes from real work: setting up clusters that didn't want to cooperate, dashboards that took longer than they should have, and the kind of config errors you only make once (or twice). If something here saves you an hour of digging through docs, great.