Friday, 2024-03-01

*** adamcarthur55 is now known as adamcarthur502:52
opendevreviewGhanshyam proposed opendev/irc-meetings master: Retire TripleO: remove the TripleO meeting info  https://review.opendev.org/c/opendev/irc-meetings/+/91069202:58
opendevreviewMerged opendev/irc-meetings master: Retire TripleO: remove the TripleO meeting info  https://review.opendev.org/c/opendev/irc-meetings/+/91069203:28
tkajinamI'm wondering if there is an easy way to partially restore tripleo projects (like only zuul projects ?), to unblock a few projects07:29
tkajinamI know ideally we should address these config errors but we are close to freeze/release timings so it may be helpful if we can pend that work now till the release is made...07:30
fricklertkajinam: this should likely rather be discussed in an openstack context, but do you have examples where this is actually blocking? I would very much rather like to help to move forward and not have a release that somehow still depends on tripleo things09:23
*** ykarel is now known as ykarel|away14:33
opendevreviewMerged openstack/project-config master: Add OpenAPI related repos to OpenStackSDK project  https://review.opendev.org/c/openstack/project-config/+/91058014:39
TheJuliafrickler: tkajinam: One thing I suspect all involved might want to be careful is a desire to maintain continuity, where as maybe we don't need continuity. It is okay to let things go, after all. Just... difficult. :)14:54
TheJuliaJust thinking out loud, it might make sense to leverage coredumpctl for core files on CI nodes, in the event the run context of the process cannot write to the current working directory15:05
fungiwhat's involved in turning that on?15:07
TheJuliabasically resetting the sysctrl15:07
fungidoes that have to be done during boot or can it be done at runtime?15:07
TheJuliakernel.core_pattern=|/usr/lib/systemd/systemd-coredump %P %u %g %s %t %c %e15:08
TheJuliaruntime it looks like, although I'm still fighting to get dnsmasq to give me a core dump15:08
TheJuliaokay, apparently it is resource limiting which is preventing me at the moment15:08
fungiif you do manage to figure it out, that'll be the first step in collecting/recording crashdumps in jobs (obviously we'd want to avoid doing that for sensitive jobs where it might leak memory regions containing secrets or something)15:09
TheJuliayeah15:09
Clark[m]Ya that seems like something jobs should opt into explicitly rather than us setting a default to on15:11
Clark[m]But we can have a role to do it and make it easy15:11
TheJuliaso other challenge is the ulimit is also defaulted out to something stupidly low15:24
fungiheading out to grab lunch and vote but shouldn't be too long if anyone needs anything15:59
fungishould be back by 17:30 utc at the latest15:59
SvenKieskemhm, it is possible to have multiple parents inside a zuul job, isn't it?16:16
SvenKieskewhen can we replace dnsmasq with an evolved dnsdist? :D16:17
TheJulialooks like the only way to get a core, as deep as something like dnsmasq, is to manually tag the process because it gets its limits constrained for core dumps down to 0 somewhere along the way16:21
TheJuliaso prlimit ++ --pid option16:21
Clark[m]SvenKieske it is possible to have multiple parents but I'm not sure it is encouraged/documented. system-config-run-containers in opendev/system-config is an example of this16:22
SvenKieskeClark[m]: thanks, for now I have avoided doing it and just copied instead some stuff around, it's all in a testing stage currently, so not a big deal16:24
SvenKieskemy linter also yells at me when I add multiple parents, but that might just be misconfigured.16:24
Clark[m]You can't provide the parents as a list. The way you declare this is pretty specific16:25
SvenKieskeTheJulia: I fuzzily remember that the ability to being coredumped somehow is tied to CAP_PTRACE, do you happen to know if that is correct? because that might just be dropped, especially in containerized deployments.16:25
TheJulia... that might actually be it16:26
SvenKieskeClark[m]: ah I see, yeah it wants a string, so comma delimited or something I guess?16:26
TheJuliaI just... I'm already low on spoons and it being in a namespace has not helped :)16:26
TheJuliabut I got one16:26
SvenKieskewell that's why I wrote :) hope it helps16:26
Clark[m]No, it is more complicated than that. Hence the lack of docs and recommendation. You'll want to look at the example if you try it16:28
SvenKieskealso dnsmasq afaik does drop some caps on it's own (also a fuzzy and maybe wrong memory). we recently had to change podman caps afaik for dnsmasq to also let cap_net_raw through because cap_admin was not enough, there was an old ML thread linked where the handling of caps in dnsmasq was discussed16:29
SvenKieskeif you're interested it's all linked here, maybe there is something useful for your debugging as well there: https://bugs.launchpad.net/kolla-ansible/+bug/205528216:30
SvenKieskeI really wish for a better/modern dnsmasq alternative though :-/ it's a very useful software for sure, but I feel it's not really well equiped for various cloud scenarios.16:31
TheJuliaSvenKieske: so, I've got one of the folks who lives/breaths it in another window and it took him a while but in his case it was the systemd unit config on his test machine16:33
TheJulia:)16:33
SvenKieskeah, yeah well, systemd units also drop many capabilities by default these days16:33
TheJuliayeah16:34
SvenKieskeoh oh... something broken on the interwebz? ERROR! Error when finding available api versions from default (https://galaxy.ansible.com) (HTTP Code: 503, Message: Service Unavailable)16:35
SvenKieskemhm, works manually at least from my machine16:35
Clark[m]I would say use our proxy cache but it hasn't worked since galaxy redid all of its APIs16:42
SvenKieskeyeah well, it failed in opendev CI :D https://zuul.opendev.org/t/openstack/build/81a67d99f02a4591b681af74432376b016:55
Clark[m]Right but that was against the upstream servers. I can't make them not 50316:56
SvenKieskeI'm used to mirrors being flaky and crumbling CI and all, but I think it's the first time I saw an actual 503 error code in opendev ci16:56
SvenKieskesure :)16:56
Clark[m]It has nothing to do with us. 503 is a server error and the server is wherever galaxy hosts16:56
SvenKieskeyes I know, that was a sincere "sure"!16:58
SvenKieskecommunication via the internet is still hard.16:59
SvenKieskemhm, github also doesn't like me, prints an angry looking unicorn instead :D17:01
SvenKieskereload helps17:01
gtemasvenkieske - it's friday evening. What else do you expect, everybody is applying fixes now17:02
SvenKieskegtema: I personally am a big fan of read only friday, I have spent enough weekends fixing production after botched friday noon deployments :)17:02
SvenKieskeI guess I call it a day :D17:03
gtema:)17:03
SvenKieskedid you got any positive replies regarding the new rusty openstacksdk stuff btw?17:03
SvenKieskeI wanted to post to the mailinglist, that being said, to voice some public support there17:04
gtemanot really, you are still welcome17:04
SvenKieskewill do that!17:07
gtematy17:07
fungiokay, i'm back. looks like i missed some lively banter while lunching17:31
clarkbI'm finding myself have a lazy morning and this afternoon I'm doing taxes. Probably for the best after yesterday's fun :)17:43
clarkbI did start to get some initial centos 7 and xenial cleanup changes pushed to openstack requirements yesterday though17:43
fungiyeah, today seems like a good day for catching up on stuff, far less exciting than yesterday anyway17:54
clarkbalso it is kinda of snowing18:02
fungikinda snowing is the best kinda snowing18:03
TheJuliaclarkb: fungi: unless you guys scream in short order, I'm going to give ssh access to that held vm to one of RH's developers to prod the coredump some18:07
clarkbTheJulia: I think that is fine18:07
clarkbgeneral test nodes should be safe to hand out like that as they don't have any important credentials on them (just public keys)18:07
clarkbthis isn't universally true but should be for your held node18:07
clarkbthank you for the heads up18:07
fungiTheJulia: go for it18:09
clarkbthe centos 7 stuff definitely has more tendrils than buster or opensuse did. swift seems to still run jobs for it across their projects so I've asked them about lceaning up there18:14
clarkbthen a handful of otherplaces appear to have centos 7 jobs too18:14
fungitopic on python discourse was asking about retiring the manylinux1 standard since it's based on rhel/centos 718:24
timburkethe main tricky thing i see there is that centos 8 stream ships pip 9.0.3, which doesn't know about manylinux201018:37
*** ralonsoh_ is now known as ralonsoh19:08
prometheanfireI guess fyi, some recent python updates (beyond urwid) seem to have stopped allowing gertty to sync, I think maybe I'll try it in a venv, downgrading libs to see what fixes it20:17
corvusprometheanfire: side note: https://review.opendev.org/910358  looks like it's not compatible with old urwid version (2.1.2) i haven't tried it with newer20:41
corvusprometheanfire: let me know if you'd like a pip freeze from my venvs to set a lower bound to help with tracking anything down20:42
prometheanfirecorvus: sure, that could help, latest release of urwid and git master of gertty should work20:47
prometheanfireit's other libs that are not working as well anymore20:47
corvusprometheanfire: https://paste.opendev.org/show/b0GGmCDSccHVUu2Jk5nc/20:50
prometheanfireolder sqlalchemy will help a bit too21:02

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!