Wednesday, 2021-05-12

*** hamalq has quit IRC00:07
*** auristor has quit IRC00:21
*** auristor has joined #opendev00:28
*** whoami-rajat has quit IRC01:23
openstackgerritIan Wienand proposed openstack/diskimage-builder master: bootloader: remove extlinux/syslinux path  https://review.opendev.org/c/openstack/diskimage-builder/+/54112901:35
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Futher bootloader cleanups  https://review.opendev.org/c/openstack/diskimage-builder/+/79087801:35
*** darshna has quit IRC01:51
*** tkajinam has quit IRC03:34
*** tkajinam has joined #opendev03:34
openstackgerritIan Wienand proposed opendev/zone-opendev.org master: Remove pbx.opendev.org  https://review.opendev.org/c/opendev/zone-opendev.org/+/79089503:46
openstackgerritMerged opendev/zone-opendev.org master: Remove pbx.opendev.org  https://review.opendev.org/c/opendev/zone-opendev.org/+/79089503:59
*** ykarel has joined #opendev04:18
ianw#status log Asterisk PBX service retired; see https://review.opendev.org/c/opendev/system-config/+/79019004:39
openstackstatusianw: finished logging04:39
*** whoami-rajat_ has joined #opendev04:44
*** stevebaker has quit IRC04:52
*** hemanth_n has joined #opendev04:55
*** marios has joined #opendev05:05
openstackgerritIan Wienand proposed openstack/diskimage-builder master: bootloader: remove extlinux/syslinux path  https://review.opendev.org/c/openstack/diskimage-builder/+/54112905:46
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Futher bootloader cleanups  https://review.opendev.org/c/openstack/diskimage-builder/+/79087805:46
openstackgerritMerged opendev/system-config master: Cleanup ssl_cert_check puppet components  https://review.opendev.org/c/opendev/system-config/+/78965206:02
*** ralonsoh has joined #opendev06:07
*** slaweq has joined #opendev06:16
*** ralonsoh has quit IRC06:48
*** sboyron has joined #opendev06:48
*** ralonsoh has joined #opendev06:49
*** tinwood has quit IRC06:54
*** tinwood has joined #opendev06:57
*** fressi has joined #opendev07:00
*** ysandeep|away is now known as ysandeep07:05
*** jhesketh has quit IRC07:08
*** tosky has joined #opendev07:09
*** hashar has joined #opendev07:16
*** andrewbonney has joined #opendev07:17
*** rpittau|afk is now known as rpittau07:26
*** fressi has quit IRC07:38
*** fressi has joined #opendev07:44
*** amoralej|off is now known as amoralej07:46
*** jpena|off is now known as jpena07:59
openstackgerritBenjamin Schanzel proposed zuul/zuul-jobs master: Allow Specifying Log File Retention in s3 Uploader  https://review.opendev.org/c/zuul/zuul-jobs/+/79005008:06
openstackgerritMerged openstack/diskimage-builder master: dib-lint: match text/x-script.python  https://review.opendev.org/c/openstack/diskimage-builder/+/79036908:09
openstackgerritMerged openstack/diskimage-builder master: containerfile: automatically search for distro docker files  https://review.opendev.org/c/openstack/diskimage-builder/+/79036208:19
openstackgerritMerged openstack/diskimage-builder master: bootloader: disable BLS for Fedora  https://review.opendev.org/c/openstack/diskimage-builder/+/79054908:19
*** jhesketh has joined #opendev08:23
*** ysandeep is now known as ysandeep|lunch08:39
*** sshnaidm_ has joined #opendev08:48
*** sshnaidm has quit IRC08:50
openstackgerritMartin Kopec proposed opendev/irc-meetings master: Update interop-wg-meeting info  https://review.opendev.org/c/opendev/irc-meetings/+/79092308:54
*** dtantsur|afk is now known as dtantsur09:03
*** rpittau is now known as rpittau|bbl09:07
*** darshna has joined #opendev09:08
*** sshnaidm_ is now known as sshnaidm09:28
openstackgerritDmitriy Rabotyagov proposed opendev/lodgeit master: Add py3.8 support  https://review.opendev.org/c/opendev/lodgeit/+/77347910:00
openstackgerritDmitriy Rabotyagov proposed opendev/lodgeit master: Redesign manage.py to not use deprecated werkzeug.script  https://review.opendev.org/c/opendev/lodgeit/+/69337810:09
*** rpittau|bbl is now known as rpittau10:19
*** hashar has quit IRC10:30
*** sshnaidm is now known as sshnaidm|afk10:46
*** hashar has joined #opendev11:08
*** dtantsur is now known as dtantsur|brb11:19
*** ysandeep|lunch is now known as ysandeep11:19
*** jpena is now known as jpena|lunch11:30
fricklerinfra-root: we seem to have a significant number of jobs with post_failure where no logs have been uploaded. sadly no time to dig myself today12:17
fricklerhttps://zuul.opendev.org/t/openstack/build/65247750404549a0be5c858716045c33 is one example, but some more can be found e.g. on the status page currently in gate failures12:18
*** sshnaidm|afk is now known as sshnaidm12:19
*** jpena|lunch is now known as jpena12:27
fungithanks, probably one of the swift providers erroring on auth or write. will try to find it12:29
fungilooks like your example ran from ze1012:32
fungiWARNING:keystoneauth.identity.generic.base:Failed to discover available identity versions when contacting https://auth.cloud.ovh.net/12:34
funginothing obvious in progress at http://travaux.ovh.net/12:37
fungiyeah, looks like it's still happening based on a survey of timestamps in our executor lgs12:41
fungiTimeoutError is what's being raised12:41
fungii'll get an emergency patch in to stop uploading logs to ovh12:42
openstackgerritJeremy Stanley proposed opendev/base-jobs master: Temporarily stop uploading logs to OVH  https://review.opendev.org/c/opendev/base-jobs/+/79096112:48
openstackgerritJeremy Stanley proposed opendev/base-jobs master: Revert "Temporarily stop uploading logs to OVH"  https://review.opendev.org/c/opendev/base-jobs/+/79096212:48
fungiinfra-root: i'm going to bypass gating on the first of those and wip the second for now12:48
*** amoralej is now known as amoralej|lunch12:55
openstackgerritMerged opendev/base-jobs master: Temporarily stop uploading logs to OVH  https://review.opendev.org/c/opendev/base-jobs/+/79096112:55
fungi#status log Bypassed gating to merge https://review.opendev.org/790961 for temporarily disabling log uploads to one of our providers12:56
*** ykarel_ has joined #opendev12:56
openstackstatusfungi: finished logging12:56
*** ykarel has quit IRC12:58
*** ykarel_ is now known as ykarel13:00
*** rosmaita has joined #opendev13:05
*** whoami-rajat_ is now known as whoami-rajat13:17
*** timburke has quit IRC13:26
*** timburke has joined #opendev13:27
*** hemanth_n has quit IRC13:28
*** amoralej|lunch is now known as amoralej13:36
*** avass has quit IRC13:39
*** avass has joined #opendev13:39
*** calcmandan has quit IRC13:48
*** calcmandan has joined #opendev13:48
fungiheaded out to run some errands, should be back in an hour and will take a closer look at the ovh swift situation assuming nothing else catches fire in the meantime14:04
*** fressi has quit IRC14:04
*** hashar is now known as hasharAway14:09
*** ykarel_ has joined #opendev14:14
*** ykarel has quit IRC14:16
*** ykarel_ is now known as ykarel14:17
*** lucasagomes has joined #opendev14:20
*** dtantsur|brb is now known as dtantsur14:26
*** rosmaita has left #opendev14:53
*** pongboom2 has quit IRC14:58
fungiokay, back and catching up again15:10
*** hasharAway is now known as hashar15:18
*** mlavalle has joined #opendev15:22
fungilooks like all the builds which selected ovh for log uploads have finally completed as of ~50 minutes ago15:29
fungi#status notice Any builds with POST_FAILURE result and no available logs between 11:41 and 14:41 UTC today were related to an authentication endpoint problem in one of our providers and can be safely rechecked now15:30
openstackstatusfungi: sending notice15:30
-openstackstatus- NOTICE: Any builds with POST_FAILURE result and no available logs between 11:41 and 14:41 UTC today were related to an authentication endpoint problem in one of our providers and can be safely rechecked now15:30
fungiit looks like there was also a much shorter incident around 08:00 but i only counted a few builds which hit it15:31
fungilooks like the latest outage could be related to http://travaux.ovh.net/?do=details&id=50472 which is now marked "closed" but unfortunately the timing is unclear and the reason given is just "done"15:33
openstackstatusfungi: finished sending notice15:33
*** ysandeep is now known as ysandeep|dinner15:34
fungii'll exercise base-test a bit and see if i get a successful upload to ovh, then we can approve the revert15:34
*** ykarel has quit IRC15:37
openstackgerritJeremy Stanley proposed opendev/base-jobs master: DNM: exercise base-test  https://review.opendev.org/c/opendev/base-jobs/+/79101415:39
Tengu&1615:40
*** ysandeep|dinner is now known as ysandeep15:45
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Add zuul02  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048016:03
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Swap zuul.opendev.org CNAME to zuul02.opendev.org  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048216:03
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Reset zuul.o.o CNAME TTL to default  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048316:03
*** lucasagomes has quit IRC16:03
clarkbfungi: do you think we can approve https://review.opendev.org/c/opendev/zone-opendev.org/+/790480/2 to avoid more merge conflicts?16:04
clarkbThen I'll plan to land https://review.opendev.org/c/opendev/system-config/+/790481 tomorrow first thing assuming my head is functionaing properly16:04
clarkb(and maybe we'll finish that swap tomorrow)16:04
*** marios is now known as marios|out16:05
fungidone16:07
clarkbthanks!16:07
clarkband if you haven't yet looking over https://etherpad.opendev.org/p/opendev-zuul-server-swap today would be great so we can call out any flaws with that plan16:08
openstackgerritMerged opendev/zone-opendev.org master: Add zuul02  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048016:09
*** slaweq has quit IRC16:11
fungiabsolutely!16:12
*** rpittau is now known as rpittau|afk16:12
*** marios|out has quit IRC16:24
*** ralonsoh has quit IRC17:02
*** jpena is now known as jpena|off17:07
*** hashar has quit IRC17:10
*** hashar has joined #opendev17:11
*** ysandeep is now known as ysandeep|away17:27
*** amoralej is now known as amoralej|off17:34
fungiclarkb: not sure if you saw ianw's suggestion on 790481, but adding the new zuul server's inventory fqdn to the le altnames, if for no other reason than consistency. in a followup though, happy to push one if you haven't already17:48
*** andrewbonney has quit IRC17:56
*** dtantsur is now known as dtantsur|afk18:01
*** avass has quit IRC18:03
*** hashar has quit IRC18:04
*** avass has joined #opendev18:09
melwitthuh, I just noticed going to www.opendev.org fails "www.opendev.org’s DNS address could not be found"18:12
openstackgerritAde Lee proposed zuul/zuul-jobs master: Add role to enable FIPS on a node  https://review.opendev.org/c/zuul/zuul-jobs/+/78877818:17
fungimelwitt: you're right, there currently isn't one, but if we add one in https://opendev.org/opendev/zone-opendev.org/src/branch/master/zones/opendev.org/zone.db as a cname we'd want to make sure to add a redirect in https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/gitea/templates/gitea.vhost.j2 and altname for the certs in18:19
fungihttps://opendev.org/opendev/system-config/src/branch/master/inventory/service/host_vars/gitea01.opendev.org.yaml through 0818:19
melwittohh ok18:20
fungithankfully that would just be two changes in gerrit (the system-config change, and then the zone-opendev.org change setting depends-on the other)18:20
melwittok cool18:21
openstackgerritMartin Kopec proposed opendev/irc-meetings master: Update interop-wg-meeting info  https://review.opendev.org/c/opendev/irc-meetings/+/79092318:23
fungimelwitt: did you follow a link from somewhere to the www name, or just tried it on a whim?18:26
melwittfungi: tried it while testing something, needed dummy url so I just put www.opendev.org18:27
fungiahh, okay18:28
fungifrankly, we probably would have added it originally except we were doing direct layer-4 forwarding to gitea which didn't have an internal redirect mechanism, so would needed to have tacked on a separate system to handle the layer-7 redirects18:29
melwittoh ok18:30
fungisince then we injected apache in between haproxy and gitea to do ua-based filtering so we could deal with some misbehaving webcrawlers18:30
fungiso if we're okay with the idea of apache being there permanently, or at least something like it, then extending it to do redirects is likely fine18:31
fungialternatively we could host the redirect on a separate server, maybe static.opendev.org18:31
clarkbfungi: we should probably do that update to the actual change18:39
clarkbfungi: we also need a dns update for that altname18:39
clarkbthe reason I'm thinking we shouldn't do a followup is we'll generate one too many extra certs that we don't need18:40
fungiclarkb: oh, yep for the acme cname18:40
fungisystem-config-run-base failed in the gate on it anyway18:40
clarkboh I see you approved the change, but it failed18:40
clarkbya18:40
clarkbI can update it and push a zone update too18:40
fungiwfm, at the ready to reapprove18:41
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Swap zuul.opendev.org CNAME to zuul02.opendev.org  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048218:45
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Reset zuul.o.o CNAME TTL to default  https://review.opendev.org/c/opendev/zone-opendev.org/+/79048318:45
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Add zuul02 acme cname  https://review.opendev.org/c/opendev/zone-opendev.org/+/79103918:45
clarkbfungi: https://review.opendev.org/c/opendev/zone-opendev.org/+/791039 that is the dns update18:45
openstackgerritClark Boylan proposed opendev/system-config master: Add zuul02 to inventory  https://review.opendev.org/c/opendev/system-config/+/79048118:47
openstackgerritClark Boylan proposed opendev/system-config master: Clean up zuul01 from inventory  https://review.opendev.org/c/opendev/system-config/+/79048418:47
clarkbfungi: ^ and that should update the le stuff if I did it correctly18:47
clarkbnope I think I did something wrong18:48
clarkboh wait no its fine I was looking at the child change18:48
clarkbdouble check me (also my brain isn't working so great)18:49
fungiyep, that looks correct to me18:50
corvusclarkb: qq why do we want a zuul02 cert?18:51
corvusdon't we have the le stuff set up so we can give zuul02 the zuul.o.o cert?18:51
fungicorvus: it's more for consistency so we can directly address the inventory fqdn of servers for testing connectivity when we have more than one18:52
fungijust ends up as a subject altname on the final cert18:53
corvusfungi: ok, we don't do that for zuul01 but i guess it's no time like the present to start.18:53
fungicorvus: yep, also zuul01 was in a weird state living in the openstack.org domain officially, so we didn't add records to that zone which weren't absolutely necessary, for a variety of reasons18:54
fungifor stuff in opendev.org it's trivial to include the inventory fqdns on certs18:54
openstackgerritMerged opendev/zone-opendev.org master: Add zuul02 acme cname  https://review.opendev.org/c/opendev/zone-opendev.org/+/79103918:56
clarkbya I think its mostly for consistency18:56
fungiright, though it might come in more handy after 5.018:57
fungiwhen we can expect to have more than one scheduler/web server18:57
clarkb++18:57
fungioutside of upgrade replacement transition periods i mean18:57
fungidoing that for the gitea certs has proven itself very useful, since it gives us an easy way for users to identify which backend they're hitting through a load balancer18:58
corvusi find it interesting we've never felt the need to scale out zuul-web18:59
fungii find it reassuring we've never needed to scale out zuul-web, that's for sure19:00
fungiseems to be highly efficient19:00
openstackgerritMartin Kopec proposed opendev/system-config master: refstack: trigger image upload  https://review.opendev.org/c/opendev/system-config/+/79104319:04
*** stevebaker has joined #opendev19:28
openstackgerritJeremy Stanley proposed opendev/bindep master: DNM: Exercise base-test job  https://review.opendev.org/c/opendev/bindep/+/79104819:36
*** hashar has joined #opendev19:43
*** slaweq has joined #opendev19:48
*** Jeffrey4l has quit IRC19:49
*** Jeffrey4l has joined #opendev20:00
*** sboyron has quit IRC20:13
spotzfungi clarkb - can you leet my RDO announcement through on openstack-discuss:)20:18
clarkbspotz: fungi is a list admin for that one and should be able to20:29
fungispotz: i did roughly an hour ago20:29
spotzHa sweet!20:29
spotzIt's so hhard cause you don't see your own posts20:30
fungii think that's a subscription setting, by default you receive copies of your own posts20:33
fungibut you can set your subscription to exclude them20:33
clarkbI have rechecked https://review.opendev.org/c/opendev/system-config/+/790481 looks like apache on the scheduler failed to start (I would expect and LE problem to have bubbled up sooner in the playbook?) I don't know I have much ability to debug further today, but maybe someone else can take a look if it fails again20:39
fungii've finished cooking/eating so can try to keep an eye on it20:43
fungihttps://review.opendev.org/791048 has 5 builds which successfully uploaded to ovh swift, so i've approved 790962 now20:47
openstackgerritMerged opendev/base-jobs master: Revert "Temporarily stop uploading logs to OVH"  https://review.opendev.org/c/opendev/base-jobs/+/79096220:52
openstackgerritMerged opendev/irc-meetings master: Update interop-wg-meeting info  https://review.opendev.org/c/opendev/irc-meetings/+/79092321:02
*** slaweq has quit IRC21:04
*** hashar has quit IRC21:13
openstackgerritMerged openstack/project-config master: Retire puppet-glare - Step 3: Remove Project  https://review.opendev.org/c/openstack/project-config/+/79008321:29
fungiclarkb: same job failed on the rerun... looking now to see if it's the same behavior21:32
fungiyup21:35
fungiUnable to reload service apache2: Job for apache2.service failed.21:35
*** openstackgerrit has quit IRC21:47
ianwsometimes we can see that if letsencrypt failed, there are some failure returns i think we don't catch correctly21:49
ianwthe acme log should reveal ... looking21:49
ianwit seems we didn't collect it, nor apache logs21:51
*** openstackgerrit has joined #opendev21:58
openstackgerritIan Wienand proposed opendev/system-config master: zuul job : collect some more logs  https://review.opendev.org/c/opendev/system-config/+/79105521:58
clarkbI wonder if I got something wrong with the zuul02.opendev.org altname addition?21:58
fungiahh, thanks, i got sidetracked with other stuff before i could dig deeper21:58
clarkbthough looking at it I don't see anything obviously wrong22:00
clarkbI guess we wait for the logs22:00
*** openstackgerrit has quit IRC22:04
*** timburke has quit IRC22:08
*** timburke has joined #opendev22:09
ianwCreating certs in /etc/letsencrypt-certs/zuul02.opendev.org22:23
ianwthat bit looks right22:23
*** darshna has quit IRC22:30
ianwSSLCertificateFile: file '/etc/letsencrypt-certs/zuul.opendev.org/zuul.opendev.org.cer' does not exist or is empty22:38
ianwshould be zuul02.opendev.org22:38
*** openstackgerrit has joined #opendev22:39
openstackgerritMerged opendev/system-config master: refstack: trigger image upload  https://review.opendev.org/c/opendev/system-config/+/79104322:39
ianwi have to do school run but can fix up ./zuul-web/templates/openstack.vhost.j2 after if you like22:41
*** tosky has quit IRC23:01
openstackgerritIan Wienand proposed opendev/system-config master: zuul-web : use hostname for LE cert  https://review.opendev.org/c/opendev/system-config/+/79106023:28
openstackgerritIan Wienand proposed opendev/system-config master: zuul-web : use hostname for LE cert  https://review.opendev.org/c/opendev/system-config/+/79106023:32
fungiaha, cool i was just looking at what needed to be edited there23:34
ianwi think if that's ok, remerge ontop of it and it should "just work" with the new hostname23:36
fungiyep23:37
fungiseems like it should be23:37
*** openstackgerrit has quit IRC23:49
*** mlavalle has quit IRC23:56

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!