Tuesday, 2022-07-05

opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856200:23
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856201:03
opendevreviewMerged opendev/system-config master: gerrit: trigger rebuild of images to promote 3.6  https://review.opendev.org/c/opendev/system-config/+/84867601:33
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856202:00
ianwwe have a gerrit 3.6 tag now02:01
opendevreviewwangxiyuan proposed openstack/diskimage-builder master: [WIP]Add openEuler 22.02 LTS support  https://review.opendev.org/c/openstack/diskimage-builder/+/84868002:08
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856202:40
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test https  https://review.opendev.org/c/opendev/system-config/+/84868503:31
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test haproxy to gitea  https://review.opendev.org/c/opendev/system-config/+/84868703:37
*** ysandeep|out is now known as ysandeep|ruck04:02
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856204:10
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test https  https://review.opendev.org/c/opendev/system-config/+/84868504:10
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test haproxy to gitea  https://review.opendev.org/c/opendev/system-config/+/84868704:10
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856204:35
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test https  https://review.opendev.org/c/opendev/system-config/+/84868504:35
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test haproxy to gitea  https://review.opendev.org/c/opendev/system-config/+/84868704:35
*** ysandeep|ruck is now known as ysandeep|ruck|afk04:38
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Support LVM thin provisioning  https://review.opendev.org/c/openstack/diskimage-builder/+/84014404:56
opendevreviewSteve Baker proposed openstack/diskimage-builder master: WIP Add thin provisioning support to growvols  https://review.opendev.org/c/openstack/diskimage-builder/+/84868804:56
*** ysandeep|ruck|afk is now known as ysandeep|ruck05:08
opendevreviewIan Wienand proposed opendev/system-config master: [wip] setup trusted CA for host comms in testing  https://review.opendev.org/c/opendev/system-config/+/84856205:36
ianwi'm kind of glad i pulled on this thread as i think that having a CA and SAN setup correctly will remove a handy bunch of testing hacks05:39
*** ysandeep|ruck is now known as ysandeep|ruck|afk06:18
*** ysandeep|ruck|afk is now known as ysandeep|ruck07:11
*** kopecmartin_ is now known as kopecmartin07:39
opendevreviewwangxiyuan proposed opendev/system-config master: Update openEuler mirror repo  https://review.opendev.org/c/opendev/system-config/+/84870308:11
*** ysandeep|ruck is now known as ysandeep|lunch08:15
opendevreviewwangxiyuan proposed openstack/diskimage-builder master: [WIP]Upgrade openEuler to 22.02 LTS  https://review.opendev.org/c/openstack/diskimage-builder/+/84868008:41
*** ysandeep|lunch is now known as ysandeep|ruck10:16
*** rlandy|out is now known as rlandy10:24
*** arxcruz is now known as arxcruz|brb10:59
*** dviroel|biab is now known as dviroel11:29
*** arxcruz|brb is now known as arxcruz12:03
*** ysandeep|ruck is now known as ysandeep|brb12:52
*** ysandeep|brb is now known as ysandeep|ruck13:01
*** dasm|off is now known as dasm13:15
opendevreviewgnuoy proposed openstack/project-config master: Add ACL to allow collaboration with Trilio devs  https://review.opendev.org/c/openstack/project-config/+/84874614:07
*** ysandeep|ruck is now known as ysandeep|out14:39
opendevreviewMerged opendev/system-config master: Redirect all Mailman sites from HTTP to HTTPS  https://review.opendev.org/c/opendev/system-config/+/84831914:50
fungidigging deeper, i don't think we actually need to run fix_url.py since it doesn't appear that any of the list configs override web_page_url anyway, but i'll do some more testing to confirm that observation once the deploy completes14:52
*** dviroel is now known as dviroel|lunch15:07
fungiokay, scratch that, it must get baked into something that config_list doesn't emit15:53
clarkbmeaning we do need to run the script afterall?15:53
fungithe moderator interface for the openstack-discuss ml kept trying to submit the form to http and wouldn't accept my actions15:54
fungiso i ran fix_url on that list and it's working now15:54
fungii'll do the same for the rest15:54
clarkbthanks!15:54
fungidone now for every list on all 7 list sites across both servers15:56
fungi#status log Moved all mailing list sites entirely to HTTPS15:57
opendevstatusfungi: finished logging15:57
clarkbwoot!15:57
fungii'll keep an eye out for any problems with the lists i moderate, and check some new posts to make sure headers have updated to reflect the updated urls15:58
*** marios is now known as marios|out16:03
*** dviroel|lunch is now known as dviroel16:22
corvusi'm going to start a rolling restart of zuul to pick up clarkb python image changes.  we'll be running under py 3.1016:36
clarkbIn theory the nodepool systems have already automatically udpated to python3.1016:38
clarkbAnd this is zuul catching up16:38
corvusthat was a later merge -- did it happen before the last restart?16:38
clarkbcorvus: the nodepool iamges auto update when the image updates now16:39
clarkbcorvus: since we can pretty safely restart them whenever without interrupting jobs16:39
corvusoh i thought it was daily?16:39
clarkbcorvus: the job runs hourly16:39
clarkbso ya I guess if it hasn't run the hourly yet then it may need to wait for a little bit, but not long16:40
corvusi think it was ~2 hours ago16:40
clarkbnl01's launcher was restarted about an hour ago16:40
clarkband that image is 3 hours old16:40
corvusrolling restart of mergers/executors in process16:42
clarkband an ubuntu xenial image is building on nb01 which was also restarted.16:42
clarkb2022-07-05 16:41:31,101 ERROR nodepool.zk.ZooKeeper: Error loading json data from image build /nodepool/images/fedora-35/builds/0000007464 there is some unhappy json in the db but nodepool seemed to skip over it successfully when doing an image listing16:42
clarkbI'll try to look into that later today once I'm through meetings and other things16:42
clarkbThe xenial build failed because the chroot'd install of os-testr hit 'distutils.errors.DistutilsError: Could not find suitable distribution for Requirement.parse('pbr>5.7.0')' I don't think that is related to our update of python16:47
clarkbsince that is nested away from the new version of python16:47
clarkbdib itself seems to be running well and doing what I expect16:47
fungicurrent versions of pbr should also work with xenial's python anyway16:58
clarkbya I think this is possibly the lack of sni ssl support talking to pypi16:59
fungioh, yeah that makes sense17:00
fungiand pbr is just the first thing it tries to grab17:00
fungivia setup_requires17:01
clarkbya17:01
clarkbcorvus: looks like ze06 may have stopped if you want to start it earlier than the others18:31
corvussure, why not? :)18:34
corvus2022-07-05 18:36:28,495 INFO zuul.ansible_manager:     python version = 3.10.4 (main, Apr 20 2022, 05:30:59) [GCC 10.2.1 20210110]18:36
corvuslooks nominal so far18:38
clarkbexcellent18:39
corvushttps://zuul.opendev.org/t/openstack/build/0e53970785d6428d98a6d54ff615abb1 completed sucessfully on ze0618:40
corvusi'll do the same for ze02 since i'm here18:42
clarkbfungi: I had to hard refresh the service-discuss archives page to get it to show me my agenda email. Not sure if that is related to https change (perhaps different cache settings?)18:51
fungiclarkb: thanks, when you visited the old url you got a blank page or an error or...19:02
clarkbfungi: I got the stale content without my email entry in the list of archived threads19:03
clarkbthen I did a hard refresh instead of a normal refresh and it appeared. This was well after I sent the email. Not sure if that was preexisting behavior with caches or not19:03
fungiahh, that's weird, it shouldn't have served that over http at all19:03
clarkbno it was all https19:03
clarkbhttps://lists.opendev.org/pipermail/service-discuss/2022-July/thread.html that url19:04
opendevreviewIan Wienand proposed opendev/system-config master: [wip] gitea-lb: test ssl connections during testing too  https://review.opendev.org/c/opendev/system-config/+/84877719:05
fungiyeah, if i visit that url with a fresh browser i get the current content over https. i don't have a stale browser to test with though19:06
clarkbya this may require checking opportunistically when new emails arrive. My hunch is this was a thing before too but calling it out just in case caching for https works differently enough to make it happen19:08
opendevreviewIan Wienand proposed opendev/system-config master: letsencrypt: selfsigned testing certs - use common CA, setup SAN  https://review.opendev.org/c/opendev/system-config/+/84856219:28
opendevreviewIan Wienand proposed opendev/system-config master: [wip] gitea-lb: test ssl connections during testing too  https://review.opendev.org/c/opendev/system-config/+/84877719:28
opendevreviewIan Wienand proposed opendev/system-config master: [wip] paste : move testing host to paste99, remove https hacks  https://review.opendev.org/c/opendev/system-config/+/84868519:55
ianw^ this is the overall gist; targeting the testing host as if it were production -- but also being careful to not actually confuse production with testing (e.g. using suffixes like "99")19:58
fungii got 99 servers, but that ain't one20:03
ianwi also wonder if we could do a better job with the LE "i've renewed your cert" handlers now.  they were in that big file because that was the only way i could convince ansible to find them, but i'm pretty sure that was a bug.  it might be fixed now20:27
ianws/were/are/20:27
corvusze01-06 all upgraded; 07-12 stopping now20:28
ianwinterestingly i was trying to see that the LB is passing data during system-config testing.  here's a sample of what it is doing -> https://paste.opendev.org/show/bBUMrJhSPOlHS1DRGhQv/ ... getting hit by various scanning bots20:51
ianwit's only up for 12 minutes or so20:51
fungithe internet is for self-propagating worms20:53
opendevreviewIan Wienand proposed opendev/system-config master: gitea-lb: test ssl connections during testing too  https://review.opendev.org/c/opendev/system-config/+/84877720:59
opendevreviewIan Wienand proposed opendev/system-config master: paste : move testing host to paste99, remove https hacks  https://review.opendev.org/c/opendev/system-config/+/84868520:59
opendevreviewIan Wienand proposed opendev/system-config master: [wip] test haproxy to gitea  https://review.opendev.org/c/opendev/system-config/+/84868720:59
fungiinfra-root: heads up that rackspace has notified us of an upcoming block storage maintenance on august 25 impacting afs01.ord and backup01.ord21:46
corvuslove it if they're on the same rust21:47
fungii'll try to swap out those volumes tomorrow21:47
*** dviroel is now known as dviroel|afk21:59
*** dasm is now known as dasm|off22:02
ianwServer balance_git_http/gitea99.opendev.org is DOWN, reason: Layer4 connection problem, info: "No route to host", ...22:09
ianwanyone know if haproxy logs when this comes back?22:10
ianw... oh, now i wonder ... is the /etc/hosts the same in the container as the host?  i wonder if haproxy container can't resolve gitea9922:11
clarkbI think it isn't mounted by default22:11
clarkbdocker will use google dns iirc22:11
ianwhttps://docs.docker.com/config/containers/container-networking/#dns-services22:13
ianw"Custom hosts defined in /etc/hosts are not inherited. To pass additional hosts into your container, refer to add entries to container hosts file in the docker run reference documentation."22:13
opendevreviewJames E. Blair proposed opendev/system-config master: WIP: Build a nodepool image  https://review.opendev.org/c/opendev/system-config/+/84879222:14
opendevreviewJames E. Blair proposed opendev/system-config master: WIP: Build a nodepool image  https://review.opendev.org/c/opendev/system-config/+/84879222:16
ianwhrm we explicitly set 22:34
ianwserver gitea99.opendev.org 198.72.124.63:3080 check22:34
clarkbianw: I think in production we also use the ip addresses to specify the backends22:35
clarkbthat way we don't rely on dns22:35
clarkbcorvus: looks like all executors and mergers are restarted on the new python3.10 zuul image22:36
clarkbjust scheduler services remaining22:36
ianwyeah, i'm unconvinced that we pass data through the lb in the system-config tests22:36
ianwhttps://review.opendev.org/845316 splits the logs out, making it easier to see22:37
ianwhttps://zuul.opendev.org/t/openstack/build/f89e7a94e2674e36ad7208d78873cc0e/log/gitea-lb01.opendev.org/haproxy.log ... all comes back NOSRV22:38
clarkbI feel like this did work at one time because we used the tests to verify the tls checking? But that did have problems too?22:38
ianwistm that it is testing to see if it can see gitea99, but it can't.  so it's half-testing the checking (checking that it's checking, but not checking that the check checked)22:43
ianwmaybe?  or the services aren't responding while things are loading or something22:44
ianwi think i'll have to hold a node to diagnose it.  anyway, it would be good to get to a point where the haproxy log is split out and clearly shows traffic through it, that is the ultimate goal22:44
corvusrestarting zuul0122:45
clarkbcorvus: any idea how long that restart takes now? iirc there were a few improvements made23:04
corvusit's done; i'll check23:06
corvus2022-07-05 22:44:55,071 DEBUG zuul.Scheduler: Configured logging: 6.1.1.dev3223:07
corvus2022-07-05 22:47:46,203 INFO zuul.Scheduler: Config priming complete (duration: 164.639 seconds)23:07
clarkbwow, under 3 minutes? And it took about 20 before? amazing improvement23:07
corvusyeah, it's a wee bit better :)23:08
corvusrestarting zuul02 now23:08
corvus#status log restarted all of zuul on 78b14ec3c196e7533ac2c72d95fba09c936e625a23:09
opendevstatuscorvus: finished logging23:10
ianwok, if i "curl https://gitea99.opendev.org:3081" from the load balancer it works, so the connection is alive23:11
ianw(under test)23:11
ianwhang on, it's looking for port 3080 ...23:16
ianwright ... after fixing the ports "curl --resolve opendev.org:127.0.0.1 https://opendev.org" works with the new bits (because gitea99.opendev.org has a SAN for opendev.org)23:23
*** dviroel|afk is now known as dviroel23:27
fungithat makes a bit more sense, yep23:29
opendevreviewIan Wienand proposed opendev/system-config master: gitea: fix loadbalancer forwarding in testing  https://review.opendev.org/c/opendev/system-config/+/84879323:34
ianw^ i haven't put that ontop of any of the testing CA/ssl changes23:37
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Parse block device lvm lvs size attributes  https://review.opendev.org/c/openstack/diskimage-builder/+/83982923:41
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Do dmsetup remove device in rollback  https://review.opendev.org/c/openstack/diskimage-builder/+/84786023:41
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Support LVM thin provisioning  https://review.opendev.org/c/openstack/diskimage-builder/+/84014423:41
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Add thin provisioning support to growvols  https://review.opendev.org/c/openstack/diskimage-builder/+/84868823:41
*** dviroel is now known as dviroel|out23:50
corvusmemory usage on zuul01/zuul02 looks good.  i think it's worth keeping an eye on that over the next week or so because of the python version change.23:59
clarkb++23:59

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!