Tuesday, 2020-06-23

*** DSpider has quit IRC00:20
openstackgerritIan Wienand proposed opendev/system-config master: Add a grafana/grafyaml image  https://review.opendev.org/73739700:29
openstackgerritIan Wienand proposed opendev/system-config master: Add a grafana/grafyaml image  https://review.opendev.org/73739700:30
openstackgerritIan Wienand proposed opendev/system-config master: Add a grafana/grafyaml image  https://review.opendev.org/73739700:33
*** xiaolin has joined #opendev00:43
*** sgw1 has quit IRC01:35
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740601:42
*** sgw1 has joined #opendev01:50
*** mrunge_ has joined #opendev02:20
*** mrunge has quit IRC02:21
*** sgw1 has quit IRC02:47
*** shtepanie has quit IRC02:53
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740603:02
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740603:05
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740603:11
*** sgw1 has joined #opendev03:28
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740603:34
*** meiyanzheng has joined #opendev03:44
*** diablo_rojo has quit IRC03:58
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740604:03
*** sgw1 has quit IRC04:06
*** ykarel|away is now known as ykarel04:10
*** sgw1 has joined #opendev04:18
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740604:23
*** rchurch has quit IRC04:23
*** rchurch has joined #opendev04:24
auristorianw: with regards to the unexplained afs01.dfw traffic, what the graphs do not separate is traffic by service.   Therefore it is not known if the outbound traffic is fileserver to cache manager or volserver to volserver04:42
auristorbut if you compare the afs02.dfw graph for the same time period http://cacti.openstack.org/cacti/graph.php?action=zoom&local_graph_id=6405&rra_id=5&view_type=&graph_start=1592857482&graph_end=1592858713&graph_height=120&graph_width=500&title_font_size=1204:43
auristorthere is only 5m of traffic at the end of the time period.   that is most likely the volserver traffic from afs01.dfw to afs02.dfw.   The rest is most likely fileserver to cache manager traffic.  Either the rsync client fetching status and data or cache managers that read from /afs fetching new data after receiving volume callbacks.04:46
ianwauristor: probably what we should do is combine all the exit codes of the rsyncs for each distro, and only do a vos release if it reports changes were made05:00
ianwgoogling that, it appears the only way to tell if rsync did something of interest is log scraping of various types, so possible, but not as neat as just exit codes05:02
ianwstill, an item for the todo list05:03
*** ysandeep|PTO is now known as ysandeep05:18
openstackgerritFelix Edel proposed zuul/zuul-jobs master: [DNM] Test upload return values  https://review.opendev.org/73744105:51
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740606:19
openstackgerritFelix Edel proposed zuul/zuul-jobs master: [DNM] Test upload return values  https://review.opendev.org/73744106:19
*** rpittau|afk is now known as rpittau06:34
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731506:58
openstackgerritFelix Edel proposed zuul/zuul-jobs master: Return upload_results in upload-logs-swift role  https://review.opendev.org/73356407:02
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740607:07
AJaegerianw, fungi, clarkb, could you review https://review.opendev.org/#/c/735283 together with this etherpad, please? https://etherpad.opendev.org/p/-CBx0IaMT37oFBHdt8iV07:15
AJaegerPlease update the etherpad with your thoughts and let's see how to make this more consistent07:16
AJaegerinfra-root, codesearch still finds x/whitebox-tempest-plugin but that was renamed to openstack/whitebox-tempest-plugin07:23
*** calcmandan has quit IRC07:26
*** hashar has joined #opendev07:26
*** calcmandan has joined #opendev07:29
*** tosky has joined #opendev07:35
*** aannuusshhkkaa has quit IRC07:51
*** moppy has quit IRC08:01
*** moppy has joined #opendev08:01
openstackgerritJavier Peña proposed opendev/system-config master: Support CentOS for AFS mirror  https://review.opendev.org/73699608:06
*** hashar has quit IRC08:16
*** hashar_ has joined #opendev08:16
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740608:20
*** ykarel is now known as ykarel|lunch08:23
*** hashar_ is now known as hashar08:25
*** ysandeep is now known as ysandeep|lunch08:37
openstackgerritVishal Manchanda proposed openstack/project-config master: Upadting horizon nodejs job name  https://review.opendev.org/73745708:37
*** ysandeep|lunch is now known as ysandeep08:50
openstackgerritVishal Manchanda proposed openstack/project-config master: Upadting horizon nodejs job name  https://review.opendev.org/73745708:52
openstackgerritRiccardo Pittau proposed openstack/diskimage-builder master: Convert multi line if statement to case  https://review.opendev.org/73447908:54
*** priteau has joined #opendev08:57
*** SotK has quit IRC09:08
*** owalsh has quit IRC09:08
*** tobiash has quit IRC09:08
*** owalsh_ has joined #opendev09:08
*** SotK has joined #opendev09:08
*** sshnaidm|afk is now known as sshnaidm|ruck09:11
*** tobiash has joined #opendev09:12
*** hashar is now known as hasharAway09:31
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740609:32
*** ryohayakawa has quit IRC10:06
*** meiyanzheng has quit IRC10:07
*** rpittau is now known as rpittau|bbl10:12
*** hasharAway is now known as hasharLunch10:16
*** tkajinam has quit IRC10:17
*** jhesketh has quit IRC10:36
*** jhesketh has joined #opendev10:37
*** mrunge_ is now known as mrunge10:48
*** ykarel|lunch is now known as ykarel10:58
fricklermnaser: you have very strange rDNS in place for mirror01.ca-ymq-1.vexxhost.opendev.org: 149.45.204.199.in-addr.arpa domain name pointer abla-4.albalisaude2.com.br.11:11
*** DSpider has joined #opendev11:45
*** hasharLunch is now known as hashar11:51
*** lpetrut has joined #opendev11:51
*** ysandeep is now known as ysandeep|afk12:00
*** rpittau|bbl is now known as rpittau12:14
smcginnisWhat would be the best ensure-* role to use to make sure setuptools is present for calling "python setup.py --name"?12:19
AJaegersmcginnis: ensure-pip includes setuptools12:21
smcginnisAJaeger: Any idea why it would fail with that role? We have this pre playbook that runs:12:23
smcginnispip12:23
smcginnishttps://pypi.org/project/cinder/12:23
smcginnisBut got this failure: https://060ba5bdde00663d768c-19780c33aa00a3c0d825d79cd8c225b0.ssl.cf2.rackcdn.com/8ea79a3dba07789f8ab08066faa90cbfcc8a7837/release/propose-update-constraints/51a2e0c/job-output.txt12:23
smcginnisSearch for "2020-06-19 14:40:03.562345"12:24
smcginnisHmm, 2020-06-19 14:39:09.485384 | TASK [ensure-pip : Ensure setuptools]12:25
smcginnisOh, skipping: Conditional result was False12:26
smcginnisThat's only on Suse.12:26
*** ysandeep|afk is now known as ysandeep12:28
AJaegersmcginnis: that's from 19th of June - shouldn't this be fixed by now?12:29
AJaegersmcginnis: https://zuul.opendev.org/t/openstack/builds?job_name=propose-update-constraints# all looks green12:29
AJaegersmcginnis: So, let's ignore this specific failure - ok?12:30
smcginnisAJaeger: Oh, I think you are right. It was probably shortly after then that it was fixed. I'm just looking at too old of logs.12:30
AJaegerhope so ;)12:30
smcginnisYeah, I was wondering why I had seen some successful runs. That would make sense.12:30
smcginnisWe have a nightly job that runs that has failed, but I think that uses a different playbook that probably also needs an update.12:31
smcginnisWill have to track that down though.12:31
smcginnisThanks for looking AJaeger.12:31
AJaegerok, let's fix the nightly one ;)12:32
AJaegersmcginnis: failure on cinder?12:32
AJaegersmcginnis: just checked, didn't find one...12:35
smcginnisAJaeger: It's on openstack/requirements. Looks like propose-updates and release-wheel-cache:12:35
smcginnishttps://zuul.opendev.org/t/openstack/builds?pipeline=periodic&branch=master&project=openstack%2Frequirements12:35
smcginnisrelease-wheel-cache is: The task includes an option with an undefined variable. The error was: 'afs_volume' is undefined12:36
smcginnisBut propose-updates looks like it's missing virtualenv.12:36
AJaegerindeed12:36
smcginnisWhich is a little odd, since I see it being installed prior to the failure.12:37
smcginnisMaybe just needs the bit to make it global?12:37
AJaegerit worked before - infra-root, did we update something in the last 24h to break release-wheel-cache?12:38
AJaegerSee https://zuul.opendev.org/t/openstack/build/229fc8ce6db3464cbb9f50d0e3ed43a112:38
AJaegersmcginnis: I don't see anything obvious - that's why I asked for help ^12:39
smcginnisThat one does look like it could be a side effect of another change.12:40
AJaegersmcginnis: virtualenv is called from the script - and thus needs to be available globally12:40
AJaegeris the shell script invoked without the venv in PATH?12:41
smcginnisThat would be my guess.12:42
AJaegerwe have the ensure_global flag only for tox, not for virtualenv ;(12:43
AJaegermordred: any idea what do do here? ^12:43
smcginnisI wonder if it would work to just update this part to call "tox -e venv"? https://opendev.org/openstack/project-config/src/branch/master/playbooks/proposal/propose_update.sh#L35-L3812:44
AJaegermmh, might - sorry, have to step out for a bit now. Hope others can help further12:47
smcginnisThanks12:47
fricklerAJaeger: smcginnis: this is a role invocation without the var added, I'll push a fix https://opendev.org/openstack/project-config/src/branch/master/playbooks/wheel/release.yaml#L1312:48
smcginnisThanks frickler.12:48
openstackgerritJens Harbott (frickler) proposed openstack/project-config master: Fix wheel release playbook  https://review.opendev.org/73752512:50
fricklerinfra-root: ^^ the lastest commit was https://opendev.org/openstack/project-config/commit/92b378cc9e2ac95cf24520d1dc73986060c7ecfb which didn't touch this, maybe this is also ansible 2.9 fallout?12:51
smcginnisLooks like that was an error in there for a couple months looking at the git blame.12:52
smcginnisMaybe we just didn't notice that one until now.12:52
openstackgerritSean McGinnis proposed openstack/project-config master: Fix venv use in requirements propose_update.sh  https://review.opendev.org/73752612:54
smcginnisWould be good to know if that should work ^12:54
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ensure-pip debian: update package lists  https://review.opendev.org/73752912:57
*** Dmitrii-Sh has quit IRC13:04
*** Dmitrii-Sh has joined #opendev13:05
openstackgerritThierry Carrez proposed zuul/zuul-jobs master: upload-git-mirror: check after mirror operation  https://review.opendev.org/73753313:05
mordredsmcginnis: that might work - we might also need to add an ensure_global flag to ensure-virtualenv too13:17
fungiproblem with that is that ensure-venv may install the venv module and set a variable saying to run `python3 -m venv`13:19
mnaserhave we by any chance made any recent changes to base jobs?13:19
mnaserit doesn't seem that way but i just had a no-log POST_FAILURE13:19
fungimordred: so just a simple symlink won't work for that case , might need a wrapper script?13:19
mnaserover here: https://review.opendev.org/#/c/733024/513:20
fungibrb13:20
mordredfungi: ugh13:20
mnaseroh there maybe something else transpiring here13:20
mnaseri'm seeing a bunch of other POST_FAILUREs13:20
mnaserhttps://zuul.opendev.org/t/openstack/builds13:20
mnaseran object storage provider having problems maybe?  i can't help troubleshoot more because we'll need some zuul executors logs to help uncover this i think13:21
mordredfungi: no, I donm't think that's right ... the role is ensure-virtualenv13:21
mordredmnaser: looking13:21
mnasermordred: feel free to paste a log and i can go digging after too, seems like you're dealing with another fire :)13:22
fricklermnaser: mordred: seems some other jobs are affected, too, likely an issue with one s3 provider13:22
fricklerlike https://zuul.opendev.org/t/openstack/build/1be32e52753c401caaa853179dd1982613:22
mordredmnaser: http://paste.openstack.org/show/795098/13:25
mordredkeystone v2 not available anymore :)13:25
mnaserah bon13:26
mnaseri think i can fix that13:26
mordredmnaser: I think we just need to supply default as domain name: http://travaux.ovh.net/?do=details&id=4217913:28
mnasermordred: identity_api_version: 3 too or that's optional?13:28
mordredmnaser: optional13:28
mordredwe should detect based on presence of domain parameters13:29
mordredhah. this is already fixed in openstacksdk ... I guess we have an old version there?13:30
mordredhow could we have an old version - these are running in the containers13:30
* mordred goes to look13:30
openstackgerritMohammed Naser proposed opendev/base-jobs master: ovh: start using keystone v3  https://review.opendev.org/73754013:31
mnasermordred, infra-core: ^ see above to avoid a flood of post-failures incoming13:31
mordredoh - no it's not13:31
mordredmnaser: can you do gra too?13:32
mnasermordred: https://github.com/openstack/openstacksdk/blob/master/openstack/config/vendors/ovh.json wat ?13:32
mordredoh - nevermind13:32
mnaser:)13:32
mnasermordred: i think this is because this is the openstacksdk version inside zuul's ansible13:32
mnasermordred: and we should probably double check nodepool's config also has the right info too and is using v3 too, dont think i can look at that13:33
mnaseruhh, the profile has said identity_api_version 3 since dec 201913:34
mordredyeah. the issue here is missing domain attributes - identity_api_version doesn't actually control this13:34
mnaserah13:34
mordredit's controlled by auth_type13:34
mordredbut auth_type of password does inferrance based on parameters13:34
*** sgw1 has quit IRC13:34
mordredBU?T13:34
mordredthe ovh profile has user_domain_name: Default and project_domain_name: Default13:35
mordredso I'm guessing that yeah, this is an issue with the ansible venv installs13:35
mordredall the more reason to get these bad-boys on to docker13:35
*** sgw1 has joined #opendev13:35
mordredmnaser: thanks for the patch!13:36
mnaseroh yeah we have to check the venv there13:36
mordredcorvus: I think we should maybe finish rolling out zuul-executor docker13:36
mnasermordred: do you feel like pip freeze there just to check if our theory is right?  im just curious.13:37
mordredmnaser: sure. one sec13:37
mordred>>> openstack.version.__version__13:43
mordred'0.41.0'13:43
mordredmnaser: ^^ yup13:44
mordredalso - I think we have a bug in zuul-manage-ansible13:44
fungimnaser: catching up, but today is probably the long-announced date that ovh was dropping v213:49
mordredyup13:51
mordredfungi: and - sdk has support for it - so all of the work was actually done to make it seamless...13:51
mordredfungi: EXCEPT13:51
mordredour virtualenvs have a stale version of sdk13:51
fungiwe don't continuously upgrade our ansible venvs on the executors, yep13:51
*** sshnaidm|ruck is now known as sshnaidm|afk13:51
fungistale versions of everything last i checked13:52
* mordred would like us to finish moving zuul-executors to docker13:52
fungiwhen zuul adds a new ansible version, manage-ansible creates the new venv for it, but we never upgrade ansible or anything else in it (or didn't last time i looked, though it's been a couple months so maybe that's gotten better)13:52
mnaseri dont know if zuul/nodepool can do this but it would be an interesting exercise of being able to push zuul-executors out to the clouds themselves13:53
mnaseri wonder how much of a performance impact that would have given the reduced RTT for running ansible13:53
mnaserone could theoretically profile something like this13:53
mnaserhmm.13:54
mordredmnaser: we have support for cloud-region tied zuul-executors13:54
mnasermordred, AJaeger, fungi: https://review.opendev.org/#/c/737540/ has failed in POST_FAILURE13:54
mordredsigh13:55
mnaseri guess its because base-jobs is a config-project and its not speculatively testing the secret, right?13:55
mordredyah13:55
mordredinfra-root: it might be worth force-merging mnaser's patch13:55
mnaseryeah so someone probably should force-merge that.. hoping i didn't do something bad :)13:55
mordredsince it's a config-project fix for a gate failure that is itself hitting the gate failure13:56
mordredinfra-root: I'm ready to force-merge unless someone thinks we shouldn't13:56
corvusmordred: +113:57
fungiplease do, i'm just getting settled back in13:57
fungibut i agree it's a fine stopgap13:57
openstackgerritMerged opendev/base-jobs master: ovh: start using keystone v3  https://review.opendev.org/73754013:57
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ensure-pip debian: update package lists  https://review.opendev.org/73752913:58
*** DSpider has quit IRC13:58
mordredinfra-root: if folks could review https://review.opendev.org/#/c/733967/ - I can put the executors into the emergency file and do a one-at-a-time rollout14:00
*** roman_g has joined #opendev14:01
corvusmordred: lgtm14:01
mordredcorvus: think maybe I should go ahead and put them in emergency?14:04
* mnaser wonders if we should wait to double check ovh is back ok again14:04
mnaser(btw maybe we should throw a notice too)14:05
mordredmnaser: bah. let people be confused14:07
mordredit adds excitement14:07
mnaserfun ride ahead :)14:10
corvusmordred: yeah, i emergency now sounds reasonable14:11
mnaseroh yeah there's a lot of post_failure's14:12
mnaserim trying to find a successful ovh job to make sure what we did works14:12
mnaserwe merged at 9:57 am et, so ~16 minutes ago, trying to find a job that completed since14:14
mnaserhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_6f0/737497/2/check/puppet-openstack-syntax-5-ubuntu-bionic/6f0554c/14:14
mnaserthat was uploaded 6 minutes ago14:15
mordredmnaser: woot!14:15
mnaserso i think we're good14:15
mnaserkindof a bummer about all the orange in the status, but we can't really know to rerun those jobs, they'll just have to fail14:15
openstackgerritMonty Taylor proposed opendev/base-jobs master: Revert "ovh: start using keystone v3"  https://review.opendev.org/73755014:16
mnaseri think we _could_ perhaps technically add a retry on the upload job using another cloud to make it more resiliant14:16
mordredmnaser: that's potentially not a terrible idea14:16
mordredmnaser: I pushed up a revert but -2'd it - once we've rolled out the executor update we can land it14:17
mnaserthe trickiest bit i guess is just the 'figure out where you failed to upload and try something 8except* that'14:17
mnaseri wonder if also we could make the upload-logs failing not fail the job if the job passed14:19
*** DSpider has joined #opendev14:20
mnasercould be really useful for promote/post/release failures... and just in general, if it passed and logs failed, we're probably ok14:20
mordredhrm. that's an interesting thought14:20
mnaserlike maybe we can report a zuul warning saying we failed to push logs14:20
mnaserin the comment14:20
mnaserbut no need for us to fail the whole thing14:20
mordredyeah. bceause most of the time you only care about logs when a job failed anyway.14:21
mnaseryeah14:21
mordredcorvus: ^^ what do you think about that?14:21
mordredmnaser: or - perhaps have that behavior be dependent on type of pipeline or something. like - make it be a failure in check - but in release/post - make it not14:22
mordredespecially if you consider a release job that does some other thing (like, actually does the release) - that may have dependent jobs that should be triggered - but the log failure would cause the dependent jobs to not trigger even though a released artifact would have been published/pushed14:23
openstackgerritMerged openstack/project-config master: Fix wheel release playbook  https://review.opendev.org/73752514:24
corvusi worry a little bit about not noticing a failure, so making it pipeline-contingent may be a good compromise there.  but honestly, do we really want jobs to finish, or follow-on jobs to run without logs?14:24
corvusi mean, a build without a log is *really really bad*14:24
corvusespecially in release14:25
mordredcorvus: maybe mnaser's first thought - re-try log upload on a different target - would be a better thing to work on first?14:25
corvusyeah, i think that may be better14:25
* mnaser thinks this would be nice to integrate to the zuul role itself14:26
mordredI'm torn between which is worse - release job without logs - but that already did its release actions14:26
mnasermordred: well, the note here is it's not a release job without logs14:26
mnaserits a release job without public facing logs, the executor still technically has a copy14:26
mnaserits "not ideal and easy to access" but it's there14:26
corvusno, they're deleted14:27
mordredyeah14:27
corvusi mean, there's some ansible logs14:27
mordredbut they won't have the logs of the release14:27
mnaserok right, so we log the error but not the progress of the whole thing14:27
corvusso yeah, we might be able to figure out if something uploaded, but no guarantees14:27
mnaserwell unless we can grab the status of the 'run' phase and then you have a guarantee that it passed or didnt14:28
corvus(it all depends on how much of that info makes it to the executor debug log)14:28
mordredcorvus: how horrible of an idea would it be to add a $something to let us tell the executor to not clean a build dir14:28
corvusmordred: horrible :)14:28
mnaserif run passed and upload-logs failed, you know the run phase _should_ be complete14:28
corvusmordred: we'd never clean them up14:28
mordredcorvus: yeah. was mostly thinking about how to not delete logs that didn't actually get uploaded14:29
corvusi think we should first exhaust solutions that make the system work more reliably :)14:29
mordred++14:29
corvusso retry-other-provider has my vote for focus14:29
fungialso keep in mind that retrying the upload to another endpoint wouldn't necessarily have helped here, as all the ovh endpoints stopped being writeable for us ~simultaneously, so the job could easily have tried another which also failed14:30
mnaserah, upload-logs-swift takes one cloud only, not the whole list14:30
mnaserif only we wrote it to take a list, it would have been easier to make it better for everyone without affecting too much14:30
corvuswell, we can update the opendev usage, and if we get that solid, we can look at upgrading the role14:34
mnasercorvus: well, i was thinking of updating the role to accept a list or string (which it converts to a list) which shouldn't change user-facing behaviour first14:35
mnaserand then adding the 'fallback' stuff as stage 2, cause really no one should have been feeding that thing a list in the first place14:35
corvusmnaser: hrm, actually, i wonder if it would be better outside of the role?  if you put it in the swift role, then a user can only retry to other swifts.  but someone might want to retry swift/gce/aws14:38
*** mlavalle has joined #opendev14:39
*** sshnaidm|afk is now known as sshnaidm|ruck14:39
mnasercorvus: i like the idea of making it in a way so that all of our (zuul) users can get the increased benefit of 'reliability'.  in my experience base-job improvements will generally not be picked up by most users later14:39
*** ysandeep is now known as ysandeep|afk14:39
mnasermaybe eventually we could have a upload-object-store that takes a specific type of stringset which can do all sort of object storages14:39
corvusmnaser: i don't disagree with you14:40
corvusand yes, i'm suggesting that a meta-role may be the most appropriate here14:40
corvusmnaser: note that if we change the swift role, we will also need to change the gce role, because we're maintaining parity between them.14:41
mnasercorvus: yeah that seems fair, i'll probably have some questions (i guess the weird thing here is.. do you upload to another 'gce region'?) -- i'll have some questions14:42
corvusmnaser: i would suggest starting with the meta-role idea; it seems to me that would be the easiest as well as most robust and future-proof approach14:44
clarkbhrm I thought we switched to v3 across the board for ovh months ago. But I guess that was only in the control plane side of things14:45
clarkbas far as making this more robust pabelanger suggseted similar in the past but the way ansible works makes it difficult14:45
clarkbbecause we want to do successive random choices14:45
clarkbexcluding previous options14:45
*** ysandeep|afk is now known as ysandeep14:46
corvusyeah, it may need to be a custom module?14:46
clarkbyes I think so14:46
*** priteau has quit IRC14:52
*** owalsh_ is now known as owalsh15:07
*** _mlavalle_1 has joined #opendev15:23
*** mlavalle has quit IRC15:25
*** ykarel is now known as ykarel|afk15:33
*** ysandeep is now known as ysandeep|away15:36
*** lpetrut has quit IRC15:44
openstackgerritSean McGinnis proposed openstack/project-config master: Retire dragonflow project  https://review.opendev.org/73756615:49
*** ykarel|afk is now known as ykarel15:55
*** rpittau is now known as rpittau|afk16:00
*** diablo_rojo has joined #opendev16:02
openstackgerritMerged openstack/project-config master: Fix venv use in requirements propose_update.sh  https://review.opendev.org/73752616:04
*** sgw1 has quit IRC16:13
*** sgw1 has joined #opendev16:19
AJaegerinfra-root, we retired puppet-congress but there's still an open review - can you abandon that? https://review.opendev.org/#/q/project:openstack/puppet-congress+is:open Or leave as is?16:28
clarkbAJaeger: I have an abandon button16:28
clarkbAJaeger: do you hvae a link to the retirement change? I can put that in the abandon message16:29
AJaegerclarkb: http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015555.html16:29
AJaegerclarkb: thanks16:29
AJaegerBut that means gerrit did not make the repo readonly?16:30
*** ykarel is now known as ykarel|away16:30
AJaegeroh, it's readonly, see it...16:31
clarkbya its read only now. maybe there was a race between switching to RO and the change being pushed16:31
AJaegerthat change was overlooked when retiring ;( it was a year old16:31
AJaegerthanks, clarkb16:31
openstackgerritSean McGinnis proposed openstack/project-config master: Remove dragonflow from infra  https://review.opendev.org/73757816:32
clarkboh thats a 2019 not 2020216:32
AJaegeryep16:32
openstackgerritSean McGinnis proposed openstack/project-config master: Remove dragonflow from infra  https://review.opendev.org/73757816:37
openstackgerritMerged openstack/project-config master: Stop translation stable branches on projects without Dashboard  https://review.opendev.org/72321716:49
toskyI lost tracko of the status: can we recheck the jobs which failed with POST_FAILURE?16:55
clarkbtosky: yes16:56
*** sgw1 has quit IRC16:56
clarkbtosky: any job which started in the last 3 hours should have the fix16:56
clarkb(roughly)16:56
toskyclarkb: thanks!16:56
fungitosky: in short, one of the swift providers to whom we upload build logs dropped keystone v2 api support today, and we were using too old openstacksdk in our ansible venvs on the zuul executors to do proper version discovery16:57
*** sgw1 has joined #opendev17:05
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731517:14
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731517:15
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731517:17
*** sshnaidm|ruck is now known as sshnaidm|afk17:24
*** factor has quit IRC17:25
*** hashar has quit IRC17:25
*** hashar has joined #opendev17:25
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731517:33
*** sgw1 has quit IRC17:34
*** sgw1 has joined #opendev17:50
*** xiaolin has quit IRC17:59
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731518:17
*** roman_g has quit IRC18:21
*** hashar is now known as hasharAway18:23
AJaegerianw, fungi, clarkb, could you review https://review.opendev.org/#/c/735283 together with this etherpad, please? https://etherpad.opendev.org/p/-CBx0IaMT37oFBHdt8iV - that for the python-jobs. Please update etherpad18:28
openstackgerritRafael Folco proposed openstack/diskimage-builder master: DNM: Debug py3 on dib 7  https://review.opendev.org/73642118:40
openstackgerritMerged zuul/zuul-jobs master: Simplify twine invocation for PyPI uploads  https://review.opendev.org/73593219:01
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731519:02
*** aannuusshhkkaa has joined #opendev19:08
*** sgw1 has quit IRC19:21
*** sgw1 has joined #opendev19:22
openstackgerritIan Wienand proposed opendev/system-config master: Add a grafana/grafyaml image  https://review.opendev.org/73739719:26
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740619:26
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731519:31
clarkbfungi: re stale apache worker we could port the worker limits config from mirrors to files?19:49
clarkber I guess its static.opendev.org now not files19:49
corvusi wondered about the certcheck email; i guess it's safe to ignore?19:49
fungias mentioned in the meeting just now, running this returns inconsistent results: `echo | openssl s_client -connect zuul-ci.org:https -servername zuul-ci.org 2> /dev/null | openssl x509 -text | grep -i after`19:50
clarkbcorvus: if the stale worker eventually dies and goes away its fine, the cert has been updated. If it persists we can restart apache forcefully19:50
clarkbcorvus: on the mirrors we landed mpm config to give apache workers a request count limit19:50
clarkband the static server apache would hit those limits pretty quickly I think19:50
fungiyeah, basically we have a month for that worker to recycle before it will start causing intermittent cert failures for people19:50
clarkbthe mirrors would hit this semi frequently and since we landed that config for the mirrors it hasn't happened again (but it also hasn't been super long so may not have refreshed certs yet)19:51
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740619:51
fungimost requests i'm getting a 2020-09-20 expiration, but occasionally i'll get back an older cert expiring 2020-07-2219:52
corvusyeah, i think we should limit the static worker life19:53
fungiso yes, manually issuing a service restart would clear this in the short term, but longer term we likely need something like clarkb describes19:53
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731519:53
fungiclarkb: looks like this was your change to set it for the mirrors: https://review.opendev.org/72787319:58
fungii'll propose somethnig similar for static19:59
clarkbthanks19:59
ianwAJaeger: thanks for pointing out release-wheel-cache ; that actually wasn't running until the last attempt.  this was because xenial arm64 wheel builds were borked until we merged https://review.opendev.org/73505520:03
corvusi have to go grocery shopping; biab.20:03
ianwoh, something i meant to bring up in the meeting was the changes in system-config to add centos support to base20:03
fungicorvus: enjoy your adventure, that's my tomorrow morning20:03
ianwi wasn't sure if people wanted to keep system-config focused on opendev actual production, or how much wiggle room there was20:03
fungii think we've previously asserted that system-config isn't being maintained/supported for reconsumption, it's a public entry point into the maintenance of our running systems and services20:05
fungiso if we're going to run some centos-based services then i could see adding support for it20:05
ianwthat is the thing, i don't think opendev has particular plans for that20:07
funginone that i'm aware of, anyway20:07
fungiwhat prompted the change?20:07
ianwi think it is wanting to setup infra compatible mirrors20:08
fungiwhat's an infra compatible mirror?20:08
ianwwell i mean mirrors that look like the mirrors opendev infra sets up, but not in opendev infra20:08
ianwin terms of paths/proxies, etc20:09
fungiif there's interest in collaborating on standardization of apache configuration and tooling around ci mirror systems, then we should do that outside system-config (it could of course be seeded by code from system-config) and then hopefully eventually consume that somehow20:09
ianwperhaps we should move some of the mirror config bits outside of system-config20:10
ianwheh, jinx20:10
fungibut yeah, not in the system-config repo20:10
openstackgerritJeremy Stanley proposed opendev/system-config master: Limit connections for static site Apache workers  https://review.opendev.org/73761920:15
fungiclarkb: corvus: ^ as discussed20:15
clarkbfungi: one msall but important thing on that20:24
mnaseris there any places where python-builder is used to build *multiple* projects into an image?20:27
mnaseruse case: building image for horizon with multiple plugins20:28
ianwmnaser: hrm, dib does pull in a bunch of siblings ...20:28
*** rchurch has quit IRC20:28
ianwmnaser: specifically this is what i'm talking about -> https://opendev.org/zuul/nodepool/src/branch/master/.zuul.yaml#L21620:30
*** rchurch has joined #opendev20:30
mnaserianw: interesting.. so i guess in my use case, id build a horizon image with all the plugins as siblings20:30
*** sgw1 has quit IRC20:31
ianwmnaser: yeah, i think so.  basically every sibling is copied and then gets installed with https://opendev.org/opendev/system-config/src/branch/master/docker/python-builder/scripts/assemble#L8120:32
mnaserright now our image jobs do a poor job of not actually using the zuul checkout.. sadly.. i'm going to clean that up20:33
ianwright, yeah that was the exact use case ... i wanted to make sure Depends-On: for openstacksdk, etc. worked when testing the nodepool containers20:34
mnaserianw: my only pain point is my dockerfile's are in a different repo than the code itself20:35
mnaserso making docker build locally pretty darn painful20:35
mnaserbecause in the year 2020, dockerfile's inside openstack projects is too much to ask for20:35
fungiclarkb: oh, yep, great catch. we may want to add an actual restart handler to cover that case20:37
ianwyeah, well i mean docker is broken on fedora for god knows how long with cgroups or whatever it is, i don't want to think about it20:37
ianwin 2021 people will reinvent dpkg and talk about how amazing it is we can share libraries :)20:38
fungiin 2021 people will reinvent makefiles yet again. like for the 20th time20:39
openstackgerritJeremy Stanley proposed opendev/system-config master: Limit connections for static site Apache workers  https://review.opendev.org/73761920:41
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740620:43
*** sgw1 has joined #opendev20:47
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731521:04
*** shtepanie has joined #opendev21:11
clarkbfungi: ianw https://review.opendev.org/#/c/736389/ is an easy system-config cleanup if you have a moment21:20
clarkbrelated to dns cleanups21:20
clarkbianw: thanks!21:22
clarkbmordred: does your change to switch zuul executors to docker images depend on https://review.opendev.org/#/c/735739/ ?21:23
clarkbI've re enqueued that change into the gate so we should have it soon but wanted to call out it isn't merged yet21:23
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740621:24
*** DSpider has quit IRC21:28
openstackgerritMohammed Naser proposed openstack/project-config master: Temporarily unretire incorrectly retired projects  https://review.opendev.org/73763621:29
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731521:30
openstackgerritMohammed Naser proposed openstack/project-config master: Temporarily unretire incorrectly retired projects  https://review.opendev.org/73763621:31
mordredclarkb: yes it does - and thanks!21:31
mordredclarkb: the executors are in emergency, but definitely let's wait until that lands to start rolling them out21:32
*** DSpider has joined #opendev21:38
*** factor has joined #opendev21:42
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Test multiarch release builds  https://review.opendev.org/73731521:45
openstackgerritMerged opendev/system-config master: Limit connections for static site Apache workers  https://review.opendev.org/73761921:47
openstackgerritMerged opendev/system-config master: Remove elasticsearch01  https://review.opendev.org/73638921:47
*** hasharAway has quit IRC22:11
openstackgerritIan Wienand proposed opendev/system-config master: Add a grafana/grafyaml image  https://review.opendev.org/73739722:21
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740622:21
openstackgerritMohammed Naser proposed openstack/project-config master: gerrit: change retired.config acls  https://review.opendev.org/73764922:23
openstackgerritMohammed Naser proposed openstack/project-config master: Move retired.config to external namespace  https://review.opendev.org/73765222:35
*** tosky has quit IRC22:42
*** clarkb has quit IRC22:44
openstackgerritMohammed Naser proposed openstack/project-config master: gerrit: change retired.config acls  https://review.opendev.org/73764922:48
openstackgerritMohammed Naser proposed openstack/project-config master: openstack: move all openstack projects to dedicated acl  https://review.opendev.org/73765422:48
*** tkajinam has joined #opendev22:53
*** _mlavalle_1 has quit IRC22:56
*** clarkb has joined #opendev22:58
mordredmnaser: wow. you're cleaning up!23:11
mnasermordred: doing my best!23:12
mordredmnaser: those two changes would be _way_ smaller if they were squahsed. :)23:19
mnasermordred: i need to pad mah statz -- but mainly because i wanted to make 2 simple easy to merge and one that might likely require manual intervention on its own23:26
mordredmnaser: dude. now you're gonna make me go pad _my_ stats23:29
* mordred hasn't done a good old fashioned patch-bomb in ages23:29
openstackgerritMerged openstack/project-config master: Move retired.config to external namespace  https://review.opendev.org/73765223:32
openstackgerritMerged openstack/project-config master: openstack: move all openstack projects to dedicated acl  https://review.opendev.org/73765423:32
clarkbmy irc connection dropped and I missed the second one23:33
clarkbalso firefox is not happy trying to open those diffs23:33
openstackgerritIan Wienand proposed opendev/system-config master: [wip] grafana deployment  https://review.opendev.org/73740623:36
fungigertty opened them quite happily, fwiw23:37
ianwclarkb: https://199.204.45.223 is a held node with a containerised grafana (self signed)23:40
clarkbianw: I can confirm it is both a self signed cert and a grafana with existing nodepool problems23:41
clarkbsomewhat reassuring that those issues don't mysteriously disappear with a redeploy23:41
ianwclarkb: yeah, login is admin/adminpassword23:41
ianwit's definitely something to do with $region ... change that to * in the UI and it finds stuff23:41
ianw"This panel is deprecated. Please migrate to the new Stat panel. "23:43
ianwworking variable definition -> http://paste.openstack.org/show/795123/23:49
ianwnot working defintiion -> http://paste.openstack.org/show/795124/23:51
clarkbianw: is it the definition: thatm akes the difference?23:51
ianwpossibly, i updated via the UI and that was what it wrote back23:52
ianwi guess this is what writes it? https://opendev.org/opendev/grafyaml/src/branch/master/grafana_dashboards/schema/template/query.py#L3223:54
clarkbianw: I think thats a schema for the yaml input23:54
clarkbthough maybe its a 1:1 input to output23:55
clarkbya I think that may be the case23:56
clarkbianw: it uses the schema to parse the datasources and dashboards then it passes that input straight into grafana I think23:59
clarkbianw: so we'd need to update the schema to take definition or add a step in there that copies query to definition23:59

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!