Wednesday, 2018-09-19

*** dayou_ has joined #openstack-infra00:01
*** dhill_ has joined #openstack-infra00:03
*** hamzy has joined #openstack-infra00:05
*** janki has quit IRC00:09
*** janki has joined #openstack-infra00:09
*** janki has quit IRC00:10
*** janki has joined #openstack-infra00:11
*** sthussey has quit IRC00:15
*** gyee has quit IRC00:19
*** jiapei has quit IRC00:24
*** jamesmcarthur has joined #openstack-infra00:26
ianwhrm, this erw cloud seems to have the same problem as the linaro cloud, in that we end up with keystone telling us to use internal end-points for a range of operations (i think)00:36
*** diablo_rojo has quit IRC00:36
fungicould that be why openstackclient seemed to hate me when i was trying to test access to it?00:37
ianwfungi: no, i think that was a typo in your config :)00:38
fungiokay, that sounds FAR more likely ;)00:38
ianwhowever, openstackclient did hate me when i was doing something similar and i ended up with https://review.openstack.org/#/c/601485/00:39
ianwand this time, i've noticed that the osc can log the password in plain per https://review.openstack.org/#/c/603528/00:39
ianwso we're averaging a 100% chance of openstackclient issues when bringing up a new cloud so far :)00:40
ianwoh, i see in the latest change it's now arm64ci.cloud; i'm not using that, let me see if that makes any difference00:43
ianwit points at the same thing anyway00:44
*** longkb has joined #openstack-infra00:46
*** jamesmcarthur has quit IRC00:59
*** harlowja has quit IRC01:02
ianwahhh, ok figured out the end-point thing.  "interfaces: public" helps01:07
*** diablo_rojo has joined #openstack-infra01:11
*** slaweq has joined #openstack-infra01:11
*** evrardjp has quit IRC01:12
*** slaweq has quit IRC01:15
*** jamesmcarthur has joined #openstack-infra01:16
*** mrsoul has quit IRC01:29
*** anteaya has joined #openstack-infra01:30
*** studarus has joined #openstack-infra01:31
anteayafolks may want to read my comments on this patch: https://review.openstack.org/#/c/602697 it is about a proposal to governance about social media use01:31
anteayaapart from the larger discussion01:31
studarusclarkb: instance 4ee38e14-1181-40b1-bc00-ef479514f21b has two private IP addresses assigned...  I'll do some more investigating. End result is we run out of ports on that subnet...01:32
anteayathere is a very helpful comment from a user from korea and I offered some thoughts, others may want to offer theirs01:32
anteayaalso that user may show up in channel and have some questions as I have suggested they do just that01:32
*** hongbin has joined #openstack-infra01:37
*** studarus has quit IRC01:38
*** diablo_rojo has quit IRC01:43
*** ykarel|away has joined #openstack-infra01:52
*** imacdonn has joined #openstack-infra01:54
imacdonnin case anyone's around .... something seems unwell with the gate ... 603194,1 has been in there for 10 hours, and seems to be in a loop .. it gets almost finished, then starts over01:56
*** dpawlik has joined #openstack-infra01:56
*** anteaya has quit IRC01:58
*** dpawlik has quit IRC02:00
openstackgerritMerged openstack-infra/git-review master: Always print failure case when testing remotes  https://review.openstack.org/60276702:00
*** anteaya has joined #openstack-infra02:06
clarkbimacdonn: I think that us because the changes ahead of it in the gate keep failing causing it to start over02:09
clarkbnot ideal, but zuul is doing what we have asked of it02:09
*** jamesmcarthur has quit IRC02:12
*** armax has quit IRC02:15
*** jamesmcarthur has joined #openstack-infra02:18
*** jamesmcarthur has quit IRC02:22
*** anteaya has quit IRC02:24
*** linshuicheng[m] is now known as linshuicheng[m]102:25
*** ijw has joined #openstack-infra02:31
*** ijw has quit IRC02:36
*** ykarel|away has quit IRC02:43
imacdonnclarkb: OK... hope it makes it through eventually ... kinda seems like zuul is only creating more work for itself by timing out (?) and repeating over and over02:44
*** jamesmcarthur has joined #openstack-infra02:45
*** _ari_ has quit IRC02:45
*** rascasoft has quit IRC02:45
*** apetrich has quit IRC02:51
*** armax has joined #openstack-infra02:52
*** vivsoni has joined #openstack-infra02:56
*** armax has quit IRC02:56
*** jistr has quit IRC03:00
*** jistr has joined #openstack-infra03:00
prometheanfireclarkb: looks like app-portage/gentoolkit is expected to be installed http://logs.openstack.org/46/602446/4/check/openstack-infra-base-integration-gentoo-17-0-systemd/7fd4e0c/job-output.txt.gz#_2018-09-18_19_30_14_10674303:01
prometheanfireclarkb: where is best to add that?03:02
prometheanfireprobably in one of the pre-elements or something03:04
*** ramishra has joined #openstack-infra03:08
openstackgerritMatthew Thode proposed openstack-infra/project-config master: Install gentoolkit on Gentoo  https://review.openstack.org/60354403:08
prometheanfireclarkb: well, let me know if it's the right place/element ^03:08
*** slaweq has joined #openstack-infra03:11
*** cgoncalves|pto has quit IRC03:16
*** slaweq has quit IRC03:16
*** cgoncalves has joined #openstack-infra03:17
*** eernst has quit IRC03:20
ianwfungi / gary_perkins : sent an email with some bits on the new cloud, thanks.  LMN thoughts on projects etc03:34
*** jamesmcarthur has quit IRC03:34
*** vivsoni has quit IRC03:37
*** jamesmcarthur has joined #openstack-infra03:40
*** jamesmcarthur has quit IRC03:50
*** udesale has joined #openstack-infra03:52
*** ykarel|away has joined #openstack-infra03:53
*** jamesmcarthur has joined #openstack-infra03:56
*** jamesmcarthur has quit IRC04:00
*** jamesmcarthur has joined #openstack-infra04:00
*** vivsoni has joined #openstack-infra04:04
*** jamesmcarthur has quit IRC04:05
*** vivsoni has quit IRC04:06
*** vivsoni has joined #openstack-infra04:06
*** yamamoto has joined #openstack-infra04:07
*** rfolco has quit IRC04:08
*** jamesmcarthur has joined #openstack-infra04:09
*** jamesmcarthur has quit IRC04:13
*** jaosorior_ is now known as jaosorior04:13
*** jamesmcarthur has joined #openstack-infra04:23
*** jamesmcarthur has quit IRC04:27
*** diablo_rojo has joined #openstack-infra04:28
*** bobh has quit IRC04:31
*** harlowja has joined #openstack-infra04:32
*** rkukura has quit IRC04:35
*** rkukura has joined #openstack-infra04:35
*** bobh has joined #openstack-infra04:37
*** bobh has quit IRC04:42
*** bobh has joined #openstack-infra04:51
*** roman_g has quit IRC04:52
*** bobh has quit IRC04:55
openstackgerritIlya Etingof proposed openstack-infra/git-review master: Improve exit code implementation  https://review.openstack.org/48026704:57
*** hamzy_ has joined #openstack-infra05:01
*** hamzy has quit IRC05:02
*** ykarel|away has quit IRC05:04
*** hongbin has quit IRC05:06
*** diablo_rojo has quit IRC05:07
*** hamzy has joined #openstack-infra05:09
*** sshnaidm has joined #openstack-infra05:10
*** hamzy_ has quit IRC05:11
*** sshnaidm has quit IRC05:11
*** slaweq has joined #openstack-infra05:11
*** slaweq has quit IRC05:15
*** hamzy has quit IRC05:17
*** hamzy has joined #openstack-infra05:18
*** hamzy has quit IRC05:23
*** hamzy has joined #openstack-infra05:24
*** rcernin has quit IRC05:30
*** rcernin has joined #openstack-infra05:30
*** quique|rover|off is now known as quiquell|rover05:33
ianwniedbalski: it too me *far* too long to realise, but "identinty_interface: public" doesn't actually do anything in clouds.yaml ... it's on "interface:" that works05:37
ianwonly05:37
ianwhowever, it is "identity_api_version:" for the api version, and you can prefix like "compute_interface" if you're calling certain parts of the client directly from code05:39
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: executor: enable zuul_return to update Ansible inventory  https://review.openstack.org/59009205:44
*** ykarel|away has joined #openstack-infra05:47
*** jtomasek has quit IRC06:00
*** jtomasek has joined #openstack-infra06:01
*** rcernin_ has joined #openstack-infra06:04
*** jtomasek has quit IRC06:06
*** rcernin has quit IRC06:06
*** holser_ has joined #openstack-infra06:12
*** janki has quit IRC06:17
*** holser_ has quit IRC06:18
*** slaweq has joined #openstack-infra06:18
*** ykarel|away is now known as ykarel06:21
*** holser_ has joined #openstack-infra06:21
*** slaweq has quit IRC06:23
*** harlowja has quit IRC06:24
*** dpawlik has joined #openstack-infra06:27
*** dpawlik has quit IRC06:28
*** dpawlik_ has joined #openstack-infra06:28
*** dpawlik_ has quit IRC06:30
*** dpawlik has joined #openstack-infra06:30
*** dpawlik has quit IRC06:31
*** dpawlik_ has joined #openstack-infra06:31
*** rcernin has joined #openstack-infra06:34
*** rcernin_ has quit IRC06:36
*** jtomasek has joined #openstack-infra06:38
*** chkumar|off is now known as chkumar|ruck06:53
*** slaweq has joined #openstack-infra06:55
*** janki has joined #openstack-infra06:55
*** apetrich has joined #openstack-infra06:57
*** slaweq has quit IRC07:00
*** rcernin has quit IRC07:06
*** olivierb has joined #openstack-infra07:09
*** hashar has joined #openstack-infra07:09
*** jamesmcarthur has joined #openstack-infra07:13
*** slaweq has joined #openstack-infra07:16
*** jamesmcarthur has quit IRC07:17
*** ykarel is now known as ykarel|lunch07:25
egonzalezhi, EPEL mirror still missing some packages which are on public epel https://dl.fedoraproject.org/pub/epel/7/x86_64/Packages/u/uwsgi-plugin-python2-2.0.17.1-1.el7.x86_64.rpm07:28
egonzalezwell,  package is actually there http://mirror.mtl01.inap.openstack.org/epel/7/x86_64/Packages/u/uwsgi-plugin-python2-2.0.17.1-1.el7.x86_64.rpm07:29
openstackgerritSimon Westphahl proposed openstack-infra/nodepool master: Cleanup of leaked resource for static driver  https://review.openstack.org/60008407:30
egonzalezbut building docker images in queens branch fails because the missing package, not in rocky or master07:30
openstackgerritSimon Westphahl proposed openstack-infra/nodepool master: Implement liveness check for static nodes  https://review.openstack.org/60151307:30
egonzalezany idea why can this be happening?07:30
egonzalezINFO:kolla.image.build.barbican-base: * epel: mirror.mtl01.inap.openstack.org07:32
egonzalezINFO:kolla.image.build.barbican-base:No package uwsgi-plugin-python available.07:32
ianwegonzalez: is it an upstream issue?  I'm not seeing any issues with the epel mirroring process07:34
ianwcan you link to the full logs?07:35
*** e0ne has joined #openstack-infra07:36
*** tosky has joined #openstack-infra07:37
*** jtomasek has quit IRC07:37
*** Gorian has quit IRC07:37
*** jesusaur has quit IRC07:39
*** pcrews has quit IRC07:39
*** jtomasek has joined #openstack-infra07:39
*** ginopc has joined #openstack-infra07:40
*** jesusaur has joined #openstack-infra07:44
*** e0ne has quit IRC07:48
strigazihello AJaeger, after this you proposed in magnum, http://git.openstack.org/cgit/openstack/magnum/commit/?id=4a1a4be0d315dbce44fd569b491c989a403017c0 the cover job is voting. How can we make it non-voting?07:51
egonzalezianw yep, the stable/queens jobs are failing due missing uswgi-plugin-python package missing http://logs.openstack.org/periodic-stable/git.openstack.org/openstack/kolla/stable/queens/kolla-publish-centos-binary/c3a643e/logs/build/000_FAILED_barbican-api.txt.gz07:56
ianwegonzalez : hrm, i think maybe that's a red herring07:57
ianwbefore that07:57
ianwTimeout on https://copr-be.cloud.fedoraproject.org/results/iwienand/zookeeper-el7/epel-7-x86_64/repodata/repomd.xml: (28, 'Connection timed out after 30001 milliseconds')07:57
ianwperhaps copr is having downtime?07:57
ianwegonzalez: actually, maybe again ... Determining fastest mirrors08:01
ianwINFO:kolla.common.utils.barbican-api: * epel: fedora-epel.mirror.iweb.com08:01
ianwi don't think it's using our mirror08:01
egonzalezianw the last attempt tries to use openstack mirrors INFO:kolla.common.utils.barbican-api:Determining fastest mirrors08:02
egonzalezINFO:kolla.common.utils.barbican-api: * epel: mirror.mtl01.inap.openstack.org08:02
egonzalezianw btw, this is the replace we use in gates for the mirrors https://github.com/openstack/kolla/blob/d609c318bf374217c5d2e40a9e51dd565581333d/tests/templates/template_overrides.j2#L4008:06
*** Emine has joined #openstack-infra08:06
egonzalezhrm, may be the fedora url changed, http://download.fedoraproject.org/pub to http://dl.fedoraproject.org/pub08:08
*** jpich has joined #openstack-infra08:08
egonzalezin master is getting the package from delorean, thats why is not failing08:14
*** rossella_s has quit IRC08:14
*** rossella_s has joined #openstack-infra08:15
*** roman_g has joined #openstack-infra08:17
openstackgerritSorin Sbarnea proposed openstack-infra/git-review master: Avoid UnicodeEncodeError on python 2  https://review.openstack.org/58353508:20
AJaegerstrigazi: remove the cover template, move it to check queue with "voting: false" to it.08:21
strigaziAJaeger: thanks08:22
AJaegerstrigazi: why don't you want it voting? Just curious08:22
AJaegerstrigazi: happy to review a change - but no time right now to do it myself08:22
strigaziAJaeger It is buggy, it has false negative, until we fix it, we have it non-voting08:22
AJaegerI see. You can also leave the template in in that case08:22
strigaziI didn't get the last comment08:23
AJaegerstrigazi: let me do it quickly for you...08:23
strigaziAJaeger: https://review.openstack.org/60300108:24
AJaegerhttps://review.openstack.org/603594 - I can abandon again ;)08:25
*** e0ne has joined #openstack-infra08:25
AJaegerstrigazi: my change is correct - your call on how to continue...08:25
* AJaeger needs to step out again08:26
*** e0ne has quit IRC08:26
strigaziAJaeger: thanks for your time, I really really apreciate it.08:26
*** jpena|off is now known as jpena08:31
*** e0ne has joined #openstack-infra08:33
*** ykarel|lunch is now known as ykarel08:41
*** alexchadin has joined #openstack-infra08:43
*** derekh has joined #openstack-infra08:49
*** owalsh has quit IRC08:51
*** owalsh has joined #openstack-infra08:56
*** dtantsur|afk is now known as dtantsur08:57
*** e0ne has quit IRC09:02
openstackgerritSorin Sbarnea proposed openstack-infra/zuul master: Assure that status tooltip is displayed on entire row  https://review.openstack.org/60350409:05
openstackgerritMarkos Chandras (hwoarang) proposed openstack-infra/system-config master: modules: opensuse-mirror: Switch to US mirror for OBS repositories  https://review.openstack.org/60361009:05
hwoaranginfra-root: ^^ could we get this in please? it's impacting jobs again :(09:06
*** gfidente has joined #openstack-infra09:12
*** dpawlik_ has quit IRC09:13
*** dpawlik has joined #openstack-infra09:13
*** markvoelker has quit IRC09:25
*** elod has quit IRC09:26
*** elod has joined #openstack-infra09:27
*** dangtrinhnt has joined #openstack-infra09:41
*** owalsh has quit IRC09:46
*** owalsh has joined #openstack-infra09:52
ssbarneaianw: it took some time to add unittests but it was possible, see https://review.openstack.org/#/c/583535/ -- tripled the efforth but now we can avoid regressions.09:52
*** ijw has joined #openstack-infra09:54
mordredslaweq: morning!09:58
*** ijw has quit IRC09:58
slaweqmordred: hi09:58
slaweqisn't it very early for You? :)09:58
mordredslaweq: yes, yes it is :)09:59
mordredslaweq: but - maybe that's a good thing for helping you track down the neutron thing?09:59
slaweqmordred: thx a lot10:00
slaweqmordred: basically I have two things two debug, one, more important is failing neutron-grenade-dvr-multinode job10:00
slaweqbut biggest problem with this one is that when I logged in to node, everything worked fine :/10:01
slaweqso I'm now trying to debug it by adding some additional logs and running it in gate10:01
slaweqmaybe I will figure out what's going on there10:01
mordredslaweq: ugh, that sounds like fun10:02
slaweqmordred: yup :)10:02
slaweqmordred: but maybe You can help me with this second thing10:02
mordredI hope so10:02
slaweqmordred: I still have no idea why in job neutron-tempest-plugin-dvr-multinode-scenario-zuulv3 (https://review.openstack.org/#/c/578796/) all FIP from subnode-2 are not reachable10:03
slaweqI suspect that there is some difference between this "new" multinode setup and old legacy job10:03
slaweqcan You maybe set for me on autohold both those jobs? I would recheck my patch and then I would be maybe able to log to both of them and compare config of nodes10:04
mordredslaweq: sure! also - we should pull in clarkb when he gets up on the floating ip being reachable thing - he is the master of the networking setup for multinode jobs10:05
mordredslaweq: autoholds are now set10:05
slaweqmordred: good to know that, so I will compare those configs and then I will get back to clarkb if I will need any help with this and if this will be really nodes' networking setup10:05
slaweqthx a lot10:06
slaweqis it on hold for specifi project or for any project?10:06
slaweqI mean, will it be holded if will fails on neutron patch instead of neutron-tempest-plugin?10:06
mordredoh - I did it for the same one as last time - so ...10:06
mordredyeah, openstack/neutron-tempest-plugin for neutron-tempest-plugin-dvr-multinode-scenario-zuulv310:07
mordredalthough I could set it for a different project if you prefer10:07
slaweqthat is what I want :)10:07
mordredsweet10:07
slaweqthx a lot10:07
slaweqif I will have such failed job I will come back to You :)10:07
dangtrinhntHi openstack-infra team. Sorry for interrupting your conversation. I'm trying to add myself as the channel's operator of the #openstack-searchlight channel (I'm the PTL) but helpless. And I cannot contact the last PTL. The open-infra docs says something about setting mask as full_mask but I don't think it's it.10:11
dangtrinhntIt would be great if someone can give me a hint. Thanks.10:12
ianwdangtrinhnt: are you messaging chanserv?10:20
dangtrinhntPardon me. I don't understand your question. :) I'm not the operator of #openstack-searchlight because I just joint last month.10:29
dangtrinhntoh, looks like you grant the rights for me. Many thanks.10:29
dangtrinhntianw10:29
*** markvoelker has joined #openstack-infra10:30
openstackgerritMerged openstack-infra/puppet-asterisk master: Ensure asterisk refresh happens last  https://review.openstack.org/60174910:31
*** aidin has joined #openstack-infra10:34
*** longkb has quit IRC10:34
openstackgerritMerged openstack-infra/system-config master: Turn on the future parser for pbx.o.o  https://review.openstack.org/60183710:37
*** longkb has joined #openstack-infra10:38
*** longkb has quit IRC10:39
*** longkb has joined #openstack-infra10:40
*** longkb has quit IRC10:42
*** e0ne has joined #openstack-infra10:47
*** aidin has left #openstack-infra10:50
*** markvoelker has quit IRC11:00
*** panda has joined #openstack-infra11:00
*** priteau has joined #openstack-infra11:16
*** imacdonn has quit IRC11:19
*** imacdonn has joined #openstack-infra11:20
*** jpena is now known as jpena|lunch11:20
*** pbourke has quit IRC11:22
*** pbourke has joined #openstack-infra11:24
*** dhill_ has quit IRC11:25
*** sambetts|afk is now known as sambetts11:35
*** ssbarnea has quit IRC11:39
*** owalsh has quit IRC11:42
*** rh-jelabarre has joined #openstack-infra11:46
*** _Cyclone_ has quit IRC11:48
*** ansmith has quit IRC11:49
*** _Cyclone_ has joined #openstack-infra11:52
quiquell|roverHello11:52
quiquell|roverdangtrinhnt, mordred: Do you know what happends with zuul_changes for this https://review.openstack.org/#/c/594145/ ?11:53
quiquell|roverIt has a Depends-On that point to multiple reviews11:53
quiquell|roverlooks like zuul_changes just show the puppet-tripleo ZUUL_CHANGES=openstack/puppet-tripleo:stable/queens:refs/changes/45/594145/311:54
AJaegerinfra-root, OVH mainteance should be over in 5 minutes and I'll then approve https://review.openstack.org/#/c/603174/ to get back the cloud. Anybody else to review?11:55
*** ssbarnea has joined #openstack-infra11:56
*** aperevalov has quit IRC11:56
*** udesale has quit IRC11:56
*** markvoelker has joined #openstack-infra11:57
openstackgerritGabriele Cerami proposed openstack-infra/project-config master: Allow push and push merge commit for tripleo-quickstart  https://review.openstack.org/60237711:59
*** ijw has joined #openstack-infra12:02
mordredquiquell|rover: I'm not sure about ZUUL_CHANGES (that's a legacy compat thing and I'm not 100% sure of the interactions), but I highly recommend switching the depends-on to use the url form instead of the changeid form - because regardless of what goes into zuul_changes, zuul will not merge that patch until all three matching changes have landed12:04
mordredheh. seems like you got that taken care of12:05
mordredAJaeger: I tossed a +2 on there12:05
*** ijw has quit IRC12:07
AJaegerthanks, mordred12:08
quiquell|rovermordred: thanks, we have change to full url12:11
quiquell|rovermordred: we are still depending on legacy ZUUL_CHANGES12:11
openstackgerritMerged openstack-infra/project-config master: Revert "OVH BHS1 Maintenance" - 2018-09-19 1200UTC  https://review.openstack.org/60317412:12
*** dotplus has left #openstack-infra12:14
*** rfolco has joined #openstack-infra12:16
*** jamesmcarthur has joined #openstack-infra12:18
*** e0ne has quit IRC12:20
*** hashar is now known as hasharAway12:21
*** jpena|lunch is now known as jpena12:24
*** ijw has joined #openstack-infra12:26
*** ijw has quit IRC12:30
*** jamesmcarthur has quit IRC12:31
*** markvoelker has quit IRC12:31
*** quiquell|rover is now known as quique|rover|lch12:33
*** kgiusti has joined #openstack-infra12:42
openstackgerritMerged openstack-infra/system-config master: modules: opensuse-mirror: Switch to US mirror for OBS repositories  https://review.openstack.org/60361012:44
AJaegerinfra-root, can you check the graphs for OVH BHS1? Is everything looking fine in that cloud? I see 10 launch errors/min and wonder whether thta is fine12:47
dpawlikAJaeger: We are still upgrading the infra12:49
dpawlikAJaeger: it is possible that some part of compute hosts has been disabled. Please wait one/two hours12:50
*** jamesmcarthur has joined #openstack-infra12:51
*** dhill_ has joined #openstack-infra12:52
*** alexchadin has quit IRC12:53
AJaegerdpawlik: oh - should we swithc them off again?12:53
dpawlikAJaeger: if you can just wait some time :)12:54
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Revert "Revert "OVH BHS1 Maintenance" - 2018-09-19 1200UTC"  https://review.openstack.org/60374112:54
*** alexchadin has joined #openstack-infra12:54
dpawlikAJaeger: or maybe I will ask team responsible for upgrade if they can upgrade OS aggregation12:54
AJaegerdpawlik: just tell us when you're ready, please12:54
AJaegerinfra-root, can we promote 603741 ?12:55
AJaegermordred: ^12:55
EmilienMhello infra, can someone approve https://review.openstack.org/#/c/602869/ ? thanks12:55
*** trown|outtypewww is now known as trown|brb12:55
*** trown|brb is now known as trown12:56
*** vivsoni has quit IRC12:56
*** vivsoni has joined #openstack-infra12:56
dpawlikAJaeger: you are a "prio" :)12:57
AJaegerdpawlik: thanks ;) Just takes time right now to switch them off again, I don#t have the permissions to help immediately.12:58
*** kukacz_ is now known as kukacz13:00
*** ansmith has joined #openstack-infra13:04
*** holser_ has quit IRC13:04
*** holser__ has joined #openstack-infra13:04
AJaegerinfra-root, or change the value in nodepool for OVH BHS1 directly?13:09
mordredAJaeger: looking13:09
*** alexchadin has quit IRC13:11
openstackgerritMerged openstack-infra/project-config master: Revert "Revert "OVH BHS1 Maintenance" - 2018-09-19 1200UTC"  https://review.openstack.org/60374113:11
AJaegerthanks, mordred13:11
AJaegerdpawlik: ^13:12
mordredAJaeger: :5313:12
mordredgah13:12
mordredAJaeger: I also edited the file on nl04 directly13:12
AJaegermordred: thanks13:15
AJaegerdpawlik: ok, we should not launch anything anymore...13:15
*** alexchadin has joined #openstack-infra13:15
*** janki has quit IRC13:16
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Revert "Revert "Revert "OVH BHS1 Maintenance" - 2018-09-19 1200UTC""  https://review.openstack.org/60376613:18
*** jamesmcarthur has quit IRC13:18
cmurphylove the triple revert13:18
AJaegerinfra-root, please take over once OVH is ready ^ - no further time13:18
*** jamesmcarthur has joined #openstack-infra13:19
*** sthussey has joined #openstack-infra13:20
*** quique|rover|lch is now known as quiquell|rover13:21
*** mriedem has joined #openstack-infra13:21
openstackgerritSlawek Kaplonski proposed openstack-infra/project-config master: Add openstack-python36 job to Neutron Grafana dashboard  https://review.openstack.org/59557313:28
*** tpsilva has joined #openstack-infra13:36
*** udesale has joined #openstack-infra13:38
*** bobh has joined #openstack-infra13:46
*** jamesmcarthur has quit IRC13:49
*** ginopc has quit IRC13:50
*** jamesmcarthur has joined #openstack-infra13:50
*** bobh has quit IRC13:51
*** rascasoft has joined #openstack-infra13:51
*** ginopc has joined #openstack-infra13:52
*** ginopc has quit IRC13:56
*** bobh has joined #openstack-infra13:58
*** janki has joined #openstack-infra13:59
*** bobh has quit IRC14:02
*** janki has quit IRC14:04
*** ykarel is now known as ykarel|away14:04
*** ginopc has joined #openstack-infra14:04
slaweqmordred: hi14:06
slaweqmordred: one of my jobs just failed: http://logs.openstack.org/96/578796/18/check/neutron-tempest-plugin-dvr-multinode-scenario-zuulv3/e1620d0/14:06
slaweqcan You check those nodes and add my ssh key to them?14:06
dpawlikslaweq: so sad :P14:06
slaweqdpawlik: why? I was waiting for that since morning :)14:07
*** aidin has joined #openstack-infra14:07
dpawlikslaweq: oh14:07
*** jtomasek has quit IRC14:08
*** bobh has joined #openstack-infra14:08
mordredslaweq: ok - looks like both jobs failed actually- so 198.72.124.232 and 198.72.124.237 are neutron-tempest-plugin-dvr-multinode-scenario-zuulv314:10
mordredslaweq: and 23.253.201.43 and 23.253.213.20 are neutron-grenade-dvr-multinode14:11
mordredslaweq: oh - wait - I got those backwards14:11
openstackgerritMatthew Thode proposed openstack-infra/project-config master: Install gentoolkit on Gentoo  https://review.openstack.org/60354414:11
slaweqmordred: thx a lot14:11
mordredslaweq: 23. are neutron-tempest-plugin-dvr-multinode-scenario-zuulv3 - 198. are neutron-grenade-dvr-multinode14:11
slaweqok, great :)14:11
slaweqthx14:11
mordredsure thing!14:11
*** bobh has quit IRC14:12
efriedGood UGT morning all. Random question for gerrit query mavens: Are wildcards/regexes of any kind supported? E.g. if I want to search by topic:bug/.* kind of thing?14:15
cmurphyefried: it works best if you double quote it and use ^ and $14:17
cmurphytopic:"^bug/.*$"14:17
efriedhmph, snot working for me.14:18
*** bobh has joined #openstack-infra14:19
cmurphyhmm maybe it doesn't work for topics14:19
efriedboo. Okay, thanks cmurphy.14:20
slaweqmordred: there is 2 small problems with neutron-grenade-dvr-multinode14:22
slaweqmordred: 1. it's upgrade from ocata to pike and that is not what I was looking for :/14:22
slaweqmordred: 2. in fact I wanted today to set autohold on neutron-tempest-plugin-dvr-multinode-scenario job in neutron-tempest-plugin instead of this grenade job :D14:22
slaweqbecause I wanted to compare it with setup of neutron-tempest-plugin-dvr-multinode-scenario-zuulv3 which should be the same but isn't for some reason14:23
*** alexchadin has quit IRC14:25
*** bobh has quit IRC14:27
clarkbslaweq: mordred dvr requires we use the test env overlay to route FIPs between nodes as they can be terminated on each node when using dvr14:29
clarkbthis isnt necessary without dvr because all FIPs are terminated on the controller and we run tempest there so all IPs are local14:29
*** alexchadin has joined #openstack-infra14:30
clarkbI would double check you have the test env overlay in place for br-ex on the dvr job14:30
mordredslaweq: well poo. so should I set an autohold for neutron-tempest-plugin-dvr-multinode-scenario on neutron-tempest-plugin?14:31
*** bobh has joined #openstack-infra14:32
clarkbwe have ascii art about this somewhere too14:33
clarkbbut the issue is routing arbitrary FIP range in arbitrary cloud netowkring14:33
smcginnisfungi: You had mentioned in a thread about making our cgit interface nicer. Has anyone looked at gogs.io?14:33
mordredsmcginnis: yah. I looked at it a while back14:34
clarkbwe do that with a vxlan overlay between test nodes which gives us direct attached routes to the larger overlay range thrn neutron assigns FIPs out of a subset14:34
mordredsmcginnis: I like it conceptually, but I haven't gotten far enough to make any formal suggestions14:34
smcginnismordred: Just a quick local trial, I kind of like it.14:34
mordredsmcginnis: luckily it DOES allow completely disabling pull requests, issues, wikis, etc14:34
smcginnismordred: And it might make it easier for those more familiar with github.14:35
smcginnis++14:35
clarkbgogs has a fork too also they were owned on github iirc14:35
clarkb(and dont self host)14:35
smcginnisI was wondering though, I wonder if we could somehow bridget that pull request interface into being able to submit reviews.14:35
smcginnisAnother way to make it easier for folks more familiar with github.14:35
mordredyah - that is a thing that is a much harder thing14:35
*** gouthamr_ is now known as gouthamr14:35
mordredbecause we'd need to make an entire UI interface that used gerrit as a backend14:35
smcginnisYeah, that would probably require a fair amount of noodling and work.14:36
mordredyup14:36
clarkbgitea14:36
*** bobh has quit IRC14:36
smcginnisclarkb: Oh, that looks nice too. Very similar.14:36
mordredclarkb: https://notabug.org/hp/gogs/ is the fork I was looking at14:36
mordredbut any of them would be fine14:36
clarkbgitea is a community managed fork of gogs because gogs went idle14:37
smcginniscgit is a litte... aged.14:37
* prometheanfire likes cgit :P14:37
clarkbBut my big issue with all of the  is non self host14:37
smcginnisprometheanfire is a little... aged.14:37
smcginnis:P14:37
clarkbif they arent good enough for their authors...14:37
*** udesale has quit IRC14:37
smcginnisclarkb: Hah, I chuckled when I noticed that too.14:37
mordredclarkb: notabug self-hosts14:38
prometheanfiresmcginnis: iirc we have a dev for it, so are likely to stay on it14:38
prometheanfirereally just depends on what you want though14:38
smcginnisYeah, it totally works. Just many people end up browsing the github mirrors instead since it's so darn purdy.14:39
clarkbmordred: ah neat14:39
prometheanfireI'm one of those, but more because of muscle memory14:39
mordredI like the simplicity of cgit and the fact that it doesn't require a database or anything lke that to run. however, I will admit that github/gogs are a nicer browsing experience for humans14:39
AJaegerclarkb: did you see scrollback about OVH?14:39
clarkbAJaeger: ya looks like dpawlik will give us tue go ahead?14:40
mordredand with a legit open source choice in gogs that we coudl use to make git.openstack.org more prettier, I don't think it's a bad idea to explore it14:40
*** dayou_ has quit IRC14:40
prometheanfireswitching to blame view, and going back in history is nice14:40
AJaegerclarkb: yes. Could you keep this on your radar, please? I'm busy right now...14:40
prometheanfirenot sure if cgit has the compare function that github has (which I use a ton)14:40
smcginnisI would be willing to help however I can if there is enough interest to make a change.14:40
smcginnisprometheanfire: ++14:40
*** bobh has joined #openstack-infra14:40
clarkbmordred: smcginnis github UI has some huge flaws for browsing though. md/rst are always rendered or raw so no line linking. Also line width is like 80 chars so glhf browsing wide code14:40
clarkbAJaeger: yup14:40
mordredsmcginnis: most of the effort will likely be around integrating with project creation workflow14:41
smcginnisclarkb: Yeah, that's about the only thing I like better in cgit. :D14:41
mordredsmcginnis, clarkb but with the opendev stuff coming up - perhaps it's worthy of a spec14:41
clarkbmordred: if we are picking priorities gerrit upgrade and actual opendev work is probably far ahead of this14:42
mordredI would be happy to volunteer to write one and then dump work on smcginnis :)14:42
mordredclarkb: oh - totally14:42
smcginnis:)14:42
slaweqclarkb: thx for tips, basically I'm using tempest-multinode-full job as parent for my job14:42
slaweqclarkb: so I was thinking that it will configure everything for me just fine14:43
*** quiquell|rover is now known as quique|rover|off14:43
mordredbut if we had some stuff written down, someone (such as smcginnis) could reasonably work on some of the pre-reqs while we work through the other priorities14:43
clarkbsmcginnis: mordred for code submission I think bridging PRs and changes in gerrit is likely to only lead to pain14:43
prometheanfiresmcginnis: if you have an account this should work14:43
clarkbnow you have to understand two workflows a d how they go together14:43
mordredclarkb: yah - I do not desire to do that14:43
prometheanfirehttps://try.gogs.io/gogs/gogs/compare/v0.11.53...v0.11.6614:43
*** alexchadin has quit IRC14:43
clarkbwhich is worse than just understanding the one14:44
mordredclarkb: I would be *strictly* interested in it as code browsing same as cgit is today14:44
prometheanfirewell, comparing commits works14:44
prometheanfirebut not tags?14:44
clarkbslaweq: I dont think so because neutron without dvr has different routing needs for the FIP range than neutron with dvr14:44
*** aidin has quit IRC14:44
clarkbslaweq: pretry sure you need to add the multinode networking overlay for br-ex14:44
mordredslaweq: do you want me to put in the autohold on the correct job/project?14:44
*** jtomasek has joined #openstack-infra14:45
mordredor is the stuff from clarkb good for now?14:45
slaweqmordred: I will try what clarkb is suggesting14:45
*** bobh has quit IRC14:45
slaweqwill ping You later if I will need Your help again14:45
slaweqthx a lot14:45
prometheanfiresmcginnis: looks like it's an open issue (compare tags/branches in gogs)14:45
mordredslaweq: cool14:46
prometheanfirehttps://github.com/gogs/gogs/issues/362114:46
clarkbslaweq: https://git.openstack.org/cgit/openstack-infra/devstack-gate/tree/multinode_setup_info.txt#n81 ascii art explaining the neutron scenarios with and without dvr14:47
smcginnisprometheanfire: Seems pretty basic (and easy to implement)14:47
prometheanfireya14:48
clarkbslaweq: actually that first diagram may be wrong, there is no br-ex vxlan tunnel in the non dvr case I think14:48
*** bobh has joined #openstack-infra14:48
prometheanfirelooks like gitea has the same problem14:48
slaweqclarkb: thx, I will check that14:49
clarkbslaweq: but in the dvr case we have to do the br-ex vxlan tunnel so that we can route for the FIP networking on top of regularl cloud netowrking14:49
clarkbslaweq: multi-node-bridge is the role name in openstack-infra/zuul-jobs. By default it creates a bridge with interfaces named br-infra on all the nodes14:52
clarkbslaweq: I think you can bridge br-ex onto that and it will work?14:52
clarkbor you can do like devstack-gate did and just make br-ex the bridge interface name on all the nodes (multi-node-bridge supports setting these variables)14:52
*** bobh has quit IRC14:53
openstackgerritsebastian marcet proposed openstack-infra/openstackid-resources master: Fix on update affiliation endpoint  https://review.openstack.org/60382414:54
openstackgerritMerged openstack-infra/openstackid-resources master: Fix on update affiliation endpoint  https://review.openstack.org/60382414:55
*** alexchadin has joined #openstack-infra14:57
*** bobh has joined #openstack-infra14:59
clarkbslaweq: let me know if you have questions on how to piece that together. The best place to start may be looking back to the old legacy dvr job defs14:59
clarkbbut I'm happy to help as I can too14:59
slaweqclarkb: ok, thx a lot. I will get back to You if I will need something14:59
*** janki has joined #openstack-infra15:03
*** bobh has quit IRC15:03
slaweqclarkb: so when I connected br-ex with br-infra with patch ports, connectivity works fine :)15:07
*** ykarel|away is now known as ykarel15:07
clarkbslaweq: cool that is what I expected15:07
clarkbslaweq: good to know that piece is working at least :)15:07
slaweqdo You think I can set br-infra as external bridge instead of br-ex in job definition then?15:07
clarkbslaweq: you could try it, it is ovs in part because it made integrating it into neutron that way simpler15:08
*** janki has quit IRC15:08
*** bobh has joined #openstack-infra15:10
*** Emine has quit IRC15:10
*** bobh has quit IRC15:14
slaweqclarkb: can You check if something like in https://review.openstack.org/#/c/578796/19/.zuul.yaml in L178 would be enough to replace br-infra with br-ex directly? because then it should also works IIUC, right?15:16
*** udesale has joined #openstack-infra15:17
clarkbslaweq: yes bridge_name to br-ex in the vars is what I would try first15:17
slaweqthx clarkb15:17
slaweqso lets now check for results :)15:17
*** alexchadin has quit IRC15:19
slaweqmordred: You can remove those nodes with neutron-tempest-plugin-dvr-multinode-scenario-zuulv315:21
slaweqI think I found what's wrong there, with big help from clarkb :)15:21
slaweqthx to both of You guys :)15:21
clarkbslaweq: maybe you want to take that ascii "art" I made in devstack-gate and give it a new home closer to where the new jobs will live15:22
clarkbslaweq: so that the next person doesn't need to work so hard to find that info :)15:22
*** bobh has joined #openstack-infra15:22
slaweqclarkb: You mean to store it somewhere in neutron repo?15:24
*** armax has joined #openstack-infra15:25
*** bobh has quit IRC15:27
clarkbslaweq: if it doesn't look too ugly as a comment next to the dvr jobs maybe do that?15:27
*** dtantsur is now known as dtantsur|brb15:28
slaweqclarkb: that is IMO good idea, I will add comment with link to this description there15:28
slaweqbut let's wait for result now :)15:28
slaweqthx once again for help15:28
fungismcginnis: one reason we switched to cgit is that it's also what kernel.org uses. it's possible newer cgit would also be nicer15:28
smcginnisfungi: Not that I saw. :)15:29
*** jamesmcarthur has quit IRC15:30
*** eernst has joined #openstack-infra15:30
fungibut yeah, i'm open to alternatives. we can even host them in parallel for a while fairly easily15:30
*** hasharAway is now known as hashar15:30
mordredfungi: ++15:30
*** bobh has joined #openstack-infra15:31
fungithough i would eventually like to add some fancy rewrite rules to allow us to use the same urls for browsing and git remotes15:31
mordredfungi: I would also like to do that15:31
fungii have a set i use for some of my personal projects and haven't run into any issues yet15:31
*** jamesmcarthur has joined #openstack-infra15:32
*** chkumar|ruck is now known as chkumar|off15:34
*** bobh has quit IRC15:36
*** gyee has joined #openstack-infra15:37
*** bobh has joined #openstack-infra15:37
*** bobh has quit IRC15:42
*** panda has quit IRC15:42
openstackgerritMarkus Hosch proposed openstack-infra/zuul master: Add support for authentication/STARTTLS to SMTP  https://review.openstack.org/60383315:42
openstackgerritMarkus Hosch proposed openstack-infra/zuul master: Add support for authentication/STARTTLS to SMTP  https://review.openstack.org/60383315:44
openstackgerritMarkus Hosch proposed openstack-infra/zuul master: Add support for authentication/STARTTLS to SMTP  https://review.openstack.org/60383315:46
*** dtantsur|brb is now known as dtantsur15:46
*** pcaruana has joined #openstack-infra15:47
*** bobh has joined #openstack-infra15:47
*** bobh has quit IRC15:47
*** bobh has joined #openstack-infra15:47
*** bdodd has joined #openstack-infra15:50
*** yamamoto has quit IRC15:53
*** ykarel is now known as ykarel|away15:56
clarkbfungi: are you in a spot to make the git review release today?15:57
clarkbapparently gerrit is waiting on us before making their next release15:57
fungiclarkb: yeah, was there anything urgent to get in, or should we just plan to have a couple of releases closer together?15:58
clarkbfungi: I think the most important item is whatever fix we need for gerrit (is that refs/for vs refs/otherthing) ?15:58
fungii thought it already merged... double checking16:00
clarkbya I think it did, just double checking there aren't other fixes that gerrit needs16:00
*** diablo_rojo has joined #openstack-infra16:00
clarkbaccording to the email thread that is the one16:00
clarkbI think we can do multiple releases if we need to16:01
clarkbget this fix out for gerrit to release16:01
*** ykarel|away has quit IRC16:01
fungissbarnea and others have pointed out 195043 601251 200860 480267 as further possibilities for inclusion16:02
clarkbI'll take a quick look16:02
clarkbhttps://review.openstack.org/#/c/601251/ seems appropriate given its similarity to the refs/for issue16:03
fungiagreed. i haven't had time to dig into the implementation but if it lgty it already has a +2 from jhesketh16:03
clarkbhttps://review.openstack.org/#/c/480267/3 is potentially backward incompatible change for scripts16:04
clarkbI don't think we should have ^ in this release as a result16:04
clarkbhttps://review.openstack.org/#/c/200860/ is -1'd so maybe we can skip it too16:04
clarkbhttps://review.openstack.org/#/c/195043/ is a new feature so that can probably go in with 480267 if we want it16:04
clarkbtldr I'll review 601251 now16:05
fungithanks for the quick overview16:05
fungistill trying to catch up this morning unfortunately16:05
clarkbI'm going to test that change16:06
clarkbit adds a comma to the push command I'm not sure is correct16:07
*** e0ne has joined #openstack-infra16:07
clarkbat least the previous +='s didn't add commas16:07
fungii need to go drop christine off at work real fast and then can push the tag one way or the other16:11
* fungi will brb16:11
*** agopi has quit IRC16:12
*** e0ne has quit IRC16:17
openstackgerritClark Boylan proposed openstack-infra/git-review master: Do Not Merge testing  https://review.openstack.org/60384216:19
clarkbfungi: I left comment on https://review.openstack.org/#/c/601251/3 but change lgtm. If my comments makes sense to you I would say approve it then we can tag that for a new release16:22
*** dtantsur is now known as dtantsur|afk16:23
*** olivierb has quit IRC16:26
*** yamamoto has joined #openstack-infra16:26
*** pcaruana has quit IRC16:27
*** udesale has quit IRC16:28
*** e0ne has joined #openstack-infra16:31
openstackgerritSorin Sbarnea proposed openstack-infra/git-review master: Avoid UnicodeEncodeError on python 2  https://review.openstack.org/58353516:32
*** zhangfei has joined #openstack-infra16:32
clarkbI'm going to grab breakfast now16:32
*** jpich has quit IRC16:32
fungithanks, back and taking a look now16:34
*** e0ne has quit IRC16:35
*** ginopc has quit IRC16:35
*** holser__ has quit IRC16:36
*** ykarel has joined #openstack-infra16:36
*** panda has joined #openstack-infra16:38
fungiclarkb: what do you think about 601823? seems like a fairly trivial bug fix we might also want to stuff in here16:41
*** ramishra has quit IRC16:42
*** bobh has quit IRC16:43
fungilooks like it may be redundant with your 532359 change now that i look closer16:45
*** agopi has joined #openstack-infra16:50
clarkbfungi: the int types thing was important I remember16:51
clarkbbecause gerrit was giving it as a json string but now a json int16:51
*** derekh has quit IRC16:51
clarkband you get type mismatches without that depending on the gerrit version16:51
clarkbfor that reason I'm inclined to go with my change (but I am clearly a biased opinion :) )16:51
clarkbfungi: I'd rather not approve my own change there though. Maybe you want to treat electrofelix +1 as good enough?16:53
fungido you think we should include it in today's release, or defer and make sure it solves the broken -m behavior?16:53
clarkbI think I tested it when I wrote it and it worked16:54
clarkbits definitely broken without that chagne so if we get it in now under the assumption it fixes it the worst that can happen is it will still be broken16:54
*** zhangfei has quit IRC16:56
openstackgerritMerged openstack-infra/project-config master: Add openstack-python36 job to Neutron Grafana dashboard  https://review.openstack.org/59557316:56
openstackgerritSorin Sbarnea proposed openstack-infra/zuul master: Assure that status tooltip is displayed on entire row  https://review.openstack.org/60350416:59
otherwiseguyis it just me or is the gate, like, super slow. Like things in for > 12 hours slow?16:59
clarkbjroll: about? curious if you had more info on the edge glance needs. Thinking that maybe nodepool actually solves that problem for them (maybe not in the most efficient manner)17:00
fungiotherwiseguy: one of our largest donors is performing upgrade maintenance and we've had to disable them temporarily17:00
clarkbotherwiseguy: top of the integrated openstack gate has been around for almost a day. its ^ as well as flaky testing17:00
otherwiseguyah. that would do it. :D17:00
fungithey thought they were going to be finished sooner, but it's taking longer than they expected17:00
logan-heh node requests at a 90 day high http://grafana.openstack.org/d/T6vSHcSik/zuul-status?panelId=17&fullscreen&orgId=1&from=now-90d&to=now17:01
fungiotherwiseguy: yeah, for gate resets in particular, you can probably blame drunken coding at the ptg ;)17:01
clarkbimacdonn: I meant to clarify further last night but it was late for me. That is how zuul operates. It builds a speculative future state based on code reviewers approving code (it effectively serializes all of those changes in the order zuul hears about the approvals), assumes they will all work because hey the humans said they will and tests them that way. If they all pass testing we can merge all of17:01
clarkbthem together17:01
otherwiseguyso...much...ptg...beer17:01
clarkbimacdonn: the trouble is when they start failing we have to discard those results, build a new speculative state that removes the broken change then start over17:02
clarkbimacdonn: the way to improve this is to write more reliable tests/software and avoid the gate restarts17:02
fungiotherwiseguy: one of the risks of holding a working event two blocks from a brewery i guess17:02
EmilienMhello infra, can I get +A on puppet-placement addition in project-config, thanks https://review.openstack.org/#/c/602869/17:02
clarkbwe tend to see it happen in waves. Gate will be incredibly reliable then degrade people make changes and not fix tests/software17:02
clarkbeventually it gets broken enough that people get annoyed and then fix it17:03
clarkbunfortunately without the likes of mtreinish sdague jogo and mriedem curating things they tend to be unhappy more often than in the past17:03
otherwiseguyclarkb, but it is so easy to just type 'recheck'! :D17:04
clarkball that to say if this bugs you, the best way to help is to start identifying bugs, http://status.openstack.org/elastic-recheck/data/integrated_gate.html, then fixing them http://status.openstack.org/elastic-recheck/gate.html17:04
otherwiseguyi mean, it'll probably eventually work, right? :p17:04
jrollclarkb: short version is, they want to run more localized glance servers to reduce bandwidth usage... there's a few different options. good point on nodepool, that might actually be reasonable. definitely more reasonable than building python apps that sync data between mysql instances. :)17:05
clarkbotherwiseguy: we'ev actually found cases where the bug was clearly in the change that was rechecked 50 times before it merged and it only passed the 1/50% of the time17:05
clarkbotherwiseguy: :/17:05
otherwiseguyclarkb, I don't doubt that.17:05
clarkbjroll: ya its not perfect and you have to accept some skew, but it will aggressively do its best to make sure every cloud it knows about has the images it knows about :)17:05
jrollclarkb: yeah, anything we do here is going to be eventually consistent. thanks, I'll add that to... something17:06
*** anteaya has joined #openstack-infra17:06
clarkbjroll: we've also got the ansible cloud launcher stuff which will upload images without thinking about how they were built17:07
clarkbit is far less aggressive thoug17:07
jrollyep17:07
clarkblogan-: everyone is excited to get to work after the PTG I guess17:07
*** trown is now known as trown|lunch17:09
openstackgerritMerged openstack-infra/git-review master: Use new %topic=XXXX syntax for topic pushes  https://review.openstack.org/60125117:09
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Add support for async tasks  https://review.openstack.org/60385017:11
clarkbthere have been ~16 unittest failurse in the integrated openstack gate over the last ~day17:13
clarkbmriedem: looks like the majority of them may be nova changes? is that known?17:14
*** zhangfei has joined #openstack-infra17:18
prometheanfiregate just busy today?17:20
imacdonn^^ :)17:20
imacdonnclarkb: attempting to digest above ... I guess I kinda/sorta get it, but would probably need to understand more about the internals of zuul than I have capacity for today ;)17:21
mordredimacdonn: digesting zuul internals one chunk at a time is recommended ;)17:22
imacdonn:)17:23
clarkbimacdonn: the short version of it is if there is a test failure we have to restart everything without the failing change in consideration for merging17:23
clarkbimacdonn: this process is optimized for systems that have working test suites and right now we don't appear to have that17:24
imacdonnclarkb: does that apply across projects .. e.g. if a nova job fails, does glance have to start over ?17:24
clarkbimacdonn: it applies to all projects in the same gate queue. In this case nova cinder glance keystone swift neutron (and probably a couple others) share a queue17:24
imacdonnclarkb: OK, that explains some of what I'm seeing (affecting both cinder and glance)17:26
clarkbimacdonn: all of the tripleo projects are in a separate shared queue, and so on17:26
imacdonnright17:26
*** sambetts is now known as sambetts|afk17:27
prometheanfireand project-config :P17:27
*** jpena is now known as jpena|off17:30
clarkblooks like glance unittests just timed out on inap. I don't see anything pointing at the test node. I looks like the glance tests just stop17:33
clarkbI wonder if we are going to have to accept this type of flakyness as we ramp up the python3 efforts17:33
clarkbdhellmann: ^ we might try to track this as a metric (even if its just success rate of python3X tests vs python27 tests?)17:33
*** jamesmcarthur has quit IRC17:34
dhellmannclarkb : when I find some time I was going to try to pull some stats about the number of rechecks needed to land the patches for the migration17:34
openstackgerritMerged openstack-infra/git-review master: Fix compare_review's use of fetch_review  https://review.openstack.org/53235917:34
dhellmannif teams find the new 3.6 tests to be more flakey than the existing 3.5 tests then that seems like an indication of a potential issue with the language version17:36
clarkbfungi: ^ tagging time?17:36
fungiyep, was just double-checking that all the changes we approved have merged17:36
clarkbdhellmann: this glance failure was python35 and nova seems to have a few python35 unittest failures too. I think in some cases the set of tests we ran under python3 was constrained and now we are constraining less ( I could be wrong about that though )17:36
dhellmannthat did used to be true for many teams. it may have changed recently.17:37
openstackgerritSorin Sbarnea proposed openstack-infra/git-review master: Clean up vestigal scripting in cmd.py  https://review.openstack.org/56729717:39
clarkbreminds me a lot of gate behavior around feature freeze fwiw17:39
clarkboh this job that timed out against glance is a functional python35 job not unittests17:40
clarkbwhich is likely new17:40
openstackgerritSorin Sbarnea proposed openstack-infra/git-review master: Allow choosing which field to use as author when naming branch  https://review.openstack.org/44457417:42
fungiclarkb: ssbarnea: okay, i think i'm ready to tag git-review 185fb8d (current master branch tip) as 1.27.0... any last-minute objections?17:43
*** aidin has joined #openstack-infra17:45
*** zhangfei has quit IRC17:45
clarkbfungi: not from me17:48
*** aidin has quit IRC17:49
ssbarneaclarkb: there are other changes pending but this should not stop you from doing it, go! we have no reason not to make another release in one week.17:49
*** Tim_ok has joined #openstack-infra17:50
fungissbarnea: agreed, i mostly want to stop holding up the gerrit community from making a new release. it's very generous that they wanted to block on modernization in git-review17:50
ssbarneai am always running git-review from master branch, so I doubt we would break the world.17:51
mriedemclarkb: like this? http://logs.openstack.org/72/600372/1/gate/openstack-tox-py35/dcfd363/testr_results.html.gz17:51
clarkbmriedem: ya that was one of the changes I found via e-r uncategorized17:52
mriedemi've seen those tests fail in weird ways before, but not in a rash17:52
clarkbmriedem: there are a whole back in the last day, day and a half that seem to be nova changes in elastic-recheck uncategorized list for the gate17:53
openstackgerritMerged openstack-infra/system-config master: Create the OpenStack discussion mailing list  https://review.openstack.org/60278117:53
clarkb(some are not noav too, but majority appaer to be nova)17:53
mriedemmy guess would be eventlet something or other17:54
mriedemhttp://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22sqlalchemy.exc.ResourceClosedError%3A%20This%20result%20object%20does%20not%20return%20rows.%20It%20has%20been%20closed%20automatically.%5C%22%20AND%20tags%3A%5C%22console%5C%22&from=7d doesn't hit much17:55
roman_gHello all. With Zuul queue being so long, I'd like to ask the following question: is there a "fast lane" queue in Zuul to run after merge? "Post" queue is quite slow. All I need is to add a tag with commit id to existing docker image (get commit id->pull image->add tag->push new tag), and I would expect it to happen as fast as possible right after merge, so that there is always an image in17:56
roman_gcontainer repository with a tag corresponding to the latest git commit id.17:56
clarkbmriedem: http://logs.openstack.org/03/602403/1/gate/openstack-tox-py35/b4c9214/testr_results.html.gz is another nova failure17:56
mriedemyeah there is a IndexError: tuple index out of range in there17:57
mriedembut i don't know where that is coming from17:57
AJaegerclarkb, fungi, we also have less capacity right now with with OVH BHS down - missing 159 nodes - that explains part of the backlog17:58
clarkbroman_g: the current prioritization is gate,release|tag > check > post,periodic17:59
roman_g"pull image->add tag->push new tag" could also be replaced with just API call, I've just checked docs17:59
roman_gclarkb: thank you.18:00
clarkbroman_g: you can potentially run that as a null nodeset job then, which won't need nodes to be assigned which will allow it to run quickly18:00
clarkbroman_g: the gotcha is you have to operate in a very constrained environment where basically all you can do is talk to external services18:00
roman_gclarkb: oh, that's cool. I didn't think it's possible.18:01
clarkbyou cannot install additional software, but python should be there for you to talk http for example18:01
clarkbalso you may have to use native ansible? we cannot shell on localhost or can we?18:01
clarkbdpawlik: I assume we shouldn't enable ovh again?18:02
roman_gclarkb: so my usecase fits pretty well then: get latest commit id, and make api call (Ansible works perfectly).18:02
clarkbdpawlik: specifically bhs1?18:02
clarkbroman_g: ya that should work18:02
AJaegerconfig-core, could you review https://review.openstack.org/#/c/603282/ and https://review.openstack.org/603199 (will recheck once 603282 is in), please?18:02
roman_gclarkb: which queue would you recommend then? release?18:02
clarkbmriedem: http://logs.openstack.org/94/603194/1/gate/openstack-tox-py35/e37d161/testr_results.html.gz is another db looking failure like nova's but in cinder18:02
fungiclarkb: ssbarnea: https://pypi.org/project/git-review/ now shows 1.27.0 as current18:03
fungii'll follow up to the ml thread in a sec18:03
roman_gclarkb: or still "post", but without nodes?18:03
clarkbroman_g: post, the precedence is for node assignments, because there are no node assignments I think you can run immediately18:03
clarkbroman_g: yup exactly that18:03
roman_gclarkb: cool. Thank you!18:04
ssbarnealets hope we don't have to unrelease it, previous one was like 10 months ago,...18:04
clarkbssbarnea: chances are we will roll forward18:04
clarkbrather than delete a release18:04
clarkbroman_g: if you get something working it may be worth sharing as part of the zuul-jobs roles/jobs if generally applicable18:04
mriedemclarkb: yeah those all use the same oslo.db opportunistic db test fixtures for walking the schema migrations18:05
mriedemi seem to remember years ago we had to bump the timeout on those tests,18:05
mriedemmaybe we need to do that again18:05
clarkbroman_g: other things to note if you don't specify a nodeset you get the default nodeset. So you have to actually specify a nodeset that has no nodes in it. Also you have to explicitly use localhost and not all in the ansible to run on localhost18:05
roman_gclarkb: how can I disable node assigned for a specific job? I think by default all my jobs are run in a nodeset18:06
ssbarneaclarkb: this only if the fix is quick, hiding from pypi should be first action if serious bug is found. had to do this only twice so far with jira library.18:06
clarkbroman_g: let me find an example18:06
roman_gclarkb: -job: ... nodeset: localhost ??18:06
mriedemclarkb: yeah https://review.openstack.org/#/c/370805/18:08
*** trown|lunch is now known as trown18:08
clarkbroman_g: https://git.openstack.org/cgit/openstack-infra/system-config/tree/.zuul.yaml#n211 like that18:08
clarkbmriedem: is that due to accumulation of migrations? we don't roll them up anymore I guess?18:09
mriedemmight be part of it,18:09
mriedemand just slower nodes in general b/c of spectre/meltdown patces18:09
mriedem*patches18:09
clarkbnewer kernels do seem to help with that at least18:10
clarkb(maybe this is the motivation to move to bionic more quickly18:10
clarkbat least with infra's testing on rax we got much better performance with latest xenial HWE kernel compared to older HWE kernels18:11
logan-i wonder how much difference it would make to run hwe kernels on our nodepool hvs18:11
roman_gclarkb: thank you!!18:13
fungilogan-: we're happy to help figure out what the performance difference looks like if we can18:14
fungimapping builds back to hypervisor hosts and coming up with baseline performance numbers is nontrivial though18:14
clarkbgraphite has job timing data per cloud18:15
clarkbwe'd probably just look at that for something like nova unittests18:15
clarkband see if we notice a change18:15
fungiit does, but yeah depends on whether nova unittests have a consistent performance profile18:15
clarkbthey should be pretty consistent on specific clouds I would expect18:16
fungiand filter by success only obviously18:16
clarkbdevstack/tempest jobs are likely to be better measures of meltdown performance impact though18:16
fungion the guest side or on the host side?18:16
fungior both?18:17
clarkbguest side for sure since there are plenty of syscalls on the guest kernel18:17
clarkbI'm not sure how that maps into the ways kvm was affected18:17
mriedemclarkb: well https://bugs.launchpad.net/cinder/+bug/179336418:17
openstackLaunchpad bug 1793364 in OpenStack Compute (nova) "mysql db opportunistic unit tests timing out intermittently in the gate" [High,Confirmed]18:17
logan-btw clarkb, I think you were right that the limestone mirror issue was due to a full disk. It seems like the ELK stuff we deployed last week was eating too much local storage. we are moving it to rbd volumes. i also noticed several nova failures you just posted were on limestone nodes, and in general I see a spike in failures from the 15th thru the 18th on limestone18:17
logan-https://i.imgur.com/B5Zwj4a.jpg18:17
mriedemand it's not a simple timeout,18:17
logan-(gist is here: https://gist.github.com/logan2211/76e7a86fccb04a4db9de0ba96fb83f4e)18:18
mriedemhere is a failure18:18
mriedemnova.tests.unit.db.test_migrations.TestNovaMigrationsMySQL.test_models_sync [664.512994s] ... FAILED18:18
mriedemfrom a passing run:18:18
mriedemnova.tests.unit.db.test_migrations.TestNovaMigrationsMySQL.test_models_sync [39.644814s] ... ok18:18
clarkblogan-: interesting18:18
clarkbmriedem: woah18:18
mriedemso we're clearly losing a context switch or something with eventlet18:18
fungithat's going to be fun to track down :/18:18
clarkblogan-: I guess that is because our disks are thin provisioned so even though our mirror wasn't using much disk on the hv the hv ran out disk from noisy neighbor ELK stack?18:19
logan-yeah18:20
clarkbmakes sense18:20
* clarkb upgrades git-review to be new release tester18:20
fungibleeding edge!18:20
clarkbI need to step away from the computer for a few minutes, but when I get back I'll draft up a dev list status update on why things are slow (BHS1 maintenance, flaky tests, etc)18:21
clarkbhopefully that will help people understand what is going on18:21
fungithanks clarkb!18:22
fungii'm about to context-switch to configuring the future of our mailing lists following the next ansible pulse18:22
fungiand then i should be able to send out the announcement about that plan18:22
yumiriamsmcginnis: hi, i submitted this patch: https://review.openstack.org/#/c/599720, and i investigated why lvm lio job was failing18:23
ssbarneafungi: Thanks for git-review work! maybe it would be a good idea to add two more cores to git-review, so you would not become overloaded (maybe even documenting them somewhere so people will know who to add as reviewer, i am sure that not everyone knows how to dig gerrit ACL in order to find people that can review).18:25
*** ykarel has quit IRC18:26
yumiriamsmcginnis: i think i figured out what was causing the problem, i'll have to change the tempest_roles in tempest configuration, could you help me to do it?18:26
*** anteaya has quit IRC18:27
*** zhangfei has joined #openstack-infra18:32
fungissbarnea: yes, i agree. we should have a talk with the infra ptl about it when he's not quite so busy ;)18:33
mriedemclarkb: welp, i don't have any good ideas on how to fingerprint this in e-r18:33
mriedemthe indexerror is probably the killer18:35
mriedembut i don't know where it starts18:35
mriedemand we can't do multi-line fingerprinting for context18:35
clarkbmriedem: if it is a traceback the whole traceback should be indexed as one event ?18:36
mriedemonly in screen logs18:36
mriedemnot console18:36
*** ijw has joined #openstack-infra18:36
clarkbah18:36
mriedemmy guess is the indexerror comes from reading a buffer in eventlet18:38
mriedemthat's in the stacktrace18:38
clarkbglance python2 functional job just reset the whole gate so not python3 specific there18:39
*** ijw has quit IRC18:42
*** zhangfei has quit IRC18:45
*** vkmc is now known as vkmc|afk18:49
roman_gQuestion: where can I see logs for zuul "post" jobs being published on merge?18:57
mriedemclarkb: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22%20%20connection.scalar(select(%5B1%5D))'%5C%22%20AND%20tags%3A%5C%22console%5C%22&from=7d18:57
mriedemthat will surely get our categorization numbers up18:57
mriedemcount per 1h | (1890 hits)18:57
clarkbroman_g: easiset way to find them is from the builds tab of the zuul status page18:57
clarkbmriedem: that is a lot of hits18:57
clarkbmriedem: I'm writing email to the dev list about helping with e-r and fixing some of these bugs18:57
clarkbmriedem: I'll put it on etherpad if you are interested in reading it first18:57
mriedemheh to make sure you are PC enough? :P18:58
mriedemWWMD18:58
clarkbmriedem: more so that you don't feel I've thrown you to the wolves :P18:58
mriedemare there wolves?18:58
mriedemi remember sending several "the gate is melting and it's ALL YOUR FAULT" emails over the years18:58
mriedembut like you said, in the old days with mtreinish and jogo18:59
*** ijw has joined #openstack-infra18:59
*** jamesmcarthur has joined #openstack-infra18:59
mriedemby all means though, raise the alarm18:59
clarkbmriedem: "If you'd like to help let mriedem or myself know and we'll gladly work with you to get elasticsearch queries added to elastic-recheck. We are likely less help when it comes to fixing functional tests in Glance, but I'm happy to point people in the right direction for that as much as I can.19:01
clarkber that was meant to have an end quote19:01
clarkbbut thats the tldr of throwing you to the wolves :)19:01
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Add query for mysql opportunistic test bug 1793364  https://review.openstack.org/60387419:02
openstackbug 1793364 in OpenStack Compute (nova) "mysql db opportunistic unit tests timing out intermittently in the gate (bad thread switch?)" [High,Confirmed] https://launchpad.net/bugs/179336419:02
roman_gclarkb: thanks. Actually I found it easier to find logs by filtering on Builds page.19:02
roman_g*filtering by job name19:02
mriedemclarkb: e-r review wolves, sure19:02
mriedemi'm down19:02
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: convert docs to PTI  https://review.openstack.org/55939619:04
AJaegermriedem: your change is not needed - but there's more to fix ^19:07
AJaegermriedem: will you take care of it?19:07
mriedemAJaeger: it's not my change,19:09
mriedemi'm just rebasing it19:09
mriedemAJaeger: if you want it, it's yours19:09
AJaeger;)19:10
mriedemclarkb: is this a known issue? http://logs.openstack.org/17/595317/1/gate/build-openstack-sphinx-docs/b8849f2/job-output.txt.gz#_2018-09-18_18_33_45_35349319:12
mriedemhttp://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22rsync%20error%3A%20unexplained%20error%20(code%20255)%20at%5C%22&from=7d19:12
clarkbmriedem: I think that is a limestone node (based on ipv6 usage) and I think logan- said he was fixing some stuff there?19:13
openstackgerritAndreas Jaeger proposed openstack-infra/elastic-recheck master: convert docs to PTI  https://review.openstack.org/55939619:13
clarkbmriedem: it is odd that it timed out then failed to reconnect19:13
clarkblogan-: ^19:13
AJaegermriedem: done ^19:13
mriedemdefinitely hitting most on limestone-regionone19:14
mriedemAJaeger: thanks19:14
mriedemclarkb: is there a bug for that?19:14
clarkbmriedem: I don't think so19:14
clarkbmriedem: I think we only just started to get a handle on it a few minutes ago in scrollback (about an hour ago)19:15
logan-ya after I found the ELK disk usage stuff yesterday I moved some things around to resolve it. today looks much smoother: https://i.imgur.com/B5Zwj4a.jpg19:15
clarkbmriedem: logan- we can stick a query in e-r to make sure it has been resolved19:15
mriedemi've got it19:16
mriedemhttps://bugs.launchpad.net/openstack-gate/+bug/179337019:16
openstackLaunchpad bug 1793370 in OpenStack-Gate ""Collect sphinx build html" fails with "rsync error: unexplained error (code 255) at io.c(226) [Receiver=3.1.1]" on limestone nodes" [Undecided,New]19:16
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Add query for ansible ssh rsync fail bug 1793370  https://review.openstack.org/60387819:18
openstackbug 1793370 in OpenStack-Gate ""Collect sphinx build html" fails with "rsync error: unexplained error (code 255) at io.c(226) [Receiver=3.1.1]" on limestone nodes" [Undecided,New] https://launchpad.net/bugs/179337019:18
AJaegerdpawlik: how is OVH BHS coming along? You mentioned some time ago 1 or 2 h which is long over - is it more complicated or are you done?19:24
*** gema has joined #openstack-infra19:32
clarkbAJaeger: my bhs1 vm just stopped responding then started again. Reading between the lines I would assume that was a live migration related to updates19:32
clarkbAJaeger: possibly still in progress given that19:33
AJaegerclarkb: dpawlik's ask for 1 or 2 hours was 6 hours ago, so that's why I was asking. Yes, might be they had some surprises ;(19:36
* AJaeger signs off for today19:37
fungii hear upgrading openstack isn't simple19:38
clarkbfungi: they are doing the fun route of jumping multiple versions too aiui19:39
fungidoesn't help that they seem to be going from juno to newton19:39
fungiyeah, that19:39
fungiwell, i think i have the initial openstack-discuss@l.o.o configuration done. i guess it's time to write up the announcement about it19:40
fungiwe said we should subscribe the old lists to the new list around october 24 (two weeks before the summit) and then turn off the old lists around november 21 (the week after the summit) right?19:44
clarkbssbarnea: fungi as far as more core reviewers I'm happy to add them. I think we should be careful to get fixes like https://review.openstack.org/#/c/428700/6 in over adding new features like https://review.openstack.org/#/c/195043/19:44
clarkbwe've made git-review fairly stable and it doesn't break often for existing users (but as in the case of that one bug fix may not work for new users)19:44
*** anteaya has joined #openstack-infra19:45
clarkband from there maybe make the test suite a bit more robust and easier/quicker to run. THen consider features?19:45
fungitesting it against newer gerrit was also suggested as something worth prioritizing, i think19:46
*** roman_g has quit IRC19:46
*** zhangfei has joined #openstack-infra19:46
clarkb++19:47
clarkbbasically I'd like to see is get our feet wet again on the project by fixing bugs and building confidence in testing19:47
*** roman_g has joined #openstack-infra19:47
clarkbif we end up in a good spot for that then consider features again. Since I think part of why we stopped doing feature work was we broke too many people too often in the past19:48
fungiwell, that coupled with not having any appreciable testing19:51
clarkbssbarnea: are you volunteering ? :)19:51
clarkbzxiiro may be interested as well as electrofelix?19:51
zxiirohappy to help if I'm needed. We depend extensively on git-review so makes sense that we spend some cycles helping out there.19:53
fungimy only real reservation about adding more core reviewers is to make sure we share the opinion that adding more features to git-review needs to meet a very high bar these days to avoid scope creep (and that there are even places where we might do well to deprecate/remove some existing features)19:56
*** pfallenop has joined #openstack-infra19:56
fungikeeping focus on it doing a specific set of tasks very well, and blending with use of other common posix/unix command-line utilities via pipes and so on19:58
fungiand considering that some sorts of possible features may be better implemented as separate git subcommands rather than more git-review options19:59
*** zhangfei has quit IRC19:59
clarkbor `use gertty`19:59
clarkbwhich is excellent software for doing all your gerrit activites on the command line19:59
fungifor fancy use cases, i definitely feel like more robust gerrit clients are a better option20:00
fungigit-review is intended as a streamlined tool for pushing one or a series of commits to a gerrit for review20:00
notmynameclarkb: I just got back from lunch and say your email about the zuul issues. it's fantastic. thanks20:00
clarkbnotmyname: I'm glad someone found it useful :)20:02
mriedemclarkb: this should help https://review.openstack.org/#/c/603900/20:06
mriedemshould have been done weeks ago20:06
clarkbmriedem: thanks20:06
notmynameclarkb: with a quick glance at the elastic recheck page, I see one with "swift" in the name. it happened once and two days ago and has since passed. is there a way I can categorize it as "doesn't matter" or transient or resolved or soemthing?20:06
mriedemnotmyname: is the bug marked as invalid?20:07
mriedemor which one on http://status.openstack.org/elastic-recheck/index.html ?20:07
mriedemif http://status.openstack.org/elastic-recheck/data/integrated_gate.html#cross-swift-py3520:08
mriedemit doesn't matter, ignore20:08
notmynameya, that second one20:08
mriedemhttp://status.openstack.org/elastic-recheck/index.html is really the "oh shit stuff is on fire"20:08
notmynameah, ok20:08
mriedemhttp://status.openstack.org/elastic-recheck/data/integrated_gate.html is what we haven't fingerprinted yet20:08
mriedemOverall Categorization Rate: 15.2%  is terrible,20:08
mriedemthat should be closer to 50%20:08
clarkbya the uncategorized page is more of a "hey we should look at things if that categorization rate is hihg"20:08
mriedemmeaning, the gate is failing for reasons we aren't tracking20:08
clarkband start at the top since those have the most occurences20:09
mriedemright20:09
clarkbnotmyname: thank you for checking though :)20:09
mriedemthe categorization rate should bump up within the hour20:09
mriedemand then we can do another pass20:09
notmynameah, ok. as a quick heuristic, I'd just searched for "swift" since that's something I can be sortof useful with. the one instance is some issue where zuul couldn't install python (or pyyaml). and it's passed since then. so... ignore it20:10
*** jamesmcarthur has quit IRC20:10
clarkbnotmyname: ya if its a one off you can ignore it. e-r is largely oriented around identifying and tracking persistent issues20:10
clarkbbut it may also tell us we have no persistent issues and they are all one offs20:10
clarkband then we have more work to do probably :)20:10
fungiwell, we have more work to do regardless... just a question of what we'll be working on ;)20:12
clarkbone thing we might want to add to the uncategorized page is the project that was tested when it failed20:13
notmynameoh wait!20:13
clarkbsince we moved to the generic job names with zuulv320:13
notmynameclarkb: mriedem: the one thing I saw is in fact the same (or similar to a tracked issue on the e-r page. how do I associate it?20:13
clarkbnotmyname: you can update the query in elastic-recheck/queries/$bugnumber.yaml to include matches for your issue20:14
notmyname"cross-swift-py35 : 1 Uncategorized Fails. 0.0% Classification Rate (1 Total Fails)"  --->> https://bugs.launchpad.net/openstack-gate/+bug/144913620:14
openstackLaunchpad bug 1449136 in OpenStack-Gate "Pip fails to find distribution for package" [Undecided,New]20:14
clarkbnotmyname: to test that go to logstash.openstack.org and plug in the existing query to get the existing result. THen modify as necessary to include the other results20:14
mriedemyeah http://status.openstack.org/elastic-recheck/#1449136 has a logstash link to take you to the existing query directly20:14
mriedemand then you can add an OR to the query20:15
mriedemto see if you get more hits20:15
clarkbthis bit is the most hand wavy of the lot (you kind of have to figure out lucene's query syntax) but if you point me at a log line that should match and the existing bug I can try to help come up with some thing20:15
notmynameclarkb: http://logs.openstack.org/83/602183/1/gate/cross-swift-py35/aa80654/job-output.txt.gz#_2018-09-18_08_35_12_68814120:15
clarkbnotmyname: ah yup (thats a known issue we think we fixed in limestone, but lets do this for tracking and learning purposes)20:16
*** pfallenop has quit IRC20:16
prometheanfirenotmyname: sup20:16
*** pfallenop has joined #openstack-infra20:16
prometheanfirerecheck'd blaming clarkb :D20:16
clarkbnotmyname: http://status.openstack.org/elastic-recheck/gate.html#1449136 is the existing bug entry on e-r. if you click the little logstash link under that it opens up the logstash ui with that query20:17
notmynameyeah. I'm there20:18
notmynametrying to figure out why it doesn't already match20:18
clarkbnotmyname: I think because in your example there is the serialized newling so its \nNo matching20:18
clarkband lucene treats that as a different token than No20:19
clarkbSo we can update the query to be (message:"No matching distribution found for" OR message:"\nNo matching distribution found for") rest of query here20:19
notmynameyep. running it now20:19
clarkbyup that seems to work. So last step is to update the query in https://git.openstack.org/cgit/openstack-infra/elastic-recheck/tree/queries/1449136.yaml to include that new bit20:20
clarkbnotmyname: (message:"No matching distribution found for" OR message:"\nNo matching distribution found for") AND tags:console AND voting:1 AND build_queue:gate <- is my query fwiw20:21
clarkbthe voting:1 and build_queue:gate stuff is added by elastic-rechech which is why you won't see it in the query file I linked above20:21
notmynameoh, I ddin't have the buil.... ah yes20:21
clarkb(you don't have to add that thwn you edit the file in the elastic-recheck repo)20:21
notmynameyeah, the added message went from 92 matches to ... a lot more than that20:22
clarkbindeed20:22
clarkb(there were issues with the mirror in limestone crashing, we think since fixed. But updating the query will give us a good level of detail into whether or not that is the case20:22
fungidhellmann: we said we should subscribe the old lists to the new -discuss list around october 24 (two weeks before the summit) and then turn off the old lists around november 21 (the week after the summit) right? or were we looking for tighter timing on that?20:23
dhellmannfungi : I don't remember the dates. Did we have notes on that in the etherpad?20:24
clarkbyes I tried to write those notes down on the etherpad20:25
fungimm, i guess that would have been the infra pad. checking20:25
dhellmannI think the specific dates you give make sense and mesh with the rougher descriptions I'm seeing in the etherpad20:26
dhellmanna tighter period would work for me, too, but I know there was a lot of concern about giving folks time for the transition20:26
fungiyeah, the etherpad just says "cut over First week of December"20:27
dhellmannit might be confusing in a different way to have too long of a period for the transition20:27
*** armax has quit IRC20:27
clarkbI think the concern with week after summit is US thanksgiving20:27
openstackgerritJohn Dickinson proposed openstack-infra/elastic-recheck master: update query 1449136 to match some more queries  https://review.openstack.org/60390520:27
notmynameclarkb: ^^20:27
clarkb(I personally don't have the holiday concern that others had)20:28
zxiiroclarkb: does gertty support the latest version of Gerrit yet? last time I tried it didn't work for the Gerrit versions LF deploys.20:28
fungiso maybe we resolved to subscribe the new list to the old lists around the summit and then disable them during the first week of december?20:28
clarkbzxiiro: unsure, I don't actually use gertty20:28
clarkbzxiiro: I imagine corvus would like to fix those problems though20:28
fungizxiiro: when did you last try?20:28
dhellmannfungi : yeah, that seems to match the etherpad better20:28
zxiiroah ok, yeah it sounds like a cool tool but it didn't work on our Gerrit systems last time I tried.20:29
zxiirofungi: I tried at the Vancouver Summit.20:29
zxiiroso a few months ago now...20:29
clarkbnotmyname: looks great, I will let mriedem double check things as I don't review these as often as he does20:29
clarkbmriedem: can you review https://review.openstack.org/60390520:29
notmynamehere's hoping it helps find some major blockers :-)20:29
fungizxiiro: yeah, then it's possibly still an issue worth investigating20:29
*** jtomasek has quit IRC20:29
*** jamesmcarthur has joined #openstack-infra20:30
mriedemlooking20:30
fungidhellmann: clarkb: okay, so more like subscribe the new list to the old lists around november 19 (monday after summit) and disable old lists december 7 (first friday of the month)?20:31
clarkbfungi: that wfm20:31
fungior we could disable on december 2 (first monday)20:31
clarkbactually I think I prefer switching on monday20:32
fungier, i guess that's december 320:32
clarkbpeople more likely to take it as a todo to address that20:32
*** kgiusti has left #openstack-infra20:32
clarkbwhereas on friday you leave it until next week and forget and blah20:32
fungiokay, so phase 1 now to nov 19, phase 2 nov 19 to dec 3, phase 3 begins dec 320:33
dhellmannwfm20:33
fungithat's 2 weeks in phase 220:33
mriedemnotmyname: clarkb: is it just me, or does that bring the hits for that query up from ~62 to ~3741 in 7 days?20:33
dhellmannwith an email announcing all of this ~soon20:33
clarkbmriedem: ya there was a limestone sadness, it may match multiples per job though20:34
fungidhellmann: yes, ~soon is this afternoon or tomorrow morning (i already have the new ml configured)20:34
clarkbmriedem: our mirror was off in limestone for several hours monday and several hours tuesday :(20:34
dhellmannperfect20:34
fungijust making sure i know what dates to communicate20:34
clarkbmriedem: we think we've fixed that, but we can add the query to double check20:34
mriedemok, +W20:34
*** hemna_ has quit IRC20:36
*** priteau has quit IRC20:39
*** anteaya has quit IRC20:43
clarkbfyi I have approved https://review.openstack.org/#/c/603766/1 to enable bhs1 again20:44
clarkbthank you amorin20:44
*** ansmith has quit IRC20:45
*** dklyle has quit IRC20:47
openstackgerritMerged openstack-infra/project-config master: Revert "Revert "Revert "OVH BHS1 Maintenance" - 2018-09-19 1200UTC""  https://review.openstack.org/60376620:53
clarkbI think ^ will apply in about half an hour20:57
*** trown is now known as trown|outtypewww21:04
fungiheaded to grab an early dinner, but should be back soonish21:16
*** hashar has quit IRC21:23
tbarronhmm, I do '/msg chanserv topic #openstack-manila blah blah' and get 'You are not authorized to perform this operation'.  I think this was working, or am I forgetting something obvious?21:23
clarkbtbarron: our channels are +t so only channels ops can set the topic21:24
*** jamesmcarthur has quit IRC21:24
clarkbtbarron: you are listed as having ops in the channel according to chanserv so you should be able to op up, set the topic, then deop21:25
*** armax has joined #openstack-infra21:25
clarkbtbarron: /msg chanserv op #openstack-manila tbarron21:25
clarkbthen /msg chanserv deop #openstack-manila tbarron when done iirc21:26
*** jamesmcarthur has joined #openstack-infra21:26
*** agopi has quit IRC21:28
*** jamesmcarthur has quit IRC21:30
*** armax has quit IRC21:30
tbarronclarkb: thanks but it tells me I'm not authorized when I attempt to op up21:31
clarkbtbarron: are you identified with nickserv?21:32
tbarronclarkb: yes21:32
tbarronclarkb: will double check, but I logged in as usual with pass etc.21:32
clarkbnickserv says you are not logged in21:33
clarkb/msg nickserv acc tbarron21:33
clarkbthe 1 means not logged in21:33
clarkbwe have reenabled bhs1 now. I am going to watch it for a bit21:35
*** bdodd has quit IRC21:35
*** armax has joined #openstack-infra21:36
tbarronclarkb: hmm, i'll shutoff my client and reconnect, when I connect I have to do /quote PASS tom_barron:mypw everytime21:37
tbarronclarkb: and maybe i need to bounce my bouncer (znc) but its configuration worked the last time I changed the topic, during PTG21:40
*** sthussey has quit IRC21:40
tbarronclarkb: not an emergency clearly :)21:40
clarkbtbarron: you should just need to reidentify with nickserv and not bounce your bouncer21:40
clarkbchances are you were disconnected from freenode for some short period then things didn't renegotiate properly on reconnect (could be due to a netsplit or similar)21:41
clarkbtbarron: /msg nickserv identify $pw21:41
clarkbthen you can op up and set the topic21:41
*** dklyle has joined #openstack-infra21:42
clarkbBHS1 seems to be happy21:45
clarkberrors about no valid host found now that we are near/at quota but I think that was happening before21:45
clarkbdoesn't seem like there were errors prior to that point either which is impressive since we just asked the cloud to boot 150 something instances21:46
clarkbmordred: are you in a place to discuss the zuul CD things I've learned since PTG?21:47
clarkbmordred: I think it would be helpful to run the problems and thoughts on how to address them by someone21:47
mordredclarkb: my brain is pretty toast from today - how about we talk through it as soon as you get up tomorrow and I'll try to not be rabbitholed in weird async loops21:48
clarkbmordred: ok21:48
tbarronclarkb: all works now, thanks21:53
*** owalsh has joined #openstack-infra21:57
ianwtbarron: i use znc too and if you setup sasl the problem disappears21:58
ianwbecause it authenticates before it starts21:58
*** owalsh has quit IRC21:58
tbarronianw: and I do have it setup and authenticate at startup, not sure what went wrong21:59
tbarronianw: sasl21:59
tbarronianw: never had to separately '/msg nickserv identify $pw' before.21:59
tbarronianw: some glitch, will  see if it repeats22:00
ianwyeah, with many parts doesn't take much to go wrong; similarly i thought i had it working and it stopped too, but i think it was config and docker images and overlays etc too22:00
ianwi am sympathetic to the discussions that irc can be a little difficult to get going reliably :)22:01
clarkbianw: is this related to running docker comamnds as root?22:01
clarkbfwiw my setup is weechat in screen in the cheapest ovh VM. Seems to have stayed up during their upgrades so far :)22:01
tbarronianw: it's karma for me not being as sympathetic to that pov as I should have been22:01
*** roman_g has quit IRC22:02
ianwclarkb: not in this case.  but i have learnt some lessons about that contributing a few bits to testinfra22:02
ianwthe tox there runs a bunch of containers22:02
tbarroni was thinking, oh, just put in a bit of effort one time and forget about it afterwards.  And sure enough, I forgot.22:02
ianwand it's not enjoyable when you run "tox" and suddenly your VT switches to an Alpine linux console prompt22:02
ianwhow the heck that happens I don't want to know22:03
clarkbianw: this is one reason we've pushed back on needing root to run tox in openstack land22:07
clarkbturns out you can do all sorts of terrible things this way :(22:07
ianwafter reading a bit, it seems that the reason you need to be root for docker in fedora is that using a "docker" group is really just root by another, much less obvious name22:08
ianwhttps://developer.fedoraproject.org/tools/docker/docker-installation.html22:09
clarkbyup its like being in the kvm/libvirt group22:09
clarkbyou can abuse the daemon to gain root22:09
clarkbthis is actually one of my concerns with dox vs tox22:10
clarkbdox requires you to give the tests root essentially22:10
clarkbwhereas with tox you don't have that and can isolate it under a normal user as long as the tests themselves don't need that access22:10
ianwthat's ok, let's just isolate the docker tests in a VM!22:10
clarkbnotmyname: do you have a link to the ansible + zuul syntax error you had that OSA table helped you debug?22:15
notmynameclarkb: yes... maybe? I know the patch that caused it22:15
clarkbnotmyname: I can find it from there (curious beacuse other users are reporting that syntax errors result in no logs and that did result in logs for you so identifying the cause may be helped along with that additional data)22:16
* prometheanfire throws a line out22:17
prometheanfirehttps://review.openstack.org/#/c/603544/ clarkb ianw ^ ?22:17
ianwclarkb: has tristanC given you a tour of his ML log analysis tool?22:17
notmynameclarkb: patch set 27 in this file: https://review.openstack.org/#/c/601686/27/.zuul.yaml (the list didn't have a space on lines 245 and 246)22:18
clarkbianw: I've gotten some tidbits here and there when we end up in the same physical location22:18
*** gema has quit IRC22:19
clarkbnotmyname: and you got an error message to show OSA? I"m actually not seeing where the errors was reported. Maybe we don't report it like I thought we did22:22
clarkbianw: prometheanfire: isn't the pkgmap just a name lookup? we have to add the package to the package list too?22:22
prometheanfireclarkb: down lower in the pkg map I mask it (default is "")22:23
clarkboh itsalready there just with the not mapped name22:23
notmynameon that one, I'm not sure of what error was reported (or not). corvus also pointed out that run doesn't support a list (yet).22:23
ianwclarkb: it is already listed in https://git.openstack.org/cgit/openstack-infra/project-config/tree/nodepool/elements/infra-package-needs/package-installs.yaml22:23
notmynameclarkb: trying to think if there was a different error we talked to you about22:23
clarkbnotmyname: there was one you went to the OSA table in the bar for and it turned out to be a yaml syntax error or ansible syntax error iirc22:24
clarkbI'm assuming it was in that change so looking for that on other patchsets now22:25
notmynameyeah, would have been the same gerrit change22:26
clarkbnotmyname: found it http://logs.openstack.org/86/601686/32/check/swift-multinode-rolling-upgrade/4ed9236/job-output.txt.gz#_2018-09-14_19_36_25_291563 thanks22:28
notmynameclarkb: good :-)22:29
*** agopi has joined #openstack-infra22:30
openstackgerritMerged openstack-infra/project-config master: Install gentoolkit on Gentoo  https://review.openstack.org/60354422:37
prometheanfirecool, anyone mind kicking a gentoo build?  I can wait as well (D&D tonight)22:40
*** rcernin has joined #openstack-infra22:44
*** Tim_ok has quit IRC22:53
*** tpsilva has quit IRC22:54
*** armax has quit IRC22:59
* fungi doesn't have nearly so exciting an evening planned as to include d&d23:03
fungiprometheanfire: what is it you need done? delete old gentoo images so new ones get built sooner?23:04
*** armax has joined #openstack-infra23:07
*** armax has quit IRC23:08
*** bobh has joined #openstack-infra23:11
*** jamesmcarthur has joined #openstack-infra23:14
prometheanfirefungi: wfm23:15
prometheanfirefungi: how are things on the coast?23:15
*** gfidente has quit IRC23:15
*** ansmith has joined #openstack-infra23:15
fungistill here! happy about that23:16
fungii'll delete the images in a sec23:16
*** jamesmcarthur has quit IRC23:19
*** slaweq has quit IRC23:19
prometheanfirecool23:21
fungiprometheanfire: new image building gentoo-17-0-systemd-0000000877 on nb0123:27
*** dklyle has quit IRC23:29
fungihope your dice remain lucky both on the table and off!23:32
* fungi rolls a save vs food coma and consults the relevant chart23:33
* fungi gets back to composing an e-mail abotu e-mail23:33
prometheanfirecool :D23:36
mnaseryay23:36
mnaserhopefully with ovh back the gate gets churned through overnight23:36
openstackgerritMerged openstack-infra/git-review master: Clean up vestigal scripting in cmd.py  https://review.openstack.org/56729723:43
*** mriedem is now known as mriedem_away23:44
openstackgerritIan Wienand proposed openstack-infra/system-config master: Add notes on manual host configuration runs  https://review.openstack.org/51651023:53
openstackgerritMerged openstack-infra/system-config master: Use zuul-sphinx README.rst detection  https://review.openstack.org/59622523:54

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!