Friday, 2018-07-27

*** AhmadM has quit IRC00:01
pabelangerclarkb: yah, would agree00:01
pabelangerI guess we try restarting zuul services tomorrow?00:03
*** david-lyle has joined #openstack-infra00:03
clarkbya in theory demand will be way down and smcginnis won't be waiting on release jobs00:04
pabelangerwfm00:04
smcginnis++00:05
*** harlowja has joined #openstack-infra00:05
*** dklyle has quit IRC00:06
*** linkmark has quit IRC00:07
*** bobh has quit IRC00:08
*** caphrim007_ has quit IRC00:12
*** dklyle has joined #openstack-infra00:12
*** dklyle_ has joined #openstack-infra00:14
*** david-lyle has quit IRC00:14
*** jamesde__ has quit IRC00:15
*** rcarrill1 has joined #openstack-infra00:15
*** dklyle has quit IRC00:17
*** jamesden_ has joined #openstack-infra00:17
*** rcarrillocruz has quit IRC00:17
*** caphrim007 has joined #openstack-infra00:19
*** gyee has quit IRC00:23
*** caphrim007 has quit IRC00:24
openstackgerritAustin Sun proposed openstack-infra/project-config master: Add gerritbot for StarlingX (openstack/stx-*) projects to #starlingx IRC channel  https://review.openstack.org/58591900:32
*** felipemonteiro has quit IRC00:55
*** slaweq has joined #openstack-infra01:05
*** slaweq has quit IRC01:10
*** harlowja has quit IRC01:17
*** xarses_ has quit IRC01:17
*** bobh has joined #openstack-infra01:46
*** yamahata has quit IRC01:48
*** dklyle has joined #openstack-infra01:49
*** dklyle_ has quit IRC01:50
*** bobh has quit IRC01:52
*** larainema has joined #openstack-infra01:58
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Add query for live migration unbound vif bug 1783917  https://review.openstack.org/58638901:59
openstackbug 1783917 in OpenStack Compute (nova) "live migration raises NovaException: Unsupported VIF type unbound convert '_nova_to_osvif_vif_unbound'" [Undecided,New] https://launchpad.net/bugs/178391701:59
*** david-lyle has joined #openstack-infra02:01
*** dingyichen has joined #openstack-infra02:02
*** dklyle has quit IRC02:02
*** david-lyle has quit IRC02:03
*** dklyle_ has joined #openstack-infra02:04
openstackgerritMerged openstack-infra/elastic-recheck master: Add query for live migration unbound vif bug 1783917  https://review.openstack.org/58638902:15
openstackbug 1783917 in OpenStack Compute (nova) "live migration fails with NovaException: Unsupported VIF type unbound convert '_nova_to_osvif_vif_unbound'" [High,Confirmed] https://launchpad.net/bugs/178391702:15
*** gongysh has joined #openstack-infra02:22
*** psachin`` has joined #openstack-infra02:23
*** psachin`` has quit IRC02:33
*** chinna100 has quit IRC02:35
*** myoung has quit IRC02:39
*** caphrim007 has joined #openstack-infra02:45
*** caphrim007 has quit IRC02:50
*** rosmaita has quit IRC02:55
*** armax has quit IRC03:00
*** armax has joined #openstack-infra03:04
*** yamamoto has joined #openstack-infra03:09
*** armax has quit IRC03:12
*** armax has joined #openstack-infra03:16
*** dave-mccowan has quit IRC03:24
*** armax has quit IRC03:29
*** eernst has quit IRC03:29
*** annp has quit IRC03:31
*** xarses_ has joined #openstack-infra03:32
*** annp has joined #openstack-infra03:32
*** links has joined #openstack-infra03:52
*** gongysh has quit IRC03:52
*** yamahata has joined #openstack-infra03:54
*** diablo_rojo has quit IRC03:58
*** mriedem has quit IRC04:04
*** mschuppert has joined #openstack-infra04:06
*** slaweq has joined #openstack-infra04:11
*** slaweq has quit IRC04:16
openstackgerritTobias Henkel proposed openstack-infra/zuul master: Add a dequeue command to zuul client  https://review.openstack.org/9503504:21
*** pcaruana has joined #openstack-infra04:28
*** pcaruana has quit IRC04:30
*** diablo_rojo has joined #openstack-infra04:46
*** gongysh has joined #openstack-infra04:50
*** kjackal has joined #openstack-infra04:58
*** yamamoto has quit IRC04:59
*** yamamoto has joined #openstack-infra05:01
*** diablo_rojo has quit IRC05:01
openstackgerritMerged openstack-infra/zuul master: Add a dequeue command to zuul client  https://review.openstack.org/9503505:03
*** n-saito has joined #openstack-infra05:03
*** slaweq has joined #openstack-infra05:13
*** cshastri has joined #openstack-infra05:14
*** pgadiya has joined #openstack-infra05:16
*** pgadiya has quit IRC05:16
*** Bhujay has joined #openstack-infra05:17
*** slaweq has quit IRC05:17
*** Bhujay has quit IRC05:21
*** kjackal has quit IRC05:39
*** quiquell has joined #openstack-infra05:41
*** diablo_rojo has joined #openstack-infra05:46
*** annp has quit IRC05:51
*** annp has joined #openstack-infra05:52
*** AJaeger has quit IRC05:52
*** zigo_ has joined #openstack-infra05:53
*** zigo has quit IRC05:53
*** AJaeger has joined #openstack-infra05:55
openstackgerritIan Wienand proposed openstack-infra/project-config master: Add trigger-readthedocs-webhook job  https://review.openstack.org/58344906:05
openstackgerritIan Wienand proposed openstack-infra/openstack-zuul-jobs master: Use webhook update for docs-on-readthedocs  https://review.openstack.org/58383406:06
*** hashar has joined #openstack-infra06:15
*** alexchadin has joined #openstack-infra06:15
*** mgoddard has joined #openstack-infra06:41
*** pguimaraes has joined #openstack-infra06:47
*** mgoddard has quit IRC06:50
*** tesseract has joined #openstack-infra06:52
*** kjackal has joined #openstack-infra06:54
*** dhajare has joined #openstack-infra06:56
*** rcernin has quit IRC07:00
*** ccamacho has joined #openstack-infra07:20
*** dtantsur|afk is now known as dtantsur07:21
*** zoli|gone is now known as zoli07:30
*** shardy has joined #openstack-infra07:37
*** jtomasek has joined #openstack-infra07:48
*** jtomasek has quit IRC07:48
*** jtomasek has joined #openstack-infra07:49
*** alexchadin has quit IRC07:52
*** rpittau has quit IRC07:57
*** rpittau has joined #openstack-infra07:57
*** pblaho has joined #openstack-infra07:58
*** dtantsur is now known as dtantsur|bbl08:00
*** alexchadin has joined #openstack-infra08:05
*** jpich has joined #openstack-infra08:06
*** jamesmcarthur has joined #openstack-infra08:06
*** jamesmcarthur has quit IRC08:11
*** mgoddard has joined #openstack-infra08:12
*** roman_g has quit IRC08:13
tinwoodmorning08:15
tinwoodI'm having a problem with a nova-lxd backport to stable/queens; the issue is that although the branch it is proposed to is stable/queens in the nova-lxd project, it is actually testing against the master in the nova project;  I'm sure I just have to twiddle some bits somewhere in the various project configs, but I'm not sure where to start.  The issues are the tests openstack-tox-{py27|py35}.  I tried an override in .zuul.08:17
tinwoodconf, but that didn't work?  Any pointers, please?08:17
AJaegertinwood: have a change up to check log files?08:19
*** bauzas is now known as PapaOurs08:19
tinwoodAJaeger, sure: https://review.openstack.org/#/c/584900/08:19
tinwoodAJaeger, I added openstack-tox-py27-stable-queens as a test; that's coming out; I was trying to see if an override would work at all.08:20
*** e0ne has joined #openstack-infra08:22
kashyapHey folks, I'm hitting a "POST_FAILURE" state for the 'nova-live-migration' CI job (for this patch: https://review.openstack.org/#/c/567258/); is this a Zuul problem?08:22
AJaegertinwood: why do you think it checks out master?08:22
tinwoodAJaeger, I'll find the relevant line I think is doing it:08:22
AJaegerkashyap: check the log files, please08:22
kashyapWhen I look in the log (http://logs.openstack.org/58/567258/10/check/nova-live-migration/1998129/job-output.txt.gz)08:22
kashyap2018-07-26 18:18:54.522168 | primary | ERROR08:22
kashyap2018-07-26 18:18:54.541519 | primary | {08:22
kashyap2018-07-26 18:18:54.541725 | primary |   "msg": "SSH Error: data could not be sent to remote host \"147.75.38.160\". Make sure this host can be reached over ssh",08:22
*** diablo_rojo has quit IRC08:22
kashyap2018-07-26 18:18:54.541843 | primary |   "unreachable": true08:22
kashyap2018-07-26 18:18:54.541953 | primary | }08:23
kashyapAJaeger: Yeah, the above is what the job-output seem to say08:23
AJaegerkashyap: please no multi-line pastes - you can link to that line...08:23
tinwoodAJaeger, in the py27 tox run:08:23
tinwood2018-07-26 16:17:15.870344 | ubuntu-xenial |     from microversion_parse import middleware as mp_middleware08:23
tinwood2018-07-26 16:17:15.870445 | ubuntu-xenial | ImportError: cannot import name middleware08:23
AJaegerkashyap: could have been a cloud problems, I would recheck08:23
*** derekh has joined #openstack-infra08:23
kashyapAJaeger: Ah, I too am averse to multi-line paste.  But anything around 5 lines or less OK (if done only once :-))08:24
tinwoodAJaeger, and I think that middleware appeared in master; it doesn't appear to be in stable/queens. (or I'm not checking correctly).08:24
kashyapAJaeger: Thanks!  I'll do the 'recheck'08:24
AJaegertinwood: http://logs.openstack.org/00/584900/5/check/openstack-tox-py27/a9cc494/job-output.txt.gz#_2018-07-26_16_12_23_456162 has stable/queens in it08:24
tinwoodAJaeger, yes, for nova-lxd.  But "nova" is still master; so it fails (I think!)08:25
AJaegertinwood: ah, nova! let me check...08:25
AJaegertinwood: http://logs.openstack.org/00/584900/5/check/openstack-tox-py27/a9cc494/job-output.txt.gz#_2018-07-26_16_12_23_030838 does not check out nova at all08:25
AJaegertinwood: you check it out in tox.ini yourself, that's your problem. You should use required-projects and follow the work we done for horizon and neutron to be able to install them from source08:26
tinwoodAJaeger, I'm confused then. How can nova-lxd be tested at all without nova?  (it doesn't mock everything out and the import error is in nova)08:27
AJaegertinwood: check how the networking projects and the -dashboard ones are setup. You need the same setup.08:27
tinwoodAJaeger, okay, re: neutron and horizon.  I'll do some research on that; thanks for the pointers! :)08:27
AJaegertinwood: your tox.ini has egit+https://github.com/openstack/nova.git#egg=nova - that is master08:27
* tinwood sometimes only sees the wood, and didn't notice the trees.08:28
AJaegerif you add to the jobs a "required-projects: openstack/nova", then we would check out the correct branch for you ;)08:28
tinwoodAJaeger, thanks -- I will try that.08:28
AJaegertinwood: sorry, no more time to guide you through the rest - if you need more help, best ask rest of team later.08:28
tinwoodAJaeger, no problem; I'll look at the work in the other projects you mentioned and see where I get to.  Thanks very much.08:29
*** mgoddard has quit IRC08:34
*** jaosorior has quit IRC08:38
*** pbourke has quit IRC08:41
*** roman_g has joined #openstack-infra08:41
*** vivsoni has quit IRC08:41
*** pbourke has joined #openstack-infra08:42
*** lifeless has quit IRC08:54
*** jaosorior has joined #openstack-infra08:58
*** vivsoni has joined #openstack-infra09:05
*** d0ugal has joined #openstack-infra09:08
*** lifeless has joined #openstack-infra09:11
*** caphrim007 has joined #openstack-infra09:14
*** caphrim007 has quit IRC09:18
*** quiquell has quit IRC09:27
*** quiquell has joined #openstack-infra09:27
*** zoli is now known as zoli|lunch09:32
*** rcarrill1 is now known as rcarrillocruz09:41
*** andymccr- has joined #openstack-infra09:47
*** jaosorior has quit IRC09:49
*** andymccr_ has quit IRC09:50
*** johnthetubaguy has quit IRC09:52
*** stakeda has quit IRC10:03
*** andymccr has quit IRC10:04
*** andymccr- is now known as andymccr10:05
*** sshnaidm|afk has quit IRC10:09
*** alexchadin has quit IRC10:26
*** yamamoto has quit IRC10:30
*** alexchadin has joined #openstack-infra10:39
*** kjackal has quit IRC10:43
*** gongysh has quit IRC10:47
*** dtantsur|bbl is now known as dtantsur10:49
*** zoli|lunch is now known as zoli11:02
*** cshastri has quit IRC11:02
*** dhill_ has quit IRC11:03
*** dave-mccowan has joined #openstack-infra11:06
*** larainema has quit IRC11:15
*** cshastri has joined #openstack-infra11:17
*** yamamoto has joined #openstack-infra11:18
*** quiquell has quit IRC11:21
*** vivsoni has quit IRC11:23
*** quiquell has joined #openstack-infra11:24
*** jamesde__ has joined #openstack-infra11:31
*** jamesden_ has quit IRC11:32
*** jamesde__ has quit IRC11:34
*** zul has quit IRC11:34
*** kjackal has joined #openstack-infra11:34
*** alexchadin has quit IRC11:34
*** alexchadin has joined #openstack-infra11:35
*** alexchadin has quit IRC11:35
*** alexchadin has joined #openstack-infra11:36
*** alexchadin has quit IRC11:36
*** alexchadin has joined #openstack-infra11:36
*** alexchadin has quit IRC11:37
*** quiquell has quit IRC11:40
*** d0ugal has quit IRC11:41
*** quiquell has joined #openstack-infra11:42
*** quiquell has quit IRC11:43
*** rh-jelabarre has joined #openstack-infra11:47
*** n-saito has quit IRC11:48
*** yamamoto has quit IRC11:54
*** tpsilva has joined #openstack-infra11:54
*** rfolco|off is now known as rfolco|ruck11:55
*** yamamoto has joined #openstack-infra11:59
*** pawelzny has joined #openstack-infra12:00
*** linkmark has joined #openstack-infra12:03
*** pawelzny has left #openstack-infra12:05
*** boden has joined #openstack-infra12:07
*** briancurtin has quit IRC12:07
*** sshnaidm has joined #openstack-infra12:08
*** sshnaidm is now known as sshnaidm|off12:08
*** caphrim007 has joined #openstack-infra12:14
*** dhill_ has joined #openstack-infra12:15
*** alexchadin has joined #openstack-infra12:16
*** edmondsw has joined #openstack-infra12:17
*** johnthetubaguy has joined #openstack-infra12:17
*** caphrim007 has quit IRC12:18
*** alexchadin has quit IRC12:20
*** zul has joined #openstack-infra12:22
*** armaan has joined #openstack-infra12:22
openstackgerritTobias Urdin proposed openstack-infra/system-config master: Mirror puppet5 for Ubuntu Bionic  https://review.openstack.org/58652612:22
*** jcoufal has joined #openstack-infra12:25
*** wolverineav has joined #openstack-infra12:26
*** annp has quit IRC12:27
*** zul has quit IRC12:28
*** zul has joined #openstack-infra12:30
*** alexchadin has joined #openstack-infra12:33
*** pblaho has quit IRC12:33
*** diablo_rojo has joined #openstack-infra12:33
*** trown|outtypewww is now known as trown12:34
*** mriedem has joined #openstack-infra12:35
*** zul has quit IRC12:36
*** agopi has quit IRC12:37
*** zul has joined #openstack-infra12:37
*** armaan has quit IRC12:41
*** yamamoto has quit IRC12:42
*** zul has quit IRC12:44
*** armaan has joined #openstack-infra12:45
*** zul has joined #openstack-infra12:45
*** rlandy has quit IRC12:48
*** rlandy has joined #openstack-infra12:48
*** armaan has quit IRC12:49
*** armaan has joined #openstack-infra12:50
*** zul has quit IRC12:51
*** zul has joined #openstack-infra12:53
*** armaan has quit IRC12:54
*** zul has quit IRC12:58
*** yamamoto has joined #openstack-infra12:59
*** zul has joined #openstack-infra13:00
*** rcarrill1 has joined #openstack-infra13:01
*** hamerins has joined #openstack-infra13:01
*** rcarrill2 has joined #openstack-infra13:03
*** rpioso|afk is now known as rpioso13:03
*** rcarrillocruz has quit IRC13:03
*** sthussey has joined #openstack-infra13:04
*** rcarrill2 is now known as rcarrillocruz13:05
*** rcarrill1 has quit IRC13:06
*** rcarrill1 has joined #openstack-infra13:08
openstackgerritMatthieu Huin proposed openstack-infra/zuul master: scheduler: return project_canonical in status page  https://review.openstack.org/58245113:08
*** rcarrillocruz has quit IRC13:10
*** zul has quit IRC13:14
*** zul has joined #openstack-infra13:15
AJaegerconfig-core, could you go over the review queues, please? We have collected a few changes... Some older changes with already one +2 are: https://review.openstack.org/#/c/584366/ https://review.openstack.org/#/c/581096/ https://review.openstack.org/#/c/581064/ https://review.openstack.org/58549313:16
*** jcoufal has quit IRC13:16
*** mandre is now known as mandre_away13:18
*** rcarrillocruz has joined #openstack-infra13:19
pabelanger+313:19
*** jcoufal has joined #openstack-infra13:20
*** zul has quit IRC13:21
*** rcarrill1 has quit IRC13:21
*** rcarrill1 has joined #openstack-infra13:22
*** agopi has joined #openstack-infra13:22
*** zul has joined #openstack-infra13:23
*** rcarrillocruz has quit IRC13:25
*** pblaho has joined #openstack-infra13:25
*** pblaho has quit IRC13:25
*** pblaho has joined #openstack-infra13:25
AJaegerthanks, pabelanger13:25
*** mdrabe has joined #openstack-infra13:26
*** zul has quit IRC13:28
*** caphrim007 has joined #openstack-infra13:29
*** caphrim007_ has joined #openstack-infra13:29
*** rosmaita has joined #openstack-infra13:30
*** zul has joined #openstack-infra13:30
*** auristor has quit IRC13:30
*** rcarrill1 is now known as rcarrillocruz13:31
*** jistr is now known as jistr|mtg13:32
*** kgiusti has joined #openstack-infra13:32
*** caphrim007 has quit IRC13:33
*** myoung has joined #openstack-infra13:35
*** caphrim007_ is now known as caphrim00713:35
*** rcarrill1 has joined #openstack-infra13:35
*** quiquell has joined #openstack-infra13:35
AJaegerconfig-core, a few more changes for review https://review.openstack.org/582317 https://review.openstack.org/#/c/585966/ https://review.openstack.org/#/c/586280/ https://review.openstack.org/#/c/585919/13:36
*** alexchadin has quit IRC13:36
openstackgerritAndreas Jaeger proposed openstack-infra/infra-manual master: Remove Zuul v2 content  https://review.openstack.org/58654913:36
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Remove sahara-extra legacy jobs, moving in-tree  https://review.openstack.org/58109613:37
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Remove legacy-tempest-dsvm-nova-os-vif  https://review.openstack.org/58549313:37
openstackgerritMonty Taylor proposed openstack-infra/zuul-jobs master: Add a yarn role  https://review.openstack.org/58655013:37
*** auristor has joined #openstack-infra13:37
*** rcarrillocruz has quit IRC13:37
AJaegermordred: shall I +2A https://review.openstack.org/#/c/511827/ and https://review.openstack.org/#/c/511823/ ? Or do you wait for the rest first?13:38
mordredAJaeger: I think they're both safe to land - let's go ahead13:39
*** yamamoto has quit IRC13:39
AJaegerdone - +313:39
openstackgerritJoshua Hesketh proposed openstack-infra/zuul master: Add instructions for deploying zuul with openSUSE  https://review.openstack.org/58125513:41
*** hamerins has quit IRC13:41
*** rcarrillocruz has joined #openstack-infra13:41
openstackgerritMonty Taylor proposed openstack-infra/zuul master: Change npm reference to yarn  https://review.openstack.org/58655213:41
*** alexchadin has joined #openstack-infra13:42
*** zul has quit IRC13:43
openstackgerritMerged openstack-infra/infra-manual master: Drop references to system-required template  https://review.openstack.org/57918013:43
*** rcarrill1 has quit IRC13:43
*** hamerins has joined #openstack-infra13:45
*** yamamoto has joined #openstack-infra13:45
*** rcarrill1 has joined #openstack-infra13:48
*** alexchadin has quit IRC13:49
*** rcarrillocruz has quit IRC13:50
*** rcarrillocruz has joined #openstack-infra13:51
*** rcarrill1 has quit IRC13:54
*** ianychoi has joined #openstack-infra13:56
openstackgerritMerged openstack-infra/project-config master: Add pbrx-release team for pbrx  https://review.openstack.org/58436613:58
openstackgerritMerged openstack-infra/project-config master: import networking-cisco jobs into net-cisco tree  https://review.openstack.org/58106413:58
*** zul has joined #openstack-infra13:58
*** ianychoi_ has quit IRC14:00
openstackgerritMerged openstack-infra/system-config master: Replace mirror01.us-west-1.packethost.openstack.org  https://review.openstack.org/58633714:06
*** links has quit IRC14:10
openstackgerritMerged openstack-infra/project-config master: Remove legacy scenario multinode job for cinder  https://review.openstack.org/58385314:11
*** jamesmcarthur has joined #openstack-infra14:12
*** pblaho has quit IRC14:14
*** eharney has joined #openstack-infra14:15
*** jlvacation is now known as jlvillal14:18
*** shardy has quit IRC14:18
*** r-daneel has joined #openstack-infra14:18
openstackgerritMerged openstack-infra/project-config master: Set zuul_output_dir in site-variables  https://review.openstack.org/51182714:18
*** felipemonteiro has joined #openstack-infra14:19
*** dhajare has quit IRC14:20
mordredpabelanger: lookie there ^^ !14:21
mordredpabelanger: progress!14:21
pabelangeryay14:22
*** diablo_rojo has quit IRC14:22
pabelangermordred: I think once we restart zuul today (assuming today) I can finished up some testing that depends on rsync patch. Then will reply again to ML with some notes14:22
* mordred does a little dance14:24
*** alexchadin has joined #openstack-infra14:26
*** mdrabe has quit IRC14:26
*** links has joined #openstack-infra14:29
*** alexchadin has quit IRC14:30
*** ramishra has quit IRC14:31
pabelangerclarkb: email sent to packethost to debug building nodes again, only 3 nodes in use ATM14:37
pabelangerhttp://grafana.openstack.org/d/U462abNik/nodepool-packethost14:37
*** jistr|mtg is now known as jistr14:39
*** shardy has joined #openstack-infra14:40
*** felipemonteiro_ has joined #openstack-infra14:42
*** felipemonteiro has quit IRC14:46
*** diablo_rojo has joined #openstack-infra14:46
*** dhajare has joined #openstack-infra14:46
*** ramishra has joined #openstack-infra14:49
*** efried is now known as fried_rice14:49
Adri2000hello, would appreciate a quick review for this planet.o.o addition https://review.openstack.org/#/c/586254/ :)14:51
*** links has quit IRC15:02
mordredpabelanger, clarkb: http://logs.openstack.org/62/586262/4/check/openstack-tox-lower-constraints/535b826/job-output.txt.gz#_2018-07-27_13_30_48_57567615:03
mordredpabelanger, clarkb: also - mirror issues there15:03
*** alexchadin has joined #openstack-infra15:04
*** mdrabe has joined #openstack-infra15:05
pabelangerugh15:05
pabelangerlet me see if server is off again15:06
pabelangeryah, cannot SSH again15:07
pabelangerthis is new mirror too15:07
pabelangerI'll reboot via API again15:07
*** r-daneel_ has joined #openstack-infra15:08
*** r-daneel has quit IRC15:09
*** r-daneel_ is now known as r-daneel15:09
openstackgerritMonty Taylor proposed openstack-infra/zuul master: web: add /{tenant}/job/{job_name} route  https://review.openstack.org/55097815:12
openstackgerritMonty Taylor proposed openstack-infra/zuul master: web: add /{tenant}/projects and /{tenant}/project/{project} routes  https://review.openstack.org/55097915:12
openstackgerritMonty Taylor proposed openstack-infra/zuul master: web: add /{tenant}/pipelines route  https://review.openstack.org/54152115:12
mordredpabelanger: joy15:12
*** ramishra has quit IRC15:13
pabelangermordred: clarkb: is back, but maybe we should disable packethost until we resolve mirror issues15:13
mordredyeah15:13
mordredseems reasonable15:13
*** ginopc has joined #openstack-infra15:13
mnaserfwiw we have an expected insane spike today of infra, but after that i should be able to restore our stuff and be able to actively troubleshoot stuf together15:13
*** ginopc has quit IRC15:14
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Disable packethost in nodepool  https://review.openstack.org/58660115:14
pabelangermnaser: clarkb: ^15:15
pabelangerwill follow up again with john on email15:15
*** cshastri has quit IRC15:15
*** e0ne has quit IRC15:16
pabelangermnaser: yay15:16
smcginnisAre the mirror issues mentioned above the reason for a bunch of job failures with a finger:// link as the result with RETRY_LIMIT status?15:21
mordredsmcginnis: probably?15:22
*** eernst has joined #openstack-infra15:23
mordredsmcginnis: got an example?15:23
pabelangerHmm15:23
pabelangerlooking15:23
smcginnisTake your pick? http://zuul.openstack.org/15:24
smcginnis:)15:24
pabelangernope15:24
pabelangerwe have broke jobs15:24
*** alexchadin has quit IRC15:24
pabelanger2018-07-27 15:06:23,037 DEBUG zuul.AnsibleJob: [build: 635c8ce2eaf34e21a490fea197fed485] Ansible output: b"The error appears to have been in '/etc/zuul/site-variables.yaml': line 11, column 1, but may"15:24
quiquellHello15:25
quiquellI see some retry_limit at zuul.o.o at the gates, are they important ?15:25
quiquellHere for example 586499,115:25
mordreduhoh15:26
pabelanger1 sec15:26
pabelangerhave patch15:26
mordredpabelanger: we're gonna need to force-merge it I'm betting15:26
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Fix site-variables typo  https://review.openstack.org/58660215:26
pabelangeryes15:26
pabelangermordred: ^15:26
pabelangerthen kick puppet15:26
mordredpabelanger: on it15:26
*** eernst has quit IRC15:27
openstackgerritMerged openstack-infra/project-config master: Fix site-variables typo  https://review.openstack.org/58660215:27
pabelangernow we need to kick.sh all zuul-executors15:27
pabelangeror wait15:27
mordredpabelanger: you on that or want me to?15:27
smcginnisSurprised the linter jobs let that through in the first palce.15:27
smcginnis*place15:27
mordredI believe we might not be linting that file - which is a clear mistake15:28
pabelangermordred: i can15:28
* mordred works on getting linters applied to that file to prevent in future15:28
*** eernst has joined #openstack-infra15:28
pabelangerkicking now15:29
prometheanfiregate sad? https://review.openstack.org/58655315:29
pabelangeryes, fixing now15:29
*** annp has joined #openstack-infra15:30
prometheanfirethanks15:30
prometheanfirerechecks needed?15:30
prometheanfireguess bot will announce if so15:30
pabelangeryah, we'll announce15:31
pabelangeronce fixed15:31
smcginnisMight be an opportune time to do that restart now I suppose.15:31
mordredhow about: #status alert A zuul config error slipped through and caused a pile of job failures. A fix has been applied, but jobs showing finger urls and RETRY_LIMIT will need to be rechecked.15:32
smcginnismordred: That looks good to me.15:33
mordredsmcginnis: not a terrible idea ... but I don't know if we want to conflate/confound things?15:33
pabelangerkick.sh still running, a little slow due to executors load15:33
mordredinfra-root: since we just bombed out a ton of jobs for people, should we take this opportunity to restart zuul?15:33
quiquellmordred: That's why we see somw retry_limit at the gates ?15:34
mordredquiquell: yup15:34
quiquellDamn ok15:34
mordredhow about: #status alert A zuul config error slipped through and caused a pile of job failures with retry_limit - a fix is being applied and should be back up in a few minutes15:34
*** kashyap has left #openstack-infra15:34
*** andymccr has quit IRC15:34
mordrednow - and we can follow up with a fix announcement when kick is done?15:34
pabelangerlets wait, load is really high on executor and puppet is running slow15:35
smcginnisThat might be good to head off all the questions that will be coming this way.15:35
mordred#status alert A zuul config error slipped through and caused a pile of job failures with retry_limit - a fix is being applied and should be back up in a few minutes15:35
openstackstatusmordred: sending alert15:35
*** andymccr has joined #openstack-infra15:35
*** quiquell is now known as quique|luckyluck15:36
pabelangerwe might have to stop zuul-executors for puppet to run, ansible-playbooks launching a lot of builds15:36
*** quique|luckyluck is now known as quiquell15:37
*** xinliang3 has joined #openstack-infra15:37
*** xinliang has quit IRC15:37
-openstackstatus- NOTICE: A zuul config error slipped through and caused a pile of job failures with retry_limit - a fix is being applied and should be back up in a few minutes15:38
*** ChanServ changes topic to "A zuul config error slipped through and caused a pile of job failures with retry_limit - a fix is being applied and should be back up in a few minutes"15:38
* clarkb catches up15:38
mordredclarkb: tl;dr - we don't lint the site-variables file and landed a variable addition with a syntax error which is now causing all zuul jobs to fail15:39
*** quiquell has quit IRC15:39
*** hongbin_ has joined #openstack-infra15:39
corvuswhy did we change site vars?15:39
mordredto add the zuul_output_dir variable15:39
pabelangerokay, puppet has started fixing the files15:39
mordredwoot15:40
clarkbpabelanger: re packethost we lost the new mirror node too?15:40
mordredfor the logging change series - we missed a closing double quote15:40
pabelangerclarkb: only new mirror node, mirror01 still running15:40
clarkbhuh15:40
clarkbwell that change is approved for whenever zuul is running jobs again15:40
pabelangerokay, kick.sh is done15:41
mordredwoot15:41
roman_gHello. Please, review/+2/merge: minor thing - adding statusbot to the #airshipit channel - https://review.openstack.org/#/c/581704/15:41
openstackstatusmordred: finished sending alert15:42
*** diablo_rojo has quit IRC15:42
pabelangerI can see jobs running again15:42
mordredshall we send #status ok ?15:43
*** yamamoto has quit IRC15:44
clarkbmordred: are we also going to restart zuul?15:44
clarkbor shall we wait on that?15:44
pabelangerokay, just seen successful job15:45
pabelangercan send ok, or restart zuul.15:45
pabelangerhappy to do either15:45
corvusi think we've spent our allotment of good will for the day.15:46
*** mdrabe has quit IRC15:46
clarkbya if it is already running jobs then we may want to wait15:46
*** mdrabe has joined #openstack-infra15:46
pabelangerwfm15:46
clarkbthe problem was a yaml syntax error in a file we didn't lint?15:46
corvusyep15:47
corvusclarkb: change in question: https://review.openstack.org/51182715:47
openstackgerritMonty Taylor proposed openstack-infra/project-config master: Add linters check to make sure site-variables is yaml  https://review.openstack.org/58660415:47
*** yamamoto has joined #openstack-infra15:48
mordredfwiw, I tried yamllint first - but it got unhappy about a line being too long and I didn't really feel like trying to deal with that15:48
*** yamamoto has quit IRC15:49
*** yamamoto has joined #openstack-infra15:49
corvus++15:49
*** yamamoto has quit IRC15:49
*** yamamoto has joined #openstack-infra15:50
*** yamamoto has quit IRC15:50
corvusi actually think that's best.  i don't think i want to force everyone to figure out how to break "{{ some really long jinja formatted string }} | with_filters_and_stuff" across 2 lines in yaml and still maintain internal validity.15:50
mordredcorvus: me either15:50
*** yamamoto has joined #openstack-infra15:50
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: Add upload-logs-swift role  https://review.openstack.org/58454115:51
pabelangermordred: send all clear?15:51
mordredhow about #status ok Zuul config error has been rectified (and a test added to prevent in the future) Please recheck any RETRY_LIMIT failures from the last half hour or so15:52
pabelanger++15:52
corvuswfm15:52
clarkbmordred: should we add pyyaml to the deps list in tox.ini?15:52
clarkbmordred: status message lgtm15:52
*** gyee has joined #openstack-infra15:52
mordredclarkb: probably yes15:52
mordredoh - no - it already has ansible and ansible-lint15:53
mordredso it's got pyyaml15:53
*** yamamoto has quit IRC15:54
clarkbapproved (I even double checked it failed agains the old broken file15:55
*** rpittau has quit IRC15:58
*** r-daneel_ has joined #openstack-infra15:58
mordredyay!15:58
openstackgerritMerged openstack-infra/project-config master: Minor typo fix: duplicate HDD label  https://review.openstack.org/58628015:58
openstackgerritMerged openstack-infra/project-config master: Add gerritbot for StarlingX (openstack/stx-*) projects to #starlingx IRC channel  https://review.openstack.org/58591915:59
*** r-daneel has quit IRC16:00
*** r-daneel_ is now known as r-daneel16:00
*** shardy has quit IRC16:01
*** openstackgerrit has quit IRC16:04
*** jpich has quit IRC16:06
corvusi have to run some errands, biab.16:08
*** zoli is now known as zoli|gone16:09
*** zoli|gone is now known as zoli16:09
*** openstackgerrit has joined #openstack-infra16:10
openstackgerritsebastian marcet proposed openstack-infra/openstackid master: Fixed UX on ODIC RP initiated logout  https://review.openstack.org/58660816:10
*** lbragstad_ is now known as lbragstad16:11
*** links has joined #openstack-infra16:14
*** mriedem is now known as mriedem_away16:15
*** trown is now known as trown|lunch16:15
*** jamesmcarthur has quit IRC16:16
*** links has quit IRC16:17
*** links has joined #openstack-infra16:17
*** 07IADF5PV has joined #openstack-infra16:18
*** armax has joined #openstack-infra16:21
openstackgerritMerged openstack-infra/zuul-jobs master: Add role to ensure per-node output dirs exist  https://review.openstack.org/51182316:21
*** yolanda has quit IRC16:22
*** links has quit IRC16:23
*** gema has quit IRC16:26
*** harlowja has joined #openstack-infra16:27
*** gema has joined #openstack-infra16:27
*** gema has joined #openstack-infra16:27
*** yolanda has joined #openstack-infra16:29
*** derekh has quit IRC16:30
*** tesseract has quit IRC16:32
*** fried_rice is now known as fried_rolls16:33
clarkbpabelanger: should we manually disable packethost while we wait for zuul to catch up?16:34
openstackgerritMerged openstack-infra/project-config master: Disable packethost in nodepool  https://review.openstack.org/58660116:37
clarkbpabelanger: ^ nevermind that should get in soon enough16:39
*** annp has quit IRC16:39
*** pguimaraes has quit IRC16:39
pabelangerkk16:43
*** john_studarus has joined #openstack-infra16:49
*** felipemonteiro__ has joined #openstack-infra16:52
*** felipemonteiro_ has quit IRC16:52
*** diablo_rojo has joined #openstack-infra17:00
clarkbcorvus: pabelanger mordred thinking ahead here a bit zuul demand tends to fall off around 2-3pm pacific. Thoughts on planning to restart then? It may even happen quicker on a friday17:02
*** panda|rover is now known as panda|rover|off17:03
*** trown|lunch is now known as trown17:05
pabelangershould be around still17:06
*** felipemonteiro_ has joined #openstack-infra17:06
pabelanger2pmPST a little better17:06
*** yamahata has quit IRC17:07
openstackgerritMerged openstack-infra/project-config master: Add linters check to make sure site-variables is yaml  https://review.openstack.org/58660417:09
*** felipemonteiro__ has quit IRC17:10
*** dtantsur is now known as dtantsur|afk17:10
openstackgerritMerged openstack-infra/openstackid master: Fixed UX on ODIC RP initiated logout  https://review.openstack.org/58660817:11
*** felipemonteiro__ has joined #openstack-infra17:18
*** felipemonteiro_ has quit IRC17:18
*** hashar has quit IRC17:19
clarkbjohnsom: https://review.openstack.org/#/c/585184/2 any reason for me to not approve that now?17:19
johnsomclarkb No, that is fine. Thanks for the heads up17:20
johnsomclarkb Looks like the job don't have a link to the rendered docs anymore. Is that fixed somewhere?17:22
*** mriedem_away is now known as mriedem17:23
clarkbjohnsom: which job?17:23
johnsomclarkb The job Carlos linked: https://review.openstack.org/#/c/585185/17:23
openstackgerritMerged openstack-infra/project-config master: Drop requirements-check from renderspec  https://review.openstack.org/58475317:24
clarkbhttp://logs.openstack.org/85/585185/2/check/openstack-tox-docs/7cb8e2d/ for anyone else following along17:24
clarkbjohnsom: I'm not sure why that happened. Maybe because its running the normal tox role which doesn't know about the docs artifacts17:25
johnsomPreviously the job link was to the rendered docs. It was pretty handy. Trying to remember where that was configured17:25
clarkbmordred: AJaeger ^ have been involved with that I think17:25
johnsomThis was the old config: http://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/jobs.yaml#n27317:29
clarkbjohnsom: I don't think the change above changed that (since you cannot depends on a trusted repo and have it change pre merge)17:30
clarkbjohnsom: some other change must've changed that17:30
*** harlowja has quit IRC17:31
clarkbah ya the change I just approved will switch you to that job17:32
clarkbjohnsom: it should work as expected once that change merges17:32
johnsomHmm, ok17:32
*** felipemonteiro_ has joined #openstack-infra17:34
openstackgerritMerged openstack-infra/project-config master: Update oslo.limit testing jobs  https://review.openstack.org/57969017:34
openstackgerritMerged openstack-infra/project-config master: Run propose-updates for requirements-constraints to Python3.6  https://review.openstack.org/58556917:34
openstackgerritMerged openstack-infra/project-config master: Switch to publish-openstack-docs-pti in Octavia  https://review.openstack.org/58518417:34
*** felipemonteiro__ has quit IRC17:37
*** yamahata has joined #openstack-infra17:44
*** dhajare has quit IRC17:48
openstackgerritMerged openstack-infra/project-config master: Change neutron CI dashboard to a week view  https://review.openstack.org/58329918:02
*** 07IADF5PV has quit IRC18:06
*** myoung is now known as myoung|lunch18:09
*** jtomasek has quit IRC18:10
*** mriedem1 has joined #openstack-infra18:14
*** mriedem has quit IRC18:14
*** john_studarus has quit IRC18:15
*** harlowja has joined #openstack-infra18:15
*** diablo_rojo has quit IRC18:16
*** gema has quit IRC18:17
*** jamesmcarthur has joined #openstack-infra18:19
*** mriedem1 is now known as mriedem18:22
*** jamesmcarthur has quit IRC18:23
*** dhajare has joined #openstack-infra18:26
openstackgerritMerged openstack-infra/project-config master: Set up translation for cloudkitty-dashboard  https://review.openstack.org/58231718:27
*** Qiming has quit IRC18:27
*** jamesmcarthur has joined #openstack-infra18:27
*** pguimaraes has joined #openstack-infra18:29
johnsomclarkb Nope, this just isn't configured for those new jobs: https://review.openstack.org/#/c/586625/18:29
johnsomIt doesn't look like the new job is even collecting the rendered docs....18:29
*** Qiming has joined #openstack-infra18:29
clarkbjohnsom: I don't think that is the new job18:30
clarkbthat is the old one18:30
johnsomNope, the old one rendered this stuff18:30
clarkbIts the job that was running before the change I approved18:31
johnsomNo, this is a DNM job I put in after the project-config change merged to test it18:31
johnsomLooking at the job definitions it is clearly missing18:31
*** rkukura has quit IRC18:32
clarkbah18:32
johnsomThis is an example of the old job http://logs.openstack.org/64/585864/2/check/build-openstack-sphinx-docs/df0c553/html/18:32
johnsomI am super booked today, but maybe later I can take a minute and propose fixes18:32
*** sshnaidm|off has quit IRC18:32
clarkbI think the reason this is happening is openstack-tox-docs is "run tox docs env"18:33
clarkband implies nothing about copyting additional artifacts18:33
clarkb(at least as it is currently written)18:33
johnsomNo, that is perfectly fine, it's just that the post task is not collecting the output and not setting the success link18:33
clarkbright18:33
johnsomlol, maybe we are saying the same thing...18:34
clarkblooking at openstack-zuul-jobs more closely what is the difference between publish-openstack-sphinx-docs and publish-openstack-docs-pti supposed to be? functionality its this job that doesn't publish the docs vs the one that does18:34
corvusclarkb: i'll be around (more reliably than this morning) then18:35
clarkbcorvus: ok, I think we can probably do a restart around then assuming demand falls off as it has in prior days18:36
openstackgerritMerged openstack-infra/zuul master: Add debug message to job freezing  https://review.openstack.org/58275018:36
openstackgerritMerged openstack-infra/project-config master: Aslo apply the py35 job for trove-dashboard queens  https://review.openstack.org/58596618:36
*** njohnston has joined #openstack-infra18:41
*** jamesmcarthur has quit IRC18:43
njohnstonHi!  I saw a job failure with an odd reason why, wanted to let you know.  So the test run for this neutron-functional job actually passed, but the failure happened in the post-testing portion.  trying to install os-log-merger it got 'No route to host' installing from pypi mirror on http://mirror.us-west-1.packethost.openstack.org18:45
njohnstonhttp://logs.openstack.org/89/585489/10/check/neutron-functional/0f04b1e/job-output.txt.gz#_2018-07-27_16_36_40_70833018:45
*** diablo_rojo has joined #openstack-infra18:45
clarkbnjohnston: thanks for the heads up. That mirror had crashed earlier today, we've removed that cloud region from nodepool temporarily while we figure out why that happened18:46
njohnstonThanks clarkb and all the infra team for being on it as always!18:46
njohnstonFYI the error in question happened about 10 minutes ago18:46
clarkbnjohnston: the logged timestamp was from just over 2 hours ago (16:36UTC vs 18:47UTC now)18:47
*** r-daneel_ has joined #openstack-infra18:47
*** r-daneel has quit IRC18:47
*** r-daneel_ is now known as r-daneel18:47
njohnstonoh gosh, sorry about that, I misread18:48
*** gema has joined #openstack-infra18:49
*** gema has joined #openstack-infra18:49
*** gema has quit IRC18:54
*** v1a4 has joined #openstack-infra18:55
*** fried_rolls is now known as fried_rice18:59
*** sthussey has quit IRC19:03
*** hamerins has quit IRC19:03
*** r-daneel has quit IRC19:04
*** gema has joined #openstack-infra19:05
*** gema has joined #openstack-infra19:05
openstackgerritPaul Belanger proposed openstack-infra/zuul-jobs master: Copy inventory file first for validate-host  https://review.openstack.org/58663919:11
clarkbpabelanger: re ^ the plan is to remove that task from validate hosts entirely19:14
clarkbpabelanger: the new log-inventory role is used instead19:14
pabelangerclarkb: oh19:15
pabelangeras long as we run it before validate-host, I'm fine with it19:15
clarkbpabelanger: https://review.openstack.org/#/c/563789/219:15
clarkbpabelanger: you might suggest flpping the order of the roles in that change19:16
pabelangeryah19:16
clarkbpabelanger: feel free to just push a new ps to that change19:17
clarkb(to get the order right)19:17
*** myoung|lunch is now known as myoung19:18
*** jtomasek has joined #openstack-infra19:18
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Use log-inventory in base jobs  https://review.openstack.org/56378919:19
pabelangerdone19:19
*** jtomasek has quit IRC19:19
clarkbianw: I think we can work to get ^ in next week now that feature freeze is behind us19:19
*** rtjure has quit IRC19:25
pabelangerclarkb: node requests have haven't dropped to 22 seem to have caught up now19:28
*** rkukura has joined #openstack-infra19:28
clarkbya maybe after lunch we go for it19:30
clarkbI need to eat now and expect corvus is doing similar19:30
*** eharney has quit IRC19:31
*** rtjure has joined #openstack-infra19:35
zxiirowe released JJB 2.2.0 this morning but I don't see it in pypi yet. Seems like the release job didn't kick off?19:36
zxiiroanyway to force it to start?19:36
openstackgerritMerged openstack-infra/zuul master: fix zuul from scratch user and group creation  https://review.openstack.org/58125419:37
openstackgerritMerged openstack-infra/zuul master: Add instructions for deploying zuul with openSUSE  https://review.openstack.org/58125519:37
clarkbzxiiro: it may have gotten caught it in the zuul misconfiguration fallout that caused a bunch of tests to fail early19:38
clarkbzxiiro: we can reenqueue it19:38
fungihttp://zuul.openstack.org/builds.html confirms that's what happened19:38
clarkbI'm grabbing lunch now and can look at enqueuing it after19:38
clarkbor maybe someone else will beat me to it19:38
fungii'll take a quick look19:39
zxiiromakes sense19:39
zxiirois there a keyword to requeue these things? or do I need to poke you guys when these things happen?19:39
*** lbragstad_ has joined #openstack-infra19:40
*** lbragstad has quit IRC19:41
funginope19:41
fungii ran the following from  shell on zuul.o.o: sudo zuul enqueue-ref --tenant=openstack --trigger=gerrit --pipeline=release --project=openstack-infra/jenkins-job-builder --ref=refs/tags/2.2.0 --newrev=014aa39477bad6eadba3c351ce24935e749cd23519:42
fungithe jobs for it should be queuing up already19:42
zxiirofungi: ok, thanks!19:43
funginope to there being a user-facing api for it (yet), yep to needing to raise it with one of the zuul admins (or tag a higher numbered release)19:43
*** jamesmcarthur has joined #openstack-infra19:47
*** dhajare has quit IRC19:49
fungizxiiro: it's running now http://zuul.openstack.org/stream.html?uuid=e3241a29ac6743a2af7a767b37ab9edb&logfile=console.log19:49
zxiirogreat!19:50
*** eernst has quit IRC19:51
*** eernst has joined #openstack-infra19:53
*** eernst has quit IRC19:53
*** eernst has joined #openstack-infra19:54
*** rlandy is now known as rlandy|brb19:55
*** lbragstad_ is now known as lbragstad19:56
fungizxiiro: and seems to have worked fine this time: https://pypi.org/project/jenkins-job-builder/19:57
*** hamerins has joined #openstack-infra19:57
zxiirowoohoo19:57
corvusclarkb, fungi, mordred, pabelanger: i have replenished myself with a burrito at my local taqueria:  https://imgur.com/a/0yeGj8I19:59
pabelanger++20:00
*** ccamacho1 has joined #openstack-infra20:00
fungii'm technically still vacationing but am back in the cabin for a bit should help be required20:00
corvusi swear i didn't put that there :)20:00
*** ccamacho has quit IRC20:01
clarkbI had a giant bowl of fresh picked blueberries I mixed into cereal :)20:04
*** jcoufal has quit IRC20:05
*** e0ne has joined #openstack-infra20:06
*** kgiusti has left #openstack-infra20:07
*** rosmaita has quit IRC20:11
clarkbzuul==3.1.1.dev200  # git sha 22ad98a is the commit we have installed on all of the zuul nodes20:13
clarkbthat seems to be tip of master20:13
*** slaweq has joined #openstack-infra20:15
pabelangerlooks like it20:17
*** rlandy|brb is now known as rlandy20:17
*** jmorgan1 has quit IRC20:18
fungi22ad98a matches what i have too20:22
*** e0ne has quit IRC20:22
fungifor master branch tip on origin remote20:22
*** dtruong_ has quit IRC20:23
*** dtruong_ has joined #openstack-infra20:26
*** jmorgan1 has joined #openstack-infra20:28
*** dmsimard has quit IRC20:29
*** jamesmcarthur has quit IRC20:30
*** jamesmcarthur has joined #openstack-infra20:31
*** trown is now known as trown|outtypewww20:31
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: Add upload-logs-swift role  https://review.openstack.org/58454120:38
openstackgerritClark Boylan proposed openstack-infra/zuul master: Point Suse users are zookeeper releases page  https://review.openstack.org/58668220:39
*** slaweq has quit IRC20:40
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: Add upload-logs-swift role  https://review.openstack.org/58454120:40
*** felipemonteiro_ has quit IRC20:40
*** felipemonteiro_ has joined #openstack-infra20:40
*** v1a4 has quit IRC20:40
clarkbpabelanger: corvus should we work on restarting now?20:44
pabelangerready here20:46
corvusit's still a bit busy, but i think it'd recovery quickly.  i reckon we could go for it.20:49
clarkbya its busy but not backlog busy I think we go for it too20:49
fungisounds good20:50
corvusclarkb: what's your preferred sequence?20:50
corvusi think we worked this out last time, and you had come up with the optimal20:50
*** jamesmcarthur has quit IRC20:50
clarkbprobably should've written it down :)20:50
corvusit's in eavesdrop somewhere :)20:50
clarkbI think it was stop scheduler, then stop mergers (to avoid merge failures), then stop executors. Start scheduler as soon as others are told to stop then start mergers and executors20:51
clarkbwith a save and restart job queues bookending that20:51
corvusyeah, with a note that we need to wait for the mergers to stop (fast) before starting them, then wait for the executors to stop (slow) before starting them20:52
pabelanger+120:53
corvus(i mean, if you follow the rule don't start it until it's stopped, you're fine, but considering the executors take 15m to stop, it's worth a note :)20:53
*** david-lyle has joined #openstack-infra20:54
clarkbI'm happy to drive it or let someone else do it if they want a go at it20:54
corvusclarkb: i feel like i've done the last few planned full restarts, so why don't you take a drive.  should be able to do it all from puppetmaster with ad-hoc ansible commands20:55
clarkbya  Itend to do the executors and mergers from there since there are many of them then do the zuul01 stuf fon host20:55
clarkbgive me a minute to prep some commands but then I'll start20:56
*** dklyle_ has quit IRC20:57
corvusclarkb: hang on just a second20:57
clarkbok20:57
*** diablo_rojo has quit IRC20:58
corvusclarkb: i've been working on spiffing up pabelanger's playbooks.  i think i have a stop playbook which matches our process.  maybe we can try that out and confirm it works?20:58
*** eernst_ has joined #openstack-infra20:58
corvushttp://paste.openstack.org/show/726768/20:58
clarkbsure20:58
* clarkb looks20:58
corvuspabelanger: ^20:58
*** eernst has quit IRC20:59
corvuswe'll want to run that with "-f 20" or something for all the executors21:00
pabelangercorvus: looks right at first glance21:00
clarkbcorvus: we may be able to optomize it a bit by running some of the stops in parallel, but otherwise I think that should work21:00
clarkbdon't forget to save queues first as well21:00
corvusand obviously, it's not supposed to return until the executors are all stopped, but we'll be wanting to start things before then.  but hopefully we can at least confirm that the stop commands and wait_for's all work21:00
clarkbyup and we can work things on a side channel while that runs21:01
corvus++21:01
clarkbcorvus: do you want to run it?21:02
corvusclarkb: probably best if you do; you can self-coordinate the starts, etc.21:02
clarkbok21:03
*** edmondsw has quit IRC21:04
clarkb`sudo ansible-playbook -f 20 /home/clarkb/playbooks/stop_zuul.yaml` is what I'll be running that look right to you all?21:04
corvusclarkb: yep21:05
*** edmondsw has joined #openstack-infra21:05
clarkbok I'll save queues first then run that here in a minute21:05
*** yamahata has quit IRC21:05
*** eernst has joined #openstack-infra21:05
*** eernst_ has quit IRC21:05
*** r-daneel has joined #openstack-infra21:05
clarkbqueues saved, now running playbook21:07
clarkbok I think there is a bug21:08
clarkbbut it should be fine21:08
fungii guess someone will push that up as an addition to the system-config repo once it's tested? or do we have somewhere to start putting zuul-specific deployment and lifecycle management bits?21:08
clarkbor maybe not21:08
clarkbthe wait didn't wait as long as I expected but the scheduler is indeed stopped21:08
corvusfungi: yeah, i'm prepping that now21:08
fungiawesome21:08
clarkbI'm starting scheduler and tertiary services now21:08
corvusclarkb: scheduler takes maybe 10 seconds to stop?  the executors should be a really obvious test21:09
*** edmondsw has quit IRC21:10
clarkbcorvus: it returned from executors already too21:10
corvusokay so that's still broke.21:10
*** pbourke has quit IRC21:10
clarkbchecking if they stopped now (scheduler and mergers should be starting now21:10
clarkbI double checked ps before starting those though21:11
corvusthere are indeed no pidfiles in /var/run/zuul on the executors21:11
clarkbya executors have not stopped yet21:11
corvusi wouldn't expect them to be removed until it's stopped21:12
clarkbsummary to this point is queues saved and all services have been asked to stop. Scheduler, web, fingergw and mergers have been started after stopping. Waiting on executors to actually stop now so they can be restarted21:12
clarkbcorvus: possible bug in the init script21:12
clarkbcorvus: also looks like paramiko still tries the ed25519 key :/21:13
*** njohnston has left #openstack-infra21:13
clarkbit would be nice if people didn't configure broken keys on gerrit21:13
clarkb(but maybe that is a gerrit bug in that if you don't have one it makes one?)21:14
clarkbenqueuing changes now21:14
corvusclarkb: hrm, i'll look into the paramiko issue.21:15
corvusclarkb: (oh! i bet our init scripts remove the pid file on stop.  i think we can stop doing that now)21:15
clarkbcorvus: ya  Ithink that may be the init script bug21:16
*** e0ne has joined #openstack-infra21:16
*** r-daneel has quit IRC21:16
clarkbzuul mergers are stopping I'll start them one by one so that we can start processing jobs21:16
corvusclarkb: you mean executors?21:17
clarkbyes sorry21:17
fungii believe at start gerrit generates any "missing" host keys it thinks it supports (i've certainly never manually generated host keys when bootstrapping it)21:17
pabelangercorvus: I think you are right, pid file deleted by init script21:18
pabelangercan look into that21:18
*** yamahata has joined #openstack-infra21:18
corvusok hrm21:21
clarkbAll services are running new code now. Check queue is being reenqueued (gate is done)21:22
corvususing the known_hosts that puppet wrote does not cause paramiko to use rsa.  but it's a valid known_hosts -- if i use it with openssh, it's fine.  no complaints about unknown keys.21:22
corvusbut if i clear it out and use openssh and accept the host key (so openssh writes it), *that* makes paramiko use the rsa key21:23
clarkbweird21:23
corvusthe main difference being that openssh is writing the hashed index21:23
openstackgerritPaul Belanger proposed openstack-infra/puppet-zuul master: Stop deleting PIDFILE on stop  https://review.openstack.org/58669921:23
clarkbmaybe paramiko treats hashed index as higher priority than unhashed?21:23
pabelangercorvus: corvus: ^pidfile for zuul21:23
clarkbcheck is enqueued, from my position of performing the mechanical changes we are done21:25
clarkbI haven't seen any errors other than the ssh host key failure to opendaylight21:25
pabelangerclarkb: corvus: not today, we should also look to delete git repo on mergers / executors to ensure they are not created with 0755 permissions. As they still are 777.21:26
corvusoh.  the port.21:27
*** pbourke has joined #openstack-infra21:27
clarkbcorvus: oh!21:27
corvus[git.opendaylight.org]:29418,[2600:1f14:421:f500:7b21:2a58:ab0a:2d17]:29418 ssh-rsa21:28
corvusAAAAB3NzaC1yc2EAAAABIwAAAQEAyRXyHEw/P1iZr/fFFzbodT5orVV/ftnNRW59Zh9rnSY5Rmbc9aygsZHdtiWBERVVv8atrJSdZool75AglPDDYtPICUGWLR91YBSDcZwReh5S9es1dlQ6fyWTnv9QggSZ98KTQEuE3t/b5SfH0T6tXWmrNydv4J2/mejKRRLU2+oumbeVN1yB+8Uau/3w9/K5F5LgsDDzLkW35djLhPV8r0OfmxV/cAnLl7AaZlaqcJMA+2rGKqM3m3Yu+pQw4pxOfCSpejlAwL6c8tA9naOvBkuJk+hYpg5tDEq2QFGRX5y1F9xQpwpdzZROc5hdGYntM79VMMXTj+95dwVv/8yTsw==21:28
corvusthat's what openssh generates when i tell it to disable hashing.  that makes it and paramiko happy21:28
corvusi think since zuul should keep retrying, it's probably connected to ODL during one of the times when i manually had a working config in place.  so we probably don't need to revert or restart.  we can just fix going forward.21:29
clarkbcorvus: ++21:29
clarkbzuul web seems happy, log streaming works. failed jobs appear to be valid failures21:29
corvus2018-07-27 21:28:40,548 DEBUG zuul.Scheduler: Processing trigger event <GerritTriggerEvent comment-added git.opendaylight.org/mdsal master 74289,7 Verified:0, Code-Review:0>21:30
pabelangeryay21:30
fungiahh, yep, without the port number it assumes it's for the default ssh port and probably just ignores it21:30
clarkbwe should double check that we report success/failure properly since we made that change to didalljobssucceed21:30
clarkbfungi: whats odd is openssh seems to consider it vlaid for 29418 too based on corvus' testing21:31
fungiodd that openssh treats it more like a port wildcard21:31
fungiyep, that21:31
corvusmaybe it's a generous fallback behavior21:32
corvusor, maybe it's an exploitable error :)21:32
*** rfolco|ruck is now known as rfolco|off21:33
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Add playbooks to start/stop/restart zuul  https://review.openstack.org/58670621:33
corvusi'll work on updating the kesy21:33
corvuskeys21:33
corvusthere was an empty line between the two keys, presumably because {gerrit_ssh_host_key} has a newline at the end of it, and i put a \n after that.  would you like me to keep doing that (to protect against us removing the newline from gerrit_ssh_host_key in the future), or remove the extra \n i added to keep the file tidy?21:38
clarkbit is a machine managed and read file I think its probably fine to have extra whitespace if that makes the config mgmt easier to read21:38
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Fix syntax of gerrit host ssh keys  https://review.openstack.org/58671021:39
fungii like the newline in the template as insurance against a missing newline in the variable itself21:39
fungior, rather, i think expecting newlines at the end of variable values is a mistake21:40
corvus(also, this is something that would be great to make better when we next revisit how we handle these data; string concatenation is not going to scale here)21:40
fungibetter to assume they'll be missing and end up with a blank line than to end up accidentally concatenating two of them21:40
fungiyeah, that too. string concatenation is a poor substitute for an actual template21:41
*** boden has quit IRC21:42
openstackgerritMerged openstack-infra/zuul master: Point Suse users are zookeeper releases page  https://review.openstack.org/58668221:43
pabelangerokay, I have to leave now, but can check back later this evening!21:43
clarkbpabelanger: thanks21:43
clarkbpabelanger: also its friday you should probably do something other than work this evening :)21:43
*** kjackal has quit IRC21:44
*** eernst has quit IRC21:44
*** pguimaraes has quit IRC21:44
pabelanger++21:45
*** rtjure has quit IRC21:46
*** eernst has joined #openstack-infra21:48
*** eernst_ has joined #openstack-infra21:49
*** eernst has quit IRC21:49
*** neiloy has quit IRC21:53
*** eernst_ has quit IRC21:53
*** eernst has joined #openstack-infra21:54
*** eernst has quit IRC21:59
*** e0ne has quit IRC22:00
*** sshnaidm|off has joined #openstack-infra22:04
*** hamerins has quit IRC22:05
*** felipemonteiro_ has quit IRC22:06
*** shaner has quit IRC22:06
*** shaner has joined #openstack-infra22:07
*** rh-jelabarre has quit IRC22:07
*** rtjure has joined #openstack-infra22:13
corvusso... anyone have thoughts on how to get openstacksdk installed on the executors?22:13
corvusi guess we could just add it to puppet with the pip3 provider?22:14
clarkbcorvus tobiash seemed to think we could add it as a zuul dep22:14
clarkb(I'm not sure that is best route though)22:15
*** rh-jelabarre has joined #openstack-infra22:15
corvushrm, i'm not sure i'm keen on that.  zuul-jobs could end up having a lot of optional dependencies.  i guess we could argue that *this one* is special....22:15
clarkbmaybe list it as an extras require?22:15
corvusif we get around to supporting multiple ansible versions, and we have zuul build the venvs, then we could add things like this to a zuul config file so it could install them.  that sounds ideal, but that's rather a bit in the future.22:16
clarkbI'd be ok with an explicit out of band install to start that will be forward compatible if we add it to the dep list22:16
corvuswe should also probably think about image builds....22:17
openstackgerritMerged openstack-infra/zuul-jobs master: Add upload-logs-swift role  https://review.openstack.org/58454122:17
corvusie, if we want to run our executors from pbrx containers22:17
*** neiloy has joined #openstack-infra22:17
clarkbDoes pbrx handle extras requires yet? that may be a good option there22:18
corvusthat's probably an easy one too.  there's a way to add extra packages to the zuul-executor image by adding them to setup.cfg.22:18
clarkbthen you can pip install zuul[executor] and get that stuff only on that image22:18
corvusara is added like that.22:18
corvusclarkb: yeah, it uses extras22:18
corvushttp://git.zuul-ci.org/cgit/zuul/tree/setup.cfg#n5522:18
corvusokay, i think i like the idea of us adding it to puppet and extras, but keeping it out of the main requirements file for now.22:19
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Add openstacksdk to executor extras  https://review.openstack.org/58671722:20
*** shaner has quit IRC22:23
*** shaner has joined #openstack-infra22:24
openstackgerritJames E. Blair proposed openstack-infra/puppet-zuul master: Install openstacksdk on zuul executors  https://review.openstack.org/58671822:24
*** sambetts_ has quit IRC22:24
*** sambetts_ has joined #openstack-infra22:26
*** tpsilva has quit IRC22:26
clarkbboth changes lgtm. I think generally being helpful with dependency lists even at the cost of disk is probably a good thing22:26
corvusclarkb: yeah, except that the actual requirements.txt is very frequently automatically installed in a way that is difficult or impossible for people to override.  some users of zuul have requested that we keep the fixed requirement to a minimum.  so it's a tricky balancing act.22:31
corvushappily, openstacksdk doesn't impose a lot of additional requirements of its own.22:32
fungiextras are also problematic come uninstall or upgrade time22:32
clarkbcorvus: ya I think extras is a reasonable compromise, you have to opt into using them but its still an automatable thing if you chose to consume them22:32
clarkbfungi: how so?22:32
clarkbfungi: you can put them on the same pip intsall/uninstall command and it should sort it all out22:33
fungilast i heard (unless it's been fixed) pip doesn't know that packages installed via extras are dependencies when you do upgrades22:33
clarkbfungi: ya they are like distinct lists, you have to list them together if using them22:33
fungithough, yes, including the extras on the upgrade command i guess works22:33
fungias for container images, if the size isn't significantly different i expect having one zuul image which can be used in a variety of configurations for different combinations of subservices might be more user-friendly22:34
clarkbfungi: I think the tricky thing doing that is that (docker at least) bakes in the command to run along with the image itself22:35
fungibut i'm not familiar enough with container ecosystems to know for sure whether that makes a difference to people using such installation methods22:35
clarkbso you end up needing different images for different commands even if they are basically hte same22:35
fungimmm, so if you wanted a container which ran multiple daemons how would it handle that?22:36
clarkbfungi: I think the container crowd would say not to do that, but aiui if you want that you make your command to run an init system22:36
fungii have a hard time believing docker containers only ever have one running process per container22:36
clarkbfungi: thats the design pattern22:36
fungilots of software expects supporting processes22:37
clarkbfungi: ya 'sidecar' containers in k8s terminology are how you handle that22:37
* fungi checks to see how many processes his web browser is currently comprised of22:37
clarkbForking like a web browser or zuul-executor is fine22:37
clarkbbut it won't start distinct processes for you22:38
clarkbjust the one (and thta one has to fork)22:38
corvusfungi: the pbrx system is building a container image for each setuptools entrypoint.  so we have a distinct executor image (but some common things are shared in a base image which gets layered in)22:38
*** agopi has quit IRC22:38
clarkbcorvus: ya and I think the design there is largely dictated by how docker expects to run containers (one top level process per container image)22:39
corvusand yeah, there's even a "zuul" image for running the zuul CLI22:39
fungiamusing22:39
corvus(so, in the container world, when we need to enqueue, the command will be a wee bit longer, but substantially similar -- we'll still log into zuul01 and "run the enqueue command")22:40
fungiso we wouldn't run zuul-web and apache in the same container, there's be a zuul-web container and an apache container and they'd communicate via virtual interfaces exposed into each of those?22:41
corvusfungi: i think you can turn off the network namespacing to make that less weird, and i think that's what mordred has planned22:42
clarkbfungi: yes (except I think to start we are using the host's networking so they will see regularl ol' eth0 and have to use distinct ports)22:42
clarkbcorvus: yup in part because it makes things like the zuul enqueue commands simpler22:42
clarkb(since they rely on gearman which relies on tcp)22:42
*** hongbin_ has quit IRC22:42
corvusat the end of the day, it's probably going to feel more like a (very baroque) chroot for us.  :)22:43
clarkbfungi: if we wanted different network stacks we can do that too and it sets up a bridge like system like neutron22:44
corvus(albiet, one which is not convenient to cd into)22:44
clarkbfungi: but then you hvae to sort out how to plug one system into another and that is made much easier if you jump to kubernetes22:44
clarkb(though I use docker swarm on my single node server at home and it works well enough for that)22:44
clarkbcorvus: you can exec a shell into a container pretty easily as long as the shell exists on the image22:44
corvusyep22:45
clarkbdocker exec22:45
openstackgerritJames E. Blair proposed openstack-infra/zuul master: WIP: Support line comments in Gerrit  https://review.openstack.org/57703522:48
fungidoesn't sound too bad, just cumbersome to operate (in exchange for being less cumbersome to deploy i suppose)22:49
fungiand more portable (albeit in much the same way that java applications are "portable")22:50
corvusyeah -- concievably with some helper scripts installed for each of the images, it could be nearly transparent "zuul enqueue" could just work22:50
clarkbfungi: ya control over what is installed and when it changes is a big part of the draw. Youcan do neat stuff with namespacing beyond that but it seem to be the major win for most22:52
*** shaner has quit IRC22:57
*** shaner has joined #openstack-infra22:59
*** mschuppert has quit IRC23:03
clarkbthis conversation inspired me to look at changes to the zuul container spec again. Other than one small nit I think it is ready to go23:08
clarkbcorvus: ^23:08
clarkband now I must find somewhere to plug laptop into the wall23:08
*** rpioso is now known as rpioso|afk23:09
*** harlowja has quit IRC23:09
*** neiloy has quit IRC23:15
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Support line comments in Gerrit  https://review.openstack.org/57703523:20
*** myoung has quit IRC23:22
*** shaner has quit IRC23:23
pabelangerclarkb: If okay with you, I might experiment with upgrading ze01 hwe kernel this weekend, that we've restarted zuul services.  Based on reading kernel release notes, i think might be what is giving us such an improvement on ze11.o.o for ansible-playbooks23:27
*** shaner has joined #openstack-infra23:29
mordredpabelanger: that would be a great piece of info to learn if it turns out to be the case23:29
pabelangerindeed, for reasons unknown, ze11.o.o is way ahead with number of concurrent ansible-playbook processes. Hopefully ze01.o.o gets the same boost with latest kernel23:30
*** rh-jelabarre has quit IRC23:33
clarkbya I think it is worth trying23:33
openstackgerritMerged openstack-infra/zuul master: scheduler: return project_canonical in status page  https://review.openstack.org/58245123:34
pabelangerclarkb: actually, looks already installed just need to reboot server to pick it up23:35
pabelangerI'll do that in a bit23:36
*** rlandy has quit IRC23:37
*** gongysh has joined #openstack-infra23:43
mordredpabelanger: what's the change in the kernel changelog?23:45
mordredor release notes or whatever?23:46
*** diablo_rojo has joined #openstack-infra23:47
clarkbThere were pti improvements iirc23:49
pabelangerlet me see where I found it23:52
pabelangerhttps://kernelnewbies.org/Linux_4.15#Memory_management23:53
mordredpabelanger: thanks23:53
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Support line comments in Gerrit  https://review.openstack.org/57703523:53
pabelangersome interesting patches for speedup23:53
pabelangerhave zero idea, if that is the reason23:53
pabelanger:)23:53
*** wolverineav has quit IRC23:53
*** wolverineav has joined #openstack-infra23:54
mordredpabelanger: it's a worthy hypothesis23:54
mordredif it turns out to be true- you should write a little blog post about it23:55
mordredseems like the sort of thing people would like - and we've got the ability to show the impact on a real world workload at scale23:55
mordred"we upgrade the kernel and you'll never believe what happened next ..."23:55
*** eernst has joined #openstack-infra23:56
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Support line comments in Gerrit  https://review.openstack.org/57703523:56
openstackgerritMichael Johnson proposed openstack-infra/openstack-zuul-jobs master: Fix publish-openstack-docs-pti check/gate html  https://review.openstack.org/58672923:56
openstackgerritPaul Belanger proposed openstack-infra/zuul-jobs master: Add role to fetch zuul logs from nodes  https://review.openstack.org/58334623:58
pabelangermordred: ++23:58
*** wolverineav has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!