Wednesday, 2018-12-12

*** tosky has quit IRC00:01
*** smarcet has quit IRC00:07
clarkbmelwitt: there is definitely something going on there and I undersatnd it less after looking at your example00:08
clarkblater in the console log it outputs the route table and ip-route:default via 10.1.0.1 dev eth0 is in there00:09
clarkbwhich is what I think WARN: failed: route add -net "0.0.0.0/0" gw "10.1.0.1" would give you00:09
clarkband if that can happen without the disk trouble than maybe that is just noise00:09
clarkbnow I wonder if there is a neutron bug00:10
clarkband I just got distracted by shiny error and warning messages00:10
melwittyeah, I noticed that too in the route table dump but didn't understand it00:10
clarkbmelwitt: we may want to get someone from neutron to look at some of these with fresh eyes and see if they notice anything less shiny and distracting00:11
melwitthaha, indeed those errors and warnings are shiny00:11
*** _alastor_ has joined #openstack-infra00:11
melwittok. I think sean-k-mooney might be able to help here, so I'll ask tomorrow00:12
clarkbianw: I am able to login to github with the new account given the details you provided00:13
clarkbianw: so that seems to work00:13
clarkbmelwitt: also given the number of hits changing the query finds (and themall being failures) maybe we should update the bug and query00:13
clarkbit is odd that if the route add was unrelated that they would all be failures so maybe there is a thread there to pull on00:13
*** wolverineav has quit IRC00:14
melwittI was thinking similar, that if the WARN is unrelated, why is it all failures. I did notice a lot of dupes in the build uuids though, so I'm not sure if there's anything I can do to reduce those with the query too00:14
melwittlots of hits were on the same build uuid I mean00:15
clarkbe-r will dedup them when it makes the graphs iirc00:15
clarkband ya that will happen if multiple cirros boots have the same issue in one job00:15
melwittk00:15
clarkbthere isn't a built in uniq filter we can use with elasticsearch though00:15
*** smarcet has joined #openstack-infra00:18
*** wolverineav has joined #openstack-infra00:18
clarkbin ansible if a var is set in one role via defaults and vars dir yaml files are those vars not available to subsequent roles?00:21
clarkbhttp://logs.openstack.org/25/624525/1/check/openstack-infra-multinode-integration-centos-7/05ed114/job-output.txt.gz#_2018-12-11_23_58_47_946410 implies that that may be the case00:21
clarkbdmsimard: pabelanger ^ for some reason I thought vars were global but I must be misunderstanding?00:21
*** dklyle has quit IRC00:21
openstackgerritmelanie witt proposed openstack-infra/elastic-recheck master: Update query for bug 1808010  https://review.openstack.org/62453300:21
openstackbug 1808010 in OpenStack-Gate "Tempest cirros boots fail due to lack of disk space" [Undecided,New] https://launchpad.net/bugs/180801000:21
*** wolverineav has quit IRC00:23
melwittclarkb: you know what though, we only index the cirros log if a job fails, so we would only see the WARN message on a job failure, even if it's unrelated to the fail (for example, if it's always emitted)00:25
melwitt*we only collect the cirros log if a job fails00:25
openstackgerritClark Boylan proposed openstack-infra/zuul-jobs master: Use mirrors if available when installing OVS on centos  https://review.openstack.org/62452500:27
clarkbmelwitt: oh good point since tempest dumps that data out on failure only00:28
clarkbso ya could be entirely unrelated and we have some other networking bug there00:28
melwittyeah00:29
clarkbwe probaly do want to sort out the cirros issues even if they aren't failures (so that they don't become shiny fly traps for us in the future)00:29
clarkbbut definitely lower priority if that is the case00:29
*** dklyle has joined #openstack-infra00:34
*** xek__ has joined #openstack-infra00:37
*** tpsilva has quit IRC00:37
*** smarcet has quit IRC00:39
*** xek_ has quit IRC00:39
clarkbmelwitt: ok, rereading the tempest portion of the log from logs.openstack.org/76/582376/8/gate/tempest-full-py3/a8f62b6/ it appears the paramiko says it is connecting so tcp works00:43
clarkbthe failure is with ssh public key auth00:43
clarkbmelwitt: that is the same as your example00:43
clarkbmelwitt: failed to get http://169.254.169.254/2009-04-04/meta-data/public-keys/0/openssh-key is an error from yours00:46
clarkbmelwitt: is config drive enabled on your job?00:46
*** wolverineav has joined #openstack-infra00:48
*** Swami has quit IRC00:49
*** armax has joined #openstack-infra00:49
clarkbb'failed to get http://169.254.169.254/2009-04-04/meta-data/public-keys' error on mine00:53
clarkbnow I'm fairly certain we are seeing a bug in the instance -> neutron metadataapi -> nova (metadata)api services00:53
clarkbI've got to go now to keep an eye on kids, but hopefully ^ is useful00:56
*** jamesmcarthur has joined #openstack-infra00:57
ianwclarkb: thanks, if we get votes on https://review.openstack.org/#/c/624531/ i'll consider that a sign to add permissions to the account00:58
*** jamesmcarthur has quit IRC00:59
*** agopi has joined #openstack-infra01:03
*** jamesmcarthur has joined #openstack-infra01:04
*** dklyle has quit IRC01:04
*** rlandy has quit IRC01:05
*** dave-mccowan has joined #openstack-infra01:15
*** sthussey has quit IRC01:19
*** mriedem has quit IRC01:23
*** rh-jelabarre has quit IRC01:24
*** _alastor_ has quit IRC01:25
*** markvoelker has quit IRC01:41
*** jamesmcarthur has quit IRC01:41
*** jamesmcarthur has joined #openstack-infra01:42
*** anteaya has quit IRC01:45
*** jamesmcarthur has quit IRC01:46
*** hwoarang has quit IRC01:48
*** hwoarang has joined #openstack-infra01:53
*** jamesmcarthur has joined #openstack-infra02:00
*** armax has quit IRC02:02
*** jamesmcarthur has quit IRC02:05
*** pots has joined #openstack-infra02:05
*** mrsoul has quit IRC02:09
openstackgerritClark Boylan proposed openstack-infra/zuul-jobs master: Use mirrors if available when installing OVS on centos  https://review.openstack.org/62452502:11
clarkbmwhahaha: ^ I think I got it right that time02:11
clarkbmelwitt: my ssh debugging script that I added to tempest a while back should try and print the authorized user ssh keys02:15
clarkbmelwitt: mriedem looking at http://logs.openstack.org/76/582376/8/gate/tempest-full-py3/a8f62b6/job-output.txt.gz#_2018-12-11_11_17_44_640445 there are no contents in that so I think that confirms the issue is related to metadta service not providing that data back02:16
*** wolverineav has quit IRC02:17
*** wolverineav has joined #openstack-infra02:18
*** armax has joined #openstack-infra02:18
*** xarses has joined #openstack-infra02:21
*** wolverineav has quit IRC02:22
*** xarses_ has joined #openstack-infra02:22
*** jrist has quit IRC02:25
*** xarses has quit IRC02:26
*** fuentess has quit IRC02:26
clarkbwhat is really weird is the q-meta log file shows that path beying GETted and repsonding with a 20002:27
clarkbhowever it took 10 seconds looks like which is maybe beyond that cirros timeout?02:27
clarkbmelwitt: mriedem: I wonder if the slowness logged at http://logs.openstack.org/76/582376/8/gate/tempest-full-py3/a8f62b6/controller/logs/screen-n-api-meta.txt.gz#_Dec_11_10_46_57_286705 is leading to cirros client to timeout and igve up02:29
clarkbany idea where to look for that lost time?02:29
*** bobh has quit IRC02:30
clarkbhttps://github.com/XANi/cirros/blob/master/src/usr/bin/ec2metadata#L5 the cirros timeout is 10 seconds so we are just over02:35
*** hongbin has joined #openstack-infra02:44
*** bhavikdbavishi has joined #openstack-infra02:48
*** jamesmcarthur has joined #openstack-infra02:56
*** ykarel|away has joined #openstack-infra02:58
*** agopi has quit IRC02:59
*** xarses_ has quit IRC03:01
*** bobh has joined #openstack-infra03:03
*** ykarel|away has quit IRC03:05
*** agopi has joined #openstack-infra03:06
*** bobh has quit IRC03:07
*** apetrich has quit IRC03:15
*** jamesmcarthur has quit IRC03:24
*** jamesmcarthur has joined #openstack-infra03:25
*** armax has quit IRC03:27
*** psachin has joined #openstack-infra03:27
melwittclarkb: thanks for all that info. this is the code that could get time consuming https://github.com/openstack/nova/blob/master/nova/api/metadata/base.py#L117-L119 it makes at least one call to neutron for security groups. looking through it now (not already familiar with it)03:29
*** wolverineav has joined #openstack-infra03:30
*** jamesmcarthur has quit IRC03:31
*** bobh has joined #openstack-infra03:32
*** bobh has quit IRC03:36
*** jamesmcarthur has joined #openstack-infra03:36
*** jamesmcarthur has quit IRC03:42
*** jamesmcarthur has joined #openstack-infra03:44
*** Tengu has quit IRC03:51
*** dave-mccowan has quit IRC03:52
*** hwoarang has quit IRC03:56
*** Tengu has joined #openstack-infra03:58
*** yamamoto has quit IRC04:02
*** hwoarang has joined #openstack-infra04:03
*** yamamoto has joined #openstack-infra04:06
*** ykarel|away has joined #openstack-infra04:08
*** bobh has joined #openstack-infra04:14
openstackgerritClark Boylan proposed openstack-infra/zuul-jobs master: Use mirrors if available when installing OVS on centos  https://review.openstack.org/62452504:17
*** udesale has joined #openstack-infra04:17
*** bobh has quit IRC04:18
*** jamesmcarthur has quit IRC04:23
*** bobh has joined #openstack-infra04:23
*** jamesmcarthur has joined #openstack-infra04:24
*** bobh has quit IRC04:27
*** bobh has joined #openstack-infra04:31
*** yamamoto has quit IRC04:31
*** bobh has quit IRC04:34
*** pots has quit IRC04:43
*** bobh has joined #openstack-infra04:43
*** pots has joined #openstack-infra04:44
*** bobh has quit IRC04:48
*** yamamoto has joined #openstack-infra04:50
*** yamamoto has quit IRC04:52
*** yamamoto has joined #openstack-infra04:53
*** hongbin has quit IRC04:55
openstackgerritMichael Johnson proposed openstack-infra/project-config master: Add publish-to-pypi for octavia-lib  https://review.openstack.org/62457404:56
prometheanfirejohnsom: :D04:57
*** jamesmcarthur has quit IRC04:58
johnsomprometheanfire: coming to a g-r near you soon...04:58
*** xarses has joined #openstack-infra04:59
*** xarses has quit IRC04:59
*** xarses has joined #openstack-infra04:59
*** bobh has joined #openstack-infra05:00
*** bobh has quit IRC05:04
openstackgerritIan Wienand proposed openstack-infra/zuul-jobs master: mirror-workspace-git-repos: Explicitly show HEAD of checked out branches  https://review.openstack.org/62184005:06
openstackgerritIan Wienand proposed openstack-infra/zuul-jobs master: Update test-mirror-workspace-git-repos, add test  https://review.openstack.org/62457505:06
ianwAJaeger: to your question yesterday, i think we need to merge ^ and then it will test ^^05:07
*** ykarel|away has quit IRC05:09
*** bobh has joined #openstack-infra05:09
*** bobh has quit IRC05:14
*** wolverineav has quit IRC05:17
*** dhellmann has quit IRC05:19
*** rtjure has quit IRC05:19
*** dhellmann has joined #openstack-infra05:20
*** rtjure has joined #openstack-infra05:22
*** wolverineav has joined #openstack-infra05:24
*** ykarel|away has joined #openstack-infra05:25
*** ykarel|away is now known as ykarel05:26
*** jamesmcarthur has joined #openstack-infra05:34
openstackgerritIan Wienand proposed openstack-infra/zuul-jobs master: Add a note on testing  https://review.openstack.org/62457805:44
*** bobh has joined #openstack-infra05:46
*** bobh has quit IRC05:51
*** dklyle has joined #openstack-infra05:51
*** dklyle has quit IRC05:56
*** _alastor_ has joined #openstack-infra06:06
*** wolverineav has quit IRC06:09
*** hwoarang has quit IRC06:16
*** hwoarang has joined #openstack-infra06:22
*** dayou has quit IRC06:22
*** dayou has joined #openstack-infra06:23
*** bobh has joined #openstack-infra06:26
*** slaweq has joined #openstack-infra06:29
*** yboaron_ has joined #openstack-infra06:29
*** lpetrut has joined #openstack-infra06:30
*** bobh has quit IRC06:31
*** ramishra has quit IRC06:32
*** ramishra has joined #openstack-infra06:33
*** bobh has joined #openstack-infra06:34
*** bhavikdbavishi1 has joined #openstack-infra06:36
*** bhavikdbavishi has quit IRC06:37
*** bhavikdbavishi1 is now known as bhavikdbavishi06:37
*** bobh has quit IRC06:39
*** jamesmcarthur has quit IRC06:42
*** _alastor_ has quit IRC06:45
*** kjackal has joined #openstack-infra06:52
openstackgerritMerged openstack-infra/project-config master: Add openstack/os-api-ref to #openstack-doc  https://review.openstack.org/62301307:05
openstackgerritMerged openstack-infra/project-config master: Remove openstack/osc-placement from #openstack-nova  https://review.openstack.org/62298707:05
*** quiquell|off is now known as quiquell07:13
*** e0ne has joined #openstack-infra07:13
*** masayukig[m] has joined #openstack-infra07:13
*** bobh has joined #openstack-infra07:15
*** xarses_ has joined #openstack-infra07:17
*** jamesmcarthur has joined #openstack-infra07:18
*** bobh has quit IRC07:19
*** jamesmcarthur has quit IRC07:23
*** wolverineav has joined #openstack-infra07:25
*** openstackgerrit has quit IRC07:29
*** dklyle has joined #openstack-infra07:29
*** wolverineav has quit IRC07:29
*** rcernin has quit IRC07:30
*** alexchadin has joined #openstack-infra07:31
*** jamesmcarthur has joined #openstack-infra07:40
*** jamesmcarthur has quit IRC07:44
*** lpetrut has quit IRC07:46
*** pgaxatte has joined #openstack-infra07:50
*** xarses has quit IRC07:50
*** xarses has joined #openstack-infra07:50
*** bobh has joined #openstack-infra07:55
*** apetrich has joined #openstack-infra07:55
*** bobh has quit IRC08:00
*** ahosam has joined #openstack-infra08:01
*** jamesmcarthur has joined #openstack-infra08:01
*** ahosam has quit IRC08:01
*** ahosam has joined #openstack-infra08:01
*** ahosam has quit IRC08:02
*** shardy has joined #openstack-infra08:05
*** jtomasek has joined #openstack-infra08:05
*** jamesmcarthur has quit IRC08:05
*** dayou has quit IRC08:10
*** bobh has joined #openstack-infra08:13
*** openstackgerrit has joined #openstack-infra08:16
openstackgerritMerged openstack-infra/system-config master: Mirror Stein on Ubuntu from Cloud Archive  https://review.openstack.org/62123108:16
*** bobh has quit IRC08:17
*** yboaron_ has quit IRC08:18
*** shardy has quit IRC08:21
*** shardy has joined #openstack-infra08:22
*** jamesmcarthur has joined #openstack-infra08:22
*** imacdonn has quit IRC08:23
*** imacdonn has joined #openstack-infra08:23
*** jamesmcarthur has quit IRC08:26
*** dayou has joined #openstack-infra08:28
*** e0ne has quit IRC08:31
*** dklyle has quit IRC08:32
evrardjpis https://review.openstack.org/#/c/624484/2 scary , or a welcomed addition ?08:33
*** bhavikdbavishi has quit IRC08:35
*** ccamacho has joined #openstack-infra08:37
*** dayou has quit IRC08:43
*** markvoelker has joined #openstack-infra08:44
*** dayou has joined #openstack-infra08:44
*** bobh has joined #openstack-infra08:45
*** tosky has joined #openstack-infra08:46
*** markvoelker has quit IRC08:49
openstackgerritJean-Philippe Evrard proposed openstack-infra/zuul-jobs master: Add docker insecure registries feature  https://review.openstack.org/62448408:51
*** jpena|off is now known as jpena08:55
*** jonher_ has joined #openstack-infra08:57
*** fresta_ has joined #openstack-infra08:57
*** jonher has quit IRC09:00
*** jonher_ is now known as jonher09:00
*** jpich has joined #openstack-infra09:01
*** fresta has quit IRC09:01
*** jamesmcarthur has joined #openstack-infra09:04
*** yamamoto has quit IRC09:06
*** yboaron_ has joined #openstack-infra09:07
*** jamesmcarthur has quit IRC09:08
*** mtreinish has quit IRC09:11
*** yamamoto has joined #openstack-infra09:11
*** dtantsur|afk is now known as dtantsur09:19
*** bhavikdbavishi has joined #openstack-infra09:19
*** jamesmcarthur has joined #openstack-infra09:20
*** bhavikdbavishi has quit IRC09:23
*** jamesmcarthur has quit IRC09:24
quiquellGood morning09:31
amorinmorning09:31
quiquellWe are having issues with fedora28 nodepool images09:31
quiquellamorin: o/09:31
Tenguah.... cool, I was wondering if it was due to my patch not being in-sync with master.09:32
quiquellamorin: Do you know something about them ?09:32
amorinnop, sorry09:34
quiquellamorin: Who can help me with that ?09:34
*** psachin is now known as psachin|session09:35
*** jamesmcarthur has joined #openstack-infra09:36
amorinmaybe frickler ?09:36
amorinor fungi, but they are on another tz09:37
*** markvoelker has joined #openstack-infra09:39
*** jamesmcarthur has quit IRC09:41
fricklerquiquell: maybe you could start by explaining what you think the issue might be09:41
*** lpetrut has joined #openstack-infra09:41
frickleramorin: did you see my remarks yesterday? I'm not sure how we should proceed now, does is still make sense to continue with the 20-nodes-benchmarking setup?09:42
amorinfrickler: yes I saw it09:43
quiquellfrickler: yep sorry, the thing is that we have a script at tripleo with the shebag "python3 -s"09:43
quiquellfrickler: -s modifier means don't use user site package, meaning don't use pip stuff09:43
quiquellfrickler: but fedora28 has setuptools from pip since it's excluded from dnf.conf09:43
amorinso, for now I think it's not needed to move forward until we fix our RAM issue09:43
quiquellfrickler: so it fails using "pkg_resources" that it's at setuptools09:44
amorinanyway I moved your instances, one per host as said09:44
*** derekh has joined #openstack-infra09:44
*** lpetrut has quit IRC09:45
frickleramorin: so your plan would be to install more RAM into the hosts? or adapt the quota settings to the RAM that is actually installed? I'm assuming the latter would be faster09:46
fricklerquiquell: is this a new issue? also, it sounds to me like the solution would be to either amend your script or the fedora setup. I'm guessing the former might be easier. also ianw seems to be our resident fedora expert, maybe he has some further insight09:48
amorinthe plan is a little bit different, because we are something that is leaking some ram on the host itself, not related to instances09:48
amorinbut the side effect is that instances wont have enough ram available09:48
frickleramorin: ah, o.k.09:48
amorinso adding more ram or setting quota is not the solution09:48
*** bhavikdbavishi has joined #openstack-infra09:48
amorinwe need to work deeper on our leak09:48
quiquellfrickler: We have started to test python3 stuff using fedora28 for future centos versions, so it's new09:49
quiquellfrickler: I suppose the "-s" modifier is there for a reason, have to ask around09:49
quiquellfrickler: But I don't know why nodepool f28 is coocked with exclusions at dnf.conf and using pip setuptools09:49
quiquellfrickler: Someone told me about, people not messing around with the images or the like09:50
fricklerquiquell: I can only guess that we need to avoid pip >= 10 because it broke devstack09:50
*** jamesmcarthur has joined #openstack-infra09:51
quiquellfrickler: ok, so it would be ok for a job to modify dnf.conf and install setuptools from dnf ?09:51
quiquellfrickler: in case it's not using devstack09:52
frickleramorin: o.k., so I'm going to assume that there is nothing to be done currently on infra-root side, we'll wait for your further feedback.09:52
quiquellssbarnea|rover: ^09:53
*** gfidente has joined #openstack-infra09:53
amorinfrickler: yes09:53
fricklerquiquell: in theory anything that works for your job and doesn't break things globally would seem fine I guess09:53
quiquellfrickler: fair enough, thanks!09:55
*** jamesmcarthur has quit IRC09:56
quiquellfrickler: Do you have a pointer to part of the code that cooks the f28 image and add the exclusion at dnf.conf ?09:57
quiquellssbarnea|rover: ^09:57
fricklerquiquell: looks like this is what is installing the exclude: http://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/pip-and-virtualenv/install.d/pip-and-virtualenv-source-install/04-install-pip#n14509:59
quiquellfrickler: thanks so much10:00
ssbarnea|roverfrickler: am I correct to assume that the only reason for installing from pip and doing the exclude was because it was not available on base os?10:01
ssbarnea|roverso if we change the logic into: "install from rpm if available, else install from pip and add exclude" would be fine, right?10:02
AJaegerquiquell: ianw and pabelanger did most of the fedora work, best discuss with them10:03
*** electrofelix has joined #openstack-infra10:04
*** sshnaidm|afk is now known as sshnaidm10:05
quiquellAJaeger: ack thanks10:05
*** e0ne has joined #openstack-infra10:08
*** jamesmcarthur has joined #openstack-infra10:12
*** jamesmcarthur has quit IRC10:17
*** jamesmcarthur has joined #openstack-infra10:19
*** quiquell is now known as quiquell|brb10:19
*** yamamoto has quit IRC10:21
*** pbourke has quit IRC10:26
*** pbourke has joined #openstack-infra10:28
*** jamesmcarthur has quit IRC10:28
*** jamesmcarthur has joined #openstack-infra10:35
fricklerquiquell|brb: the main issue is that the os version might be too new for what is needed in testing, i.e. the pip >= 10 issue I was mentioning earlier10:39
*** jamesmcarthur has quit IRC10:40
*** quiquell|brb is now known as quiquell10:40
quiquellfrickler: the pip >= 10 is only affecting devstack ?10:41
*** jamesmcarthur has joined #openstack-infra10:43
fricklerquiquell: most likely not, but for devstack I know that in the current state it would hard fail. see https://github.com/pypa/pip/issues/4805 for some context10:43
*** abregman has joined #openstack-infra10:46
abregmanhey, what does this means? "Incompatible requirement found!"10:50
*** jamesmcarthur has quit IRC10:53
*** yamamoto has joined #openstack-infra10:54
fricklerabregman: you probably need to give us a bit more context in order to be able to come up with a helpful answer10:58
abregmanfrickler: reqruiement-check fails for this patch https://review.openstack.org/#/c/62452111:01
*** jamesmcarthur has joined #openstack-infra11:01
*** udesale has quit IRC11:02
*** udesale has joined #openstack-infra11:03
fricklerabregman: so that's mainly a question for the requirements team, but it seems to show the details two lines earlier http://logs.openstack.org/21/624521/6/check/requirements-check/4bb9da8/job-output.txt.gz#_2018-12-12_09_58_02_84767211:03
*** lpetrut has joined #openstack-infra11:03
*** yamamoto has quit IRC11:03
*** jamesmcarthur has quit IRC11:06
fricklerabregman: IIUC the short answer is that requirements should go into the global list first before you use them in your project. folks in #openstack-requirements probably can explain this in more detail11:06
*** abregman has quit IRC11:07
*** abregman has joined #openstack-infra11:07
abregmanfrickler: sorry, I disconnected. did you write something?11:08
fricklerabregman: yes I did, sadly eavesdrop seems to have dropped it, too:11:14
fricklerabregman: so that's mainly a question for the requirements team, but it seems to show the details two lines earlier http://logs.openstack.org/21/624521/6/check/requirements-check/4bb9da8/job-output.txt.gz#_2018-12-12_09_58_02_84767211:14
fricklerabregman: IIUC the short answer is that requirements should go into the global list first before you use them in your project. folks in #openstack-requirements probably can explain this in more detail11:14
*** jamesmcarthur has joined #openstack-infra11:14
*** jamesmcarthur has quit IRC11:18
*** bhavikdbavishi has quit IRC11:19
*** yamamoto has joined #openstack-infra11:24
openstackgerritNatal Ngétal proposed openstack/diskimage-builder master: [Configuration] Add missing py37 and corrected default envlist.  https://review.openstack.org/62467011:25
aspiersfungi: I notice that gerritbot doesn't announce changes to openstack/governance-* anywhere (except -uc) - should I fix that?11:26
*** yamamoto has quit IRC11:29
*** jamesmcarthur has joined #openstack-infra11:29
*** markvoelker has quit IRC11:30
*** e0ne has quit IRC11:30
*** rfolco has joined #openstack-infra11:30
aspiersit could announce in #openstack-{dev,tc}, since openstack/governance changes are already announced there11:30
*** jamesmcarthur has quit IRC11:34
*** smarcet has joined #openstack-infra11:37
*** jamesmcarthur has joined #openstack-infra11:37
openstackgerritMerged openstack-infra/system-config master: Ectomy some Jenkins out of the docs  https://review.openstack.org/43645211:53
*** witek has quit IRC11:55
*** witek has joined #openstack-infra11:55
*** jamesdenton has quit IRC11:56
*** dtantsur is now known as dtantsur|brb11:56
*** smarcet has quit IRC11:56
*** tpsilva has joined #openstack-infra11:57
*** jamesdenton has joined #openstack-infra12:00
*** markvoelker has joined #openstack-infra12:05
*** yamamoto has joined #openstack-infra12:08
*** yamamoto has quit IRC12:11
*** yamamoto has joined #openstack-infra12:11
*** jamesmcarthur has quit IRC12:16
*** jamesmcarthur has joined #openstack-infra12:17
*** jamesmcarthur has quit IRC12:21
*** abregman has quit IRC12:24
*** jamesmcarthur has joined #openstack-infra12:25
*** bhavikdbavishi has joined #openstack-infra12:39
pabelangerfrickler: quiquell: AJaeger: ssbarnea|rover: There is a long history of how we install pip / setuptool / virtualenv on images, you can see some of that in the DIB url linked above. I too am actually running into this issue with dnf.conf excludes, my though is to modify dnf.conf during pre.yaml run and removed the excludes, however I have no idea what this is going to break.12:39
pabelangerI think the better solution might be to drop excludes from dnf.conf and switch to using dnf versionlock, where we can pin the version of packages we have installed: https://dnf-plugins-core.readthedocs.io/en/latest/versionlock.html12:40
pabelangerhowever, I don't believe we are going to produce images with versions of pip / setuptools from distro over latest. As we want to keep version the same across distros for testing reasons12:41
*** abregman has joined #openstack-infra12:41
pabelangerianw: ^12:41
abregmanfrickler: thanks, I'll check with them12:42
quiquellpabelanger: The pinning of RPM package would be better solution12:42
pabelangerquiquell: yes, I think so also, right now an exclude results in 404 from dnf for packages, breaking cfgmgmt if trying to install those packages12:43
*** jpena is now known as jpena|lunch12:43
ssbarnea|roveri doubt pinning would work, i think that test images should respect default distro setup/behavior and try not to alter it, or the result of "testing the code on xyz distro" would not be trustable. It worth nothing to know that a tool works on a specific distro if you.... if you didn't use distro packages.12:44
ssbarnea|roverpip got smarter recently and now is able to fail to upgrade a distro package instead of breaking the diso packge. in the past it was upgrading distro and causing very hard to discover bugs. most often where when the distro package was updated, yum was failing to update that package.12:46
*** yamamoto has quit IRC12:49
ssbarnea|roverthis conflict between pip and system packager is a permanent source of issues. if i remember correctly debian did something smart: system packages are in different location than pip installed ones, so you cannot really override system packages with pip.12:49
quiquellssbarnea|rover: well I suppose --user should be the default for pip so they get separated12:49
ssbarnea|roverclearly I am in favour or eradicating use of excludes.12:50
*** rh-jelabarre has joined #openstack-infra12:50
quiquellssbarnea|rover: ack, let's play a little removing it to see if it affects our tripleo jobs or not12:50
*** bobh has quit IRC12:51
*** bobh has joined #openstack-infra12:55
*** bobh has quit IRC12:58
*** jamesmcarthur has quit IRC12:59
*** xarses has quit IRC13:03
*** jamesmcarthur has joined #openstack-infra13:06
*** abregman has quit IRC13:06
*** bobh has joined #openstack-infra13:06
*** trown|outtypewww is now known as trown13:07
*** bobh has quit IRC13:11
*** jamesmcarthur has quit IRC13:14
*** jamesmcarthur has joined #openstack-infra13:15
*** bobh has joined #openstack-infra13:15
*** yamamoto has joined #openstack-infra13:16
*** yamamoto has quit IRC13:18
fricklerinfra-root: devstack and tempest have now switched to running on bionic per default on master, as a result we are currently using 150 bionic vs. 200 xenial nodes. I'm thinking we should adjust the min-ready count somehow, maybe set to 10 for both as a first step?13:19
*** bobh has quit IRC13:20
*** dtantsur|brb is now known as dtantsur13:20
pabelangerfrickler: min-ready doesn't really affect how many jobs get launched to use images, it is more about first job for node request.  There is also new logic for realitive priority, so smaller projects using xenial will likely be run first before those changed to use bionic13:20
pabelangerbasically, if we want and watch, it will al level out13:21
pabelangerI'd actually be okay with dropping min-ready down to like 1, our clouds boot pretty fast theses days13:23
openstackgerritFilippo Inzaghi proposed openstack-infra/python-storyboardclient master: Don't quote {posargs} in tox.ini  https://review.openstack.org/60917613:23
*** yamamoto has joined #openstack-infra13:23
fricklerpabelanger: I would subscribe to the latter, rax is around 5 minutes, ovh currently bursting up to 20 mins13:24
*** jamesmcarthur has quit IRC13:24
*** jamesmcarthur has joined #openstack-infra13:24
pabelangerfrickler: sure, in the case of openstack, the system is so busy, min-ready has minimal affect on making jobs faster. Only if there is zero patches in the queue, and number of nodes running < min-ready. Otherwise, PRs just need to wait x minutes for resoures to boot13:25
fungiyeah, min-ready only really matters when we're not using all our capacity, then nodepool pre-boots some instances to have them ready for new requests. more often than not when people really care about responsiveness is when we're already under a backlog anyway. the main risk is in having min-ready too high for node types we almost never use, so they sit there chewing up some of our capacity to no13:25
fungipurpose13:25
fungibut yeah, when there are already backlogged requests for those node types, min-ready is entirely irrelevant13:26
*** markvoelker has quit IRC13:28
pabelanger+113:31
*** rlandy has joined #openstack-infra13:33
fricklero.k., so leave things as is? or make ubuntu-xenial==1, too, to avoid it looking more equal than others?13:36
*** jamesmcarthur has quit IRC13:38
*** jamesmcarthur has joined #openstack-infra13:38
pabelangerlets confirm with clarkb, but my preference would be to set all to min-ready: 1 and let nodepool boot them as needed. Our system is busy enough to not really benefit from it now with new node request system13:40
*** eharney has joined #openstack-infra13:40
*** jamesmcarthur has quit IRC13:43
*** bhavikdbavishi has quit IRC13:44
openstackgerritJens Harbott (frickler) proposed openstack-dev/hacking master: Fix coverage job  https://review.openstack.org/62469913:48
*** jpena|lunch is now known as jpena13:48
*** jamesmcarthur has joined #openstack-infra13:49
*** bobh has joined #openstack-infra13:53
*** dtantsur is now known as dtantsur|brb13:56
*** markvoelker has joined #openstack-infra14:01
*** aojea has joined #openstack-infra14:01
openstackgerritMerged openstack-infra/elastic-recheck master: add query for os-vif pyroute2 open files  https://review.openstack.org/62441214:04
*** kgiusti has joined #openstack-infra14:13
*** mriedem has joined #openstack-infra14:13
*** fresta has joined #openstack-infra14:17
*** jonher_ has joined #openstack-infra14:17
*** jonher has quit IRC14:21
*** jonher_ is now known as jonher14:21
*** fresta_ has quit IRC14:21
*** tk81 has joined #openstack-infra14:30
*** irclogbot_1 has quit IRC14:32
*** jrist has joined #openstack-infra14:37
*** e0ne has joined #openstack-infra14:38
*** irclogbot_1 has joined #openstack-infra14:41
*** yamamoto has quit IRC14:44
*** ykarel has quit IRC14:47
dmsimardCan anyone remind me how to check if there are any meetbot meetings in progress other than with the calendar ?14:52
fungiit's hard to know now that lots of teams are running meetings in their own channels14:52
fungii used to just look at the meeting channels since i lurk in all of them14:53
fungibut that doesn't work any longer14:53
dmsimardfungi: right, that was my point14:53
*** Miouge has quit IRC14:55
fungii don't know of any other way than by parsing the calendar data, no. maybe a quick python script to check whether a specific time overlaps any of the time ranges in the yaml data would be useful to add to the yaml2ical repo or the meetings repo14:55
*** irclogbot_1 has quit IRC14:55
*** jamesmcarthur has quit IRC14:55
fungialternatively, it might be possibel to parse the meetbot daemon's log directly on eavesdrop.o.o to see if it records any meetings starting which it didn't record ending14:56
fungier, possible14:56
fungidepending on whether you care about finding running meetings which aren't on the schedule or which have run over their allotted times14:56
*** Miouge has joined #openstack-infra14:57
*** ykarel has joined #openstack-infra14:58
*** jamesmcarthur has joined #openstack-infra15:04
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Add query for network-vif-plugged timeout bug 1808171  https://review.openstack.org/62472815:09
openstackbug 1808171 in neutron "TaggedBootDevicesTest.test_tagged_boot_devices intermittently fails waiting for network-vif-plugged event which neutron does not send" [Medium,Confirmed] https://launchpad.net/bugs/180817115:09
*** jamesmcarthur has quit IRC15:09
evrardjpI am trying to use Zuul's tool/encrypt_secret.py can someone tell me what I should use as endpoint for infra?  http://susepaste.org/view//531360515:16
mriedemssbarnea|rover: were you going to update this e-r query? https://review.openstack.org/#/c/621004/15:17
*** markvoelker has quit IRC15:18
fungievrardjp: https://zuul.openstack.org/ should work, but your project name is openstack/openstack-helm-images not just openstack-helm-images right?15:18
evrardjpfungi: oh right. Thanks!15:18
evrardjpme silly.15:18
ssbarnea|rovermriedem: planning to, just too many things to do. if you want it faster feel free to update it.15:18
mriedemok15:19
fungievrardjp: https://docs.openstack.org/infra/manual/zuulv3.html#secret-variables15:20
*** irclogbot_1 has joined #openstack-infra15:20
*** yamamoto has joined #openstack-infra15:20
evrardjpfungi: it's just me being silly -- I went through that doc, just ... my bad.15:21
evrardjpoh wait.15:21
evrardjpI didn't see that docs15:21
evrardjpbut yeah, it's still my bad.15:21
fungievrardjp: the bit in the infra manual is specific to our deployment of zuul, so that you know the correct tenant name and url15:21
*** jrist has quit IRC15:21
fungiit does link to the more general feature info in zuul's documentation for those who are interested in the details though15:22
*** alexchadin has quit IRC15:22
evrardjpyeah :) it's easier to copy and paste and not make mistakes like a silly eventingmonkey15:22
evrardjpwoops15:22
evrardjpevrardjp15:22
evrardjpbad tab!15:22
evrardjpsorry eventingmonkey15:22
*** lpetrut has quit IRC15:22
evrardjpseems a bad end of day for me15:22
fungior a good time to end your day perhaps? ;)15:23
fungievrardjp: i also like using --infile and --outfile options with that tool to cut down on copy/paste issues with the plaintext and encrypted secret data too15:24
evrardjpfungi: Yeah I am concerned of those :)15:25
*** dtantsur|brb is now known as dtantsur15:25
evrardjpfungi: on top of that it's not me who'll do said copy and paste -- intermediaries are generally bad for those things15:25
evrardjpthanks for the advice fungi !15:28
fungiany time!15:28
evrardjp(including the end of day one, but sadly I can't take that one)15:28
*** jamesmcarthur has joined #openstack-infra15:31
openstackgerritMerged openstack-infra/elastic-recheck master: Add query for glance-api proxy error bug 1808063  https://review.openstack.org/62452415:37
openstackgerritMerged openstack-infra/elastic-recheck master: Add query for network-vif-plugged timeout bug 1808171  https://review.openstack.org/62472815:37
openstackbug 1808063 in OpenStack-Gate "glanceclient.exc.HTTPBadGateway: 502 Proxy Error during server snapshot" [Undecided,Confirmed] https://launchpad.net/bugs/180806315:37
openstackbug 1808171 in neutron "TaggedBootDevicesTest.test_tagged_boot_devices intermittently fails waiting for network-vif-plugged event which neutron does not send" [Medium,Confirmed] https://launchpad.net/bugs/180817115:37
openstackgerritFrank Kloeker proposed openstack-infra/irc-meetings master: [trivial] fix I18n meeting-id  https://review.openstack.org/62474215:40
*** lpetrut has joined #openstack-infra15:48
*** armax has joined #openstack-infra15:53
*** jrist has joined #openstack-infra15:54
*** lpetrut has quit IRC15:56
*** yamamoto has quit IRC15:58
*** yamamoto has joined #openstack-infra15:58
*** kjackal has quit IRC16:04
*** kjackal has joined #openstack-infra16:07
*** ykarel is now known as ykarel|away16:10
*** smarcet has joined #openstack-infra16:13
*** gyee has joined #openstack-infra16:19
*** ykarel|away has quit IRC16:19
*** udesale has quit IRC16:19
*** smarcet has quit IRC16:20
JpMaxManHey fungi16:23
JpMaxManI had a couple of questions about https://review.openstack.org/#/c/624523/ - should I ask here or message you directly?16:24
*** smarcet has joined #openstack-infra16:26
clarkbJpMaxMan: usually best to ask the channel then others can answer too if they are able16:27
JpMaxMancool - so with regards to the Missing zuul/main.yaml change - I'm not entirely sure what that is referring to....16:28
*** rfolco is now known as rfolco_doctor16:28
clarkbhttps://docs.openstack.org/infra/manual/creators.html#add-project-to-zuul is the piece of documentation that explains that. Basically zuul takes action on repos that it has been told about, so we have to edit the zuul config to tell it about your new project16:29
clarkb(otherwise zuul ignores events from that project repo)16:29
JpMaxMansince it is a sandbox I think we would want Zuul to ignore it right?16:30
*** bhavikdbavishi has joined #openstack-infra16:31
clarkbpossibly. At the very least you need zuul to run the noop jobs to get your Verified +1 and +2 votes so that things can merge16:31
clarkbbut zuul involvment can stop there16:31
clarkb(though you may want to have zuul do other useful tasks)16:32
*** Alvass has joined #openstack-infra16:32
*** _alastor_ has joined #openstack-infra16:32
JpMaxManalright - I'll add that - open to suggestions if you think it should do more?16:32
AlvassHi, I might have found a bug with zuul-executor executor/server.py, not sure where I should report this16:33
JpMaxManmy second question was regarding specifying just the master branch.  how do I specify that in the upstream: paramater of projects.yaml16:33
clarkbAlvass: zuul tracks bugs on https://storyboard.openstack.org and has #zuul on freenode for IRC discussion. There is also a zuul-discuss mailing list at lists.zuul-ci.org16:34
clarkbAlvass: any one of those three locations would be a good place to start16:34
Alvassclarkb thanks16:34
fungiJpMaxMan: sorry, in a meeting. our automation is going to perform it's initial import from all branches and tags from whatever repo you list in the "upstream" parameter16:37
fungiyou could create a fork and delete all the branches/tags you don't want imported, then list that instead16:37
JpMaxMangot it - so its up to the upstream repo to be clean16:38
JpMaxMansounds good16:38
JpMaxManok I'll work on it - thanks!16:38
*** _alastor_ has quit IRC16:40
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Update query for bug 1721093  https://review.openstack.org/62100416:40
openstackbug 1721093 in OpenStack-Gate "Zuul v3 tasks can end up in an UNREACHABLE state" [Undecided,Confirmed] https://launchpad.net/bugs/172109316:40
*** Alvass has quit IRC16:47
openstackgerritMerged openstack-infra/irc-meetings master: [trivial] fix I18n meeting-id  https://review.openstack.org/62474216:52
*** electrofelix has quit IRC16:53
openstackgerritJp Maxwell proposed openstack-infra/project-config master: Adding the netlify-sandbox project  https://review.openstack.org/62452316:53
*** yboaron_ has quit IRC16:55
*** pgaxatte has quit IRC16:57
*** ykarel has joined #openstack-infra17:03
*** amuller has joined #openstack-infra17:03
amullerhi ho, hi ho. I'm trying to have a gate job require a github project (in this case skydive)17:03
amullercheck out: https://review.openstack.org/#/c/624494/5/.zuul.yaml17:03
openstackgerritJp Maxwell proposed openstack-infra/project-config master: Adding the netlify-sandbox project  https://review.openstack.org/62452317:03
amullergetting this error: http://logs.openstack.org/94/624494/4/check/neutron-tempest-plugin-dvr-multinode-scenario/9f22757/job-output.txt.gz#_2018-12-12_15_08_47_95801517:04
amullerany idea what is the syntax to add a github project in the 'required projects' list?17:04
fungiamuller: they would need to release packages to pypi17:05
amullerfungi: so they do have a python client on pypi17:05
amullerI'm getting that via requirements.txt17:06
amullerwhat I need in the job is the devstack plugin they defined in their github repo17:06
fungioh, required-projects in zuul, not global requirements in openstack, got it17:06
*** quiquell is now known as quiquell|off17:06
fungipretty sure to be able to have it as a required-project (so that you can take advantage of cross-repo dependencies) it needs to be included in the zuul "openstack" tenant's main list of repositories for the github.org connection17:07
amullerI don't need cross-repo dependencies, I don't think17:07
amullerreally what I'm trying to do is to pull in the skydive devstack plugin17:07
fungier, i guess it's github.com (you can see how much time i spend on the site)17:07
clarkbI'm guessing the underlying issue here is that set error on clone17:08
clarkbso the github plugin can't be cloned from github17:08
amullerso should I set ERROR_ON_CLONE to False for jobs that want to use skydive?17:09
clarkbamuller: a better approach may be to clone skydive into the correct location outside of devstack first17:10
clarkbthen the devstack check will still apply to everything else, ut you can get the repo on disk17:10
amullerhmm17:11
amullernot sure how to do that and still to get the devstack plugin executed on stack.sh17:11
clarkbya add a pre-run playbook that clones the repo to /opt/stack/skydive (or whatever the path is supposed to be)17:11
amullerit's weird, locally it all works fine17:11
clarkbamuller: its a sanity check for gating because devstack doesn't know how to set up repos to test changes under test17:11
clarkbamuller: instead zuul does that for devstack, then we disable any cloning in devstack so devstack doesn't undo zuul'17:12
clarkbs work17:12
clarkbI think corvus has talked about making required projects less strict so you can require some project from github and have zuul set it up for you, but not pull in the other implied zuul config stuff17:13
clarkbif/when this happens it will make this simpler, but isn't implemented yet17:13
amullernew to zuul and these type of changes... so I add a new playbook in the repo, and add it to the pre-run list for the jobs I'm interested in?17:15
amullerand the playbook will git clone the github repo to the correct dir17:15
amullerand how do I ensure that devstack will use that plugin?17:15
clarkbamuller: yup I think that will make devstack happy in that context because skydive will be present and no cloning will need to happen17:16
openstackgerritMerged openstack-infra/elastic-recheck master: Update query for bug 1721093  https://review.openstack.org/62100417:16
clarkbamuller: you enable it as before. The difference is devstack will notice the repo already exists and won't have to clone it so no error happens17:16
openstackbug 1721093 in OpenStack-Gate "Zuul v3 tasks can end up in an UNREACHABLE state" [Undecided,Confirmed] https://launchpad.net/bugs/172109317:16
amulleryeah got it17:16
amullerok17:16
amullerlet's see if I can Google my way to get the Ansible playbook to do what it needs to do :)17:16
amullerclarkb++17:17
clarkbamuller: feel free to push something up even if it doesn't seemquite right. We can probably help once we've got error messages from ansible :)17:17
amullerfor the ansible playbook I should be able to develop that locally instead of pushing 10 patchsets17:17
amullerhopefully zuul will run it the same way17:18
amulleras far as users and whatever17:18
*** graphene has joined #openstack-infra17:18
fungibut also, once you do push it, zuul can run it on your proposed addition and provide you feedback too17:18
*** psachin|session has quit IRC17:20
*** e0ne has quit IRC17:21
*** jamesmcarthur has quit IRC17:22
*** jamesmcarthur has joined #openstack-infra17:22
clarkbfrickler: mriedem melwitt to TL;DR the cirros ssh issues, they all root cause down to metadata being slow, but in the case of running out of disk we are attempting to use config drive first (which would ignore the metadata server), but since disk is full that fails then we fall back to metadata then that times out and fails?17:25
clarkbso thats actually a couple bugs in one. That is a fun one :)17:25
mriedemmetadata api being slow?17:26
mriedemas in the guest doesn't get network info fast enough from the meta api?17:26
*** jpich has quit IRC17:26
clarkbmriedem: yup, cirros has a 10 second timeout on that network request, and nova was taking 10.something seconds to respond17:26
melwittI couldn't pinpoint what part of it was taking > 10 seconds. but I left notes on the launchpad bug17:26
*** dtantsur is now known as dtantsur|afk17:26
melwittI assumed a call to neutron but was having trouble matching up the metadata API log to the neutron log17:27
clarkbmriedem: in the case of the disk issues in the logs, we are trying config-drive first and that fails so we fall back on metadata after17:27
mriedemok, there is a known db query perf bug in the metadata api where we're doing some unnecessary joins17:27
mriedemmaybe removing that would speed things up17:27
clarkbbut in the case melwitt found I don't think we attempt config drive at all so that explains why we see the same problem in both cases17:27
mriedemclarkb: want to link nova to that bug?17:27
clarkbmriedem: sure17:27
mriedemwe likely need some profiling of the metadata api code since it's rarely touched17:27
clarkbdone17:27
clarkbit sounded like frickler was talking to cirros about fixing the config-drive failures on their end17:28
melwittmriedem: yeah, it's really hard to tell because each thing is just a singular log message17:28
melwittas far as looking at it from a failed run sans profiling17:28
mriedemhttps://bugs.launchpad.net/nova/+bug/1799298 was the other thing17:29
openstackLaunchpad bug 1799298 in OpenStack Compute (nova) rocky "Metadata API cross joining instance_metadata and instance_system_metadata" [Medium,Triaged]17:29
clarkband the note about the route add failing seems to be just noise since the route is tehre when the route table is echoed17:29
mriedemworkday reported that17:29
mriedemok added a note to https://bugs.launchpad.net/openstack-gate/+bug/180801017:31
openstackLaunchpad bug 1808010 in OpenStack-Gate "Tempest cirros boots fail due to lack of disk space" [Undecided,New]17:31
clarkbI should update the bug title too17:31
fungiheading out for a brisk, chilly walk but will be back well before the storyboard meeting at 19:0017:32
clarkbnew title is a bit long but hopefully captures what is going on17:32
melwittthis bug is heating up17:34
melwittI read through the db cross join bug and ML thread, kinda hard to believe that could take > 10 seconds. but what do I know17:35
clarkbmelwitt: we may want to check against the dstat information to see if the system is under heavy load at that point, could explain the extra slowness17:36
melwittgood thinkin, I shall look17:36
clarkbI've been using https://lamada.eu/dstat-graph/ you can dump the dstat.csv file from devstack into there17:36
*** graphene has quit IRC17:38
*** ianychoi has joined #openstack-infra17:38
*** graphene has joined #openstack-infra17:40
*** trown is now known as trown|lunch17:41
clarkbfungi: that sounds really nice actually. I kinda want to take a walk, but everyone just left for a dentist visit so I'll enjoy the peace and quiet at home instead17:42
melwittoh, neat17:43
*** panda is now known as panda|off17:43
clarkbfungi: can you let me know when you are back? I'd like to approve cmurphy's puppet4 futureparser chagne for lists.o.o but figure you know that service much better than I do so your help if something goes sideways would be nice :)17:46
*** ginopc has quit IRC17:46
mriedemclarkb: the metadata api does need to query neutron to get security group information, so that's one slow thing17:53
clarkbmriedem: does the api need to do that when asking for public keys?17:53
mriedemand another long-standing known api performance bug, that nova doesn't cache the security group info17:53
mriedemclarkb: no, it's just part of the metadata response17:53
mriedemprobably to match ec217:54
clarkbah17:54
* clarkb finds breakfast17:57
*** _alastor_ has joined #openstack-infra18:03
*** derekh has quit IRC18:04
*** kjackal has quit IRC18:04
*** boden has joined #openstack-infra18:09
openstackgerritClark Boylan proposed openstack-infra/system-config master: Set iptables forward drop by default  https://review.openstack.org/62450118:10
openstackgerritClark Boylan proposed openstack-infra/system-config master: Import install-docker role  https://review.openstack.org/60558518:10
clarkbI think I've got ^ correct this time18:10
*** graphene has quit IRC18:11
*** rfolco_doctor is now known as rfolco18:12
*** jpena is now known as jpena|off18:17
melwittclarkb: that dstat graph site is awesome. looking at the data from the run, cpu usr is at 100% at 10:46:57 when the > 10 secs metadata API reply happened, and also memory usage nearly maxed http://logs.openstack.org/76/582376/8/gate/tempest-full-py3/a8f62b6/controller/logs/dstat-csv_log.txt18:18
*** mriedem is now known as mriedem_lunch18:21
clarkbcorvus: mordred: can you check my comment on https://review.openstack.org/#/c/622964/2 about that inventory generation picking up the magnum nodes? I don't think we want that but maybe it will be fine18:21
clarkbmelwitt: not surprising :/ probably want to make any easy performance improvements to metadata retrieval, then if this persists reduce general system overhead too18:22
melwittaye18:22
melwittmriedem_lunch is investigating18:23
mordredclarkb: we probably do want a flag. that said - there is at least one update to the magnum hosts we'll want to make with ansible before we go live with them for anything18:23
mordredclarkb: so ansibling our magnum hosts in general may not be crazy18:23
clarkbmordred: ya I'm more thinking we don't want our install exim and iptables stuff running on them18:23
clarkbso we need to split them off18:23
mordredclarkb: but - I'm sure things will blow up ... yeah18:23
*** aojea has quit IRC18:23
clarkbmelwitt: mriedem_lunch sounds good, thanks18:24
*** bhavikdbavishi has quit IRC18:25
*** gfidente is now known as gfidente|afk18:30
*** jrist has quit IRC18:33
*** shardy has quit IRC18:35
clarkbmordred: do we want to merge the inventory script as is (I mean it works and won't get auto applied to anything) or should we refine it further?18:36
mordredclarkb: I think lets just land it and we can always make it better18:37
mordredclarkb: my hunch is that we'll find it more pleasant to just add an entry to the inventory file by hand when making a new node - and that script is really just there for in-case-we-need-ot18:38
clarkbmordred: k I'll approve18:38
*** jamesmcarthur has quit IRC18:44
*** Swami has joined #openstack-infra18:44
Shrewsfungi: left you a comment on https://review.openstack.org/62321118:45
*** wolverineav has joined #openstack-infra18:46
*** wolverineav has quit IRC18:46
*** wolverineav has joined #openstack-infra18:47
*** smarcet has quit IRC18:49
*** trown|lunch is now known as trown18:50
fungiclarkb: cmurphy: yeah, i'm around now to watch lists.o.o18:52
clarkbfungi: ok I'll go ahead and reapprove that now I guess18:54
openstackgerritJp Maxwell proposed openstack-infra/project-config master: Adding the netlify-sandbox project  https://review.openstack.org/62452318:54
*** dabukalam has joined #openstack-infra18:55
*** wolverineav has quit IRC18:57
fungiShrews: thanks! i've replied18:58
Shrewsfungi: ah, for some reason i assumed task #'s may not be unique19:00
Shrews(thus the need to include Story)19:00
fungithey are globally unique19:01
fungiat least within a given sb deployment19:01
fungii think what we'd like (but isn't implemented in its-storyboard yet) is for the story footer to trigger a story comment, a la "related-bug"19:02
*** wolverineav has joined #openstack-infra19:03
fungiso people can use it when they want a commit to refer to a story without addressing a particular task within that story19:03
fungimany of us thought that was already implemented since it was part of the original specification19:03
fungibut it seems to have been left as a future exercise for someone who enjoys java19:03
Shrewsfungi: i do not enjoy java, ftr19:06
Shrewsinfra history has documented this fact  :)19:07
*** wolverineav has quit IRC19:07
*** wolverineav has joined #openstack-infra19:09
fungiheh, yes i was sort of harkening back to the age of the wip feature19:09
*** ykarel has quit IRC19:10
*** mriedem_lunch is now known as mriedem19:11
openstackgerritMerged openstack-infra/system-config master: Add a script to generate the static inventory  https://review.openstack.org/62296419:12
mriedemclarkb: corvus: can one of you link me to the zuul queueing change made recently to put tripleo in it's own queue or whatever19:12
mriedemi want to send out a gate status update while we have some fixes approved19:12
clarkbmelwitt: mriedem want to update https://review.openstack.org/#/c/624533/1 with my comment?19:13
clarkbmriedem: https://review.openstack.org/#/c/624246/ is the tripleo-ci change, not merged yet19:13
clarkbcorvus was also mentioning we might be ebtter off setting that in project-config19:13
clarkb(I'm happy to start with ^ and if that doesn't work move to project-config though)19:13
fungimriedem: https://review.openstack.org/62359519:13
clarkbya ^ is the zuul feature and https://review.openstack.org/#/c/624246/ uses that new feature to group things19:14
fungimriedem: at least for the feature implementation in zuul19:14
mriedemok19:14
fungiand then we got the scheduler restarted with that in place so we can configure19:14
melwittclarkb: oh, yup, will do19:16
openstackgerritmelanie witt proposed openstack-infra/elastic-recheck master: Update query for bug 1808010  https://review.openstack.org/62453319:19
openstackbug 1808010 in OpenStack-Gate "Tempest cirros ssh setup fails due to lack of disk space causing config-drive setup to fail forcing fallback to metadata server which fails due to hitting 10 second timeout." [Undecided,New] https://launchpad.net/bugs/180801019:19
*** e0ne has joined #openstack-infra19:20
clarkbinfra-root https://review.openstack.org/#/c/605585/16 and its parent both pass ansible testinfra integration testing now. The parent change switches our default FORWARD rule to DROP from ACCEPT19:25
clarkbI believe this to be safe but we should watch it carefully. Then 605585 is the base change to get docker onto servers so we can start dockering services19:25
*** e0ne_ has joined #openstack-infra19:27
amullerclarkb: of course it fails: http://logs.openstack.org/94/624494/6/check/neutron-tempest-plugin-dvr-multinode-scenario/d2d8b5a/job-output.txt.gz#_2018-12-12_18_32_52_01583719:27
*** e0ne has quit IRC19:27
amullerdo I need the playbook to become a certain user?19:28
clarkbamuller: I think that dir will be owned by the stack user19:28
clarkbyou should be able to set become: yes and whatever the flag is to set user to stack and then it will work19:28
Shrewsclarkb: what is /etc/docker/daemon.json?19:29
Shrewsis that created when the service starts?19:29
amullerclarkb: thanks I'll try that19:29
clarkbShrews: that is the docker daemon configuration file19:30
*** e0ne_ has quit IRC19:30
clarkbShrews: I use it locally to point docker at my zfs pool for volume use, we seem to be using it to allow for ipv619:30
Shrewsclarkb: ah i see it used in the playbooks now. funny i've never noticed that file19:31
Shrewsclarkb: do we want to test that the service is actually running?19:32
clarkbmaybe? (I've mostly picked this change up by way of mordred and ianw to get the existing tests working around firwall rules)19:33
clarkbI think we are relying on the packaging to do that for us but double checking it starts docker daemon may be a good idea19:34
*** e0ne has joined #openstack-infra19:34
clarkblet me add that19:34
Shrewsclarkb: http://git.openstack.org/cgit/openstack-infra/system-config/tree/testinfra/test_base.py#n10419:36
Shrewsfor an example19:36
openstackgerritClark Boylan proposed openstack-infra/system-config master: Import install-docker role  https://review.openstack.org/60558519:36
clarkbthe service is called docker on my local machine but process is dockerd. I think I got ^ correct19:36
Shrewsclarkb: lgtm otherwise. i'll +2 if the new check passes19:38
Shrewsi have a feeling we'll need a check for distro on the service name though (just from past experience)19:39
*** e0ne_ has joined #openstack-infra19:39
clarkbShrews: currently the test is scoped to bionic only19:40
Shrewsclarkb: oh, you only test on bionic. that should be fine then  :)19:40
clarkbbut ya if we add more platforms we'll need that maybe19:40
clarkb(it is using upstream packaging not distro packaging too, so may be consistent across platforms)19:40
*** e0ne has quit IRC19:41
Shrewsyeah19:41
clarkbfungi: cmurphy just a few minutes away from merging the lists.o.o future parser change19:44
*** e0ne has joined #openstack-infra19:44
clarkbthen we wait for puppet to run there19:44
openstackgerritMerged openstack-infra/elastic-recheck master: Update query for bug 1808010  https://review.openstack.org/62453319:44
openstackbug 1808010 in OpenStack-Gate "Tempest cirros ssh setup fails due to lack of disk space causing config-drive setup to fail forcing fallback to metadata server which fails due to hitting 10 second timeout." [Undecided,New] https://launchpad.net/bugs/180801019:44
*** e0ne_ has quit IRC19:45
*** e0ne has quit IRC19:45
*** e0ne has joined #openstack-infra19:46
fungiclarkb: cool, i'm tailing syslog on lists.o.o watching for puppet activity19:46
mriedemso i think i'm going to just start pushing patches to skip cinder tests that have had reported bugs forever for which no one is working on19:48
clarkbmriedem: in tempest?19:48
mriedemhttp://status.openstack.org/elastic-recheck/ - look for 'backup19:48
mriedemor just move cinder backup tests to their own job so they aren't part of the integrated gate19:49
*** tk81 has quit IRC19:49
fungijungleboyj might be interested in a consolidated list of those19:49
fungiand yeah, testing them only against cinder changes could be a viable compromise19:50
mriedemi already dumped a bit in -cinder a few minutes ago19:50
mriedemi will send a separate proposal to the ML19:50
fungiahh, yep i see that. i lurk in there but wasn't paying close attention19:51
* jungleboyj hangs my head in shame19:51
fungithanks mriedem!19:51
clarkbfwiw I really do think there is value in debugging these failurse. Seems like in the last week myself and you and ovh and inap and tripleo have identified a bunch of things that can be approved in a variety of places19:52
clarkbbasically the test failures are valuable information if we act on them19:52
clarkbthe problem is we don't act on them often19:52
mriedemi've debugged a few of the cinder bugs,19:52
AJaegerclarkb: are you fine adding another repo to openstack-infra? See https://review.openstack.org/#/c/624523/ for netlify-sandbox19:52
jungleboyjmriedem:  Yes, thank you for your help.19:52
mriedemseveral of them are due to the fact that cinder-api does rpc calls to cinder-volume,19:52
mriedemwhich times out the REST API response19:52
fungigetting someone active in cinder involved in helping debug those seems like a reasonable expectation19:52
mriedemor the RPC response19:52
jungleboyjThe problem we are having is no one else picking these up and helping.19:52
mriedemcinder can't scale with RPC blocking calls everywhere19:52
clarkbAJaeger: yes, I mentioned yseterday that given the similar desire for other docs hosting with zuul and opendev I don't mind hosting it under -infra19:53
jungleboyjfungi:  Agreed.19:53
AJaegerclarkb: thanks, then I'll +219:53
openstackgerritMerged openstack-infra/system-config master: Turn on the future parser for lists.openstack.org  https://review.openstack.org/61565619:53
clarkbAJaeger: Its not a perfect location today, but it will work and it should be valuable to us.19:53
jungleboyjAlso have noticed more check and gate issues popping up.19:53
clarkbjungleboyj: yes I've been banging that drum since about the PTG now19:53
mriedemjungleboyj: yes, see the email i just sent19:53
AJaegerclarkb: great19:53
*** e0ne has quit IRC19:54
*** Adri2000 has quit IRC19:54
clarkbjungleboyj: I think the objectvie data collection shows our testing (and software?) is less reliable in recent months than in the past19:54
fungiAJaeger: yeah, i double-checked with clarkb before suggesting the infra namespace, just haven't had a chance to follow up on the review19:54
clarkbthere are a variety of reasons for that, but I think it mostly has to do with people not actively identifying and fixing issues as they come up19:54
jungleboyjclarkb:  Sadly you are right.19:55
fungifor intermittent/nondeterministic failures it's all too easy to "recheck" and hope it becomes someone else's problem to solve19:55
jungleboyjclarkb:  In Cinder we have seen a focus on users fixing issues in their drivers, which is good, but not in fixing general issues.19:55
clarkbmriedem: thanks for the email update, the bug by bug rundown is good stuff19:56
*** Adri2000 has joined #openstack-infra19:56
fungionce folks are trained to recheck until things pass, they recheck new nondeterministic bugs in until we grind to a halt because jobs won't succeed any more19:56
dansmithclarkb: mriedem ++19:56
AJaegerthanks, fungi19:57
*** smarcet has joined #openstack-infra19:58
clarkbfungi: lists.o.o will get puppeted on the run starting at 2000UTC19:58
* jungleboyj read mriedem s email.19:59
clarkbfwiw I've also got a fix for an issue affecting a bunch of tripleo jobs up and dmsimard has suggested a simpler/better way to fix it and ahs offered a patch for that. So we're trackign things down outside of the integrated gate or even just nova as well19:59
fungiclarkb: thanks. most recent puppetage on lists.o.o completed at 19:51:4419:59
clarkbOVH thinks that may have tracked down the source of slowness there, we have to wait for them to fix it though20:00
fungimriedem: i like "fracas"20:00
funginot a word i get to see very often20:01
mriedemjungleboyj: cinder-only version just sent20:01
clarkbfungi: once lists.o.o is confirmed happy I need to pop out for lunch and dinner prep. We are trying to do family thing today since we are leaving town for holidays to see other family20:02
clarkbso I may become a bit more afk as the day wears on20:02
fungiclarkb: noted, thanks for the heads up20:02
*** vkmc has quit IRC20:03
*** wolverineav has quit IRC20:05
mriedemfungi: i use fracas as much as humanly possibe20:06
fungigood call20:07
*** vkmc has joined #openstack-infra20:07
dansmithfungi: don't encourage him.20:08
*** bobh has quit IRC20:08
clarkbmriedem: total brainstorm thinking out loud mode here. But maybe You, myself, and say frickler can ask for say 1-3 volunteers that are interested in walking through the process of digging into failures, writing e-r bugs, then fixing things and try to start building a group of people that can jump in when things get really bad (and maybe periodically jump in to fix stuff and keep things running20:09
clarkbhappily)20:09
clarkbProblem is I'm getting on a plane next week then holidays happen so now isn't a great time, but maybe early january ish we do that?20:09
clarkbAnother idea is maybe we take the featurefreeze/RC period seriously and really push on this stuff then20:09
clarkbI think some of the problem definitely is that OpenStack is this giant piece of machinery and understanding the moving parts is hard20:10
*** kmalloc is now known as notmorgan20:10
*** notmorgan is now known as morgan20:10
fungiyeah, having a succession plan so mriedem doesn't feel obligated to jump on this stuff constantly would be great20:11
clarkbso if we can get people moving past that obstacle maybe we get more help on this20:11
jungleboyjmriedem: clarkb We do have a person that has been digging into Cinder issues more lately.20:11
mriedem¯\_(ツ)_/¯20:11
rm_workcould someone possibly poke at https://review.openstack.org/#/c/624574/ ? :P20:11
mriedemi'm out the week of xmas and the week of jan 720:12
mriedemour spec freeze in nova is jan 1020:12
jungleboyjHad been asking whoami-rajat to spend time looking at bugs.  Sounds like his time might be best focused on the check and gate issues?20:12
mriedemand i'm already behind on a bunch of crap for the people that pay my bills20:12
jungleboyjmriedem:  Feel your pain there.20:12
*** bobh has joined #openstack-infra20:12
mriedemso sure i'm willing to help there, and we've had some summit talks about this as well20:13
clarkbjungleboyj: maybe not best focused, but I do think it helps overall since nova needs cinder to be reliable to merge code and vice cersa20:13
fungithe descendants of sdague, j0g0 and mtreinish. sjmsquad20:13
mriedembut dealing with the gate is just kind of my extra curricular right now while i'm procrastinating on blueprint work20:13
clarkbjungleboyj: basically we don't want people to be super siloed on this type of work. The tests test "OpenStack"'s IaaS and if things don't work together well no one is happy20:13
mriedemhttp://status.openstack.org/elastic-recheck/#1763712 is an excellent example of that20:14
mriedemnova volume attach tests fail b/c cinder times out20:14
mriedemb/c of rpc call20:14
rm_workthanks clarkb :)20:14
fungiyeah, having people with a strong background in cinder looking at these problems would be great. having them focus exclusively on cinder bugs less so20:14
dansmithmriedem: on that one, do we time out waiting for them to wait, or would the rpc heartbeat help them help us?20:14
mriedemdansmith: c-api gets a messaging timeout from c-vol, which returns a 500 response to nova-compute20:15
*** e0ne has joined #openstack-infra20:15
dansmithso, could help...20:15
mriedemn-cpu isn't polling cinder-api for state changes20:15
mriedemi don't think so,20:15
mriedemdefault http response time is 60 seconds isn't it?20:15
dansmithdefault where?20:15
mriedemi believe for one of the cinder backup bugs i identified that the long rpc timeout could help20:16
jungleboyjfungi:  clarkb  I wasn't saying having them focus on Cinder bugs.  Focus on check/gate issues that we are causing.20:16
mriedemb/c it was an rpc timeout between c-bak and c-vol over rpc20:16
*** bobh has quit IRC20:16
dansmithif we're timing out at 60s then we'd generally not see the 50020:16
*** e0ne has quit IRC20:16
jungleboyjmriedem:  Not seeing the Cinder list in e-mail.  Did you sent it?20:16
clarkbjungleboyj: ya, I'm saying that longer term having them also debug neutron or nova or whatever is also valuable. For example my involvement in this has largely been to rule out/fix infra issues. Through that I've learned quite a bit about debugging openstack in general and so try to help in general too20:16
dansmithso I said it'd could help because if ours is 120s or something, it might actually complete with the longer timeout20:17
mriedemjungleboyj: yar20:17
mriedemjungleboyj: http://lists.openstack.org/pipermail/openstack-discuss/2018-December/000868.html20:17
mriedemdidn't tag for [cinder]20:17
clarkbjungleboyj: so maybe the starting point is "here are cinder specific check/gate issues" but if we can buidl that into "here are check/gate issues" thats even better20:17
jungleboyjAh, I just had to ask, it showed up.20:17
mriedemb/c it's more than just cinder20:17
*** jcoufal has joined #openstack-infra20:17
mriedemdansmith: i thought there used to be a thing for response timeouts in the api, but maybe that was when we had eventlet20:18
mriedemmordred: does ksa define any kind of http response timeout?20:18
clarkbfungi: lists should puppet soon20:18
clarkb(I'm watching on the bridge side)20:18
fungiyep, still watching in one of my numerous array of terminals20:18
dansmithmriedem: what's the operation we're waiting for, by the way?20:18
dansmithbecause timeout or not, we shouldn't be making synchronous calls that take that long anyway20:18
mriedemdansmith: this https://developer.openstack.org/api-ref/block-storage/v3/#update-an-attachment20:19
mriedemwhich is essentially the new version of os-initialize_connection,20:19
dansmithmriedem: why does that take a long time?20:19
mriedemwhich creates the export20:19
mriedemidk20:19
mriedemthis is why i would like cinder people to figure it out :)20:19
jungleboyjeharney: smcginnis  Can you guys take a quick look at the discussion between mriedem clarkb  and I above?20:20
mriedemit's been awhile since i tried digging into where time is spent there,20:20
mriedemmaybe c-vol is using a lock in a bad way,20:20
mriedeme.g. maybe something is holding a lock during a periodic task or something and we're stuck waiting on that20:20
dansmithmriedem: yeah, just trying to think of what could be taking that long legitimately20:20
dansmithyeah, sounds fishy to me20:20
mriedemi think we've also at times seen weird things where tgtadm takes more than a minute20:21
dansmithjungleboyj: ^20:21
jungleboyjI have seen all kinds of weird behavior when running on slow nodes.20:21
jungleboyjIt is hard to debug/fix though.20:21
*** auristor has quit IRC20:22
clarkbjungleboyj: if we can at least debug the source that helps us identify what we can change to reduce the slowness20:22
jungleboyjclarkb:  Agreed.20:22
clarkbeven if we can't fix that in the software itself, we might be able to take that info back to the cloud (as we've done with OVH)20:22
clarkbthat might also indicate a case where we want to isolate that specific test or set of tests so that they aren't competing for cpu time or memory etc20:23
clarkbit may still be slow, but probably more relaible if we can do that20:23
clarkb(basically there are things we can do if we identify the source of the slowness, but I agree it isn't always easy)20:24
mriedemdansmith: this is the backup bug where i identified we could use long_rpc_timeout https://bugs.launchpad.net/cinder/+bug/173948220:24
openstackLaunchpad bug 1739482 in Cinder "test_snapshot_backup fails to build backup due to MessagingTimeout" [Medium,Confirmed]20:24
mriedemsee comment 220:24
jungleboyjThe case where I saw issues here was due to disk contention.  Tried setting up an OpenStack cloud in a way that wasn't working well.  Pounding one disk too much.20:24
dansmithlocally I sometimes seem weird iscsi behavior when the source/dest are the same node20:24
dansmithso I wonder if we're causing ourselves pain by doing all-in-one here20:24
clarkbfungi: seems to have been a noop. So I think we are good on that futureparser change?20:25
jungleboyjdansmith:  Interesting.  I haven't seen that so much.20:25
clarkbfungi: if you confirm I'm gonna go make lunch20:25
dansmithlike tgtd and iscsiadm fighting for locks or buffer flushes or something20:25
fungiclarkb: complete and total noop, yes. go enjoy lunch!20:25
*** bobh has joined #openstack-infra20:27
jungleboyjmriedem: I am going to try and get eharney smcginnis e0ne and whoami-rajat together to discussion these issues and put together a plan to help with them.  That is the best place to start.20:27
mriedemthanks20:28
mriedemdansmith: maybe i was making stuff up about ksa / response timeouts https://github.com/openstack/keystoneauth/blob/ccf6cb79033b2083d9177823094f7836eb68ae0d/keystoneauth1/session.py#L24820:28
mriedemksa sessions have a timeout, but default to nothing20:28
openstackgerritMerged openstack-infra/project-config master: Add publish-to-pypi for octavia-lib  https://review.openstack.org/62457420:28
* dansmith nods20:28
dansmithmriedem: the individual clients might set that I guess, but..20:28
mriedemalthough the CLI sets 600 https://github.com/openstack/keystoneauth/blob/ebe781a3ea0386d6ff088a84e8dde26e538b856d/keystoneauth1/loading/session.py#L11620:28
mriedembut that's way high20:29
dansmithmriedem: anyway, it doesn't sound like waiting longer for that really makes sense in the long term if there's some silly issue we'd just be covering up20:29
*** jrist has joined #openstack-infra20:30
mriedemthis also reminds me that we used to get debug logs in nova's logs about calls using cinderclient but we don't get that debug logging anymore,20:31
mriedemmaking it hard to trace requests across services20:32
mriedemi think something might have changed in ksa recently with how that logging is setup20:32
*** bobh has quit IRC20:32
*** _alastor_ has quit IRC20:32
*** dklyle has joined #openstack-infra20:39
* mriedem moves to cinder20:39
*** bobh has joined #openstack-infra20:41
*** wolverineav has joined #openstack-infra20:44
*** bobh has quit IRC20:46
*** wolverineav has quit IRC20:52
*** bobh has joined #openstack-infra20:52
*** wolverineav has joined #openstack-infra20:55
*** smarcet has quit IRC20:55
*** yboaron_ has joined #openstack-infra20:56
*** bobh has quit IRC20:57
*** rcernin has joined #openstack-infra21:06
slaweqmordred: hi21:07
slaweqmordred: can You help me with one thing?21:07
slaweqmordred: we have in neutron neutron-functional job, which is still using legacy-dsvm-base as parent, I now want to move it to zuulv3, do You know what job template should I use for it? Should it be devstack-tox-functional ?21:09
slaweqor maybe someone else from infra team can help me?21:09
ianwfrickler: oh, hrm, switching devstack testing to bionic has caused dib issues because zypper isn't available on bionic.  i don't know what we're going to do about building opensuse-minimal on bionic hosts, this has been a known issue that i just forgot about21:11
*** bobh has joined #openstack-infra21:11
clarkbianw: possible we can use the xenial zypper package on bionic?21:13
ianwclarkb: when i looked, it's all a huge bundle of c++ ... and i'm assuming abi's are such that it won't work21:13
clarkbslaweq: what is the general setup of the neutron function job? It runs devsatck in a minimal setup then exectues tox or tempest or something else?21:14
ianwslaweq / clarkb: yeah looks like it runs tox at the end? http://git.openstack.org/cgit/openstack/neutron/tree/neutron/tests/contrib/post_test_hook.sh#n6421:15
ianwi think you're in the ballpark with devstack-tox-functional21:15
clarkb++21:15
clarkbianw: re zypper maybe suse ships a staticly compiled option to bootstrap things?21:16
*** bobh has quit IRC21:16
*** amuller has quit IRC21:16
ianwhrm, when i looked previously i never saw anything like that but maybe21:16
slaweqclarkb: currently it clones devstack but then runs our script: https://github.com/openstack/neutron/blob/master/neutron/tests/contrib/gate_hook.sh#L69 and then run dsvm-functional tests with tox21:16
clarkbthe new devstack jobs won't run devstack-gate hooks anymore. I think the idea there is to have your run stage execute that But maybe that happens after tox is invoked? that is the piece that might need figuring out21:17
ianwi don't want to blow out testing, but we should maybe run bionic and xenial tests for the nodepool dsvm dib tests ... since all our builders are actually xenial, but it would also be nice to have bionic gated too for future sanity21:18
clarkbianw: ++21:18
*** xek__ has quit IRC21:18
slaweqianw: clarkb: if whole devstack will be run, it is also fine for our tests - that should works too, thx then - I will play with this devstack-tox-functional job then21:18
ianwi guess we could upgrade builders, but we'd have to isolate opensuse-minimal to an older xenial host, in a similar way i guess to how we isolate arm64 builds to nb0321:19
rm_workcould someone poke along https://review.openstack.org/#/c/624551/ ?21:19
rm_workpretty please? (octavia-lib release)21:20
ianwoh, but we're trying for puppet free bionic hosts, which means a whole thing around ansible-ising dib install etc etc ... hrm21:20
clarkbrm_work: you'll want to ping the release team about that. #openstack-release21:20
rm_workclarkb: ah ok, i looked quickly at past +2s and it seemed mostly people here :P21:22
rm_worki'll go poke them there tho21:22
pabelangerianw: we had the same issue with zypper missing in xenial, but somebody fixed it. Maybe dirk, cannot remember21:22
*** yboaron_ has quit IRC21:24
*** bobh has joined #openstack-infra21:26
ianwpabelanger: hrm, i guess I'll just be thankful that I have no memory of that :)21:26
ianwit doesn't sound like fun21:26
pabelangerianw: I want to say somebody had commit rights for ubuntu, but can't completely remember21:27
*** bobh has quit IRC21:30
ianwi thought there was a bug, maybe not, so i filed https://bugs.launchpad.net/ubuntu/+source/zypper/+bug/1808230 which is content free, but at least something we can refer to21:30
openstackLaunchpad bug 1808230 in zypper (Ubuntu) "Zypper unavailable on bionic" [Undecided,New]21:30
openstackgerritHongbin Lu proposed openstack-infra/project-config master: Rename neutron ryu jobs  https://review.openstack.org/62481421:30
*** jamesmcarthur has joined #openstack-infra21:33
*** jcoufal has quit IRC21:34
*** bobh has joined #openstack-infra21:40
*** bobh has quit IRC21:44
*** lpetrut has joined #openstack-infra21:44
*** _alastor_ has joined #openstack-infra21:47
cmurphyclarkb: fungi sorry i wasn't around, i assume it went okay with lists.o.o?21:47
cmurphythere are a bunch more queued up if you're feeling adventurous21:48
openstackgerritDavid Moreau Simard proposed openstack-infra/zuul-jobs master: Vendor the RDO repository configuration for installing OVS  https://review.openstack.org/62481721:49
fungicmurphy: yep, totally fine--thanks again!!!21:49
*** slaweq has quit IRC21:49
*** bobh has joined #openstack-infra21:51
*** bobh has quit IRC21:55
openstackgerritDavid Moreau Simard proposed openstack-infra/zuul-jobs master: Vendor the RDO repository configuration for installing OVS  https://review.openstack.org/62481721:57
*** bobh has joined #openstack-infra21:58
*** dklyle has quit IRC22:00
*** bobh has quit IRC22:03
*** trown is now known as trown|outtypewww22:04
*** slaweq has joined #openstack-infra22:06
ianwdmsimard: heh, great minds think alike :)22:07
clarkbhrm http://logs.openstack.org/85/605585/17/check/system-config-run-docker/77e61f6/job-output.txt.gz#_2018-12-12_19_48_59_704238 indicates the docker service wasn't running, but reading the docker-ce packaging it should be22:08
*** jamesmcarthur has quit IRC22:08
clarkb"Setting up docker-ce (5:18.09.0~3-0~ubuntu-bionic) ...", "update-alternatives: using /usr/bin/dockerd-ce to provide /usr/bin/dockerd (dockerd) in auto mode", "Created symlink /etc/systemd/system/multi-user.target.wants/docker.service -> /lib/systemd/system/docker.service.", "Job for docker.service failed because the control process exited with error code.", "See \"systemctl status docker.service\"22:09
clarkband \"journalctl -xe\" for details."22:09
clarkbianw: ^ any ideas. Also Shrews ++ for the usggestion to test this which catches ^22:09
*** slaweq has quit IRC22:10
ianwhrm, i saw something like this when i put an invalid ipv6 config in maybe?22:10
ianwi think these jobs should probably start capturing syslog and things from the host22:11
clarkbianw: there is the ipv6 config in there still, any idea if the current ps has a valid config?22:11
ianwi thought it was valid, but i have been known to be wrong :)22:12
* clarkb googles docker.json config22:12
clarkbooh we can also set iptables to false22:13
*** jamesmcarthur has joined #openstack-infra22:15
*** jamesmcarthur has quit IRC22:15
*** slaweq has joined #openstack-infra22:15
*** jamesmcarthur has joined #openstack-infra22:18
*** slaweq has quit IRC22:20
clarkbI think the ipv6 setting is valid. Now wondering if we have to specify some other config that we aren't specifying22:21
*** bobh has joined #openstack-infra22:21
*** boden_ has joined #openstack-infra22:21
ianwi hate json config files22:21
ianwclarkb: is the best thing to do get log capturing at this point?  seems like it will be useful work in the future anyway22:22
clarkbianw: likely22:22
*** mnasiadka_ has joined #openstack-infra22:23
*** benj_- has joined #openstack-infra22:23
clarkbI really need to figure out apparmor and libvirt locally so that I can boot VMs again22:25
* clarkb does that now since its impacted a couple things at this point22:25
*** bobh has quit IRC22:26
*** _alastor_ has quit IRC22:26
*** boden_ has quit IRC22:26
*** uberjay has joined #openstack-infra22:29
*** mnasiadka has quit IRC22:29
*** andreykurilin has quit IRC22:29
*** boden has quit IRC22:29
*** rh-jelabarre has quit IRC22:29
*** verdurin has quit IRC22:29
*** masayukig[m] has quit IRC22:29
*** eumel8 has quit IRC22:29
*** lewo has quit IRC22:29
*** uberjay_ has quit IRC22:29
*** smcginnis has quit IRC22:29
*** benj_ has quit IRC22:29
*** logan- has quit IRC22:29
*** mnasiadka_ is now known as mnasiadka22:29
*** rh-jelabarre has joined #openstack-infra22:29
*** wolverineav has quit IRC22:31
*** logan- has joined #openstack-infra22:31
*** verdurin has joined #openstack-infra22:31
*** gouthamr has quit IRC22:32
*** wolverineav has joined #openstack-infra22:32
*** lpetrut has quit IRC22:32
*** irdr has quit IRC22:33
*** irdr has joined #openstack-infra22:34
*** gouthamr has joined #openstack-infra22:35
*** bobh has joined #openstack-infra22:38
*** jamesmcarthur has quit IRC22:40
*** slaweq has joined #openstack-infra22:42
*** bobh has quit IRC22:42
*** slaweq has quit IRC22:47
*** kgiusti has left #openstack-infra22:49
openstackgerritIan Wienand proposed openstack-infra/system-config master: Collect syslogs from nodes in ansible tests  https://review.openstack.org/62482722:54
*** bobh has joined #openstack-infra23:00
*** jamesmcarthur has joined #openstack-infra23:00
openstackgerritIan Wienand proposed openstack-infra/system-config master: [wip] Collect syslogs from nodes in ansible tests  https://review.openstack.org/62482723:03
clarkbok I think I have working libvirt again. Now just waiting on bionic image to download23:03
* clarkb reviews ^23:03
*** bobh has quit IRC23:04
clarkbianw: left a note, I think the second stat may overwrite the previous?23:06
*** smcginnis has joined #openstack-infra23:07
*** rkukura_ has joined #openstack-infra23:08
ianwoh yeah, good point, it only works if the first is a negative result and the second is positive23:09
*** rkukura has quit IRC23:11
*** rkukura_ is now known as rkukura23:11
*** bobh has joined #openstack-infra23:12
openstackgerritClark Boylan proposed openstack-infra/zuul-jobs master: Vendor the RDO repository configuration for installing OVS  https://review.openstack.org/62481723:14
*** bobh has quit IRC23:16
openstackgerritMerged openstack-infra/system-config master: Prefix install_openstacksdk variable  https://review.openstack.org/62146223:19
openstackgerritMerged openstack-infra/system-config master: Add support for enabling the ARA callback plugin in install-ansible  https://review.openstack.org/61122823:19
openstackgerritMerged openstack-infra/system-config master: Enable ARA reports for system-config bridge CI jobs  https://review.openstack.org/61721623:19
dmsimardianw: ^ yay, thanks for your help23:19
ianwif we're going to start collecting even more logs, we should start from that base23:20
*** gfidente|afk has quit IRC23:24
*** jamesmcarthur has quit IRC23:28
*** jamesmcarthur_ has joined #openstack-infra23:28
*** rlandy is now known as rlandy|bbl23:28
*** jamesmcarthur_ has quit IRC23:30
*** jamesmcarthur has joined #openstack-infra23:30
openstackgerritIan Wienand proposed openstack-infra/system-config master: [wip] Collect syslogs from nodes in ansible tests  https://review.openstack.org/62482723:31
*** dklyle has joined #openstack-infra23:35
openstackgerritIan Wienand proposed openstack-infra/system-config master: [wip] Collect syslogs from nodes in ansible tests  https://review.openstack.org/62482723:43
*** dklyle has quit IRC23:45
*** weshay is now known as weshay_pto23:46
*** diablo_rojo has joined #openstack-infra23:47
*** auristor has joined #openstack-infra23:49
jonherzuul is not doing gate checks for https://review.openstack.org/624482 am I missing something?23:50
*** bobh has joined #openstack-infra23:52
*** jamesmcarthur has quit IRC23:53
*** bobh has quit IRC23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!