Wednesday, 2018-09-26

*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537400:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256000:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)00:10
*** jamesdenton has quit IRC00:36
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Allow debootstrap to cleanup without a kernel  https://review.openstack.org/60469200:42
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: Introduce undercloud_container_cli parameter  https://review.openstack.org/60051200:50
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: Fix quickstart undercloud selinux configuration  https://review.openstack.org/60270300:50
weshayhrm... scenario00100:51
*** rlandy has joined #tripleo01:05
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537401:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256001:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)01:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)01:10
EmilienMweshay: timeouts?01:16
*** rlandy has quit IRC01:19
*** mrsoul has joined #tripleo01:19
*** phuongnh has joined #tripleo01:22
*** mschuppert has quit IRC01:23
weshayone01:28
weshayit failed twice01:28
weshaysince we turned off validations01:28
*** owalsh_ has joined #tripleo01:29
*** owalsh has quit IRC01:33
*** rh-jelabarre has quit IRC01:34
openstackgerritEmilien Macchi proposed openstack/ansible-role-redhat-subscription master: Add support for RHSM Pools  https://review.openstack.org/60529001:44
*** jamesdenton has joined #tripleo01:46
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: libvirt standalone deployment  https://review.openstack.org/59107701:55
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: f28 support for quickstart  https://review.openstack.org/59165201:57
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: libvirt standalone deployment  https://review.openstack.org/59107701:58
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537402:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256002:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)02:10
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: libvirt standalone deployment  https://review.openstack.org/59107702:37
*** psachin has joined #tripleo02:40
*** apetrich has quit IRC02:42
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537403:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256003:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)03:10
pvchi tripleo team03:14
pvcany one knows why my NetworkDeployment is not completing?03:15
pvcor should i wait a little bit more03:15
openstackgerritMerged openstack/tripleo-heat-templates stable/rocky: Stop cap granting to empty pool when telemetry disabled  https://review.openstack.org/60473403:22
*** skramaja has joined #tripleo03:25
*** ramishra has joined #tripleo03:26
*** udesale has joined #tripleo03:53
*** itlinux has joined #tripleo03:55
itlinuxhello all, can someone point me to the right docs on how to do minor updates on PIKE BM. I get this issue.. http://paste.openstack.org/show/730915/03:56
*** ykarel has joined #tripleo03:59
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537404:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256004:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)04:10
*** pcaruana has joined #tripleo04:14
*** nawar has joined #tripleo04:24
*** mcornea has quit IRC04:25
*** shyamb has joined #tripleo04:26
*** shyamb has quit IRC04:31
*** shyamb has joined #tripleo04:34
*** pcaruana has quit IRC04:38
*** Petersingh has joined #tripleo04:46
*** Petersingh is now known as Petersingh|afk04:47
*** zb has joined #tripleo04:58
*** dxiri has joined #tripleo05:00
*** zaneb has quit IRC05:01
*** nawar has quit IRC05:06
*** abishop_ has quit IRC05:08
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537405:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256005:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)05:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)05:10
*** abishop has joined #tripleo05:12
Tenguhello there05:14
*** Petersingh|afk is now known as Petersingh05:18
jaosorioryoo05:19
itlinuxhello all..05:21
pvchello all05:21
itlinuxjaosorior: and Tengu:05:21
pvchello itlinux05:21
jaosorioritlinux: hey! how's it going?05:21
itlinuxjust trying to figure the update minor issue.. need your help there..05:22
itlinux http://paste.openstack.org/show/730915/05:22
itlinuxany tips on what to look for on this..05:22
*** dxiri has quit IRC05:22
itlinuxit's driving me crazy :)05:22
jaosorioritlinux: I don't have a lot of knowledge about the update and upgrades workflow yet :/05:22
jaosoriorunsupported parameters for pacemaker_cluster05:22
itlinuxok.. I will check with M.. what timezone is he at?05:23
jaosoriorsounds like you're using the wrong version of the pacemaker ansible module05:23
itlinuxthat's default I have not changed anything on pacemaker..05:23
jaosoriornot pacemaker itself05:24
jaosoriorbut the pacemaker ansible module05:24
jaosorioruhm...05:24
jaosoriorbandini: ^^05:24
itlinuxI just deployed the overcloud .. so there is nothing I changed and used one of the default img05:24
itlinuxahh the italian guy! ok05:24
itlinuxI will reach out to him and see what he has to say :)05:24
*** shyamb has quit IRC05:28
*** shyamb has joined #tripleo05:36
*** nawar has joined #tripleo05:38
quique|rover|offGood morning05:38
*** quique|rover|off is now known as quiquell|rover05:40
quiquell|roverykarel: you there ?05:40
ykarelquiquell|rover, yes05:40
quiquell|roverykarel: Did you found something about timeouts and overcloud deploy ?05:41
*** pcaruana has joined #tripleo05:43
*** apetrich has joined #tripleo05:46
Tenguquiquell|rover: heya! for what I know, weshay patch was merged yesterday evening (CET), and apparently it did help a bit.05:47
Tenguquiquell|rover: also, my patch for the selinux issue is working, and should be hopefully gating today. maybe.05:47
Tengueventually.05:48
quiquell|roverTengu: Cool, I still see some timeouts, but at least now we don't have to look at validations05:48
Tenguquiquell|rover: hmm ok. well, zuul's pretty loaded as well as the gate.05:49
ykarelquiquell|rover, no i didn't looked much there, but what i saw is stack creation and config-download were ok, just overcloudrc action stuck for long05:49
*** cylopez has joined #tripleo05:51
quiquell|roverykarel: Yep, found that at one timeout, didn't find it elsewhere05:52
ykarelquiquell|rover, really? i saw in multiple jobs05:52
quiquell|roverykarel: Can you add some of them to the lp of the timeout ?05:55
*** jistr has quit IRC05:55
quiquell|roverykarel: or open new one, so we close the validations ?05:55
ykarelquiquell|rover, okk will look in some time05:55
*** jistr has joined #tripleo05:56
quiquell|roverykarel: Or give me the logs you found I will open it05:56
*** jfrancoa has joined #tripleo05:57
*** mschuppert has joined #tripleo05:59
quiquell|rovermschuppert: Good morning, we have a queens promotion06:01
*** iranzo has joined #tripleo06:02
mschuppertquiquell|rover: perfect! will rerun the job06:03
quiquell|rovermschuppert: Let me know if it works06:04
Tenguykarel quiquell|rover : lemme know if I can help a bit06:04
mschuppertquiquell|rover: sure06:04
quiquell|roverykarel: openning the lp with the overcloud issue06:05
*** ksambor has joined #tripleo06:06
openstackgerritMartin Schuppert proposed openstack/puppet-tripleo stable/queens: Revert "Revert "SSL support for haproxy -> novnc proxy connection""  https://review.openstack.org/59414506:07
quiquell|roverykarel: https://bugs.launchpad.net/tripleo/+bug/1794406:07
openstackLaunchpad bug 17944 in samba (Ubuntu) "samba: new changes from Debian require merging" [Medium,Fix released] - Assigned to Adam Conrad (adconrad)06:07
*** sanjayu_ has joined #tripleo06:08
*** shyamb has quit IRC06:09
*** nawar has quit IRC06:09
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537406:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256006:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)06:10
*** shyamb has joined #tripleo06:11
*** nawar has joined #tripleo06:11
*** ratailor has joined #tripleo06:12
nawarhi06:12
*** jtomasek has joined #tripleo06:13
*** holser_ has joined #tripleo06:15
*** dmacpher_ has quit IRC06:15
ykarelquiquell|rover, wrong link06:15
quiquell|roverykarel: https://bugs.launchpad.net/tripleo/+bug/179441806:15
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)06:15
*** dmacpher has joined #tripleo06:16
ykarelquiquell|rover, adding few more links and the relation to timeout06:17
jaosoriorholser_: around?06:18
holser_jaosorior yeah06:18
quiquell|roverykarel: Let's work that06:18
holser_Good morning06:18
quiquell|roverykarel: Do you think it can be realted to the guard added to tripleoclient ?06:18
jaosoriorholser_: was it you that added the topic to the stein forum that's titled: "Zero footprint installer, interests and progress" ?06:19
ykarelquiquell|rover, might be but not sure, so i asked exactly when we started seeing it and in which branches, if it's around the time merged and master only then it can be related06:19
holser_jaosorior let me have a look at etherpad...06:19
* holser_ looking06:20
jaosoriorholser_: https://etherpad.openstack.org/p/tripleo-forum-stein06:20
holser_jaosorior nope ....06:22
holser_I don't recall that I added that06:22
nawari need your help06:22
jaosoriorholser_: the color matched yours... so it must have been someone else.. don't know who though06:22
jaosoriorneed to poke them to get more info on it06:22
holser_I understand but it was not me06:22
jaosoriorholser_: sure, no problem.06:23
jaosoriorthanks06:23
holser_You may add comment and delete in a couple weeks if noone cares06:23
nawarI'm trying to scale up my env of 2 nodes and add new compute but I'm getting this exception: KeyError: 'passwords' when redeploy06:24
jaosoriornawar: what version?06:28
ykarelquiquell|rover, commented https://bugs.launchpad.net/tripleo/+bug/1794418/comments/106:30
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)06:30
*** dtrainor has quit IRC06:33
*** yprokule has joined #tripleo06:34
*** aufi has joined #tripleo06:37
openstackgerritMerged openstack/tripleo-heat-templates master: Allow a containerized logrotate to access docker  https://review.openstack.org/59627406:37
*** shyamb has quit IRC06:38
nawarqueens06:42
*** quiquell|rover is now known as quique|rover|brb06:43
*** rdopiera has joined #tripleo06:43
*** chkumar|off is now known as chkumar|ruck06:44
chkumar|ruckykarel: is there some problem with overcloud deploy in timedout?06:45
ykarelchkumar|ruck, which problem?06:45
chkumar|ruckykarel: looking at this bug https://bugs.launchpad.net/tripleo/+bug/1794418/06:46
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)06:46
ykarelchkumar|ruck, yes06:46
ykarelseen in multiple jobs so issue is there06:46
openstackgerritJose Luis Franco proposed openstack/tripleo-heat-templates master: Check if openstack-glance-registry is enabled before stopping it.  https://review.openstack.org/60358106:46
nawar jaosorior: queens06:50
openstackgerritMerged openstack/tripleo-heat-templates stable/rocky: undercloud/stackrc: unset OS_* variables  https://review.openstack.org/60493606:51
chkumar|ruckTengu: Hello06:52
chkumar|ruckTengu: http://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-master/5efcbac/logs/undercloud/home/zuul/undercloud_reinstall.log.txt.gz fs020 is still failing in master at same step with different error06:53
chkumar|ruckTASK [Create /var/lib/config-data directory] ***********************************06:53
chkumar|ruck2018-09-26 01:33:54 | fatal: [undercloud]: FAILED! => {"changed": false, "msg": "path /var/lib/config-data/crond/etc/../usr/share/zoneinfo/UTC does not exist", "path": "/var/lib/config-data/crond/etc/../usr/share/zoneinfo/UTC", "state": "absent"}06:53
Tenguchkumar|ruck: err, that's not related to my selinux work, that's for sure.06:56
Tenguo____O06:57
Tenguthere isn't any recurse nor absent in the code.06:57
Tenguwtf06:57
Tenguchkumar|ruck: my patch was successful at least: https://review.rdoproject.org/r/#/c/16354/06:58
*** shyamb has joined #tripleo06:58
Tenguchkumar|ruck: BUT it's still not merged.06:59
Tengumight be that?06:59
openstackgerritMerged openstack/tripleo-specs master: Support for Podman in Stein  https://review.openstack.org/60248006:59
openstackgerritMerged openstack/tripleo-specs master: Improve upgrades_tasks CI coverage with standalone for Stein  https://review.openstack.org/57985406:59
chkumar|ruckTengu: Ah thanks :-)06:59
*** quique|rover|brb is now known as quiquell|rover07:00
quiquell|roverchkumar|ruck: o/07:00
chkumar|ruckTengu: two patches are in zuul merge queue07:00
chkumar|ruckhttps://review.openstack.org/#/c/605039/ and https://review.openstack.org/#/c/602703/07:00
chkumar|ruckhope this will promote master also07:00
chkumar|ruckquiquell|rover: \o/07:00
Tenguchkumar|ruck: first one is mine07:00
chkumar|ruckquiquell|rover: I am not sure this is a timedout case on rdo cloud http://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset018-master/918dfed/job-output.txt.gz07:02
chkumar|ruckyesterday I have seen this type of one also07:02
quiquell|roverchkumar|ruck: fs037 is still non voting wy do we have it at master criteria ?07:02
quiquell|roverchkumar|ruck: fs037 is updates07:02
*** rcernin has quit IRC07:02
chkumar|ruckquiquell|rover: nope07:02
chkumar|ruckquiquell|rover: https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/dlrnapi_promoter/config/master.ini#L2807:03
chkumar|ruckit is still non-voting that's why07:03
quiquell|roverchkumar|ruck: Ahh ok, I think I open old version07:04
chkumar|ruckquiquell|rover: anyway fs037 is broken07:05
chkumar|ruckquiquell|rover: good to have a bz07:05
chkumar|ruckquiquell|rover: I will be looking into telemetry tempest issue why it is not working after fixes07:05
chkumar|ruckand some tempest podman stuff07:05
chkumar|ruckfor fs02707:05
*** amoralej|off is now known as amoralej07:06
*** cylopez has quit IRC07:07
*** cylopez has joined #tripleo07:07
*** sanjayu_ has quit IRC07:08
quiquell|roverchkumar|ruck: fs027 arrg ok07:10
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537407:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256007:10
*** ooolpbot has quit IRC07:10
quiquell|roverchkumar|ruck: I will look into overcloud deploy timeout07:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)07:10
chkumar|ruckquiquell|rover: cool!07:10
quiquell|roverchkumar|ruck: the overcloud deploy timeout is not promotion blocker ?07:10
*** waleedm has joined #tripleo07:12
nawardo I report bug ? or not07:13
openstackgerritMerged openstack/tripleo-common master: config: ignore missing server_id from the stack  https://review.openstack.org/60448307:14
openstackgerritMerged openstack/tripleo-common stable/queens: Avoid getting one-empty-element-list in blacklisted_hostnames.  https://review.openstack.org/60416507:14
*** Petersingh is now known as Petersingh|afk07:20
*** psachin has quit IRC07:21
quiquell|roverykarel: I don't find any pre 21th timeout with the overcloudrc issue, have to be related07:21
ykarelquiquell|rover, okk good to keep an eye, i think u are merging that in rocky too07:22
ykarelso good to get some feedback from someone from mistral07:23
quiquell|roverykarel: It's like we have convert a tripleoclient failure into a timeout07:23
*** shardy has joined #tripleo07:23
ykarelyes07:23
quiquell|roverykarel: Failure can be to throw proper exception instead of just return in the guard07:24
quiquell|roverykarel: But we have to find why zaqar websocket connection is not open at the time07:25
quiquell|roverykarel: Can be infra07:25
ykarelno idea07:25
*** Petersingh|afk is now known as Petersingh07:27
*** psachin has joined #tripleo07:27
*** shyamb has quit IRC07:29
pvchi07:33
pvcanyone experience stuck on this part07:34
pvc2018-09-26 07:08:36Z [overcloud.Controller.0.NetworkDeployment]: CREATE_IN_PROGRESS  state changed07:34
*** shyamb has joined #tripleo07:35
Tenguchkumar|ruck: -.- post_failure. damn zuul.07:36
*** bogdando has joined #tripleo07:43
*** shyamb has quit IRC07:45
*** jpich has joined #tripleo07:48
*** chem has joined #tripleo07:51
*** jpena|off is now known as jpena07:51
quiquell|roverykarel: I see warning at zaqar around the problem of overcloud deploy http://logs.openstack.org/47/604447/2/check/tripleo-ci-centos-7-containers-multinode/e18e77f/logs/undercloud/var/log/containers/zaqar/zaqar-server.log.txt.gz#_2018-09-25_20_02_53_06307:57
*** rcernin has joined #tripleo07:57
quiquell|roverAlso a WebSocket connection closed: None07:57
*** Petersingh is now known as Petersingh|lunch07:57
ykarelquiquell|rover, i guess those warning should be in success job as well07:58
ykarelso seems unrelated if that's the case07:58
ykarelwebsocket can be related07:58
*** ykarel is now known as ykarel|lunch07:59
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates master: GATE CHECK for TripleO  https://review.openstack.org/60429808:00
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates stable/rocky: GATE CHECK for TripleO  https://review.openstack.org/60429308:00
*** akrivoka has joined #tripleo08:00
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates stable/queens: Allow a containerized logrotate to access docker  https://review.openstack.org/60155508:02
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates stable/pike: Allow a containerized logrotate to access docker  https://review.openstack.org/60534808:02
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates stable/rocky: Allow a containerized logrotate to access docker  https://review.openstack.org/60534908:03
*** Guest42266 has joined #tripleo08:04
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: Removes EXTRA_TAGS from toci_gate_test&quickstart.sh.j2  https://review.openstack.org/60476808:08
thervequiquell|rover: What kind of issues do you see related to zaqar?08:10
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537408:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256008:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441808:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)08:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)08:10
quiquell|rovertherve: Running tripleo.deployment.v1.create_overcloudrc08:11
therveThat last bug?08:11
quiquell|rovertherve: give us this https://bugs.launchpad.net/tripleo/+bug/179441808:11
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)08:11
therveLooking08:12
quiquell|rovertherve: suspect is https://review.openstack.org/#/c/603802/08:12
holser_bandini I have assigned https://bugzilla.redhat.com/show_bug.cgi?id=1631674 back to me08:12
*** moshele has joined #tripleo08:12
openstackbugzilla.redhat.com bug 1631674 in python-tripleoclient "[UPGRADES][14] MySQL passwords ain't synchronized during when running deploy_steps_playbook.yaml" [Urgent,New] - Assigned to michele08:12
quiquell|rovertherve: What I don't really know is why do we end in a loop or why the websocket connection is close08:12
quiquell|rovertherve: The review fixed an issue at tripleoclient failint at closed connection08:13
quiquell|rovertherve: Maybe doing a "rerturn" is not correct08:13
quiquell|rovertherve: But I really want to know why the connection is closed08:13
*** hjensas has joined #tripleo08:15
*** gkadam has joined #tripleo08:16
thervequiquell|rover: http://logs.openstack.org/91/587491/4/check/tripleo-ci-centos-7-containers-multinode/f618803/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-09-25_08_20_00 shows the log08:21
therveSo we don't get into your return clause08:21
*** ykarel|lunch is now known as ykarel08:21
therveThat return is bogus though. It should really be an exception08:23
quiquell|rovertherve: I don't think it's exception, just do yield with empty values08:24
quiquell|roverykarel: wait_for_messages at stuff without connection is like "We don't have more messages"08:24
quiquell|rovertherve: ^08:24
thervequiquell|rover: Why?08:24
*** noama has joined #tripleo08:26
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535608:26
therveIf we still publish messages, the client won't wait for them08:26
*** shyamb has joined #tripleo08:26
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: Switch fs027 to deploy with podman  https://review.openstack.org/60051708:28
quiquell|rovertherve: The guard fixed an issue of tripleo-client failing beacuase an Exception was rise there (stack was correctly installed though)08:29
quiquell|rovertherve: was like a false negative08:29
quiquell|rovertherve: If I add a raise there we go back to the issue08:29
*** sanjayu_ has joined #tripleo08:29
quiquell|rovertherve: now feels like we are going to the next issue after cleaning up the previous08:29
thervequiquell|rover: Or maybe it was intermittent and nothing changed?08:30
*** jistr has quit IRC08:30
therveThe fact that one CI run passed doesn't mean much08:30
quiquell|rovertherve: Can i add raise there and tripleoclient not failing ? I don't want a false negative again08:30
quiquell|rovertherve: It was not passing08:30
*** fhubik has joined #tripleo08:30
therveWell it was before, so what changed?08:30
therveAt the very least, I would try to catch the error from recv, instead of returning right away08:31
therveIt's possible that we read a bunch of messages successfully, and then fail08:31
*** jistr has joined #tripleo08:31
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Add new featureset 056 for standalone upgarde.  https://review.openstack.org/60536308:33
*** derekh has joined #tripleo08:33
*** derekh has joined #tripleo08:34
*** shardy has quit IRC08:35
*** shardy has joined #tripleo08:36
quiquell|rovertherve: but where is the infinite loop ?08:37
quiquell|rovertherve: the while true ?08:37
thervequiquell|rover: Yeah? You didn't break if the connection is opened08:37
quiquell|rovertherve: maybe we are not consuming stuff with recv ?08:37
thervequiquell|rover: Can we get the ansible-errors.json file?08:38
therveIt's not in the CI logs08:38
quiquell|rovertherve: The issue is exactly that, after the long run the file is not there08:39
thervequiquell|rover: Should we fix that instead? :)08:39
quiquell|rovertherve: Hehe agre we were just suspicous of the guard (and I am not ok with the 'return')08:40
Tenguso, validations weren't the culprit for the time_out apparently :(08:41
*** nawar has quit IRC08:42
quiquell|roverTengu: We have other issues looks like, but now that we have remov validation it's easier08:42
quiquell|roverTengu: reducing valirabes looking at timeouts is gold08:42
quiquell|roverTengu: gates are better also08:43
Tengu:)08:43
Tenguapparently time_outs are mainly for t-h-t changes.08:43
Tenguquiquell|rover: if you want a fresh look: http://logs.openstack.org/39/605039/1/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/1a10cd5/  time_out  and that one got a post_failure: http://logs.openstack.org/39/605039/1/check/tripleo-ci-centos-7-scenario003-multinode-oooq-container/beb1223/08:45
*** owalsh_ is now known as owalsh08:45
quiquell|roverTengu: hate you!!! this is different :-(08:46
Tengusorry ;)08:47
quiquell|roverno overcloudrc issue08:47
chkumar|ruckTengu: post_Failure have everything passing08:47
quiquell|roverTengu: 1:30 overcloud deploy08:47
quiquell|roverthat's really wrong08:47
*** tosky has joined #tripleo08:48
Tenguchkumar|ruck: ah, it doesn't fail the whole thing?08:48
quiquell|roverTengu: later08:48
chkumar|ruckTengu: it is just a post failure failed to upload to logs somehow08:48
quiquell|roverchkumar|ruck: But the other is not08:48
chkumar|ruckhttp://logs.openstack.org/39/605039/1/check/tripleo-ci-centos-7-scenario003-multinode-oooq-container/beb1223/job-output.txt.gz08:49
*** apetrich has quit IRC08:49
chkumar|ruckquiquell|rover: yes, other is timed out08:49
quiquell|roverchkumar|ruck: damn overcloud ARA task are wrong... we have to fix it08:50
*** nawar has joined #tripleo08:51
sshnaidmstevebaker, hi, I still wait for explanation why you merged this patch when there is discussion in progress there? https://review.openstack.org/#/c/599358/08:52
*** chkumar|ruck has quit IRC08:58
*** chandankumar has joined #tripleo08:59
*** chandankumar is now known as chkumar|ruck09:00
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Add new featureset 056 for standalone upgarde.  https://review.openstack.org/60536309:00
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Put create repo script into its own tasks file.  https://review.openstack.org/60536909:00
*** ykarel is now known as ykarel|away09:01
*** sileht has quit IRC09:02
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Refactor workflow actions  https://review.openstack.org/60308009:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: convert PlansActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308109:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NodesActions to named expoers to avoid using 'this' in thunks  https://review.openstack.org/60308209:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ValidationsActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308309:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert WorkflowExecutionsActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308409:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert CurrentPlanActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308509:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert FlavorsActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308609:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert I18nActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308709:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert LoggerActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308809:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert LoginActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60308909:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NetworksActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309009:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert EnvironmentConfigurationActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309109:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NotificationActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309209:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ParametersActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309309:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert RegisterNodesActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309409:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert RolesActions to named exports to avoid using 'this' in thunks  https://review.openstack.org/60309509:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert StacksActions to named exports to avoid using 'this' in thunk  https://review.openstack.org/60309609:03
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ZaqarActions to named exports to avoid using 'this' in thunk  https://review.openstack.org/60309709:03
*** Petersingh|lunch is now known as Petersingh09:04
*** ykarel|away has quit IRC09:05
*** dtrainor has joined #tripleo09:07
quiquell|rovertherve: The run from the guard review http://logs.openstack.org/02/603802/1/check/tripleo-ci-centos-7-containers-multinode/1ae99fc/09:08
quiquell|rovertherve: it passes09:08
*** kopecmartin|off is now known as kopecmartin09:09
therveOK let's see09:09
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537409:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256009:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441809:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)09:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,Triaged] - Assigned to Quique Llorente (quiquell)09:10
chkumar|rucksshnaidm: bogdando https://review.openstack.org/#/c/605038/ needs +W on this09:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert PlansActions to named exports  https://review.openstack.org/60308109:10
*** moshele has quit IRC09:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NodesActions to named expoers  https://review.openstack.org/60308209:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ValidationsActions to named exports  https://review.openstack.org/60308309:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert WorkflowExecutionsActions to named exports  https://review.openstack.org/60308409:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert CurrentPlanActions to named exports  https://review.openstack.org/60308509:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert FlavorsActions to named exports  https://review.openstack.org/60308609:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert I18nActions to named exports  https://review.openstack.org/60308709:10
*** dtantsur|afk is now known as dtantsur09:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert LoggerActions to named exports  https://review.openstack.org/60308809:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert LoginActions to named exports  https://review.openstack.org/60308909:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NetworksActions to named exports  https://review.openstack.org/60309009:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert EnvironmentConfigurationActions to named exports  https://review.openstack.org/60309109:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert NotificationActions to named exports  https://review.openstack.org/60309209:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ParametersActions to named exports  https://review.openstack.org/60309309:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert RegisterNodesActions to named exports  https://review.openstack.org/60309409:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert RolesActions to named exports  https://review.openstack.org/60309509:10
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Convert ZaqarActions to named exports  https://review.openstack.org/60309709:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart-extras master: WIP: Add necessary bits for N-1->N standalone upgrade.  https://review.openstack.org/60473609:11
pvchi09:12
pvcanyone know why i cant ssh to my overcloud instance but its state is running?09:13
thervequiquell|rover: "Notifying subscriber" is missing09:13
shardypvc: didn't we discuss this yesterday?09:13
pvci cant install libguetfs :(09:13
quiquell|rovertherve: where ?09:13
pvcso i cant customize the image09:13
thervequiquell|rover: http://logs.openstack.org/91/587491/4/check/tripleo-ci-centos-7-containers-multinode/f618803/logs/undercloud/var/log/containers/zaqar/zaqar.log.txt.gz#_2018-09-25_08_19_59_50709:14
pvcbut it is stack on NetworkDeployment shardy09:14
therveWe get the message, but it's not pushed to websocket09:14
shardypvc: the nova ACTIVE state only means the node booted, it doesn't care about networking etc, so my guess is there is some issue with the network09:14
shardypvc: Ok that probably confirms network issues, you can also set the root password  via cloud-init without customizing the image, sec09:14
pvcyes09:14
shardyout of interest why can't you install libguestfs?09:15
pvcplease :(09:15
pvcwhere09:15
pvcthere is an issue on our firewall device09:15
pvcbut i dont have a privilege to view it09:15
pvcso i cant do anything09:15
shardypvc: Ok one moment I'll find a cloud-init example09:15
*** rcernin has quit IRC09:16
shardyhttps://github.com/openstack/tripleo-heat-templates/blob/master/firstboot/userdata_root_password.yaml09:16
shardyThen you'd include an environment file like:09:16
quiquell|rovertherve: good and bad looks the same to me for req-2a4f6fca-d09e-4174-965c-ac225d0b60d609:17
quiquell|rovertherve: at good one09:17
*** salmankhan has joined #tripleo09:17
pvcwhere will i put hte password sharda09:17
pvcshardy*09:17
shardygive me a moment please09:18
thervequiquell|rover: I wonder if it's not a race condition09:18
thervequiquell|rover: http://logs.openstack.org/91/587491/4/check/tripleo-ci-centos-7-containers-multinode/f618803/logs/undercloud/var/log/containers/zaqar/zaqar-server.log.txt.gz#_2018-09-25_08_20_00_50509:18
therveIt's happening *after* the response has been sent09:18
*** sileht has joined #tripleo09:19
shardypvc: http://paste.openstack.org/show/730925/09:19
dtantsurjaosorior: morning! did you have a chance to submit the undercloud edge forum proposal?09:20
*** waleedm has quit IRC09:20
pvci will just run this shardy openstack overcloud deploy --templates -e root_password_env.yaml09:20
shardypvc: yes, but if it's baremetal don't you have some nic config scripts as well?09:21
shardys/scripts/templates09:21
thervequiquell|rover: http://paste.openstack.org/show/730927/09:21
thervetripleo is too fast09:21
shardythat should give you root access via the ipmi console, anyway09:21
therveI never tought I'd type that09:21
*** waleedm has joined #tripleo09:21
*** waleedm has quit IRC09:22
pvcyes im just running the09:22
pvcopenstack overcloud deploy09:22
quiquell|rovertherve: So we are waitting before subscriber is created at zaqar ?09:22
pvci already created the file root_password shardy09:23
*** waleedm has joined #tripleo09:23
shardypvc: Ok, well we have a default network config, normally that's not what you want for baremetal boxes though so you may want to check the docs09:23
shardyat least if you have access you can then debug it09:23
thervequiquell|rover: No, we send the message before waiting for it09:23
therveAnd there is no mechanism to catch up09:24
chemquiquell|rover: hum, a little question, when your inheriting from a job def, if you set, say "playbooks" variable, is that an override or are they merged with those of the parent definition ?09:24
pvcit is okay now shardy openstack overcloud deploy --templates -e root_password_env.yaml09:24
quiquell|roverchem: override09:24
quiquell|roverchem: Don't know if you can concatenate them tough09:24
shardypvc: yes that's what I put in the paste, but as I mentioned you may well need some additional -e arguments to pass a valid network configuration for your hardware09:25
shardypvc: which nodes are you able to connect to and which are not working?09:25
chemquiquell|rover: thanks, that's what I was thinking, but it would have been cool to have merging :)09:25
pvci can access my compute but controller cant09:25
quiquell|rovertherve: so, we ask the client to create overcloudrc, the client send it to zaqar and zaqar send it to mistral ?09:25
shardyhttps://github.com/openstack/tripleo-heat-templates/blob/master/overcloud-resource-registry-puppet.j2.yaml#L3409:25
pvcwe will deploy again wait09:25
shardyhttps://github.com/openstack/tripleo-heat-templates/blob/master/overcloud-resource-registry-puppet.j2.yaml#L2709:26
thervequiquell|rover: No, we ask the client, the client sends it to mistral, mistral answers to zaqar09:26
shardypvc: Ok, that is most likely because we default to a bridged config that works for VMs for the Controller, but there's a noop config for the other roles09:26
quiquell|rovertherve: and client wait for message at zaqar ?09:26
shardyI expect you'll need to pass a valid configuration that depends on your hardware and network setup09:26
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Better support for the local devbox cases  https://review.openstack.org/59356709:26
pvcWhere can i find a docs about passing a valid configuration?09:27
thervequiquell|rover: Yes09:27
thervequiquell|rover: But if mistral answers too quickly, we don't get it09:27
ssbarneaquiquell|rover: https://review.openstack.org/#/c/605021/ really needed, I was hit by it while running reproducer, lucky bogdan already created a CR to fix it.09:27
quiquell|rovertherve: it answers before we wait fo rit ?09:27
thervequiquell|rover: Yes09:27
*** ssbarnea|bkp has quit IRC09:28
quiquell|rovertherve: so we have to start waitting before sending with another thread or the like09:28
shardyhttps://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/network_isolation.html#creating-custom-interface-templates09:28
quiquell|rovertherve: or registering callback09:29
thervequiquell|rover: No the change I pasted is good enough09:29
pvcbut shardy internet connection is not an issue right since i already have the image?09:29
therveYou just need to create the websocket client beforehand09:29
shardypvc: ^^ it's in the network isolation section, because in most cases baremetal setups want to use that, but even without you may need to configure the nics, or at least disable that default bridge setup09:29
shardypvc: yes09:29
quiquell|rovertherve: Ahh I see09:29
pvcjust do give you a overview shardy im using a Lenovo Server09:30
quiquell|rovertherve: This can also cause connection not there I suppose and old kind of issues09:30
quiquell|rovertherve: and about the guard return thing, going to raise an exception09:31
quiquell|rovertherve: is that ok ?09:31
quiquell|rovertherve: thanks so much man !!09:31
thervequiquell|rover: I'd just catch it same way as timeout09:31
shardypvc: Ok well the controllers get configured via https://github.com/openstack/tripleo-heat-templates/blob/master/net-config-bridge.j2.yaml09:32
shardypvc: so for baremetal I expect you will at least need to set NeutronPublicInterface https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/role.role.j2.yaml#L61009:33
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: WIP: New workflow for standalone upgrade  https://review.openstack.org/60470609:33
shardye.g if that interface isn't the first nic on the box it won't work09:33
openstackgerritThomas Herve proposed openstack/python-tripleoclient master: Start websocket client before workflows  https://review.openstack.org/60537709:33
shardyyou can use the real interface name there, or os-net-config sorts the active (link up) interfaces, which doesn't always automatically mean "nic1" is the right device outside of basic VM setups09:33
thervequiquell|rover: ^^^, I fixed the other ones in that file09:34
shardyit's kind of hard to make that a default that works for any hardware/network setup09:34
thervedidn't check elsewhere09:34
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Put create repo script into its own tasks file.  https://review.openstack.org/60536909:34
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Add new featureset 056 for standalone upgarde.  https://review.openstack.org/60536309:34
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: WIP: New workflow for standalone upgrade  https://review.openstack.org/60470609:35
quiquell|rovertherve: some of them are ok, will fix all09:35
pvcshardy should i edit this https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/role.role.j2.yaml#L5509:35
pvcto the first nic of my controller baremetal09:36
shardypvc: No, you pass NeutronPublicInterface in the parameter_defaults of a -e file like in my paste09:36
shardyparameter_defaults:09:36
shardy  NeutronPublicInterface: em209:37
shardyor whatever09:37
shardyprobably worth browsing the docs as there are a lot of examples of this sort of thing09:37
*** dsneddon has quit IRC09:37
thervequiquell|rover: I can do it in the same patch, that'll make more sense no?09:37
pvchttp://paste.openstack.org/show/730929/ shardy09:38
quiquell|rovertherve: are you preparing the patch with the solution ?09:38
thervequiquell|rover: https://review.openstack.org/60537709:39
therveThought you saw it09:39
quiquell|rovertherve: nope, thanks !09:40
pvcbut the allocation pool im using in the crtplane subnet is the real ip of the baremetel shardy09:40
quiquell|rovertherve: Checking if we suffer it elswhere09:41
pvcwhere can i put the file shardy?09:41
pvci put here09:42
pvcdirectory /usr/share/openstack-tripleo-heat-templates09:43
quiquell|rovertherve: found another09:43
thervequiquell|rover: Yeah me too09:43
quiquell|rovertherve: plan_management.py09:43
openstackgerritThomas Herve proposed openstack/python-tripleoclient master: Start websocket client before workflows  https://review.openstack.org/60537709:43
thervequiquell|rover: ^^09:43
quiquell|rovertherve: maybe this have to be encapsulated in a function09:43
thervequiquell|rover: I'll leave you the refactoring :)09:44
quiquell|rovertherve: hufff, we the latency of openstack merges it would take ages :-)09:44
quiquell|roverreFUCKtoring09:44
shardypvc: you should put user generated -e files in your home directory, not /usr/share/openstack-tripleo-heat-templates - that's owned by an RPM package09:44
shardyand you need root to write to it...09:44
shardywhere in your home dir is up to you, for testing anywhere is fine, for production a lot of folks have a git controlled directory and a wrapper script09:45
quiquell|rovertherve: about proper error message, I suppose is difficult since we don't have the socket09:45
pvcnoted on this09:45
pvcit is working now09:45
quiquell|rovertherve: Humm why don't we see this in rocky, adding the guard has found this ?09:46
quiquell|rovertherve: the guard is not merged to rocky yet09:46
pvci will let you know shardy openstack overcloud deploy --templates -e root_password_env.yaml -e config.yaml09:46
pvcthis is my config.yaml http://paste.openstack.org/show/730929/  shardy09:46
shardypvc: Ok, is "int1" a device name on your hardware?09:47
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: Only ask for a prompt when safe teardown requested  https://review.openstack.org/60537909:48
*** shardy is now known as shardy_mtg09:48
pvcyes09:49
pvcthe device name of my hardware09:49
*** nawar has quit IRC09:50
*** nawar has joined #tripleo09:50
*** shyamb has quit IRC09:51
*** gfidente has joined #tripleo09:53
*** waleedm has quit IRC10:01
*** shyamb has joined #tripleo10:04
*** jfrancoa has quit IRC10:04
pvcshardy it doesnt get an ip address10:04
pvcbut the root password worked10:04
pvci can login to it now10:04
*** jfrancoa has joined #tripleo10:05
pvcbut if we restart the network service it can get an IP address shardy*10:05
quiquell|roverjaosorior: Possible fix for new timeouts https://review.openstack.org/#/c/605377/10:06
quiquell|roverchkumar|ruck: ^10:06
pvcand it stack on NetworkDeployment shardy10:09
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537410:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256010:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441810:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)10:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)10:10
openstackgerritQuique Llorente proposed openstack/python-tripleoclient master: Raise proper exception at webscocket close  https://review.openstack.org/60538710:11
pvcshardy are you there10:11
quiquell|rovertherve: About the guard https://review.openstack.org/#/c/605387/10:11
*** shardy_mtg has quit IRC10:11
quiquell|rovertherve: this is it ?10:11
pvcrdo hi10:12
pvcquiquell|rover hi10:12
openstackgerritQuique Llorente proposed openstack/python-tripleoclient master: Raise proper exception at webscocket close  https://review.openstack.org/60538710:12
chkumar|ruckquiquell|rover: ack check10:14
pvcchkumar|ruck hi10:14
chkumar|ruckquiquell|rover: do we need to keep doc guard patch for rocky?10:15
quiquell|roverchkumar|ruck: You have read my mind, going to refactor it with the exception10:15
chkumar|ruckquiquell|rover: hehe10:15
quiquell|roverchkumar|ruck: but it has +2 there :-)10:15
pvcanyone not busy here?10:15
quiquell|roverchkumar|ruck: Let's do the right thing10:16
*** shyamb has quit IRC10:16
*** fhubik is now known as fhubik|brb10:18
quiquell|roverchkumar|ruck: Humm thinking about it, let's first test the exception at master10:18
chkumar|ruckquiquell|rover: yup, that would be better10:18
quiquell|roverchkumar|ruck: And then if the stuff is not merged rocky/queens let refactor it10:18
chkumar|ruckcool10:18
quiquell|roverchkumar|ruck: if merged, well cherry-pick10:18
*** dciabrin has quit IRC10:21
chkumar|ruckquiquell|rover: ack10:21
quiquell|roverbogdando: We need this to fix timeouts https://review.openstack.org/#/c/605377/10:21
chkumar|ruckpvc: Hello10:21
quiquell|roverchkumar|ruck: we have to merge this https://review.openstack.org/#/c/605377/10:21
pvchi chkumar10:21
pvcmy overcloud deploy it stacking on NetworkDeployment , after checking the instance there is an issue on network service. "no link present check cable"10:22
chkumar|ruckpvc: may be marios bogdando Tengu can help on that ^^10:23
pvchi marios bogdando Tengu10:23
Tengupvc: heya. are all your network interfaces connected to something?10:24
pvcwhat do you mean connected to something Tengu10:24
chkumar|ruckjaosorior: we need to patch https://review.openstack.org/#/c/605377/2 for timed out10:24
pvci override the Network as specificy by shardy here http://paste.openstack.org/show/730929/10:25
Tengupvc: well, if "no link present check cable" is shown, that seems to point to some hardware issue, like a disconnected network interface.10:25
Tengupvc: or maybe faulty cable10:25
pvcbut the network interface is not the same on the network interface of my baremetal10:26
pvcit is okay?10:26
pvcbut if we do an ifup <interface name> it can get an IP address10:26
*** Petersingh_ has joined #tripleo10:27
*** Petersingh_ is now known as Petersingh|afk10:29
openstackgerritHarald Jensås proposed openstack/python-tripleoclient master: Undercloud Validations - Deprecated (replaced/removed) opts  https://review.openstack.org/60492310:29
*** Petersingh has quit IRC10:29
*** dciabrin has joined #tripleo10:33
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: Remove dependency on github for cloning  https://review.openstack.org/60538810:34
chkumar|ruckmarios: sshnaidm can we merge this https://review.openstack.org/#/c/575588/ running tempest on standalone10:37
chkumar|rucklater on we can extend it to full tempest api and scenario test10:37
sshnaidmchkumar|ruck, +w10:39
*** phuongnh has quit IRC10:39
chkumar|rucksshnaidm: thanks :-)10:39
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Assure copied image is owned by current user  https://review.openstack.org/59642810:41
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Put create repo script into its own tasks file.  https://review.openstack.org/60536910:42
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart master: Add new featureset 056 for standalone upgarde.  https://review.openstack.org/60536310:42
*** dmacpher has quit IRC10:43
*** dmacpher has joined #tripleo10:44
pvchi rdo10:45
pvcTengu what do i need to do?10:45
*** Petersingh|afk is now known as Petersingh10:51
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Assure copied image is owned by current user  https://review.openstack.org/59642810:52
jaosoriordtantsur: I did10:53
*** gvrangan has joined #tripleo10:53
jaosoriorchkumar|ruck: ack10:53
dtantsurthanks jaosorior10:54
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Assure copied image is owned by current user  https://review.openstack.org/59642810:54
*** shyamb has joined #tripleo10:55
pvcdtantsur im deploying now10:56
openstackgerritDerek Higgins proposed openstack/tripleo-heat-templates master: Add scenario 012 - overlcoud baremetal+ansible-ml2  https://review.openstack.org/57960310:59
*** thrash|g0ne is now known as thrash10:59
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci master: [WIP] Testing ansible-ml2 job  https://review.openstack.org/58229411:01
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: Use rdo mirror for q,p,o buildlogs at promotions  https://review.openstack.org/60432411:02
quiquell|roverchkumar|ruck: What else do we have to look up ?11:07
*** aufi has quit IRC11:07
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537411:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256011:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441811:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)11:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)11:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)11:10
mschuppertquiquell|rover: that issue got resolved with the promotion! but there is a new one with gem versions http://logs.openstack.org/45/594145/5/check/puppet-openstack-unit-4.8-centos-7/4d03794/job-output.txt.gz#_2018-09-26_08_06_58_110082 -> https://review.openstack.org/#/c/605350/11:11
*** udesale has quit IRC11:12
pvchi dtantsur11:12
*** psachin has quit IRC11:13
pvchi etingof dtantsur. two of my server doesnt get an ip address and its error is Determining IP information for ens4f0... failed; no link present.  Check cable?11:13
dtantsurpvc: could you please not spam us in several channels simultaneously?11:14
openstackgerritGabriele Cerami proposed openstack-infra/tripleo-ci master: [WIP] add a job to test the reproducer  https://review.openstack.org/60423211:15
openstackgerritHonza Pokorny proposed openstack/tempest-tripleo-ui master: Add basic project structure  https://review.openstack.org/57573011:15
pvcim sorrry dtantsur11:15
openstackgerritMartin André proposed openstack/tripleo-common master: Add wrapper for openshift-ansible docker command  https://review.openstack.org/60539911:15
*** pcaruana has quit IRC11:15
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Introduce OpenShiftGlusterNodeVars heat param  https://review.openstack.org/60472411:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Make glusterfs the default sc when deploying with CNS  https://review.openstack.org/60472511:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Consolidate openshift-ansible global variables  https://review.openstack.org/60472611:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add heat param for openshift prerequisites playbook  https://review.openstack.org/60433811:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Do not wipe disks on OpenShift gluster nodes  https://review.openstack.org/60512711:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Remove unused networks from OpenShift roles  https://review.openstack.org/60472711:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Deploy openshift all in one in scenario009  https://review.openstack.org/60378011:16
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Use openshift-ansible container instead of RPMs  https://review.openstack.org/58386811:16
*** mathlin has quit IRC11:16
openstackgerritMichal Pryc proposed openstack/instack-undercloud master: Fix curly brackets for ntp::servers: to prevent HTML escaping  https://review.openstack.org/60540011:17
openstackgerritMartin André proposed openstack/tripleo-common master: Add wrapper for openshift-ansible docker command  https://review.openstack.org/60539911:19
*** jpena is now known as jpena|lunch11:21
*** Petersingh is now known as Petersingh|afk11:23
*** psachin has joined #tripleo11:28
honzajaosorior: mandre: could you have a look at this patch? https://review.openstack.org/#/c/589572/11:30
*** pvc_ has joined #tripleo11:30
*** aufi has joined #tripleo11:30
chkumar|ruckquiquell|rover: currently everything is under controle11:31
jaosoriorhonza: ack11:32
*** Petersingh|afk has quit IRC11:32
*** pvc has quit IRC11:32
honzamwhahaha: hey, about this patch https://review.openstack.org/#/c/599400/ I admittedly don't know much about the way networking is configured in oooq, but i'm happy to work on it given some guidance --- would you mind elaborating on your last comment?11:33
honzajaosorior: thanks!11:34
mandrehonza: shouldn't we make the expire period configurable?11:34
chkumar|ruckquiquell|rover: for queens fs002 and fs020 is problematic with overcloud prepare image ction_spec_name\nInvalidActionException: Failed to find action [action_name=baremetal_introspection.get_status]\n'}11:35
chkumar|ruckhttp://logs.rdoproject.org/openstack-periodic-24hr/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-queens/aebdd5c/logs/undercloud/home/zuul/overcloud_prep_images.log.txt.gz and11:35
chkumar|ruckhttp://logs.rdoproject.org/openstack-periodic-24hr/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-queens-upload/fd3bef2/logs/undercloud/home/zuul/overcloud_prep_images.log.txt.gz11:35
honzamandre: good point!11:35
chkumar|ruckquiquell|rover: I am not sure what to do, as it is seen previously also in periodic jobs11:35
mandrehonza: we can possibly interpret a value as11:36
mandreas 'turn if off'11:36
mandreso we don't need to introduce a new param11:37
chkumar|ruckquiquell|rover: and one with http://logs.rdoproject.org/openstack-periodic-24hr/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset016-pike/a1840be/logs/undercloud/var/log/mistral/executor.log.txt.gz11:37
chkumar|ruckfor pike11:37
honzamandre: so just replace the expires_by_type with a default?11:38
honzamandre: ie if you're configuring the expire period, you might as well configure the mimetypes11:39
honzaor contenttypes or whatever it's called :)11:39
mandrehonza: yeah that makes sense11:39
chkumar|ruckquiquell|rover: for master promotion only fs020 is blocking na?11:40
chkumar|ruckrocky green11:40
chkumar|ruckqueens fs020 and fs00211:40
chkumar|ruckpike fs01611:40
quiquell|rovermschuppert: How is the rootwrap going ?11:42
*** dprince has joined #tripleo11:45
*** lblanchard has joined #tripleo11:47
openstackgerritQuique Llorente proposed openstack/python-tripleoclient stable/rocky: Add a guard to break if no connection  https://review.openstack.org/60380411:47
weshaychkumar|ruck, quiquell|rover if you guys want a break I can attend the program call11:50
weshaywe're green, thanks to you guys11:50
quiquell|roverweshay: thanks11:50
*** raildo has joined #tripleo11:51
*** nawar has quit IRC11:53
*** nawar has joined #tripleo11:54
chkumar|ruckweshay: thanks :-)11:54
*** shyamb has quit IRC11:55
weshayrdo cloud jobs still at 30% success rate :(11:56
openstackgerritHonza Pokorny proposed openstack/puppet-tripleo master: Add apaches expires directive for js and css files  https://review.openstack.org/58957211:58
openstackgerritQuique Llorente proposed openstack/python-tripleoclient master: Raise proper exception at webscocket close  https://review.openstack.org/60538711:58
*** rh-jelabarre has joined #tripleo11:58
*** trown|outtypewww is now known as trown11:59
*** shyamb has joined #tripleo12:00
openstackgerritQuique Llorente proposed openstack/python-tripleoclient stable/rocky: Add a guard to break if no connection  https://review.openstack.org/60380412:01
*** Petersingh|afk has joined #tripleo12:02
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Sudo virt-resize for libvirt reproducer  https://review.openstack.org/60502112:05
openstackgerritDaniel Alvarez proposed openstack/tripleo-heat-templates master: Configure http/https on OVN Metadata service to talk to Nova  https://review.openstack.org/60540612:06
*** leanderthal has joined #tripleo12:09
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537412:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256012:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441812:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)12:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)12:10
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Switch previous release of master from 'queens' to 'rocky'  https://review.openstack.org/59077412:10
*** apetrich has joined #tripleo12:11
weshaychkumar|ruck, arxcruz any idea why we have no tempest.html or stackwiz? http://logs.openstack.org/19/603419/2/gate/tripleo-ci-centos-7-undercloud-containers/12fa54c/logs/undercloud/home/zuul/tempest.log.txt.gz12:12
*** Petersingh|afk is now known as Petersingh12:12
chkumar|ruckweshay: I have opened a bug for the same , it is seen at multiple places where tests failed12:13
openstackgerritwes hayutin proposed openstack/tripleo-common master: Remove non-voting jobs from the gate  https://review.openstack.org/60341912:13
mschuppertquiquell|rover: rootwrap is good. we need https://review.openstack.org/60535012:15
chkumar|ruckweshay: https://bugs.launchpad.net/tripleo/+bug/179366512:15
openstackLaunchpad bug 1793665 in tripleo "Fs016/17 periodic jobs fails to generate tempest results once tempest run finishes" [Critical,Triaged]12:15
chkumar|ruckweshay: I think it requires a lot of fixes at where result generaiton is done12:15
quiquell|rovermschuppert: so you need master promotions no ?12:16
weshaychkumar|ruck, arxcruz it's such a error prone piece of our work12:16
weshaybreaks very often12:16
chkumar|ruckweshay: I need to do some clean there, breaking tempest result generaiton seperate from stackviz12:17
chkumar|ruckweshay: I will be looking tomorrow12:17
*** dtantsur is now known as dtantsur|brb12:18
*** rlandy has joined #tripleo12:18
quiquell|roverchkumar|ruck: master fs020 is different now, https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset020-master/5efcbac/logs/undercloud/home/zuul/undercloud_reinstall.log.txt.gz#_2018-09-26_01_33_5412:18
quiquell|roverchkumar|ruck: it's not the selinux stuff12:18
*** dprince has quit IRC12:18
*** shardy has joined #tripleo12:19
chkumar|ruckquiquell|rover: nope12:19
weshayjistr, do you want to update the depends-on https://review.openstack.org/#/c/590774/12:19
chkumar|ruckquiquell|rover: fs020 failing at same task /var/lib/config-data dir12:19
*** pdeore has joined #tripleo12:19
quiquell|roverchkumar|ruck: Do we have a bug for that ?12:20
*** quiquell|rover is now known as quique|rover|lch12:20
chkumar|ruckquiquell|rover: https://bugs.launchpad.net/tripleo/+bug/1794251 check the description bottom12:20
openstackLaunchpad bug 1794251 in tripleo "[master] undercloud reinstall failed with invalid selinux context: [Errno 95] Operation not supported" [Critical,In progress] - Assigned to Cédric Jeanneret (cjeanner)12:20
quique|rover|lchchkumar|ruck: ack12:20
*** ratailor has quit IRC12:20
chkumar|ruckquique|rover|lch: when I filed the bz two with selinux 2 with different12:20
*** panda|off is now known as panda12:21
quique|rover|lchchkumar|ruck: Looks like there is a solution to disable selinux upstream12:22
mschuppertquiquell|rover: we have stable branches for puppet-openstack_spec_helper12:22
*** agopi has quit IRC12:24
chkumar|ruckquique|rover|lch: I am not sure disabling selinux is the right thing12:24
openstackgerritJiri Stransky proposed openstack-infra/tripleo-ci master: Switch previous release of master from 'queens' to 'rocky'  https://review.openstack.org/59077412:24
jistrweshay: done, thanks12:25
jistrfyi quique|rover|lch ^12:25
jistr(updated https://review.openstack.org/#/c/590774/ so if you push a new patch set, please fetch first)12:25
pvc_hi guys may i ask?12:28
pvc_jistr12:28
*** jpena|lunch is now known as jpena12:28
pvc_shardy hello12:28
*** pvc_ has left #tripleo12:29
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Fix .bashrc path for XDG exports  https://review.openstack.org/59310312:29
*** pvc_ has joined #tripleo12:29
pvc_hi shardy12:29
*** udesale has joined #tripleo12:30
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: [WIP] Add rocky specific parameters to custom NICs.  https://review.openstack.org/60540712:30
arxcruzweshay: chkumar|ruck looking into it12:30
shardypvc_: hi12:31
pvc_hi shardy. it seems that my controller failing on starting network service Determining IP information for ens4f0... failed; no link present.  Check cable?12:32
pvc_i think this is a hardware issue?12:32
pvc_but if i ifup ens4f0 it have an IP address12:32
*** amoralej is now known as amoralej|lunch12:33
shardypvc_: as I said earlier, the default configuration creates an ovs bridge, which is then brought up - is br-ex configured on your node?12:34
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: [WIP] Add rocky specific parameters to custom NICs.  https://review.openstack.org/60540712:35
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: WIP: New workflow for standalone upgrade  https://review.openstack.org/60470612:35
shardypvc_: if you want to disable that configuration, do something like OS::TripleO::Controller::Net::SoftwareConfig: /usr/share/openstack-tripleo-heat-templates/net-config-noop.yaml in the resource_registry section of an -e environment file12:35
pvc_on overcloud node or undercloud node?12:36
shardyhttps://github.com/openstack/tripleo-heat-templates/blob/master/overcloud-resource-registry-puppet.j2.yaml#L3412:36
shardypvc_: which node is failing?12:36
pvc_Controller node12:36
shardyOk, that is the node to debug (!)12:36
shardypvc_: what OpenStack version is this?12:37
pvc_im using ocata shardy12:37
pvc_im sorry Compute node is failed shardy12:39
pvc_i will reset the bios of the failed node shard y12:40
*** ubijtsa is now known as assassin12:40
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Fix .bashrc path for XDG exports  https://review.openstack.org/59310312:40
*** psachin has quit IRC12:41
EmilienMbandini: when you have time: https://review.rdoproject.org/r/#/c/16280/12:41
EmilienMbandini: and https://review.openstack.org/#/c/600849/12:41
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Remove timeout logic  https://review.openstack.org/58906812:41
shardypvc_: Ok, as I said earlier for most baremetal deploymens you need to actually do some work to configure at least one interface - it might be that net-config-bridge.yaml is enough for a very simple setup, but normally folks require more configuration e.g multiple nics, bonding, isolated vlans etc12:42
*** gvrangan has quit IRC12:42
quique|rover|lchjistr: ack, going to remove the note that weshay mention12:43
pvc_this is not enought right shardy http://paste.openstack.org/show/730929/12:43
*** quique|rover|lch is now known as quiquell|rover12:44
mschuppertcan I please get reviews on https://review.openstack.org/#/c/587066/ https://review.openstack.org/#/c/604272/12:44
openstackgerritHonza Pokorny proposed openstack/puppet-tripleo master: [ui] Add option to configure apache expires  https://review.openstack.org/58957212:46
*** assassin has left #tripleo12:46
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Switch previous release of master from 'queens' to 'rocky'  https://review.openstack.org/59077412:47
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: WIP: trying to remove QUICKSTART_RELEASE into the job definitions  https://review.openstack.org/60541012:47
quiquell|rovermandre: ^ weee !!!12:47
*** shyamb has quit IRC12:49
*** jcoufal has joined #tripleo12:49
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: DNM: fc28 mega-testing-patch  https://review.openstack.org/60541112:50
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Fix .bashrc path for XDG exports  https://review.openstack.org/59310312:54
*** janki has joined #tripleo12:55
openstackgerritEmilien Macchi proposed openstack/tripleo-common stable/rocky: config: ignore missing server_id from the stack  https://review.openstack.org/60541212:56
*** nawar has left #tripleo12:57
*** jfrancoa has quit IRC12:57
*** jfrancoa has joined #tripleo12:58
*** skramaja has quit IRC12:59
*** Guest42266 is now known as florianf13:01
*** tzumainn has joined #tripleo13:02
*** gfidente has quit IRC13:02
*** Petersingh has quit IRC13:07
*** sshnaidm is now known as sshnaidm|mtg13:07
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Allow sudoing for non root user via wheel group  https://review.openstack.org/60541513:07
jankiEmilienM, hi13:08
*** Petersingh has joined #tripleo13:09
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537413:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256013:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441813:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)13:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)13:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)13:10
weshaychkumar|ruck, ?13:11
*** Petersingh_ has joined #tripleo13:12
*** Petersingh has quit IRC13:12
openstackgerritCédric Jeanneret proposed openstack/tripleo-quickstart-extras master: Add podman support for log collection  https://review.openstack.org/60509013:14
*** ohsnap has joined #tripleo13:14
openstackgerritMichele Baldessari proposed openstack/puppet-pacemaker master: Rely on path for CLI calls when possible  https://review.openstack.org/60489113:16
*** Petersingh_ has quit IRC13:19
openstackgerritArx Cruz proposed openstack/tripleo-quickstart-extras master: WIP - Fix stackviz  https://review.openstack.org/60541913:21
*** Petersingh has joined #tripleo13:21
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates master: Handle LP openvswitch meta-package on upgrade  https://review.openstack.org/60520013:21
*** Petersingh has quit IRC13:23
*** Petersingh has joined #tripleo13:23
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535613:25
*** chkumar|ruck is now known as chandankumar13:26
EmilienMjanki: hey, in a meeting now. I'll answer later if you have any question13:26
*** mschuppert has quit IRC13:26
*** mrsoul has quit IRC13:26
*** agopi has joined #tripleo13:27
*** vinaykns has joined #tripleo13:27
Tengubandini: heya! are you here?13:28
Tengubandini: did the keepalived docker image or provisionning change lately? I get a weird error now when I try to deploy a podman-driven undercloud: modprobe: ERROR: could not insert 'ip_vs': Operation not permitted13:29
*** panda has quit IRC13:32
*** dtantsur|brb is now known as dtantsur13:33
*** abishop has quit IRC13:40
bandiniTengu: sort of here ;) seems keepalived is trying to modprobe ip_vs and fails13:41
*** amoralej|lunch is now known as amoralej13:41
Tengubandini: it's new, isn't it?13:41
bandininot sure if it is a new thing (i'd say it is less likely)13:41
Tenguhmm.13:41
bandinimaybe, although I am not sure if we updated keepalived recently13:41
Tengudidn't get that issue while deploying an undercloud until today. last try before my PTO -.-'13:41
Tengubandini: not really sure a *container* should be allowed to modprobe anything anyway.13:42
bandiniTengu: what version is in the container?13:42
bandiniTengu: exactly13:42
Tenguthat should be in the prep_host13:42
bandiniversion == version of keepalived13:42
Tengubandini: wait, getting a clean env for the deploy13:42
Tengushould be up in sec.13:42
*** mschuppert has joined #tripleo13:44
*** slaweq has quit IRC13:45
openstackgerritMerged openstack-infra/tripleo-ci master: Increase post-timeout to 1 hour  https://review.openstack.org/60518513:48
Tengubandini: still waiting - deploy's running, should get the containers shortly.13:49
openstackgerritUdi Kalifon proposed openstack/tempest-tripleo-ui master: Selenium infra  https://review.openstack.org/60542413:49
*** gfidente has joined #tripleo13:50
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Remove unused bootstrap-config.yaml  https://review.openstack.org/60500913:56
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Convert *tasks from bootstrap_nodeid to short_bootstrap_node_name  https://review.openstack.org/60543013:56
Tengubandini: hm. in fact keepalived has nothing to do on the undercloud... right?13:58
Tengubandini: docker.io/tripleomaster/centos-binary-keepalived:48a0d56e4ba547ab10a35888138370fb1ec74a97_31a6245613:58
bandiniTengu: keepalived on the undercloud is used for the VIPs when you use TLS (which is the default nowadays)13:59
*** mjturek has joined #tripleo13:59
bandinii.e. it is always used13:59
Tengubandini: duh. ok.13:59
Tengubandini: does the hash answer your question, or should I do some blackmagic in order to get another version number?14:00
*** zb is now known as zaneb14:01
*** mcornea has joined #tripleo14:01
*** mcornea has quit IRC14:02
*** mcornea has joined #tripleo14:02
bandini [root@undercloud-0 pacemaker]# docker run -it --net=host --user=root  docker.io/tripleomaster/centos-binary-keepalived:48a0d56e4ba547ab10a35888138370fb1ec74a97_31a62456 sh -c 'rpm -q keepalived'14:03
bandinikeepalived-1.3.5-6.el7.x86_6414:03
openstackgerritDmitry Tantsur proposed openstack/diskimage-builder master: Add an element to configure iBFT network interfaces  https://review.openstack.org/39178714:03
bandiniTengu: from a quick look we did not upgrade much there, so maybe something else changed14:03
Tengubandini: hmm ok. maybe the bootstrap part? will check.14:04
Tengubandini: https://github.com/openstack/kolla/blob/master/docker/keepalived/extend_start.sh  that's where it's called.14:06
Tengubut still.... I don't think I saw that before.14:07
openstackgerritDmitry Tantsur proposed openstack/diskimage-builder master: Add an element to configure iBFT network interfaces  https://review.openstack.org/39178714:07
Tengubandini: would you be OK to move that modprobe into the tripleo-heat-templates/docker/services/keepalived host_prep_tasks instead?14:08
TenguI can of course produce a patch in that way.14:08
*** abishop has joined #tripleo14:09
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537414:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256014:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441814:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)14:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)14:10
shardyjaosorior: Hey is OS::TripleO::NodeTLSData still used anywhere, seems to be set to OS::Heat::None in both ./environments/enable-tls.yaml and ./overcloud-resource-registry-puppet.j2.yaml ?14:11
shardyjaosorior: the reason for my question is I need to modify puppet/extraconfig/tls/tls-cert-inject.yaml as it references bootstrap_nodeid, but AFAICS it's not used anymore14:11
shardyso perhaps I can just delete it?14:12
*** janki has quit IRC14:12
weshaymwhahaha, fetching the containers in pre is a no-go? https://review.openstack.org/#/c/580037/14:14
mwhahahaweshay: pretty much, the more i look at it the less beneficial it is14:14
jaosoriorshardy: right, it's not used anymore. Tengu refactored the TLS setup to be ansible based now :D14:14
shardyjaosorior: Ok thanks, I'll remove it in my series reworking bootstrap things14:15
weshaymwhahaha, pulling the containers at one time and comparing across providers?14:15
Tengu:)14:15
*** udesale has quit IRC14:15
weshaynot worth it?14:15
mwhahahaweshay: the problem is pulling all the containers takes too long (like 40+ mins)14:16
mwhahahaweshay: so unless you know up front what containers you're going to need, it's wasted time14:16
mwhahahaweshay: pre also shares run's timeout so there is no gain14:16
Tenguquiquell|rover: my patch is entering the gate (for the selinux recurse thingy) :)14:16
Tenguquiquell|rover: cross you fingers!14:17
*** panda has joined #tripleo14:21
dtantsurslagle: hey! I'm not sure how your proposal at https://etherpad.openstack.org/p/tripleo-edge-squad-status is better than just have N underclouds per location with templates stored in git..14:22
openstackgerritDan Macpherson proposed openstack/ansible-role-openstack-operations master: Adding Backup and Restore Operations  https://review.openstack.org/60443914:23
openstackgerritHonza Pokorny proposed openstack/tempest-tripleo-ui master: Add basic project structure  https://review.openstack.org/57573014:23
slagledtantsur: well, better or worse, that's to be decided i guess. but the point is that you don't have an undercloud at each edge site14:26
dtantsurslagle: right, so.. you don't have edge undercloud.14:26
dtantsuryou have some remote software configuration appliances, but not what people are used to: nodes management via ironic, etc14:27
slagledtantsur: you have something that can execute a deployment. a container with everything embedded perhaps to execute ansible or whatever14:27
dtantsurso, it's kind of Federation idea, no?14:27
dtantsura big undercloud talking to smaller ones?14:27
*** moguimar has quit IRC14:27
slagledtantsur: yes in a way, if it includes "just enough ironic" to also do baremetal14:27
*** dxiri has joined #tripleo14:28
slaglewhere the data was federated, as opposed to one big cloud (undercloud)14:28
dtantsurslagle: I see. What I don't like here is the amount of TripleO-specific code to write to make it happen..14:28
dtantsurslagle: now, re RabbitMQ/MariaDB: how is it going to be solved for the overcloud? or is it going to also be federation?14:28
slagledtantsur: i don't see a lot of tripleo specific code14:29
dtantsurslagle: well, the federation itself, no?14:30
shardydtantsur: also there are two layers to this problem - for each controlplane datacentre (region?) you could probably in some cases have a director per deployment, but for the compute/storage "far edge" nodes (think a radio antenna) it's unlikely any deployment hardware would be acceptable due to footprint14:30
slagledtantsur: ironic would have to support that14:30
dtantsurshardy: right, this is what we're trying to figure out14:30
slagledtantsur: i don't see tripleo adding that onto ironic14:30
*** moguimar has joined #tripleo14:30
slagledtantsur: but one HUGE undercloud isn't the answer14:30
dtantsurslagle: welllll.. we may :) there'll be a session on that. but that's probably T+14:30
dtantsurslagle: it's not on HUGE, it's distributed14:31
slagleit is huge when we talk about the number of distributed sites14:31
dtantsurwe're talking distributed vs federated (I assume you also expect federated nova, glance, etc)14:31
EmilienMbandini, bogdando : thx for the review. Will address.14:31
slagledtantsur: for the undercloud? no, i don't14:31
dtantsurslagle: for the overcloud. each service must support federation, no?14:32
slagledtantsur: i'm talking about the deployment tool. how it needs to work so that we can deploy edge architectures14:32
dtantsurright, but I'm trying to see the whole picture14:32
dtantsurwe've spent some time making undercloud NOT a special snowflake, but a case of the overcloud14:32
slagleif one huge (ok, "distributed") cloud won't work for the overcloud, it won't work for undercloud either14:32
dtantsurso I'm trying to understand how the federation idea plays with all overcloud services14:32
shardyeach region is an independent cloud, and federation just enables authentication/authorization via Keystone in each region, with some central IdP14:33
shardyI'm not clear why all services need to "support federation", e.g what is missing?14:33
dtantsurokay, so it's keystone-level federation, not service-level14:33
EmilienMbandini: I replied to https://review.rdoproject.org/r/#/c/16280/7/paunch-container-shutdown14:33
slagleshardy: i don't see how they do14:33
dtantsuri.e. different endpoints for each, say, nova, in each location14:33
slagle*why14:33
EmilienMbandini: do we need to follow up with a patch? I haven't checked how pacemaker containers are configured for restart policy14:33
quiquell|roverTengu: crossed, hand and feet14:34
shardydtantsur: yeah there were a few ideas on that, but it seems like either one central keystone with multiple regions, Federated access to each region, or possibly some new model where keystone can lazily replicate from a central IdP for each region14:34
Tengu:D14:34
*** artom has quit IRC14:34
shardythe last one I'm not super clear on, but IIRC it's not currently possible14:34
dtantsurshardy: okay, so then we don't need ironic to support federation itself14:34
dtantsurthere will be tripleo code that will know which ironic to talk to (and.. mm.. get rid of nova first?)14:35
bandiniEmilienM: no I think we're good there, thanks14:35
dtantsurslagle: this is what I'm talking about re a lot of tripleo code ^^^14:35
slagledtantsur: oh to get rid of nova? or know which ironic to talk to?14:35
shardydtantsur: it'd probably be something like an ansible playbook with a list of underclouds, one per region14:36
dtantsurslagle: both :) but getting rid of nova is planned already (hopefully)14:36
shardythe harder problem is what to do with the "far edge" compute only clusters14:36
EmilienMbandini: thx for the help btw14:36
dtantsurwell, my hard problem is that it's no longer TripleO in any sense of "OpenStack on OpenStack"14:37
dtantsure.g. you cannot just talk to ironic to provision a node14:37
dtantsuryou need to talk to some playbooks that find an ironic to talk to14:37
shardydtantsur: well it's exactly the same for the controlplane14:37
*** Vorrtex has joined #tripleo14:37
shardyit's just a special case for some specific compute cluster deployments14:37
slagledtantsur: what i was proposing on the etherpad, was a self contained zero-footprint installer that could install an Ironic at the edge if needed. in that case, there is only one14:37
dtantsurslagle: one per location, no?14:37
slagleand then use that Ironic to provision local compute/storage14:38
bogdandoslagle, mwhahaha: "but the point is that you don't have an undercloud at each edge site" that part confises me. I know there is a related rfe for AIO productized as 1:1 bundled underclouds to all in one node managed by it. So it is very netural to expect these two can play together... and they do not!14:38
openstackgerritUdi Kalifon proposed openstack/tempest-tripleo-ui master: Selenium infra  https://review.openstack.org/60542414:38
slaglebut honestly, i wouldn't even focus on solving baremetal provisioning at the edge14:38
slaglefeels like boiling the ocean14:38
bogdando(assumed an edge site may be represened by that productized AIO+undercloud bundle)14:38
dtantsurwell, something has install all this baremetals14:38
shardyyeah, I think just use deployed server, at least for the first pass14:38
slagleyes, something does. i don't think we have to provide it right on day 114:39
mwhahahabogdando: yea i thought that was the point of having an all in one deployable via an undercloud (and the undercloud can deploy multiples)14:39
dtantsurmmmok, come back to me when customers start complaining :D14:39
bogdandootherwise, if we go with "non-productized" AIO, which is standalone, we end up with non productized Edge as well :) messy14:39
* dtantsur imagines someone running around with a RHEL CD in all locations14:39
slaglepigeons, perhaps14:40
bogdandoand that "something that can execute a deployment" replacing control plane for edge sites, is not really a life cycle  tool, can't do upgrades, for example14:40
dtantsurpigeons++14:40
shardydtantsur: FWIW I think the edge cases we're initially targetting will be more like embedded appliances - they'll build hundreds of them with a fixed spec and a standard image, so you just need some way to power them up and point a configuration tool at the running nodes14:40
shardye.g consider a cell phone antenna site14:40
mwhahahaRFC 254914:41
bogdandoif acting disconnected from the central UC14:41
shardythere are certainly other use-cases to consider too, but perhaps as a second step?14:41
dtantsurwhat I'm trying to understand is how much more we can do beyond "make undercloud consume less memory, so that it fits into a small computer"14:41
dtantsurbecause for me this ^^^ sounds like our plan14:41
slaglebogdando: i think we can have lifecycle management, but just not with the architecture we have today in tripleo14:41
dtantsurplus some playbooks, dirt and sticks14:41
bogdandoagree with far edge sites not having UC tho14:42
slaglebogdando: everything can't be centrally managed. that's one of the core issues with edge we are trying to solve14:42
shardydtantsur: well that's one goal, but kind of orthoganal to edge clusters which only contain 3 nodes?14:42
slaglebogdando: plus, the UC will never scale to manage thousand(s) node scale14:42
slaglecentrally14:42
Tengubandini: adding the "modprobe" in the services/keepalived.yaml host_prep_tasks solves my issue, I'll provide a patch once my mtg is over.14:42
dtantsurokay, let me ask it in a more provocative way: what's the benefit of using tripleo here instead of e.g. kolla-ansible?14:42
Tengubandini: that was easy for once :).14:43
dtantsurno central API, no bm provisioning14:43
slagledtantsur: we still need those. just perhaps not at the edge14:43
dtantsurslagle: right, and I'm asking about Edge. Why use TripleO there? What is going to make our differentiator?14:43
shardydtantsur: the value is all the control-plane nodes in each DC can still do BM provisioning etc, then you can manage day-2 operations for both controlplane and edge compute sites with one tool14:44
shardye.g TripleO14:44
*** cylopez has quit IRC14:44
bandiniTengu: right but do we actually need that modprobe for what we do in keepalived?14:44
shardymaybe kolla-ansible can do that, I have no idea14:44
slagledtantsur: because we use tripleo for the undercloud, and we want to have it support the edge14:44
Tengubandini: can't say - we don't want to run it from a container, that's all I can say. I'm no keepalived guru :/14:44
dtantsurshardy: without real undercloud on Edge, this is not the same tool. in the real DC you use Ironic, Mistral and co, on Edge you use.. pidgeons and ansible?14:44
slagledtantsur: it's not just "run some ansible playbooks"14:44
dtantsurslagle: I know why we want it, I'm asking why a customer would.14:44
shardydtantsur: it may be helpful to define your vision of "Edge", here I think we're talking about distributed compute mostly14:45
Tengubandini: in a first step we can move it out of the container, so that we're iso-compatible. You OK with that?14:45
bandiniTengu: yeah me neither, I'd be surprised if we need that kernel module only to add a couple of VIPs though (I might be mistaken though)14:45
shardydtantsur: meh - we already support several modes of deployment with don't use Ironic, I don't see how this is any different really14:45
bandiniTengu: worksforme as long as we don't forget it (as it will bite us eventually ;)14:45
Tenguwell, seeing its name, it might be needed, in fact, bandini14:45
slagledtantsur: you seem to be coming from the position that the tool can't evolve.14:45
openstackgerritMehdi Abaakouk (sileht) proposed openstack/puppet-tripleo master: ceilometer: escape % in crontab  https://review.openstack.org/59942114:45
shardyand yeah, it mostly uses ansible because that's what $everybody asked for14:45
slagledtantsur: is TripleO not TripleO because we added support for pre-provisioned nodes?14:45
dtantsurshardy: well, it's opt-out, not the only option14:45
slagledtantsur: i can't argue that point14:45
dtantsurslagle: no, but it will when you remove the provisioning14:46
slagleit's not totally sensical to me14:46
dtantsurwell, IMO14:46
shardydtantsur: sure there's still much utility in Ironic, we're just saying maybe not (yet at least) for tiny edge deployments14:46
slagledtantsur: we're not removing it14:46
dtantsurmy main point is: centralized and uniform control plane is a killer feature of tripleo14:47
dtantsurI think sacrificing it will reduce our utility compared to more lightweight solutions14:48
dtantsurI don't insist you put Ironic where it does not belong (less bug reports for me)14:48
dtantsur(although I'm pretty sure somebody will come with a bug asking for Ironic at Edge UC quite soon)14:48
slagledtantsur: what i'm proposing is a way to still get the benefit from the centralized management, but in a distributed/disconnected/portable fashion14:48
slaglei think we need to think beyond our existing architecture or just multi-node undercloud or multi-undercloud14:49
slagleneither of those scale to thousands of nodes on their own14:49
*** ade_lee has joined #tripleo14:49
openstackgerritGabriele Cerami proposed openstack-infra/tripleo-ci master: [WIP] add a job to test the reproducer  https://review.openstack.org/60423214:49
dtantsurslagle: maybe I don't quite undercloud your proposal? it seems like we're going to have the central undercloud with playbooks (behind Mistral?) only, and whatever is one Edge mostly hidden from operators using that central undercloud.14:50
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load ip_vs module from the host, NOT from the container  https://review.openstack.org/60544614:50
Tengubandini: -^14:50
slagledtantsur: i'd like us to still use the UC to do the planning, and rendering of the deployment, along with some state tracking14:51
slagledtantsur: not necessarily "live" state14:51
dtantsurslagle: it's still mistral+heat+ansible, right?14:51
slaglebut things like what container image versions were used, what ip's, etc14:51
shardyYeah, I don't think it'll be hidden, there are a few options, e.g multiple compute-only plans which scale out independent of the controlplane14:52
dtantsurbut no direct control plane access for tasks like introspection?14:52
slagledtantsur: but then separate the actual applying of that deployment, so that it doesnt have to be driven from the UC14:52
shardyjust because we don't use Ironic doesn't mean the undercloud can't manage those clusters14:52
*** PhilSliderS has quit IRC14:52
slagledtantsur: we are already quite far along with config-download+export14:52
slagleyou get a container image that can do the deployment14:52
slagleyes it uses ansible, but i consider that implementation detail almost14:52
openstackgerritMartin Schuppert proposed openstack/tripleo-heat-templates master: Add nova file_backed_memory and memory_backing_dir support for qemu.conf  https://review.openstack.org/60436014:53
dtantsurokay, I think I see what the plans are, thanks all. I guess I cannot help much in this, since the ironic part stays unchanged (where it stays).14:53
slagledtantsur: and then management in git of the plan, state, rendered deployment, etc. instead of a centralized swift14:53
dtantsursorry to nitpick, but a git server is also centralized (unlike the protocol itself)14:54
*** sileht has quit IRC14:54
dtantsurbut this probably does not matter much14:54
slaglesure, i mean ultimately you need a truth of record. centralized14:55
*** sileht has joined #tripleo14:56
bogdandoAlso I think penguins may fit the case better than pigeons14:57
slaglemy thinking with ideas around git is that i think it's quite nice with how we've ended up using it with config-download14:57
bogdandothey can swim far distance across islands, for example, and still carry on more CDs with RHEL than over the air :)14:57
slaglebut it is still kind of hidden behind swift14:57
slagleand i'd like the plan to be in git as well, so we have some clear history there. which would be super helpful14:57
slagleand then when you consider the edge, where we may be doing deployments that are completely disconnected, we can't exactly be reporting status back over a message bus14:58
slagleso saving some state locally seems useful14:58
dtantsurbogdando++14:58
slaglewith the ability to "sync back" (git push) back to the centralized management14:58
* dtantsur thinks that picking AMQP for an RPC implementation was a weird idea to begin with..14:59
dtantsurI like the idea of git for history management though15:00
dtantsurdo we need a Git as a Service in OpenStack? :)15:00
*** pcaruana has joined #tripleo15:01
slaglei hope not :)15:01
*** Petersingh has quit IRC15:02
*** Petersingh has joined #tripleo15:02
thervethrash: toure: I think I found the issue with the messaging timeout15:05
therveWell root cause at least, don't know why it's happening yet15:05
thrashtherve: do tell15:06
thervethrash: It looks like SIGHUP messes up with the worker, and the RPC client ends up broken15:06
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load isci_tcp module from the host.  https://review.openstack.org/60545015:06
*** moguimar has quit IRC15:06
thervethrash: We end up here: https://github.com/openstack/oslo.messaging/blob/master/oslo_messaging/_drivers/impl_rabbit.py#L65615:07
therveBingo managed to reproduce15:07
thrashtherve: so, instead of waiting 48 hours... SIGHUP the API?15:08
thervethrash: Yep. I did it 5-6, and now it's in a broken state15:08
thrashtherve: Nice work15:08
thrashtherve: sighup in the host? Or from within the container?15:09
*** moguimar has joined #tripleo15:09
thervethrash: I think both works, the process id is exposed in the host15:09
thrashtherve: ack15:09
thervethrash: I used "sudo docker exec mistral_api  kill -SIGHUP" though15:09
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load dm-multipath module from the host.  https://review.openstack.org/60545215:10
thervethrash: I used "sudo docker exec mistral_api  kill -SIGHUP 1" though15:10
thrashtherve: gotcha15:10
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537415:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256015:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441815:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)15:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)15:10
thrashtherve: That was interesting timing... Look at the bug that just scrolled by15:10
Tengubandini: found a couple more of modprobe within containers...15:11
* Tengu tracks them down15:11
thrashtherve: don't know if related, but interesting :)15:11
thrashtherve: but yeah... I just ran it once.. and bang.15:11
therve\o/15:12
thrashrestart the container and it works again.15:12
thervethrash: So it's related to https://github.com/openstack/mistral/blob/master/mistral/api/service.py#L5115:13
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: use the debug callback (humananly_readable)  https://review.openstack.org/60480215:13
thrashtherve: but didn't we determine that the container is not running via wsgi?15:14
thervethrash: Why is that important?15:14
bandiniTengu: ack, nice. maybe open a bug and add a Related-Bug: XYZ for each review/fix?15:14
* toure reading backlog15:14
Tengubandini: yeah, might be an idea ^^'15:14
thrashtherve: because that code is the WSGIService...15:14
thervethrash: We don't run it via wsgi, but it runs the WSGIService anyway15:15
thrashtherve: I'm a dolt. :P15:15
therve:D15:15
Tengubandini: https://bugs.launchpad.net/tripleo/+bug/1794550 - will update the commits.15:17
openstackLaunchpad bug 1794550 in tripleo "Some kernel modules are loaded from containers" [Medium,Triaged] - Assigned to Cédric Jeanneret (cjeanner)15:17
bandiniTengu: awesome, thanks15:17
*** janki has joined #tripleo15:17
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load isci_tcp module from the host.  https://review.openstack.org/60545015:19
*** iranzo has quit IRC15:19
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load ip_vs module from the host  https://review.openstack.org/60544615:20
openstackgerritCédric Jeanneret proposed openstack/tripleo-heat-templates master: Load dm-multipath module from the host.  https://review.openstack.org/60545215:20
Tengudone.15:20
Tengujust the openvswitch, I don't know where it should be loaded.15:20
*** moguimar has quit IRC15:20
*** yprokule has quit IRC15:21
TenguEmilienM: -^^^ a few reviews :)15:21
jaosorioro15:22
Tenguhttps://review.openstack.org/#/q/status:open+topic:bug/1794550  shorter.15:22
Tengujaosorior: guess you'll be happy to see some modprobe dropped from the containers :)15:23
EmilienMTengu: will look after my meetings15:23
TenguEmilienM: np. will sign-off for today :).15:23
*** artom has joined #tripleo15:23
EmilienMack15:23
jaosoriorTengu: nice!!15:23
thervethrash: Surprisingly hard to reproduce a second time15:24
toureyeah I issued the SIGHUP15:24
tourenow the api doesn't reconnect :)15:24
Tengujaosorior: ;) thought so. anyway. see you tomorrow folks15:24
jaosoriorTengu: have a good one!15:25
Tengusame for you ;)15:25
toureoh crap I forgot I cherry picked a change for the eventlets15:25
thrashtoure: I don't think that's even in play. Red herring.15:26
thrashtherve: You're right. Hard to reproduce...15:26
*** chandankumar is now known as chkumar|off15:26
toureok15:27
*** quiquell|rover is now known as quique|rover|off15:27
*** quique|rover|off is now known as quique|off15:28
*** kopecmartin is now known as kopecmartin|ruck15:28
*** sshnaidm|mtg is now known as sshnaidm15:29
*** marios|rover has joined #tripleo15:29
*** dsneddon has joined #tripleo15:31
thervethrash: Something like that http://paste.openstack.org/show/730950/ maybe, but it'd be nice to reproduce it more15:31
*** leanderthal has quit IRC15:31
thrashtherve: I'm wondering if there is a race condition with the SIGHUP and the healthcheck that's happening every two seconds...15:32
tourethrash I was going to mention, maybe the original theory of haproxy swamping us15:32
thrashtherve: the question... Where is the SIGHUP coming from anyway?15:32
bogdandothrash: it comes from logrotate post script I think15:33
thrashbogdando: Ahhh...15:33
*** jtomasek has quit IRC15:34
*** PhilSliderS has joined #tripleo15:34
*** sanjayu_ has quit IRC15:35
thrashtherve: bogdando toure the picture is becoming quite a bit clearer...15:36
jankiEmilienM, hey. I have commented on the patch. am logging off now. thanks :)15:36
*** janki has quit IRC15:36
EmilienMjanki: ack, will look asap15:36
tourethrash so you think it is a race condition with logrotate and haproxy polling15:37
thrashtoure: not sure about the haproxy part...15:37
thrashtoure: but I'm thinking that if a request comes in at just the wrong time you hit that "forked after connection established"15:38
thrashand when you hit that... Hosed.15:38
thrashbasically... request -> SIGHUP -> reply -> BORKED15:38
thrashbut that's completely speculation15:39
toureyup that makes sense so if we slow the haproxy polling this should reduce the pressue of the race, which I know is a bandaid more than a fix15:39
thrashtoure: better to keep the error from happening, which I think therve's idea should alleviate.15:40
touretrue15:40
thrashI think it's reasonable to clean up the rcp_clients on reset.15:41
*** hjensas has quit IRC15:42
toure+115:42
* toure will test the theory 15:42
*** jtomasek has joined #tripleo15:45
thrashtoure: problem is... I can't reproduce it again.15:45
thrashtherve: I'll keep trying to trigger it... Not having any luck as of yet.15:45
toureI have twice on my systems15:45
*** numans has joined #tripleo15:46
thrashtoure: gonna let mine sit for a bit.15:46
*** thrash is now known as thrash|biab15:47
toureack15:47
openstackgerritMerged openstack/ansible-role-redhat-subscription master: Add support for RHSM Pools  https://review.openstack.org/60529015:51
*** panda is now known as panda|bbl15:52
ohsnapso in the past i deployed a test/dev env using packstack. today im redoing it using tripleo-quickstart, this has been sitting here a little over an hour: TASK [undercloud-deploy : Install the undercloud]15:52
*** holser_ has quit IRC15:54
*** jfrancoa has quit IRC15:54
*** jfrancoa has joined #tripleo15:56
*** Petersingh is now known as Petersingh|away15:57
rh-jelabarreI'm trying to map a router to a network (openstack router set --external-gateway provider_network external) and I keep getting a "BadRequestException: Unknown error".  Any suggestions of what to look into?  Or at least figure out what the error means?15:58
*** dxiri has quit IRC15:58
*** ohsnap has quit IRC15:59
*** Petersingh|away has quit IRC16:00
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Inject undercloud user into the cloud image  https://review.openstack.org/60306916:01
openstackgerritDavid Peacock proposed openstack/tripleo-heat-templates master: docker-puppet.py: used dedicated hiera entry, not uuid  https://review.openstack.org/55918216:02
*** rdopiera has quit IRC16:04
EmilienMbnemec: thx for the email, long life to OVB16:07
bnemecEmilienM: np. I'm hoping at some point we can do some testing with tripleo-ci on the new branch to make sure it doesn't break anything.16:08
*** dtantsur is now known as dtantsur|afk16:08
bnemecAnd if it does to get it fixed before making the switch.16:09
EmilienMyes +216:09
openstackgerritDavid Peacock proposed openstack/tripleo-heat-templates master: docker-puppet.py: used dedicated hiera entry, not uuid  https://review.openstack.org/55918216:09
*** ramishra has quit IRC16:09
*** noama has quit IRC16:10
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart-extras master: Improve output of Verify Sphinx build task  https://review.openstack.org/60040316:10
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537416:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256016:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441816:10
*** ooolpbot has quit IRC16:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)16:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)16:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)16:10
*** ksambor has quit IRC16:15
*** fhubik|brb has quit IRC16:15
*** florianf is now known as florianf|afk16:17
*** kopecmartin|ruck is now known as kopecmartin|off16:20
*** aufi has quit IRC16:21
*** jpich has quit IRC16:22
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Introduce OS::TripleO::Services::Podman  https://review.openstack.org/60423516:24
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: undercloud: deploy podman  https://review.openstack.org/60522116:24
EmilienMchkumar|off: nice work on tempest, thanks16:25
EmilienMchkumar|off: I commented though16:25
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Switch fs027 to deploy with podman  https://review.openstack.org/60051716:25
*** gkadam has quit IRC16:25
openstackgerritDavid Peacock proposed openstack/puppet-tripleo master: adding deployment_type fact in support  https://review.openstack.org/60547816:27
openstackgerritDavid Peacock proposed openstack/tripleo-heat-templates master: docker-puppet.py: used dedicated hiera entry, not uuid  https://review.openstack.org/55918216:28
*** panda|bbl is now known as panda16:35
*** dxiri has joined #tripleo16:35
*** trown is now known as trown|lunch16:40
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Document the KVM accelerated mode for building VMs  https://review.openstack.org/60548316:40
*** sanjayu_ has joined #tripleo16:48
*** akrivoka has quit IRC16:49
TenguEmilienM: just had a discussion, my patches for the modprobe are NOT enough, and should NOT be merged - can you w-1 them ? I'm not on my work laptop, I have no access - for the records: https://review.openstack.org/#/q/status:open+topic:bug/1794550  thank you :)16:52
*** gfidente has quit IRC16:55
*** artom has quit IRC16:58
*** jfrancoa has quit IRC16:58
*** jfrancoa has joined #tripleo16:58
*** derekh has quit IRC17:01
*** jfrancoa has quit IRC17:02
*** thrash|biab is now known as thrash17:03
thrashtoure: ok... repro after sitting for a bit.17:03
thrashtoure: so I'm going to apply therve's idea and let it sit again.17:04
*** artom has joined #tripleo17:04
EmilienMTengu: ok17:05
TenguEmilienM: apparently I'll need to hit kolla-ansible - and persist the modprobe in some way across reboot. I was sure it couldn't be that easy :)17:06
*** salmankhan has quit IRC17:06
EmilienMTengu: done17:06
TenguEmilienM: great, thanks!17:07
EmilienMTengu: I WIPed the THT patches.17:07
Tengujust want to ensure nobody push anything for now.17:07
EmilienMTengu: ci is in bad shape no worries :D17:08
Tengu"good news then"17:08
Tengu#orNot ;)17:08
*** jpena is now known as jpena|off17:09
*** AJaeger has joined #tripleo17:10
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537417:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256017:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179441817:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)17:10
openstackLaunchpad bug 1794418 in tripleo "Overcloud deploy error creating overcloudrc" [Critical,In progress] - Assigned to Thomas Herve (therve)17:10
AJaegerbogdando: are you around to discuss https://review.openstack.org/#/c/588587/4/zuul.d/layout.yaml17:10
AJaegerssbarnea: that's your change that I augmented so that it can merge ^17:10
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates master: WIP: configure the undercloud host  https://review.openstack.org/60548917:11
bogdandoAJaeger: hi. Yes please. Tho I think the comment you've provided answers it fully :)17:11
AJaegerbogdando: ok - wanted to be around to discuss further if needed...17:11
bogdandothanks for the exmplanation!17:11
AJaegeryou're welcome, bogdando17:12
AJaegerthanks for reviewing17:12
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Convert *tasks from bootstrap_nodeid to short_bootstrap_node_name  https://review.openstack.org/60543017:15
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Remove unused tls-cert-inject.yaml template  https://review.openstack.org/60549117:15
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Add SERVICE_bootstrap_node_ip values to allNodesConfig  https://review.openstack.org/60549217:15
*** shardy has quit IRC17:20
*** dciabrin has quit IRC17:20
*** bogdando has quit IRC17:25
slaglethrash: do you recall the reasoning for using the same queue name of "tripleo" for everything? can't we accidentally show messages from other workflows when polling the queue in tripleoclient?17:29
tourethrash sounds good17:29
thrashslagle: iirc, it was the UI that drove that.17:31
thrashslagle: but no, because the workflow id is checked.17:32
thrashslagle: *execution id17:32
thrashslagle: at least on the CLI side. The CLI is somewhat synchronous, so it knows what it has executed, and looks for messages specifically from that execution.17:33
slaglethrash: i don't see that in the code. at least not the way i'm reading it :)17:33
*** hjensas has joined #tripleo17:34
slaglei see a check on the execution id, but that's only so that we don't bail on getting messages due to a sub-workflow going complete17:34
slagleand we've already yielded the payload to the caller before that check17:35
thrashslagle: hmmm17:36
slaglehttps://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/workflows/base.py#L6117:36
slaglethat's where I'm looking17:36
thrashslagle: yep... was just looking at that... And I think you're right.17:36
thrashslagle: I knew there was a check... But...17:37
slaglewith tripleoclient, we've never really had a reason that someone might run 2 workflows at the same time, so i wouldn't be surprised if this had gone unnoticed17:38
*** AJaeger has left #tripleo17:39
slaglebut with the status and failures commands I've added that use workflows, we got a bug report of some deployment output polluting the output of the failures command17:39
thrashslagle: exactly17:39
slagleb/c a deployment was ongoing at the time17:39
thrashslagle: should be a simple enough fix. Check for the execution id or the root execution id.17:40
thrashbefore yield.17:40
slagleyea, i think so. just wanted to double check my reasoning around the situation :)17:40
thrashslagle: no worries... Looks like you got it right. :)17:41
*** trown|lunch is now known as trown17:44
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: libvirt standalone deployment  https://review.openstack.org/59107717:53
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: WIP: udpate reproducer to install required deps  https://review.openstack.org/60083617:54
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: f28 support for quickstart  https://review.openstack.org/59165217:54
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: libvirt standalone deployment  https://review.openstack.org/59107717:54
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: f28 support for quickstart  https://review.openstack.org/59165217:54
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: WIP: enable fedora-28 for the reproducer  https://review.openstack.org/60249217:54
openstackgerritMerged openstack/python-tripleoclient master: Start websocket client before workflows  https://review.openstack.org/60537717:57
*** jcoufal has quit IRC18:03
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537418:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256018:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)18:10
*** slaweq has joined #tripleo18:12
openstackgerritAlex Schultz proposed openstack/python-tripleoclient stable/rocky: Start websocket client before workflows  https://review.openstack.org/60549918:17
*** amoralej is now known as amoralej|off18:20
itlinuxhello all can someone give me a pointer on this issue http://paste.openstack.org/show/730956/18:21
itlinuxtrying to do an update (minor) on pike18:21
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Remove toci_jobtype definition from v3 jobs  https://review.openstack.org/59386318:22
*** slaweq has quit IRC18:22
*** pcaruana has quit IRC18:24
openstackgerritayoung proposed openstack/tripleo-specs master: Global Galera Database  https://review.openstack.org/60055518:29
*** zaneb has quit IRC18:37
*** zaneb has joined #tripleo18:37
weshaymwhahaha, I think https://review.openstack.org/#/c/603419/ is timing out18:45
weshayhttp://zuul.openstack.org/stream.html?uuid=715bb91a744c45f6afec643ce29ddbdb&logfile=console.log\18:45
openstackgerritRafael Folco proposed openstack/tripleo-quickstart master: Run scenario001-multinode-oooq-container job for config/* changes  https://review.openstack.org/60242518:48
mwhahahaweshay: of course it is18:49
weshaymwhahaha, it's your fault18:53
mwhahahanot my fault someone switching things to non-voting and didn't remove them from the gate :D18:53
*** jcoufal has joined #tripleo18:56
*** abishop_ has joined #tripleo18:58
*** abishop has quit IRC19:00
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: Use sync action get_deployment_failures  https://review.openstack.org/60505819:05
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates master: Ping default gateways before controllers  https://review.openstack.org/60422919:07
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537419:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256019:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)19:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)19:10
*** asbishop has joined #tripleo19:12
*** asbishop is now known as abishop19:13
*** abishop_ has quit IRC19:15
*** pdeore has quit IRC19:15
*** salmankhan has joined #tripleo19:17
touretherve looks like your suggestion works19:18
tourethrash ^^19:18
toureback in a bit19:19
*** toure is now known as toure|biab19:19
*** jcoufal has quit IRC19:21
*** salmankhan has quit IRC19:22
openstackgerritEmilien Macchi proposed openstack/paunch master: podman: create/delete systemd unit files when restart policy is used  https://review.openstack.org/60084919:22
EmilienMbandini: ^ ready for review again, comments addressed19:23
openstackgerritEmilien Macchi proposed openstack/paunch master: Stop hardcoding 'docker' and make it more generic  https://review.openstack.org/60129019:23
*** slaweq has joined #tripleo19:25
mwhahahaweshay: how many cpus do the CI instances have? 4?19:28
mwhahahanm is 819:29
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: WIP: udpate reproducer to install required deps  https://review.openstack.org/60083619:33
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: WIP: enable fedora-28 for the reproducer  https://review.openstack.org/60249219:33
openstackgerritAlex Schultz proposed openstack/ansible-role-container-registry master: Allow docker image download/upload concurrency  https://review.openstack.org/60551119:42
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535619:46
*** sanjayu_ has quit IRC19:46
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Increase docker pull/push concurrency  https://review.openstack.org/60551419:50
thrashtoure|biab: therve I had the opposite.19:50
thrashtoure|biab: therve It didn't seem to work for me.19:51
openstackgerritAlex Schultz proposed openstack/tripleo-common master: Increase upload concurrency  https://review.openstack.org/60551519:52
openstackgerritSam Doran proposed openstack/tripleo-ansible master: Use generic names for container platform  https://review.openstack.org/60084719:54
openstackgerritAlex Schultz proposed openstack/tripleo-common master: Increase upload concurrency  https://review.openstack.org/60551519:54
weshaymwhahaha, pedal to the metal19:58
mwhahahanot sure if it'll help, but worth a try19:59
*** dsneddon has quit IRC20:02
*** dsneddon has joined #tripleo20:02
*** agopi has quit IRC20:06
*** bnemec has quit IRC20:10
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537420:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256020:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)20:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)20:10
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: Filter messages not from waiting execution  https://review.openstack.org/60552020:10
weshaymwhahaha, ok.. finally deploying standalone on f28 w/ centos containers20:10
weshayw/ the reproducer scripts20:10
mwhahahaalways a bonus20:10
thrashtoure|biab: therve stop() gets called also. Not a bad idea to just clear them there as well... Should get around race conditions20:13
EmilienMare we getting CI back one day or?20:13
EmilienMwhere are we?20:13
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Update standalone role  https://review.openstack.org/60515620:14
*** toure|biab is now known as toure20:14
tourethrash hrmm...20:15
*** bnemec has joined #tripleo20:15
thrashtoure: The repro is not 100%20:15
tourethrash therve one way to reproduce the issue is to move the logrotate from daily to hourly20:15
thrashtoure: which leads me to believe it is definitely a race condition20:15
toureack20:16
thrashtoure: sure. But still *time*20:16
tourethrash true20:16
thrashBut again, it really depends on whether the race happens... I've concated the SIGHUP and the action call into a single cli20:16
thrashtoure: and that *sometimes* worked20:17
tourebut we should run into the race much quicker if it is kicking every hour20:17
toureah20:17
thrashtoure: I'm testing out adding the client reset to the stop function as well20:17
thrashtoure: do it every minute. :)20:18
tourekk20:18
tourehehe20:18
thrashtoure: or just cron the SIGHUP. :)20:18
thrashtoure: that's probably the better way.20:18
touretrue20:18
thrashheck, that could be every 5 seconds.20:18
* toure creates a crontab for SIGHUP20:18
toure:)20:18
thrashor probably 2020:18
thrashin fact, I think I'm gonna do that. Cron the SIGHUP, and then watch the action call20:19
toure* * * * * sudo docker exec mistral_api kill -SIGHUP 1 >/dev/null 2>&120:20
toure:)20:21
openstackgerritEmilien Macchi proposed openstack/paunch master: podman: create/delete systemd unit files when restart policy is used  https://review.openstack.org/60084920:22
openstackgerritEmilien Macchi proposed openstack/paunch master: Stop hardcoding 'docker' and make it more generic  https://review.openstack.org/60129020:22
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart-extras master: Update standalone environment file  https://review.openstack.org/60552320:23
tourethrash got it20:23
tourethe cleanup() function works20:25
thrashtoure: from?20:25
tourehttp://paste.openstack.org/show/730963/20:26
tourethrash ^^20:26
thrashtoure: sure... but we need a much larger sample size to be certain.20:26
toureyup and I will leave it running all nigth :p20:27
thrashtoure: biab20:27
*** thrash is now known as thrash|biab20:27
toureeither ack20:27
tourethrash|biab when you get back here is the setup20:32
tourehttp://paste.openstack.org/show/nbn7LVPRox2obnisF6z7/20:32
stevebakermorning20:33
EmilienMstevebaker: salut20:34
EmilienMmwhahaha: what jobs are timeouting the most?20:34
mwhahahai don't know at the moment20:34
EmilienMok let's look grafana20:34
EmilienMweshay: can you have an answer to that ?20:34
mwhahahacistatus.tripleo.org seems to think everything is grand (i think it's lying)20:35
stevebakermwhahaha: I've got some more context for talking about concurrent layer copying for docker/skopeo20:36
*** raildo has quit IRC20:36
* weshay reads20:36
mwhahahastevebaker: yea? the max-concurrent-download stuff in docker seems to have little effect (i tested)20:36
mwhahahastevebaker: i did throw a patch up to improve the tripleo-common number of upload workers20:36
weshayEmilienM, what's the question20:37
weshay720:37
weshayis the answer20:37
mwhahahacause that in my testing seemed to have a better impact on total time20:37
EmilienMweshay: question is what jobs timeout the most20:37
EmilienMwhat's the new URL for cockpit?20:37
stevebakermwhahaha: it does allow more layers to be downloaded concurrently, but yeah I didn't see a huge time difference in a test of pulling two images with mostly shared layers20:37
EmilienM38.145.34.131:3000 is down for me20:37
weshayhttp://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?orgId=1&panelId=61&fullscreen20:37
EmilienMoh ok20:38
weshayscen001 and containers-multinode20:38
mwhahahastevebaker: https://review.openstack.org/#/q/topic:concurrent-settings+(status:open+OR+status:merged) is what i proposed a bit ago20:38
EmilienMtripleo-ci-centos-7-undercloud-containers - 2h2020:38
mwhahahastevebaker: i think the tripleo-common one might have some improvement, but the other i would agree doesn't seem to help when the layers are so shared (like ours)20:38
EmilienMwhile it takes 25 min to deploy an undercloud w/o oooq in my env20:39
EmilienMit makes me cry20:39
stevebakermwhahaha: as for skopeo, the news is bad. No concurrent copies, and most of the combinations of src->dest can't detect that the destination already has the layer, so the copy happens again. Until that is fixed, buildah in CI will be sloooow20:39
mwhahahastevebaker: oh noes20:39
EmilienMomg20:40
weshaydear lord don't cry :)20:40
weshaythere there20:40
EmilienMcan't we build our own images with pre-fetched containers in a local registry?20:41
weshay+120:41
EmilienMand our CI jobs would just update the layers?20:41
weshayI like that plan20:41
EmilienMis infra against us having our own centos7 image?20:41
weshayDIB that sucks in the containers20:41
EmilienMwe could build our own image on top of current centos720:42
EmilienMand install things that take time20:42
weshayEmilienM, that would get us our daily updates w/ dlrn-current too20:42
stevebakerEmilienM: what do you mean a local registry? per CI regsion?20:42
EmilienMlike openstack-selinux (almost 5 min to install)20:42
weshaythat's normal20:42
EmilienMstevebaker: no I mean deploying a docker registry with our containers20:42
weshayEmilienM, but installed and populated at image creation right?20:42
EmilienMso when the VM starts, we alraedy have a registry and our CI job would not pull everything from scratch20:42
weshayvs.. pre.yaml20:42
weshay+100020:42
* weshay starts to cry20:43
mwhahahagood luck with that20:43
EmilienMmwhahaha: why? I don't think that's impossible20:43
weshay\/skick /sban mwhahaha20:43
EmilienMstevebaker: it's technically possible you think?20:43
weshayianw, ^20:43
EmilienMif yes, I'll ping infra20:43
weshaywhose turn is it to buy scotch for the infra guys?20:44
stevebakerEmilienM: do the VM images get rebuilt often enough for the layers to be current?20:44
weshaystevebaker, ya20:44
mwhahahano they don't which is kinda teh problem20:44
EmilienMstevebaker: every night we could do20:44
weshayalmost every day20:44
weshayya20:44
EmilienMthe only thing that will cost is :20:45
weshaythe image is 8.6 gb20:45
weshaynow20:45
EmilienM1) an image to store in more (so more storage for our cloud providers)20:45
EmilienM2) more time in the image build process20:45
EmilienMyeah I'm afraid about the size with our containers20:45
weshayhttps://nb01.openstack.org/images/20:45
EmilienMI think the first question inrfa will ask is what size would it take20:45
EmilienMand I'ma fraid of the answer20:46
stevebakerweshay: is it possible to just run CI against container images which are built more frequently? so current-tripleo is promoted images, but we do CI against a daily-tripleo tag or something?20:46
weshaystevebaker, so the containers would just be updated w/ dlrn-current20:46
weshayya.. that is fine20:46
weshayso we'd have that in the process..20:46
EmilienMI think what takes a bunch of time in our CI is that we run yum update in our containers20:46
weshayI know.. we could isolate the services more effectively on the bm20:46
EmilienMI mean we have to20:46
stevebakeryes that is the slowest part, if there hasn't been a promotion for a while, updating every image with the dlrn-current repo gets slower and slower20:47
EmilienMif we could come up with a size that it would take20:47
mwhahahaso you realize that the containers are like 17G right?20:47
EmilienMmaybe we can request infra20:47
weshaylolz20:47
weshay\/skick mwhahaha20:47
EmilienMwell20:47
EmilienMthe image is stored once20:47
EmilienMie no history20:47
mwhahahashouldn't be20:47
EmilienMwe won't store multiple versions of this image20:48
mwhahahai'm talking about the downloaded containers20:48
mwhahahajust a single set20:48
EmilienMI know20:48
EmilienMI'm trying to find a solution here :D20:48
weshayit would be more efficient, even if the image was large20:48
EmilienMbut to me we download the same things in our CI jobs20:48
mwhahahai think we need to work on other problems20:48
EmilienMso let's try to cache them in the image20:48
weshaythat's why the images are built daily20:48
weshayto keep them updated20:48
weshaythis just takes to the next level w/ containers20:49
mwhahahayes cause we need more complexity20:49
weshayhow else can sell rhel820:49
weshayMORE COMPLEXITY20:49
* mwhahaha closes laptop, opens window, escapes20:49
mwhahahaif you need me, i'll be doing something useful like taking anap20:49
* mwhahaha would rather we stop doing scenarios and only have a single undercloud job and convert all the services to single light weight standalones20:50
mwhahahai think we'd get more use out of that20:50
EmilienMmwhahaha: why aren't we doing it now20:50
EmilienMwhat does it take?20:50
EmilienMwe need an env/role per scenario20:50
EmilienMlet's start with scenario00120:50
mwhahahatime/people20:50
weshay$$20:51
EmilienMI thought we agreed to work on that20:51
mwhahahayea it's be 3 days20:51
mwhahahaDO TRY AND GIVE PEOPLE SOME TIME20:51
EmilienMno20:51
EmilienMok let me do it now20:51
* EmilienM disappears20:51
weshayEmilienM, converting scenario001 multinode to a standalone?20:51
EmilienMyes20:52
EmilienMit's not a big deal20:52
mwhahahayou can't convert it straight20:52
weshayand the standalone has enough resources?20:52
EmilienMwe need a role with the services20:52
EmilienMand an environment20:52
mwhahahayou need to pull the services out and split itnto multiple jobs20:52
weshayya20:52
EmilienMwhat's the big deal here?20:52
weshayI'm all for busting out the scenarios into combinations that would fit standalone20:52
EmilienMit's 1 yaml file thingy20:53
mwhahahano it's not20:53
weshaynice spec20:53
* weshay thinks EmilienM sounds like morazi20:53
*** Vorrtex has quit IRC20:53
EmilienMcome on20:53
weshaylolz20:53
stevebakerwoah20:53
weshaymwhahaha, EmilienM let's pause20:53
weshaywe know we want to work on standalone20:54
weshaysure20:54
weshaybut let's go pursue the DIB/image w/ containers20:54
mwhahahak you go do that20:54
mwhahahawe still need the standalone thing20:54
mwhahahafor f28 and other gates20:54
EmilienMscenario001 is a standalone with a dedicated env file20:54
EmilienMoverriding the serviecs20:54
EmilienMand that's it20:54
mwhahahaEmilienM: no it's not because there's not enough resources in CI to run scneario001 on a single box20:54
weshayI'm on TASK [Run docker-puppet tasks (bootstrap tasks) for step 3] **********************************************20:54
weshayw/ f2820:54
weshay:)20:54
EmilienMmwhahaha: just trash the multinode scenario00120:55
EmilienMin master20:55
weshaymwhahaha, just tell prad to make ceil more efficient20:55
EmilienMwe can do it with zuul super easily20:55
* mwhahaha gives up and goes to work on other things20:55
weshaymwhahaha, it's worth trying once20:55
mwhahahak get it done20:55
EmilienMmwhahaha: ok tell me what's complicated20:55
weshayEmilienM, too many services for one box20:56
mwhahahai already did but you keep blasting past it20:56
weshaybut maybe it would work20:56
EmilienMand I don't want to give up20:56
weshaymaybe not20:56
EmilienMnothing has merged20:56
EmilienMmwhahaha: then we reduce the services20:56
mwhahaharight to what20:56
weshayEmilienM, which one?20:56
EmilienMscenario00120:56
*** agopi has joined #tripleo20:56
EmilienMit's always broken20:56
mwhahahayou need to actually come up with a plan on what you're doing20:56
weshayya.. but which service20:56
weshayfaker20:56
EmilienMfirst let's make it non voting20:56
EmilienMwell let's stop to test autoscaling20:57
weshayhttps://github.com/openstack/tripleo-heat-templates/blob/master/README.rst#service-testing-matrix20:57
EmilienMlet's reduce the tempest tests that we run20:57
EmilienMso we can drop Heat maybe in this scenario20:57
mwhahahai'd rather we trash all the scenarios, and run a single standalone multinode (with my new standalone role) and then create individual standalone configurations for the various other services20:57
EmilienMHeat is already tested on the containerized undercloud20:57
mwhahahanot really20:57
mwhahahabut sure20:57
mwhahahasince we stopped doing actual heat, it's not really "Tested"20:58
mwhahahai know20:58
EmilienMI guess we need some trade offs20:58
mwhahahaEmilienM: let's come up with all the services and list them out how we want to est them in CI (etherpad?)20:58
mwhahahaand we can figure out new featuresets for them20:58
*** trown is now known as trown|outtypewww20:59
mwhahahaassuming standalone rather than muiltinode20:59
EmilienMso why services can't run on a single node?20:59
EmilienMwhat changes from having an overcloud?21:00
EmilienMonce we run the playbooks to deploy, the heat instance isn't running anymore21:00
mwhahaharam21:00
EmilienMso nothing should take memory on the host, (beside ansible)21:00
weshay########################################################21:01
weshayDeployment successfull!21:01
weshayf2821:01
*** abishop has quit IRC21:03
matbuceph21:03
matbuvi hosts21:03
matbuddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd:q21:03
matbuddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd:q:q!21:03
matburmddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd:q:q!21:03
matbuddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd:q:q!21:03
* stevebaker googles how to quit vi21:04
weshaydid matbu just die?21:04
EmilienMhe's stuck in vi21:05
matbuEmilienM: oops :)21:06
*** slaweq has quit IRC21:06
matbumy term are totally wierd,21:06
matbusorry for the noise21:06
stevebakerit cheered up my day21:07
mwhahahaEmilienM: so if we only have 8G, a default install of the normal standalone takes 8Gs.  Scenario* has way more services21:07
EmilienMright21:07
weshaymatbu, glad you are alive and well21:08
matbuweshay: yep just got a 90 baremetal nodes deployed21:09
weshaymatbu, where did yo get 90 nodes?21:09
EmilienMi think we don't want to know21:10
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537421:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256021:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)21:10
matbuweshay: no no you won't use for CI :) its the scale lab21:10
*** matbu has quit IRC21:10
*** matbu has joined #tripleo21:11
weshay90 nodes killed matbu's irc21:11
EmilienMweshay, mwhahaha : https://ethercalc.openstack.org/TripleOStandaloneCICoverage21:11
matbu:)21:11
EmilienMmaybe we could dress a list that way21:12
EmilienMI just drafted something super quick21:12
weshaydammit21:13
weshayI hate people that are naturally on cocaine21:13
*** panda is now known as panda|off21:13
EmilienMmwhahaha: maybe we could start by dressing a list of actions for the CI team21:17
EmilienMto give visibility on the work that you think needs to be done for standalone21:17
EmilienMthis spreadsheet will help if we wants to re-do scenarios21:18
EmilienMweshay: who from CI team can work on that topic, in short term?21:18
weshayEmilienM, it's going to be a sprint topoic21:21
weshaytopic21:21
weshaywe all can21:21
weshayminus the ruck/rover21:21
mwhahahaEmilienM: so is this the new version of the scenarios or are you documenting the old ones21:22
EmilienMweshay: when is next sprint starting?21:22
EmilienMmwhahaha: new21:22
weshayEmilienM, thrs21:22
mwhahahaso i'd like to get rid of the scenario names21:22
weshaytomorrow21:22
mwhahahacause the last thing we need is more magic decoder rings to figure out ci21:23
weshayagree, but also will note the names are the least of our problems21:23
EmilienMI have to go but I'm back in 1h, and all evening21:25
EmilienMmostly21:25
* weshay is going back to Red Rocks tonight :)21:25
EmilienMcan we capture some sort of todo list (high level)21:25
mwhahahasee he starts trouble and then just leaves21:25
EmilienMmwhahaha: ...21:25
EmilienMlet's create a list of things we need to do from high level so it's easier to split the work21:28
EmilienMI'll draft something when I'm back on etherpad or something21:28
EmilienMif nobody started before21:28
* EmilienM brb21:28
weshayEmilienM, you should come to our planning mtg21:31
weshaymwhahaha, EmilienM jaosorior fyi.. I'm seeing much more success from rdo jobs21:37
weshayI think the +1 for 3rd party can be expected again21:38
weshayhttp://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?orgId=1&panelId=207&fullscreen21:38
mwhahahaEmilienM, weshay: ok so i listed the major services and their current coverage under undercloud/ovb: https://ethercalc.openstack.org/TripleOStandaloneCICoverage21:46
mwhahahaare we missing anything?21:46
mwhahahaso we'd likely want to fill gaps in service coverage via standalone21:47
*** toure is now known as toure|gone21:48
openstackgerritMerged openstack/tripleo-quickstart-extras master: Support ARA statistics in InfluxDB for longest tasks  https://review.openstack.org/58023821:54
weshaysec21:54
*** lblanchard has quit IRC22:02
openstackgerritSteve Baker proposed openstack/tripleo-common master: Set prepare neutron_driver from NeutronMechanismDrivers  https://review.openstack.org/60495322:09
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537422:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256022:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)22:10
openstackgerritMerged openstack/tripleo-quickstart-extras master: Calculate ARA metrics for overcloud  https://review.openstack.org/60490022:14
openstackgerritMerged openstack/tripleo-quickstart master: Set containerized_undercloud for OpenShift featureset  https://review.openstack.org/60280222:14
openstackgerritMerged openstack/tripleo-common master: Switch to openshift 3.10  https://review.openstack.org/59682022:17
openstackgerritMerged openstack/tripleo-common master: Switch to origin-docker-build  https://review.openstack.org/59930722:17
stevebakermwhahaha: permission to merge this? https://review.openstack.org/#/c/60495222:20
openstackgerritSam Doran proposed openstack/tripleo-quickstart master: Add YAML standard out callback plugin  https://review.openstack.org/60554322:24
openstackgerritArx Cruz proposed openstack/tripleo-quickstart-extras master: WIP - Fix stackviz  https://review.openstack.org/60541922:27
openstackgerritSteve Baker proposed openstack/tripleo-quickstart master: Make setup repo task output visible when errored  https://review.openstack.org/59935822:28
openstackgerritMerged openstack/tripleo-common master: Add container-registry image to openshift master role  https://review.openstack.org/60211222:33
*** rcernin has joined #tripleo22:35
*** tosky has quit IRC22:37
beagleswe still have a "do not workflow" in effect?22:38
* beagles glances at channel status, duh22:38
EmilienMbeagles: hey22:53
EmilienMdid you have a chance to look at the sidecar podman container thingy?22:53
*** rcernin has quit IRC22:53
EmilienMweshay: yes I could join your planning22:54
EmilienMweshay: invite me22:54
beaglesEmilienM: nyet - but can tomorrow a.m. Going to link up with bogdan22:54
EmilienMbeagles: excellent22:54
beaglesEmilienM: so what's the story with the gate, is there progress on sorting out why the timeouts?22:55
*** rcernin has joined #tripleo22:55
EmilienMbeagles: not really, bunch of timeouts22:55
beaglesmeh22:55
EmilienMbut there are some wips22:55
EmilienMalex is looking at concurrency22:55
beaglesI think the last I was following it was timeouts on getting data around (pulling container images, etc)? is that still the thing?22:56
EmilienMmwhahaha: ack for the spreadsheet, so we would have "standalone-ceph" for ex?22:56
EmilienMbeagles: see https://review.openstack.org/#/q/topic:concurrent-settings22:56
beaglesEmilienM: ack thanks22:58
EmilienMbeagles: could you please create a card in https://trello.com/b/S8TmOU0u/tripleo-podman and update with progress if any23:00
EmilienMit helps to track our efforts, thx23:00
beaglesEmilienM: sure thing23:01
EmilienMthx23:01
*** eggmaster has quit IRC23:08
*** eggmaster has joined #tripleo23:08
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537423:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256023:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)23:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)23:10
EmilienMbeagles: so there is another approach23:11
EmilienMbeagles: look https://review.openstack.org/#/c/604659/223:11
EmilienMI'm going to update https://review.openstack.org/#/c/604180/ to use that23:12
EmilienMwe would configure these containers with cap SYSADMIN & mount /var/lib/container23:12
EmilienMso they can do things that we need23:12
EmilienMbut privileged mode is too high for what we need23:12
EmilienM(credits to stevebaker for the great idea)23:12
beaglesah so this implements what steve baker was talking about it in the email thread23:12
beaglesthat'd be cool23:12
EmilienMyea23:13
EmilienMbeagles: again we want an approach that isn't heavy and complicated23:13
EmilienMand that keeps our problem solved: separate neutron container from the services that it manages like haproxy/keepalived/dnsmasq etc23:13
beaglesEmilienM: yeah, I was just going to say - I'm all for easy23:13
EmilienMso we can restart them independently23:13
EmilienMcool23:14
EmilienMbeagles: so yeah let's try that23:14
beaglesEmilienM: right on23:14
EmilienMbeagles: I'm working on pacemaker bits now https://review.openstack.org/#/c/604180/23:14
EmilienMbut i'll let you figure out the neutron thing23:14
beaglesEmilienM: ack23:14
EmilienMbasically we need the containers configured with cap_drop=SYSADMIN and mount /var/lib/containers23:14
mwhahahaEmilienM: so yea we'd likely want a standalone-ceph, but that's a weird one where we'll probably want an ovb-ceph as well23:15
beaglesright23:16
EmilienMmwhahaha: why would we need ovb-ceph? I would hope we can keep ovb as lighter as possible.23:16
mwhahahaEmilienM: one that excercises mistral to deploy ceph-ansible23:16
mwhahahaEmilienM: we could do a 1 uc 1 all-in-one (with ceph)23:16
mwhahahato be lighter23:16
mwhahaharather than an HA one23:16
EmilienMah right, dat mistral23:17
mwhahahawe could leave 1 multinode job that maybe runs ceph23:17
mwhahahado a 1 uc 1 all-in-one (with ceph) multinode23:18
mwhahahato replace the existing one23:18
mwhahahabut these are the types of things that need to be thought out and aren't just "go do a thing"23:18
mwhahahathough a 1uc/1all-in-one(with ceph) is a better test if we want to properly excercise mistral, etc23:19
EmilienMyes23:19
mwhahahai meant on ovb but anyway23:19
*** mcornea has quit IRC23:20
mwhahahafrom a "basic tripleo deployment" in CI, i'd like to stadandarize on https://review.openstack.org/#/c/605156/23:20
mwhahahaall the other services would get coverage via a standalone unless it needs something special (ocavia/ceph)23:20
EmilienMI also think we'll have to make tradeoffs23:21
EmilienMtesting autoscaling requires all telemetry + heat23:21
EmilienMit's clearly not working for us23:21
EmilienMtimeouts etc23:21
EmilienMI would be ok to have heat + basics and actually test heat api23:22
mwhahahadoes that get covered in the full tempest run we do for promotions?23:22
EmilienMnot sure about that23:22
mwhahahawe should see, because if it does i think that would tick that box23:22
mwhahahathats the other thing that would be nice to know is the tempest tests for these feature sets23:22
EmilienMif we could have "advanced" testing in RDO CI (with higher timeouts)23:22
mwhahahabecause we could be deploying all this stuff but not even bothering to exercise it23:22
EmilienMand "simpler" tests in our gate23:22
EmilienMit would help imho23:23
mwhahahaEmilienM: that's why i recommend additional ovb jobs (other than the HA ones)23:23
EmilienM:-o23:23
EmilienMdo we have capacity?23:23
mwhahahaof course not23:23
mwhahahabut if we didn't run fs001 and ds35 on everything, maybe23:23
mwhahahawhy do we run it on just about everything again?23:23
EmilienM(is that the time where you blame me? :-))23:24
mwhahahai've been blaming you since you started this several hours ago23:24
EmilienManyway23:24
mwhahahaSO THERE23:24
EmilienMyou can't blame me23:24
mwhahaha:D23:24
mwhahahai can and will23:24
EmilienMI work with one eye since Monday23:24
mwhahahaand you can't stop me23:24
EmilienMso i'm half processing?23:24
mwhahahaARRRRRmilienM23:24
mwhahahait would also be beneficial to properly scope jobs to specific areas in the tripleo code base23:25
mwhahahai tend to think that we excessively run things (i'm looking at you tripleo-quickstart)23:25
*** rh-jelabarre has quit IRC23:26
mwhahahaspeaking of random things, is someone driving the ovn switch?23:29
EmilienMbeagles: ^23:30
mwhahahacause that likely adds additional minimum requirements for some jobs23:30
mwhahahawith new servies and such23:30
*** tzumainn has quit IRC23:34
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Allow to run bootstrap containers in privileged mode.  https://review.openstack.org/60053323:35
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Set proper setype for service directories  https://review.openstack.org/60053423:35
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Allow to deactivate SELinux separation for selected containers  https://review.openstack.org/60053523:35
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Support podman when tagging container for Pacemaker  https://review.openstack.org/60418023:35
*** dxiri has quit IRC23:35
EmilienMstevebaker: https://review.openstack.org/#/c/604180/12/docker/services/pacemaker/cinder-backup.yaml23:36
EmilienMsomething like this ^ see cap23:36
EmilienMI haven't tested it (in progress now)23:36
*** rcernin_ has joined #tripleo23:41
openstackgerritAlex Schultz proposed openstack/tripleo-docs master: Update standalone docs  https://review.openstack.org/60352223:41
stevebakerEmilienM: looks good23:42
*** rcernin has quit IRC23:43
mwhahahaEmilienM: so why do we run pacemaker on containers-multinode?23:47
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Support podman when tagging container for Pacemaker  https://review.openstack.org/60418023:47
* mwhahaha sighs23:47
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Support podman when tagging container for Pacemaker  https://review.openstack.org/60418023:47
mwhahaha:q23:47
EmilienMmwhahaha: because at some point we said pacemaker would be default on overcloud23:48
EmilienMto avoid keepalived23:48
mwhahahayet we didn't make that a thing23:48
mwhahahabecause if that was a thing, we should have fixed it in the resource-registry23:49
mwhahahaand not in the CI scenarios23:49
*** openstackgerrit has quit IRC23:49
* mwhahaha flips tables and will resume hating life tomorrow23:49
EmilienMlet's put thisway :23:49
mwhahahatomorrow: roofers, so i shall enjoy banging my head against the wall and hearing it on my roof23:49
EmilienMthere is room for improvment23:49
*** rrubins__ has quit IRC23:54
*** openstackgerrit has joined #tripleo23:58
openstackgerritAlex Schultz proposed openstack/ansible-role-chrony master: Revert "Remove zuul configuration"  https://review.openstack.org/60555723:58
openstackgerritAlex Schultz proposed openstack/ansible-role-chrony master: Fix .gitreview  https://review.openstack.org/60555823:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!