14:02:29 #startmeeting TripleO Edge Squad Meeting 14:02:29 Meeting started Thu Jan 31 14:02:29 2019 UTC and is due to finish in 60 minutes. The chair is slagle. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:02:31 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:02:32 ping slagle, csatari, jaosorior, owalsh, fultonj, gfidente, hjensas, jtomasek, bogdando, dtantsur, rbrady, d0ugal, toure, abishop 14:02:34 #info remove or update your nick from the Meeting Template on the etherpad if you want (or don't want) to be ping'd for the start of the meeting 14:02:34 The meeting name has been set to 'tripleo_edge_squad_meeting' 14:02:38 o/ 14:02:40 o/ 14:02:41 Jose Luis Franco proposed openstack/tripleo-common master: Make the image pulling optional for tripleo-container-tag role. https://review.openstack.org/634239 14:02:42 #link https://etherpad.openstack.org/p/tripleo-edge-squad-status 14:02:43 o/ 14:02:45 Anyone can use the #link, #action, #help, #idea, #agreed, and #info commands, not just the moderatorǃ 14:02:48 hey! 14:02:49 beagles: makes wonder how on earth things promoted to that state in the repos ... 14:02:52 o/ 14:03:00 o/ (might be afk for a few mins) 14:03:32 #topic Agenda 14:03:35 * Review past action items 14:03:37 * Goals/action items for the week 14:03:48 #topic Review past action items 14:03:53 o/ 14:04:02 our AI's are from 2 weeks ago as we didn't meet last week 14:04:42 * slagle slagle/fultonj try multi-ceph deployments with DCN 14:05:04 i didn't get to 14:05:11 oh good me neither :) 14:05:11 slagle: did you? 14:05:18 punt to next week? 14:05:21 sure 14:05:23 Natal Ngétal proposed openstack/tripleo-common master: [Core] Be pep8 compliant. https://review.openstack.org/634241 14:05:26 i've only done it with standalone 14:05:32 #action slagle/fultonj try multi-ceph deployments with DCN 14:05:46 * slagle review https://etherpad.openstack.org/p/tripleo-edge-glance-deployment 14:06:22 abishop: i took a look. seems bogdando and I had a similar question about glance-api at the edge using the central DB 14:06:43 does it require a direct DB connection from the edge to the central site? 14:07:06 quick answer (and I'll update etherpad) is yes, it does 14:07:22 all part of the common control plane 14:08:00 what functionality is affected if the connection is lost? 14:09:04 well, I think the whole central control plane model presumes control plane is key to everything 14:09:10 I think the anwer is the same as for DCN in general 14:09:18 everything but running workloads 14:09:21 if connectino is lost, no new instances, no new volumes, no nothing 14:09:31 bogdando: ack! 14:09:51 ok, makes sense. just wanted to double check 14:10:03 which is an accepted limitation of DCN from what i have heard 14:10:12 right 14:11:06 mschuppert added libvirt-guest support too, so we could restart the VMs on reboot 14:11:12 node reboot 14:11:21 abishop: do you plan to propose the work from your repo as patches, etc? 14:11:27 owalsh: oh nice, on ephemeral volumes? 14:11:27 abishop: just wondering what the next steps are 14:11:54 fultonj: yea, I expect there are caveats... volumes, networking etc... 14:11:54 owalsh: cinder backed VMs too? 14:12:05 slagle: right now they're mainly a set of THT parameter values, but no new parameters 14:12:19 so not sure what I'd be patching 14:12:34 abishop: mostly just thinking environment files, roles, and docs 14:13:00 fultonj: not quite sure TBH 14:13:43 slagle: I can ponder, and will start with fresh review of what's out there (in tht tree) and suggest a place to make some updates 14:14:20 I'm still working on getting a glance expert involved (jokke) but his time is extremely limited 14:14:30 ack 14:14:46 ok, and the last action item... 14:14:53 * slagle continue investigating glance and nova image caching status/options 14:15:00 guess we kind of already talked about it 14:15:11 unless there are other proposals not caputred in the etherpad/ 14:15:13 ? 14:17:03 owalsh: the instances get started again via nova, so yes neutron needs to be available, also cinder if volumes are used. 14:17:23 mschuppert: so we'd need the control plane 14:17:29 #topic Goals/action items for the week 14:17:32 owalsh: yes 14:17:38 dang 14:17:58 there was an item regarding the split control plane ci 14:18:24 update: the job is extracting control plane data and putting it where it needs to be on the separate compute node 14:18:27 progress ^ 14:18:30 fultonj: i had attempted to keep it moving forward when you were in brno 14:18:36 slagle: yes, thank you 14:18:41 i haven't revisited yet though :/ 14:18:43 it helped 14:18:51 i'll take another look. is there a current issue? 14:18:54 the current status is... 14:18:56 #action fultonj (et al) get split control plane ci job to not crash on podman container restart 14:19:00 #link http://logs.openstack.org/88/615988/14/check/tripleo-ci-centos-7-split-controlplane-standalone/cd1a87f/logs/subnode-2/home/zuul/standalone_deploy.log.txt.gz 14:19:09 #link https://review.openstack.org/#/c/615988/ 14:19:11 oh ok. yea i had to switch the job to podman 14:19:24 yeah, so controller goes up with podman fine; compute fails on ^ 14:19:43 error restarting $container0, $container1, ... on step 3 14:22:15 so, https://review.openstack.org/#/c/632089/ needs more eyes 14:22:27 fultonj: strange. wonder if it's a podman issue or something else 14:22:42 attempts describe negative scenarios and expectations for failure modes supported 14:23:14 slagle: right. i was going to try to reproduce in my env. last time i did edge in my env was w/ docker 14:23:33 #action review https://review.openstack.org/#/c/632089/ 14:23:37 bogdando: will review it 14:23:46 Jose Luis Franco proposed openstack/tripleo-heat-templates master: Do not pull image while tagging pcmk images in upgrade_tasks. https://review.openstack.org/634243 14:24:56 anything else that folks want to highlight this week? 14:25:31 Sagi Shnaidman proposed openstack/tripleo-quickstart master: Use force_tcg by libguestfs is not ok https://review.openstack.org/633444 14:25:49 mschuppert: when you said 'yes' to owalsh... 14:26:10 did you mean reboots of instances on edge nodes 14:26:21 which were backed by cinder and ephemeral volumes? 14:26:27 or is that still not known? 14:27:01 fultonj: yes, compute needs neutron to be up. doesn't matter if ephemeral or cinder 14:27:54 mschuppert: thanks. so is the problem then that neutron may not be up? 14:28:02 fultonj: yes 14:28:14 no rebooting, no live migration will be possible indeed 14:28:35 is that someone we need to track or simply state as a known limitation? 14:28:50 running workloads keep running. if they stopped running, then you need the ctl plane back 14:29:02 even with that nova chnage, no reboot 14:29:08 that should be a known limitation IMO, we can't do that w/o control plane 14:29:17 works for me 14:29:20 thanks mschuppert 14:29:41 :) 14:31:44 thanks folks! 14:31:46 #endmeeting