Monday, 2019-04-08

*** Liang__ has joined #openstack-helm00:50
*** Liang__ is now known as LiangFang00:55
*** irclogbot_1 has quit IRC01:44
*** JangwonLee_ has joined #openstack-helm02:15
*** JangwonLee has quit IRC02:18
openstackgerritRahul Khiyani proposed openstack/openstack-helm-infra master: rabbitmq securityContext  https://review.openstack.org/64291802:42
*** pgaxatte has joined #openstack-helm06:05
*** Nishant_ has joined #openstack-helm06:24
*** dpawlik has joined #openstack-helm06:45
pgaxattehello06:50
pgaxattei'm reading the docs on how to add authentication for private registries: https://docs.openstack.org/openstack-helm/latest/ko_KR/specs/support-OCI-image-registry-with-authentication-turned-on.html06:52
pgaxatteis this still active?06:52
pgaxattei mean there 3 work items at the end, are they stil activetely worked on?06:53
*** jamesgu has quit IRC06:58
*** nmimi has joined #openstack-helm07:13
*** dimitris_ has joined #openstack-helm07:19
*** alisanhaji has joined #openstack-helm07:24
*** jsuchome has joined #openstack-helm07:28
*** alisanhaji has quit IRC07:29
evrardjpo/07:37
evrardjppgaxatte: I don't know. Maybe ask tomorrow in the meeting?07:37
pgaxatteevrardjp: alright will do, what time are the meetings?07:39
evrardjppgaxatte: here is an ICS file for your calendar :)07:40
evrardjphttp://eavesdrop.openstack.org/#OpenStack-Helm_Team_Meeting07:40
evrardjptl:dr; 1500 utc07:41
pgaxatte:)07:41
pgaxattethanks07:41
*** witek has joined #openstack-helm07:47
*** itxaka has joined #openstack-helm07:49
*** alisanhaji has joined #openstack-helm07:53
*** nick_kar has joined #openstack-helm07:55
*** happyhemant has joined #openstack-helm08:00
*** roman_g has joined #openstack-helm08:15
*** witek has quit IRC09:20
*** witek has joined #openstack-helm09:31
*** LiangFang has quit IRC10:04
pgaxattehello again11:59
pgaxattewe're trying to deploy the multinode scripts on top of Kubernetes 1.1412:00
pgaxattehas anyone succeeded already? is openstack-helm limited to a certain version of k8s?12:00
pgaxattewe were able to spawn on kubernetes 1.1212:01
portdirectWhat errors have you been seeing with 1.14?12:01
portdirectI've not run on that yet, but we test with 1.1312:01
pgaxatteportdirect: this kind of errors: http://paste.openstack.org/show/749001/12:02
pgaxattedoes it work out of the box on 1.13?12:03
portdirectIt should12:03
pgaxattethe error we get looks like something was deprecated or moved out of beta in k8s 1.1412:03
portdirectIt is possible that the storage class schema has changed/been tightened up slightly12:04
portdirectAgreed12:04
portdirectThis should be fairly simple to fix, at the very worst we may need a conditional dependent on version of k8s tiller is talking to12:05
pgaxatteportdirect: we're digging in ceph-mon helm charts and we'll report back if we find something12:26
portdirectI think ceph-client will be what's needs attention here at 1st glance12:33
*** spiette has quit IRC12:40
pgaxatteportdirect so far we see that there is an issue when creating the ceph-mon-keyring secret12:40
pgaxattethe lead we are exploring is maybe upgrading the kubectl used in the pod12:41
pgaxatteit's a v1.10.3 maybe it's a bit too old to work with k8s 1.1412:41
*** spiette has joined #openstack-helm12:43
pgaxattethe problem is coming from there apparently12:50
pgaxatteusing the same version of kubectl as in the pod we could not create the secret by hand but we succeeded when using kubectl 1.1212:51
portdirectNice!12:52
portdirectIf you could make a ps to openstack-helm-images bumping the version of kubectl in the image it would be really appreciated12:53
*** parasitid has joined #openstack-helm12:53
pgaxatteon my way :)12:53
*** bh526r has joined #openstack-helm12:55
parasitidhi ppl12:55
parasitidhas anyone ever tried to base an openstack-helm deploymennt based on rook instead of oshi/ceph ?12:56
parasitidis it a good idea or not12:56
parasitid?12:56
portdirectparasitid, ive done some very simple sanity tests12:58
portdirectboth rook and the osh-infra charts provide vanilla ceph12:58
portdirectso it would be perfectly valid to use rook if so desired12:59
portdirectthe one thing that would chnage would be needing to treat the ceph cluster as an `external` one - jayahn has a wip ps for how to do this in gerrit13:00
openstackgerritPete Birley proposed openstack/openstack-helm-infra master: HTK: Update k8s-entrypoint container macro to add security context  https://review.openstack.org/65051913:03
parasitidoki13:07
parasitidthanks13:07
parasitidcould you point me his wip on gerrit ?13:08
parasitidor is it already in the docs how to use an external ceph cluster ?13:08
portdirectparasitid: https://review.openstack.org/#/c/586992/13:09
parasitidah13:13
parasitidcheers13:13
parasitidok13:16
parasitidso ive partially read it13:16
parasitidit seems to me that, according to the https://review.openstack.org/#/c/586992/9/doc/source/install/developer/deploy-with-existing-ceph.rst13:17
parasitidwe still have to use some osh ceph tools to do ceph-provisionning13:18
parasitidam i correct ?13:18
pgaxatteportdirect, i'm not used to storyboard but here it is: https://storyboard.openstack.org/#!/story/200539713:19
portdirectyou shoudl be able to skip the 1st section as i belive rook provides its own storageclass13:19
portdirectlines 53 onwards will be required if you intend to use ceph with cinder/glance/nova13:20
parasitidok13:20
openstackgerritdiwakar thyagaraj proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763813:27
openstackgerritdiwakar thyagaraj proposed openstack/openstack-helm-infra master: Add default AppArmor profile to Fluentbit Make MAC profile annotation optional because nothing is currently defined in values.yaml  https://review.openstack.org/64788113:31
*** jsuchome has quit IRC13:53
openstackgerritLuna Das proposed openstack/openstack-helm-infra master: Add docker-default apparmor profile for elasticserach.  https://review.openstack.org/64999014:02
openstackgerritHemanth Nakkina proposed openstack/openstack-helm-infra master: [WIP] Helm chart for sriov-cni plugin  https://review.openstack.org/65090814:04
openstackgerritDeepak proposed openstack/openstack-helm-images master: Makefile and Docker file for calicoctl-utility containers  https://review.openstack.org/64974214:06
*** Nishant_ has quit IRC14:08
*** dpawlik has quit IRC14:13
*** kranthikirang has joined #openstack-helm14:22
*** michael-beaver has joined #openstack-helm14:25
openstackgerritkranthi kiran guttikonda proposed openstack/openstack-helm-infra master: ceph-mon, tenat-ceph storageclass schema  https://review.openstack.org/65042814:27
*** jamesgu has joined #openstack-helm14:27
evrardjpparasitid: I think I would be happy to follow what's going on on that level (rook support work)14:33
openstackgerritIan Howell proposed openstack/openstack-helm master: WIP/DNM - Add the upgrade lifecycle step  https://review.openstack.org/64918914:34
*** howell has joined #openstack-helm14:34
openstackgerritMerged openstack/openstack-helm master: change the way to get tunnel device  https://review.openstack.org/64390914:36
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: This commit adds docker-default apparmor profile for prometheus-alertmanager Add in prometheus-alertmanager gate script as a script  https://review.openstack.org/65037314:39
*** pgaxatte has quit IRC14:40
parasitidevrardjp: ok14:45
parasitidwill try to keep you informed14:45
evrardjpthanks parasitid  :D14:46
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763814:52
*** lemko has joined #openstack-helm14:53
*** cfriesen has joined #openstack-helm14:54
happyhemantevrardjp: Hi I actually trying to deploy horizon on openstack but got some strange error in logs. may be you are famillier with this and could help me. https://www.irccloud.com/pastebin/4XD0qqLC/14:55
happyhemantI dont get it this "no listening sockets available"14:56
*** jsuchome has joined #openstack-helm14:56
jsuchomeportdirect: Hi, I think your comments in https://review.openstack.org/#/c/642844/ were addressed ...14:57
*** cfriesen has quit IRC14:59
jsuchomeAny reason not to merge this? https://review.openstack.org/#/c/642415/ As it is just addition of new script, it's not affecting upstream at all...14:59
*** sthussey has joined #openstack-helm15:01
happyhemantevrardjp: I also get this error when i apply my horizon manifest file. https://www.irccloud.com/pastebin/do3Gegui/15:31
portdirectdwalt / mattmceuen ^ any thoughts?15:33
*** itlinux_ has quit IRC15:33
openstackgerritItxaka Serrano Garcia proposed openstack/openstack-helm-images master: Add tempest suse image and version ARG  https://review.openstack.org/65093315:40
openstackgerritJagan Mohan Kavva proposed openstack/openstack-helm-infra master: Add docker-default (enforce) AppArmor profile to openvswitch  https://review.openstack.org/65094015:57
openstackgerritItxaka Serrano Garcia proposed openstack/openstack-helm master: Fix configmap-etc values for tempest  https://review.openstack.org/65094816:03
openstackgerritPete Birley proposed openstack/openstack-helm-infra master: Allow multiple containers per daemonset pod  https://review.openstack.org/64595816:08
openstackgerritPete Birley proposed openstack/openstack-helm-infra master: WIP: MariaDB: Update backup scripts  https://review.openstack.org/65095016:20
*** michaelbeaver has joined #openstack-helm16:21
*** michael-beaver has quit IRC16:24
*** itlinux has joined #openstack-helm16:28
*** witek has quit IRC16:29
openstackgerritMerged openstack/openstack-helm-infra master: HTK: Update k8s-entrypoint container macro to add security context  https://review.openstack.org/65051916:34
*** unicell has joined #openstack-helm16:37
openstackgerritMerged openstack/openstack-helm-infra master: Ceph: fix overriding ceph monitor hosts value  https://review.openstack.org/64927916:38
*** michaelbeaver has quit IRC16:40
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add docker-default apparmor profile for elasticserach.  https://review.openstack.org/64999016:42
*** unicell has quit IRC16:45
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: This commit adds docker-default apparmor profile for prometheus-alertmanager Add in prometheus-alertmanager gate script as a script  https://review.openstack.org/65037316:50
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: This commit adds docker-default apparmor profile for prometheus-node-exporter.  https://review.openstack.org/65038616:50
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: This commit adds docker-default apparmor profile for prometheus-openstack-exporter.  https://review.openstack.org/65038816:50
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add docker-default apparmor profile for prometheus process exporter.  https://review.openstack.org/65039516:50
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763816:51
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add docker-default apparmor profile for elasticserach.  https://review.openstack.org/64999016:51
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: [WIP]Change gate job to see if running just libvirt in the AppArmor gate passes  https://review.openstack.org/65096116:54
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: [WIP]Change gate job to see if running just libvirt in the AppArmor gate passes  https://review.openstack.org/65096116:55
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763817:08
*** unicell has joined #openstack-helm17:17
openstackgerritMerged openstack/openstack-helm master: Add network policy ingress rule to mariadb  https://review.openstack.org/63829917:37
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763817:46
openstackgerritdiwakar thyagaraj proposed openstack/openstack-helm-infra master: Add default AppArmor profile to Fluentbit Make MAC profile annotation optional because nothing is currently defined in values.yaml  https://review.openstack.org/64788118:04
openstackgerritMerged openstack/openstack-helm-images master: Add rocky release script  https://review.openstack.org/64241518:05
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763818:07
openstackgerritLuna Das proposed openstack/openstack-helm-infra master: [WIP]Add docker-default apparmor profile for elasticserach.  https://review.openstack.org/64999018:13
*** michael-beaver has joined #openstack-helm18:15
*** lemko has quit IRC18:15
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: [WIP]Add docker-default apparmor profile for elasticserach.  https://review.openstack.org/64999018:16
*** itlinux has quit IRC18:19
*** bh526r has quit IRC18:19
*** bh526r has joined #openstack-helm18:20
*** alanmeadows has quit IRC18:20
*** mattmceuen has quit IRC18:20
*** megheisler has quit IRC18:20
*** lamt has quit IRC18:20
*** adrianreza has quit IRC18:20
*** mattmceuen has joined #openstack-helm18:20
*** adrianreza has joined #openstack-helm18:20
*** alanmeadows has joined #openstack-helm18:21
*** ChanServ sets mode: +o alanmeadows18:21
*** megheisler has joined #openstack-helm18:25
*** lamt has joined #openstack-helm18:34
*** flaviosr_ has joined #openstack-helm18:37
*** flaviosr has quit IRC18:38
*** michael-beaver has quit IRC18:39
openstackgerritGage Hugo proposed openstack/openstack-helm master: Add wait.resources.type to armada manifest  https://review.openstack.org/64850318:42
*** michael-beaver has joined #openstack-helm18:44
*** happyhemant has quit IRC18:54
openstackgerritDoug Aaser proposed openstack/openstack-helm-infra master: [WIP] Patroni inclusion work for HA Postgres  https://review.openstack.org/64438818:55
openstackgerritRandeep Jalli proposed openstack/openstack-helm-infra master: Add Docker default AppArmor profile to Ceph-osd Change apparmor for ceph-osd into gate script Make MAC section of daemonset optional in case nothing is specified in values  https://review.openstack.org/64763818:57
*** witek has joined #openstack-helm19:03
*** alisanhaji has quit IRC19:05
openstackgerritMeghan Heisler proposed openstack/openstack-helm-infra master: Add wait.resource for LMA services to armada manifest  https://review.openstack.org/64914119:05
openstackgerritDoug Aaser proposed openstack/openstack-helm-infra master: [WIP] Patroni inclusion work for HA Postgres  https://review.openstack.org/64438819:06
*** howell has quit IRC19:10
*** jsuchome has quit IRC19:10
*** witek has quit IRC19:25
*** cfriesen has joined #openstack-helm19:33
openstackgerritPete Birley proposed openstack/openstack-helm-infra master: WIP: MariaDB: Update backup scripts  https://review.openstack.org/65095019:53
dwaltportdirect: happyhemant: thanks. Responded to your message in #airshipit19:54
cfriesenportdirect: thought I'd give you a heads-up on something weird I saw.  was booting up controller nodes and one of them didn't come up due to disk pressure.  The other mariadb nodes were stuck in /tmp/start.py because check_if_cluster_data_is_fresh() kept evaluating to false (because the one node wasn't updating its timestamp)20:00
portdirectthis is kinda by design20:01
portdirectas we need to be be able to be sure which node leads the cluster20:01
cfriesenit does mean that if a single node isn't working your whole DB is down20:02
cfriesenon startup, at least20:02
portdirecton a full cluster restart, all three pods need to come up before the cluster will reform20:02
*** itxaka has quit IRC20:03
cfriesenI'm envisioning recovering from a power outage or something...could get stuck with an extended outage until someone comes in and manually fixes it up.20:03
portdirectas it stands today this is correct20:03
portdirectwe are working on some methods to mitigate this however20:03
cfriesenah?20:03
portdirectmostly centered around the need to have some way of moving the pod from a dead node to one that is running20:04
portdirecta 'reaper' if you will20:04
portdirecthere there needs to be some sanity checking - eg if the workload is in `nodelost` or `unknown` for x lins and there is no rbd client connected, then it should be safe to get the pod running on another node20:05
cfriesenportdirect: there's a ~5min timeout in kubernetes to prevent you from accessing the backing store volume on a new node20:05
portdirectif we go to the other extreme - we do however expose ourselves to data loss20:06
cfriesen(apparently the only way around the k8s timeout is to delete the node entirely)20:06
portdirecteg - what if the pod thats not come up has received some tx that has not yet been synced to other pods?20:06
*** jaypipes has quit IRC20:07
*** jaypipes has joined #openstack-helm20:07
cfriesenportdirect: yeah, it's a tough call.  do we recover automatically but risk some data loss, or extend the outage?20:07
openstackgerritGage Hugo proposed openstack/openstack-helm master: Add credential delete hook to keystone chart  https://review.openstack.org/62413120:07
portdirectcfriesen: we went for the latter, as we felt it safest20:08
portdirectand we can shorten it with some mitigations (eg the above)20:08
*** jaypipes_ has joined #openstack-helm20:08
portdirectbut if there was a viable suggestion to relax the 'paranoia' here, we would welcome it20:08
*** jaypipes has quit IRC20:12
portdirectcfriesen: other than the issue above - how has your testing been going?20:16
openstackgerritPete Birley proposed openstack/openstack-helm-infra master: WIP: MariaDB: Update backup scripts  https://review.openstack.org/65095020:16
openstackgerritdiwakar thyagaraj proposed openstack/openstack-helm-infra master: Add default AppArmor profile to Fluentbit Make MAC profile annotation optional because nothing is currently defined in values.yaml  https://review.openstack.org/64788120:17
openstackgerritDoug Aaser proposed openstack/openstack-helm-infra master: [WIP] Patroni inclusion work for HA Postgres  https://review.openstack.org/64438820:17
cfriesenportdirect: I think there are a few issues that will be brought up if they haven't been already.20:23
portdirectcfriesen: nice (or not, as the case may be :) ) it would be great to get your findings tracked so we can ensure we address them20:25
*** unicell has left #openstack-helm20:48
*** jaypipes_ is now known as jaypipes20:51
openstackgerritMeghan Heisler proposed openstack/openstack-helm-infra master: Add egress network policy to LMA services  https://review.openstack.org/64255521:07
cfriesenportdirect: had a thought...what about specifying a timeout in the values file.  the system would behave as it does now initially, but any nodes that haven't updated themselves after the timeout expires would have their information wiped from the state configmap21:23
portdirecthow would this protect against potential data loss though?21:30
portdirector would we work on the assumption that if you set/enabled this option that you'd be accepting of the risk?21:31
cfriesenportdirect: right..so basically the operator could say "if the system hasn't come up within 30 minutes" (or whatever the threshold is) then we want to come up anyway even if it means data loss21:37
cfriesencould default to zero, meaning we wait forever21:37
portdirectcfriesen: how many nodes are in a starling-x cluster?21:42
portdirecti wonder if a deployment and single replica may make more sense here?21:42
cfriesenwe have one or two controller nodes.  the single node case is easy.  the dual-node case is trickier because galera doesn't like that scenario.21:43
cfriesenif we have compute nodes we run a garbd instance on a compute node21:43
cfriesenbut the two-node-only case is tricky.  what we've ended up doing is running a separate script that will bootstrap the database on the "active" controller if things go south.21:44
alanmeadowsactually purging PVC data seems a bit drastic21:44
cfriesenalanmeadows: not purging pvc data, just configmap data21:45
alanmeadowsor are you saying just config state?21:45
alanmeadowsi see21:45
cfriesenportdirect: I tried a deployment with a single replica, but ran into problem if the node the replica was on died.  the pod would start up just fine on the running controller, but kubernetes woudln't let the new node access the PV until after a ~5min timeout21:46
alanmeadowsin the single replica case, it may just make sense to setup some sort of external reaper/watcher that can both potentially help unlock the PVC and expire the pod on the lost node; in the very specific two-node-only case the right solution may just be not to be using galera and create an Active-Active or Active-Standby mysql chart?21:57
alanmeadowsI mean sometimes galera is right, sometimes it isn't....21:58
cfriesenalanmeadows: are you aware of any mechanism to unlock the PVC to let it be accessed by another node?  I couldn't find one.22:13
cfrieseneven force-deleting the pod on the "dead" node didn't allow me to access the PV on the running node until the timeout was up22:15
cfriesenthe k8s folks seemed to think the only way to speed it up was to delete the kubernetes node and re-provision it once it came back up22:16
alanmeadowsin my experience this has been very storage backend specific22:25
cfriesenif we had a PV capable of multi-attach it'd be much simpler, but at the moment we don't22:26
alanmeadowse.g. in older versions of k8s on ceph based volumes, locks were created that you could clear (or had to if things got gummed up).  Now it checks for other in use "active" clients prior to allowing an attachment, which in theory should be relatively quick22:27
alanmeadowsother backends may have other mechanisms for allowing reattachment22:27
alanmeadowsor rather, early reattachment22:28
portdirectcfriesen: re read-write many PVC, you may want to try cephfs?22:29
cfriesenyeah, I think it's on the list to add eventually.  are you folks using it?22:29
portdirectNot atm, though its deployed and available with the ceph charts today.22:30
* alanmeadows contemplates whether your situation is improved with galera atop cephfs22:30
alanmeadows;-)22:30
portdirectThough I really wonder if Alan's suggestion of moving off galera is the right choice here22:31
portdirectAs trying to fit a system requiring a quorum of nodes into an even number is always gonna present challenges22:32
*** kranthikirang has quit IRC22:32
alanmeadowscertainly makes more sense in making a two node installation a first class citizen22:32
cfriesenI think if we had multi-attach PVs we could use a deployment instead of a statefulset and then force-kill the pod if we know the node it's on has gone down.22:32
portdirectOne thing that does trouble me here22:33
cfriesenI know, it's a hack. :)22:33
*** michaelbeaver has joined #openstack-helm22:33
portdirectIs the issue you are seeing occurring on more than just a single node reboot?22:33
portdirectSorry, on a single node reboot22:33
cfriesenthe specific scenario in question is if you have a power outage or something so all nodes are down.  then all the nodes come up except for one22:34
cfriesenwhat'll happen is that all the nodes sit there forever in /tmp/start.py22:35
*** michael-beaver has quit IRC22:35
*** michaelbeaver has quit IRC22:37
openstackgerritGeorg Kunz proposed openstack/openstack-helm-images master: Adding support for DPDK to openvswitch image  https://review.openstack.org/65015223:11
openstackgerritDmitrii Kabanov proposed openstack/openstack-helm-images master: [Ceph] Update Ceph repository and key  https://review.openstack.org/65102823:29
openstackgerritDmitrii Kabanov proposed openstack/openstack-helm-images master: [Ceph] Update Ceph repository and key  https://review.openstack.org/65102823:34
openstackgerritDmitrii Kabanov proposed openstack/openstack-helm-images master: [Ceph] Update Ceph repository and key  https://review.openstack.org/65103623:40

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!