15:00:39 #startmeeting monasca 15:00:43 Meeting started Wed Dec 19 15:00:39 2018 UTC and is due to finish in 60 minutes. The chair is witek. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:44 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:46 The meeting name has been set to 'monasca' 15:01:36 hi joadavis 15:02:18 today the first time in #openstack-monasca 15:03:04 =-O 15:03:27 just posted a short reminder in the old channel 15:03:45 agenda for today: 15:03:49 https://etherpad.openstack.org/p/monasca-team-meeting-agenda 15:03:56 hey all 15:04:06 hi dougsz 15:04:20 #topic Vitrage integration 15:04:55 we had some discussion with Ifat and Joseph about the integration with Vitrage 15:05:17 and how the monitored resources (or alarms) should be identified in Vitrage 15:05:33 here the email list discussion: 15:05:35 http://lists.openstack.org/pipermail/openstack-discuss/2018-December/000806.html 15:06:37 I didn't reply to his last email, but have been trying to dig up any alarm details that may be helpful. I haven't explored the topic deeply before 15:06:53 I think what he proposed is on the right track 15:07:22 as I understand it, metrics from vms and metrics from hosts have different metric names, which may help 15:08:11 in the change from Bartosz (POC from Berlin) he uses custom dimensions: resource_type and resource_id 15:08:55 do you think we could use such schema for all metrics? 15:08:59 some metrics have the resource_id already, but I don't think any have type 15:09:31 I think you could infer the type from the metric itself 15:09:54 most of the checks have the possibility to configure own `custom` dimensions per check instance 15:10:11 but that's e.g. not the case for libvirt 15:11:14 Is there a resource_type in vitrage they are trying to match with? 15:11:45 let me check their docu 15:12:43 https://docs.openstack.org/vitrage/latest/contributor/vitrage-template-format.html 15:13:02 they define entity type, but I guess it's not what is meant 15:13:39 as resource_type I understand e.g: node, instance, disc, interface 15:14:13 that is the impression I got too. That it would be used to differentiate hosts from vms, etc 15:15:43 OK, and how should we treat e.g http_check? 15:15:57 that's pretty generic 15:16:27 true, good point. That is probably most generic 15:16:59 I think it's difficult to create a global mapping for metric of that type 15:18:00 similarly process 15:18:33 http_alive_status does have service field. For a vm it adds component and resource_id, but for a host those fields are missing 15:19:49 do you think we could add a resource_type to all metrics? it might be useful, just don't know how much work that would be 15:20:58 not sure how we would define a resource_id for a process, as it is not an OpenStack resource in all cases 15:21:32 Similar questions come up for aggregated metrics. If the metric used to trigger the alarm is aggregated across all VMs, then we don't have a resource_id 15:23:00 That might be a use case we should discuss with them 15:23:22 I have joined the self-healing meeting today morning (EU time) 15:23:38 and we had a longer discussion with Ifat 15:23:46 cool 15:24:08 she wants to propose a spec and start to define use cases we should cover 15:24:50 That sounds like a good idea. A starting set of use cases could keep us from digging too deep 15:25:06 there is another self-heeling meeting today in 2 hours, I guess 15:25:56 I have to think over this mapping thing, still don't have a clear picture on it 15:26:26 here the logs from today's meeting: 15:26:28 http://eavesdrop.openstack.org/meetings/self_healing/2018/self_healing.2018-12-19-09.02.log.html 15:26:43 thanks 15:27:29 If nothing comes up, I may join in at 9am PST 15:27:44 thanks, I'm not sure myself yet 15:27:53 should we move on? 15:28:05 Speaking of SIGs, have you seen anything about the auto-scaling sig starting meetings? 15:28:13 no 15:28:20 I think it has been quiet for a week 15:28:37 I don't think it needs meetings yet, but I would like to start contributing to their use cases too 15:29:16 yes, next topic. :) 15:29:23 yes, we could publish our documentation there 15:29:36 #topic spaces in dimension names 15:30:05 yes, I saw Matthias' comments on the bug this morning 15:30:27 I think it may be an acceptable workaround, but will check with the team and let support comment 15:30:46 so, the reported case is about deployment using collecting libvirt metrics 15:31:13 the dimensions zone and tenant_name are set by the plugin 15:31:20 and include spaces 15:31:53 are in general dimensions values with spaces something we should support? 15:33:00 fair question 15:33:37 looking at the designs and documentation, I think there was an assumption that fields like zone and tenant_name would be unique identifiers and not long descriptions 15:33:54 unique, human parsable identifiers. :) 15:34:11 apparently API accepts it and it's not explicitly forbidden in the API ref 15:34:40 I'm not sure how in this case someone set values iwth spaces without causing trouble at the cli or api, but you are right that it appears it accepted it 15:35:10 in general, it does seem like we want dimensions to be flexible and there may be other fields that we would want to support spaces in 15:35:58 I haven't looked in to the persister code for influxdb. How hard would it be to add escaping or something to support spaces? 15:35:58 could you check if Cassandra persister supports it? 15:36:15 I asked the team, but I don't think James is online yet. :) 15:36:35 InfluxDB has syntax for this, should not be very difficult 15:36:37 I'll follow up witg them today 15:37:03 the problem is more our matrix Python/Java + InfluxDB/Cassandra 15:37:12 thanks joadavis 15:37:14 yes, I was surprised it was a problem for influx 15:38:08 OK, let's follow up on this after holidays 15:38:17 will do 15:38:35 #topic events listener 15:39:26 dougsz has added small comments 15:39:28 Spec looks approved, though Doug had a couple good nits. I think we can merge and I'll do a quick follow-up to fix the nits 15:39:42 unless you want me to fix it first. :) 15:39:43 OK, let's do this 15:39:55 it's up to you :) 15:39:56 Sounds good to me 15:40:24 just gave W+1 15:40:31 thanks 15:40:39 :) 15:40:52 great job, thanks joadavis and Ashwin 15:41:08 yep, thanks guys, will be a really nice feature 15:41:57 thanks dougsz for constructive remarks and discussion 15:42:25 I certainly didn't do any of the hard work :) 15:42:38 I was finally watching the RH presentation on monitoring with Prometheus from Berlin summit. It is interesting when they come to some of the same conclusions, and have a similar event listener component. :) 15:42:56 :) 15:43:57 getting back to this presentation, I think we could also consider adding support for collectd 15:44:28 I've been pondering what a good solution for container monitoring will be. collectd may be a good choice 15:44:51 cadvisor is also a nice one for containers 15:44:52 I'm still slightly clueless though. :) 15:45:20 yes... was it ceilometer that was using cAdvisor? 15:45:51 hmm, pass 15:46:33 Merged openstack/monasca-specs master: Monasca Events Publishing Spec https://review.openstack.org/583803 15:46:40 witek: we would have a Monasca Agent plugin to harvest collectd metrics? 15:47:04 Or were you thinking about something that submits them directly to the API? 15:47:11 I rather thought about monasca plugin for collectd 15:47:27 Ah, I see. 15:47:42 I think it's main advantage is that it's lightweight and performant 15:48:12 I'm all for light weight. on of the reoccurring concerns when deploying monasca is that it is too heavy 15:48:31 And here we are talking about adding Events support to it :) 15:48:40 :) 15:49:10 At least it is modular though, so the collector can always be turned off in favour of collect or something. 15:49:29 The statsd part is actually very useful to us. 15:49:34 modular is a good thing 15:49:44 +1 15:51:04 if we don't have anything else for today, I would like to wish you Merry Christmas 15:51:18 and have a good start in the new year! 15:51:18 It may be worth defining a 'monasca light stack' for smaller deployments. 15:51:27 Merry Christmas! 15:51:53 Thanks, Merry Christmas to you all too 15:52:02 Hope you get some quality time off... 15:52:38 I'm looking forward to time with the kids 15:53:00 I need to line up some interactive science experiments with them. :) 15:54:04 tomorrow we have a new additional meeting in the morning 15:54:29 and then a break until Jan, 2nd 15:54:44 thank you guys and bye bye 15:55:06 #endmeeting