08:00:10 #startmeeting vitrage 08:00:11 Meeting started Wed Jan 3 08:00:10 2018 UTC and is due to finish in 60 minutes. The chair is ifat_afek. Information about MeetBot at http://wiki.debian.org/MeetBot. 08:00:12 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 08:00:14 The meeting name has been set to 'vitrage' 08:00:15 Hi :-) 08:00:24 Hi :D 08:00:32 Hi o/ 08:01:05 \o/ 08:03:06 Hi 08:03:21 Agenda: 08:03:33 • Status and Updates 08:03:33 • Queens roadmap follow-up 08:03:34 • Open Discussion 08:03:39 #topic Status and Updates 08:03:49 One general update: Vitrage was added to kolla-ansible 08:03:54 #link https://review.openstack.org/#/c/432608/ 08:04:12 My update: I am still working on the execute-mistral refactoring. I pushed the change that supports different template validation and template loading per template versions. 08:04:19 #link https://review.openstack.org/#/c/529332 08:04:29 As part of this change, I changed the structure of the existing execute_mistral action. 08:04:45 Now I’m working on the last part that is needed to complete the integration with Mistral – I’m writing the code to add functions to the templates. 08:05:04 The first (and for now the only) function will be get_attr(entity, attr). For example: get_attr(instance, name). 08:05:14 That’s it for me 08:05:29 Who’s next? 08:06:11 My update 08:06:18 Still working on suspect alarm 08:06:20 #link https://review.openstack.org/#/c/519250/ 08:06:50 I realized that how to aggregate it with monitored alarm need to be clarified 08:07:10 Thanks to idan_hefetz reminding 08:07:41 I'm not quite familiar with the value normalization and aggregation in current code. 08:07:50 Looking into it and will update the blueprint soon. 08:08:11 Besides I added it to the topic for coming PTG 08:08:31 #link https://etherpad.openstack.org/p/vitrage-ptg-rocky 08:09:11 yujunz: Good idea. In general there are four states: 08:09:18 This part seems to be a major deviation from ZTE's use case 08:09:18 1. state - as appears in the datasource 08:09:31 2. vitrage state - the deduced state 08:09:56 3. aggregated state - combined from the datasource(s) and vitrage. Could be practically anything, depending on the datasource 08:10:33 4. operational (aka normalized) state - one of 4-5 predefined values, needed by the UI (to show red/yellow/green status) 08:10:42 But I’m not sure this is what you are talking about... 08:10:43 As far as I know, we prioritize the value by time, currently in vitrage, it is ordered by configured priority then create time. 08:11:07 Ok, so it is an interesting discussion 08:11:45 I'll discuss with the ZTE's project team to see how to resolve it. 08:11:48 The issue is the aggregated state. The operational state is a normalized version of the aggregated one, so I think this part can remain the same 08:12:13 Yes. Thank you for the inputs, ifat_afek 08:12:22 Ok, we’ll be happy to hear your ideas 08:12:39 yujunz: is your issue only with state or all the actions? 08:12:51 With raise_alarm action 08:13:06 I assume also with set state? 08:13:19 OK, i will try to think about it too :) 08:13:32 I think state is not used in our fault model at the moment 08:13:40 All the RCA is about alarms 08:13:41 Ok. 08:13:58 That's all from me 08:14:22 i will update o/ 08:14:46 update on graph snapshot aka(with its new name) graph persistor. 08:14:55 #link https://review.openstack.org/#/c/530650 08:15:10 It is a part from Vitrage HA and History Vision 08:15:19 #link https://docs.openstack.org/vitrage/latest/contributor/vitrage-ha-and-history-vision.html 08:15:49 I'm working now on patch 4 - considering and fixing all the comments. 08:16:07 Please review and ask everything that comes in your mind 08:17:06 I'm done updating 08:17:59 Thanks. Anyone else? 08:18:19 I'll update. 08:18:30 I am working on sending parsed alarms to the message bus last week. 08:18:38 Bp: #link https://review.openstack.org/#/c/486812/12/specs/queens/approved/snmp-parsing-service.rst 08:19:07 And now I am adding tempest tests. 08:19:31 But I'm confused that should datasources futher process parsed alarms, and which datasource will process them? 08:19:46 How are you adding the tempest tests? do you have a mock that sends snmp traps? 08:20:02 I understood that you have your own ZTE datasource, right? 08:20:50 Yes, I have a mock rhat sends snmp traps. 08:21:44 If graph has added an alarm vertex, the test is passed. 08:21:50 We add our own alarm datasource that process with snmp triggered traps, but community protogenetic alarm datasource does not deal with snmp triggered traps. 08:22:17 Should I add codes in the datasource Doctor to process with them? 08:22:44 That’s a good question… so you would like to have a tempest so you could know if something breaks, but your datasource is not upstream 08:23:26 Yes. 08:24:22 idan_hefetz changed the Doctor datasource so it can handle different types of alarms. Maybe you can use it. I’m not sure about it, and idan_hefetz is not here at the moment. But I think this should probably be the solution 08:25:04 Let us know if you need help with that 08:25:46 Ok. Then I will process with the snmp alarms in Doctor datasource. 08:25:50 And note that the tempest tests are just about to move to a new location: https://github.com/openstack/vitrage-tempest-plugin 08:26:05 Thanks o/ 08:26:17 That's all from me. 08:26:25 This was defined as a Queens community goal, and it will happen once the following change is approved: 08:26:34 #link https://review.openstack.org/528528 08:27:17 Also note that currently changes in Vitrage will trigger the tempests; but changes in vitrage-tempest-plugin will NOT trigger the tempests. This is another change that has to be done 08:27:57 And BTW, thanks yujunz for reminding me: you are all welcome to add ideas to the PTG etherpad: 08:28:05 #link https://etherpad.openstack.org/p/vitrage-ptg-rocky 08:28:20 Anyone else has updates? 08:29:46 #topic Queens roadmap follow-up 08:29:59 A reminder about the Queens release timeframe: 08:30:04 The release deadline for python-vitrageclient: January 25 08:30:14 This week is also declared as the “feature freeze”, although we are not committed to it (as we are following the cycle-with-intermediary release model). 08:30:19 The release candidate deadline is February 5-9 08:30:41 I updated some of the statuses on the etherpad with the roadmap: 08:30:48 #link https://etherpad.openstack.org/p/vitrage-ptg-queens 08:32:05 Important tasks that I believe will be finished by January 25: Mistral integration, configurable notifications, graph persistor, templates CRUD 08:32:23 That’s it for me 08:33:16 For the complete features related to proactive RCA, it seems not possible to catch up with the date. 08:33:30 But I do hope to complete suspect alarm in this release. 08:33:51 Ok. Will you be able to use it, if it’s only this part? 08:35:40 I would say not fully functional but viable. 08:35:47 Ok 08:37:09 #topic Open Discussion 08:37:22 Anything else we should discuss? 08:37:37 A bit curious on a patch under review 08:37:39 #link https://review.openstack.org/#/c/530629/ 08:37:53 What is the purpose of adding controller node? 08:38:29 Well - we are showing controller nodes in our Nokia solution, and Yuval Adar realized that Nova can actually give us this information. 08:39:17 The question is, actually, why filter it *out*. This is the current behavior, that only hosts are returned by the nova.host driver 08:39:38 And ideally, we would like Vitrage to have as much information as possible, right? 08:39:44 OK. I was thinking it is not a part of virtual infrastructure resource, but the manager. 08:40:22 Vitrage can be used to show resources of all layers - infra, physical, virtual, etc. 08:40:29 But this is the TripleO controller 08:41:23 OK. Understood. 08:42:11 Another question is about Rocky PTG time table. Is the time slot allocated? 08:42:35 I might be interested to join some discussion in other projects 08:42:40 All we have is the etherpad 08:42:42 Trying to make a time table 08:42:51 I still didn’t see an agenda of the PTG. Have you? 08:43:13 You mean from the organizer? 08:43:21 Yes 08:43:25 No 08:43:39 We should know, as a first stage, on which dates we have the room available 08:43:42 I am also wondering why there is so few updates from the PTG organizer... 08:43:53 But it might be a good time to start adding ideas to the etherpad 08:44:20 These two weeks everyone is on vacation… but I understand what you are talking about 08:44:31 There are few discussion in the mailing list. I don't know where to get more information. 08:44:40 Once I hear something I will notify everyone 08:44:49 About the PTG? 08:44:50 How was your last attendance to PTG? 08:44:54 in Denvor? 08:45:07 They sent a document with proposed agenda, and after a few weeks it was updated 08:45:44 The agenda was very high level: first two days were cross-project, and the next three days were for project specific discussions 08:46:00 On the project-specific, only rooms were allocated, and the agenda was managed by internal etherpads 08:46:28 Then the created an html page where the PTLs (or someone else) could send update whenever a new discussions was started 08:46:42 OK. Let's wait and see. Thank you for the information ifat_afek 08:46:55 I got the feeling that the agenda kept changing 08:47:29 It’s not as organized ahead as the big summits 08:47:41 Anyway, I’ll keep you updated once I know something 08:48:18 That would be good. Thank you 08:49:02 Any other issue? 08:49:48 No more from me 08:49:59 Ok 08:50:08 See you next week :-) 08:50:20 bye 08:50:32 bye 08:50:47 #endmeeting