15:00:06 #startmeeting monasca 15:00:07 Meeting started Wed Sep 23 15:00:06 2015 UTC and is due to finish in 60 minutes. The chair is rhochmuth. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:08 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:10 The meeting name has been set to 'monasca' 15:00:15 roll call 15:00:25 o/ 15:00:26 o/ 15:00:27 o/ 15:00:31 o/ 15:00:34 o/ 15:01:04 Agenda looks a little light today at, https://etherpad.openstack.org/p/monasca-team-meeting-agenda 15:01:17 1. Review Gerrit bugs 15:01:17 1. Need more folks contributing to reviews 15:01:17 2. Update on DevStack 15:01:17 3. Update on Tempest Tests 15:01:37 Are there other items to discuss 15:01:48 How about an update on performance from TWC 15:01:53 o/ 15:01:53 as well as Grafana 15:01:58 sure we can do that 15:02:27 I also have an update on Ceilosca 15:02:31 is anyone attending from fujitsu? 15:02:40 ok, let's talk about ceilosca too 15:02:48 can we add timing of next tagging of monasca-agent and pub to pypi? 15:02:49 yes, but I have to leave earlier 15:04:15 ok, we'll discuss agent tags too 15:04:15 #topic reviews 15:04:16 One area I want to get help on is more reviews 15:04:47 are people getting added to reviews, but not reviewing? 15:04:49 i think, i could help a bit 15:05:00 there are a bunch of reviews that I've +1'd, but have been just sitting for a couple of days 15:05:19 i would like to see the code flowing 15:05:22 twc can be better too 15:06:04 in the old days, i would just +2 things, but it align with the overall process better we need multiple companies one each review commit 15:06:24 agree 15:06:29 i think we also needed automated tests 15:06:43 ideally, a company shouldn't be +2 their own work, but we're not quite there yet 15:06:46 we just don't have enough tests to feel confident in submittals 15:06:50 but we have to move in that direction 15:06:51 ddieterly: +1 Every patch should have unit test 15:07:19 i'd also like to see integration tests 15:07:50 ddieterly: so the integratino tests is what i'm working on 15:07:54 right? 15:08:10 that is what tempest tests are for 15:08:10 yea, not sure how much coverage we have on those 15:08:51 well, i think coverage will be good at the API syntactic level 15:09:04 and basic functionality 15:09:14 do the tempest tests use a 'real' db? 15:09:21 yes, they do 15:09:26 tempest uses the API 15:09:34 they are written against the Monasca API 15:09:44 cool 15:10:02 so, getting back to reviews, just need more help in that area 15:10:28 witek, are all of tomasz's reviews then ready to get committed? 15:10:59 i will take a look at them again 15:11:04 he has about 5 or 6 reviews sittign in the queue, 15:11:05 thanks 15:11:25 sorry about the speed, we actually test each commit prior to +2 15:11:29 that is the theory 15:11:39 so the turn-aroudn time can be gated by testing 15:11:53 hopefully, when we have tempest tests up and running this will be automated 15:12:08 so, we'll have a little more confidence 15:12:15 +1 on that 15:12:25 rhochmuth: in the meantime can you please post a comment when the tests have been done and were successful? 15:12:29 are we going to require tempest tests for each patch? 15:12:30 #topic devstack 15:12:42 rhochmuth: I think this will give more confidence to the other reviewers that the patch works 15:12:42 sorry, i changed topics 15:12:54 np 15:12:58 #help Help is needed on reviews 15:13:20 ddieterly i think i need to get the first round of tempest tests done and integrated 15:13:40 then we can hopefully have a process where each feature has tests written againt it 15:13:49 so, i think the answer is yes 15:13:55 +1 15:14:03 +1 15:14:07 what is the eta on the tempest stuff? 15:14:31 i'm going to try and have some code up for review this week 15:14:42 i have tests for metrics and alarm definitions complete 15:14:49 will work through the rests of the API 15:14:59 actually, i don't think everyhting will be done this week 15:15:15 but it will be a good start and time to merge and get feedback 15:15:35 #topic devstack 15:15:57 how is devstack going? 15:15:59 ok, i'm in the process of converting all the ansible to bash in monasca-api/devstack 15:16:23 hope to have a complete monasca deployment in devstack in 2-3 weeks 15:16:27 sounds exciting 15:16:33 very 15:16:39 :-) 15:16:59 so, you've put some reviews up 15:17:10 yea, already have 15:17:31 can you post it please? 15:17:40 #fabiog #ddieterly was asking about using ansible 15:17:47 i said can't do that 15:18:00 what is the reason to convert ansible to bash ? 15:18:01 has to be independent of any deployment tool 15:18:14 https://review.openstack.org/#/q/owner:%22Deklan+Dieterly%22+status:merged,n,z 15:18:45 #Matthias_Christi i think the problem is they don't want Ansible in Devstack 15:18:51 Matthias_Christi: it is the devstack way 15:19:09 #fabiog is that your understanding 15:19:17 bash is the linqua franca of devstack 15:19:41 for better or for worse... 15:20:25 yea, c'est la geurre 15:20:25 rhochmuth: yep 15:20:46 so, basically 2-3 weeks on devstack, then we'll move onto ci/cd integration 15:20:58 ok, next topic? 15:21:08 hopefully 15:21:15 we discussed the tempest tests already 15:21:22 so i'll move to performance update 15:21:26 #performance 15:21:31 #topic performance 15:21:47 that's me 15:21:54 yes 15:22:10 was out last week, in a conf this week, so not much movement, but 15:22:20 vertica has agreed current perf is a bug 15:22:28 awesome! 15:22:35 no fix yet 15:22:44 do they have an eta 15:23:04 the bug is when 10 or so queries are in parallel 15:23:04 no eta yet, pushing them on that 15:23:23 what do you mean by 10 queries in parallel 15:23:35 asking them to back port to the current version (hot fix), instead of waiting for Q4 major release 15:23:46 yes, 10 queries in parellel goes into the toilet 15:24:08 that explains our puzzlement over the performance at least 15:24:08 omg 15:24:15 so, i'm waiting on other projection work, until that's fixed 15:24:27 (pre-join queries, schema change...) 15:24:32 your review is sitting up there, should we wait on that 15:24:40 bklei: wow! and no customers found that before ? 15:24:44 exactly ddieterly 15:24:56 that switches from inner joins to ... 15:25:02 apparently there is one other big customer. but boo on their testing 15:25:04 should we wait on that 15:25:05 i'm surprised that vertica had this bug 15:25:10 kinda serious 15:25:48 the vertica luster has been dulled a bit 15:25:58 also, would limiting the concurrency to <10 be a work-around 15:26:24 doesnt' sound like a good idea, but maybe it is faster 15:26:31 the bug# is VER-40005 15:26:45 "Time in plan is high when 10+ queries are run concurrently on 3 node cluster" 15:27:01 but the engineer thinks it's beyond just a 3 node cluster 15:27:23 anyhoo, not much more to say on that till i get an eta 15:27:32 will pressure them 15:27:51 do you want your review for inner join conversion to just wait then 15:28:09 let's wait, not sure it's necessary, may be working around their bug 15:28:25 ok 15:28:29 thx though 15:28:45 #bklei thanks for working through this 15:28:46 it wasn't a dramatically better improvement 15:28:49 sure 15:29:15 i think we should still make the change because it is much simpler to implement and reaon about 15:29:17 nice to know we know how to code and that the bug was in Vertica 15:29:39 we just need to make the change globally 15:30:37 i'm not opposed to the change #ddieterly, just want a bug fix in vertica before spending much time on tweaking stuff above it 15:30:48 yea, understood 15:31:01 so let's see how fast vertica can respond 15:31:14 and then we can measure again with fixes hopeflly 15:31:20 and then switch to new code 15:31:20 agreed, hope to have better info next week 15:31:31 ok, cool 15:31:32 yup 15:31:46 #topic grafana 15:32:02 rbak was adding support to Grafana 2.0 15:32:17 #bklei do you know how that is progressing? 15:32:19 rbak is sort of here, but in a conf with me -- he's made some progress 15:32:31 ok 15:32:43 more investigation, no eta or idea of the quantity of work yet. 15:32:54 ok 15:32:56 but it's sexy, and i'm confident he'll get it done 15:33:12 sexy 15:33:14 templates are awesome 15:33:15 bklei: really? 15:33:35 ok, we'll check back next week 15:33:39 very cool, and it doesn't sound too difficult to add us as a proper 'data source' 15:33:42 yup 15:34:01 #topic ceilosca 15:34:32 #fabiog you are up 15:35:01 fabiog: you there? 15:35:12 crickets 15:35:14 rhochmuth: sorry 15:35:36 ok, so we have started to do tests with Ceilosca 15:35:54 and what we found is that Influlx 0.9.1 suffers all sort of issues 15:36:06 it times out with the Persister 15:36:15 :( 15:36:17 and it prevents to store large amounts of data 15:36:29 but when we updated to 0.9.4.1 it worked really well 15:36:43 and we didn't have to change a single line of code or configuration 15:36:44 problem solved :-) 15:36:49 are we done 15:36:50 just switched the package 15:36:56 fabiog: did you have a chance to test the api end points with 9.4? 15:36:56 rhochmuth: kind of 15:37:15 i'm concerned that the api might be broken with 9.4 15:37:21 ddieterly: yes, because the ceilometer call in returns call the monasca API 15:37:25 so, there were changes i thoguht in .9.3 that were incompatible with .9.1 15:37:27 and we get data back 15:37:47 #bmotz was looking at this 15:37:48 we need to check all the functionality of the end points 15:38:01 so I would like to see the mini-mon installation to be upgraded to use 0.9.4.1 15:38:07 would that be possible? 15:38:15 fabiog: go for it 15:38:36 i don't have a problem updating 15:38:43 ddieterly: ok, so I will try to figure out what and where to change and I will submit a patch 15:38:44 just have to address any issues that occur 15:38:58 influxdb keeps changing their api, even though they are supposed to be frozen 15:39:00 it is a very simple change in the ansible vars 15:39:13 yeah, so the change in Ansible is trivial 15:39:20 the problem is finding all the problems 15:39:34 goes back to comprehensive integration tests 15:39:51 #bmotz isn't here today, but you might contact directly to see what he might have uncovered 15:40:02 ddieterly: right. In the ceilosca case seems to work fine, so if there are bugs are not major 15:40:08 he was also looking at swithching from http to binary protocol 15:40:12 ddieterly: meaning stop working 15:40:51 rhochmuth: ok, I will get in touch with bmotz 15:40:57 thx 15:41:09 does cceilosca exercise a good portion of the api? 15:41:11 if it is all working, then let's switch 15:41:24 we can also help to get some testing here 15:41:28 possibly 15:41:32 ddieterly: pretty much queries with dimensions 15:41:36 i'll need to coordinate with our test team 15:41:40 ddieterly: and post of new measurements 15:41:41 alarm history, metrics, measurements, statistics 15:41:59 ddieterly: no alarm history or anything related to alarms 15:42:03 the rest yes 15:42:13 merge-metrics flag, all parameters, etc 15:42:20 pagination 15:42:24 ddieterly: probably not all 15:42:29 oh, yea! 15:42:30 and definitely not pagination 15:42:39 how in the world did i forget that one 15:42:43 what, how can you forget paginatino 15:42:54 sorry, inside joke 15:43:57 #fabiog so i think plan to to upgrade 15:44:07 check with ben motz at cray 15:44:22 we'll need to get some testing done 15:44:33 to validate completely 15:44:35 rhochmuth: I will send an email in the mailing list asking him for issues 15:44:42 thx 15:44:52 but 0.9.4.1 is a huge step forward in performance 15:45:00 awesome! 15:45:04 +1 15:45:12 you might want to ask the influxdb guys if there is any compatibility issues that they know of 15:45:14 sounds like they are finally getting the streamign raft stuff done 15:45:34 #ddieterly can you send an email and cc folks 15:45:46 sure 15:45:49 thx 15:46:01 #fabiog wnat are you seeing in performance 15:46:08 with the latests influxdb 15:46:42 rhochmuth: I see that Ceilosca is around 30/40% faster 15:46:57 rhochmuth: and it seems that is getting better when the data set is bigger 15:47:03 which is what I was expecting 15:47:10 cool 15:47:18 right now we are doing a sort of worst case scenario test 15:47:42 like we are doing open ended query of get samples with 1M measurements 15:47:53 so we need to switch to CURL instead of cello client 15:48:00 because the client times out 15:48:14 but up to 750K the client worked 15:48:20 in both cases 15:48:30 and in both cases failed at 1M 15:48:37 how is ceilosca comparing to native ceilometer 15:48:48 with the latests influxdb 15:48:58 my preliminary estimates is that is between 30 to 40% faster 15:49:08 nice 15:49:23 but I think the real advantage will be in the amount of data we can store in Ceilosca compared to Ceilometer 15:49:34 i would like to understand the tests and how that maps onto the overall system too 15:49:38 I want to test with 10M measurements and see what happens 15:49:46 I think Ceilometer will fail 15:49:49 10M is small 15:50:16 right but we could not get Ceilo to work with more than 6M in the past 15:50:23 ohhh 15:50:44 let's see 15:50:59 the other thing I am not so sure is that Rally is really representative of the tests 15:51:07 I had strange results with it 15:51:22 so maybe we will use 10 runs using curl instead 15:51:36 it will be definitely more work but the results are more realistic 15:51:49 it will be undoubtedly the time a client will take to receive the data 15:52:26 bklei: how much data you have in Monasca in prod? 15:52:40 bklei: just to have an idea of a real prod env 15:53:06 so I can try to emulate that 15:53:12 we've got around maybe 5 months of data, in vertica terms, getting close to 1TB 15:53:33 bklei: but in terms of measurements per metric 15:53:41 is 1M, 10M, 100K ... 15:54:04 i can get that to you, don't know off the top of my head 15:54:20 bklei: I would appreciate that 15:54:28 thanks 15:54:35 sure, will get u some #'s later today 15:54:39 np 15:54:42 ok, 5 minutes left 15:54:54 should we continue with ceilosca or any other topics, or close 15:55:02 I am done unless there are questions 15:55:23 i just want to plug a couple of libvirt plugin patches 15:55:26 no further questions your honor 15:55:32 rhochmuth: we need to start talking about summit topics and sessions 15:55:56 agenda for next week, I guess 15:55:58 we don't have any summit session 15:56:13 i think it will be hard to get, as they are over-booked and we aren't a openstack project yet 15:56:24 ok 15:56:32 but, we could reach out to other teams on various topics 15:56:56 yes, I want to talk to the Congress guys 15:57:02 there was some areas with the ceilometer team, such as alarming and componentization 15:57:12 Yes, Congress was the other one that comes to mind 15:57:27 There has been work in adding autoc-scaling to heat for MOnasca 15:57:37 I'll be getting an update on that tomorrow 15:57:55 Do you want me to contact Congress? 15:58:20 rhochmuth: sure I'll do that 15:58:28 MaaS is our big priority at TWC, and so we're very focused on libvirt plugin -- would like to see https://review.openstack.org/#/c/225337/ and https://review.openstack.org/#/c/226405/ in soon and the agent tagged... 15:58:55 #bklei ok, i've given a +1 to one of those i believe 15:59:01 if others can review that would be great 15:59:05 por favor 15:59:07 as soon as merged i can tag 15:59:16 +1!! 15:59:17 thx 15:59:27 i was hoping david schroeder at hp could take a look 15:59:32 not sure he was around this week 15:59:35 me too, it's his baby 15:59:36 but will try and ping him 15:59:53 thx 16:00:02 ok, coming to an end 16:00:05 thanks everyone 16:00:17 ciao! 16:00:18 thx #rhochmuth 16:00:26 #endmeeting