Wednesday, 2015-11-18

*** slogan621 has joined #openstack-telemetry00:00
*** llu has quit IRC00:01
*** jaypipes has quit IRC00:04
*** rbak has quit IRC00:04
*** ddieterly has joined #openstack-telemetry00:22
*** terriyu has quit IRC00:38
*** thorst has joined #openstack-telemetry00:44
*** thorst has quit IRC00:51
*** thorst has joined #openstack-telemetry00:51
*** thorst has quit IRC00:56
*** liusheng has joined #openstack-telemetry01:03
*** thumpba has joined #openstack-telemetry01:06
*** ljxiash has joined #openstack-telemetry01:06
*** thumpba has quit IRC01:12
*** ildikov has quit IRC01:15
*** ildikov has joined #openstack-telemetry01:16
*** thorst has joined #openstack-telemetry01:23
*** thorst has quit IRC01:23
*** thorst has joined #openstack-telemetry01:23
*** thorst has quit IRC01:24
*** thorst has joined #openstack-telemetry01:25
*** thorst has quit IRC01:29
*** jaypipes has joined #openstack-telemetry01:33
*** slogan621 has quit IRC01:37
*** pradk has quit IRC01:38
*** boris-42 has joined #openstack-telemetry01:50
*** Liuqing has joined #openstack-telemetry01:58
openstackgerritRohit Jaiswal proposed openstack/ceilometer: Consistent publisher_id and event_type for polling and api  https://review.openstack.org/24034202:00
*** prashantD has quit IRC02:01
*** jwcroppe has quit IRC02:21
*** chaozhechen_ has joined #openstack-telemetry02:29
*** CheneyChen has joined #openstack-telemetry02:29
*** fawadkhaliq has joined #openstack-telemetry02:42
openstackgerritliusheng proposed openstack/aodh: Don't send notificaton when recording alarm change  https://review.openstack.org/24672702:45
*** ddieterly has quit IRC02:46
*** Liuqing has quit IRC02:55
*** Liuqing has joined #openstack-telemetry02:55
*** thorst has joined #openstack-telemetry02:57
*** thorst has quit IRC02:57
*** ddieterly has joined #openstack-telemetry03:05
*** ddieterl_ has joined #openstack-telemetry03:06
*** ddieterly has quit IRC03:10
*** prashantD has joined #openstack-telemetry03:26
*** lvdongbing has joined #openstack-telemetry03:27
lvdongbingLianhao Lu, are you there?03:31
*** liusheng has quit IRC03:35
*** ViswaV has quit IRC03:45
openstackgerritliusheng proposed openstack/aodh: Don't send notificaton when recording alarm change  https://review.openstack.org/24672703:47
*** liusheng has joined #openstack-telemetry03:48
*** ViswaV has joined #openstack-telemetry03:49
*** prashantD_ has joined #openstack-telemetry03:55
*** prashantD has quit IRC03:56
*** lvdongbing has quit IRC04:04
*** lvdongbing has joined #openstack-telemetry04:04
*** fawadkhaliq has quit IRC04:06
*** khushbu has joined #openstack-telemetry04:07
*** khushbu has quit IRC04:08
*** ddieterl_ has quit IRC04:11
*** hparekh2 has quit IRC04:20
*** fawadkhaliq has joined #openstack-telemetry04:27
*** hparekh has joined #openstack-telemetry04:27
openstackgerritliusheng proposed openstack/ceilometer: Move the content of ReleaseNotes to README.rst  https://review.openstack.org/24674304:52
*** jwcroppe has joined #openstack-telemetry05:03
openstackgerritliusheng proposed openstack/ceilometer: Fix a indent nit of enforce_limit method  https://review.openstack.org/24674405:04
openstackgerritliusheng proposed openstack/ceilometer: Fix an indent nit of enforce_limit method  https://review.openstack.org/24674405:05
openstackgerrityuntongjin proposed openstack/ceilometer-specs: event to sample publisher  https://review.openstack.org/22392605:05
*** ddieterly has joined #openstack-telemetry05:11
*** ddieterly has quit IRC05:17
*** prashantD_ has quit IRC05:19
*** jaypipes has quit IRC05:20
*** khushbu_ has joined #openstack-telemetry05:23
*** khushbu_ has quit IRC05:24
*** khushbu_ has joined #openstack-telemetry05:27
*** khushbu_ has quit IRC05:34
*** khushbu_ has joined #openstack-telemetry05:39
*** khushbu_ has quit IRC05:46
*** jwcroppe has quit IRC05:50
*** jwcroppe has joined #openstack-telemetry05:50
*** jwcroppe has quit IRC05:54
*** nadya_ has joined #openstack-telemetry06:01
*** Liuqing has quit IRC06:10
*** Liuqing has joined #openstack-telemetry06:11
*** ddieterly has joined #openstack-telemetry06:13
*** nadya_ has quit IRC06:18
*** ddieterly has quit IRC06:19
*** changbl has quit IRC06:22
*** changbl has joined #openstack-telemetry06:23
*** liusheng has quit IRC06:36
*** rcernin has joined #openstack-telemetry06:37
*** liusheng has joined #openstack-telemetry06:37
*** Liuqing has quit IRC07:05
*** Liuqing has joined #openstack-telemetry07:06
*** ddieterly has joined #openstack-telemetry07:16
*** ddieterly has quit IRC07:20
*** eglynn has quit IRC07:32
*** hparekh has quit IRC07:33
openstackgerritliusheng proposed openstack/ceilometer: Move the content of ReleaseNotes to README.rst  https://review.openstack.org/24674307:34
*** hparekh has joined #openstack-telemetry07:40
*** belmoreira has joined #openstack-telemetry07:57
*** Ala has joined #openstack-telemetry07:58
*** liusheng has quit IRC07:59
*** liusheng has joined #openstack-telemetry08:00
*** eglynn has joined #openstack-telemetry08:12
*** ddieterly has joined #openstack-telemetry08:17
*** safchain has joined #openstack-telemetry08:20
*** shardy has joined #openstack-telemetry08:21
*** ddieterly has quit IRC08:21
*** Liuqing has quit IRC08:23
*** Liuqing has joined #openstack-telemetry08:24
*** fawadkhaliq has quit IRC08:41
*** fawadkhaliq has joined #openstack-telemetry08:42
openstackgerritMehdi Abaakouk (sileht) proposed openstack/python-gnocchiclient: resouce id must be url quoted  https://review.openstack.org/24680208:52
*** Liuqing has quit IRC08:53
*** Liuqing has joined #openstack-telemetry08:54
*** fawadkhaliq has quit IRC08:57
openstackgerritBéla Vancsics proposed openstack/ceilometer: Reduced source code by extracting duplicated code  https://review.openstack.org/23202009:00
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: add a __repr__ for AggregatedTimeSerie  https://review.openstack.org/24507609:04
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: implement an integer sampling attribute  https://review.openstack.org/24507509:04
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: make offset conversion consistent  https://review.openstack.org/24507409:04
openstackgerritJulien Danjou proposed openstack/gnocchi: archive_policy: enforce types  https://review.openstack.org/24507309:04
openstackgerritJulien Danjou proposed openstack/gnocchi: _carbonara: dedicated methods to store raw timeserie  https://review.openstack.org/24507209:04
openstackgerritJulien Danjou proposed openstack/gnocchi: storage: support storage upgrade  https://review.openstack.org/24507009:04
openstackgerritJulien Danjou proposed openstack/gnocchi: cli: allow to upgrade in 2 passes  https://review.openstack.org/24507109:04
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: deprecate TimeSerieArchive  https://review.openstack.org/24090509:04
openstackgerritJulien Danjou proposed openstack/gnocchi: Rename dbsync to upgrade  https://review.openstack.org/24506909:04
openstackgerritJulien Danjou proposed openstack/gnocchi: Add missing PrettyTable dependency  https://review.openstack.org/24680709:04
*** yassine__ has joined #openstack-telemetry09:12
*** nadya_ has joined #openstack-telemetry09:15
*** ddieterly has joined #openstack-telemetry09:17
*** ddieterly has quit IRC09:22
*** Liuqing has quit IRC09:32
*** lvdongbing has quit IRC09:36
*** ljxiash has quit IRC09:50
*** r-mibu has left #openstack-telemetry09:59
*** Liuqing has joined #openstack-telemetry10:04
*** eglynn has quit IRC10:15
*** ddieterly has joined #openstack-telemetry10:18
*** ddieterly has quit IRC10:22
*** jwcroppe has joined #openstack-telemetry10:30
*** jwcroppe has quit IRC10:34
*** fawadkhaliq has joined #openstack-telemetry10:36
*** ildikov has quit IRC11:05
*** exploreshaifali has joined #openstack-telemetry11:05
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: add a __repr__ for AggregatedTimeSerie  https://review.openstack.org/24507611:13
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: implement an integer sampling attribute  https://review.openstack.org/24507511:13
*** prashantD has joined #openstack-telemetry11:17
*** prashantD has quit IRC11:21
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: gnocchi: use gnocchiclient instead of requests  https://review.openstack.org/23753811:38
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: Use keystoneauth1 instead of manual setup  https://review.openstack.org/23753711:38
*** sergio__nubeliu has joined #openstack-telemetry11:43
*** exploreshaifali has quit IRC11:49
*** exploreshaifali has joined #openstack-telemetry11:53
*** ildikov has joined #openstack-telemetry11:54
*** exploreshaifali has quit IRC12:03
*** khushbu_ has joined #openstack-telemetry12:05
*** khushbu_ has quit IRC12:05
*** khushbu_ has joined #openstack-telemetry12:09
*** khushbu_ has quit IRC12:10
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: add a __repr__ for AggregatedTimeSerie  https://review.openstack.org/24507612:15
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: deprecate TimeSerieArchive  https://review.openstack.org/24090512:15
*** khushbu has joined #openstack-telemetry12:16
*** khushbu has quit IRC12:16
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: deprecate TimeSerieArchive  https://review.openstack.org/24090512:16
*** khushbu_ has joined #openstack-telemetry12:19
*** ddieterly has joined #openstack-telemetry12:19
*** khushbu_ has quit IRC12:20
*** wolsen has quit IRC12:20
*** alejandrito has joined #openstack-telemetry12:23
*** ddieterly has quit IRC12:24
*** ddieterly has joined #openstack-telemetry12:24
*** jkraj has joined #openstack-telemetry12:28
*** thorst has joined #openstack-telemetry12:28
*** khushbu has joined #openstack-telemetry12:33
*** khushbu has quit IRC12:33
alejandritojd__, why is it that when querying the measures for a metric that has this archive policy (http://pastebin.com/7ff9e0Bm) every new day, i see the points from 00:00 hs of today and not the day/s after ?12:36
alejandritojd__, are we interpreting something wrong ? maybe about definition of the policy?12:37
jd__alejandrito: can you paste me what you get and what you would expect?12:37
alejandritojd__, sure12:39
*** fawadkhaliq has quit IRC12:42
alejandritojd__, http://pastebin.com/f6bWma8A i see the measures from today since 00:00hs here is a metric show output ( http://pastebin.com/FNVP0tUm )12:49
jd__alejandrito: can you show me the details of the metric ? like its archive policy12:54
jd__your archive policy is pretty weird too :)12:58
alejandritojd__, its thought for show back purposes :) this one is no good ? http://pastebin.com/FNVP0tUm13:01
jd__you keep some datapoints for 60 years13:01
alejandritojd__, yup :D its like saying ... FOR EVER13:04
jd__alejandrito: I'm still waiting for the metric details, can you show me? :)13:05
alejandritojd__, sorry this one ? http://pastebin.com/FNVP0tUm13:06
*** gordc has joined #openstack-telemetry13:07
*** gordc_ has joined #openstack-telemetry13:09
*** jwcroppe has joined #openstack-telemetry13:10
*** gordc has quit IRC13:13
*** CheneyChen has quit IRC13:13
*** chaozhechen_ has quit IRC13:13
jd__ah yeah13:14
jd__thanks13:14
jd__so yeah looks like you miss points13:14
*** jwcroppe has quit IRC13:15
*** jwcroppe has joined #openstack-telemetry13:15
*** jwcroppe_ has joined #openstack-telemetry13:17
*** jwcroppe has quit IRC13:20
sergio__nubeliujd__: yes but there is a pattern, in all cases we miss points from the day before and older13:24
*** prashantD has joined #openstack-telemetry13:25
*** ljxiash has joined #openstack-telemetry13:28
*** prashantD has quit IRC13:29
*** prashantD has joined #openstack-telemetry13:30
*** Liuqing_ has joined #openstack-telemetry13:31
*** prashantD has quit IRC13:31
*** tomoiaga has joined #openstack-telemetry13:31
jd__sergio__nubeliu: what do you mean?13:31
jd__ah right13:32
jd__yeah something is weird13:32
*** Liuqing has quit IRC13:33
*** gordc_ has quit IRC13:34
jd__sergio__nubeliu: alejandrito if you can send me the Carbonara file stored in Ceph that'd probably help me13:39
jd__maybe it's a normal, maybe not but I'm confused13:40
openstackgerritRohit Jaiswal proposed openstack/ceilometer: Consistent publisher_id and event_type for polling and api  https://review.openstack.org/24034213:42
*** kbyrne has joined #openstack-telemetry13:43
sergio__nubeliujd__: alejandrito will send you the file in a few minutes13:44
*** chaozhechen_ has joined #openstack-telemetry13:49
*** ddieterly has quit IRC13:54
*** julim has joined #openstack-telemetry13:55
*** dan-t has joined #openstack-telemetry13:56
*** changbl has quit IRC14:01
*** exploreshaifali has joined #openstack-telemetry14:04
*** bapalm has joined #openstack-telemetry14:08
*** boris-42 has quit IRC14:08
*** jaypipes has joined #openstack-telemetry14:23
alejandritojd__, im back14:23
*** lsmola has quit IRC14:26
*** ddieterly has joined #openstack-telemetry14:26
*** ddieterly has quit IRC14:26
*** ddieterly has joined #openstack-telemetry14:27
alejandritojd__, before going into the ceph carbonara content, im having metricd giving this : http://pastebin.com/cCR0xwBN14:27
alejandritojd__, can it have something to do with "missing" data ?14:27
jd__alejandrito: that explains probably everything indeed14:28
jd__since it creates an empty timeserie each time14:28
jd__that explains why you're losing metrics14:29
*** lsmola has joined #openstack-telemetry14:29
jd__alejandrito: I imagine you see no reason for that data corruption to happen on your side? Ceph is OK?14:29
jd__sileht: you have any idea why so many read errors would happen on Ceph?14:30
silehtjd__, not really14:30
*** lsmola has quit IRC14:31
alejandritojd__, the first message happened today at 00:48 im looking at ceph right now.14:31
jd__ok14:31
jd__keep me in touch14:31
*** openstackgerrit has quit IRC14:31
alejandritojd__, sileht to see if i see something ... because the measure_ are still being created14:32
alejandritojd__, we have no error on CEPH and the health out put is  OK14:32
*** openstackgerrit has joined #openstack-telemetry14:32
jd__alejandrito: you got no write issues before 00:48?14:32
alejandritojd__, nope ... and our developers told us that they saw same behaviour past week, but ceph has a 180 days uptime with no errors whatsoever14:33
jd__alejandrito: ok,I'm gonna try to write a patch to grab more info on this14:35
*** jkraj has quit IRC14:37
*** ljxiash has quit IRC14:39
*** ljxiash has joined #openstack-telemetry14:41
*** bapalm has quit IRC14:43
jd__alejandrito: what coordination_url do you use?14:43
*** bapalm has joined #openstack-telemetry14:43
jd__alejandrito: how many computers are running gnocchi-metricd?14:43
*** ddieterly has quit IRC14:44
*** lsmola has joined #openstack-telemetry14:44
alejandritojd__, wow ... the critical thing about this ... is that the corrupted messages keep appearing and all my metrics / measures are dissapearing :O , i hace just only one metricd vm running ... let me double check de coordinator url14:44
*** fawadkhaliq has joined #openstack-telemetry14:44
*** lsmola has quit IRC14:45
*** lsmola has joined #openstack-telemetry14:45
*** lsmola has quit IRC14:45
alejandritojd__, coordination_url = file:///var/lib/gnocchi/locks14:45
*** lsmola has joined #openstack-telemetry14:46
jd__alejandrito: ok and how many servers run metricd?14:46
alejandritojd__, just one with one worker14:46
jd__alejandrito: ok thanks!14:47
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: gnocchi: use gnocchiclient instead of requests  https://review.openstack.org/23753814:53
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: Use keystoneauth1 instead of manual setup  https://review.openstack.org/23753714:53
*** ddieterly has joined #openstack-telemetry14:57
alejandritojd__, sileht well ... i can confirm that the data corruption messages keep appearing and all the measures are gone for all the metrics14:58
alejandritojd__, sileht they appear in this manner http://pastebin.com/LXqqPQS014:59
jd__yeah makes sense, if I can say so14:59
*** rbak has joined #openstack-telemetry15:01
*** ddieterly has quit IRC15:01
*** ddieterly has joined #openstack-telemetry15:01
alejandritojd__, im just keeping everything in this state ( holding the developers ) just for you to debug something here if you want to take advantage of this "corrupted" env15:05
jd__alejandrito: yeah thanks, I'm trying to write a patch with more debug stuff15:06
alejandritojd__, one side question in what gnocchi resource type can i put all this samples ? ( we kept all this ones in ceilometer ) http://pastebin.com/j0MqXjUW15:08
*** llu-laptop has joined #openstack-telemetry15:09
*** llu-laptop is now known as llu15:10
*** Liuqing_ has quit IRC15:11
jd__alejandrito: generic?15:12
jd__not sure we have anything for that yet15:12
*** exploreshaifali has quit IRC15:12
alejandritojd__, oka15:13
alejandritojd__, one resource type could be "hypervisor" for example15:13
alejandritojd__, waiting for you patch to debug ^_^15:13
*** larainema has quit IRC15:16
*** larainema has joined #openstack-telemetry15:18
openstackgerritLianhao Lu proposed openstack/ceilometer: Fix an indent nit of enforce_limit method  https://review.openstack.org/24674415:21
jd__alejandrito: can you try this http://paste.openstack.org/show/479265/ ? it's a simple approach that will first tell us what's read15:21
jd__and why it's corrupted15:21
jd__I'd like to see if it's a totally empty file, or a partial file for example15:21
alejandritojd__, trying RIGHT NOW !15:22
jd__ok thanks15:22
jd__I'm gonna check librados doc at the same time as I'm not really familiar with it, to see if I can grab extra info15:22
jd__alejandrito: just restart metricd after applying the patch, that's the only one affected here15:23
alejandritojd__, oka perfect, can i ask how to apply the patch ?15:23
* alejandrito :$15:23
jd__alejandrito: go to the source dir of Gnocchi and type: patch -p1 -i yourpatch15:24
jd__it should apply it seamlessly :)15:24
jd__you can download the raw patch here http://paste.openstack.org/raw/479265/15:24
alejandritojd__, perfect, yourpatch is the filename ? and then python setup.py install ?15:25
jd__yes15:26
*** chaozhechen_ has quit IRC15:27
*** pradk has joined #openstack-telemetry15:28
*** yprokule has joined #openstack-telemetry15:28
openstackgerritZi Lian Ji proposed openstack/ceilometer-specs: Enable LBaaS V2 for Ceilometer  https://review.openstack.org/24413915:28
*** ildikov has quit IRC15:28
*** yprokule has quit IRC15:30
*** yprokule has joined #openstack-telemetry15:30
*** yprokule has quit IRC15:32
*** yprokule has joined #openstack-telemetry15:32
alejandritojd__, ok, running15:33
alejandritojd__, want me to pastebin you what i see ? or want the whole file by mail ?15:33
jd__alejandrito: just paste me what you see15:33
alejandritojd__, i'll just paste when the data corription message appears15:34
jd__alejandrito: 👍15:34
alejandritojd__, http://pastebin.com/LV76HUmR15:37
jd__lol damn it15:37
jd__alejandrito: I'm gonna fix my debug ptch now :)15:39
jd__Python and bytes…15:39
alejandritojd__, hahahaha ! great15:39
jd__alejandrito: http://paste.openstack.org/show/479271/15:42
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: deprecate TimeSerieArchive  https://review.openstack.org/24090515:50
alejandritojd__, testing15:53
*** annasort_ has quit IRC15:54
alejandritojd__,  should i apply this one over the already patched ? or over the original one ? because its giving me this : http://pastebin.com/0Mb5LvTJ15:55
jd__alejandrito: no revert the original one15:56
jd__git checkout -f15:56
jd__that should remove the previous patch15:56
alejandritojd__, great15:56
alejandritojd__, patched, restarting everything15:56
*** pradk_ has joined #openstack-telemetry15:58
alejandritojd__, running ... didnt hit a corrupted message still16:00
*** liamji has joined #openstack-telemetry16:06
alejandritojd__, http://pastebin.com/8RkKwhQQ16:07
*** tomoiaga has quit IRC16:08
*** rcernin has quit IRC16:09
jd__alejandrito: ok http://paste.openstack.org/show/479279/ this is a simpler version with only the len of the content16:11
jd__alejandrito: just send me the corrupted file then if you can16:11
alejandritojd__,  that log will give me a ceph corrupted filename , so i can send you that file via email ? is that it ?16:12
*** rickyrem has joined #openstack-telemetry16:13
*** jaypipes has quit IRC16:13
jd__alejandrito: no, do you need the filename?16:13
jd__this will just give me the size of the corrupted file16:13
alejandritojd__, hmmmm ... then i didnt got what i need to do based on that patch, sorry J :(16:14
jd__alejandrito: the 3rd one?16:14
alejandritojd__, yeahp16:14
jd__yeah it does not print out any file name16:14
jd__just the length of the corrupted data16:14
jd__the filename should be pretty easy it has the metric id in it16:14
alejandritojd__, oka doing it16:15
silehtjd__, alejandrito the object name in rados is 'gnocchi_<metric_id>_<aggregation_method>'16:15
jd__hm except that I think metricd will replace it by a non corrupted version16:16
silehttrue16:16
jd__alejandrito: wait a sec, I'll update the patch to remove the creation of a new file when corrupted data16:16
alejandritojd__, great so i can send you the original file16:16
*** rickyrem1 has joined #openstack-telemetry16:16
alejandritojd__, WAITING16:17
jd__exactly16:17
*** rickyrem has quit IRC16:17
jd__alejandrito: http://paste.openstack.org/show/479281/ should do it16:17
alejandritojd__, applying16:17
*** wolsen has joined #openstack-telemetry16:19
*** edmondsw has quit IRC16:19
*** Ephur has joined #openstack-telemetry16:20
*** fawadkhaliq has quit IRC16:20
*** jfluhmann has joined #openstack-telemetry16:22
alejandritojd__, http://pastebin.com/YsdvFsT616:23
*** jwcroppe_ has quit IRC16:24
jd__alejandrito: hum you're sure you've applied it and reinstalled?16:24
jd__looks like the old error16:24
*** belmoreira has quit IRC16:24
alejandritojd__, oh good ... i hate myself ... sorry J16:24
jd__I know I'm bad but I hope not that bad16:24
jd__:D16:24
alejandritojd__,  AJJAJAAJAJAJAJA my bad16:25
*** exploreshaifali has joined #openstack-telemetry16:25
alejandritojd__, running16:25
openstackgerritBéla Vancsics proposed openstack/ceilometer: Reduced the complexity of the send method  https://review.openstack.org/24701516:26
alejandritojd__, http://paste.openstack.org/show/479288/ getting rados object ... via email ?16:28
*** edmondsw has joined #openstack-telemetry16:28
jd__alejandrito: is it big?16:28
jd__email should be ok I don't think it's that big16:28
alejandritojd__, let me check16:29
jd__16384 is not big16:29
jd__but it is really suspiscious indeed16:29
jd__cc sileht16:29
jd__is it a block size or something sileht ?16:29
silehtjd__, from the rados API point of view we didn't deal with block size16:31
jd__hmhm16:31
alejandritojd__, rados -p gnocchi get gnocchi_a2beb7f1-65c1-473c-8f6d-fc912452a6bb_mean16:32
alejandritojd__, 18K email ?16:32
jd__alejandrito: awesome16:32
jd__yup yup16:32
alejandritojd__, sending16:32
jd__i'll decode msgpack manually so fun16:32
alejandritojd__, hahahahaah ! if it is to know the root cause  ... hope you get a good one !16:33
alejandritojd__,  emailing16:33
openstackgerritBéla Vancsics proposed openstack/ceilometer: Reduced complexity of get_meter_statistics method  https://review.openstack.org/24702116:34
silehtjd__, bs is 819216:34
silehtjd__, I was wrong, we deal with the block size in the driver16:34
jd__sileht: with the offset in reading?16:35
silehtjd__, yes16:35
silehtjd__, I guess the culprit is around this piece of code16:35
alejandritojd__, sileht  email sent16:36
jd__sileht: though alejandrito just said the result of getting with rados -p is 18K16:36
jd__so not sure Gnocchi is wrong16:36
alejandritoyeahp ... maybe because the file just got data after being recreated because the previous corruption16:37
jd__sileht: -rw-------  1 jd  staff   18168 Nov 18 17:37 gnocchi_corrupted_ceph16:38
jd__the file is not corrupted it's brand new16:38
jd__but it is 18168 not 1638416:38
jd__so yeah you're right we're miss-reading it for whatever reason16:38
openstackgerritPradeep Kilambi proposed openstack/gnocchi: Ensure file basepath exists  https://review.openstack.org/24534816:39
openstackgerritBéla Vancsics proposed openstack/ceilometer: Reduced the complexity of the get_events method  https://review.openstack.org/24702916:46
*** jwcroppe has joined #openstack-telemetry16:46
openstackgerritZi Lian Ji proposed openstack/ceilometer-specs: Enable LBaaS V2 for Ceilometer  https://review.openstack.org/24413916:47
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: ceph: fix computation of read offset  https://review.openstack.org/24703116:48
silehtjd__, WDT ? https://review.openstack.org/24703116:48
jd__sileht: makes no sense to me16:50
silehtjd__, I will write a test16:50
jd__sileht: even read(object_name, offset=0) should read the whole file16:50
*** liamji has quit IRC16:50
jd__not only 16k16:50
jd__IIUC http://docs.ceph.com/docs/v0.94/rados/api/python/16:50
silehtjd__, read use a 8k buffer16:51
jd__ah yeah16:51
jd__I thought the C API had no length16:51
*** elemoine has joined #openstack-telemetry16:52
silehtI was wrong it have a length but we use the default16:52
jd__so 8k?16:52
silehtyes16:53
jd__yeah your patch seems right now that I think about it16:53
jd__it's late -_-16:53
jd__sileht: alejandrito can try it16:53
*** jaypipes has joined #openstack-telemetry16:53
*** fawadkhaliq has joined #openstack-telemetry16:54
*** fawadkhaliq has quit IRC16:55
*** fawadkhaliq has joined #openstack-telemetry16:55
openstackgerritBéla Vancsics proposed openstack/ceilometer: Reduced the complexity of the __init__ method  https://review.openstack.org/24703716:55
*** khushbu_ has joined #openstack-telemetry16:57
*** prashantD has joined #openstack-telemetry16:58
*** belmoreira has joined #openstack-telemetry16:58
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: ceph: fix computation of read offset  https://review.openstack.org/24703117:01
silehtjd__, alejandrito : with a test: https://review.openstack.org/24703117:01
alejandritosileht, jd__ just got back, let me read everything :P17:02
jd__sileht: 👍17:03
jd__i'll approve if alejandrito +117:03
silehtdamn it, my unicode font is broken again on term !17:04
*** khushbu_ has quit IRC17:06
*** sileht has quit IRC17:10
*** sileht has joined #openstack-telemetry17:11
jd__alejandrito: got it?17:14
* alejandrito is reading17:14
alejandritojd__, perfect, so .. want me to apply https://review.openstack.org/#/c/247031/ and re-try everything ? without deleting and recreating the ceph pool ?17:17
*** khushbu_ has joined #openstack-telemetry17:19
*** rickyrem1 has quit IRC17:20
*** ViswaV has quit IRC17:20
*** khushbu_ has quit IRC17:21
jd__alejandrito: yeah17:21
*** exploreshaifali has quit IRC17:22
*** ViswaV has joined #openstack-telemetry17:22
alejandritojd__,  trying17:22
*** exploreshaifali has joined #openstack-telemetry17:23
*** khushbu_ has joined #openstack-telemetry17:24
*** belmoreira has quit IRC17:25
*** changbl has joined #openstack-telemetry17:29
*** Ala has quit IRC17:29
alejandritojd__, sileht running17:30
*** ildikov has joined #openstack-telemetry17:30
*** gordc has joined #openstack-telemetry17:33
alejandritojd__, sileht with the existing data should i not see "corrupted" messages anymore ? because im seeing them17:34
jd__alejandrito: you should not, you're sure you applied it? :(17:35
alejandritojd__, sileht let me double check ... cause maybe i didnt "setup installed" again :(17:35
* jd__ crosses his fingers17:36
* alejandrito f*** again ... re-running ^_^17:37
*** yassine__ has quit IRC17:38
*** khushbu_ has quit IRC17:38
*** elemoine has quit IRC17:39
alejandritojd__, sileht not seeing corrupted till now17:41
alejandritojd__, should happened by now, i'm +1 to the fix to merge it17:43
jd__kewl17:43
jd__thanks alejandrito17:43
alejandritojd__, i didnt quite got what the problem was even after reading everything17:44
alejandritojd__, can you clarify to me please ?17:44
jd__alejandrito: Gnocchi was reading the file from Ceph in a wrong manner :(17:45
jd__so it was only reading the first few Kb17:45
jd__and so that made the file appears corrupted17:45
*** exploreshaifali has quit IRC17:47
*** exploreshaifali has joined #openstack-telemetry17:48
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: deprecate TimeSerieArchive  https://review.openstack.org/24090517:49
alejandritojd__, reading the fix to understand the difference between content being summarized as offset and data17:49
alejandritojd__, sileht so .... do you think its because of THIS that my original message from today about not having yesterdays data ?17:58
jd__alejandrito: yes17:59
jd__alejandrito: you'll tell us if we were wrong tomorrow :))18:00
*** yprokule has quit IRC18:00
alejandritojd__, hope to tell you you were right ! what i dont understand its WHY happened from one day to another just miss-reading data ?18:00
jd__alejandrito: I think it depends on the file size18:01
*** khushbu_ has joined #openstack-telemetry18:02
jd__it's likely the way read() returned our data and how the function misbehaved was transparent for certain file sizes18:02
*** belmoreira has joined #openstack-telemetry18:04
*** khushbu_ has quit IRC18:05
*** belmoreira has quit IRC18:05
alejandritojd__, i see, what i dont understand (and sorry again) is why the "data" being read at a specific moment and its len as an incremental of offset is any different of the content variable used ( and i know im not reading something, thats why i want to know :D )18:06
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: gnocchi: use gnocchiclient instead of requests  https://review.openstack.org/23753818:08
openstackgerritMehdi Abaakouk (sileht) proposed openstack/ceilometer: Use keystoneauth1 instead of manual setup  https://review.openstack.org/23753718:08
alejandritocc sileht ^^18:12
openstackgerritMerged openstack/ceilometer: Move the content of ReleaseNotes to README.rst  https://review.openstack.org/24674318:20
*** fawadkhaliq has quit IRC18:21
*** jaypipes has quit IRC18:34
*** jwcroppe_ has joined #openstack-telemetry18:52
*** jwcroppe has quit IRC18:55
*** harlowja has quit IRC19:05
*** jwcroppe_ is now known as jwcroppe19:06
*** nadya_ has quit IRC19:06
*** harlowja has joined #openstack-telemetry19:08
*** prashantD has quit IRC19:20
*** prashantD has joined #openstack-telemetry19:23
*** bapalm has quit IRC19:23
*** elemoine has joined #openstack-telemetry19:24
*** bapalm has joined #openstack-telemetry19:26
*** vishwanathj has quit IRC19:29
*** nadya has joined #openstack-telemetry19:37
*** prashantD has quit IRC19:42
*** prashantD has joined #openstack-telemetry19:44
*** ddieterly has quit IRC19:45
gordcalejandrito: http://docs.ceph.com/docs/v0.94/rados/api/python/#rados.Ioctx.read19:46
*** pradk_ has quit IRC19:47
gordcalejandrito: from what i can tell, by default, read only returns a limited chunk of data.19:47
*** KrishR has joined #openstack-telemetry19:47
gordcthe offset uses data and not content because offset is a sum of all past data lengths.19:48
gordcif we take content it'll be too long as content is allready the sum of all data. i think to use len(content), offset shouldn't be adding the total each loop19:49
*** onder has quit IRC19:50
*** onder has joined #openstack-telemetry19:53
alejandritogordc, let me read what you stated19:56
gordcit's probably a bit convoluted what i typed. basically 'content' is the aggregate of 'data' https://github.com/openstack/gnocchi/blob/master/gnocchi/storage/ceph.py#L20019:59
alejandritogordc, NOW i think i get it ^_^20:00
gordc:) we could probably use len(content) as offset but if it works, it works.20:01
alejandritogordc, you mean ... not offset += len(content) but offset = len(content)20:02
gordcalejandrito: right20:02
alejandritogordc, totally understood20:03
* gordc writes ceph expert on cv20:04
alejandritogordc, what i cant believe is that ... EVERY time till today, that a ceph file in gnocchi was bigger than 8K (the default BS on read) the second read was gonna be ok, but in THAT SECCOND LOOP, offset would have a wrong value, so ... any file bigger than 16K was (or IS, since the fix is not merged) gonna give CORRUPTED FILE20:09
* alejandrito doesnt want to believe that, but seems so :O20:09
alejandritogordc, well ... its true ... since our metrics didnt last more than a day ( because they exceded the 16K)20:10
gordcyeah, that makes sense. 8k seems to be default read limit, so after first two reads, it should still make sense... and after that, the offset expotentially jumps rather than incremental.20:11
alejandritogordc, exactly, that was it20:11
gordcgood catch by sileht... subtle errors are always the biggest.20:12
*** ddieterly has joined #openstack-telemetry20:13
alejandritogordc, yeah , thankfully we could keep the "corrupted" environment long enough to debug with jd__ and sileht , im happy ^_^20:23
gordcalejandrito: appreciate you being guinea pig for gnocchi :)20:24
alejandritogordc, (o.o)20:24
jd__alejandrito: yeah I think we sucked at our testing, thanks to you we found a big bug :p20:30
jd__we didn't test large archive enough20:30
*** harlowja has quit IRC20:36
*** nadya has quit IRC20:36
openstackgerritOpenStack Proposal Bot proposed openstack/ceilometer: Updated from global requirements  https://review.openstack.org/24710820:38
*** harlowja has joined #openstack-telemetry20:41
*** thorst has quit IRC20:47
*** thorst has joined #openstack-telemetry20:48
*** thorst has quit IRC20:49
alejandritojd__, im really happy that we found it ... i remember the developers saying ... "that data disappears at 00hs" and me saying ... IMPOSIBLE hahahahahaha20:49
*** thorst has joined #openstack-telemetry20:49
*** thorst has quit IRC20:49
*** thorst has joined #openstack-telemetry20:50
gordcalejandrito: it's the cloud. anything can disappear.20:51
alejandritogordc, anything but ceilometer data ! ^_^20:52
*** shardy_ has joined #openstack-telemetry20:52
gordcalejandrito: :)20:52
*** thorst_ has joined #openstack-telemetry20:53
*** thorst has quit IRC20:54
*** sergio__nubeliu has quit IRC20:55
*** thorst_ has quit IRC20:57
*** shardy_ has quit IRC21:00
*** openstack has joined #openstack-telemetry21:03
*** alejandrito has quit IRC21:04
*** thorst has joined #openstack-telemetry21:11
*** elemoine has quit IRC21:12
openstackgerritGeorge Peristerakis proposed openstack/ceilometer: Load a directory of YAML event config files  https://review.openstack.org/24717721:13
*** thorst has quit IRC21:15
openstackgerritGeorge Peristerakis proposed openstack/ceilometer: Load a directory of YAML event config files  https://review.openstack.org/24717721:18
*** changbl has joined #openstack-telemetry21:19
openstackgerritOpenStack Proposal Bot proposed openstack/ceilometer: Updated from global requirements  https://review.openstack.org/24710821:22
*** thorst has joined #openstack-telemetry21:24
*** thorst_ has joined #openstack-telemetry21:25
*** thorst has quit IRC21:28
*** rickyrem has joined #openstack-telemetry21:33
*** rickyrem has quit IRC21:43
*** julim has quit IRC21:51
*** changbl has quit IRC22:02
*** llu has quit IRC22:24
openstackgerritgordon chung proposed openstack/aodh: support queue based communication between evaluator and notifier  https://review.openstack.org/24721122:26
openstackgerritMerged openstack/ceilometer: Fix an indent nit of enforce_limit method  https://review.openstack.org/24674422:29
*** dan-t has quit IRC22:45
openstackgerritOpenStack Proposal Bot proposed openstack/ceilometer: Updated from global requirements  https://review.openstack.org/24710822:47
*** jaypipes has joined #openstack-telemetry22:47
*** edmondsw has quit IRC22:49
*** gordc has quit IRC22:50
*** rjaiswal has joined #openstack-telemetry22:52
*** safchain has quit IRC22:55
openstackgerritMerged openstack/gnocchi: Ensure file basepath exists  https://review.openstack.org/24534822:57
openstackgerritMerged openstack/gnocchi: ceph: fix computation of read offset  https://review.openstack.org/24703122:57
*** pradk has quit IRC23:19
*** ddieterly has quit IRC23:27
*** rbak has quit IRC23:35
*** thorst_ has quit IRC23:55
*** ddieterly has joined #openstack-telemetry23:56
openstackgerritMerged openstack/ceilometermiddleware: Updated from global requirements  https://review.openstack.org/24712523:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!