Wednesday, 2015-05-06

notmynameand all the unscheduled stuff is below the lines at the bottom ater the friday section00:00
*** minwoob has quit IRC00:02
mattoliveraunotmyname: i see it.. my ehterpad tab was out of date :P00:04
notmynameor rather, "eventually consistent"?00:05
mattoliveraunotmyname: where could combine fast-post and updateable object sysmeta spec maybe00:07
mattoliverauor do we want to talk about testing00:08
notmynameyeah, I was thinking of combining those two00:10
notmynamethe testing one was intended to be more as an intro to what/how to test swift, especially for newer contributors. not something that's good for a working session00:12
notmynameok, I'll check back tomorrow. i'm going home now00:12
*** NM has quit IRC00:16
mattoliveraunotmyname: k, have a great evening :)00:28
*** ho has joined #openstack-swift00:29
*** zhill_ has quit IRC00:43
-openstackstatus- NOTICE: Restarted gerrit due to stuck stream-events connections. Events since 23:49 were missed and changes uploaded since then will need to be rechecked.00:43
*** mgagne has joined #openstack-swift00:53
mgagneI'm currently trying to setup tempest to test swift and I'm getting this error: "Insufficient number of users provided. ObjectExpiryTest,PublicObjectTest have allocated all the credentials for this allocation request" Anyone with tempest experience who could give me a hand?00:55
mgagneI have accounts.yaml setup with 2 accounts with the operator_role role assigned to them. Do I need more accounts?00:56
mgagneok, somehow I had garbage left in my lock_path folder. I cleaned it and it worked. thanks01:01
zaitcevodd, but all is well that ends well01:12
mgagnezaitcev: in fact, it looks like tempest leaves lock behind. If I run tempest a second time, tests start to fail again01:15
*** haigang has joined #openstack-swift01:53
*** jrichli has quit IRC01:56
*** panbalag has quit IRC01:57
*** haigang has quit IRC02:05
*** wbhuber has joined #openstack-swift02:32
*** bill_az has quit IRC02:39
*** vinsh has joined #openstack-swift02:53
*** fifieldt has joined #openstack-swift03:11
*** NM has joined #openstack-swift03:13
*** wbhuber has quit IRC03:19
*** fifieldt has quit IRC03:20
*** NM has quit IRC03:28
*** vinsh has quit IRC03:31
*** vinsh has joined #openstack-swift03:32
*** vinsh has quit IRC03:36
*** tdasilva has quit IRC03:41
*** links has joined #openstack-swift03:41
*** aix has joined #openstack-swift03:44
*** zhill_ has joined #openstack-swift03:50
*** aix has quit IRC03:50
*** zhill_ has quit IRC03:51
*** tdasilva has joined #openstack-swift04:00
*** aix has joined #openstack-swift04:17
*** proteusguy has quit IRC04:24
*** proteusguy has joined #openstack-swift04:29
*** portante has quit IRC04:40
*** portante has joined #openstack-swift04:45
peluse wbhuber, so I looked and don't have a quick answer for you - will mess with it some more in the morn and we may have ot grab someone else for some more advanced mocking04:46
*** silor has joined #openstack-swift04:58
*** bkopilov has quit IRC05:04
*** tamizh_geek has joined #openstack-swift05:05
*** SkyRocknRoll has joined #openstack-swift05:10
*** proteusguy has quit IRC05:11
*** bkopilov has joined #openstack-swift05:13
*** bkopilov has quit IRC05:18
*** bkopilov has joined #openstack-swift05:28
*** bkopilov has quit IRC05:33
*** bkopilov has joined #openstack-swift05:34
*** ppai has joined #openstack-swift05:40
*** early has quit IRC05:49
*** early has joined #openstack-swift05:59
charznotmyname: 20mins(run_seconds) for ec_huge_test scenarios are too long, it almost took 3+ hours to finish a run. I dropped the tests.06:32
*** zaitcev has quit IRC06:35
*** yuanz has joined #openstack-swift06:36
*** bkopilov has quit IRC06:36
charznotmyname: I'm going to change run_seconds to operation_count in ec_huge_test. And also reduce the number of run_secondes, I'll start from 120s, 240s, 480s to 20mins.06:37
charznotmyname: And see which one is close to result(20mins). And I'll use the number for run_seconds.06:39
*** yuan has quit IRC06:39
*** bkopilov has joined #openstack-swift06:57
*** joeljwright has quit IRC06:58
*** geaaru has joined #openstack-swift07:31
openstackgerritHisashi Osanai proposed openstack/swift: WIP: Enable Role-based access control using oslo.policy in Swift  https://review.openstack.org/14993007:48
*** tamizh_geek has quit IRC07:49
*** tamizh_geek has joined #openstack-swift07:50
openstackgerritHisashi Osanai proposed openstack/swift: WIP: Enable Role-based access control using oslo.policy in Swift  https://review.openstack.org/14993007:52
*** jistr has joined #openstack-swift07:59
*** acoles_away is now known as acoles08:13
*** jordanP has joined #openstack-swift08:14
acolesnotmyname: ack no meeting today08:15
*** haomaiw__ has quit IRC08:21
*** chlong has quit IRC08:22
*** ppai has quit IRC08:41
*** proteusguy has joined #openstack-swift08:43
cschwedenotmyname: just one question regarding summit schedule: the test framework has been moved to Friday, so it will be more like a working session then?08:47
*** kota_ has joined #openstack-swift08:47
kota_notmyname: ack for no meeting today. Thanks for the information :)08:48
acoleskota_: you get more sleep ;)08:49
kota_acoles: thanks, today is the last day of the long holidays in Japan, so I'll enjoy it :D08:49
acolesyay!08:50
*** haomaiwa_ has joined #openstack-swift08:50
kota_k, leaving for dinner, see you tomorrow08:51
*** kota_ has quit IRC08:51
*** ppai has joined #openstack-swift08:55
*** clduser_ has quit IRC08:58
*** swifterdarrell has quit IRC08:58
*** torgomatic has quit IRC08:58
*** rsFF has quit IRC08:58
*** clduser_ has joined #openstack-swift09:00
*** swifterdarrell has joined #openstack-swift09:00
*** torgomatic has joined #openstack-swift09:00
*** rsFF has joined #openstack-swift09:00
*** sendak.freenode.net sets mode: +vv swifterdarrell torgomatic09:00
*** bkopilov has quit IRC09:07
*** bkopilov has joined #openstack-swift09:08
*** joeljwright has joined #openstack-swift09:13
*** bkopilov has quit IRC09:14
*** ho has quit IRC09:14
*** bkopilov has joined #openstack-swift09:15
*** bkopilov has quit IRC09:21
*** bkopilov has joined #openstack-swift09:21
*** bkopilov has quit IRC09:29
*** bkopilov has joined #openstack-swift09:30
*** bkopilov has quit IRC09:35
*** joeljwright has quit IRC10:03
openstackgerritMerged openstack/swift: Fix account replication during pre-storage-policy upgrade  https://review.openstack.org/17796410:17
*** panbalag has joined #openstack-swift10:22
*** joeljwright has joined #openstack-swift10:35
*** ppai has quit IRC11:16
*** ppai has joined #openstack-swift11:33
*** aix has quit IRC11:38
*** bkopilov has joined #openstack-swift11:38
*** mherdad has joined #openstack-swift12:01
mherdadswift client check number of replication deploy ?12:02
*** joeljwright has quit IRC12:06
*** SkyRocknRoll has quit IRC12:10
*** ppai has quit IRC12:13
*** erlon has joined #openstack-swift12:15
*** thurloat_isgone is now known as thurloat12:20
*** gyee has joined #openstack-swift12:22
*** SkyRocknRoll has joined #openstack-swift12:23
*** proteusguy has quit IRC12:23
*** mherdad has quit IRC12:26
*** mherdad has joined #openstack-swift12:27
*** aix has joined #openstack-swift12:27
*** NM has joined #openstack-swift12:40
*** tamizh_geek has quit IRC12:46
*** tamizh_geek has joined #openstack-swift12:46
*** proteusguy has joined #openstack-swift12:50
*** tamizh_geek has quit IRC12:51
*** openstackgerrit has quit IRC12:53
*** openstackgerrit has joined #openstack-swift12:53
*** links has quit IRC12:54
*** ppai has joined #openstack-swift12:58
*** tab__ has joined #openstack-swift12:59
*** bill_az has joined #openstack-swift13:00
*** joeljwright has joined #openstack-swift13:10
*** mherdad has quit IRC13:17
*** esker has joined #openstack-swift13:26
*** joeljwright has quit IRC13:34
*** openstackgerrit has quit IRC13:38
*** openstackgerrit has joined #openstack-swift13:38
*** gyee has quit IRC13:45
*** gyee has joined #openstack-swift13:46
*** zaitcev has joined #openstack-swift14:02
*** ChanServ sets mode: +v zaitcev14:02
*** shakamunyi has joined #openstack-swift14:09
*** Nadeem has joined #openstack-swift14:12
*** shakamunyi has quit IRC14:13
*** wbhuber has joined #openstack-swift14:16
*** minwoob has joined #openstack-swift14:35
*** shakamunyi has joined #openstack-swift14:35
*** shakamunyi has quit IRC14:36
*** shakamunyi has joined #openstack-swift14:36
*** vinsh has joined #openstack-swift14:40
*** joeljwright has joined #openstack-swift14:42
*** jrichli has joined #openstack-swift14:50
*** ppai has quit IRC14:50
*** gyee has quit IRC15:03
*** gyee has joined #openstack-swift15:06
*** marcusvrn_ has joined #openstack-swift15:07
*** aix has quit IRC15:09
*** joeljwright has quit IRC15:30
*** Nadeem has quit IRC15:33
*** openstackgerrit has quit IRC15:37
*** openstackgerrit has joined #openstack-swift15:37
*** joeljwright has joined #openstack-swift15:39
*** silor has quit IRC15:50
*** SkyRocknRoll has quit IRC15:56
*** silor has joined #openstack-swift15:57
*** shakamunyi has quit IRC15:58
*** jordanP has quit IRC15:59
*** tamizh_geek has joined #openstack-swift16:08
*** silor has quit IRC16:10
*** proteusguy has quit IRC16:13
*** gyee has quit IRC16:13
*** tab_ has joined #openstack-swift16:20
*** Fin1te has joined #openstack-swift16:21
*** tab_ has quit IRC16:22
*** proteusguy has joined #openstack-swift16:25
*** cdelatte has quit IRC16:26
*** jistr has quit IRC16:29
*** cdelatte has joined #openstack-swift16:33
notmynamegood morning16:36
notmynamecschwede: no, for now that fishbowl session is in the "overflow" bucket. if we talk about it in friday, great. but it's not scheduled16:37
notmynamereminder that there is no meeting today16:37
*** tamizh_g_ has joined #openstack-swift16:41
*** joeljwright has quit IRC16:41
*** tab_ has joined #openstack-swift16:42
*** marcusvrn_ is now known as marcusvrn16:42
*** tamizh_geek has quit IRC16:43
*** jordanP has joined #openstack-swift16:46
*** wolsen_ is now known as wolsen16:47
*** shakamunyi has joined #openstack-swift16:49
*** Nadeem has joined #openstack-swift16:50
*** zhill_ has joined #openstack-swift16:52
*** fthiagogv has joined #openstack-swift16:56
notmynameacoles: http://xkcd.com/1521/17:04
*** Fin1te has quit IRC17:04
*** shakamunyi has quit IRC17:05
*** gyee has joined #openstack-swift17:08
acolesnotmyname: lol17:08
*** tamizh_geek has joined #openstack-swift17:09
hurricanerixmorning17:09
*** tamizh_g_ has quit IRC17:10
*** Nadeem has quit IRC17:16
*** shakamunyi has joined #openstack-swift17:19
*** gyee has quit IRC17:22
openstackgerritMerged openstack/swift: Don't quarantine on missing data files  https://review.openstack.org/18031717:27
*** bill_az has quit IRC17:27
*** marcusvrn has quit IRC17:29
*** tamizh_g_ has joined #openstack-swift17:32
*** tamizh_geek has quit IRC17:33
openstackgerritAlistair Coles proposed openstack/swift: Add POST capability to ssync for .meta files  https://review.openstack.org/13849817:42
*** acoles is now known as acoles_away17:43
*** gyee has joined #openstack-swift17:46
openstackgerritChristian Schwede proposed openstack/swift: Add missing statsd metrics section for object-reconstructor  https://review.openstack.org/18065517:54
*** jkugel has joined #openstack-swift17:57
*** Nadeem_ has joined #openstack-swift17:57
*** Nadeem_ has quit IRC17:58
*** tamizh_geek has joined #openstack-swift18:05
*** tamizh_g_ has quit IRC18:05
*** Nadeem_ has joined #openstack-swift18:07
*** Nadeem_ has quit IRC18:15
*** Fin1te has joined #openstack-swift18:23
*** jordanP has quit IRC18:34
*** gyee has quit IRC18:37
*** geaaru has quit IRC18:39
*** fthiagogv has quit IRC18:45
*** jkugel has quit IRC18:47
*** zhill_ has quit IRC18:47
*** openstackgerrit has quit IRC18:47
*** vinsh has quit IRC18:47
*** clduser_ has quit IRC18:48
*** swifterdarrell has quit IRC18:48
*** torgomatic has quit IRC18:48
*** rsFF has quit IRC18:48
*** bsdkurt1 has quit IRC18:48
*** morganfainberg has quit IRC18:48
*** alpha_ori has quit IRC18:48
*** redbo has quit IRC18:48
*** tanee has quit IRC18:48
*** early has quit IRC18:48
*** wer has quit IRC18:48
*** raginbajin has quit IRC18:48
*** briancline has quit IRC18:48
*** InAnimaTe has quit IRC18:48
*** early has joined #openstack-swift18:49
*** bsdkurt1 has joined #openstack-swift18:49
*** wer has joined #openstack-swift18:49
*** tanee has joined #openstack-swift18:49
*** redbo has joined #openstack-swift18:49
*** alpha_ori has joined #openstack-swift18:49
*** morganfainberg has joined #openstack-swift18:49
*** raginbajin has joined #openstack-swift18:49
*** briancline has joined #openstack-swift18:49
*** InAnimaTe has joined #openstack-swift18:49
*** sendak.freenode.net sets mode: +v redbo18:49
*** jkugel has joined #openstack-swift18:49
*** zhill_ has joined #openstack-swift18:49
*** openstackgerrit has joined #openstack-swift18:49
*** vinsh has joined #openstack-swift18:49
*** clduser_ has joined #openstack-swift18:49
*** swifterdarrell has joined #openstack-swift18:49
*** torgomatic has joined #openstack-swift18:49
*** rsFF has joined #openstack-swift18:49
*** sendak.freenode.net sets mode: +vv swifterdarrell torgomatic18:49
dencavalGuys, Isn't possible to create bp for swift-bench at launchpad?18:51
torgomaticdencaval: blueprints are terrible; if you want people to actually see your words and discuss them, submit a spec to the swift-specs repository18:52
dencavaltorgomatic Hm, I'll do it. Thanks.18:54
*** minwoob_ has joined #openstack-swift19:02
*** minwoob has quit IRC19:03
*** wbhuber_ has joined #openstack-swift19:04
*** annegentle has joined #openstack-swift19:06
*** wbhuber has quit IRC19:06
*** gyee has joined #openstack-swift19:07
*** zul has quit IRC19:14
*** silor has joined #openstack-swift19:16
*** zul has joined #openstack-swift19:16
*** wbhuber_ has quit IRC19:18
*** bsdkurt1 has quit IRC19:18
*** morganfainberg has quit IRC19:18
*** alpha_ori has quit IRC19:18
*** redbo has quit IRC19:18
*** tanee has quit IRC19:18
*** annegentle has quit IRC19:19
*** wbhuber_ has joined #openstack-swift19:19
*** bsdkurt1 has joined #openstack-swift19:19
*** tanee has joined #openstack-swift19:19
*** redbo has joined #openstack-swift19:19
*** alpha_ori has joined #openstack-swift19:19
*** morganfainberg has joined #openstack-swift19:19
*** sendak.freenode.net sets mode: +v redbo19:19
*** annegentle has joined #openstack-swift19:19
*** Nadeem has joined #openstack-swift19:19
zaitcevno meeting today?19:20
*** morganfainberg has quit IRC19:20
*** morganfainberg has joined #openstack-swift19:21
notmynameno meeting today19:22
*** Fin1te has quit IRC19:25
ahalehmm, so im failing to install a working saio today :(19:32
redboliberasurecode?19:33
ahalenope, recon middleware import error weirdness19:34
ahale'ImportError: No module named middleware.recon'19:34
*** silor has quit IRC19:35
ahalewell i didnt need recon anyway, I only wanted a single object-sever and its fine without that in pipeline19:35
openstackgerritJanie Richling proposed openstack/swift: WIP - working on the encryption feature.  https://review.openstack.org/15790719:45
*** dencaval has quit IRC19:55
claygnotmyname: torgomatic: I went ahead and opened lp bug # 1452431 for the ring issue19:59
claygnotmyname: torgomatic: although I also re-validated that the qa cluster isn't suffereing from lp bug #145243120:00
openstackLaunchpad bug 1452431 in OpenStack Object Storage (swift) "some parts replicas assigned to duplicate devices" [Undecided,New] https://launchpad.net/bugs/145243120:00
notmynameclayg: thanks20:04
*** wbhuber__ has joined #openstack-swift20:09
*** tamizh_geek has quit IRC20:10
*** thurloat is now known as thurloat_isgone20:12
*** wbhuber_ has quit IRC20:12
*** gyee has quit IRC20:13
*** breitz has quit IRC20:16
*** breitz has joined #openstack-swift20:16
*** tamizh_geek has joined #openstack-swift20:17
*** zhill has quit IRC20:18
*** Nadeem has quit IRC20:28
*** tamizh_geek has quit IRC20:36
*** bkopilov has quit IRC20:46
mattoliverauMorning all, I'm up early cause I get to wait at the airport for a flight. Another day, another city :p20:57
jrichlimorning!  where are you going?20:58
*** bkopilov has joined #openstack-swift21:00
mattoliverauJust up to Canberra, the city I was born and raised in. Have a house tenanted out, tenants are moving out so I need to go and inspect the property. I lead an exciting life :p21:01
mattoliverauI'll stay up for most the weekend and see family while I'm there (and hack from cafes of course)21:01
jrichliI hope you find good things at the house :-)  Have fun with fam21:04
notmynamemattoliverau: hope you have a safe flight21:05
mattoliverauThanks y'all21:11
*** esker has quit IRC21:28
*** david-lyle has quit IRC21:29
*** InAnimaTe has quit IRC21:34
*** DCWillia_ has joined #openstack-swift21:37
*** NM has quit IRC21:46
claygnotmyname: lp bug #1452468 is the thing for the PUT with the post_quorum_timeout setting - hopefully torgomatic can pick it up21:52
openstackLaunchpad bug 1452468 in OpenStack Object Storage (swift) "ec PUT post commit response is to quick" [Undecided,New] https://launchpad.net/bugs/145246821:52
claygnotmyname: I need to bounce off and look at swifterdarrell's segfault thing21:52
notmynameok21:52
notmynamethanks for the test21:53
notmynameI hope torgomatic and acoles_away can think on it21:53
claygI think this is another one of those bugs acoles_away already told us about :)21:54
notmynametorgomatic: acoles_away: peluse: also, here are some logs for a GET that show some weird behavior (independent of the PUT issue in the bug above: https://gist.github.com/notmyname/3179e59c4a01b7bdc87121:55
claygthere was one with timestamps or etags or something in the reconstructor and peluse and I were like - nah it's fine - I think we did the same thing to him with "if final_phase and need_quorum" and he was like - I think final_phase always means not need_quorum?21:55
notmynameclayg: raising the post put timeout has really helped, but not completely solved it: https://gist.github.com/notmyname/8ab7131d3e948a7dfecd21:56
*** jrichli has quit IRC21:57
claygnotmyname: well the way youhave it broken down there i can't tell if those errors are maybe on GET's right?21:58
notmynameyeah, they could be21:58
claygthe GET is a seperate issue - I think there's some overwrites in there and we know the ECObjectController is sorta per-snickity about mixed etags?21:58
*** jkugel has quit IRC21:58
pelusejust reading the PUT issue now...22:07
notmynamein-office conversation has speculated that its "overwrites" that are causing the errors, so I'm going to try to prove that with a new run22:07
notmynamemy plan, for those who are curious, is to have all this info available to anyone who wants it (and maybe even a pretty summary) at the summit22:07
notmynamehmm..but there were a few errors on a read-only test22:07
*** thurloat_isgone is now known as thurloat22:09
peluseclayg, yeah the final phase never needs quorum.  those two flags were just added to reuse _get_put_responses for sending data and commit22:09
pelusewhich makes me question my memory as to why we needed both :)22:10
mattoliverauI'm turning the phone off, be back online later22:15
*** tab_ has quit IRC22:17
claygpeluse: I think when it was all smushed together with the replicated PUT path it made sense - now it's all crazy town22:20
claygI think I broke it when I merged to master cause I tried to get cleaver and make the policy.quorum a property instead of a function that takes a number - idk22:21
peluseclayg, yeah, I'm looking at a few things I don't quite "get" anymore.  I do recall the quorum confusion there towards the end...22:24
*** DCWillia_ has quit IRC22:27
pelusehmmm, I guess post_quorum_timeout was a pre-EC thing22:27
*** DCWillia_ has joined #openstack-swift22:28
peluseclayg, have already traced through the PUT path line by line to make sure none of these flags got fliped and/or quorom related variables got misused during all the PUT path refactoring?22:30
notmynamehere's more on the GET error. has a traceback in it: https://gist.github.com/notmyname/0d6874edf9c2b865031222:30
claygnotmyname: that may yet a different issue?  it looks sorta like a chunk read timeout?22:33
*** DCWillia_ has quit IRC22:33
pelusenotmyname, hmmm, somehow we call decode before having all our frags? that's interesting too22:33
notmynameI found it when I was trying to find the 6 errors in the read only test (from above)22:33
claygpeluse: yeah I thought there was an explicit test for len(fragments) > num_needed_to_decode that will fail the request (i.e. stop sending segments) before calling decode.22:34
claygnotmyname: maybe it was a duplicated fragment index in the payload - do you have the reconstructor running while you're benchmarking?22:35
notmyname"No object-reconstructor running"22:36
notmynameactually that surprised me. I expected it to be running22:36
*** annegentle has quit IRC22:38
peluseI've got to take off for a few hrs.  Keep me posted, I have some time tonight if needed and again on Fri but Thu is totally shot to hell for me22:38
peluseis this only repro'able so far on a real cluster under load?22:39
notmynamepeluse: it's the only place I've tried.22:39
pelusecool, back later...22:41
notmynameok, updated the scenarios to remove updates and restarted the tests22:43
*** DCWillia_ has joined #openstack-swift22:44
*** annegentle has joined #openstack-swift22:45
*** wbhuber__ has quit IRC22:46
*** DCWillia_ has quit IRC22:47
*** DCWillia_ has joined #openstack-swift22:48
*** DCWilliams_VA has joined #openstack-swift22:51
*** DCWillia_ has quit IRC22:52
*** DCWilliams_VA has quit IRC22:56
*** macredcape has quit IRC22:56
torgomaticpeluse: doesn't the EC commit phase need a quorum of 2? as in, if we only manage to drop 0 or 1 .durable files, it's not enough?23:00
*** annegentle has quit IRC23:02
notmynameclayg: nope :-(23:03
claygtorgomatic: well i mean - yeah that's how the code wants to work - but I think I broke it - sorry23:03
claygoh... no i misread you23:04
claygI mean I think we need more than 023:04
torgomaticnotmyname: clayg: yeah, that'd have to be it... there's no way to pass fewer than ec_ndta items through the GET code, so it'd have to be a duplicated index or something that makes pyeclib throw data away23:04
claygand in a system that might fail - I think more than 1 is good23:04
notmynameclayg: my "nope" was wrt the Update phase doing the errors23:04
torgomaticclayg: yeah, which is why we picked 2, right? or something like ec_ndata // 2 + 1 ?23:04
claygI guess someone picked 2 - but it turns out there was some plumbing that got horked or maybe it only always looked liked it worked because commits were quick and post_quorum_timeout was long enough23:05
torgomaticclayg: so your assertion is that we're landing ec_ndata + 1 FAs on disk okay, but then we're looking to land ec_ndata + 1 .durable files on disk, but they don't complete quickly enough?23:05
torgomatic(just so I'm clear)23:05
notmynamehttps://gist.github.com/notmyname/3738929f75b25dd06c7123:05
claygtorgomatic: well... I think the test attached to the bug is pretty clear on the *failure* - I'm less sure about how we go from "this doesn't work" -> "how it should work is this" -> "here this works like we want"23:07
claygbecause - well I don't know how it should work :\23:07
claygit should not 50323:07
notmynameok, same read issue. gets 6 200s and some 404s and returns 50323:07
notmynameno traceback23:08
claygnotmyname: oh ok, so no object-reconstructor running - no overwrites - still getting 503's23:09
claygnotmyname: but you're not seeing the liberasure "not enough fragments" trackebacks?  It's just 6 200's from the object servers and then a 50323:09
notmynamein this case, no liberasure string in the entire logs23:10
notmynamewell, wait23:11
notmynamemaybe I can lurn to grep23:11
torgomaticclayg: yeah, I'm having trouble understanding the bug from that unit test :(23:11
notmynameI see the liberasure error some. not many times. definitely way less than the number of errors I have23:12
*** ondergetekende has quit IRC23:13
notmynamehttps://gist.github.com/notmyname/b0bd85c0b068e0eec895  <-- 404s 200s, resulting in a 503 (with the liberasure traceback)23:14
*** asettle is now known as asettle-bfast23:14
*** asettle-bfast has quit IRC23:14
notmynameoh, interesting23:15
*** ondergetekende has joined #openstack-swift23:15
*** chlong has joined #openstack-swift23:16
notmynamethe requests with the tracebacks don't have a proxy log line. the lines that had the proxy log a 503 don't have a traceback23:18
* notmyname isn't sure if that's interesting or not23:18
claygtorgomatic: so if you PUT and some of the commits are slower than post_quorum timeout - you get a 50323:19
*** zhill_ has quit IRC23:20
claygso in that unittest you can either make post_quorum_timeout longer - or make more of the requests not be slow to commit - and it will pass23:21
claygbut it'd be cool like if it worked like for realzy on jd's cluster23:21
torgomaticclayg: maybe something like this? https://gist.github.com/smerritt/d8ca1e378c23d66c65e023:21
torgomaticinstead of needing $quorum, then 2, then $quorum again23:21
* torgomatic goes to try it with that unit test23:21
notmynamenot just my cluster. it's everyone's cluster ;-)   /music plays. and credits23:21
*** proteusguy has quit IRC23:22
*** vinsh has quit IRC23:23
torgomaticnotmyname: does that mean I get to go home now?23:25
claygtorgomatic: maybe - how'd your patch go?23:25
notmynametorgomatic: you're already home!23:25
claygtorgomatic: wait - aren :D23:25
claygbeat me23:25
*** kei_yama has joined #openstack-swift23:26
torgomaticheh, my house is full of kids. I'm at a coffee shop for my own sanity.23:26
torgomaticclayg: so that patch makes your test pass23:26
torgomaticI'm gonna refine it a tiny bit here; it could really use a comment or two23:27
claygtorgomatic: for sure!23:27
claygtorgomatic: i wasn't even sure what I was doing really besides just making a unittest that did like the qa cluster and watching it 50323:27
torgomaticwell, I was gonna update my gist, but I'll just chuck it into Gerrit and see what happens23:29
openstackgerritSamuel Merritt proposed openstack/swift: EC: don't 503 on marginally-successful PUT  https://review.openstack.org/18079523:36
torgomaticclayg: alright, try that on for size23:36
notmynamewhat am I missing?23:37
notmynamethat renames a variable and adds a comment23:37
torgomaticnotmyname: isn't there a unit test in there?23:37
torgomaticor did I screw that up?23:37
torgomaticno, looks like it's there23:37
notmynameyeah, the test file changes are there23:38
notmynameI was just looking at the non-test first23:38
torgomaticSUBTLETY!23:38
claygtorgomatic: wait wait?23:38
torgomaticclayg: what'd I do?23:38
claygso you didn't need to *fix* anything to make the test pass?23:38
notmynameyeah, that's what I'm saying23:39
torgomaticheh, min_conns is defined before that try: and is used both before and after it23:39
torgomaticI'm updating it with a new value23:39
torgomaticthe minimum goes down once it's time for .durable files23:39
* clayg 's head explodes23:40
*** ho has joined #openstack-swift23:44
hogood morning!23:44
notmynametorgomatic: clayg: I'm going to apply that patch live and see what happens23:44
claygnotmyname: is doing it live!23:45
claygnotmyname: only think it will help with is you can lower your post_quroum_timeout back down23:45
torgomaticnotmyname: http://www.strengthleader.com/wp-content/uploads/2014/07/Jenga.jpg23:46
*** david-lyle has joined #openstack-swift23:47
claygtorgomatic: don't breath23:50
notmynamepatched and tests have restarted23:50
notmynamealso lowered the post put timeout back to 0.523:51
claygnotmyname: so torgomatic and I need more info on the GET failures - like we need to expect the available and requested fragments from the GET that failed23:55
claygnotmyname: can you run a scenario that will generate the 503 on GET (pure write, not overwrite) and halt then try to see if those objects will continue to 503 if you GET with curl?23:55
claygnotmyname: i'm not even clear if there's two bugs or one - the one with the liberasure decode thing I think torgomatic and I have good idea how that might happen - is that the only one?23:56
claygI mean I know there's the overwrite bug - but the remaining failures are from non-overwrite workloads correct?23:56
notmynameyeah, no object is deleted so fetching the object stat failed on PUT should still be available23:57
*** DCWillia_ has joined #openstack-swift23:57
notmynamecorrect. no overwrites right now23:57
claygnotmyname: oh I thought we were done with PUT bugs :P23:57
notmynamethe increased proxy timeout is for that23:57
claygnotmyname: but yeah if you can run a scenario that will trigger a the 503 on GET (no overwrite) and then we can try to GET it again - that'd be *greataaaaat*23:57
notmynameso that's what i'm testing23:58
claygfine23:58
openstackgerritSamuel Merritt proposed openstack/swift: EC: don't 503 on marginally-successful PUT  https://review.openstack.org/18079523:58
* clayg has so moved on past "someone should probably check if the passing unittest acctually fixes the issue"23:58
claygI have so much confidence in torgomatic it's not even funny23:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!