Monday, 2020-08-17

openstackgerritGhanshyam Mann proposed openstack/python-swiftclient master: [goal] Migrate testing to ubuntu focal  https://review.opendev.org/74460500:26
openstackgerritGhanshyam Mann proposed openstack/pyeclib master: [goal] Migrate testing to ubuntu focal  https://review.opendev.org/74462300:37
*** psachin has joined #openstack-swift03:38
*** m75abrams has joined #openstack-swift04:11
*** evrardjp has quit IRC04:33
*** evrardjp has joined #openstack-swift04:33
*** mikecmpbll has joined #openstack-swift07:50
*** dsariel has joined #openstack-swift08:14
*** dsariel is now known as dsariel|PTO08:14
*** tdasilva has quit IRC09:11
*** tdasilva has joined #openstack-swift09:18
*** ChanServ sets mode: +v tdasilva09:18
*** mikecmpbll has quit IRC09:53
*** tkajinam has quit IRC09:55
*** mikecmpbll has joined #openstack-swift09:55
*** tkajinam has joined #openstack-swift11:27
*** mikecmpbll has quit IRC11:56
*** mikecmpbll has joined #openstack-swift12:04
*** jv_ has joined #openstack-swift14:03
*** tkajinam has quit IRC14:06
openstackgerritGhanshyam Mann proposed openstack/swift master: [goal] Migrate testing to ubuntu focal  https://review.opendev.org/74460714:17
*** cschwede has joined #openstack-swift14:30
*** ChanServ sets mode: +v cschwede14:30
*** psachin has quit IRC14:45
*** m75abrams has quit IRC14:50
*** gyee has joined #openstack-swift14:58
*** dsariel|PTO has quit IRC15:44
*** zaitcev has joined #openstack-swift17:01
*** ChanServ sets mode: +v zaitcev17:01
*** mikecmpbll has quit IRC17:04
*** mikecmpbll has joined #openstack-swift17:06
claygtimburke: it's possible I never really groked the second half of that EC GET error handling code.  But theory is we're adding 404s in more cases than we should, and the is_success(best_bucket.status) is more restrictive...17:09
timburkeyeah, that's my thinking -- the idea in https://github.com/openstack/swift/blob/2.25.0/swift/proxy/controllers/obj.py#L2458-L2463 seemed to be *just* to fill out for non-durable frags17:31
timburkein general, i'm not really sure about the use of best_response there when we're *already* doing filtering and making decisions about which responses to consider -- it leads to really funky (and not-uncommonly *useless*) logging when we get to https://github.com/openstack/swift/blob/2.25.0/swift/proxy/controllers/base.py#L1928-L192917:46
timburkei've seen lines like "Object returning 503 for []" when the actual responses gathered were 4x timeout, 1 200, 1 404, 2x 404 from handoffs17:48
timburkeso it *looks like* there was some massive error limiting going on or something, but in reality we *really tried* to find something and got a *very* mixed result17:49
claygyeah i'm worried about continuing to extend this "hide it from best_response" strategy 😬18:13
*** hoonetorg has joined #openstack-swift18:29
*** cschwede has quit IRC19:16
clayg@timburke I feel like https://github.com/openstack/swift/blob/master/test/unit/proxy/controllers/test_obj.py#L2860 should be hitting the case we're worried about19:25
claygmaybe it's significant the test name says "will 503" then it asserts 404?19:25
claygwell, maybe if there's non-durable it *should* 404 🤔19:26
claygI guess `and best_bucket and not best_bucket.durable` was mostly a guard on `best_bucket == None` - is it possible the logic is correct in practice despite best_bucket always being true and bad buckets being durable?19:34
openstackgerritClay Gerrard proposed openstack/swift master: wip: tests for mixed frag 404 vs 503  https://review.opendev.org/74659320:03
clayg@timburke I could really use some help zero-ing a test in on the failure you said you saw when playing with probes?20:03
*** mikecmpbll has quit IRC20:09
*** mikecmpbll has joined #openstack-swift20:12
timburkeok, i think i found my edge case again; wrote it up on p 74659321:09
patchbothttps://review.opendev.org/#/c/746593/ - swift - wip: tests for mixed frag 404 vs 503 - 1 patch set21:09
timburkebasically, there were a *ton* of timeouts, mixed in with one 404 from primaries. the idea with p 744942 was that we should ignore the 404 and assume that there *likely* is data on the other servers -- or at least, it's worth the client trying again to find out21:12
patchbothttps://review.opendev.org/#/c/744942/ - swift - Client should retry when there's just one 404 and ... - 5 patch sets21:12
timburkei'm increasingly convinced that the EC controller is going to need to just *entirely replace* best_response21:13
timburkemaybe even make it behave more like a coroutine, where we feed responses to it and it either hands you a response or a sentinel that says "i need more responses"21:15
claygyes!  best_response can pound sand22:13
clayg@timburke i'm getting a 503 on EC when I stop my object servers22:13
timburkeon which patch?22:14
claygdo you think there's a regression in https://review.opendev.org/#/c/711342/or not?22:14
patchbotpatch 711342 - swift - Extend concurrent_gets to EC GET requests - 16 patch sets22:14
claygoh, sorry, difference between GET and HEAD - I guess there *is* a regression there22:16
claygwell, that sucks22:17
timburkeoh, funny... i suppose because it just punts up to the base controller at that point22:17
timburke🤔 maybe HEAD *should* confirm there's at least ndata frags out there...22:18
claygwell, it doesn't bother me so much that it's different - but I am disappointed there was no test on master for the behavior 😞22:26
*** rcernin has joined #openstack-swift22:33
claygso the problem is we don't know if `not bucket.durable` is a good request that wasn't durable or a bad request 😞22:36
timburkemay have been one of those cases where it felt like "oh, well *obviously* if everything's down, we oughta 503"22:36
timburkeyup22:36
openstackgerritClay Gerrard proposed openstack/swift master: wip: tests for mixed frag 404 vs 503  https://review.opendev.org/74659322:57
*** tkajinam has joined #openstack-swift23:07
mattoliveraumorning23:20

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!