Wednesday, 2021-01-13

*** ianychoi has joined #openstack-swift00:14
*** openstackgerrit has joined #openstack-swift00:40
openstackgerritTim Burke proposed openstack/swift master: reconciler: Prevent problematic middlewares from being used  https://review.opendev.org/c/openstack/swift/+/77052200:40
DHEThis is from an old-ish version of swift, but I ran into something. swift-ring-builder set_info doesn't update the version of the builder file. caused the composite ring builder to refuse to load the updates01:33
*** tkajinam has quit IRC01:41
*** tkajinam has joined #openstack-swift01:42
*** baojg has joined #openstack-swift01:52
*** tkajinam has quit IRC02:09
*** tkajinam has joined #openstack-swift02:10
*** rcernin has quit IRC02:57
*** rcernin has joined #openstack-swift03:18
*** rcernin has quit IRC03:21
*** rcernin has joined #openstack-swift03:21
*** psachin has joined #openstack-swift03:34
*** rcernin has quit IRC04:35
*** rcernin has joined #openstack-swift04:35
openstackgerritMatthew Oliver proposed openstack/swift master: Reclaim shards when root container is reclaimed  https://review.opendev.org/c/openstack/swift/+/77052904:46
*** gyee has quit IRC05:08
*** evrardjp has quit IRC05:20
*** evrardjp has joined #openstack-swift05:24
*** evrardjp has quit IRC05:33
*** evrardjp has joined #openstack-swift05:35
*** rcernin_ has joined #openstack-swift05:42
*** rcernin has quit IRC05:42
*** m75abrams has joined #openstack-swift06:05
*** timburke__ has quit IRC06:52
*** dsariel has joined #openstack-swift06:55
*** rcernin_ has quit IRC07:28
*** openstackgerrit has quit IRC07:47
*** diablo_rojo__ has quit IRC08:01
*** rcernin_ has joined #openstack-swift08:06
*** rpittau|afk is now known as rpittau08:25
*** rcernin_ has quit IRC08:26
*** dasp_ has quit IRC09:16
*** dasp has joined #openstack-swift09:17
*** hoonetorg has joined #openstack-swift09:44
*** openstackgerrit has joined #openstack-swift13:47
openstackgerritAlistair Coles proposed openstack/swift master: prefactor: Pass s3api Config to S3Requests  https://review.opendev.org/c/openstack/swift/+/72909113:47
openstackgerritAlistair Coles proposed openstack/swift master: s3api: Make allowable clock skew configurable  https://review.opendev.org/c/openstack/swift/+/77000513:47
*** dsariel has quit IRC14:11
*** dsariel has joined #openstack-swift14:12
*** klamath_atx has quit IRC15:23
*** psachin has quit IRC15:28
*** _david_sohonet has joined #openstack-swift15:30
acolestimburke: I was tactical in fixing https://review.opendev.org/c/openstack/swift/+/729091 - there is more work needed on the conf handling, but I didn't want to take it all on while blocking https://review.opendev.org/c/openstack/swift/+/77000515:37
_david_sohonetSince upgrading from 2.17 to 2.25.1 (including an OS upgrade to Ubuntu 18.04) we have been having proxy-server hangs, approximately once every 2 days. This is happening across multiple physical servers. When this occurs, the proxy-server does not respond to a TCP connection at all. The proxy-server is listening on localhost:80 and directly on the server, nc 127.0.0.1 80 times out.  eventlet is 0.25.1-2ubuntu1~cloud0 and python ve15:39
_david_sohonetrsion is 3.6.9. tcpdump shows SYN sent over loopback but no accept() is occuring, no respsonce to SYN.15:39
_david_sohonetCan anyone advise?15:39
_david_sohonetWe have a system 'hung' as above at the moment.  Error log is not showing any issues. The last message was : Jan 13 09:41:06 SM-X9DBL4B-S-3-LAP12-US proxy-server: Client disconnected on read of....    That was a number of hours ago, nothing since then.15:50
DHEthere's a hack that might work. edit /usr/lib/python3.6/logging/__init__.py and set logThreading = False near the top. path might not be exactly right but this should help you find it15:51
DHEcorrection: logThreads = false15:52
_david_sohonetwould a restart of proxy-server be required ?15:52
DHEoh yes15:53
_david_sohonetA restart of the process will resolve the issue, so was hoping there was some data I could gather before doing that.15:54
DHE/proc/$(pid of a hung proxy-server process)/wchan   does it mention futex ?15:54
_david_sohonet# cat /proc/31892/wchan15:56
_david_sohonetpoll_schedule_timeout15:56
_david_sohonet# cat /proc/31899/wchan15:56
_david_sohonetfutex_wait_queue_me15:56
_david_sohonetcat /proc/31900/wchan15:56
_david_sohonetfutex_wait_queue_me15:56
_david_sohonetyes15:56
DHEyeah this is exactly what I think it is15:56
DHEjust go ahead and restart it15:56
_david_sohonet$ curl http://localhost/healthcheck15:57
_david_sohonetOK15:57
_david_sohonetIt recovered and is serving requests again.15:57
DHEhttps://bugs.launchpad.net/swift/+bug/1895739 you're hitting this15:57
openstackLaunchpad bug 1895739 in OpenStack Object Storage (swift) "Proxy server sometimes deadlocks while logging client disconnect" [Undecided,In progress]15:57
_david_sohonetThank you DHE15:57
DHEat least i'm 90% sure15:57
_david_sohonetAFAIK we always see client disconnect in the error logs before the hangs15:58
*** m75abrams has quit IRC16:01
*** dsariel has quit IRC16:34
*** dsariel has joined #openstack-swift16:34
_david_sohonetWe have a misbehaving client application ( ffmpeg / libavfilter ) which appears to never properly close it's GETs ( IIRC they are range requests ), so creates lots of 499s GETs and "Client Disconnect" messages in the error log16:47
*** prometheanfire has left #openstack-swift16:47
_david_sohonetDHE: Will setting logThreads to false in /usr/lib/python3.6/logging/__init__.py  mitigate the bug ?16:49
*** gyee has joined #openstack-swift17:00
*** baojg has quit IRC17:41
*** baojg has joined #openstack-swift17:41
clayg_david_sohonet: we've been having a lot more healthy proxies since we upgrade to eventlet  > 0.28 with https://github.com/eventlet/eventlet/commit/9f49f0bbb9c18d7478d7c5e2e677ae6452845ab517:52
claygi think that logging hang only effects py3 - the eventlet hang was also related to logging, but effected py2 as well17:54
_david_sohonetThanks clayg17:59
*** timburke has joined #openstack-swift17:59
*** ChanServ sets mode: +v timburke17:59
*** rpittau is now known as rpittau|afk18:05
*** baojg has quit IRC18:11
*** baojg has joined #openstack-swift18:11
*** timburke_ has joined #openstack-swift18:13
*** ChanServ sets mode: +v timburke_18:13
*** timburke has quit IRC18:15
*** baojg has quit IRC18:18
*** baojg has joined #openstack-swift18:19
*** _david_sohonet has quit IRC18:23
DHEI've been using this hack for months now successfully. but I realize messing with system packages like this is also frowned upon18:43
DHEclayg: so this fixes py3 hangs?18:44
claygi think Tim is running with that fix in his py3 home cluster and still sees issues - so no it doesn't fix all the py3 hangs18:44
claygi'm not even sure it fixes all the py2 hangs - but we were also rebooting hung proxies all the time until we tracked down and fixed the eventlet bug18:45
*** _david_sohonet has joined #openstack-swift18:57
*** _david_sohonet has quit IRC19:12
*** _david_sohonet has joined #openstack-swift19:14
*** paladox has quit IRC19:34
*** baojg has quit IRC19:39
*** paladox has joined #openstack-swift19:39
*** baojg has joined #openstack-swift19:39
zigoI got 14 unit test failures when building swift in Debian Sid: http://paste.openstack.org/show/801608/19:55
zigotimburke_: Any idea what's going on?19:55
*** hoonetorg has quit IRC20:04
openstackgerritTim Burke proposed openstack/swift master: reconciler: Prevent problematic middlewares from being used  https://review.opendev.org/c/openstack/swift/+/77052220:08
timburke_zigo, not off hand -- what version of mock is it using? and which python 3.9? fwiw, i regularly test with py39 on my fedora dev box (mock==4.0.2, python==3.9.1 currently)20:15
timburke_it's also weird that it doubles-up each test :-/20:15
timburke_ah! looks like upgrading to mock==4.0.3 lets me repro20:23
zigotimburke_: Mock 4.0.3, Python 3.9.1.20:28
zigoI found it weird too indeed.20:28
timburke_i suspect (though haven't confirmed) that https://github.com/testing-cabal/mock/commit/902eea1 triggered it20:28
zigotimburke_: Should I try to revert this patch in the Mock package and try to build swift?20:37
timburke_ah! it's a swift (test) bug. i'll have a patch up shortly20:38
claygtimburke_: you mean you think we did something UNSEEMLY in a TEST!?20:38
clayg😱20:39
zigotimburke_: Ah, great, ping me when you have it! :)20:42
zigoThanks.20:42
openstackgerritTim Burke proposed openstack/swift master: Fix __exit__ calls  https://review.opendev.org/c/openstack/swift/+/77066620:42
timburke_zigo, ^^^20:42
* zigo tries the patch20:43
timburke_clayg, is the shocking part that the unseemly thing only affected tests? ;-)20:44
zigotimburke_: This fixes things for me indeed, thanks !20:49
timburke_👍20:49
*** adeberg has quit IRC20:53
timburke_almost meeting time!20:57
kota_good morning20:58
mattoliveraumorning20:59
mattoliveraukota_: o/21:00
kota_mattoliverau: o/21:01
acoleskota_: mattoliverau good morning21:01
*** theintern_ has joined #openstack-swift21:02
*** hoonetorg has joined #openstack-swift21:18
*** hoonetorg has quit IRC21:43
*** baojg has quit IRC21:43
*** baojg has joined #openstack-swift21:44
openstackgerritAlistair Coles proposed openstack/swift master: WIP: ssync: sync non-durable fragments from handoffs  https://review.opendev.org/c/openstack/swift/+/77004721:49
*** rcernin has joined #openstack-swift21:59
acolesso in my virtual env, 'which swift-Object-server' gives me /vagrant/swift/.tox/py36/bin/swift-Object-server but 'ls /vagrant/swift/.tox/py36/bin/' only shows the lower case swift-object-server22:04
*** klamath_atx has joined #openstack-swift22:11
*** dsariel has quit IRC22:15
*** baojg has quit IRC22:52
*** baojg has joined #openstack-swift22:53
claygyeah, my .tox dir is in ~/ - I use the vtox and venv wrappers which set the toxworkdir to get the heck out of that shared filesystem22:59
*** timburke_ has quit IRC23:11
*** timburke_ has joined #openstack-swift23:11
*** ChanServ sets mode: +v timburke_23:11
*** baojg has quit IRC23:33
*** baojg has joined #openstack-swift23:34
*** theintern_ has quit IRC23:34

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!