Monday, 2020-02-03

*** rcernin has joined #openstack-swift02:27
*** rcernin has quit IRC03:31
*** psachin has joined #openstack-swift04:29
*** manuvakery has joined #openstack-swift04:39
*** jvisser has quit IRC04:55
*** viks___ has joined #openstack-swift04:57
*** evrardjp has quit IRC05:33
*** evrardjp has joined #openstack-swift05:34
*** zaitcev has quit IRC05:36
manuvakeryhi guys .. few days back i set the weight of a disk (DISK12) to zero but not removed from ring. All the partitions moved to other disks and everything seems fine . Now am getting  the below error  in the replicator06:02
manuvakeryhttps://www.irccloud.com/pastebin/NtlOGa9S/replicator_error06:02
manuvakeryIs it because the disk is still present in the ring ??06:04
seongsoochomanuvakery:  HI, Please check the owner of directory /srv/node/disk12 (swift:swift)06:21
manuvakery@seongsoocho>  the owner is  not swift for now . I was in the impression that if i set the weight to 0. swift wont do anything on that disk06:23
seongsoochomanuvakery:  I think replicator make objects directory if not exists.  check this code : https://github.com/openstack/swift/blob/2.23.1/swift/obj/replicator.py#L81306:28
manuvakeryseongsoocho: yeah correct. thanks06:31
seongsoocho:-)06:31
manuvakeryseongsoocho: i can see one more error in the object-error.log , not related to the disk permission but connection with the hidden lock file  swift uses to acquire the directory locking06:41
manuvakeryhttps://www.irccloud.com/pastebin/zmOVNiC8/rep-error06:42
manuvakerythe error is only on this specific folder06:42
seongsoochomanuvakery:  what version do you use?06:43
manuvakerystein06:45
manuvakeryif i do an ls on /srv/node/disk5/objects/210847/.lock i can see06:48
manuvakeryhttps://usercontent.irccloud-cdn.com/file/aOmMUUNs/Selection_040.png06:48
manuvakerysomehow  there is an addition .lock directory06:49
seongsoochomanuvakery:  oh.. The .lock  should be a file not a directory. That's weird.06:50
manuvakeryhow about moving the contents from .lock dir to /srv/node/disk5/objects/210847 manually .. any harm?06:52
seongsoochohmm ..  What is the date of .lock/hashes.pkl ? If It is not a today,  how about rename .lock directory to .lock_tmp ? I'm not sure the side effect to moving the contents manually.06:58
manuvakeryseongsoocho: -rw------- 1 swift swift 1027 Feb  3 06:16 hashes.pkl its today07:01
seongsoochooh ... Then we need from someone else.07:05
manuvakeryseongsoocho: ok will wait then07:06
*** tkajinam has quit IRC07:36
*** tkajinam has joined #openstack-swift07:38
*** pcaruana has joined #openstack-swift07:43
*** tkajinam_ has joined #openstack-swift08:01
openstackgerritDavid Sariel proposed openstack/swift master: DNM - dd probe tests  https://review.opendev.org/70506308:04
*** tkajinam has quit IRC08:04
*** rpittau|afk is now known as rpittau08:05
*** rdejoux has joined #openstack-swift08:08
*** tesseract has joined #openstack-swift08:14
*** tkajinam_ has quit IRC08:38
*** mikecmpbll has joined #openstack-swift09:00
*** tkajinam has joined #openstack-swift11:28
*** rpittau is now known as rpittau|bbl11:42
*** psachin has quit IRC13:16
*** rpittau|bbl is now known as rpittau13:34
*** jvisser has joined #openstack-swift14:04
*** tkajinam has quit IRC14:16
claygmanuvakery: seongsoocho: I've seen files unexpectedly be directories before - but I'm never quite sure if it was file system corruption or if an operator accidently mis-typed something on a node14:41
claygmanuvakery: seongsoocho: I don't think swift would automatically repair from this - ideally the code would quaratine the `.lock` directory14:41
claygmanuvakery: seongsoocho: there is some potential to create "dark data" if the information in those suffixes has been disconnected from the cluster for a long time14:42
claygyou should move it out of the way (i.e. manually quarantine, as seongsoocho suggested) and let replication finish14:42
clayg... then inspect the contents of those quarantined suffixes with tools like swift-object-info and inspect the primary nodes and container listings for those objects14:43
claygmost likely they can be safely deleted 🤞14:43
manuvakeryclayg: u mean to say I can rename the .lock dir to .lock_tmp and let the replication take care the rest15:15
claygyes, but I might even suggest moving it *out* of the data dir tree entirely - into the quarntine directory is always a safe place15:23
claygbut yes, move it out of the way for now and see if replication can get things back on track15:24
claygI am curious if there's any .data files in those suffixes or if they're all empty?15:24
manuvakeryOk.. yes there is .data file inside the .lock dir15:28
claygintrestig... so if you use use the swift-object-info tool after you've moved the data out of the way you may gain some additional insight into the nature of the data in the corrupt path15:37
claygfor example - it will tell you where all the primary homes for that object should be - if the primaries already have a copy of the same data then you can definately delete it!15:37
*** rdejoux has quit IRC15:48
manuvakeryOk . I will try that tomorrow. Thanks clayg15:48
claygof course!  do keep us posted 😁15:49
DHEjust want to verify something. I'm using ssync-based replication for my object servers and it seems I'm limited to 1 replication session per disk. so the maximum number of streams to a server is limited to the number of disks. I feel like that's not the intended result.15:51
*** gyee has joined #openstack-swift16:04
*** rpittau is now known as rpittau|afk16:33
timburkeDHE, have a loot at replication_concurrency and replication_concurrency_per_device: https://github.com/openstack/swift/blob/2.24.0/etc/object-server.conf-sample#L167-L17816:38
DHEah, that's what I was looking for... I swear I looked...16:39
DHEthanks16:39
timburkeFWIW, i'm pretty sure the default for replication_concurrency_per_device came out of a desire to match the pre https://github.com/openstack/swift/commit/e199192c behavior -- no idea about the default for replication_concurrency, but it seems to have come in with the original ssync code, so probably pulled out of thin air16:40
DHEI was thinking relative to rsync where I think the suggested defaults is 416:40
timburkewe should consider updating those defaults to something more like what people actually tend to run with16:40
DHEstupid 10gig switch has a firmware bug that causes the port to negotiate at 1gig speeds, so this is capped anyway...16:42
timburke:-(16:45
*** evrardjp has quit IRC17:33
*** evrardjp has joined #openstack-swift17:34
*** mikecmpbll has quit IRC17:44
*** mikecmpbll has joined #openstack-swift18:40
*** diablo_rojo has joined #openstack-swift19:02
*** mikecmpbll has quit IRC19:02
*** mikecmpbll has joined #openstack-swift19:03
*** zaitcev has joined #openstack-swift19:05
*** ChanServ sets mode: +v zaitcev19:05
*** mikecmpbll has quit IRC19:09
*** tesseract has quit IRC20:01
*** irclogbot_1 has quit IRC20:05
*** irclogbot_3 has joined #openstack-swift20:06
*** rdejoux has joined #openstack-swift21:06
*** rdejoux has quit IRC21:10
mattoliverauMorning22:11
openstackgerritTim Burke proposed openstack/swift master: WIP: run probe tests on CentOS 8  https://review.opendev.org/69071722:12
*** tkajinam has joined #openstack-swift22:53
openstackgerritTim Burke proposed openstack/swift master: py3: Fix up probe tests  https://review.opendev.org/70557823:15
openstackgerritTim Burke proposed openstack/swift master: probe tests: Work when fronted by a TLS terminator  https://review.opendev.org/70557923:15
openstackgerritTim Burke proposed openstack/swift master: WIP: run probe tests on CentOS 8  https://review.opendev.org/69071723:44

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!