Lagspike saga

Announcements about major changes in Haven & Hearth.

Re: Lagspike saga

Postby loftar » Tue Mar 12, 2019 9:22 pm

I realized a couple of days ago that I could use the 32 GB swap partition that I had allocated specifically to the NVME drives to debug this. Since the swap isn't normally in use, I deactivated it and trimmed those entire partitions. Immediately afterward, the lagspikes decreased substantially, but since then they seem to have been coming back again, so it wasn't exactly a resounding success.

Having looked at more I/O traces since then, however, I just can't help but get the feeling that it should be TRIM-related, because the only I/O operations that seem to be getting high latency are writes, and at the times when they happen, large batches of writes are often completed at once, after they have experienced high latency together. Not sure what to make of that, but it's not like it didn't seem to do "something", and it's not like it couldn't be that 32 GB is just not enough (though that analysis also does sound quite optimistic). Perhaps I should try to reserve more unused space on the drives and try again. That will take some downtime, however, so I think I'll ponder it a bit more.
"Object-oriented design is the roman numerals of computing." -- Rob Pike
User avatar
loftar
 
Posts: 8926
Joined: Fri Apr 03, 2009 7:05 am

Re: Lagspike saga

Postby Granger » Tue Mar 12, 2019 9:44 pm

As far as I understood the bcache documentation you should be able to completely detach (remove) a cache device (performance will be *meh* then as they'll be written back to the base device which then will take over service) so you should be able to repartition the NVMe's on-the-fly.
⁎ Mon Mar 22, 2010 ✝ Thu Jan 23, 2020
User avatar
Granger
 
Posts: 9263
Joined: Mon Mar 22, 2010 2:00 pm

Re: Lagspike saga

Postby loftar » Tue Mar 12, 2019 10:10 pm

Granger wrote:As far as I understood the bcache documentation you should be able to completely detach (remove) a cache device (performance will be *meh* then as they'll be written back to the base device which then will take over service) so you should be able to repartition the NVMe's on-the-fly.

True enough. I was thinking that performance might be bad enough that I might as well shut down, but I guess there's no reason to draw that conclusion in advance.
"Object-oriented design is the roman numerals of computing." -- Rob Pike
User avatar
loftar
 
Posts: 8926
Joined: Fri Apr 03, 2009 7:05 am

Re: Lagspike saga

Postby Granger » Tue Mar 12, 2019 10:19 pm

loftar wrote:
Granger wrote:As far as I understood the bcache documentation you should be able to completely detach (remove) a cache device (performance will be *meh* then as they'll be written back to the base device which then will take over service) so you should be able to repartition the NVMe's on-the-fly.

True enough. I was thinking that performance might be bad enough that I might as well shut down, but I guess there's no reason to draw that conclusion in advance.

On the other hand: the last worlds ran on the old server quite nicely, that one only had HDDs...
⁎ Mon Mar 22, 2010 ✝ Thu Jan 23, 2020
User avatar
Granger
 
Posts: 9263
Joined: Mon Mar 22, 2010 2:00 pm

Re: Lagspike saga

Postby Grog » Sun Mar 17, 2019 9:08 pm

The lagspikes now are insane.
Favourite thread: viewtopic.php?f=9&t=3388
User avatar
Grog
 
Posts: 2730
Joined: Mon Feb 08, 2010 11:42 pm
Location: Germany

Re: Lagspike saga

Postby Fierce_Deity » Tue Mar 26, 2019 5:31 am

Seems to be especially bad right now.
Fierce_Deity
 
Posts: 783
Joined: Thu Feb 12, 2015 4:11 pm

Re: Lagspike saga

Postby Granger » Wed Apr 03, 2019 7:30 pm

loftar wrote:
Granger wrote:As far as I understood the bcache documentation you should be able to completely detach (remove) a cache device (performance will be *meh* then as they'll be written back to the base device which then will take over service) so you should be able to repartition the NVMe's on-the-fly.

True enough. I was thinking that performance might be bad enough that I might as well shut down, but I guess there's no reason to draw that conclusion in advance.

As a polite reminder: At what point in time do you plan to tackle this?
⁎ Mon Mar 22, 2010 ✝ Thu Jan 23, 2020
User avatar
Granger
 
Posts: 9263
Joined: Mon Mar 22, 2010 2:00 pm

Re: Lagspike saga

Postby jordancoles » Thu Apr 04, 2019 11:15 am

Getting laggy peaks every 3 seconds or so, had to log off because it's not playable
Duhhrail wrote:No matter how fast you think you can beat your meat, Jordancoles lies in the shadows and waits to attack his defenseless prey. (tl;dr) Don't afk and jack off. :lol:

Check out my pro-tips thread
Image Image Image
User avatar
jordancoles
 
Posts: 14015
Joined: Sun May 29, 2011 6:50 pm
Location: British Columbia, Canada

Re: Lagspike saga

Postby Sarge » Thu Apr 04, 2019 11:53 am

It's been particularly bad since yesterday.
factnfiction101 wrote:^I agree with this guy.
User avatar
Sarge
 
Posts: 2029
Joined: Fri Oct 09, 2009 3:41 am

Re: Lagspike saga

Postby Omnipotent » Thu Apr 04, 2019 6:29 pm

+1 to temporary shutdown for possible fix. I don't mind waiting even if its down for an entire day.

The last 48 hours have been particularly bad as has been mentioned by Sarge & Coles. Not sure if its just the bots from major factions or what, but it can be quite unpleasant at times of severe lag. Anything would be better than nothing.
User avatar
Omnipotent
 
Posts: 291
Joined: Wed Aug 19, 2009 9:55 pm
Location: California

PreviousNext

Return to Announcements

Who is online

Users browsing this forum: Python-Requests [Bot] and 21 guests