Web Squad ISP

Status
Not open for further replies.
I've read everything here about WACS and upstream providers and what not, and to be brutally honest I couldn't care less. Is there no redundancy or measures to protect against this in place without the entire connection going to %$#@?

These past two weeks have been probably the worst fixed line connection I've had in 5+ years. Frequent drops in connection, high latency, packet loss. No one else I know have had any issues on their ISPs, and I've asked around.

You guys came through for me in a hurry so I'm not keen to jump ship, but this is getting super frustrating.

So what's plan here? When can we expect stability and are you working to improve redundancy in providers?

Let me start off by assuring you, we do have redundancy - this is why you're still up despite our primary WACS route being down. I know it's little consolation, and I too am very frustrated at the turn of events - and I've been putting all the pressure I can to get this issue resolved, as well as looking at alternative solutions going forward.

A little technical explanation if you want:
Unfortunately, when a transit service loses a route, it leads to routes needing to re-converge (IE all the dead routes need to be pulled out of routing tables on a global scale and new, optimal routes need to be calculated). When it happens on a smaller table (like a NAP peer), this is negligible - a packet or two drop and you're back up and running. On a global table (about 800k prefixes for IPv4 and 90k for IPv6+), this can take a little longer. But there is definitely redundancy on our network, and there is redundancy on our upstream's networks. Generally, the international outages are simple.. the service is down and there is one failover episode and it's done. What's happened here is what we call flapping. The circuit is up, then down, then up. That means we're constantly needing to adjust our tables, upstreams theirs and so on and so forth - which is a worse case scenario. Think of it as flipping the switch on a fluorescent light fitting, continuously. At some point the starter takes strain and the light doesn't come on as quickly. Routers aren't designed to recalculate routes constantly. Even the best of breed routers take 15-45s to recalculate routing tables and ensure traffic goes to where it needs to.

If you look around on this forum, there are other networks that were affected. The particular upstream has about 21 downstream networks including us - so we're certainly not alone and there are other mentions on this forum. In addition, they have a pretty rock-solid network (until now) and are extremely responsive to assisting us with routing issues and route improvements (which is a rarity in the wholesale space, and allows us to be more flexible) - so we've been patient with these issues until now. We've got other transit providers in play, and we immediately see a spike in traffic on their networks when this happens. Unfortunately, it's not instant - but we're doing everything we can to make it smoother. That said, we do have an active order for yet another provider, but their lead times are long (once again, I've been on their case today).

Of course, we'll be requesting an RFO for this afternoon's outage and stressing that this is simply unacceptable as these outages have been too regular. From our side, you have my assurance we are working on this, and if this provider can't sort this out, we'll move to another.
 
So I'm getting more packet loss again now and to add insult to injury I couldn't even get onto this forum at all through my browser and even Carbonite.co.za. Weirdly international traffic seems fine but local was down. Using my MTN cell to post this. What is going on?!
 
So I'm getting more packet loss again now and to add insult to injury I couldn't even get onto this forum at all through my browser and even Carbonite.co.za. Weirdly international traffic seems fine but local was down. Using my MTN cell to post this. What is going on?!
1601996704767.png
 
So I'm getting more packet loss again now and to add insult to injury I couldn't even get onto this forum at all through my browser and even Carbonite.co.za. Weirdly international traffic seems fine but local was down. Using my MTN cell to post this. What is going on?!

Do you have any traces from your PC for us to check? This forum and carbonite are on cloudflare. I'm running from a Vuma trenched link right now and able to access both. Ran a trace from your test router last night, ran all night and didn't show a single drop.


Looks like cloudflare is running out of CPT instead of JHB. Looking to see if there are any issues that side.
 
Same.. mybb browser takes forever to load. Olivedale/Vumatel

1601996937389.png
 
Everything appears to have stalled for a bit. Octotel Melkbosstrand. Started pretty much exactly at 17:00. Something up?
 
....
9218e7b0955ad608fea9c7ea115f0a33.jpg
 
Here is a screenshot, please have a look at the top of the image for the packet loss, it is jumping between 27% all the way to 70% plus and this has been happening for days now, today it is completely unplayable. This is not COD as I am the only one having this issue:

1601997531868.png
 
lol.. Just heard my son complain about his packetloss too. And his mate's ping shot up to a 1000 on RAIN.

WACS causing chaos with the boys :D
 
Update: Situation seems to be under control. We're monitoring closely - will elaborate more a little later once we're sure a) this is the root issue, b) it's sorted and c) it's sorted. Apologies for the vagueness here - I know you're used to my nice long winded explanations.
 
Update: Situation seems to be under control. We're monitoring closely - will elaborate more a little later once we're sure a) this is the root issue, b) it's sorted and c) it's sorted. Apologies for the vagueness here - I know you're used to my nice long winded explanations.

Much better here now, but a bit erratic.
 
Any details will help; we've just brought back NAP CPT - this is where the issue started. So you may see some route adjustments as this comes back online.

Don't have the necessary software installed to give you more detail as I don't game, but things are much better from just a browser point of view.

:p I did reboot my PiHole around the time the outage happened. Hopefully that didn't knock over your network
 
Don't have the necessary software installed to give you more detail as I don't game, but things are much better from just a browser point of view.

:p I did reboot my PiHole around the time the outage happened. Hopefully that didn't knock over your network
It's all your fault!! :p
 
Don't have the necessary software installed to give you more detail as I don't game, but things are much better from just a browser point of view.

:p I did reboot my PiHole around the time the outage happened. Hopefully that didn't knock over your network

No problem, thanks for the update. :laugh: yup, it was all that PiHole's fault. No, there was a larger force at play here.. but it seems under control, still watching closely.
 
Yep, got in an hour of game time with no packet loss and decent latency. Holding thumbs for continued stability!
 
Ironically I couldn't reply earlier because once again the connection was down.

I hear you about them being big and having how many other providers, but again, from a consumer point of view, your providers are of little concern to me and it's your service that sucks right now. Particularly because no one else I've checked with has had any issues.

A little technical explanation if you want:

While I appreciate the explanation, and I don't know enough about networks to argue the point, I would expect packet loss sure, but a complete connection failure every time? From pure anecdotal experience:
  • In a game with 4 other people on 3 other ISPs I'm the only one to lose connection.
  • On a meeting with 11 other people on who knows how many ISPs, I'm the only one to lose connection.
So I guess lets see how this goes, I hope you guys find a way to sort out the issues asap.
 
Status
Not open for further replies.
Top
Sign up to the MyBroadband newsletter
X