Replies: 1 comment 1 reply
-
|
@sjaakiejj - Did you find a solution to this? I have a websocket application and I need to guarantee a 1:1 ratio, 1 client to 1 pod. I was hoping the Knative with |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
We've got a system where we run a fairly CPU-heavy process as part of incoming websocket requests. To ensure the system both protects itself from running out of resources and can handle sudden high loads, we're looking into Knative Serving as a potential solution, given that the
containerConcurrencyproperty seems to do exactly what we need.Effectively we're looking to limit each container to 10 active websocket connections and force the auto scaler to add new instances if we need more. We've setup our deployment yaml as follows:
However, when applying a load of 40 active socket connections to this service, all of them go to the same pod. I checked netstat to get the number of active connections in the pod, and it returned 210
ESTABLISHEDrecords on port8080.Displaying the list of pods, I get this:
It seems no matter how much load I throw at it, it refuses to scale. There are no errors in the autoscaler logs but also no indications that it attempts to scale. I feel like I'm missing something obvious, but I can't figure it out based on the documentation.
Is this expected to work the way I think it should? What's the best way to debug issues like this?
Beta Was this translation helpful? Give feedback.
All reactions