Kong under load keyspace configuration #543

rafael · 2015-09-15T04:06:04Z

Hi all -

I'm trying to do some small load testing on Kong (0.4.2) but the moment I start triggering some concurrent connections I start getting some errors. I'm trying to do all this testing inside a Kubernetes Cluster in gcloud. My cluster consists of 4 n1-highcpu-4 (16 vCPUs, 14.4 GB memory) instances. Inside the kubernetes cluster I'm running a Cassandra (2.1.7) cluster with 4 containers/nodes and also 4 kong nodes. I'm not putting any limit on how much resources the docker containers can take from the cluster.

Once I start doing something like this:

ab -n 10000 -c 20 http://104.197.45.83/users

Which doesn't seem crazy, I start seeing sporadic errors on Kong like the following:

2015/09/15 03:27:12 [error] 61#0: *60379 [lua] responses.lua:61: cb(): Cassandra error: Failed to read frame header from 10.176.0.13: timeout, client: 10.240.232.213, server: _, request: "GET /users HTTP/1.0", host: "104.197.45.83"
2015/09/15 03:27:12 [error] 61#0: *60375 [lua] responses.lua:61: cb(): Cassandra error: Failed to read frame header from 10.176.0.13: timeout, client: 10.240.199.154, server: _, request: "GET /users HTTP/1.0", host: "104.197.45.83"
2015/09/15 03:27:12 [error] 61#0: *60350 [lua] responses.lua:61: cb(): Cassandra error: Failed to read frame header from 10.176.0.13: timeout, client: 10.240.116.150, server: _, request: "GET /users HTTP/1.0", host: "104.197.45.83"
2015/09/15 03:27:17 [error] 61#0: *62613 [lua] responses.lua:61: cb(): Cassandra error: Failed to read frame header from 10.176.3.13: timeout, client: 10.240.232.213, server: _, request: "GET /users HTTP/1.0", host: "104.197.45.83"

and on the Cassandra side:

java.io.IOException: Error while read(...): Connection reset by peer
    at io.netty.channel.epoll.Native.readAddress(Native Method) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.channel.epoll.EpollSocketChannel$EpollSocketUnsafe.doReadBytes(EpollSocketChannel.java:675) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.channel.epoll.EpollSocketChannel$EpollSocketUnsafe.epollInReady(EpollSocketChannel.java:714) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.channel.epoll.EpollEventLoop.processReady(EpollEventLoop.java:326) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:264) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:116) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at io.netty.util.concurrent.DefaultThreadFactory$DefaultRunnableDecorator.run(DefaultThreadFactory.java:137) ~[netty-all-4.0.23.Final.jar:4.0.23.Final]
    at java.lang.Thread.run(Thread.java:745) [na:1.7.0_79]

I'm kind of puzzled because as far as I can tell the containers and the cluster don't seem to be under heavy stress. I also used the cassandra-stress tool against the cassandra cluster directly and couldn't find any issues.

Do you guys have any idea of what might be going on here?

The text was updated successfully, but these errors were encountered:

rafael · 2015-09-16T07:07:43Z

So my issue ended up being the replication factor set to 1 in the keyspace. Even though I had a bigger cluster all the request were being redirected to the same node and eventually it started time out. Once I updated the replication factor I stopped getting errors.

I'm going to see if I can put together a pull request to have the replicator factor be a parameter to kong.yml.

thibaultcha · 2015-09-16T13:00:30Z

As we already discussed this on Gitter, I'm going to sum it up for future reference:

Even if not explicit, I did not consider the replication factor set to 1 in the migration a problem because one can manually create a keyspace with any configuration and then run the migrations on it. The created keyspace would not be overridden.

That said, I've been wanting to add those options in the kong.yml file, but lacked the time/prioritisation (since it is already possible as just explained). Ideally, if you wish to implement it, the file should provide options for keyspace creation as described here. Something like:

cassandra:
  contact_points:
    ...
  keyspace: ...
  #
  # Keyspace options. Set those before running Kong or any migration.
  # See http://docs.datastax.com/en/cql/3.1/cql/cql_reference/create_keyspace_r.html
  #
  # Replica placement strategy place for the Keyspace.
  strategy_class: SimpleStrategy
  # Required if class is SimpleStrategy; otherwise, not used. 
  # The number of replicas of data on multiple nodes.
  replication_factor: 1
  # Required if class is NetworkTopologyStrategy and you provide the name of the first data center.
  # This value is the number of replicas of data on each node in the first data center.
  data_centers:
    dc1: 2
    dc2: 4

At least that was my original idea. If you don't get to it I'll probably implement this.

cagdast · 2015-09-16T14:35:47Z

I faced the same problem with two nodes Cassandra cluster. The issue got solved after changing replication factor from 1 to 2.

thibaultcha · 2015-09-21T22:54:27Z

Related to #350

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

thibaultcha · 2015-10-15T22:21:52Z

Implemented with #634.

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

thibaultcha added the question label Sep 15, 2015

rafael changed the title ~~Kong under load issue - question~~ Kong under load issue Sep 15, 2015

thibaultcha mentioned this issue Sep 16, 2015

Cassandra - Replication factor #545

Closed

subnetmarco added the dao label Sep 17, 2015

thibaultcha changed the title ~~Kong under load issue~~ Kong under load keyspace configuration Sep 21, 2015

thibaultcha mentioned this issue Sep 21, 2015

Kong cluster performance #407

Closed

thibaultcha mentioned this issue Sep 21, 2015

Migrate Cassandra keyspace to NetworkTopologyStrategy #350

Closed

thibaultcha added a commit that referenced this issue Sep 30, 2015

feat(config) add options for keyspace replication strategy

02d8ae0

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

thibaultcha self-assigned this Sep 30, 2015

thibaultcha added a commit that referenced this issue Oct 5, 2015

feat(config) add options for keyspace replication strategy

f44bb54

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

thibaultcha mentioned this issue Oct 5, 2015

[feat/config] add options for keyspace replication strategy #589

Closed

thibaultcha added this to the 0.6.0 milestone Oct 15, 2015

thibaultcha added the size/S label Oct 15, 2015

thibaultcha added a commit that referenced this issue Oct 15, 2015

feat(config) add options for keyspace replication strategy

6a0f706

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

thibaultcha mentioned this issue Oct 15, 2015

[feat/cli] keyspace options in configuration and un-nesting #634

Merged

thibaultcha closed this as completed Oct 15, 2015

thibaultcha added a commit that referenced this issue Oct 16, 2015

feat(config) add options for keyspace replication strategy

121a949

Possibility to configure the replication strategy used by the created keyspace and its options. Implements #543 and #350

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kong under load keyspace configuration #543

Kong under load keyspace configuration #543

rafael commented Sep 15, 2015

rafael commented Sep 16, 2015

thibaultcha commented Sep 16, 2015

cagdast commented Sep 16, 2015

thibaultcha commented Sep 21, 2015

thibaultcha commented Oct 15, 2015

Kong under load keyspace configuration #543

Kong under load keyspace configuration #543

Comments

rafael commented Sep 15, 2015

rafael commented Sep 16, 2015

thibaultcha commented Sep 16, 2015

cagdast commented Sep 16, 2015

thibaultcha commented Sep 21, 2015

thibaultcha commented Oct 15, 2015