Tag: Spring

See Techempower. This repository contains homemade java benchmarks using spring-mvc, spring-webflux and netty-http/netty-tcp servers based on reactor-netty. gin and gnet are also included. wrk is used as client. gobench is also considered but it is not so good as wrk.

# benchmarking plaintext
# ./wrk -c 1000 -t 30 -d 30s http://10.xx.xx.xx:8124/text

1 2	# benchmarking plaintext # ./wrk -c 1000 -t 30 -d 30s http://10.xx.xx.xx:8124/text

Environment 1

Server: 8C16G vm
Client: 4C8G vm * 2

Server	Server Throughput	Server CPU
spring-mvc	25k ~ 30k /s	~600%
spring-webflux	90k ~ 110k /s	~780%
go-gin	110k ~ 120k /s	~600%
go-gnet	110k ~ 120k /s	~270%
netty-http	110k ~ 120k /s	~480%
netty-tcp	110k ~ 120k /s	~360%

2 VM Clients are not able to fully utilize the server capability. The initial attempts were benchmarking only first 4 cases. And the go-gnet results made me wonder, it can give much more throughput. After reading the source of it, I found go-gnet case is actually a TCP server with very very little of HTTP implementation to fulfill the benchmark, which is unfair for other cases. Therefore, I added case 5/6 in java to align with it.

Environment 2

Server: 24C32G physical machine
Client:
- 4C8G vm * 2
- 8C16G vm * 1
- 24C32G physical machine * 1

Server	Server Throughput	Server CPU
spring-mvc	~120k /s	~1560%
spring-webflux	~180k /s	~2380%
go-gin	~380k /s	~2350%
go-gnet	560k ~ 580k /s	~1160%
netty-http	560k ~ 580k /s	~2350%
netty-tcp	560k ~ 580k /s	~1460%

Still room to give more throughput in go-gnet and netty-tcp cases. Not having so many idle systems for benchmarking now. The throughput should have a linear increment when more CPU is utilized, in both cases.

As a developer, spring-mvc or go-gin can still be the first choice, as they are easier to get started.

A Spring Cloud Toy Project

August 21, 2018 by gonwan·0 Comments

Recently played with the Spring/SpringBoot/SpringCloud stack with a toy project: https://github.com/gonwan/spring-cloud-demo. Just paste README.md here, and any pull request is welcome:

Introduction

The demo project is initialized from https://github.com/carnellj/spmia-chapter10. Additions are:

Code cleanup, bug fix, and better comments.
Java 9+ support.
Spring Boot 2.0 migration.
Switch from Postgres to MySQL, and from Kafka to RabbitMQ.
Easier local debugging by switching off service discovery and remote config file lookup.
Kubernetes support.
Swagger Integration.
Spring Boot Admin Integration.

The project includes:

[eureka-server]: Service for service discovery. Registered services are shown on its web frontend, running at 8761 port.
[config-server]: Service for config file management. Config files can be accessed via: http://${config-server}:8888/${appname}/${profile}. Where ${appname} is spring.application.name and ${profile} is something like dev, prd or default.
[zipkin-server]: Service to aggregate distributed tracing data, working with spring-cloud-sleuth. It runs at 9411 port. All cross service requests, message bus delivery are traced by default.
[zuul-server]: Gateway service to route requests, running at 5555 port.
[authentication-service]: OAuth2 enabled authentication service running at 8901. Redis is used for token cache. JWT support is also included. Spring Cloud Security 2.0 saves a lot when building this kind of services.
[organization-service]: Application service holding organization information, running at 8085. It also acts as an OAuth2 client to authentication-service for authorization.
[license-service]: Application service holding license information, running at 8080. It also acts as an OAuth2 client to authentication-service for authorization.
[config]: Config files hosted to be accessed by config-server.
[docker]: Docker compose support.
[kubernetes]: Kubernetes support.

NOTE: The new OAuth2 support in Spring is actively being developed. All functions are merging into core Spring Security 5. As a result, current implementation is suppose to change. See:

Tested Dependencies

Java 8+
Docker 1.13+
Kubernetes 1.11+

Building Docker Images

export BUILD_NAME=2.0.0
mvn clean package docker:build

1 2	export BUILD_NAME=2.0.0 mvn clean package docker:build

In case of running out of disk space, clean up unused images and volumes with:

docker rmi $(docker images -f "dangling=true" -q)
docker volume prune

1 2	docker rmi $(docker images -f "dangling=true" -q) docker volume prune

Running Docker Compose

export BUILD_NAME=2.0.0
docker-compose -f docker/docker-compose.yml up

1 2	export BUILD_NAME=2.0.0 docker-compose -f docker/docker-compose.yml up

Or with separate services:

docker-compose -f docker/docker-compose.yml up authentication-service organization-service license-service

1	docker-compose -f docker/docker-compose.yml up authentication-service organization-service license-service

Running Kubernetes

NOTE: Kubernetes does not support environment variable substitution by default.

kubectl create -f kubernetes/kubernetes.yml

1	kubectl create -f kubernetes/kubernetes.yml

Use Cases

Suppose you are using the kubernetes deployment.

Get OAuth2 token

curl is used here, and 31004 is the cluster-wide port of the Zuul gateway server:

# curl -u eagleeye:thisissecret http://172.16.87.12:31004/api/auth/oauth/token -X POST -d "grant_type=password&scope=webclient&username=user&password=password1"
{"access_token":"d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5","token_type":"bearer","refresh_token":"a5d12d05-78ff-4170-ab4f-b9c4e9886358","expires_in":41496,"scope":"webclient"}

# curl -u eagleeye:thisissecret http://172.16.87.12:31004/api/auth/oauth/token -X POST -d "grant_type=password&scope=webclient&username=user&password=password1"

{"access_token":"d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5","token_type":"bearer","refresh_token":"a5d12d05-78ff-4170-ab4f-b9c4e9886358","expires_in":41496,"scope":"webclient"}

Get organization info

Use the token returned from previous request.

# curl -H "Authorization: Bearer d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5" http://172.16.87.12:31004/api/organization/v1/organizations/e254f8c-c442-4ebe-a82a-e2fc1d1ff78a
{"id":"e254f8c-c442-4ebe-a82a-e2fc1d1ff78a","name":"customer-crm-co","contactName":"Mark Balster","contactEmail":"mark.balster@custcrmco.com","contactPhone":"823-555-1212"}

# curl -H "Authorization: Bearer d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5" http://172.16.87.12:31004/api/organization/v1/organizations/e254f8c-c442-4ebe-a82a-e2fc1d1ff78a

{"id":"e254f8c-c442-4ebe-a82a-e2fc1d1ff78a","name":"customer-crm-co","contactName":"Mark Balster","contactEmail":"mark.balster@custcrmco.com","contactPhone":"823-555-1212"}

Get license info associated with organization info

Use the token returned from previous request.

# curl -H "Authorization: Bearer d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5" http://172.16.87.12:31004/api/license/v1/organizations/e254f8c-c442-4ebe-a82a-e2fc1d1ff78a/licenses/f3831f8c-c338-4ebe-a82a-e2fc1d1ff78a
{"id":"f3831f8c-c338-4ebe-a82a-e2fc1d1ff78a","organizationId":"e254f8c-c442-4ebe-a82a-e2fc1d1ff78a","organizationName":"customer-crm-co","contactName":"Mark Balster","contactPhone":"823-555-1212","contactEmail":"mark.balster@custcrmco.com","productName":"CustomerPro","licenseType":"user","licenseMax":100,"licenseAllocated":5,"comment":null}

# curl -H "Authorization: Bearer d3b817dc-fb7a-4e65-a080-d0e34c0dc4d5" http://172.16.87.12:31004/api/license/v1/organizations/e254f8c-c442-4ebe-a82a-e2fc1d1ff78a/licenses/f3831f8c-c338-4ebe-a82a-e2fc1d1ff78a

{"id":"f3831f8c-c338-4ebe-a82a-e2fc1d1ff78a","organizationId":"e254f8c-c442-4ebe-a82a-e2fc1d1ff78a","organizationName":"customer-crm-co","contactName":"Mark Balster","contactPhone":"823-555-1212","contactEmail":"mark.balster@custcrmco.com","productName":"CustomerPro","licenseType":"user","licenseMax":100,"licenseAllocated":5,"comment":null}

Distributed Tracing via Zipkin

Every response contains a correlation ID to help diagnose possible failures among service call. Run with curl -v to get it:

# curl -v ...
...
< sc-correlation-id: 3265b50156556c05
...

# curl -v ...

...

< sc-correlation-id: 3265b50156556c05

...

Search it in Zipkin to get all trace info, including latencies if you are interested in.
zipkin-1
zipkin-2

The license service caches organization info in Redis, prefixed with organizations:. So you may want to clear them to get a complete tracing of cross service invoke.

redis-cli -h 172.16.87.12 -c del $(redis-cli -h 172.16.87.12 -c keys organizations* | gawk '{ print $1 }')

1	redis-cli -h 172.16.87.12 -c del $(redis-cli -h 172.16.87.12 -c keys organizations* \| gawk '{ print $1 }')

Working with OAuth2

All OAuth2 tokens are cached in Redis, prefixed with oauth2:. There is also JWT token support. Comment/Uncomment @Configuration in AuthorizationServerConfiguration and JwtAuthorizationServerConfiguration classes to switch it on/off.

Swagger Integration

The organization service and license service have Swagger integration. Access via /swagger-ui.html.

Spring Boot Admin Integration

Spring Boot Admin is integrated into the eureka server. Access via: http://${eureka-server}:8761/admin.
sba-1

Batch Insert with MySQL

December 27, 2017 by gonwan·1 Comment

Adopting to using Spring Data JPA these day, there is a post saying: IDENTITY generator disables JDBC batch inserts. To figure out the impact, create a table with 10 data fields and an auto-increment id for testing. I am using MySQL 5.7.20 / MariaDB 10.3.3 / Spring Data JPA 1.11.8 / Hibernate 5.0.12.

CREATE TABLE `t_user` (
  `id` int(11) NOT NULL AUTO_INCREMENT,
  `field1` varchar(255) DEFAULT NULL,
  `field2` varchar(255) DEFAULT NULL,
  `field3` varchar(255) DEFAULT NULL,
  `field4` varchar(255) DEFAULT NULL,
  `field5` varchar(255) DEFAULT NULL,
  `field6` varchar(255) DEFAULT NULL,
  `field7` varchar(255) DEFAULT NULL,
  `field8` varchar(255) DEFAULT NULL,
  `field9` varchar(255) DEFAULT NULL,
  `field10` varchar(255) DEFAULT NULL,
  PRIMARY KEY (`id`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

CREATE TABLE `t_user` (

`id` int(11) NOT NULL AUTO_INCREMENT,

`field1` varchar(255) DEFAULT NULL,

`field2` varchar(255) DEFAULT NULL,

`field3` varchar(255) DEFAULT NULL,

`field4` varchar(255) DEFAULT NULL,

`field5` varchar(255) DEFAULT NULL,

`field6` varchar(255) DEFAULT NULL,

`field7` varchar(255) DEFAULT NULL,

`field8` varchar(255) DEFAULT NULL,

`field9` varchar(255) DEFAULT NULL,

`field10` varchar(255) DEFAULT NULL,

PRIMARY KEY (`id`)

) ENGINE=InnoDB DEFAULT CHARSET=utf8;

And generate the persistence entity, add @GeneratedValue annotation:

package com.gonwan.spring.generated;

import javax.persistence.*;

@Entity
@Table(name = "t_user", schema = "test", catalog = "")
public class TUser {
    private int id;
    private String field1;
    private String field2;
    private String field3;
    private String field4;
    private String field5;
    private String field6;
    private String field7;
    private String field8;
    private String field9;
    private String field10;

    @Id
    @Column(name = "id", nullable = false)
    @GeneratedValue(strategy = GenerationType.IDENTITY)
    /* mysql / table */
    //@GeneratedValue(strategy = GenerationType.TABLE, generator = "tableGenerator")
    //@TableGenerator(name = "tableGenerator", allocationSize = 100, table = "t_generator", pkColumnName = "gen_name", valueColumnName = "gen_value", pkColumnValue = "SEQ_USER")
    /* mariadb / sequence  */
    //@GeneratedValue(strategy = GenerationType.SEQUENCE, generator = "sequenceGenerator")
    //@SequenceGenerator(name = "sequenceGenerator", allocationSize = 100, sequenceName = "s_user")
    public int getId() {
        return id;
    }

    public void setId(int id) {
        this.id = id;
    }

    /* field getters/setters omitted. */

}

package com.gonwan.spring.generated;

import javax.persistence.*;

@Entity

@Table(name = "t_user", schema = "test", catalog = "")

public class TUser {

private int id;

private String field1;

private String field2;

private String field3;

private String field4;

private String field5;

private String field6;

private String field7;

private String field8;

private String field9;

private String field10;

@Id

@Column(name = "id", nullable = false)

@GeneratedValue(strategy = GenerationType.IDENTITY)

/* mysql / table */

//@GeneratedValue(strategy = GenerationType.TABLE, generator = "tableGenerator")

//@TableGenerator(name = "tableGenerator", allocationSize = 100, table = "t_generator", pkColumnName = "gen_name", valueColumnName = "gen_value", pkColumnValue = "SEQ_USER")

/* mariadb / sequence */

//@GeneratedValue(strategy = GenerationType.SEQUENCE, generator = "sequenceGenerator")

//@SequenceGenerator(name = "sequenceGenerator", allocationSize = 100, sequenceName = "s_user")

public int getId() {

return id;

}

public void setId(int id) {

this.id = id;

}

/* field getters/setters omitted. */

}

My benchmark runs to batch insert 2000 records in 1/2/4/8/16/32 concurrent threads.

1. IDENTITY

When using GenerationType.IDENTITY, result looks like:

Finished: threads=1, records_per_threads=2000, duration_in_ms=823
Finished: threads=2, records_per_threads=2000, duration_in_ms=609
Finished: threads=4, records_per_threads=2000, duration_in_ms=1188
Finished: threads=8, records_per_threads=2000, duration_in_ms=2329
Finished: threads=16, records_per_threads=2000, duration_in_ms=4577
Finished: threads=32, records_per_threads=2000, duration_in_ms=9579

Finished: threads=1, records_per_threads=2000, duration_in_ms=823

Finished: threads=2, records_per_threads=2000, duration_in_ms=609

Finished: threads=4, records_per_threads=2000, duration_in_ms=1188

Finished: threads=8, records_per_threads=2000, duration_in_ms=2329

Finished: threads=16, records_per_threads=2000, duration_in_ms=4577

Finished: threads=32, records_per_threads=2000, duration_in_ms=9579

As mentioned, Hibernate/JPA disables batch insert when using IDENTITY. Look into org.hibernate.event.internal.AbstractSaveEventListener#saveWithGeneratedId() for details. To make it clear, it DOES run faster when insert multiple entities in one transaction than in separated transactions. It saves transaction overhead, not round-trip overhead.

The generated key is eventually retrieved from java.sql.Statement#getGeneratedKeys(). And datasource-proxy is used to display the underlining SQL generated.

2. TABLE

Now switch to GenerationType.TABLE. Just uncomment the corresponding @GeneratedValue and @TableGenerator annotation. Result looks like:

Finished: threads=1, records_per_threads=2000, duration_in_ms=830
Finished: threads=2, records_per_threads=2000, duration_in_ms=854
Finished: threads=4, records_per_threads=2000, duration_in_ms=1775
Finished: threads=8, records_per_threads=2000, duration_in_ms=3479
Finished: threads=16, records_per_threads=2000, duration_in_ms=6542
Finished: threads=32, records_per_threads=2000, duration_in_ms=13768

Finished: threads=1, records_per_threads=2000, duration_in_ms=830

Finished: threads=2, records_per_threads=2000, duration_in_ms=854

Finished: threads=4, records_per_threads=2000, duration_in_ms=1775

Finished: threads=8, records_per_threads=2000, duration_in_ms=3479

Finished: threads=16, records_per_threads=2000, duration_in_ms=6542

Finished: threads=32, records_per_threads=2000, duration_in_ms=13768

To fix Hibernate deprecation warning and get better performance, add the line to application.properties:

spring.jpa.hibernate.use-new-id-generator-mappings=true

1	spring.jpa.hibernate.use-new-id-generator-mappings=true

I began to think that was the whole story for batch, and the datasource-proxy interceptor also traced down the batch SQL. But after I looked into dumped TCP packages using wireshark, I found the final SQL was still not in batch format. Say, they were in:

insert into `t_user` (field1, ...) values ('value1_1', ...);
insert into `t_user` (field1, ...) values ('value1_2', ...);
insert into `t_user` (field1, ...) values ('value1_3', ...);

insert into `t_user` (field1, ...) values ('value1_1', ...);

insert into `t_user` (field1, ...) values ('value1_2', ...);

insert into `t_user` (field1, ...) values ('value1_3', ...);

Instead of:

insert into `t_user` (field1, ...) values ('value1_1', ...), ('value1_2', ...), ('value1_3', ...);

1	insert into `t_user` (field1, ...) values ('value1_1', ...), ('value1_2', ...), ('value1_3', ...);

The latter one saves client/server round-trips and is recommended by MySQL. After adding rewriteBatchedStatements=true to my connection string, MySQL generated batch statements and result was much improved:

Finished: threads=1, records_per_threads=2000, duration_in_ms=433
Finished: threads=2, records_per_threads=2000, duration_in_ms=409
Finished: threads=4, records_per_threads=2000, duration_in_ms=708
Finished: threads=8, records_per_threads=2000, duration_in_ms=1566
Finished: threads=16, records_per_threads=2000, duration_in_ms=2926
Finished: threads=32, records_per_threads=2000, duration_in_ms=6388

Finished: threads=1, records_per_threads=2000, duration_in_ms=433

Finished: threads=2, records_per_threads=2000, duration_in_ms=409

Finished: threads=4, records_per_threads=2000, duration_in_ms=708

Finished: threads=8, records_per_threads=2000, duration_in_ms=1566

Finished: threads=16, records_per_threads=2000, duration_in_ms=2926

Finished: threads=32, records_per_threads=2000, duration_in_ms=6388

3. SEQUENCE

Last switch to GenerationType.SEQUENCE. Sequence is a new feature added in MariaDB 10.3 series. Create a sequence in MariaDB with:

CREATE SEQUENCE `s_user` START WITH 1 INCREMENT BY 100;

1	CREATE SEQUENCE `s_user` START WITH 1 INCREMENT BY 100;

Generally, the increment should match the one specified in @SequenceGenerator, at least >= allocationSize. See org.hibernate.id.enhanced.PooledOptimizer#generate().

Hibernate apparently does not support the new feature, I dealt with it by adding a new dialect:

package com.gonwan.spring;

import org.hibernate.dialect.MySQL5Dialect;

/*
 * Copied from org.hibernate.dialect.PostgreSQL81Dialect.
 */
public class MariaDB103Dialect extends MySQL5Dialect {

    @Override
    public boolean supportsSequences() {
        return true;
    }

    @Override
    public boolean supportsPooledSequences() {
        return true;
    }

    @Override
    public String getSequenceNextValString(String sequenceName) {
        return "select " + getSelectSequenceNextValString(sequenceName);
    }

    @Override
    public String getSelectSequenceNextValString(String sequenceName) {
        return "nextval (`" + sequenceName + "`)";
    }

}

package com.gonwan.spring;

import org.hibernate.dialect.MySQL5Dialect;

* Copied from org.hibernate.dialect.PostgreSQL81Dialect.

public class MariaDB103Dialect extends MySQL5Dialect {

@Override

public boolean supportsSequences() {

return true;

}

@Override

public boolean supportsPooledSequences() {

return true;

}

@Override

public String getSequenceNextValString(String sequenceName) {

return "select " + getSelectSequenceNextValString(sequenceName);

}

@Override

public String getSelectSequenceNextValString(String sequenceName) {

return "nextval (`" + sequenceName + "`)";

}

And add configuration:

spring.jpa.properties.hibernate.dialect=com.gonwan.spring.MariaDB103Dialect

1	spring.jpa.properties.hibernate.dialect=com.gonwan.spring.MariaDB103Dialect

supportsSequences() adds the sequence support. supportsPooledSequences() adds some pool-like optimization both supported by MariaDB and Hibernate. Otherwise, Hibernate uses tables to mimic sequences. Refer to org.hibernate.id.enhanced.SequenceStyleGenerator#buildDatabaseStructure(). Result with and without batch:

# without batch
Finished: threads=1, records_per_threads=2000, duration_in_ms=723
Finished: threads=2, records_per_threads=2000, duration_in_ms=615
Finished: threads=4, records_per_threads=2000, duration_in_ms=1147
Finished: threads=8, records_per_threads=2000, duration_in_ms=2195
Finished: threads=16, records_per_threads=2000, duration_in_ms=4687
Finished: threads=32, records_per_threads=2000, duration_in_ms=9312
# with batch
Finished: threads=1, records_per_threads=2000, duration_in_ms=298
Finished: threads=2, records_per_threads=2000, duration_in_ms=155
Finished: threads=4, records_per_threads=2000, duration_in_ms=186
Finished: threads=8, records_per_threads=2000, duration_in_ms=356
Finished: threads=16, records_per_threads=2000, duration_in_ms=695
Finished: threads=32, records_per_threads=2000, duration_in_ms=1545

# without batch

Finished: threads=1, records_per_threads=2000, duration_in_ms=723

Finished: threads=2, records_per_threads=2000, duration_in_ms=615

Finished: threads=4, records_per_threads=2000, duration_in_ms=1147

Finished: threads=8, records_per_threads=2000, duration_in_ms=2195

Finished: threads=16, records_per_threads=2000, duration_in_ms=4687

Finished: threads=32, records_per_threads=2000, duration_in_ms=9312

# with batch

Finished: threads=1, records_per_threads=2000, duration_in_ms=298

Finished: threads=2, records_per_threads=2000, duration_in_ms=155

Finished: threads=4, records_per_threads=2000, duration_in_ms=186

Finished: threads=8, records_per_threads=2000, duration_in_ms=356

Finished: threads=16, records_per_threads=2000, duration_in_ms=695

Finished: threads=32, records_per_threads=2000, duration_in_ms=1545

Dramatically improved when compared to the table generator. A sequence generator uses cache in memory(default 1000), and is optimized to eliminate lock when generating IDs.

4. Summary

	1 thread	2 threads	4 threads	8 threads	16 threads	32 threads
IDENTITY	823	609	1188	2329	4577	9579
TABLE	830	854	1775	3479	6542	13768
TABLE with batch	433	409	708	1566	2926	6388
SEQUENCE	723	615	1147	2195	4687	9312
SEQUENCE with batch	298	155	186	356	695	1545

From the summary table, IDENTITY is simplest. TABLE is a compromise to support batch insert. And SEQUENCE yields the best performance. Find the entire project in Github.

Streaming MySQL Results Using Java 8 Streams

September 4, 2017 by gonwan·0 Comments

The article is inspired by the posts here and here.

There is a RESTful service as the infrastructure for data access in our team. It is based on Jersey/JAX-RS and runs fast. However, it consumes large memory when constructing large data set as response. Since it builds the entire response in memory before sending it.

As suggested in the above posts. Streaming is the solution. They integrated Hibernate or Spring Data for easy adoption. But I need a general purpose RESTful service, say, I do not know the schema of a table. So I decided to implement it myself using raw JDBC interface.

My class is so-called MysqlStreamTemplate:

It does not extend JdbcTemplate, since there is only one interface for streaming, not one series. I’m not writing a general purpose library.
It is MySQL only, I have no time to verify with other relation databases.
It does accept a DataSource as the parameter of the its constructor.
Staff like Hibernate session is not concerned, since it maintains Statement & Connection by itself.
Staff like @Transcational is not concerned, since we do not care about transactions. Actually, MySQL gives HOLD_CURSORS_OVER_COMMIT in StatementImpl#getResultSetHoldability() in its JDBC driver, saying that our ResultSet survives after commit.

So, here is my class. NOTE: closing our Statement & Connection requires explicit invoke of Stream#close():

import javax.sql.DataSource;
import java.io.Closeable;
import java.sql.Connection;
import java.sql.ResultSet;
import java.sql.SQLException;
import java.sql.Statement;
import java.util.HashMap;
import java.util.Map;
import java.util.Spliterator;
import java.util.Spliterators;
import java.util.function.Consumer;
import java.util.stream.Stream;
import java.util.stream.StreamSupport;

public class MysqlStreamTemplate {

    private DataSource dataSource;

    public MysqlStreamTemplate(DataSource dataSource) {
        this.dataSource = dataSource;
    }

    public Stream<Map> query(String sql) throws SQLException {
        return new MysqlStreamQuery().stream(sql);
    }

    class MysqlStreamQuery implements Closeable {

        private Connection connection;
        private Statement statement;

        public Stream<Map> stream(String sql) throws SQLException {
            connection = dataSource.getConnection();
            /*
             * MySQL ResultSets are completely retrieved and stored in memory (com.mysql.jdbc.RowDataStatic). Or
             * - Set useCursorFetch=true&defaultFetchSize=nnn in connection string (com.mysql.jdbc.RowDataCursor).
             * - Set resultSetType/resultSetConcurrency and fetchSize (Integer.MIN_VALUE) when creating statements (com.mysql.jdbc.RowDataDynamic).
             * See: https://dev.mysql.com/doc/connector-j/5.1/en/connector-j-reference-implementation-notes.html
             */
            /*
             * MySQL documents say nothing about cursor holdability, so not use it explicitly.
             */
            statement = connection.createStatement(ResultSet.TYPE_FORWARD_ONLY, ResultSet.CONCUR_READ_ONLY);
            statement.setFetchSize(Integer.MIN_VALUE);
            /* begin query */
            ResultSet rs = statement.executeQuery(sql);
            int columns = rs.getMetaData().getColumnCount();
            Map resultMap = new HashMap(columns);
            /* NOTE: Manually invoking of Stream.close() is required to close the MySQL statement and connection. */
            Stream<Map> resultStream = StreamSupport.stream(new Spliterators.AbstractSpliterator<Map>(Long.MAX_VALUE, Spliterator.ORDERED | Spliterator.NONNULL | Spliterator.IMMUTABLE) {
                @Override
                public boolean tryAdvance(Consumer<? super Map> action) {
                    try {
                        if (!rs.next()) {
                            return false;
                        }
                        resultMap.clear();
                        for (int i = 1; i <= columns; i++) {
                            resultMap.put(rs.getMetaData().getColumnLabel(i), rs.getObject(i));
                        }
                        action.accept(resultMap);
                        return true;
                    } catch (SQLException e) {
                        throw new RuntimeException(e);
                    }
                }
            }, false).onClose(() -> close());
            return resultStream;
        }

        @Override
        public void close() {
            if (statement != null) {
                try {
                    statement.close();
                } catch (SQLException e) {
                }
                statement = null;
            }
            if (connection != null) {
                try {
                    connection.close();
                } catch (SQLException e) {
                }
                connection = null;
            }
        }
    }

}

import javax.sql.DataSource;

import java.io.Closeable;

import java.sql.Connection;

import java.sql.ResultSet;

import java.sql.SQLException;

import java.sql.Statement;

import java.util.HashMap;

import java.util.Map;

import java.util.Spliterator;

import java.util.Spliterators;

import java.util.function.Consumer;

import java.util.stream.Stream;

import java.util.stream.StreamSupport;

public class MysqlStreamTemplate {

private DataSource dataSource;

public MysqlStreamTemplate(DataSource dataSource) {

this.dataSource = dataSource;

}

public Stream<Map> query(String sql) throws SQLException {

return new MysqlStreamQuery().stream(sql);

}

class MysqlStreamQuery implements Closeable {

private Connection connection;

private Statement statement;

public Stream<Map> stream(String sql) throws SQLException {

connection = dataSource.getConnection();

* MySQL ResultSets are completely retrieved and stored in memory (com.mysql.jdbc.RowDataStatic). Or

* - Set useCursorFetch=true&defaultFetchSize=nnn in connection string (com.mysql.jdbc.RowDataCursor).

* - Set resultSetType/resultSetConcurrency and fetchSize (Integer.MIN_VALUE) when creating statements (com.mysql.jdbc.RowDataDynamic).

* See: https://dev.mysql.com/doc/connector-j/5.1/en/connector-j-reference-implementation-notes.html

* MySQL documents say nothing about cursor holdability, so not use it explicitly.

statement = connection.createStatement(ResultSet.TYPE_FORWARD_ONLY, ResultSet.CONCUR_READ_ONLY);

statement.setFetchSize(Integer.MIN_VALUE);

/* begin query */

ResultSet rs = statement.executeQuery(sql);

int columns = rs.getMetaData().getColumnCount();

Map resultMap = new HashMap(columns);

/* NOTE: Manually invoking of Stream.close() is required to close the MySQL statement and connection. */

Stream<Map> resultStream = StreamSupport.stream(new Spliterators.AbstractSpliterator<Map>(Long.MAX_VALUE, Spliterator.ORDERED | Spliterator.NONNULL | Spliterator.IMMUTABLE) {

@Override

public boolean tryAdvance(Consumer<? super Map> action) {

try {

if (!rs.next()) {

return false;

}

resultMap.clear();

for (int i = 1; i <= columns; i++) {

resultMap.put(rs.getMetaData().getColumnLabel(i), rs.getObject(i));

}

action.accept(resultMap);

return true;

} catch (SQLException e) {

throw new RuntimeException(e);

}

}, false).onClose(() -> close());

return resultStream;

}

@Override

public void close() {

if (statement != null) {

try {

statement.close();

} catch (SQLException e) {

}

statement = null;

}

if (connection != null) {

try {

connection.close();

} catch (SQLException e) {

}

connection = null;

}

Read inline comments for additional details. Now the response entry and controller mapping:

import java.util.Map;
import java.util.stream.Stream;

public class ApiStreamResponse extends Response {

    /* requires jackson-datatype-jdk8 2.9.0 */
    private Stream<Map> result;

    public ApiStreamResponse(Stream<Map> result) {
        this.result = result;
    }

    public Stream<Map> getResult() {
        return result;
    }

    public void setResult(Stream<Map> result) {
        this.result = result;
    }

}

import java.util.Map;

import java.util.stream.Stream;

public class ApiStreamResponse extends Response {

/* requires jackson-datatype-jdk8 2.9.0 */

private Stream<Map> result;

public ApiStreamResponse(Stream<Map> result) {

this.result = result;

}

public Stream<Map> getResult() {

return result;

}

public void setResult(Stream<Map> result) {

this.result = result;

}

import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.http.MediaType;
import org.springframework.web.bind.annotation.RequestMapping;
import org.springframework.web.bind.annotation.RequestMethod;
import org.springframework.web.bind.annotation.RestController;

import java.util.Map;
import java.util.concurrent.Callable;
import java.util.stream.Stream;

@RequestMapping(path = "/api")
@RestController
public class ApiController {

    private static final Logger logger = LoggerFactory.getLogger(ApiController.class);

    @Autowired
    private MysqlClient mysqlClient;

    @RequestMapping(path = "/v1", method = RequestMethod.GET, produces = MediaType.APPLICATION_JSON_VALUE)
    public Callable<ApiResponse> getV1() {
        return () -> {
            String r = mysqlClient.executeToJson(MysqlClient.SQL).getLeft();
            return new ApiResponse(r);
        };
    }

    @RequestMapping(path = "/v2", method = RequestMethod.GET, produces = MediaType.APPLICATION_JSON_VALUE)
    public Callable<ApiStreamResponse> getV2() {
        return () -> {
            Stream<Map> r = mysqlClient.executeToStream(MysqlClient.SQL);
            return new ApiStreamResponse(r);
        };
    }

}

import org.slf4j.Logger;

import org.slf4j.LoggerFactory;

import org.springframework.beans.factory.annotation.Autowired;

import org.springframework.http.MediaType;

import org.springframework.web.bind.annotation.RequestMapping;

import org.springframework.web.bind.annotation.RequestMethod;

import org.springframework.web.bind.annotation.RestController;

import java.util.Map;

import java.util.concurrent.Callable;

import java.util.stream.Stream;

@RequestMapping(path = "/api")

@RestController

public class ApiController {

private static final Logger logger = LoggerFactory.getLogger(ApiController.class);

@Autowired

private MysqlClient mysqlClient;

@RequestMapping(path = "/v1", method = RequestMethod.GET, produces = MediaType.APPLICATION_JSON_VALUE)

public Callable<ApiResponse> getV1() {

return () -> {

String r = mysqlClient.executeToJson(MysqlClient.SQL).getLeft();

return new ApiResponse(r);

};

}

@RequestMapping(path = "/v2", method = RequestMethod.GET, produces = MediaType.APPLICATION_JSON_VALUE)

public Callable<ApiStreamResponse> getV2() {

return () -> {

Stream<Map> r = mysqlClient.executeToStream(MysqlClient.SQL);

return new ApiStreamResponse(r);

};

}

Complete code can be find on my GitHub repository.

My simple benchmark script looks like:

# ab -c 30 -n 3000 http://localhost:5050/api

1	# ab -c 30 -n 3000 http://localhost:5050/api

Dramatic improvements in memory usage as shown in jconsole, especially Old Gen:
all_memory
old_gen_memory

Some raw data from jmap:

Jersey

Heap Usage:
PS Young Generation
Eden Space:
   capacity = 1529348096 (1458.5MB)
   used     = 28027008 (26.7286376953125MB)
   free     = 1501321088 (1431.7713623046875MB)
   1.8326114292295166% used
From Space:
   capacity = 124780544 (119.0MB)
   used     = 36331368 (34.648292541503906MB)
   free     = 88449176 (84.3517074584961MB)
   29.116212219751183% used
To Space:
   capacity = 127926272 (122.0MB)
   used     = 0 (0.0MB)
   free     = 127926272 (122.0MB)
   0.0% used
PS Old Generation
   capacity = 1499987968 (1430.5MB)
   used     = 946428384 (902.5844421386719MB)
   free     = 553559584 (527.9155578613281MB)
   63.09573171189597% used

12833 interned Strings occupying 1401840 bytes.

Heap Usage:

PS Young Generation

Eden Space:

capacity = 1529348096 (1458.5MB)

used = 28027008 (26.7286376953125MB)

free = 1501321088 (1431.7713623046875MB)

1.8326114292295166% used

From Space:

capacity = 124780544 (119.0MB)

used = 36331368 (34.648292541503906MB)

free = 88449176 (84.3517074584961MB)

29.116212219751183% used

To Space:

capacity = 127926272 (122.0MB)

used = 0 (0.0MB)

free = 127926272 (122.0MB)

0.0% used

PS Old Generation

capacity = 1499987968 (1430.5MB)

used = 946428384 (902.5844421386719MB)

free = 553559584 (527.9155578613281MB)

63.09573171189597% used

12833 interned Strings occupying 1401840 bytes.

Spring Boot

Heap Usage:
PS Young Generation
Eden Space:
   capacity = 1494745088 (1425.5MB)
   used     = 611063008 (582.7550964355469MB)
   free     = 883682080 (842.7449035644531MB)
   40.88075036377039% used
From Space:
   capacity = 135266304 (129.0MB)
   used     = 135146784 (128.88601684570312MB)
   free     = 119520 (0.113983154296875MB)
   99.91164096566133% used
To Space:
   capacity = 156762112 (149.5MB)
   used     = 0 (0.0MB)
   free     = 156762112 (149.5MB)
   0.0% used
PS Old Generation
   capacity = 1534066688 (1463.0MB)
   used     = 525509264 (501.16468811035156MB)
   free     = 1008557424 (961.8353118896484MB)
   34.25595954274447% used

21280 interned Strings occupying 2592280 bytes.

Heap Usage:

PS Young Generation

Eden Space:

capacity = 1494745088 (1425.5MB)

used = 611063008 (582.7550964355469MB)

free = 883682080 (842.7449035644531MB)

40.88075036377039% used

From Space:

capacity = 135266304 (129.0MB)

used = 135146784 (128.88601684570312MB)

free = 119520 (0.113983154296875MB)

99.91164096566133% used

To Space:

capacity = 156762112 (149.5MB)

used = 0 (0.0MB)

free = 156762112 (149.5MB)

0.0% used

PS Old Generation

capacity = 1534066688 (1463.0MB)

used = 525509264 (501.16468811035156MB)

free = 1008557424 (961.8353118896484MB)

34.25595954274447% used

21280 interned Strings occupying 2592280 bytes.

Spring Boot with Streams

Heap Usage:
PS Young Generation
Eden Space:
   capacity = 1787297792 (1704.5MB)
   used     = 127132192 (121.24270629882812MB)
   free     = 1660165600 (1583.2572937011719MB)
   7.1130951187344165% used
From Space:
   capacity = 1048576 (1.0MB)
   used     = 557056 (0.53125MB)
   free     = 491520 (0.46875MB)
   53.125% used
To Space:
   capacity = 1048576 (1.0MB)
   used     = 0 (0.0MB)
   free     = 1048576 (1.0MB)
   0.0% used
PS Old Generation
   capacity = 1515192320 (1445.0MB)
   used     = 34598904 (32.99608612060547MB)
   free     = 1480593416 (1412.0039138793945MB)
   2.2834661675159493% used

21326 interned Strings occupying 2597800 bytes.

Heap Usage:

PS Young Generation

Eden Space:

capacity = 1787297792 (1704.5MB)

used = 127132192 (121.24270629882812MB)

free = 1660165600 (1583.2572937011719MB)

7.1130951187344165% used

From Space:

capacity = 1048576 (1.0MB)

used = 557056 (0.53125MB)

free = 491520 (0.46875MB)

53.125% used

To Space:

capacity = 1048576 (1.0MB)

used = 0 (0.0MB)

free = 1048576 (1.0MB)

0.0% used

PS Old Generation

capacity = 1515192320 (1445.0MB)

used = 34598904 (32.99608612060547MB)

free = 1480593416 (1412.0039138793945MB)

2.2834661675159493% used

21326 interned Strings occupying 2597800 bytes.

0x2B|~0x2B

My broken wings still strong enough to cross the ocean with.

Tag: Spring

Benchmark for Web Frameworks

Environment 1

Environment 2

A Spring Cloud Toy Project

Introduction

Tested Dependencies

Building Docker Images

Running Docker Compose

Running Kubernetes

Use Cases

Get OAuth2 token

Get organization info

Get license info associated with organization info

Distributed Tracing via Zipkin

Working with OAuth2

Swagger Integration

Spring Boot Admin Integration

Batch Insert with MySQL

1. IDENTITY

2. TABLE

3. SEQUENCE

4. Summary

Streaming MySQL Results Using Java 8 Streams