[CONJ-26] [Feature Request] Implement configurable fetch size and fetch direction for Statement/ResultSet - Jira

Details

Type: Epic
Status: Closed (View Workflow)
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.1.0
Fix Version/s: 1.4.0
Component/s: Other
Labels:
None

Epic Name:
Implement multiple row fetching
Sprint:
Sprint connector/j 1.3.0

Description

As discussed earlier, currently fetch size is "one or all". It would be good to implement it fully.
Also, setFetchDirection for ResultSet and Statement are currently just stubs.

Attachments

Issue Links

duplicates

CONJ-125 Optimize cached ResultSet memory footprint

Closed

is blocked by

CONJ-22 Java Client library does not support useServerPrepStmts

Closed

is duplicated by

CONJ-173 [Feature Request] Implement a mechanism to bypass result set caching for streamed results

Closed

Activity

Ascending order - Click to sort in descending order

Elena Stepanova created issue - 2013-02-14 22:17

Elena Stepanova made changes - 2013-02-14 22:17

Field	Original Value	New Value
Link		This issue relates to TODO-338 [ TODO-338 ]

Vladislav Vaintroub made changes - 2013-03-07 22:14

Fix Version/s		jdbc-1.1.2 [ 12801 ]
Fix Version/s	jdbc-1.1.1 [ 12500 ]

Vladislav Vaintroub added a comment - 2013-04-28 20:15

I think fetch direction can safely be ignored. after looking again at spec. this is a hint to the driver, which has no visible effects during runtime (i.e ResultSet.next() will still move forward, and previous() will move backward). We cannot use this hint - to move backwards with ResultSet.previous(), we have to read and cache the whole result, no way around it.

Vladislav Vaintroub added a comment - 2013-04-28 20:15 I think fetch direction can safely be ignored. after looking again at spec. this is a hint to the driver, which has no visible effects during runtime (i.e ResultSet.next() will still move forward, and previous() will move backward). We cannot use this hint - to move backwards with ResultSet.previous(), we have to read and cache the whole result, no way around it.

Elena Stepanova made changes - 2013-11-12 16:48

Assignee

Vladislav Vaintroub [ wlad ]

Georg Richter [ georg ]

Rasmus Johansson (Inactive) made changes - 2014-06-19 15:16

Workflow

defaullt [ 26218 ]

MariaDB v2 [ 47813 ]

Rasmus Johansson (Inactive) made changes - 2014-09-22 17:48

Workflow

MariaDB v2 [ 47813 ]

MariaDB connectors [ 54896 ]

Rasmus Johansson (Inactive) made changes - 2015-06-23 14:15

Workflow

MariaDB connectors [ 54896 ]

MariaDB v3 [ 70154 ]

Diego Dupin made changes - 2015-07-23 11:35

Assignee

Georg Richter [ georg ]

diego dupin [ diego dupin ]

Diego Dupin made changes - 2015-07-23 11:35

Fix Version/s

1.2.1 [ 19602 ]

Julien Fritsch made changes - 2015-07-23 11:49

Fix Version/s

1.1.2 [ 12801 ]

Diego Dupin made changes - 2015-07-29 16:44

Link

This issue is duplicated by ~~CONJ-173~~ [ ~~CONJ-173~~ ]

Julien Fritsch made changes - 2015-07-30 14:05

Link

This issue duplicates ~~CONJ-125~~ [ ~~CONJ-125~~ ]

Julien Fritsch made changes - 2015-07-30 14:06

Link

This issue is blocked by ~~CONJ-22~~ [ ~~CONJ-22~~ ]

Julien Fritsch made changes - 2015-07-30 14:06

Issue Type

Task [ 3 ]

Epic [ 5 ]

Julien Fritsch made changes - 2015-07-30 14:07

Epic Child

~~CONJ-138~~ [ 50119 ]

Diego Dupin made changes - 2015-08-04 17:13

Status

Open [ 1 ]

In Progress [ 3 ]

Diego Dupin made changes - 2015-08-06 18:55

Epic Name		Implement multiple row fetching
Sprint		Sprint 1 [ 11 ]

Diego Dupin made changes - 2015-09-15 16:10

Status

In Progress [ 3 ]

Stalled [ 10000 ]

Diego Dupin made changes - 2015-09-15 16:10

Fix Version/s		1.4.0 [ 19606 ]
Fix Version/s	1.3.0 [ 19602 ]

Paolo Bazzi added a comment - 2016-02-24 17:53

+1 for solving this issue....
We run into a similiar problem while reading a large set of data from a MariaDB and were forced to switch to the streamed mode, since we ran into OutOfMemory exceptions when fetching all records at once (~8m records). Other JDBC drivers (like Oracle) only fetch the records, when iterating over the result set and therefore require a lot less of memory if the records are processed in a loop and then discarded.
It would be great to support a configurable fetch size instead of force the user to decide wether to read all or nothing.

Paolo Bazzi added a comment - 2016-02-24 17:53 +1 for solving this issue.... We run into a similiar problem while reading a large set of data from a MariaDB and were forced to switch to the streamed mode, since we ran into OutOfMemory exceptions when fetching all records at once (~8m records). Other JDBC drivers (like Oracle) only fetch the records, when iterating over the result set and therefore require a lot less of memory if the records are processed in a loop and then discarded. It would be great to support a configurable fetch size instead of force the user to decide wether to read all or nothing.

Diego Dupin added a comment - 2016-02-24 17:56

Some good news Paolo : that's in the roadmap for next version 1.4.0.

Diego Dupin added a comment - 2016-02-24 17:56 Some good news Paolo : that's in the roadmap for next version 1.4.0.

Vladislav Vaintroub added a comment - 2016-02-24 17:57 - edited

bazzip ,alas, is no principal difference between streamed mode and configurable fetch size. streaming mode requires least memory though
Are you unhappy to be "forced" into this mode?

Vladislav Vaintroub added a comment - 2016-02-24 17:57 - edited bazzip ,alas, is no principal difference between streamed mode and configurable fetch size. streaming mode requires least memory though Are you unhappy to be "forced" into this mode?

Paolo Bazzi added a comment - 2016-02-24 18:10

@Diego very nice to hear!

@Vladislav
Two issues with the streamed mode

We use shared java code which is executed on both MariaDB and Oracle databases (some kind of data replication). The fetch size is set for each statement according to business logic and expected statement result size. With this setup we run into OutOfMemoryException problems with large data sets and the MariaDB JDBC driver, since the driver tried to load all data into memory. We were forced to implement a "if oracle then use fetchSize else use Integer.MIN_VALUE" Hack to solve the problem
I would expect a performance gain using an adequate fetch size instead of the streaming mode, which requires a JDBC driver <-> database server network round trip for each fetched result row

Paolo Bazzi added a comment - 2016-02-24 18:10 @Diego very nice to hear! @Vladislav Two issues with the streamed mode We use shared java code which is executed on both MariaDB and Oracle databases (some kind of data replication). The fetch size is set for each statement according to business logic and expected statement result size. With this setup we run into OutOfMemoryException problems with large data sets and the MariaDB JDBC driver, since the driver tried to load all data into memory. We were forced to implement a "if oracle then use fetchSize else use Integer.MIN_VALUE" Hack to solve the problem I would expect a performance gain using an adequate fetch size instead of the streaming mode, which requires a JDBC driver <-> database server network round trip for each fetched result row

Vladislav Vaintroub added a comment - 2016-02-24 18:32

I agree on portability, but I doubt you will gain any performance

the driver does exactly the same amount of network reads, and the server the same amount of writes.
The server writes whole result set, the client reads the whole result set, Oracle may and actually does perform very differently.

Vladislav Vaintroub added a comment - 2016-02-24 18:32 I agree on portability, but I doubt you will gain any performance the driver does exactly the same amount of network reads, and the server the same amount of writes. The server writes whole result set, the client reads the whole result set, Oracle may and actually does perform very differently.

Diego Dupin made changes - 2016-04-04 09:13

Fix Version/s		1.5.0 [ 19607 ]
Fix Version/s	1.4.0 [ 19606 ]

Diego Dupin made changes - 2016-04-04 15:34

Fix Version/s		1.4.0 [ 19606 ]
Fix Version/s	1.5.0 [ 19607 ]

Diego Dupin added a comment - 2016-04-04 20:43

This is now implemented on version 1.4.0.

Like Vladislav say, since all datas have to be read, performance doesn't change a lot,

JMH results (source https://codeshare.io/OlBRO) when streaming 100,000 rows

Bench.fetchSizeBy1000 : 50.274 ± 0.206 ms/op (read with fetch size 1000)
Bench.fetchSizeAll : 50.593 ± 0.252 ms/op (read all data)
Bench.fetchSizeOneByOne : 51.641 ± 0.299 ms/op (fetch one by one)

No big difference, but avoiding to create a big buffer permit to gain a small 1%, ( and avoid loading all in memory)

Diego Dupin added a comment - 2016-04-04 20:43 This is now implemented on version 1.4.0. Like Vladislav say, since all datas have to be read, performance doesn't change a lot, JMH results (source https://codeshare.io/OlBRO ) when streaming 100,000 rows Bench.fetchSizeBy1000 : 50.274 ± 0.206 ms/op (read with fetch size 1000) Bench.fetchSizeAll : 50.593 ± 0.252 ms/op (read all data) Bench.fetchSizeOneByOne : 51.641 ± 0.299 ms/op (fetch one by one) No big difference, but avoiding to create a big buffer permit to gain a small 1%, ( and avoid loading all in memory)

Diego Dupin made changes - 2016-04-04 20:43

Component/s		Other [ 12201 ]
Resolution		Fixed [ 1 ]
Status	Stalled [ 10000 ]	Closed [ 6 ]

Sergei Golubchik made changes - 2021-12-06 21:27

Workflow

MariaDB v3 [ 70154 ]

MariaDB v4 [ 134672 ]

People

Assignee:: Diego Dupin

Reporter:: Elena Stepanova

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 2013-02-14 22:17

Updated:: 2016-04-04 20:43

Resolved:: 2016-04-04 20:43

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Connector/J

Details

Description

Attachments

Issue Links

Activity

People

Dates

Git Integration