[MDEV-8100] Table functions (aka SQL functions returning tables) - Jira

Details

Type: Task
Status: Stalled (View Workflow)
Priority: Major
Resolution: Unresolved
Fix Version/s: None
Component/s: Data Definition - Procedure
Labels:
- gsoc15
- gsoc16

Description

This is a part of MDEV-5199 (Table functions) which addresses table functions that are defined in SQL (as opposed to table functions that are implemented in C++ and loaded as .so/.dlls).

~~This task is a GSOC '15 project of dj~~ not anymore

Proposed syntax

(note: the following differs from description in MDEV-5199, and this is the latest text).

We will use SQL Server's variant of syntax, where CREATE FUNCTION explicitly mentions function name. To define a table function one will use:

CREATE FUNCTION func_name (args)  RETURNS TABLE table_name (column type [, column type ...])

Inside function body, table table_name is visible as a temporary table. The table is initially empty. The code of the function is expected to insert rows into table_name. When the function finishes, the table will be returned to outside as the function's result.

Implementation overview

Table function will create and populate a temporary table (either HEAP or Aria/MyISAM, just like it is done for other kinds of temporary tables). Once the table function has finished populating the temporary table, it will be returned to the statement that invoked the table function. The caller statement will then read the table.

In other words, we will not support any kind of "streaming" for results of table function.

A possible optimization

Trivial functions (with the body of a single INSERT ... SELECT) can be treated as a VIEW. It is not clear whether this should be implemented in the scope of this WL entry.

Other details

Constant parameters

Table functions must have constant parameters. One can't use things like

select * from tbl1,  table_func(tbl1.column) where ...

This is because our implementation will pre-populate table function tables once per query execution. MySQL/MariaDB optimizer doesn't support tables that may change their contents during query execution.

Parser

The parser should support all proposed syntax.
note: the table can only be addressed through an alias. This syntax is not
allowed:

SELECT table_func(1,2).column FROM table_func(1,2)

one must use

SELECT T_ALIAS.column FROM table_func(1,2) as T_ALIAS

Name resolution/preparation in the upper query

When we find a table reference in the FROM clause, we should make
a lookup in mysql.proc, make appropriate checks, then open and parse
the function.
We should be able to take the "RETURNS ...." statement and produce a TABLE
object from it. This is needed for doing name resolution on the other parts
of the "caller" query .

Name resolution/preparation in the Stored Function

When one defines a stored function:

CREATE FUNCTION func_name (args)  RETURNS TABLE table_name (column type [, column type ...])

BEGIN

   stmt1;

   stmt2;

END

All statements inside the body of the function (i.e. stmt1, stmt2, etc) must use a modified name resolution process where table table_name resolves to the table that will be returned by the function. This may shadow the real table with name table_name.

mysql.proc contents

How does table functions should be stored in mysql.proc table. Are they a special kind of

Execution

"SELECT sp_func()" should return error for table functions

Attachments

Issue Links

duplicates

MDEV-10115 Processing Data Sets Within A stored Procedure

Closed

MDEV-15077 Insert from 'call proc', returning rows

Closed

includes

MDEV-8342 Support SELECT ... FROM table_func(...)" in the parser

Closed

MDEV-8343 Support table function declarations in the parser

Closed

MDEV-8344 Table functions: make name resolution see the return_table.

Closed

MDEV-8356 Table functions: Make sure CREATE FUNCTION works for table-returning functions

Closed

MDEV-8389 Table functions: create the result table and return it to the function caller

Closed

MDEV-8649 Table functions: SHOW FUNCTION STATUS gives an error

Closed

MDEV-8650 Table function: DROP FUNCTION doesn't work as expected

Closed

is part of

MDEV-10137 Providing compatibility to other databases

Open

MDEV-11070 Providing compatibility to other databases - Phase 2

Open

relates to

MDEV-5199 Table functions (a.k.a UDF returning arrays / result set)

Open

MDEV-7535 query the metadata of a dynamic column

Open

MDEV-15457 Pipe function technique

Stalled

(4 includes, 2 is part of, 3 relates to)

Activity

Ascending order - Click to sort in descending order

View 4 older comments

Sergei Petrunia added a comment - 2015-06-22 23:44

Updated the description

Sergei Petrunia added a comment - 2015-06-22 23:44 Updated the description

Sergei Petrunia added a comment - 2015-06-27 20:42

The tree for this project is here: https://github.com/djdij123/server.git

Sergei Petrunia added a comment - 2015-06-27 20:42 The tree for this project is here: https://github.com/djdij123/server.git

Sergei Petrunia added a comment - 2015-11-26 21:44 - edited

Closed the subtasks as nobody is working on the patch currently. This task itself remains open in case somebody wants to pick it up.

Sergei Petrunia added a comment - 2015-11-26 21:44 - edited Closed the subtasks as nobody is working on the patch currently. This task itself remains open in case somebody wants to pick it up.

Alexander Barkov added a comment - 2017-03-14 03:43

Please consider implementing tables as a new data type, so one can declare variables of the new TABLE data type and pass those variables as arguments to stored routines. This is needed for the Oracle compatibility project.

Something like this would be nice:

DELIMITER $$

CREATE PROCEDURE p1(t1 TABLE(a INT, b VARCHAR(32))

BEGIN

  SELECT * FROM t1;

  INSERT INTO t2 SELECT * FROM t1;

END;

$$

CREATE PROCEDURE p2

BEGIN

  DECLARE t1 TABLE(a INT, b VARCHAR(32));

  INSERT INTO t1 VALUES (1,'b1');

  CALL p1(t1);

END;

$$

CALL p2();

Alexander Barkov added a comment - 2017-03-14 03:43 Please consider implementing tables as a new data type, so one can declare variables of the new TABLE data type and pass those variables as arguments to stored routines. This is needed for the Oracle compatibility project. Something like this would be nice: DELIMITER $$ CREATE PROCEDURE p1(t1 TABLE (a INT , b VARCHAR (32)) BEGIN SELECT * FROM t1; INSERT INTO t2 SELECT * FROM t1; END ; $$ CREATE PROCEDURE p2 BEGIN DECLARE t1 TABLE (a INT , b VARCHAR (32)); INSERT INTO t1 VALUES (1, 'b1' ); CALL p1(t1); END ; $$ CALL p2();

Alexander Barkov added a comment - 2017-03-14 03:52

Note, the bb-10.2-compatibility tree has a support for the ROW data type. The ROW data type can be used for local variables and parameter.

See "MDEV-10914 ROW data type for stored routine variables" for details, as well as tests in mysql-test/t/sp-row.test.

Adding the TABLE data type can reuse ideas from the patch implementing ~~MDEV-10914~~.

Note, returning ROW variables from functions is not implemented yet. But this is mostly because of the parser related challenges, as as soon as we return ROW from a function, we need to support this:

SELECT f1().x FROM DUAL;

Otherwise adding ROW as a function return data type is easy. So there should not be any problems to implement TABLE as a function return value as soon as you have TABLE as a variable/parameter data type.

Alexander Barkov added a comment - 2017-03-14 03:52 Note, the bb-10.2-compatibility tree has a support for the ROW data type. The ROW data type can be used for local variables and parameter. See " MDEV-10914 ROW data type for stored routine variables " for details, as well as tests in mysql-test/t/sp-row.test . Adding the TABLE data type can reuse ideas from the patch implementing MDEV-10914 . Note, returning ROW variables from functions is not implemented yet. But this is mostly because of the parser related challenges, as as soon as we return ROW from a function, we need to support this: SELECT f1().x FROM DUAL; Otherwise adding ROW as a function return data type is easy. So there should not be any problems to implement TABLE as a function return value as soon as you have TABLE as a variable/parameter data type.

MariaDB Server

Table functions (aka SQL functions returning tables)