[MCOL-1805] Remote mcsimport tool is trowing Warning message : Column size of input file is higher than the column size of the target table.Remaining csv columns won't be injected. Created: 2018-10-15 Updated: 2023-10-26 Resolved: 2018-11-14 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | None |
| Affects Version/s: | 1.2.0 |
| Fix Version/s: | Icebox |
| Type: | Bug | Priority: | Major |
| Reporter: | Zdravelina Sokolovska (Inactive) | Assignee: | Jens Röwekamp (Inactive) |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Environment: |
linux |
||
| Issue Links: |
|
||||||||||||
| Description |
|
Remote mcsimport tool is trowing Warning message : Column size of input file is higher than the column size of the target table.Remaining csv columns won't be injected. A.
B.the same file loaded with the remote mcsimport tool
|
| Comments |
| Comment by Andrew Hutchings (Inactive) [ 2018-10-15 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Can you please provide a test case for this so we can reproduce it? | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Jens Röwekamp (Inactive) [ 2018-10-29 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
I don't think that's a bug in mcsimport but a bug in tpcds. tpcds creates a table with 9 columns via:
but wants to inject a csv file with 10 columns:
The last pipe is interpreted by mcsimport as a 10th column with only NULL values. Therefore, it warns the customer that the input file has more rows than the target ColumnStore table and only injects the first 9 columns. | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-10-30 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
when we use the local cpimport tool – eg the build in cpimport in MCS Server , that issue is not obserfved; refer to the point A.
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-10-30 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
it's not clear what are the default values of the escape character and enclosed by character
Usage: ./mcsimport database table input_file [-m mapping_file] [-c Columnstore.xml] [-d delimiter] [-df date_format] [-default_non_mapped] the other thing is that when use the same file, but importing it via cpimport tool from the cs server, such problem is not observed | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Jens Röwekamp (Inactive) [ 2018-10-30 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Hi winstone, In my opinion this case is ambiguous and cpimport treats it the one way without warning and mcsimport the other. In both cases the CSV file gets injected successfully. But with an official specification stating otherwise I'm happy to change it. Even though cpimport is non consistent in its column counting itself. If you try to import a file with one more column and end it with a separator like in the case above, it states 11 instead of 10 columns. e.g. with this input data:
this error messages are stated in the input_file.Job.err file
The enclose by character and escape character are missing and specified as feature request in | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-10-30 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
it would be expected the same behavior as with cpimport
the used import file is one and the same with cpimport and remore toll mcsimport
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by David Thompson (Inactive) [ 2018-11-14 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
I think it's better for mcsimport to follow documented standards, really we should have a bug on cpimport instead. |