Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to select distinct values in a column in Talend

Tags:

talend

I am importing an excel file in Talend. I want to select all the distinct values in column "A" and then dump that data into the database. Is it possible to do that with Talend? If not, what are the alternatives available. Any help is appreciated

like image 511
devaki Avatar asked Nov 17 '13 02:11

devaki


2 Answers

Yes you can do that easily with Talend Open Studio.

Create a new job like this one:

enter image description here
You can replace the tOracleOutput component by the component corresponding to your database.
Then parameterize the tAggregateRow component like this :

enter image description here

Distinct values of ColumnA will be transfered to distinctColumnA in the output schema.
You can also get the number of occurences by adding a count of columnB in the operations table.

like image 113
Julien Boulay Avatar answered Sep 28 '22 10:09

Julien Boulay


Using tUniqRow in Talend Open Studio 6.3 works very well and you get to keep all your columns.

like image 30
netlander Avatar answered Sep 28 '22 09:09

netlander