I have an ARFF file containing 14 numerical columns. I want to perform a normalization on each column separately, that is modifying the values from each colum to (<code>actual_value - min(this_column)) / (max(this_column) - min(this_column)</code>). Hence, all values from a column will be in the range <code>[0, 1]</code>. The min and max values from a column might differ from those of another column. How can I do this with Weka filters? Thanks

This can be done using <pre class="prettyprint"><code>weka.filters.unsupervised.attribute.Normalize </code></pre> After applying this filter all values in each column will be in the range [0, 1]

Weka normalizing columns

Tags:

normalization

weka

I have an ARFF file containing 14 numerical columns. I want to perform a normalization on each column separately, that is modifying the values from each colum to (actual_value - min(this_column)) / (max(this_column) - min(this_column)). Hence, all values from a column will be in the range [0, 1]. The min and max values from a column might differ from those of another column.

How can I do this with Weka filters?

Thanks

493

asked Feb 16 '10 07:02

lmsasu

1 Answers

This can be done using

weka.filters.unsupervised.attribute.Normalize

After applying this filter all values in each column will be in the range [0, 1]

answered Sep 22 '22 00:09

George Dontas

Related questions
                            
                                Table Design For SystemSettings, Best Model
                            
                                Best way to store user-submitted item names (and their synonyms)
                            
                                SSE normalization slower than simple approximation?
                            
                                Why is ToUpperInvariant() faster than ToLowerInvariant()?
                            
                                MySQL: Eliminating duplicate rows without breaking a foreign key constraint
                            
                                Are these tables respect the 3NF Database Normalization?
                            
                                Min-Max normalization Layer in Caffe
                            
                                Normalizing URI to make it work correctly with MakeRelativeUri
                            
                                Unicode comparison of Cyrillic 'С' and Latin 'C'
                            
                                Normalizing functions without actually applying it in Haskell
                            
                                Normalizing this database: what would be ideal in this scenario?
                            
                                Which form of unicode normalization is appropriate for text mining?
                            
                                How to normalize deeply nested data with ngrx/entity (EntityState and EntityAdapter)
                            
                                Using IDs from multiple tables in a single column
                            
                                ElasticSearch incorrectly indexing and querying on non-alphanumeric characters
                            
                                How can I make the Wikipedia API normalize and redirect without knowing the exact case of all characters?
                            
                                Remove accents in string except "ñ"
                            
                                Normalizing data with binary and continuous variables for machine learning
                            
                                Normalization vs DeNormalization when using a JSON client with a JAVA/RDBMS stack
                            
                                Standardize dataset containing too large values