Scikitlearn Column Transformer Error: Column ordering must be equal for fit and for transform when using the remainder keyword

Tags:

1 Answers

You can use df_new =pd.DataFrame(df_origin, columns=df_train.columns to make sure the data to predict have same columns with training data.

And from the given example, it's obviously that ColumnTransformer will take the order number of a chosen column as a mark to process.(Although you can use exactly name to choose a column, but I think it will transform to number too)

>>> import numpy as np
>>> from sklearn.compose import ColumnTransformer
>>> from sklearn.preprocessing import Normalizer
>>> ct = ColumnTransformer(
...     [("norm1", Normalizer(norm='l1'), [0, 1]),
...      ("norm2", Normalizer(norm='l1'), slice(2, 4))])
>>> X = np.array([[0., 1., 2., 2.],
...               [1., 1., 0., 1.]])
>>> # Normalizer scales each row of X to unit norm. A separate scaling
>>> # is applied for the two first and two last elements of each
>>> # row independently.
>>> ct.fit_transform(X)
array([[0. , 1. , 0.5, 0.5],
       [0.5, 0.5, 0. , 1. ]])

answered Oct 18 '22 01:10

小笼包

Related questions
                            
                                In-place custom object unpacking different behavior with __getitem__ python 3.5 vs python 3.6
                            
                                networkx - meaning of weight in betwenness and current flow betweenness
                            
                                How can I remove sharp jumps in data?
                            
                                Getting some form of keras multi-processing/threading to work on Windows
                            
                                Does importing a Python file also import the imported files into shell?
                            
                                How to get all characters of an arbitrary encoding?
                            
                                Proper way to type hint a private property in python
                            
                                Django 2 namespace and app_name
                            
                                How to iterate over a slice?
                            
                                How to scan previous list values in order to add a new composite list value?
                            
                                How to multi-thread with "for" loop?
                            
                                How do I add a layer in a shape of a box to an altair plot?
                            
                                ValueError: When changing to a larger dtype, its size must be a divisor of the total size in bytes of the last axis of the array
                            
                                Unable to install Airflow even after setting SLUGIFY_USES_TEXT_UNIDECODE and AIRFLOW_GPL_UNIDECODE
                            
                                Search in Rotated Sorted Array in O(log n) time
                            
                                Why there is no UserSet class defined in Python?
                            
                                How to inspect clients that are connected to a GRPC server
                            
                                Checking of **kwargs in concrete implementation of abstract class method. Interface issue?
                            
                                How do I run a Python script from a subdirectory without breaking upper-level imports?
                            
                                Ctrl+C sends EOFError once after cancelling process [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scikitlearn Column Transformer Error: Column ordering must be equal for fit and for transform when using the remainder keyword

Tags:

python-3.x

scikit-learn

tudou

People also ask

1 Answers

小笼包

Recent Activity

Donate For Us