ColumnDuplicator Class

Duplicates columns from the dataset

Inheritance
nimbusml.internal.core.preprocessing.schema._columnduplicator.ColumnDuplicator
ColumnDuplicator
nimbusml.base_transform.BaseTransform
ColumnDuplicator
sklearn.base.TransformerMixin
ColumnDuplicator

Constructor

ColumnDuplicator(columns=None, **params)

Parameters

Name Description
columns

a dictionary of key-value pairs, where key is the output column name and value is a list of input column names.

  • Only one key-value pair is allowed.

  • Input column type: numeric or string.

  • Output column type:

    Vector Type.

The << operator can be used to set this value (see Column Operator)

For example

  • ColumnDuplicator(columns={'age_copy': 'age', 'parity_copy':

'parity'})

  • ColumnDuplicator() << {'age_copy': 'age', 'parity_copy':

'parity'})

For more details see Columns.

params

Additional arguments sent to compute engine.

Examples


   ###############################################################################
   # ColumnDuplicator
   from nimbusml import FileDataStream
   from nimbusml.datasets import get_dataset
   from nimbusml.preprocessing.schema import ColumnDuplicator

   # data input (as a FileDataStream)
   path = get_dataset('infert').as_filepath()

   data = FileDataStream.read_csv(path, sep=',')

   # transform usage
   xf = ColumnDuplicator(
       columns={
           'education_copy': 'education',
           'age_copy': 'age'})

   # fit and transform
   features = xf.fit_transform(data)

   # print features
   print(features.head())
   #   age  age_copy  case education education_copy  induced  parity  ...
   # 0   26        26     1    0-5yrs         0-5yrs        1       6  ...
   # 1   42        42     1    0-5yrs         0-5yrs        1       1  ...
   # 2   39        39     1    0-5yrs         0-5yrs        2       6  ...
   # 3   34        34     1    0-5yrs         0-5yrs        2       4  ...
   # 4   35        35     1   6-11yrs        6-11yrs        1       3  ...

Methods

get_params

Get the parameters for this operator.

get_params

Get the parameters for this operator.

get_params(deep=False)

Parameters

Name Description
deep
Default value: False