ColumnDuplicator Class
Duplicates columns from the dataset
- Inheritance
-
nimbusml.internal.core.preprocessing.schema._columnduplicator.ColumnDuplicatorColumnDuplicatornimbusml.base_transform.BaseTransformColumnDuplicatorsklearn.base.TransformerMixinColumnDuplicator
Constructor
ColumnDuplicator(columns=None, **params)
Parameters
Name | Description |
---|---|
columns
|
a dictionary of key-value pairs, where key is the output column name and value is a list of input column names.
The << operator can be used to set this value (see Column Operator) For example
'parity'})
'parity'}) For more details see Columns. |
params
|
Additional arguments sent to compute engine. |
Examples
###############################################################################
# ColumnDuplicator
from nimbusml import FileDataStream
from nimbusml.datasets import get_dataset
from nimbusml.preprocessing.schema import ColumnDuplicator
# data input (as a FileDataStream)
path = get_dataset('infert').as_filepath()
data = FileDataStream.read_csv(path, sep=',')
# transform usage
xf = ColumnDuplicator(
columns={
'education_copy': 'education',
'age_copy': 'age'})
# fit and transform
features = xf.fit_transform(data)
# print features
print(features.head())
# age age_copy case education education_copy induced parity ...
# 0 26 26 1 0-5yrs 0-5yrs 1 6 ...
# 1 42 42 1 0-5yrs 0-5yrs 1 1 ...
# 2 39 39 1 0-5yrs 0-5yrs 2 6 ...
# 3 34 34 1 0-5yrs 0-5yrs 2 4 ...
# 4 35 35 1 6-11yrs 6-11yrs 1 3 ...
Methods
get_params |
Get the parameters for this operator. |
get_params
Get the parameters for this operator.
get_params(deep=False)
Parameters
Name | Description |
---|---|
deep
|
Default value: False
|