Data functions in the Factory

Data functions operate on the data types or values.

Rulex Platform formula

Function

Formula

Description

cast

cast(column, newtype, forced)

Casts a column to the specified data type. If the flag forced is set to false (true by default) only those operations which do not result in a loss of precision will be performed.

catNames

catNames(indatt, values, separator, negate)

Searches for values in specific attributes, and returns the headers of the attributes where the values were found. All the corresponding headers are concatenated.

decideType

decideType(column)

Assigns the correct data type to an attribute, depending on the values it contains.

disc

disc(column, cutoffs, rank)

Discretizes values of a selected attribute according to cutoff values.

discEqualFrequency

discEqualFrequency(column, nvalue, rank, quantile)

Discretizes values of a selected attribute into bins with the same number of values in each.

discEqualWidth

discEqualWidth(column, nvalue, rank, min, max)

Discretizes values of a selected attribute into bins of equal width.

discretize

discretize(column, nvalue, cutoffs, mode, rank, quantile, min, max)

Discretizes values of a selected attribute into bins of equal width, or with the same number of values or according to cutoff values.

isAttribute

isAttribute(name, binary)

Checks whether a specified attribute is present in the dataset. The result can either be returned as a Boolean (true/false) or binary (0/1) result.

isFloat

isFloat(string, binary)

Checks whether the data type of the attribute specified in the string parameter is continuous.

isType

isType(string, type, binary)

Checks whether the data type of the attribute specified in the string parameter corresponds to the data type specified in the type parameter.

type

type(column)

Returns the data type of the selected column as a nominal value.

Parameters in bold are mandatory.