Data functions in the Factory
Data functions operate on the data types or values.
Rulex Platform formula | ||
---|---|---|
Function | Formula | Description |
cast(column, newtype, forced) | Casts a column to the specified data type. If the flag forced is set to false (true by default) only those operations which do not result in a loss of precision will be performed. | |
catNames(indatt, values, separator, negate) | Searches for values in specific attributes, and returns the headers of the attributes where the values were found. All the corresponding headers are concatenated. | |
decideType(column) | Assigns the correct data type to an attribute, depending on the values it contains. | |
disc(column, cutoffs, rank) | Discretizes values of a selected attribute according to cutoff values. | |
discEqualFrequency(column, nvalue, rank, quantile) | Discretizes values of a selected attribute into bins with the same number of values in each. | |
discEqualWidth(column, nvalue, rank, min, max) | Discretizes values of a selected attribute into bins of equal width. | |
discretize(column, nvalue, cutoffs, mode, rank, quantile, min, max) | Discretizes values of a selected attribute into bins of equal width, or with the same number of values or according to cutoff values. | |
isAttribute(name, binary) | Checks whether a specified attribute is present in the dataset. The result can either be returned as a Boolean (true/false) or binary (0/1) result. | |
isFloat(string, binary) | Checks whether the data type of the attribute specified in the string parameter is continuous. | |
isType(string, type, binary) | Checks whether the data type of the attribute specified in the string parameter corresponds to the data type specified in the type parameter. | |
type(column) | Returns the data type of the selected column as a nominal value. |
Parameters in bold are mandatory.