This article provides a list of the functions provided by the RevoScaleR package and lists comparable functions included in the base distribution of R.
rx function |
Description |
Nearest base R function |
rxGetInfo |
Retrieves header information from an .XDF file or summary information from a data frame |
str()
names()
colNames() |
rxGetVarInfo |
Retrieves variable information from an .XDF file or data frame |
names()
str()
nrow()
min()
max() |
RxSasData |
Creates a SAS data source object |
foreign::read.ssd() |
RxSpssData |
Creates an SPSS data source object |
foreign::read.ssps() |
rxOpen |
Opens a data source for reading |
read.table() etc. |
rxReadNext |
Reads data from a data source |
read.table() , etc. |
Data Manipulation and Chunking
rx function |
Description |
Nearest base R function |
rxDataStep |
Transforms and subsets data in .XDF files or data frames |
transform()
with()
within()
subset() |
rxFactors |
Recodes a factor variable, or converts a non-factor variable into a factor |
factor() |
rxSort |
Performs multi-key sorting of the variables in an .XDF file or data frame |
sort()
order() |
rxMerge |
Merges two .XDF files or two data frames using a variety of merge types |
merge()
rbind()
cbind() |
rxSplit |
Splits an .XDF file or a data frame into multiple .XDF files or data frames |
split() |
Descriptive Statistics and Cross-Tabulation
rx function |
Description |
Nearest base R function |
rxSummary |
Generates summary statistics for a data frame, including computations by group |
summary()
lapply(x, …) |
rxQuantile |
Computes approximate quantiles for an .XDF file or data frame without sorting |
quantile() |
rxCrossTabs |
Creates a cross-tabulation of data based on a formula provided as parameter |
xtabs() |
rxCube |
Creates a cross-tabulation of data based on formula provided as parameter
This function is an alternative to rxCrossTabs and is designed for efficient representation. |
xtabs() |
rxMarginals |
Creates a marginal summary for an xtab object |
addmargins()
colSums()
rowSums() |
as.crosstabs |
Converts cross tabulation results to an xtab object |
xtabs() |
rxChiSquaredTest |
Performs a chi-squared test on an xtab object |
chisq.test() |
rxFisherTest |
Performs Fisher's Exact Test on an xtab object |
fisher.test() |
rxKendallCor |
Computes Kendall's Tau Rank Correlation Coefficient using an xtab object |
cor(…, method="kendall") |
Statistical Modeling
rx function |
Description |
Nearest base R function |
rxLinMod |
Fits a linear model to data |
lm() |
rxCovCor |
Calculates the covariance, correlation, or sum of squares (cross-product) matrix for a set of variables |
cor()
cov()
crossprod() |
rxCov |
Calculates the covariance matrix for a set of variables |
cov() |
rxCor |
Calculates the correlation matrix for a set of variables |
cov() |
rxLogit |
Fits a logistic regression model to data |
glm(…, family="binomial") |
rxGlm |
Fits a generalized linear model to data |
glm() |
rxDTree |
Fits a classification or regression tree to data |
tree::tree()
rpart::rpart() |
rxPredict |
Calculates predictions for fitted models |
predict() |
rxKmeans |
Performs K-means clustering |
cluster::kmeans() |
Basic Graphing
rx function |
Description |
Nearest base R function |
rxHistogram |
Creates a histogram from data |
hist() |
rxLinePlot |
Creates a line plot from data |
plot()
lines() |
See Also
SQL Server R Services Features and Tasks