summary.mlModel: Microsoft R Machine Learning 모델의 요약입니다.

2025-06-05

Microsoft R Machine Learning 모델의 요약입니다.

사용법

 ## S3 method for class `mlModel':
summary  (object, top = 20, ...)

주장들

`object`

MicrosoftML 분석에서 반환된 모델 개체입니다.

`top`

rxLogisticRegression 및 rxFastLinear와 같은 선형 모델에 대한 요약에 표시할 상위 계수의 수를 지정합니다. 바이어스가 먼저 나타나고 그 뒤에 다른 가중치가 나타나고 절대값이 내림차순으로 정렬됩니다. 이 값으로 NULL설정하면 0이 아닌 모든 계수가 표시됩니다. 그렇지 않으면 첫 번째 top 계수만 표시됩니다.

`...`

요약 메서드에 전달할 추가 인수입니다.

세부 정보

원래 함수 호출에 대한 요약 정보를 제공합니다.
모델을 학습하는 데 사용되는 데이터 집합 및 모델의 계수에 대한 통계입니다.

가치

summary 분석 개체의 메서드는 원래 함수 호출 및 사용된 기본 매개 변수를 포함하는 목록을 반환합니다. 메서드는 coef 모델 개체에서 정보를 처리하는 명명된 가중치 벡터를 반환합니다.

rxLogisticRegression의 경우 다음 통계가 요약에 있을 수도 showTrainingStatsTRUE있습니다.

`training.size`

모델 학습에 사용되는 데이터 집합의 크기(행 수)입니다.

`deviance`

모델 편차는 모델에 통합된 모든 기능을 사용하여 관찰을 얻을 가능성이 있는 위치에 -2 * ln(L) 의해 L 제공됩니다.

`null.deviance`

null 편차는 기능에서 영향을 주지 않고 관찰을 얻을 가능성이 있는 위치에 -2 * ln(L0) 의해 L0 지정됩니다. 모델에 바이어스(있는 경우)가 null 모델에 포함됩니다.

`aic`

AIC(Akaike 정보 기준)는 모델의 계수 수인 2 * k ``+ deviance 것으로 k정의됩니다. 바이어스는 계수 중 하나로 계산됩니다. AIC는 모델의 상대적 품질 측정값입니다. 모델의 적합성(편차로 측정됨)과 모델의 복잡성(계수 수로 측정)의 절충을 다룹니다.

`coefficients.stats`

모델의 각 계수에 대한 통계를 포함하는 데이터 프레임입니다. 각 계수에 대해 다음 통계가 표시됩니다. 바이어스는 첫 번째 행에 나타나고 나머지 계수는 p-값의 오름차순으로 표시됩니다.

Estimate 모델의 예상 계수 값입니다.
Std ErrorThis는 계수 추정값의 큰 샘플 분산의 제곱근입니다.
z-ScoreWe는 예측 비율과 표준 오차를 계산하여 계수의 중요성과 관련하여 계수가 0이어야 한다고 명시하는 null 가설을 테스트할 수 있습니다. null 가설에 따라 정규화가 적용되지 않은 경우 관련 계수의 추정값은 평균 0과 위에서 계산한 표준 오차와 동일한 표준 편차를 갖는 정규 분포를 따릅니다. z 점수는 계수의 예상값과 계수의 표준 오차 사이의 비율을 출력합니다.
Pr(>|z|) z 점수의 양면 테스트에 해당하는 p-값입니다. 중요도 수준에 따라 중요도 표시기가 p-값에 추가됩니다. 표준 정규 분포F(x)N(0, 1)의 CDF인 경우 P(>|z|) = 2 - ``2 * F(|z|) .

작성자

Microsoft Corporation Microsoft Technical Support

참고하십시오

rxFastTrees, rxFastForest, rxFastLinear, rxOneClassSvm, rxNeuralNet, rxLogisticRegression.

예시


 # Estimate a logistic regression model
 logitModel <- rxLogisticRegression(isCase ~ age + parity + education + spontaneous + induced,
                   transforms = list(isCase = case == 1),
                   data = infert)
 # Print a summary of the model
 summary(logitModel)

 # Score to a data frame
 scoreDF <- rxPredict(logitModel, data = infert, 
     extraVarsToWrite = "isCase")

 # Compute and plot the Radio Operator Curve and AUC
 roc1 <- rxRoc(actualVarName = "isCase", predVarNames = "Probability", data = scoreDF) 
 plot(roc1)
 rxAuc(roc1)

 #######################################################################################
 # Multi-class logistic regression  
 testObs <- rnorm(nrow(iris)) > 0
 testIris <- iris[testObs,]
 trainIris <- iris[!testObs,]
 multiLogit <- rxLogisticRegression(
     formula = Species~Sepal.Length + Sepal.Width + Petal.Length + Petal.Width,
     type = "multiClass", data = trainIris)

 # Score the model
 scoreMultiDF <- rxPredict(multiLogit, data = testIris, 
     extraVarsToWrite = "Species")    
 # Print the first rows of the data frame with scores
 head(scoreMultiDF)
 # Look at confusion matrix
 table(scoreMultiDF$Species, scoreMultiDF$PredictedLabel)

 # Look at the observations with incorrect predictions
 badPrediction = scoreMultiDF$Species != scoreMultiDF$PredictedLabel
 scoreMultiDF[badPrediction,]

다음을 통해 공유

summary.mlModel: Microsoft R Machine Learning 모델의 요약입니다.

사용법

주장들

object

top

...

세부 정보

가치

training.size

deviance

null.deviance

aic

coefficients.stats

작성자