Free Statistics

of Irreproducible Research!

Author's title

Author*The author of this computation has been verified*
R Software Modulerwasp_regression_trees1.wasp
Title produced by softwareRecursive Partitioning (Regression Trees)
Date of computationSat, 25 Dec 2010 21:13:58 +0000
Cite this page as followsStatistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2010/Dec/25/t1293311521dih2gq9k8qu3qj1.htm/, Retrieved Mon, 29 Apr 2024 07:01:56 +0000
Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=115459, Retrieved Mon, 29 Apr 2024 07:01:56 +0000
QR Codes:

Original text written by user:
IsPrivate?No (this computation is public)
User-defined keywords
Estimated Impact118
Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)
-     [Recursive Partitioning (Regression Trees)] [] [2010-12-05 19:50:12] [b98453cac15ba1066b407e146608df68]
-   PD  [Recursive Partitioning (Regression Trees)] [cross validation] [2010-12-24 14:19:42] [d6e648f00513dd750579ba7880c5fbf5]
-           [Recursive Partitioning (Regression Trees)] [] [2010-12-25 21:13:58] [15be6a7406d91c9912c2afdb984c54ee] [Current]
Feedback Forum

Post a new message
Dataseries X:
4	1	27	5	26	49	35
4	1	36	4	25	45	34
5	1	25	4	17	54	13
2	1	27	3	37	36	35
3	2	25	3	35	36	28
5	2	44	3	15	53	32
4	1	50	4	27	46	35
4	1	41	4	36	42	36
4	1	48	5	25	41	27
4	2	43	4	30	45	29
5	2	47	2	27	47	27
4	2	41	3	33	42	28
3	1	44	2	29	45	29
4	2	47	5	30	40	28
3	2	40	3	25	45	30
3	2	46	3	23	40	25
4	1	28	3	26	42	15
3	1	56	3	24	45	33
4	2	49	4	35	47	31
2	2	25	4	39	31	37
4	2	41	4	23	46	37
3	2	26	3	32	34	34
4	1	50	5	29	43	32
4	1	47	4	26	45	21
3	1	52	2	21	42	25
3	2	37	5	35	51	32
2	2	41	3	23	44	28
4	1	45	4	21	47	22
5	2	26	4	28	47	25
4	1	NA	3	30	41	26
2	1	52	4	21	44	34
5	1	46	2	29	51	34
4	1	58	3	28	46	36
3	1	54	5	19	47	36
4	1	29	3	26	46	26
2	2	50	3	33	38	26
3	1	43	2	34	50	34
3	2	30	3	33	48	33
3	2	47	2	40	36	31
5	1	45	3	24	51	33
NA	2	48	1	35	35	22
4	2	48	3	35	49	29
4	2	26	4	32	38	24
4	1	46	5	20	47	37
2	2	NA	3	35	36	32
4	2	50	3	35	47	23
3	1	25	4	21	46	29
4	1	47	2	33	43	35
1	2	47	2	40	53	20
2	1	41	3	22	55	28
2	2	45	2	35	39	26
4	2	41	4	20	55	36
3	2	45	5	28	41	26
4	2	40	3	46	33	33
3	1	29	4	18	52	25
3	2	34	5	22	42	29
5	1	45	5	20	56	32
3	2	52	3	25	46	35
2	2	41	4	31	33	24
1	2	48	3	21	51	31
2	2	45	3	23	46	29
5	1	54	2	26	46	27
4	2	25	3	34	50	29
4	2	26	4	31	46	29
3	1	28	4	23	51	27
4	2	50	4	31	48	34
4	2	48	4	26	44	32
2	2	51	3	36	38	31
3	2	53	3	28	42	31
4	1	37	3	34	39	31
3	1	56	2	25	45	16
2	1	43	3	33	31	25
4	1	34	3	46	29	27
4	1	42	3	24	48	32
3	2	32	3	32	38	28
5	2	31	5	33	55	25
1	1	46	3	42	32	25
3	2	30	5	17	51	36
3	2	47	4	36	53	36
5	2	33	4	40	47	36
2	1	25	4	30	45	27
3	1	25	5	19	33	29
3	2	21	4	33	49	32
4	2	36	5	35	46	29
2	2	50	3	23	42	31
4	2	48	3	15	56	34
3	2	48	2	38	35	27
3	1	25	3	37	40	28
3	1	48	4	23	44	32
2	2	49	5	41	46	33
3	1	27	5	34	46	29
2	1	28	3	38	39	32
4	2	43	2	45	35	35
4	2	48	3	27	48	33
2	2	48	4	46	42	27
1	1	25	1	26	39	16
5	2	49	4	44	39	32
4	1	26	3	36	41	26
4	1	51	3	20	52	32
4	2	25	4	44	45	38
3	1	29	3	27	42	24
3	1	29	4	27	44	26
1	1	43	2	41	33	19
5	2	46	3	30	42	37
3	1	44	3	33	46	25
3	1	25	3	37	45	24
2	1	51	2	30	40	23
4	1	42	5	20	48	28
4	2	53	5	44	32	38
3	1	25	4	20	53	28
4	2	49	2	33	39	28
4	1	51	3	31	45	26
2	2	20	3	23	36	21
3	2	44	3	33	38	35
3	2	38	4	33	49	31
3	1	46	5	32	46	34
4	2	42	4	25	43	30
5	1	29	NA	22	37	30
3	2	46	4	16	48	24
3	2	49	2	36	45	27
2	2	51	3	35	32	26
3	1	38	3	25	46	30
1	1	41	1	27	20	15
4	2	47	3	32	42	28
4	2	44	3	36	45	34
4	2	47	3	51	29	29
3	2	46	3	30	51	26
5	1	44	4	20	55	31
2	2	28	3	29	50	28
2	2	47	4	26	44	33
3	2	28	4	20	41	32
3	1	41	5	40	40	33
2	2	45	4	29	47	31
1	2	46	4	32	42	37
3	1	46	4	33	40	27
5	2	22	3	32	51	19
4	2	33	3	34	43	27
4	1	41	4	24	45	31
4	2	47	5	25	41	38
3	1	25	3	41	41	22
5	2	42	3	39	37	35
3	2	47	3	21	46	35
3	2	50	3	38	38	30
3	1	55	5	28	39	41
3	1	21	3	37	45	25
4	1	NA	3	26	46	28
2	1	52	3	30	39	45
2	2	49	4	25	21	21
4	2	46	4	38	31	33
3	1	NA	4	31	35	25
3	2	45	3	31	49	29
2	2	52	3	27	40	31
3	1	NA	3	21	45	29
3	2	40	4	26	46	31
4	2	49	4	37	45	31
1	1	38	5	28	34	25
1	1	32	5	29	41	27
5	2	46	4	33	43	26
4	2	32	3	41	45	26
3	2	41	3	19	48	23
3	2	43	3	37	43	27
4	1	44	4	36	45	24
3	1	47	5	27	45	35
2	2	28	3	33	34	24
1	1	52	1	29	40	32
1	1	27	2	42	40	24
5	2	45	5	27	55	24
4	1	27	4	47	44	38
3	1	25	4	17	44	36
4	1	28	4	34	48	24
5	1	25	3	32	51	18
4	1	52	4	25	49	34
4	1	44	3	27	33	23
2	2	43	3	37	43	35
3	2	47	4	34	44	22
4	2	52	4	27	44	34
3	2	40	2	37	41	28
4	1	42	3	32	45	34
3	1	45	5	26	44	32
4	1	45	2	29	44	24
1	1	50	5	28	40	34
2	1	49	3	19	48	33
3	1	52	2	46	49	33
3	2	48	3	31	46	29
5	2	51	3	42	49	38
4	2	49	4	33	55	24
3	2	31	4	39	51	25
3	2	43	3	27	46	37
3	2	31	3	35	37	33
3	2	28	4	23	43	30
4	2	43	4	32	41	22
3	2	31	3	22	45	28
2	2	51	3	17	39	24
4	2	58	4	35	38	33
2	2	25	5	34	41	37




Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time6 seconds
R Server'Sir Ronald Aylmer Fisher' @ 193.190.124.24

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 6 seconds \tabularnewline
R Server & 'Sir Ronald Aylmer Fisher' @ 193.190.124.24 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=115459&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]6 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Sir Ronald Aylmer Fisher' @ 193.190.124.24[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=115459&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=115459&T=0

As an alternative you can also use a QR Code:  

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time6 seconds
R Server'Sir Ronald Aylmer Fisher' @ 193.190.124.24







10-Fold Cross Validation
Prediction (training)Prediction (testing)
ActualC1C2CVC1C2CV
C18871140.886186130.8687
C26411230.1617150.0658
Overall--0.5722--0.52

\begin{tabular}{lllllllll}
\hline
10-Fold Cross Validation \tabularnewline
 & Prediction (training) & Prediction (testing) \tabularnewline
Actual & C1 & C2 & CV & C1 & C2 & CV \tabularnewline
C1 & 887 & 114 & 0.8861 & 86 & 13 & 0.8687 \tabularnewline
C2 & 641 & 123 & 0.161 & 71 & 5 & 0.0658 \tabularnewline
Overall & - & - & 0.5722 & - & - & 0.52 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=115459&T=1

[TABLE]
[ROW][C]10-Fold Cross Validation[/C][/ROW]
[ROW][C][/C][C]Prediction (training)[/C][C]Prediction (testing)[/C][/ROW]
[ROW][C]Actual[/C][C]C1[/C][C]C2[/C][C]CV[/C][C]C1[/C][C]C2[/C][C]CV[/C][/ROW]
[ROW][C]C1[/C][C]887[/C][C]114[/C][C]0.8861[/C][C]86[/C][C]13[/C][C]0.8687[/C][/ROW]
[ROW][C]C2[/C][C]641[/C][C]123[/C][C]0.161[/C][C]71[/C][C]5[/C][C]0.0658[/C][/ROW]
[ROW][C]Overall[/C][C]-[/C][C]-[/C][C]0.5722[/C][C]-[/C][C]-[/C][C]0.52[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=115459&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=115459&T=1

As an alternative you can also use a QR Code:  

The GUIDs for individual cells are displayed in the table below:

10-Fold Cross Validation
Prediction (training)Prediction (testing)
ActualC1C2CVC1C2CV
C18871140.886186130.8687
C26411230.1617150.0658
Overall--0.5722--0.52







Confusion Matrix (predicted in columns / actuals in rows)
C1C2
C11100
C2840

\begin{tabular}{lllllllll}
\hline
Confusion Matrix (predicted in columns / actuals in rows) \tabularnewline
 & C1 & C2 \tabularnewline
C1 & 110 & 0 \tabularnewline
C2 & 84 & 0 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=115459&T=2

[TABLE]
[ROW][C]Confusion Matrix (predicted in columns / actuals in rows)[/C][/ROW]
[ROW][C][/C][C]C1[/C][C]C2[/C][/ROW]
[ROW][C]C1[/C][C]110[/C][C]0[/C][/ROW]
[ROW][C]C2[/C][C]84[/C][C]0[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=115459&T=2

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=115459&T=2

As an alternative you can also use a QR Code:  

The GUIDs for individual cells are displayed in the table below:

Confusion Matrix (predicted in columns / actuals in rows)
C1C2
C11100
C2840



Parameters (Session):
par1 = 1 ; par2 = quantiles ; par3 = 2 ; par4 = yes ;
Parameters (R input):
par1 = 1 ; par2 = quantiles ; par3 = 2 ; par4 = yes ;
R code (references can be found in the software module):
library(party)
library(Hmisc)
par1 <- as.numeric(par1)
par3 <- as.numeric(par3)
x <- data.frame(t(y))
is.data.frame(x)
x <- x[!is.na(x[,par1]),]
k <- length(x[1,])
n <- length(x[,1])
colnames(x)[par1]
x[,par1]
if (par2 == 'kmeans') {
cl <- kmeans(x[,par1], par3)
print(cl)
clm <- matrix(cbind(cl$centers,1:par3),ncol=2)
clm <- clm[sort.list(clm[,1]),]
for (i in 1:par3) {
cl$cluster[cl$cluster==clm[i,2]] <- paste('C',i,sep='')
}
cl$cluster <- as.factor(cl$cluster)
print(cl$cluster)
x[,par1] <- cl$cluster
}
if (par2 == 'quantiles') {
x[,par1] <- cut2(x[,par1],g=par3)
}
if (par2 == 'hclust') {
hc <- hclust(dist(x[,par1])^2, 'cen')
print(hc)
memb <- cutree(hc, k = par3)
dum <- c(mean(x[memb==1,par1]))
for (i in 2:par3) {
dum <- c(dum, mean(x[memb==i,par1]))
}
hcm <- matrix(cbind(dum,1:par3),ncol=2)
hcm <- hcm[sort.list(hcm[,1]),]
for (i in 1:par3) {
memb[memb==hcm[i,2]] <- paste('C',i,sep='')
}
memb <- as.factor(memb)
print(memb)
x[,par1] <- memb
}
if (par2=='equal') {
ed <- cut(as.numeric(x[,par1]),par3,labels=paste('C',1:par3,sep=''))
x[,par1] <- as.factor(ed)
}
table(x[,par1])
colnames(x)
colnames(x)[par1]
x[,par1]
if (par2 == 'none') {
m <- ctree(as.formula(paste(colnames(x)[par1],' ~ .',sep='')),data = x)
}
load(file='createtable')
if (par2 != 'none') {
m <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data = x)
if (par4=='yes') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'10-Fold Cross Validation',3+2*par3,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
a<-table.element(a,'Prediction (training)',par3+1,TRUE)
a<-table.element(a,'Prediction (testing)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Actual',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
a<-table.row.end(a)
for (i in 1:10) {
ind <- sample(2, nrow(x), replace=T, prob=c(0.9,0.1))
m.ct <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data =x[ind==1,])
if (i==1) {
m.ct.i.pred <- predict(m.ct, newdata=x[ind==1,])
m.ct.i.actu <- x[ind==1,par1]
m.ct.x.pred <- predict(m.ct, newdata=x[ind==2,])
m.ct.x.actu <- x[ind==2,par1]
} else {
m.ct.i.pred <- c(m.ct.i.pred,predict(m.ct, newdata=x[ind==1,]))
m.ct.i.actu <- c(m.ct.i.actu,x[ind==1,par1])
m.ct.x.pred <- c(m.ct.x.pred,predict(m.ct, newdata=x[ind==2,]))
m.ct.x.actu <- c(m.ct.x.actu,x[ind==2,par1])
}
}
print(m.ct.i.tab <- table(m.ct.i.actu,m.ct.i.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.i.tab[i,i] / sum(m.ct.i.tab[i,]))
numer <- numer + m.ct.i.tab[i,i]
}
print(m.ct.i.cp <- numer / sum(m.ct.i.tab))
print(m.ct.x.tab <- table(m.ct.x.actu,m.ct.x.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.x.tab[i,i] / sum(m.ct.x.tab[i,]))
numer <- numer + m.ct.x.tab[i,i]
}
print(m.ct.x.cp <- numer / sum(m.ct.x.tab))
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (jjj in 1:par3) a<-table.element(a,m.ct.i.tab[i,jjj])
a<-table.element(a,round(m.ct.i.tab[i,i]/sum(m.ct.i.tab[i,]),4))
for (jjj in 1:par3) a<-table.element(a,m.ct.x.tab[i,jjj])
a<-table.element(a,round(m.ct.x.tab[i,i]/sum(m.ct.x.tab[i,]),4))
a<-table.row.end(a)
}
a<-table.row.start(a)
a<-table.element(a,'Overall',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.i.cp,4))
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.x.cp,4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable3.tab')
}
}
m
bitmap(file='test1.png')
plot(m)
dev.off()
bitmap(file='test1a.png')
plot(x[,par1] ~ as.factor(where(m)),main='Response by Terminal Node',xlab='Terminal Node',ylab='Response')
dev.off()
if (par2 == 'none') {
forec <- predict(m)
result <- as.data.frame(cbind(x[,par1],forec,x[,par1]-forec))
colnames(result) <- c('Actuals','Forecasts','Residuals')
print(result)
}
if (par2 != 'none') {
print(cbind(as.factor(x[,par1]),predict(m)))
myt <- table(as.factor(x[,par1]),predict(m))
print(myt)
}
bitmap(file='test2.png')
if(par2=='none') {
op <- par(mfrow=c(2,2))
plot(density(result$Actuals),main='Kernel Density Plot of Actuals')
plot(density(result$Residuals),main='Kernel Density Plot of Residuals')
plot(result$Forecasts,result$Actuals,main='Actuals versus Predictions',xlab='Predictions',ylab='Actuals')
plot(density(result$Forecasts),main='Kernel Density Plot of Predictions')
par(op)
}
if(par2!='none') {
plot(myt,main='Confusion Matrix',xlab='Actual',ylab='Predicted')
}
dev.off()
if (par2 == 'none') {
detcoef <- cor(result$Forecasts,result$Actuals)
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Goodness of Fit',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',1,TRUE)
a<-table.element(a,round(detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'R-squared',1,TRUE)
a<-table.element(a,round(detcoef*detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'RMSE',1,TRUE)
a<-table.element(a,round(sqrt(mean((result$Residuals)^2)),4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable1.tab')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Actuals, Predictions, and Residuals',4,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'#',header=TRUE)
a<-table.element(a,'Actuals',header=TRUE)
a<-table.element(a,'Forecasts',header=TRUE)
a<-table.element(a,'Residuals',header=TRUE)
a<-table.row.end(a)
for (i in 1:length(result$Actuals)) {
a<-table.row.start(a)
a<-table.element(a,i,header=TRUE)
a<-table.element(a,result$Actuals[i])
a<-table.element(a,result$Forecasts[i])
a<-table.element(a,result$Residuals[i])
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable.tab')
}
if (par2 != 'none') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Confusion Matrix (predicted in columns / actuals in rows)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
for (i in 1:par3) {
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
}
a<-table.row.end(a)
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (j in 1:par3) {
a<-table.element(a,myt[i,j])
}
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable2.tab')
}