Home » date » 2010 » Dec » 12 »

Depression Tree no categorisation

*The author of this computation has been verified*
R Software Module: /rwasp_regression_trees1.wasp (opens new window with default values)
Title produced by software: Recursive Partitioning (Regression Trees)
Date of computation: Sun, 12 Dec 2010 18:04:18 +0000
 
Cite this page as follows:
Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh.htm/, Retrieved Sun, 12 Dec 2010 19:03:19 +0100
 
BibTeX entries for LaTeX users:
@Manual{KEY,
    author = {{YOUR NAME}},
    publisher = {Office for Research Development and Education},
    title = {Statistical Computations at FreeStatistics.org, URL http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh.htm/},
    year = {2010},
}
@Manual{R,
    title = {R: A Language and Environment for Statistical Computing},
    author = {{R Development Core Team}},
    organization = {R Foundation for Statistical Computing},
    address = {Vienna, Austria},
    year = {2010},
    note = {{ISBN} 3-900051-07-0},
    url = {http://www.R-project.org},
}
 
Original text written by user:
 
IsPrivate?
No (this computation is public)
 
User-defined keywords:
 
Dataseries X:
» Textbox « » Textfile « » CSV «
15 10 77 46 15 12 13 6 11 6 4 15 16 0 9 20 63 37 12 7 11 4 26 5 4 23 24 1 12 16 73 45 15 13 14 6 26 20 10 26 22 1 15 10 76 46 12 11 12 5 15 12 6 19 21 1 17 8 90 55 14 16 12 5 10 11 5 19 23 1 14 14 67 40 8 10 6 4 21 12 8 16 23 1 9 19 69 43 11 15 10 5 27 11 9 23 21 0 11 23 54 33 4 4 10 2 21 13 8 19 22 1 13 9 54 33 13 7 12 5 21 9 11 24 20 1 16 12 76 47 19 15 15 6 22 14 6 19 12 0 16 14 75 44 10 5 13 6 29 12 8 25 23 0 15 13 76 47 15 16 18 8 29 18 11 23 23 0 10 11 80 49 6 15 11 6 29 9 5 31 30 1 16 11 89 55 7 13 12 3 30 15 10 29 22 0 12 10 73 43 14 13 13 6 19 12 7 18 21 1 15 12 74 46 16 15 14 6 19 12 7 17 21 1 13 18 78 51 16 15 16 7 22 12 13 22 15 0 18 12 76 47 14 10 16 8 18 15 10 21 22 0 13 10 69 42 15 17 16 6 28 11 8 24 24 1 17 15 74 42 14 14 15 7 17 13 6 22 23 0 14 15 82 48 12 9 13 4 18 10 8 16 15 0 13 12 77 45 9 6 8 4 20 17 7 22 24 1 13 9 84 51 12 11 14 2 16 13 5 21 24 0 15 11 75 46 14 13 15 6 17 17 9 25 21 0 15 16 79 47 14 10 16 6 25 15 11 22 21 0 13 17 79 47 10 4 13 6 22 13 11 24 18 0 etc...
 
Output produced by software:

Enter (or paste) a matrix (table) containing all data (time) series. Every column represents a different variable and must be delimited by a space or Tab. Every row represents a period in time (or category) and must be delimited by hard returns. The easiest way to enter data is to copy and paste a block of spreadsheet cells. Please, do not use commas or spaces to seperate groups of digits!


Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time6 seconds
R Server'Sir Ronald Aylmer Fisher' @ 193.190.124.24


Goodness of Fit
Correlation0.5376
R-squared0.289
RMSE2.7063


Actuals, Predictions, and Residuals
#ActualsForecastsResiduals
11011.4769230769231-1.47692307692308
22016.52380952380953.47619047619047
31613.03921568627452.96078431372549
41011.4769230769231-1.47692307692308
5811.4769230769231-3.47692307692308
61413.03921568627450.96078431372549
71916.52380952380952.47619047619047
82316.52380952380956.47619047619047
9913.0392156862745-4.03921568627451
101211.47692307692310.523076923076923
111411.47692307692312.52307692307692
121311.47692307692311.52307692307692
131116.5238095238095-5.52380952380953
141111.4769230769231-0.476923076923077
151013.0392156862745-3.03921568627451
161211.47692307692310.523076923076923
171813.03921568627454.96078431372549
181211.47692307692310.523076923076923
191013.0392156862745-3.03921568627451
201511.47692307692313.52307692307692
211513.03921568627451.96078431372549
221213.0392156862745-1.03921568627451
23913.0392156862745-4.03921568627451
241111.4769230769231-0.476923076923077
251611.47692307692314.52307692307692
261713.03921568627453.96078431372549
271113.0392156862745-2.03921568627451
281311.47692307692311.52307692307692
29913.0392156862745-4.03921568627451
301111.4769230769231-0.476923076923077
312016.52380952380953.47619047619047
32811.4769230769231-3.47692307692308
331211.47692307692310.523076923076923
341011.4769230769231-1.47692307692308
351113.0392156862745-2.03921568627451
361311.47692307692311.52307692307692
371311.47692307692311.52307692307692
381313.0392156862745-0.0392156862745097
391516.5238095238095-1.52380952380953
401213.0392156862745-1.03921568627451
411316.5238095238095-3.52380952380953
421416.5238095238095-2.52380952380953
43911.4769230769231-2.47692307692308
44911.4769230769231-2.47692307692308
451513.03921568627451.96078431372549
461011.4769230769231-1.47692307692308
471311.47692307692311.52307692307692
48811.4769230769231-3.47692307692308
491513.03921568627451.96078431372549
501311.47692307692311.52307692307692
512416.52380952380957.47619047619047
521116.5238095238095-5.52380952380953
531311.47692307692311.52307692307692
541211.47692307692310.523076923076923
552216.52380952380955.47619047619047
561111.4769230769231-0.476923076923077
571513.03921568627451.96078431372549
58711.4769230769231-4.47692307692308
591413.03921568627450.96078431372549
601013.0392156862745-3.03921568627451
61916.5238095238095-7.52380952380953
621213.0392156862745-1.03921568627451
631616.5238095238095-0.523809523809526
641311.47692307692311.52307692307692
651111.4769230769231-0.476923076923077
661113.0392156862745-2.03921568627451
671311.47692307692311.52307692307692
681011.4769230769231-1.47692307692308
691111.4769230769231-0.476923076923077
70911.4769230769231-2.47692307692308
711316.5238095238095-3.52380952380953
721413.03921568627450.96078431372549
731413.03921568627450.96078431372549
741111.4769230769231-0.476923076923077
751011.4769230769231-1.47692307692308
761111.4769230769231-0.476923076923077
771211.47692307692310.523076923076923
781411.47692307692312.52307692307692
791413.03921568627450.96078431372549
802111.47692307692319.52307692307692
811311.47692307692311.52307692307692
821111.4769230769231-0.476923076923077
831213.0392156862745-1.03921568627451
841211.47692307692310.523076923076923
851113.0392156862745-2.03921568627451
861413.03921568627450.96078431372549
871313.0392156862745-0.0392156862745097
881313.0392156862745-0.0392156862745097
891213.0392156862745-1.03921568627451
901413.03921568627450.96078431372549
911213.0392156862745-1.03921568627451
921211.47692307692310.523076923076923
931816.52380952380951.47619047619047
941111.4769230769231-0.476923076923077
951513.03921568627451.96078431372549
961311.47692307692311.52307692307692
971111.4769230769231-0.476923076923077
982213.03921568627458.96078431372549
991011.4769230769231-1.47692307692308
1001113.0392156862745-2.03921568627451
1011513.03921568627451.96078431372549
1021413.03921568627450.96078431372549
1031113.0392156862745-2.03921568627451
1041011.4769230769231-1.47692307692308
1051413.03921568627450.96078431372549
1061411.47692307692312.52307692307692
1071516.5238095238095-1.52380952380953
1081113.0392156862745-2.03921568627451
1091011.4769230769231-1.47692307692308
1101011.4769230769231-1.47692307692308
1111213.0392156862745-1.03921568627451
1121513.03921568627451.96078431372549
1131011.4769230769231-1.47692307692308
1141213.0392156862745-1.03921568627451
1151511.47692307692313.52307692307692
1161113.0392156862745-2.03921568627451
1171011.4769230769231-1.47692307692308
1182016.52380952380953.47619047619047
1191916.52380952380952.47619047619047
1201713.03921568627453.96078431372549
121811.4769230769231-3.47692307692308
1221716.52380952380950.476190476190474
1231111.4769230769231-0.476923076923077
1241316.5238095238095-3.52380952380953
125911.4769230769231-2.47692307692308
1261011.4769230769231-1.47692307692308
1271311.47692307692311.52307692307692
1281613.03921568627452.96078431372549
1291211.47692307692310.523076923076923
1301411.47692307692312.52307692307692
1311113.0392156862745-2.03921568627451
1321313.0392156862745-0.0392156862745097
1331516.5238095238095-1.52380952380953
1341411.47692307692312.52307692307692
1351413.03921568627450.96078431372549
1361013.0392156862745-3.03921568627451
137811.4769230769231-3.47692307692308
 
Charts produced by software:
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/2o8vu1292177049.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/2o8vu1292177049.ps (open in new window)


http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/3o8vu1292177049.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/3o8vu1292177049.ps (open in new window)


http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/4sqti1292177049.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292176999xm90gvpoqgyvggh/4sqti1292177049.ps (open in new window)


 
Parameters (Session):
par1 = 2 ; par2 = none ; par3 = 6 ; par4 = no ;
 
Parameters (R input):
par1 = 2 ; par2 = none ; par3 = 6 ; par4 = no ;
 
R code (references can be found in the software module):
library(party)
library(Hmisc)
par1 <- as.numeric(par1)
par3 <- as.numeric(par3)
x <- data.frame(t(y))
is.data.frame(x)
x <- x[!is.na(x[,par1]),]
k <- length(x[1,])
n <- length(x[,1])
colnames(x)[par1]
x[,par1]
if (par2 == 'kmeans') {
cl <- kmeans(x[,par1], par3)
print(cl)
clm <- matrix(cbind(cl$centers,1:par3),ncol=2)
clm <- clm[sort.list(clm[,1]),]
for (i in 1:par3) {
cl$cluster[cl$cluster==clm[i,2]] <- paste('C',i,sep='')
}
cl$cluster <- as.factor(cl$cluster)
print(cl$cluster)
x[,par1] <- cl$cluster
}
if (par2 == 'quantiles') {
x[,par1] <- cut2(x[,par1],g=par3)
}
if (par2 == 'hclust') {
hc <- hclust(dist(x[,par1])^2, 'cen')
print(hc)
memb <- cutree(hc, k = par3)
dum <- c(mean(x[memb==1,par1]))
for (i in 2:par3) {
dum <- c(dum, mean(x[memb==i,par1]))
}
hcm <- matrix(cbind(dum,1:par3),ncol=2)
hcm <- hcm[sort.list(hcm[,1]),]
for (i in 1:par3) {
memb[memb==hcm[i,2]] <- paste('C',i,sep='')
}
memb <- as.factor(memb)
print(memb)
x[,par1] <- memb
}
if (par2=='equal') {
ed <- cut(as.numeric(x[,par1]),par3,labels=paste('C',1:par3,sep=''))
x[,par1] <- as.factor(ed)
}
table(x[,par1])
colnames(x)
colnames(x)[par1]
x[,par1]
if (par2 == 'none') {
m <- ctree(as.formula(paste(colnames(x)[par1],' ~ .',sep='')),data = x)
}
load(file='createtable')
if (par2 != 'none') {
m <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data = x)
if (par4=='yes') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'10-Fold Cross Validation',3+2*par3,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
a<-table.element(a,'Prediction (training)',par3+1,TRUE)
a<-table.element(a,'Prediction (testing)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Actual',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
a<-table.row.end(a)
for (i in 1:10) {
ind <- sample(2, nrow(x), replace=T, prob=c(0.9,0.1))
m.ct <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data =x[ind==1,])
if (i==1) {
m.ct.i.pred <- predict(m.ct, newdata=x[ind==1,])
m.ct.i.actu <- x[ind==1,par1]
m.ct.x.pred <- predict(m.ct, newdata=x[ind==2,])
m.ct.x.actu <- x[ind==2,par1]
} else {
m.ct.i.pred <- c(m.ct.i.pred,predict(m.ct, newdata=x[ind==1,]))
m.ct.i.actu <- c(m.ct.i.actu,x[ind==1,par1])
m.ct.x.pred <- c(m.ct.x.pred,predict(m.ct, newdata=x[ind==2,]))
m.ct.x.actu <- c(m.ct.x.actu,x[ind==2,par1])
}
}
print(m.ct.i.tab <- table(m.ct.i.actu,m.ct.i.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.i.tab[i,i] / sum(m.ct.i.tab[i,]))
numer <- numer + m.ct.i.tab[i,i]
}
print(m.ct.i.cp <- numer / sum(m.ct.i.tab))
print(m.ct.x.tab <- table(m.ct.x.actu,m.ct.x.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.x.tab[i,i] / sum(m.ct.x.tab[i,]))
numer <- numer + m.ct.x.tab[i,i]
}
print(m.ct.x.cp <- numer / sum(m.ct.x.tab))
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (jjj in 1:par3) a<-table.element(a,m.ct.i.tab[i,jjj])
a<-table.element(a,round(m.ct.i.tab[i,i]/sum(m.ct.i.tab[i,]),4))
for (jjj in 1:par3) a<-table.element(a,m.ct.x.tab[i,jjj])
a<-table.element(a,round(m.ct.x.tab[i,i]/sum(m.ct.x.tab[i,]),4))
a<-table.row.end(a)
}
a<-table.row.start(a)
a<-table.element(a,'Overall',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.i.cp,4))
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.x.cp,4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable3.tab')
}
}
m
bitmap(file='test1.png')
plot(m)
dev.off()
bitmap(file='test1a.png')
plot(x[,par1] ~ as.factor(where(m)),main='Response by Terminal Node',xlab='Terminal Node',ylab='Response')
dev.off()
if (par2 == 'none') {
forec <- predict(m)
result <- as.data.frame(cbind(x[,par1],forec,x[,par1]-forec))
colnames(result) <- c('Actuals','Forecasts','Residuals')
print(result)
}
if (par2 != 'none') {
print(cbind(as.factor(x[,par1]),predict(m)))
myt <- table(as.factor(x[,par1]),predict(m))
print(myt)
}
bitmap(file='test2.png')
if(par2=='none') {
op <- par(mfrow=c(2,2))
plot(density(result$Actuals),main='Kernel Density Plot of Actuals')
plot(density(result$Residuals),main='Kernel Density Plot of Residuals')
plot(result$Forecasts,result$Actuals,main='Actuals versus Predictions',xlab='Predictions',ylab='Actuals')
plot(density(result$Forecasts),main='Kernel Density Plot of Predictions')
par(op)
}
if(par2!='none') {
plot(myt,main='Confusion Matrix',xlab='Actual',ylab='Predicted')
}
dev.off()
if (par2 == 'none') {
detcoef <- cor(result$Forecasts,result$Actuals)
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Goodness of Fit',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',1,TRUE)
a<-table.element(a,round(detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'R-squared',1,TRUE)
a<-table.element(a,round(detcoef*detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'RMSE',1,TRUE)
a<-table.element(a,round(sqrt(mean((result$Residuals)^2)),4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable1.tab')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Actuals, Predictions, and Residuals',4,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'#',header=TRUE)
a<-table.element(a,'Actuals',header=TRUE)
a<-table.element(a,'Forecasts',header=TRUE)
a<-table.element(a,'Residuals',header=TRUE)
a<-table.row.end(a)
for (i in 1:length(result$Actuals)) {
a<-table.row.start(a)
a<-table.element(a,i,header=TRUE)
a<-table.element(a,result$Actuals[i])
a<-table.element(a,result$Forecasts[i])
a<-table.element(a,result$Residuals[i])
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable.tab')
}
if (par2 != 'none') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Confusion Matrix (predicted in columns / actuals in rows)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
for (i in 1:par3) {
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
}
a<-table.row.end(a)
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (j in 1:par3) {
a<-table.element(a,myt[i,j])
}
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable2.tab')
}
 





Copyright

Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

Software written by Ed van Stee & Patrick Wessa


Disclaimer

Information provided on this web site is provided "AS IS" without warranty of any kind, either express or implied, including, without limitation, warranties of merchantability, fitness for a particular purpose, and noninfringement. We use reasonable efforts to include accurate and timely information and periodically update the information, and software without notice. However, we make no warranties or representations as to the accuracy or completeness of such information (or software), and we assume no liability or responsibility for errors or omissions in the content of this web site, or any software bugs in online applications. Your use of this web site is AT YOUR OWN RISK. Under no circumstances and under no legal theory shall we be liable to you or any other person for any direct, indirect, special, incidental, exemplary, or consequential damages arising from your access to, or use of, this web site.


Privacy Policy

We may request personal information to be submitted to our servers in order to be able to:

  • personalize online software applications according to your needs
  • enforce strict security rules with respect to the data that you upload (e.g. statistical data)
  • manage user sessions of online applications
  • alert you about important changes or upgrades in resources or applications

We NEVER allow other companies to directly offer registered users information about their products and services. Banner references and hyperlinks of third parties NEVER contain any personal data of the visitor.

We do NOT sell, nor transmit by any means, personal information, nor statistical data series uploaded by you to third parties.

We carefully protect your data from loss, misuse, alteration, and destruction. However, at any time, and under any circumstance you are solely responsible for managing your passwords, and keeping them secret.

We store a unique ANONYMOUS USER ID in the form of a small 'Cookie' on your computer. This allows us to track your progress when using this website which is necessary to create state-dependent features. The cookie is used for NO OTHER PURPOSE. At any time you may opt to disallow cookies from this website - this will not affect other features of this website.

We examine cookies that are used by third-parties (banner and online ads) very closely: abuse from third-parties automatically results in termination of the advertising contract without refund. We have very good reason to believe that the cookies that are produced by third parties (banner ads) do NOT cause any privacy or security risk.

FreeStatistics.org is safe. There is no need to download any software to use the applications and services contained in this website. Hence, your system's security is not compromised by their use, and your personal data - other than data you submit in the account application form, and the user-agent information that is transmitted by your browser - is never transmitted to our servers.

As a general rule, we do not log on-line behavior of individuals (other than normal logging of webserver 'hits'). However, in cases of abuse, hacking, unauthorized access, Denial of Service attacks, illegal copying, hotlinking, non-compliance with international webstandards (such as robots.txt), or any other harmful behavior, our system engineers are empowered to log, track, identify, publish, and ban misbehaving individuals - even if this leads to ban entire blocks of IP addresses, or disclosing user's identity.


FreeStatistics.org is powered by