Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_hierarchicalclustering.wasp

Title produced by software

Hierarchical Clustering

Date of computation

Wed, 12 Nov 2008 12:48:41 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/12/t12265193598f085vbcus1w1kz.htm/, Retrieved Sun, 19 May 2024 10:20:57 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24417, Retrieved Sun, 19 May 2024 10:20:57 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

158

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F     [Hierarchical Clustering] [Dendogram] [2008-11-11 16:52:48] [12d343c4448a5f9e527bb31caeac580b]
F   PD    [Hierarchical Clustering] [task 8 dendogram] [2008-11-12 19:48:41] [0458bd763b171003ec052ce63099d477] [Current]

Feedback Forum

2008-11-23 14:46:06 [Nathalie Daneels] [reply] 
Evaluatie opdracht 3 - Blok 8 (Q2) 
 
De student had hier geen conclusie bij vermeld. Dit zou een mogelijke conclusie kunnen zijn: 
 
We produceren een dendrogram om een antwoord te kunnen formuleren op de vraag: Kunnen er in periodes groepen gemaakt worden die gelijkaardig zijn?  
Een dendogram wordt op de volgende manier 'gemaakt': De tijdreeks wordt eerst opgesplitst in 2 delen (2 vertakkingen vanuit 1 knooppunt) en wordt dan telkens verderop gesplitst totdat er telkens nog maar 1 maand overblijft. Elke vertakking/cluster geeft de periodes weer die gelijkaardig zijn.  
Als we deze informatie toepassen op de gegevens van de student, kunnen we vaststellen dat de tijdreeks (de perioden) eerder random verdeeld zijn. Dit betekent dat we geen 'vast' patroon kunnen waarnemen tussen de clusters of met andere woorden dat er niet echt periodes zijn die gelijkaardig zijn en dus allemaal in dezelfde cluster zitten. Bijvoorbeeld als men een tijdreeks heeft van 18 maanden. En de eerste 7 maanden bevinden zich allemaal in de eerste cluster en de overige maanden in de tweede cluster. Dan kunnen we wel vaststellen dat er een verandering heeft opgetreden na de zevende maand en kunnen we op zoek gaan naar een patroon voor de periodes die gelijkaardig zijn en dus in dezelfde cluster zitten.  
We kunnen wel opmerken bij de dendrogram van de student dat de periodes die beginnen met het cijfer 7 enkel in de tweede cluster voorkomen. Voor de rest zijn de periodes eigenlijk willekeurig verdeeld. Voor deze periodes, beginnend met het cijfer 7, kunnen we dus stellen dat ze gelijkaardig zijn.
2008-11-24 12:40:12 [Anouk Greeve] [reply] 
Interpretatie ontbreekt!  
Het nut van een dendrogram is dat we kunnen zien welke gegevens in 1 groep zitten en welke we dus anders moeten behandelen. We kunnen vaststellen dat de waarden van de periodes die indezelfde kluster liggen, rond dezelfde hoogte liggen en dus dezelfde gegevens bevatten.
2008-11-24 18:41:35 [Jan De Vleeschauwer] [reply] 
geen conclusie

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

90.7	78.4	97.8
94.3	114.6	107.4
104.6	113.3	117.5
111.1	117	105.6
110.8	99.6	97.4
107.2	99.4	99.5
99	101.9	98
99	115.2	104.3
91	108.5	100.6
96.2	113.8	101.1
96.9	121	103.9
96.2	92.2	96.9
100.1	90.2	95.5
99	101.5	108.4
115.4	126.6	117
106.9	93.9	103.8
107.1	89.8	100.8
99.3	93.4	110.6
99.2	101.5	104
108.3	110.4	112.6
105.6	105.9	107.3
99.5	108.4	98.9
107.4	113.9	109.8
93.1	86.1	104.9
88.1	69.4	102.2
110.7	101.2	123.9
113.1	100.5	124.9
99.6	98	112.7
93.6	106.6	121.9
98.6	90.1	100.6
99.6	96.9	104.3
114.3	125.9	120.4
107.8	112	107.5
101.2	100	102.9
112.5	123.9	125.6
100.5	79.8	107.5
93.9	83.4	108.8
116.2	113.6	128.4
112	112.9	121.1
106.4	104	119.5
95.7	109.9	128.7
96	99	108.7
95.8	106.3	105.5
103	128.9	119.8
102.2	111.1	111.3
98.4	102.9	110.6
111.4	130	120.1
86.6	87	97.5
91.3	87.5	107.7
107.9	117.6	127.3
101.8	103.4	117.2
104.4	110.8	119.8
93.4	112.6	116.2
100.1	102.5	111
98.5	112.4	112.4
112.9	135.6	130.6
101.4	105.1	109.1
107.1	127.7	118.8
110.8	137	123.9
90.3	91	101.6
95.5	90.5	112.8
111.4	122.4	128
113	123.3	129.6
107.5	124.3	125.8
95.9	120	119.5
106.3	118.1	115.7
105.2	119	113.6
117.2	142.7	129.7
106.9	123.6	112
108.2	129.6	116.8
113	151.6	127
97.2	110.4	112.1
99.9	99.2	114.2
108.1	130.5	121.1
118.1	136.2	131.6
109.1	129.7	125
93.3	128	120.4
112.1	121.6	117.7
111.8	135.8	117.5
112.5	143.8	120.6
116.3	147.5	127.5
110.3	136.2	112.3
117.1	156.6	124.5
103.4	123.3	115.2
96.2	100.4	105.4

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'George Udny Yule' @ 72.249.76.132
R Engine error message	Error in dimnames(data) <- dimnames : length of 'dimnames' [1] not equal to array extent Calls: array Execution halted

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 1 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
R Engine error message & Error in dimnames(data) <- dimnames : 
  length of 'dimnames' [1] not equal to array extent
Calls: array
Execution halted
 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24417&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]1 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[ROW][C]R Engine error message[/C][C]Error in dimnames(data) <- dimnames : 
  length of 'dimnames' [1] not equal to array extent
Calls: array
Execution halted
[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24417&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24417&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'George Udny Yule' @ 72.249.76.132
R Engine error message	Error in dimnames(data) <- dimnames : length of 'dimnames' [1] not equal to array extent Calls: array Execution halted

Parameters (Session):

Parameters (R input):

par1 = ward ; par2 = ALL ; par3 = FALSE ; par4 = FALSE ;

R code (references can be found in the software module):

par3 <- as.logical(par3)
par4 <- as.logical(par4)
if (par3 == 'TRUE'){
dum = xlab
xlab = ylab
ylab = dum
}
x <- t(y)
hc <- hclust(dist(x),method=par1)
d <- as.dendrogram(hc)
str(d)
mysub <- paste('Method: ',par1)
bitmap(file='test1.png')
if (par4 == 'TRUE'){
plot(d,main=main,ylab=ylab,xlab=xlab,horiz=par3, nodePar=list(pch = c(1,NA), cex=0.8, lab.cex = 0.8),type='t',center=T, sub=mysub)
} else {
plot(d,main=main,ylab=ylab,xlab=xlab,horiz=par3, nodePar=list(pch = c(1,NA), cex=0.8, lab.cex = 0.8), sub=mysub)
}
dev.off()
if (par2 != 'ALL'){
if (par3 == 'TRUE'){
ylab = 'cluster'
} else {
xlab = 'cluster'
}
par2 <- as.numeric(par2)
memb <- cutree(hc, k = par2)
cent <- NULL
for(k in 1:par2){
cent <- rbind(cent, colMeans(x[memb == k, , drop = FALSE]))
}
hc1 <- hclust(dist(cent),method=par1, members = table(memb))
de <- as.dendrogram(hc1)
bitmap(file='test2.png')
if (par4 == 'TRUE'){
plot(de,main=main,ylab=ylab,xlab=xlab,horiz=par3, nodePar=list(pch = c(1,NA), cex=0.8, lab.cex = 0.8),type='t',center=T, sub=mysub)
} else {
plot(de,main=main,ylab=ylab,xlab=xlab,horiz=par3, nodePar=list(pch = c(1,NA), cex=0.8, lab.cex = 0.8), sub=mysub)
}
dev.off()
str(de)
}
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Summary of Dendrogram',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Label',header=TRUE)
a<-table.element(a,'Height',header=TRUE)
a<-table.row.end(a)
num <- length(x[,1])-1
for (i in 1:num)
{
a<-table.row.start(a)
a<-table.element(a,hc$labels[i])
a<-table.element(a,hc$height[i])
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable1.tab')
if (par2 != 'ALL'){
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Summary of Cut Dendrogram',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Label',header=TRUE)
a<-table.element(a,'Height',header=TRUE)
a<-table.row.end(a)
num <- par2-1
for (i in 1:num)
{
a<-table.row.start(a)
a<-table.element(a,i)
a<-table.element(a,hc1$height[i])
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable2.tab')
}

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code