Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_cloud.wasp

Title produced by software

Trivariate Scatterplots

Date of computation

Tue, 11 Nov 2008 08:14:26 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/11/t1226416774wri0vcdbuhvfa8t.htm/, Retrieved Sun, 19 May 2024 09:19:29 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=23587, Retrieved Sun, 19 May 2024 09:19:29 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

gdm

Estimated Impact

144

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Trivariate Scatterplots] [WS3 Task 1 - Triv...] [2008-11-11 15:14:26] [99f79d508deef838ee89a56fb32f134e] [Current]

Feedback Forum

2008-11-18 09:22:27 [Evelyn Gabriel] [reply] 
De student heeft blijkbaar de Trivariate Scatterplots buiten beschouwing gelaten. Nochtans vind ik dit de beste methode om een duidelijk overzicht te krijgen over de verbanden tussen de variabelen. Het voordeel is dat je 3 variabelen met elkaar kan vergelijken en dat ook onmiddellijk de Bivariate Kernel Density Plot wordt weergegeven. 
2008-11-20 10:14:27 [Angelique Van de Vijver] [reply] 
Er is inderdaad een min of meer positieve correlatie tussen de verschillende variabelen. Dit zie je aan de bivariate kernel density plots, waar de hoogtelijnen een ellips vormen die naar rechtsboven is gericht en die rond de diagonaal ligt. 
Er is wel een verschil tussen de gewone correlaties en de partiële correlaties (zie tabel) wat erop wijst dat de correlatie wordt beïnvloed door een 3e variabele. 
Met deze bivariate kernel density plots kan je dus de correlatie van 2 variabelen voorstellen. Via deze methode kan die correlatie wel beïnvloed zijn door een 3e variabele waardoor je misschien een vertekend beeld krijgt en een schijncorrelatie. De partiële correlatie lost dit probleem op: deze werkt eerst de effecten van deze 3e variabele weg en berekent dan de correlatie tussen de 2 variabelen.   
Foute conclusie van de student: Niet de partiële correlatie van x en y is 0.64 maar wel de gewone correlatie(zie tabel).  
De correlatiewaarden zijn echter redelijk laag waardoor er niet echt sprake is van een sterke correlatie, zeker als men kijkt naar de partiële correlatie zijn deze waarden zeer laag (rond de 0.40). Hierbij kan men dus niet echt spreken van een verband tussen de variabelen. 
De student heeft niks vermeld over de trivariate scatterplots, nochtans kunnen deze nuttige informatie verschaffen. Hierbij kan je de correlatie nagaan tussen 3 variabelen tegelijkertijd. Er wordt telkens vanuit een ander perspectief gekeken. Via deze methode kan je dan soms dingen vaststellen die je bij de kernel density plots niet kan vaststellen. 
De trivariate scatterplots zijn telkens een projectie van een kubus. Deze kunnen wel een vertekend beeld geven doordat je telkens een dimensie weglaat. 
Op de trivariate scatterplots zien we min of meer een patroon van de puntenwolk. Deze ligt ongeveer volgens de diagonaal, maar de punten liggen wel zeer gespreid. 
2008-11-22 15:32:16 [An Knapen] [reply] 
Trivariaat scatterplot  geeft het gelijktijdig verband weer tussen 3 variabelen. In dit voorbeeld gaat het over het verband tussen intermediare goederen,investeringsgoederen en consumptiegoederen. Aangezien we slechts een 2-dimensionale weergave hebben, zullen we de kubus(waar het verband tussen de 3variabelen getoond wordt) verschillende keren moeten roteren. Dit geeft een vertekend beeld omdat er telkens een dimensie gereduceerd wordt. 
Ik vind dat je niet duidelijk kunt aflezen tussen welke variabelen het sterkste verband is. Op elke tekening zijn de waarden min of meer hetzelfde verdeeld/verspreid. Het verband is dus overal gelijkaardig.

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Dataseries Y:

Download CSV

Histogram

Dataseries Z:

Download CSV

Histogram

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	5 seconds
R Server	'Herman Ole Andreas Wold' @ 193.190.124.10:1001

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 5 seconds \tabularnewline
R Server & 'Herman Ole Andreas Wold' @ 193.190.124.10:1001 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=23587&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]5 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Herman Ole Andreas Wold' @ 193.190.124.10:1001[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=23587&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=23587&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	5 seconds
R Server	'Herman Ole Andreas Wold' @ 193.190.124.10:1001

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Figure 6

PNG link

Postscript link

PDF link

Figure 7

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 50 ; par2 = 50 ; par3 = Y ; par4 = Y ; par5 = Intermediairegoederen ; par6 = Investeringsgoederen ; par7 = Consumptiegoederen ;

Parameters (R input):

par1 = 50 ; par2 = 50 ; par3 = Y ; par4 = Y ; par5 = Intermediairegoederen ; par6 = Investeringsgoederen ; par7 = Consumptiegoederen ;

R code (references can be found in the software module):

x <- array(x,dim=c(length(x),1))
colnames(x) <- par5
y <- array(y,dim=c(length(y),1))
colnames(y) <- par6
z <- array(z,dim=c(length(z),1))
colnames(z) <- par7
d <- data.frame(cbind(z,y,x))
colnames(d) <- list(par7,par6,par5)
par1 <- as.numeric(par1)
par2 <- as.numeric(par2)
if (par1>500) par1 <- 500
if (par2>500) par2 <- 500
if (par1<10) par1 <- 10
if (par2<10) par2 <- 10
library(GenKern)
library(lattice)
panel.hist <- function(x, ...)
{
usr <- par('usr'); on.exit(par(usr))
par(usr = c(usr[1:2], 0, 1.5) )
h <- hist(x, plot = FALSE)
breaks <- h$breaks; nB <- length(breaks)
y <- h$counts; y <- y/max(y)
rect(breaks[-nB], 0, breaks[-1], y, col='black', ...)
}
bitmap(file='cloud1.png')
cloud(z~x*y, screen = list(x=-45, y=45, z=35),xlab=par5,ylab=par6,zlab=par7)
dev.off()
bitmap(file='cloud2.png')
cloud(z~x*y, screen = list(x=35, y=45, z=25),xlab=par5,ylab=par6,zlab=par7)
dev.off()
bitmap(file='cloud3.png')
cloud(z~x*y, screen = list(x=35, y=-25, z=90),xlab=par5,ylab=par6,zlab=par7)
dev.off()
bitmap(file='pairs.png')
pairs(d,diag.panel=panel.hist)
dev.off()
x <- as.vector(x)
y <- as.vector(y)
z <- as.vector(z)
bitmap(file='bidensity1.png')
op <- KernSur(x,y, xgridsize=par1, ygridsize=par2, correlation=cor(x,y), xbandwidth=dpik(x), ybandwidth=dpik(y))
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main='Bivariate Kernel Density Plot (x,y)',xlab=par5,ylab=par6)
if (par3=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par4=='Y') points(x,y)
(r<-lm(y ~ x))
abline(r)
box()
dev.off()
bitmap(file='bidensity2.png')
op <- KernSur(y,z, xgridsize=par1, ygridsize=par2, correlation=cor(y,z), xbandwidth=dpik(y), ybandwidth=dpik(z))
op
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main='Bivariate Kernel Density Plot (y,z)',xlab=par6,ylab=par7)
if (par3=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par4=='Y') points(y,z)
(r<-lm(z ~ y))
abline(r)
box()
dev.off()
bitmap(file='bidensity3.png')
op <- KernSur(x,z, xgridsize=par1, ygridsize=par2, correlation=cor(x,z), xbandwidth=dpik(x), ybandwidth=dpik(z))
op
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main='Bivariate Kernel Density Plot (x,z)',xlab=par5,ylab=par7)
if (par3=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par4=='Y') points(x,z)
(r<-lm(z ~ x))
abline(r)
box()
dev.off()

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code