Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_bidensity.wasp

Title produced by software

Bivariate Kernel Density Estimation

Date of computation

Wed, 12 Nov 2008 03:45:31 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/12/t12264868179ni2ipawh809xqq.htm/, Retrieved Sun, 19 May 2024 10:05:20 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24088, Retrieved Sun, 19 May 2024 10:05:20 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

171

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Bivariate Kernel Density Estimation] [Bivariate density...] [2008-11-12 10:45:31] [a16dfd7e948381d8b6391003c5d09447] [Current]

Feedback Forum

2008-11-16 17:26:45 [006ad2c49b6a7c2ad6ab685cfc1dae56] [reply] 
De grafiek kan gebruikt worden om na te gaan of er sprake is van correlatie tussen de 2 variabelen. Het voordeel dat deze grafiek heeft ten opzichte van de gewone scatterplot is dat door de hoogtelijnen een soort van derde dimensie wordt weergegeven. Je kan op de grafiek zien dat er een positief verband door de ellipsvorm van de cluster die naar boven is gericht.
2008-11-20 12:15:43 [Carole Thielens] [reply] 
Op deze bivariate density plot worden de totale werkloosheid bij personen jonger dan 25 jaar vergeleken met de totale werkloosheid bij personen ouder dan 25 jaar. Het is voor mij niet duidelijk welke gegevensreeks op welke as staat, gezien de assen niet benoemd zijn. 
De studente merkt op dat de correlatie 0.63 bedraagt, wat wijst op een positief verband tussen beide variabelen. Naar mijn mening is dit een uiterst beperkte analyse, waar nog heel veel uitleg bij vereist is.  
Vooreerst zou wat bijkomende uitleg over de bivariate density plot een beter inzicht geven om deze analyse uit te voeren. Bivariate kernel density plots worden gebruikt om scatterplots op een beter manier te bekijken. Hoe hoger deze gelegen zijn, des de hoger het niveau. Bovendien zijn er hoogtelijnen waarneembaar, welke de verzamelingen van punten met eenzelfde concentratie voorstellen. Hoe hoger de correlatie, hoe smaller de ellipsen die worden gevormd door de hoogtelijnen. Smallere ellipsen wijzen ook op een grotere concentratie. Hoe hoger de concentratie, hoe hoger de correlatie, hoe zekerder we kunnen spreken van een lineair verband ! De rechte die doorheen de plot loopt tenslotte, stelt het gemiddelde van alle gegevens voor. 
Op deze bivariate density plot is het positief verband zeer duidelijk, gezien er vanaf de linkerbenedenhoek tot aan de rechterbovenhoek stijgende ellipsen te zien zijn. 
2008-11-22 10:09:08 [Stephanie Vanderlinden] [reply] 
2008-11-22 10:13:11 [Stephanie Vanderlinden] [reply] 
De bivariate density geeft de correlatie weer tussen verschillende variabelen dit aan de hand van regressierechten en hoogtelijnen. Als je toppen ontdekt in de bivariate density plot, kan dit wijzen op clusters. Je moet naar de hoogtelijnen kijken om een verband tussen de variabelen te ontdekken. Als deze ellipsvormig zijn, kan je spreken van een verband. Als de ellips een stijgend verloop vertoont is er een positief verband, als de ellips een dalend verloop vertoont is er een negatief verband. In deze bivariate density is er een positief verband, aangezien er vanaf de linkerbenedenhoek tot aan de rechterbovenhoek stijgende ellipsen te zien zijn.

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Dataseries Y:

Download CSV

Histogram

6,6
6,4
6
5,8
5,5
5,4
5,4
5,6
5,5
5,8
5,8
5,7
5,5
5,3
5,2
5,3
5,3
5
4,8
4,9
5,3
6
6,2
6,4
6,4
6,4
6,2
6,1
6
5,9
6,2
6,2
6,4
6,8
6,9
7
7
6,9
6,7
6,6
6,5
6,4
6,5
6,5
6,6
6,7
6,8
7,2
7,6
7,6
7,3
6,4
6,1
6,3
7,1
7,5
7,4
7,1
6,8
6,9
7,2
7,4
7,3
6,9
6,9
6,8
7,1
7,2
7,1
7
6,9
7
7,4
7,5
7,5
7,4
7,3
7
6,7
6,5
6,5
6,5
6,6
6,8
6,9
6,9
6,8
6,8
6,5
6,1
6
5,9
5,8
5,9
5,9
6,2
6,3
6,2
6
5,8
5,5
5,5
5,7
5,8
5,7

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'George Udny Yule' @ 72.249.76.132

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 2 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24088&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]2 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24088&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24088&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'George Udny Yule' @ 72.249.76.132

Bandwidth
x axis	0.95322508561578
y axis	0.313611988309568
Correlation
correlation used in KDE	0.633574745443795
correlation(x,y)	0.633574745443795

\begin{tabular}{lllllllll}
\hline
Bandwidth \tabularnewline
x axis & 0.95322508561578 \tabularnewline
y axis & 0.313611988309568 \tabularnewline
Correlation \tabularnewline
correlation used in KDE & 0.633574745443795 \tabularnewline
correlation(x,y) & 0.633574745443795 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24088&T=1

[TABLE]
[ROW][C]Bandwidth[/C][/ROW]
[ROW][C]x axis[/C][C]0.95322508561578[/C][/ROW]
[ROW][C]y axis[/C][C]0.313611988309568[/C][/ROW]
[ROW][C]Correlation[/C][/ROW]
[ROW][C]correlation used in KDE[/C][C]0.633574745443795[/C][/ROW]
[ROW][C]correlation(x,y)[/C][C]0.633574745443795[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24088&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24088&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Bandwidth
x axis	0.95322508561578
y axis	0.313611988309568
Correlation
correlation used in KDE	0.633574745443795
correlation(x,y)	0.633574745443795

Figure 1

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

Parameters (R input):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

R code (references can be found in the software module):

par1 <- as(par1,'numeric')
par2 <- as(par2,'numeric')
par3 <- as(par3,'numeric')
par4 <- as(par4,'numeric')
par5 <- as(par5,'numeric')
library('GenKern')
if (par3==0) par3 <- dpik(x)
if (par4==0) par4 <- dpik(y)
if (par5==0) par5 <- cor(x,y)
if (par1 > 500) par1 <- 500
if (par2 > 500) par2 <- 500
bitmap(file='bidensity.png')
op <- KernSur(x,y, xgridsize=par1, ygridsize=par2, correlation=par5, xbandwidth=par3, ybandwidth=par4)
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main=main,xlab=xlab,ylab=ylab)
if (par6=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par7=='Y') points(x,y)
(r<-lm(y ~ x))
abline(r)
box()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Bandwidth',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'x axis',header=TRUE)
a<-table.element(a,par3)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'y axis',header=TRUE)
a<-table.element(a,par4)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation used in KDE',header=TRUE)
a<-table.element(a,par5)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation(x,y)',header=TRUE)
a<-table.element(a,cor(x,y))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code