Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*Unverified author*

R Software Module

rwasp_bidensity.wasp

Title produced by software

Bivariate Kernel Density Estimation

Date of computation

Tue, 11 Nov 2008 04:26:03 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/11/t1226402831jfojx6mu9ugj3x4.htm/, Retrieved Sun, 19 May 2024 08:51:25 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=23328, Retrieved Sun, 19 May 2024 08:51:25 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

130

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Bivariate Kernel Density Estimation] [8.3 various EDA T...] [2008-11-11 11:26:03] [0cebda6bbc99948f606f5db2560512ab] [Current]

Feedback Forum

2008-11-15 12:07:05 [58d427c57bd46519a715a3a7fea6a80f] [reply] 
Op de bivariate Kernel zie je naast de puntenwolk een lijn die zo dicht mogelijk de puntenwolk benadert (=regressielijn), de hogelijnen zijn de omtrek van de schijfjes. In dit geval zijn de omtreklijnen niet cirkelvormig dus de correlatie is niet gelijk aan 0. Als je de tabel er zou bijzetten zou je kunnen zeggen hoeveel de correlatie is.
  2008-11-24 13:50:37 [58d427c57bd46519a715a3a7fea6a80f] [reply] 
Ter aanvulling: 
* A.h.v. de puntenwolk van de scatterplot gaat men de bivariate density tekenen. Als er in de puntenwolk ergens heel veel punten zich samen bevinden, is er daar een hoge concentratie van punten. Je gaat daar dan ook de hoogste hoogtelijn vinden.  
* Hoogtelijnen hebben te maken met de concentratie van punten: de waarschijnlijkheid dat de punten zich daar bevinden.  
De rechte lijn in de bivariate density is het gemiddelde van alle gegevens. 
2008-11-15 14:06:31 [Philip Van Herck] [reply] 
  2008-11-15 14:15:06 [Philip Van Herck] [reply] 
Bij de Bivariate Kernel Density analyse vinden we een regressielijn terug. Deze lijn benadert de puntenwolken zo dicht mogelijk. In dien alle waarnemingen op deze regressierechte zouden liggen, zouden we kunnen spreken van een perfect lineaire correlatie. We zien ook de hoogtelijnen op de figuur die enige verduidelijking geven omtrent de de dichtheid van de waarnemingen ten op zichte van elkaar. Zo kunnen we zien dat waar de oppervlaktes binnen de hoogtelijnen roze gekleurd zijn, er een zeer hoge concentratie van waarnemingen voorkomt. Indien we, maar dit is hier nu niet het geval, verschillende clusters van hoge concentratie zouden hebben, kunnen we besluiten dat er pieken van concentratie zijn met betrekking tot de tijd. Dit is vooral zo wanneer de hoogtelijnen echt cirkels vormen, en niet wanneer zij ellipsen zijn.
2008-11-18 11:19:31 [407693b66d7f2e0b350979005057872d] [reply] 
Q1 
 
De student heeft hier zijn data bestanden niet bijgevoegd.  Dit antwoord is gedeeltelijk goed beantwoord.  Bij de density plot als de kleuren lichter worden wordt de dichtheid tussen de punten groter deze grafiek wordt gebruikt om de scatterplot op een andere manier te bekijken.  De bivariate density geeft ook hoogtelijnen en regressielijnen weer als de knooppunten op een lijn liggen kunnen we zijn of het een lineair verband is of niet tussen de punten.  Met de trivariate scatterplot kunnen we  de correlatie van 3 variabelen tegelijkertijd bekijken.  Er worden verschillende perspectieven getoond wen maken gebruik van gestandaardiseerde projecties met histogrammen en scatterplots 
 

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Dataseries Y:

Download CSV

Histogram

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'George Udny Yule' @ 72.249.76.132

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 1 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=23328&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]1 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=23328&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=23328&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'George Udny Yule' @ 72.249.76.132

Bandwidth
x axis	0.000360840275187140
y axis	4.64890339140664
Correlation
correlation used in KDE	-0.177693041385040
correlation(x,y)	-0.177693041385040

\begin{tabular}{lllllllll}
\hline
Bandwidth \tabularnewline
x axis & 0.000360840275187140 \tabularnewline
y axis & 4.64890339140664 \tabularnewline
Correlation \tabularnewline
correlation used in KDE & -0.177693041385040 \tabularnewline
correlation(x,y) & -0.177693041385040 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=23328&T=1

[TABLE]
[ROW][C]Bandwidth[/C][/ROW]
[ROW][C]x axis[/C][C]0.000360840275187140[/C][/ROW]
[ROW][C]y axis[/C][C]4.64890339140664[/C][/ROW]
[ROW][C]Correlation[/C][/ROW]
[ROW][C]correlation used in KDE[/C][C]-0.177693041385040[/C][/ROW]
[ROW][C]correlation(x,y)[/C][C]-0.177693041385040[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=23328&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=23328&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Bandwidth
x axis	0.000360840275187140
y axis	4.64890339140664
Correlation
correlation used in KDE	-0.177693041385040
correlation(x,y)	-0.177693041385040

Figure 1

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

Parameters (R input):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

R code (references can be found in the software module):

par1 <- as(par1,'numeric')
par2 <- as(par2,'numeric')
par3 <- as(par3,'numeric')
par4 <- as(par4,'numeric')
par5 <- as(par5,'numeric')
library('GenKern')
if (par3==0) par3 <- dpik(x)
if (par4==0) par4 <- dpik(y)
if (par5==0) par5 <- cor(x,y)
if (par1 > 500) par1 <- 500
if (par2 > 500) par2 <- 500
bitmap(file='bidensity.png')
op <- KernSur(x,y, xgridsize=par1, ygridsize=par2, correlation=par5, xbandwidth=par3, ybandwidth=par4)
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main=main,xlab=xlab,ylab=ylab)
if (par6=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par7=='Y') points(x,y)
(r<-lm(y ~ x))
abline(r)
box()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Bandwidth',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'x axis',header=TRUE)
a<-table.element(a,par3)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'y axis',header=TRUE)
a<-table.element(a,par4)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation used in KDE',header=TRUE)
a<-table.element(a,par5)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation(x,y)',header=TRUE)
a<-table.element(a,cor(x,y))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code