Free Statistics

of Irreproducible Research!

Author's title

Author*The author of this computation has been verified*
R Software Modulerwasp_boxcoxlin.wasp
Title produced by softwareBox-Cox Linearity Plot
Date of computationSun, 09 Nov 2008 10:39:29 -0700
Cite this page as followsStatistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/09/t1226252409opizea439eadgbw.htm/, Retrieved Sun, 19 May 2024 12:15:52 +0000
Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=22792, Retrieved Sun, 19 May 2024 12:15:52 +0000
QR Codes:

Original text written by user:
IsPrivate?No (this computation is public)
User-defined keywords
Estimated Impact145
Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)
F       [Box-Cox Linearity Plot] [Various EDA Topic...] [2008-11-09 17:39:29] [0831954c833179c36e9320daee0825b5] [Current]
Feedback Forum
2008-11-14 12:37:14 [Ken Van den Heuvel] [reply
Je stelt: 'We hebben een correlatie van 0.99 waardoor het verschil tussen de “original” en de “transformed” maar 0.01 bedraagt.'

Zo werkt het niet. 0,99 is gewoon de correlatie tussen de 2 reeksen. Er is inderdaad wel weinig verschil met voor en na de transformatie. Kijk naar de y-as box-cox linearity plot, de correlatie waarden verschillen niet al te veel wanneer met verschillende lambda-waarden word getransformeerd. Je kan je dus afvragen om de techniek wel geschikt/noodzakelijk is.
2008-11-19 12:24:52 [Bob Leysen] [reply
De link is correct.

Er is een correlatie van 0,99819 tussen de 2 reeksen (niet tussen de originale en transformed zoals ik had gezegd).

Er is zeer weinig verschil met voor en na de transformatie.

Voor (original): 114,45358
Na (transformed): 114,23342

Als je naar de y-as van de box-cox linearity plot kijkt, zie je dat de correlatiewaarden niet al te veel verschilen wanneer de lambda-waarden worden getransformeerd. Is deze techniek dan wel nuttig?
2008-11-21 15:36:33 [Stijn Van de Velde] [reply
We gaan kijken of er een lambda bestaat zodanig dat je de x-variabele kan transformeren om een lineair verband te maken van de scatterplot. Deze lambda wordt diegene met de grootste correlatie, in dit geval is dat een lambda van 1,04.

Je stelling dat: We hebben een correlatie van 0.99 waardoor het verschil tussen de “original” en de “transformed” maar 0.01 bedraagt. klot wel niet.

Post a new message
Dataseries X:
15107
15024
12083
15761
16943
15070
13660
14769
14725
15998
15371
14957
15470
15102
11704
16284
16727
14969
14861
14583
15306
17904
16379
15420
17871
15913
13867
17823
17872
17422
16705
15991
16584
19124
17839
17209
18587
16258
15142
19202
17747
19090
18040
17516
17752
21073
17170
19440
19795
17575
16165
19465
19932
19961
17343
18924
18574
21351
18595
19823
20844
19640
17735
19814
22239
20682
17819
21872
22117
21866
Dataseries Y:
12055
12113
9617
12646
13581
12162
10970
11880
11888
12927
12300
12093
12381
12197
9455
13168
13428
11981
11885
11692
12234
14341
13131
12421
14286
12865
11160
14316
14389
14014
13419
12770
13316
15333
14243
13824
14963
13203
12199
15509
14200
15170
14058
13786
14148
16542
13588
15582
15803
14131
12923
15612
16034
16037
14038
15331
15038
17402
14993
16044
16930
15921
14417
15961
17852
16484
14216
17430
17840
17629




Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time3 seconds
R Server'Gwilym Jenkins' @ 72.249.127.135

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 3 seconds \tabularnewline
R Server & 'Gwilym Jenkins' @ 72.249.127.135 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=22792&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]3 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Gwilym Jenkins' @ 72.249.127.135[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=22792&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=22792&T=0

As an alternative you can also use a QR Code:  

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time3 seconds
R Server'Gwilym Jenkins' @ 72.249.127.135







Box-Cox Linearity Plot
# observations x70
maximum correlation0.998190870633694
optimal lambda(x)1.04
Residual SD (orginial)114.453582764925
Residual SD (transformed)114.233421359102

\begin{tabular}{lllllllll}
\hline
Box-Cox Linearity Plot \tabularnewline
# observations x & 70 \tabularnewline
maximum correlation & 0.998190870633694 \tabularnewline
optimal lambda(x) & 1.04 \tabularnewline
Residual SD (orginial) & 114.453582764925 \tabularnewline
Residual SD (transformed) & 114.233421359102 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=22792&T=1

[TABLE]
[ROW][C]Box-Cox Linearity Plot[/C][/ROW]
[ROW][C]# observations x[/C][C]70[/C][/ROW]
[ROW][C]maximum correlation[/C][C]0.998190870633694[/C][/ROW]
[ROW][C]optimal lambda(x)[/C][C]1.04[/C][/ROW]
[ROW][C]Residual SD (orginial)[/C][C]114.453582764925[/C][/ROW]
[ROW][C]Residual SD (transformed)[/C][C]114.233421359102[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=22792&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=22792&T=1

As an alternative you can also use a QR Code:  

The GUIDs for individual cells are displayed in the table below:

Box-Cox Linearity Plot
# observations x70
maximum correlation0.998190870633694
optimal lambda(x)1.04
Residual SD (orginial)114.453582764925
Residual SD (transformed)114.233421359102



Parameters (Session):
Parameters (R input):
R code (references can be found in the software module):
n <- length(x)
c <- array(NA,dim=c(401))
l <- array(NA,dim=c(401))
mx <- 0
mxli <- -999
for (i in 1:401)
{
l[i] <- (i-201)/100
if (l[i] != 0)
{
x1 <- (x^l[i] - 1) / l[i]
} else {
x1 <- log(x)
}
c[i] <- cor(x1,y)
if (mx < abs(c[i]))
{
mx <- abs(c[i])
mxli <- l[i]
}
}
c
mx
mxli
if (mxli != 0)
{
x1 <- (x^mxli - 1) / mxli
} else {
x1 <- log(x)
}
r<-lm(y~x)
se <- sqrt(var(r$residuals))
r1 <- lm(y~x1)
se1 <- sqrt(var(r1$residuals))
bitmap(file='test1.png')
plot(l,c,main='Box-Cox Linearity Plot',xlab='Lambda',ylab='correlation')
grid()
dev.off()
bitmap(file='test2.png')
plot(x,y,main='Linear Fit of Original Data',xlab='x',ylab='y')
abline(r)
grid()
mtext(paste('Residual Standard Deviation = ',se))
dev.off()
bitmap(file='test3.png')
plot(x1,y,main='Linear Fit of Transformed Data',xlab='x',ylab='y')
abline(r1)
grid()
mtext(paste('Residual Standard Deviation = ',se1))
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Box-Cox Linearity Plot',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'# observations x',header=TRUE)
a<-table.element(a,n)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'maximum correlation',header=TRUE)
a<-table.element(a,mx)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'optimal lambda(x)',header=TRUE)
a<-table.element(a,mxli)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Residual SD (orginial)',header=TRUE)
a<-table.element(a,se)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Residual SD (transformed)',header=TRUE)
a<-table.element(a,se1)
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')