Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_boxcoxnorm.wasp

Title produced by software

Box-Cox Normality Plot

Date of computation

Thu, 13 Nov 2008 03:40:54 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/13/t1226572922id854e103g4payo.htm/, Retrieved Sun, 19 May 2024 10:05:15 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24558, Retrieved Sun, 19 May 2024 10:05:15 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

box cox normality plot

Estimated Impact

155

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Box-Cox Normality Plot] [workshop 2 Q4 Box...] [2008-11-13 10:40:54] [74c7506a1ea162af3aa8be25bcd05d28] [Current]
F R  D    [Box-Cox Normality Plot] [Box cox normality] [2008-11-13 19:22:02] [3ffd109c9e040b1ae7e5dbe576d4698c] 

Feedback Forum

2008-11-15 12:00:18 [58d427c57bd46519a715a3a7fea6a80f] [reply] 
Theorie: De Box-Cox transformatie is een bijzonder nuttige familie van transformaties. Het wordt gedefinieerd als:  
 waar Y de reactievariabele/antwoordvariabele is en   de transformatieparameter.  
Ook hier geldt: Als   = 0, dan wordt de transformatie ‘log’ toegepast op de gegevens en wordt de bovengenoemde formule buiten beschouwing gelaten.  
Gezien het gebruik van een bepaalde transformatie, zoals de hierboven genoemde Box-Cox trasformatie, kan het erg nuttig zijn om een maatstaf van de normaliteit van de resulterende transformatie te definiëren. Een van de mogelijke maatstaven is het berekenen van de correlatiecoëfficiënt van een Normal Probability Plot. De correlatie wordt berekend tussen de variabelen van de verticale en horizontale as van de propablity plot en is een geschikte maatstaf voor het bepalen van het lineair verband van de probability plot. (Hoe meer lineair de probability plot is, hoe beter een normale verdeling past bij de gegevens). 
Het Box-Cox normality plot is een plot/grafiek van deze correlatiecoëfficiënten voor diverse waarden van de  -parameter. De optimale keuze voor de waarde van   is die waarde die overeenstemt met de maximumcorrelatie op de plot/grafiek. 
 
Definitie van de Box-Cox normality plot: Deze plots worden gevormd door:  
- Op de verticale (y-) as: De correlatiecoëfficiënt van de normal probability plot, na het toepassen van de Box-Cox transformatie.  
- Op de horizontale (x-) as: De gekozen waarde voor  . 
2008-11-19 19:03:09 [Bénédicte Soens] [reply] 
Bij de vorige vraag werd de correlatie gezocht tussen 2 verschillende variabelen en werd er vervolgens een transformatie gevormd. Hier wordt er ook een transformatie toegepast (opnieuw een logarite (te zien in R-code)), met het doel om een normaal verdeling te bekomen. Zoals duidelijk zichtbaar is, was er voor de transformatie geen sprake van een normaalverdeling.
2008-11-24 12:01:34 [Julian De Ruyter] [reply] 
Bij de Box Cox normality plot wordt y getransformeerd om de normaalverdeling te optimaliseren. We gaan lambda tussen 2 en -2 laten verschuiven om een maximum in de functie te bekomen.  
De transformatie van de tijdsreeks heeft echter geen verbeterend effect op de normaalverdeling en op de correlatie.   
Na de transformatie bekomen we net zoals ervoor, geen normaalverdeling...

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

8,4
8,4
8,4
8,6
8,9
8,8
8,3
7,5
7,2
7,5
8,8
9,3
9,3
8,7
8,2
8,3
8,5
8,6
8,6
8,2
8,1
8
8,6
8,7
8,8
8,5
8,4
8,5
8,7
8,7
8,6
8,5
8,3
8,1
8,2
8,1
8,1
7,9
7,9
7,9
8
8
7,9
8
7,7
7,2
7,5
7,3
7
7
7
7,2
7,3
7,1
6,8
6,6
6,2
6,2
6,8
6,9
6,8

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 2 seconds \tabularnewline
R Server & 'Gwilym Jenkins' @ 72.249.127.135 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24558&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]2 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Gwilym Jenkins' @ 72.249.127.135[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24558&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24558&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

Box-Cox Normality Plot
# observations x	61
maximum correlation	0
optimal lambda	-999

\begin{tabular}{lllllllll}
\hline
Box-Cox Normality Plot \tabularnewline
# observations x & 61 \tabularnewline
maximum correlation & 0 \tabularnewline
optimal lambda & -999 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24558&T=1

[TABLE]
[ROW][C]Box-Cox Normality Plot[/C][/ROW]
[ROW][C]# observations x[/C][C]61[/C][/ROW]
[ROW][C]maximum correlation[/C][C]0[/C][/ROW]
[ROW][C]optimal lambda[/C][C]-999[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24558&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24558&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Box-Cox Normality Plot
# observations x	61
maximum correlation	0
optimal lambda	-999

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Parameters (Session):

Parameters (R input):

R code (references can be found in the software module):

n <- length(x)
c <- array(NA,dim=c(401))
l <- array(NA,dim=c(401))
mx <- 0
mxli <- -999
for (i in 1:401)
{
l[i] <- (i-201)/100
if (l[i] != 0)
{
x1 <- (x^l[i] - 1) / l[i]
} else {
x1 <- log(x)
}
c[i] <- cor(qnorm(ppoints(x), mean=0, sd=1),x1)
if (mx < c[i])
{
mx <- c[i]
mxli <- l[i]
}
}
c
mx
mxli
if (mxli != 0)
{
x1 <- (x^mxli - 1) / mxli
} else {
x1 <- log(x)
}
bitmap(file='test1.png')
plot(l,c,main='Box-Cox Normality Plot',xlab='Lambda',ylab='correlation')
mtext(paste('Optimal Lambda =',mxli))
grid()
dev.off()
bitmap(file='test2.png')
hist(x,main='Histogram of Original Data',xlab='X',ylab='frequency')
grid()
dev.off()
bitmap(file='test3.png')
hist(x1,main='Histogram of Transformed Data',xlab='X',ylab='frequency')
grid()
dev.off()
bitmap(file='test4.png')
qqnorm(x)
qqline(x)
grid()
mtext('Original Data')
dev.off()
bitmap(file='test5.png')
qqnorm(x1)
qqline(x1)
grid()
mtext('Transformed Data')
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Box-Cox Normality Plot',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'# observations x',header=TRUE)
a<-table.element(a,n)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'maximum correlation',header=TRUE)
a<-table.element(a,mx)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'optimal lambda',header=TRUE)
a<-table.element(a,mxli)
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code