Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*Unverified author*

R Software Module

rwasp_bootstrapplot.wasp

Title produced by software

Blocked Bootstrap Plot - Central Tendency

Date of computation

Thu, 06 Nov 2008 12:21:47 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/06/t1225999349hn1kduyxv7lmzkh.htm/, Retrieved Tue, 14 May 2024 18:56:25 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=22379, Retrieved Tue, 14 May 2024 18:56:25 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

213

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F     [Notched Boxplots] [workshop 3] [2007-10-26 13:31:48] [e9ffc5de6f8a7be62f22b142b5b6b1a8]
- R  D  [Notched Boxplots] [Q1] [2008-10-28 14:07:27] [3d2d096cc21c6f80db3dd7b8e12effce]
- RM D    [Blocked Bootstrap Plot - Central Tendency] [Workshop 1, Q4 (Ba2)] [2008-11-06 13:03:43] [488d9a19d3c63b747ac6ad96017e55c8]
F             [Blocked Bootstrap Plot - Central Tendency] [] [2008-11-06 19:21:47] [ffe1355fa7fe5626118ee2c4cacbba88] [Current]

Feedback Forum

2008-11-09 16:12:41 [Steven Vercammen] [reply] 
Q4: Deze vraag werd niet volledig correct opgelost: Ten eerste zou op z'n minst de bootstrap simulation-central tendency grafiek moeten worden opgenomen in het word-document Bij bootstrapping wordt er 500 keer het gemiddelde (mediaan/midrange) berekend maar er wordt telkens 1 observatie weggelaten. De student vermeldt ook dat dat de spreiding van geen belang is. Dit is echter niet het geval: zowel de outliers als de spreiding zijn van belang. Men moet afwegen met welk van de 2 men het meest rekening houdt. In principe is de boxplot met de kleinste spreiding de beste maatstaf omdat hij het nauwkeurigst is (in dit geval zou het antwoord dus de mid-range zijn), maar wanneer er teveel outliers zijn (en dit is het geval bij de mid-range)is de kans dat misloopt veel groter. Het gemiddelde heeft hier een grotere spreiding maar weinig outliers en is bijgevolg de beste oplossing.
2008-11-09 19:19:57 [Leen Van Dooren] [reply] 
Bij Q4 kan je zeggen dat de midrange wel degelijk de beste schatter is maar deze heeft wel veel outliers. Je moet je afvragen wat het beste is: weinig outliers of kleine spreiding. 
2008-11-10 21:07:07 [Jonas Scheltjens] [reply] 
Het is niet juist om te stellen dat men bij de bootstrap niet uitmaakt hoe groot de spreiding is. Dit is volledig verkeerd: hoe groter de spreiding hoe minder efficiënt de berekening. Dit heeft dus wel degelijk invloed en is wél relevant. Het is wel juist dat het tevens ook belangrijk is rekening te houden met de outliers. Men moet de voordelen en nadelen van de spreiding te verkleinen en ouliers vermeerderen (mediaan en midrange) vergelijken met het tegenovergestelde (gemiddelde)en dan oordelen naargelang wat het relevantste beeld weergeeft. Hier is dat het gemiddelde.  
Wat vreemd is, is dat er geen bootstrapgrafiek is weergegeven en ook dat er dan wel een bespreking van is.
2008-11-12 10:10:14 [407693b66d7f2e0b350979005057872d] [reply] 
Het geven antwoordt is niet correct omdat: 
 
De punten in de scatterplots van de Mean liggen ver uit elkaar, van de median liggen ze dichter bij elkaar dan de mean en van de midrange liggen de punten op 2 lijnen.  Als we kijken naar de Boxplots dan zien we dat de midrange de kleinste variantie heeft dit geeft normaal ook het beste resultaat maar we zien dat de midrange tov de mean veel meer outliers heeft en is daarom minder betrouwbaar dan de mean die een relatief grote spreiding heeft.  Spreiding is wel belangrijk maar je moet kijken ook of er veel utliers zijn bij de andere plots. 
 

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 3 seconds \tabularnewline
R Server & 'Gwilym Jenkins' @ 72.249.127.135 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=22379&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]3 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Gwilym Jenkins' @ 72.249.127.135[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=22379&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=22379&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

Estimation Results of Blocked Bootstrap
statistic	Q1	Estimate	Q3	S.D.	IQR
mean	85.7688524590164	86.8934426229508	88.1184426229508	1.69128159149721	2.34959016393441
median	86.4	87.3	88	1.93211338286564	1.59999999999999
midrange	88.1	88.1	88.85	0.93459051399943	0.75

\begin{tabular}{lllllllll}
\hline
Estimation Results of Blocked Bootstrap \tabularnewline
statistic & Q1 & Estimate & Q3 & S.D. & IQR \tabularnewline
mean & 85.7688524590164 & 86.8934426229508 & 88.1184426229508 & 1.69128159149721 & 2.34959016393441 \tabularnewline
median & 86.4 & 87.3 & 88 & 1.93211338286564 & 1.59999999999999 \tabularnewline
midrange & 88.1 & 88.1 & 88.85 & 0.93459051399943 & 0.75 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=22379&T=1

[TABLE]
[ROW][C]Estimation Results of Blocked Bootstrap[/C][/ROW]
[ROW][C]statistic[/C][C]Q1[/C][C]Estimate[/C][C]Q3[/C][C]S.D.[/C][C]IQR[/C][/ROW]
[ROW][C]mean[/C][C]85.7688524590164[/C][C]86.8934426229508[/C][C]88.1184426229508[/C][C]1.69128159149721[/C][C]2.34959016393441[/C][/ROW]
[ROW][C]median[/C][C]86.4[/C][C]87.3[/C][C]88[/C][C]1.93211338286564[/C][C]1.59999999999999[/C][/ROW]
[ROW][C]midrange[/C][C]88.1[/C][C]88.1[/C][C]88.85[/C][C]0.93459051399943[/C][C]0.75[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=22379&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=22379&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Estimation Results of Blocked Bootstrap
statistic	Q1	Estimate	Q3	S.D.	IQR
mean	85.7688524590164	86.8934426229508	88.1184426229508	1.69128159149721	2.34959016393441
median	86.4	87.3	88	1.93211338286564	1.59999999999999
midrange	88.1	88.1	88.85	0.93459051399943	0.75

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Figure 6

PNG link

Postscript link

PDF link

Figure 7

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = grey ;

Parameters (R input):

par1 = 500 ; par2 = 12 ; par3 = ; par4 = ; par5 = ; par6 = ; par7 = ; par8 = ; par9 = ; par10 = ; par11 = ; par12 = ; par13 = ; par14 = ; par15 = ; par16 = ; par17 = ; par18 = ; par19 = ; par20 = ;

R code (references can be found in the software module):

par1 <- as.numeric(par1)
par2 <- as.numeric(par2)
if (par1 < 10) par1 = 10
if (par1 > 5000) par1 = 5000
if (par2 < 3) par2 = 3
if (par2 > length(x)) par2 = length(x)
library(lattice)
library(boot)
boot.stat <- function(s)
{
s.mean <- mean(s)
s.median <- median(s)
s.midrange <- (max(s) + min(s)) / 2
c(s.mean, s.median, s.midrange)
}
(r <- tsboot(x, boot.stat, R=par1, l=12, sim='fixed'))
bitmap(file='plot1.png')
plot(r$t[,1],type='p',ylab='simulated values',main='Simulation of Mean')
grid()
dev.off()
bitmap(file='plot2.png')
plot(r$t[,2],type='p',ylab='simulated values',main='Simulation of Median')
grid()
dev.off()
bitmap(file='plot3.png')
plot(r$t[,3],type='p',ylab='simulated values',main='Simulation of Midrange')
grid()
dev.off()
bitmap(file='plot4.png')
densityplot(~r$t[,1],col='black',main='Density Plot',xlab='mean')
dev.off()
bitmap(file='plot5.png')
densityplot(~r$t[,2],col='black',main='Density Plot',xlab='median')
dev.off()
bitmap(file='plot6.png')
densityplot(~r$t[,3],col='black',main='Density Plot',xlab='midrange')
dev.off()
z <- data.frame(cbind(r$t[,1],r$t[,2],r$t[,3]))
colnames(z) <- list('mean','median','midrange')
bitmap(file='plot7.png')
boxplot(z,notch=TRUE,ylab='simulated values',main='Bootstrap Simulation - Central Tendency')
grid()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Estimation Results of Blocked Bootstrap',6,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'statistic',header=TRUE)
a<-table.element(a,'Q1',header=TRUE)
a<-table.element(a,'Estimate',header=TRUE)
a<-table.element(a,'Q3',header=TRUE)
a<-table.element(a,'S.D.',header=TRUE)
a<-table.element(a,'IQR',header=TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'mean',header=TRUE)
q1 <- quantile(r$t[,1],0.25)[[1]]
q3 <- quantile(r$t[,1],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[1])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,1])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'median',header=TRUE)
q1 <- quantile(r$t[,2],0.25)[[1]]
q3 <- quantile(r$t[,2],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[2])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,2])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'midrange',header=TRUE)
q1 <- quantile(r$t[,3],0.25)[[1]]
q3 <- quantile(r$t[,3],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[3])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,3])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code