Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_bootstrapplot.wasp

Title produced by software

Blocked Bootstrap Plot - Central Tendency

Date of computation

Wed, 29 Oct 2008 07:05:50 -0600

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Oct/29/t1225285682w7wyk0prebq1hmz.htm/, Retrieved Tue, 14 May 2024 17:59:45 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=19835, Retrieved Tue, 14 May 2024 17:59:45 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

209

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F     [Blocked Bootstrap Plot - Central Tendency] [workshop 3] [2007-10-26 12:36:24] [e9ffc5de6f8a7be62f22b142b5b6b1a8]
F    D    [Blocked Bootstrap Plot - Central Tendency] [Herberekening tas...] [2008-10-29 13:05:50] [e08fee3874f3333d6b7a377a061b860d] [Current]

Feedback Forum

2008-11-08 08:41:11 [Astrid Sniekers] [reply] 
Het antwoord van de student is niet echt eenduidig. Als gemiddelde kunnen we het beste de midrange nemen, omdat hier de spreiding/variantie het kleinst is. De midrange is echter zeer gevoelig voor outliers. De outliers wijken ver van het gemiddelde af en zijn zeker relevant! Omdat deze outliers een probleem zijn, zouden we ook kunnen kiezen voor de mean. Hier is de spreiding dan wel groter, maar omdat hier geen outliers zijn, kunnen we zekerder zijn van het gemiddelde. Meestal wordt het rekenkundig gemiddelde genomen. 
2008-11-10 10:48:26 [Kevin Neelen] [reply] 
Er kan hier gebruik gemaakt worden van de midrange, aangezien daar de varantie het kleinst is (ondanks de aanwezigheid van enkele outliers). 
Echter in sommige gevallen is het misschien raadzamer om gebruik te maken van een methode waarbij minder outliers aanwezig zijn om zo tot de beste estimator te komen. Vandaar dat mean ook zeker niet een volledig foutieve oplossing zou zijn.
2008-11-10 16:55:05 [Michael Van Spaandonck] [reply] 
We zien dat de midrange de kleinste spreiding vertoont in vergelijking met median en mean. Er kan dus besloten worden dat de midrange de beste keuze is, en niet de arithmetic mean. 
 
Soms kan het echter beter zijn (bijvoorbeeld in geval van medicatie) om een “estimator” te nemen zonder outliers, zoals de oude student in het voorbeelddocument gesteld heeft en zoals Astrid en Kevin dit ook doen.
2008-11-10 19:55:52 [Inge Meelberghs] [reply] 
De student maakt een correcte berekening en opteert voor de midrange. Deze zouden volgens hem de beste benadering zijn voor de geschatte waarde van de kledingproductie. Dit is correct al kunnen we dit antwoord nog wat nuanceren.  
 
De midrange geeft de beste benadering omdat de spreiding hier het kleinst is. Een nadeel hier is dat er wel wat outliers zijn die we zeker niet als irrelevant mogen beschouwen! Dit is dan weer niet het geval bij het gemiddelde, hier heb je geen outliers maar is de spreiding wel het grootst. Ook is er hier meer variantie dan bij de midrange. Bij het kiezen van de mean of de midrange moet je kijken naar welk van de twee het beste is afhankelijk van de spreiding en de outliers.  
  
Het antwoord is dus eigenlijk een persoonlijke keuze van wat je belangrijk acht in je voorspellingen.

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	4 seconds
R Server	'George Udny Yule' @ 72.249.76.132

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 4 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=19835&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]4 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=19835&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=19835&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	4 seconds
R Server	'George Udny Yule' @ 72.249.76.132

Estimation Results of Blocked Bootstrap
statistic	Q1	Estimate	Q3	S.D.	IQR
mean	85.7446721311476	86.8934426229508	87.9959016393443	1.60288135600116	2.25122950819672
median	86.4	87.3	88	1.81259339887889	1.59999999999999
midrange	87.85	88.1	88.85	1.08083247438044	1

\begin{tabular}{lllllllll}
\hline
Estimation Results of Blocked Bootstrap \tabularnewline
statistic & Q1 & Estimate & Q3 & S.D. & IQR \tabularnewline
mean & 85.7446721311476 & 86.8934426229508 & 87.9959016393443 & 1.60288135600116 & 2.25122950819672 \tabularnewline
median & 86.4 & 87.3 & 88 & 1.81259339887889 & 1.59999999999999 \tabularnewline
midrange & 87.85 & 88.1 & 88.85 & 1.08083247438044 & 1 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=19835&T=1

[TABLE]
[ROW][C]Estimation Results of Blocked Bootstrap[/C][/ROW]
[ROW][C]statistic[/C][C]Q1[/C][C]Estimate[/C][C]Q3[/C][C]S.D.[/C][C]IQR[/C][/ROW]
[ROW][C]mean[/C][C]85.7446721311476[/C][C]86.8934426229508[/C][C]87.9959016393443[/C][C]1.60288135600116[/C][C]2.25122950819672[/C][/ROW]
[ROW][C]median[/C][C]86.4[/C][C]87.3[/C][C]88[/C][C]1.81259339887889[/C][C]1.59999999999999[/C][/ROW]
[ROW][C]midrange[/C][C]87.85[/C][C]88.1[/C][C]88.85[/C][C]1.08083247438044[/C][C]1[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=19835&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=19835&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Estimation Results of Blocked Bootstrap
statistic	Q1	Estimate	Q3	S.D.	IQR
mean	85.7446721311476	86.8934426229508	87.9959016393443	1.60288135600116	2.25122950819672
median	86.4	87.3	88	1.81259339887889	1.59999999999999
midrange	87.85	88.1	88.85	1.08083247438044	1

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Figure 6

PNG link

Postscript link

PDF link

Figure 7

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 500 ; par2 = 12 ;

Parameters (R input):

par1 = 500 ; par2 = 12 ;

R code (references can be found in the software module):

par1 <- as.numeric(par1)
par2 <- as.numeric(par2)
if (par1 < 10) par1 = 10
if (par1 > 5000) par1 = 5000
if (par2 < 3) par2 = 3
if (par2 > length(x)) par2 = length(x)
library(lattice)
library(boot)
boot.stat <- function(s)
{
s.mean <- mean(s)
s.median <- median(s)
s.midrange <- (max(s) + min(s)) / 2
c(s.mean, s.median, s.midrange)
}
(r <- tsboot(x, boot.stat, R=par1, l=12, sim='fixed'))
bitmap(file='plot1.png')
plot(r$t[,1],type='p',ylab='simulated values',main='Simulation of Mean')
grid()
dev.off()
bitmap(file='plot2.png')
plot(r$t[,2],type='p',ylab='simulated values',main='Simulation of Median')
grid()
dev.off()
bitmap(file='plot3.png')
plot(r$t[,3],type='p',ylab='simulated values',main='Simulation of Midrange')
grid()
dev.off()
bitmap(file='plot4.png')
densityplot(~r$t[,1],col='black',main='Density Plot',xlab='mean')
dev.off()
bitmap(file='plot5.png')
densityplot(~r$t[,2],col='black',main='Density Plot',xlab='median')
dev.off()
bitmap(file='plot6.png')
densityplot(~r$t[,3],col='black',main='Density Plot',xlab='midrange')
dev.off()
z <- data.frame(cbind(r$t[,1],r$t[,2],r$t[,3]))
colnames(z) <- list('mean','median','midrange')
bitmap(file='plot7.png')
boxplot(z,notch=TRUE,ylab='simulated values',main='Bootstrap Simulation - Central Tendency')
grid()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Estimation Results of Blocked Bootstrap',6,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'statistic',header=TRUE)
a<-table.element(a,'Q1',header=TRUE)
a<-table.element(a,'Estimate',header=TRUE)
a<-table.element(a,'Q3',header=TRUE)
a<-table.element(a,'S.D.',header=TRUE)
a<-table.element(a,'IQR',header=TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'mean',header=TRUE)
q1 <- quantile(r$t[,1],0.25)[[1]]
q3 <- quantile(r$t[,1],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[1])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,1])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'median',header=TRUE)
q1 <- quantile(r$t[,2],0.25)[[1]]
q3 <- quantile(r$t[,2],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[2])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,2])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'midrange',header=TRUE)
q1 <- quantile(r$t[,3],0.25)[[1]]
q3 <- quantile(r$t[,3],0.75)[[1]]
a<-table.element(a,q1)
a<-table.element(a,r$t0[3])
a<-table.element(a,q3)
a<-table.element(a,sqrt(var(r$t[,3])))
a<-table.element(a,q3-q1)
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code