Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_edauni.wasp

Title produced by software

Univariate Explorative Data Analysis

Date of computation

Sat, 25 Oct 2008 06:38:27 -0600

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Oct/25/t1224938383rh972qjr7t15tm2.htm/, Retrieved Sun, 19 May 2024 13:35:34 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=18696, Retrieved Sun, 19 May 2024 13:35:34 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

189

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Univariate Explorative Data Analysis] [] [2008-10-25 12:38:27] [5f3e73ccf1ddc75508eed47fa51813d3] [Current]
-   PD    [Univariate Explorative Data Analysis] [Q2] [2008-11-03 18:59:05] [a0d819c22534897f04a2f0b92f1eb36a] 
-   P       [Univariate Explorative Data Analysis] [Q2] [2008-11-03 19:02:09] [a0d819c22534897f04a2f0b92f1eb36a] 

Feedback Forum

2008-11-02 11:21:15 [Ciska Tanghe] [reply] 
Eerst en vooral moet Q2 opgelost worden aan de hand van vier assumpties, die terug te vinden zijn in EDA. De student heeft dit niet gedaan. Daarnaast heeft de student niet alle grafieken berekend die nodig zijn om de vier assumpties te testen. Bij opties moet het aantal lags ingevuld worden, 12 (= 1 jaar) of 38 zijn het best.
2008-11-02 16:44:29 [Stijn Loomans] [reply] 
De student heeft q2 niet berekent en beoordeeld aan de hand van de 4 assumptions. Hij heeft ook niet de benodigde grafieken gemaakt. 
 
De student had ook de lags moeten ingeven ( namelijk 12 of 36) 
 
Assumptie 1:Are the data autocorrelated? (The model assumes no autocorrelation)  
We kunnen dit gaan uittesten door de autocorrelatie of het lagplot te gebruiken. We kunnen uit de lagplot afleiden dat de autocorrelatie heel dicht bij 0 ligt . Als we met lags gaan werken kunnen we zien dat bij lag 12 , lag 24 er beide keren een grote correlatie voorkomt. Dus een seizonale correlatie optreed in deze reeks. 
Hierdoor kun je ook een voorspelling voor de toekomst maken. De 2 stippenlijnen geven de betrouwbaarheidsintervallen weer met een waarschijnlijkheid van 95%. Waardoor er dus een kans van 5% is dat de autocorrelatie erbuiten valt 
 
Conclusie: De tijdreeks is niet random want bevat seizoensgebonden correlatie. 
 
Assumption 2: Is the random component generated by a fixed distribution? (The model assumes a fixed distribution) 
 
Voor deze assumption moeten we gaan zien naar het histogram en density plot 
Je kan hier op zien dat het een normaalverdeling is( met uitschieter aan beide kanten) .  
Ook kunnen we gaan zien naar het Q-Q plot. 
Hier kan je dan een denkbeeldige lijn door de punten trekken. Hier blijkt dat de meeste punten dan op die denkbeeldige lijn zullen liggen dus normaalverdeling.   
 
Assumption 3: Is the deterministic component constant? (The model assumes that the distribution has a fixed location) 
 
We kijken hierbij naar het run sequence plot. Op lange termijn is deze reeks niet constant .  Maar we kunnen ook gaan zien of het gemiddelde constant is , via de central tendency's. 
Bij de robustness van central tendency zien we toch dat het verloop vrij constant is. 
 
Concluse : Maar toch vermoeden we dat het verloop op lange termijn dalend is. 
 
assumtion 4: 
Bij deze assumtie maken we nogmaals gebruik van de run sequence plot. 
we kijken hierbij naar de spreiding van de reeks over de tijd heen. 
 
Om deze grafiek te bekijken splits je hem in twee delen.  De spreiding van het eerste deel  is groter dan die van het tweede deel. 
Waardoor we kunnen afleiden dat de reeks geen fixed variation heeft . Want ze schommeld te hard. 
 
Conlcusion : De tijdreeks voldoet niet aan het model

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'George Udny Yule' @ 72.249.76.132

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 2 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=18696&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]2 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=18696&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=18696&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	2 seconds
R Server	'George Udny Yule' @ 72.249.76.132

Descriptive Statistics
# observations	61
minimum	66.5
Q1	80.6
median	87.3
mean	86.8934426229508
Q3	94.1
maximum	109.7

\begin{tabular}{lllllllll}
\hline
Descriptive Statistics \tabularnewline
# observations & 61 \tabularnewline
minimum & 66.5 \tabularnewline
Q1 & 80.6 \tabularnewline
median & 87.3 \tabularnewline
mean & 86.8934426229508 \tabularnewline
Q3 & 94.1 \tabularnewline
maximum & 109.7 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=18696&T=1

[TABLE]
[ROW][C]Descriptive Statistics[/C][/ROW]
[ROW][C]# observations[/C][C]61[/C][/ROW]
[ROW][C]minimum[/C][C]66.5[/C][/ROW]
[ROW][C]Q1[/C][C]80.6[/C][/ROW]
[ROW][C]median[/C][C]87.3[/C][/ROW]
[ROW][C]mean[/C][C]86.8934426229508[/C][/ROW]
[ROW][C]Q3[/C][C]94.1[/C][/ROW]
[ROW][C]maximum[/C][C]109.7[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=18696&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=18696&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Descriptive Statistics
# observations	61
minimum	66.5
Q1	80.6
median	87.3
mean	86.8934426229508
Q3	94.1
maximum	109.7

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Figure 6

PNG link

Postscript link

PDF link

Figure 7

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 0 ; par2 = 0 ;

Parameters (R input):

par1 = 0 ; par2 = 0 ;

R code (references can be found in the software module):

par1 <- as.numeric(par1)
par2 <- as.numeric(par2)
x <- as.ts(x)
library(lattice)
bitmap(file='pic1.png')
plot(x,type='l',main='Run Sequence Plot',xlab='time or index',ylab='value')
grid()
dev.off()
bitmap(file='pic2.png')
hist(x)
grid()
dev.off()
bitmap(file='pic3.png')
if (par1 > 0)
{
densityplot(~x,col='black',main=paste('Density Plot   bw = ',par1),bw=par1)
} else {
densityplot(~x,col='black',main='Density Plot')
}
dev.off()
bitmap(file='pic4.png')
qqnorm(x)
qqline(x)
grid()
dev.off()
if (par2 > 0)
{
bitmap(file='lagplot1.png')
dum <- cbind(lag(x,k=1),x)
dum
dum1 <- dum[2:length(x),]
dum1
z <- as.data.frame(dum1)
z
plot(z,main='Lag plot (k=1), lowess, and regression line')
lines(lowess(z))
abline(lm(z))
dev.off()
if (par2 > 1) {
bitmap(file='lagplotpar2.png')
dum <- cbind(lag(x,k=par2),x)
dum
dum1 <- dum[(par2+1):length(x),]
dum1
z <- as.data.frame(dum1)
z
mylagtitle <- 'Lag plot (k='
mylagtitle <- paste(mylagtitle,par2,sep='')
mylagtitle <- paste(mylagtitle,'), and lowess',sep='')
plot(z,main=mylagtitle)
lines(lowess(z))
dev.off()
}
bitmap(file='pic5.png')
acf(x,lag.max=par2,main='Autocorrelation Function')
grid()
dev.off()
}
summary(x)
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Descriptive Statistics',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'# observations',header=TRUE)
a<-table.element(a,length(x))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'minimum',header=TRUE)
a<-table.element(a,min(x))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Q1',header=TRUE)
a<-table.element(a,quantile(x,0.25))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'median',header=TRUE)
a<-table.element(a,median(x))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'mean',header=TRUE)
a<-table.element(a,mean(x))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Q3',header=TRUE)
a<-table.element(a,quantile(x,0.75))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'maximum',header=TRUE)
a<-table.element(a,max(x))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code