Minimizing the Negative Log Likelihood for BMD and ABD
Source:vignettes/BMDs_and_ABDs_EstMLExxxBin.Rmd
BMDs_and_ABDs_EstMLExxxBin.Rmd
IT WOULD BE CLEARLY BENEFICIAL FOR YOU BY USING THE RMD FILES IN THE GITHUB DIRECTORY FOR FURTHER EXPLANATION OR UNDERSTANDING OF THE R CODE FOR THE RESULTS OBTAINED IN THE VIGNETTES.
Negative Log Likelihood Estimation or Maximum Likelihood Estimation Technique
All of the Binomial Mixture and Alternate Binomial distributions in this package use this technique. This is very simple, by using the probability mass function we can develop the Likelihood function and then maximize it or develop the Negative Log Likelihood function and minimize it. Maximizing or minimizing means finding appropriate values for the parameters in question which will provide a most maximum Likelihood value or least minimum Negative Log Likelihood value.
If we have one parameter to estimate it is possible to use brute force technique. When the number of parameters need to be estimated increases this is very time consuming. Similarly, if the parameter in estimation has a very broad domain also it would be difficult.
In situations with difficulty of such we can use functions such as
optim
,mle
, mle2
,
nlm
, nlminb
. fitODBOD
is using the
mle2
function of the bbmle
package. There are situations where user defined functions without the
help of mle2
.
Functions which does not need bbmle
package
These situations occurring are very less in comparing to the number
of distributions we have in fitODBOD
. They genuinely occur
when the parameter in need of estimation has a closed domain and that
domain is very small. Here by small it is indicating a scenario of such
as in-between zero and one or greater than zero.
# estimating parameter for p and alpha using Additive Binomial Distribution of Chromosome data
Chromosome_data
|> No.of.Asso fre
|> 1 0 32
|> 2 1 103
|> 3 2 122
|> 4 3 80
The distributions in relation are
- from Alternate Binomial Distribution - Additive Binomial Distribution
There are two parameters to estimate, which are p and alpha. These
parameters are unique values, therefore estimating them once is enough.
Time related to estimation is dependent on the data. The outputs are of
class mlAB
and ml
, where the outputs can be
extracted by the methods print
, summary
,
coef
and AIC
.
|> Estimated parameter p for Chromosome data= 0.580941
|> Estimated parameter alpha for Chromosome data= 0.088305
- from Binomial Mixture Distribution - Triangular Binomial Distribution
There is only one parameter that needs to be estimated, which is mode
or c. This is in-between zero and one. The outputs are of class
mlTB
and ml
, where the outputs can be
extracted by the methods print
, summary
,
coef
and AIC
.
|> Estimated parameter mode or c for Chromosome data= 0.707276
Functions which does need bbmle
package
By Using the vignette file or manual pdf file we
can get to know how the bbmle
package works. Specifically
looking into mle2
function is necessary and also class of
mle2
. Notable points related are in point form as
- coef
- min
- nobs
- minuslogl
- method
It is preferable to try them and understand the functions capability and process. You can achieve proper results by looking at how the inputs should mentioned, because there is a specific format to follow.
Below are instances of using the mle2
function.
The distributions in relation are
from Alternate Binomial distributions
- Beta-Correlated Binomial distribution
Here there are three parameters to be estimated which are cov, a and b. It should be mentioned that cov is a unique value but alpha and beta can take specific combinations.
- cov is in the domain region of -infinity and +infinity.
- a, b are domain in the region of greater than zero.
|> Estimated parameter covariance for Chromosome data= 0.01107002
|> Estimated parameter a for Chromosome data= 12.95957
|> Estimated parameter b for Chromosome data= 9.350353
- Correlated Binomial distribution
Two parameters that needs to be estimated which are cov and p. They are unique values for the distribution.
- cov is in the domain region of -infinity and +infinity.
- p is in the domain region of zero and one.
|> Estimated parameter p for Chromosome data= 0.5809409
|> Estimated parameter covariance for Chromosome data= 0.02149907
- COM Poisson Binomial distribution
Two parameters that needs to be estimated which are p and cov. They are unique values for the distribution.
- cov is in the domain region of -infinity and +infinity.
- p is in the domain region of zero and one.
|> Estimated parameter p for Chromosome data= 0.5687794
|> Estimated parameter covariance for Chromosome data= 0.7025567
- Multiplicative Binomial distribution
Two parameters that needs to be estimated which are p and theta. Only p value is unique.
- p is in the domain region of zero and one.
- theta is in the domain region of greater than zero.
|> Estimated parameter p for Chromosome data= 0.5687846
|> Estimated parameter theta for Chromosome data= 0.8493023
- Lovinson Multiplicative Binomial distribution
Two parameters that needs to be estimated which are p and phi. Only p value is unique.
- p is in the domain region of zero and one.
- phi is in the domain region of greater than zero.
|> Estimated parameter p for Chromosome data= 0.5687846
|> Estimated parameter phi for Chromosome data= 0.8493023
from Binomial Mixture distributions
- Beta-Binomial distribution
Two parameters to be estimated which are a and b. Together they have to represent a unique combination.
- a,b in the domain region of greater than zero.
|> Estimated parameter a for Chromosome data= 6.131277
|> Estimated parameter b for Chromosome data= 4.426927
- Kumaraswamy Binomial distribution
Two parameters to be estimated which are a and b. Together they have to represent a unique combination.
- a,b in the domain region of greater than zero.
- it value or iteration value as a variable replacing the infinity in the summation.
Here the number of iterations has to be estimated as well, which is time consuming. therefore I haven’t estimated it.
|> Estimated parameter a for Chromosome data= 4.220482
|> Estimated parameter b for Chromosome data= 6.008013
|> Estimated parameter iteration for Chromosome data= 7500
- GHGBB distribution
Three parameters to be estimated which are a,b and c. They have to represent a unique combination as a whole.
- a,b,c in the domain region of greater than zero.
|> Estimated parameter a for Chromosome data= 2.774708
|> Estimated parameter b for Chromosome data= 23.16726
|> Estimated parameter c for Chromosome data= 8.626376
- McGBB distribution
Three parameters to be estimated which are a,b and c. They have to represent a unique combination as a whole.
- a,b,c in the domain region of greater than zero.
|> Estimated parameter a for Chromosome data= 0.3867209
|> Estimated parameter b for Chromosome data= 16.18944
|> Estimated parameter c for Chromosome data= 9.358337
- Gamma Binomial distribution
Two parameters to be estimated which are a and b. Together they have to represent a unique combination.
- c,l in the domain region of greater than zero.
|> Estimated parameter c for Chromosome data= 7.470102
|> Estimated parameter l for Chromosome data= 4.326182
- Grassia II Binomial distribution
Two parameters to be estimated which are a and b. Together they have to represent a unique combination.
- a,b in the domain region of greater than zero.
|> Estimated parameter a for Chromosome data= 13.88833
|> Estimated parameter b for Chromosome data= 0.06473275