From: Robin R High on
Tobias,

It is somewhat of a multi-step process, namely first determine if the
cumulative logit model assumption is OK with LOGISTIC:

PROC LOGISTIC DATA=indat;
CLASS xvar / PARAM=glm;
FREQ count;
MODEL y = xvar / link=clogit ;
run;

Near the top you'll see:


The LOGISTIC Procedure

Score Test for the Proportional Odds Assumption

Chi-Square DF Pr > ChiSq

3.9106 3 0.2713


which the relatively "large" pvalue indicates the cumulative logit model
works



To assess goodness of fit for the clogit model and a categorical
explanatory variable, you can run GENMOD with the aggregate= option

PROC genmod DATA=indat;
CLASS xvar ;
FREQ count;
MODEL y = xvar / dist=multinomial link=clogit aggregate=(xvar) ;
RUN;

that produces the Criteria For Assessing Goodness Of Fit

Criterion DF Value Value/DF

Deviance 3 3.6877 1.2292
Pearson Chi-Square 3 3.6630 1.2210

which are then compared to a chi-square distribution with the indicated
degrees of freedom (DF).

One can verify these fit statistics by computing the expected counts for
each cell of the table (takes a few programming steps) and connect them
with the observed counts entering them into the chisq and dev formulas:

expected = total * prob ; where prob is the estimated probability for
each response level for each level of xvar
total is the sum of all observed counts
for each level of the xvar

sum these values across all cells:

chisq = ( count - expected)**2/expected;
dev = 2*count*(log( count / expected));

Note that if the clogit doesn't work (bad fit or proportional odds not
satisfied) then enter link=glogit (in LOGISTIC or GLIMMIX)' the expected
and observed counts are the same for a categorical xvar

Robin High
UNMC






From:
Tobias Jeppsson <tobias.jeppsson(a)EKOL.SLU.SE>
To:
SAS-L(a)LISTSERV.UGA.EDU
Date:
02/11/2010 05:49 AM
Subject:
Model fit for ordinal multinomial models in GENMOD?
Sent by:
"SAS(r) Discussion" <SAS-L(a)LISTSERV.UGA.EDU>



Hi

I have been fitting models for two response variables, one ordinal and one
normal (using AIC to choose the "best" model). For the normal data I can
evaluate model fit and assumptions as usual, and by looking at residual
plots. However, as I understand it, residuals are not available for
ordinal
multinomial models in GENMOD. How do I then best assess fit and if I meet
the model assumptions? This is the first time that I'm analyzing ordinal
multinomial models, so maybe I'm missing something obvious.

thanks
Tobias Jeppsson
From: Tobias Jeppsson on
Thank you both Robin & Peter for your helpful responses.

When looking at the Score Test for the Proportional Odds Assumption (as Robin suggested) it is indeed significant, indicating a violation of the assumptions. However, when trying Peter's suggestion of comparing the nominal and ordinal formulations I find a relatively good correspondence between predictions, when plotting the predictions for all response classes (R2=0.93).

To test the "robustness" of the results I also ran the same model assuming dist=poisson (in genmod), even though the assumption of an ordinal response feels more natural. The response categories are (0, 1, 2, 3), with a decreasing number of observation in each category going from 0-3. This analysis come to the same basic conclusion about the effects of my predictors (direction and significance).

So basically I feel quite confident in the overall results/patterns in my data, but uneasy about violating the basic assumptions in the ordinal model. What do you think is the best way to proceed?

I'm not certain on how to use the AGGREGATE option to assess fit since I have several predictor variables, both class and continuous. Should I assess fit sequentially by looking at the predictor variables one by one?

And just to clarify, my response variable is threat classifications of species, with most species considered non-threatened and less species in the higher threat categories.

Thanks again

Tobias



-----Original Message-----
From: SAS(r) Discussion [mailto:SAS-L(a)LISTSERV.UGA.EDU] On Behalf Of Robin R High
Sent: den 12 februari 2010 19:51
To: SAS-L(a)LISTSERV.UGA.EDU
Subject: Re: Model fit for ordinal multinomial models in GENMOD?

Tobias,

It is somewhat of a multi-step process, namely first determine if the
cumulative logit model assumption is OK with LOGISTIC:

PROC LOGISTIC DATA=indat;
CLASS xvar / PARAM=glm;
FREQ count;
MODEL y = xvar / link=clogit ;
run;

Near the top you'll see:


The LOGISTIC Procedure

Score Test for the Proportional Odds Assumption

Chi-Square DF Pr > ChiSq

3.9106 3 0.2713


which the relatively "large" pvalue indicates the cumulative logit model
works



To assess goodness of fit for the clogit model and a categorical
explanatory variable, you can run GENMOD with the aggregate= option

PROC genmod DATA=indat;
CLASS xvar ;
FREQ count;
MODEL y = xvar / dist=multinomial link=clogit aggregate=(xvar) ;
RUN;

that produces the Criteria For Assessing Goodness Of Fit

Criterion DF Value Value/DF

Deviance 3 3.6877 1.2292
Pearson Chi-Square 3 3.6630 1.2210

which are then compared to a chi-square distribution with the indicated
degrees of freedom (DF).

One can verify these fit statistics by computing the expected counts for
each cell of the table (takes a few programming steps) and connect them
with the observed counts entering them into the chisq and dev formulas:

expected = total * prob ; where prob is the estimated probability for
each response level for each level of xvar
total is the sum of all observed counts
for each level of the xvar

sum these values across all cells:

chisq = ( count - expected)**2/expected;
dev = 2*count*(log( count / expected));

Note that if the clogit doesn't work (bad fit or proportional odds not
satisfied) then enter link=glogit (in LOGISTIC or GLIMMIX)' the expected
and observed counts are the same for a categorical xvar

Robin High
UNMC






From:
Tobias Jeppsson <tobias.jeppsson(a)EKOL.SLU.SE>
To:
SAS-L(a)LISTSERV.UGA.EDU
Date:
02/11/2010 05:49 AM
Subject:
Model fit for ordinal multinomial models in GENMOD?
Sent by:
"SAS(r) Discussion" <SAS-L(a)LISTSERV.UGA.EDU>



Hi

I have been fitting models for two response variables, one ordinal and one
normal (using AIC to choose the "best" model). For the normal data I can
evaluate model fit and assumptions as usual, and by looking at residual
plots. However, as I understand it, residuals are not available for
ordinal
multinomial models in GENMOD. How do I then best assess fit and if I meet
the model assumptions? This is the first time that I'm analyzing ordinal
multinomial models, so maybe I'm missing something obvious.

thanks
Tobias Jeppsson
From: Shawn Haskell on
On Feb 17, 12:18 pm, tobias.jepps...(a)EKOL.SLU.SE (Tobias Jeppsson)
wrote:
> Thank you both Robin & Peter for your helpful responses.
>
> When looking at the Score Test for the Proportional Odds Assumption (as Robin suggested) it is indeed significant, indicating a violation of the assumptions. However, when trying Peter's suggestion of comparing the nominal and ordinal formulations I find a relatively good correspondence between predictions, when plotting the predictions for all response classes (R2=0.93).
>
> To test the "robustness" of the results I also ran the same model assuming dist=poisson (in genmod), even though the assumption of an ordinal response feels more natural. The response categories are (0, 1, 2, 3), with a decreasing number of observation in each category going from 0-3. This analysis come to the same basic conclusion about the effects of my predictors (direction and significance).
>
> So basically I feel quite confident in the overall results/patterns in my data, but uneasy about violating the basic assumptions in the ordinal model. What do you think is the best way to proceed?
>
> I'm not certain on how to use the AGGREGATE option to assess fit since I have several predictor variables, both class and continuous. Should I assess fit sequentially by looking at the predictor variables one by one?
>
> And just to clarify, my response variable is threat classifications of species, with most species considered non-threatened and less species in the higher threat categories.
>
> Thanks again
>
> Tobias
>
>
>
> -----Original Message-----
> From: SAS(r) Discussion [mailto:SA...(a)LISTSERV.UGA.EDU] On Behalf Of Robin R High
> Sent: den 12 februari 2010 19:51
> To: SA...(a)LISTSERV.UGA.EDU
> Subject: Re: Model fit for ordinal multinomial models in GENMOD?
>
> Tobias,
>
> It is somewhat of a multi-step process, namely first determine if the
> cumulative logit model assumption is OK with LOGISTIC:
>
> PROC LOGISTIC DATA=indat;
> CLASS xvar / PARAM=glm;
> FREQ count;
> MODEL y = xvar / link=clogit ;
> run;
>
> Near the top you'll see:
>
> The LOGISTIC Procedure
>
> Score Test for the Proportional Odds Assumption
>
> Chi-Square       DF     Pr > ChiSq
>
>     3.9106        3         0.2713
>
> which the relatively "large" pvalue indicates the cumulative logit model
> works
>
> To assess goodness of fit for the clogit model and a categorical
> explanatory variable, you can run GENMOD with the aggregate= option
>
> PROC genmod DATA=indat;
> CLASS xvar ;
> FREQ count;
> MODEL y = xvar / dist=multinomial link=clogit aggregate=(xvar) ;
> RUN;
>
> that produces the Criteria For Assessing Goodness Of Fit
>
> Criterion                     DF           Value        Value/DF
>
> Deviance                       3          3.6877          1.2292
> Pearson Chi-Square             3          3.6630          1.2210
>
> which are then compared to a chi-square distribution with the indicated
> degrees of freedom (DF).
>
> One can verify these fit statistics by computing the expected counts for
> each cell of the table (takes a few programming steps) and connect them
> with the observed counts entering them into the chisq and dev formulas:
>
> expected = total * prob ;  where prob is the estimated probability for
> each response level for each level of xvar
>                                  total is the sum of all observed counts
> for each level of the xvar
>
> sum these values across all cells:
>
> chisq = ( count - expected)**2/expected;
> dev   = 2*count*(log( count / expected));
>
> Note that if the clogit doesn't work (bad fit or proportional odds not
> satisfied) then enter link=glogit (in LOGISTIC or GLIMMIX)' the expected
> and observed counts are the same for a categorical xvar
>
> Robin High
> UNMC
>
> From:
> Tobias Jeppsson <tobias.jepps...(a)EKOL.SLU.SE>
> To:
> SA...(a)LISTSERV.UGA.EDU
> Date:
> 02/11/2010 05:49 AM
> Subject:
> Model fit for ordinal multinomial models in GENMOD?
> Sent by:
> "SAS(r) Discussion" <SA...(a)LISTSERV.UGA.EDU>
>
> Hi
>
> I have been fitting models for two response variables, one ordinal and one
> normal (using AIC to choose the "best" model). For the normal data I can
> evaluate model fit and assumptions as usual, and by looking at residual
> plots. However, as I understand it, residuals are not available for
> ordinal
> multinomial models in GENMOD. How do I then best assess fit and if I meet
> the model assumptions? This is the first time that I'm analyzing ordinal
> multinomial models, so maybe I'm missing something obvious.
>
> thanks
> Tobias Jeppsson- Hide quoted text -
>
> - Show quoted text -

I had never used ordinal regression until recently, but did something
similar recently with moose corpora lutea data (assessing reproductive
rates). My ordinal response was identical to yours (0-3). My first
choice was Poisson, but the models turned out to be tremendously under-
dispersed, and zero-inflated models were not really appropriate. Then
I ran the ordinal models - AICc was much improved. Overall, the
effects of interest did not really change between types of models, but
the ordinal response was a much better fit as evidenced by AICc. If
your case is the same, do folks on this listserv agree that this is
worth reporting? Shawn