ÇÊ »çÀ̾𽺠| Systat | CambridgeSoft | EndNote

History
Adivisory Board
Research
Lisrel General
New
Lisrel Download
Lisrel Resource
Lisrel Example
Advanced Topic
Lisrel FAQ
Lisrel Reference
HLM General
HLM New
HLM Download
HLM Resource
HLM Example
HLM FAQ
HLM Reference
°¡°Ý ¹× ÁÖ¹®
±³À°±â°ü °¡°Ý
¼­Àû
±âŸ ¼ÒÇÁÆ®¿þ¾î
¹®ÀÇ
±â¼ú Áö¿ø
IRT General
IRT Resource
IRT Example
IRT Reference
SUperMix General
  TESTFACT: Item bifactor analysis of a twelfth-grade science assessment test

Data for this example are based on 32 items from a science assessment test in the subjects of biology, chemistry, and physics administered to twelfth-grade students near the end of the school year. The items were classified by subject matter for purposes of the bifactor analysis.

The first five cases from the data file exampl04.dat are shown below. The FILE keyword on the INPUT command denotes this file as the data source and the SCORES keyword indicates that it contains item scores.

Case001   14523121312421534414334135131545
Case002   34283328312821524114338184145848
Case003   14543223322131554134331134134441
Case004   24423324322421524134315254134242
Case005   24523221122421544514333115131241

The case identification is given in the first 7 columns, and is listed first in the variable format statement. The length of this field is also indicated by the NIDW keyword on the INPUT command. After using the 'T' operator to tab to column 11, the 32 item responses are read as single characters (32A1).

(7A1,T11,32A1)

32 items from the science test are used as indicated by the NITEM keyword on the PROBLEM command, and the RESPONSE keyword denotes the number of possible responses. The six responses are listed in the RESPONSE command. Naming of the items is done using the NAMES command, while the KEY command lists the correct response to each item.

The BIFACTOR command is used to request full-information estimation of loadings on a general factor in the presence of item-group factors. Three item-group factors are present (NIGROUP = 3), with assignation of the items to these groups as specified with the IGROUP keyword. The CPARMS keyword lists the probabilities of chance success on each item. By setting the LIST keyword to 3, the bifactor loadings will be printed in both item and in item-group order in the output file. A total of 30 EM cycles (CYCLES = 30) will be performed in the bifactor solution.

The SCORE command is used to obtain, for each distinct pattern, the EAP score of the general factor of the bifactor model and to obtain the standard error estimate of the general factor score allowing for conditional dependence introduced by the group factors. Factor scores for the first 10 cases will be printed to the output file (LIST = 10) and the guessing model will be used in the computation of the factor scores (CHANCE option).

>TITLE
  ITEM BIFACTOR ANALYSIS OF A TWELFTH-GRADE SCIENCE ASSESSMENT TEST
    THE GENERAL FACTOR WILL BE SCORED
>PROBLEM NITEM=32,RESPONSE=6;
>NAMES
  CHEM01 PHYS02,CHEM03,PHYS04,PHYS05,CHEM06,BIOL07,CHEM08,BIOL09,BIOL10,BIOL11, PHYS12,BIOL13,PHYS14,BIOL15,CHEM16,BIOL17,BIOL18,PHYS19,PHYS20,BIOL21,BIOL22,
PHYS23,BIOL24,PHYS25,PHYS26,BIOL27,PHYS29,CHEM29,PHYS30,BIOL31,CHEM32;
>RESPONSE 8,1,2,3,4,5;
>KEY  14523121312421534414334135131545;
>BIFACTOR NIGROUP=3,LIST=3,CYCLES=30,
    IGROUPS=(2,3,2,3,3,2,1,2,1,1,1,3,1,3,1,2,1,1,3,3,1,1,
       3,1,3,3,1,3,2,3,1,2),
   CPARMS=(0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,
            0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,
           0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1,0.1);
>SCORE LIST=10,CHANCE;
>SAVE PARM,FSCORES;
>INPUT NIDW=4,SCORES,FILE='EXAMPL04.DAT';
(7A1,T11,32A1)
>STOP;

  Discussion of bifactor analysis output

Exampl04.tsf illustrates the extension of a one-factor model to a so-called bifactor model by the inclusion of group factors. The bifactor model is applicable when an achievement test contains more than one subject matter content area. The data set exampl04.dat consists of the results of a 32-item science assessment test in the subjects biology, chemistry, and physics. Items are classified according to subject matter where 1 = biology, 2 = chemistry and 3 = physics. Note that TESTFACT does not estimate guessing parameters, but does allow the user to specify the values (see CPARMS keyword) in which case a 3-parameter model that provides for the effect of guessing is fitted to the data.

The analysis specified in exampl04.tsf produces Phase 0, Phase 1, Phase 2, Phase 6 and Phase 7 output. The interpretation of Phases 0, 1, and 2 are omitted here since a detailed discussion of these parts of the output is given before.

  Phase 6: Bifactor analysis

Display 1 lists the chance and initial intercept and slope estimates. Note that the initial intercept estimates are set equal to zero, the initial slope estimates are set to 1.414 for the general factor and 1.00 for the group factors. These initial values are routinely used in TESTFACT for bifactor models.

DISPLAY   1.    CHANCE AND INITIAL INTERCEPT AND SLOPE ESTIMATES
                   CHANCE  INTERCEPT SLOPES
                                             1        2
    1  CHEM01         0.100     0.000     1.414     1.000
    2  PHYS02         0.100     0.000     1.414     1.000
    3  CHEM03         0.100     0.000     1.414     1.000
    4  PHYS04         0.100     0.000     1.414     1.000
    5  PHYS05         0.100     0.000     1.414     1.000
    6  CHEM06         0.100     0.000     1.414     1.000
          …
   31  BIOL31         0.100     0.000     1.414     1.000
   32  CHEM32         0.100     0.000     1.414     1.000

One may optionally include the TETRACHORIC command (see exampl03.tsf) when fitting a bifactor model. This command is required if a printout of residuals is requested. If a TETRACHORIC command is used, tetrachoric correlations are computed pairwise for the  = 496 pairs of items. There are a total of 20 item pairs that cannot be used since their corresponding  frequency tables contain zero or near-zero off-diagonal or marginal frequencies. In these cases, a tetrachoric correlation of 1 is substituted in the matrix of tetrachoric correlations.

The inclusion or exclusion of the TETRACHORIC command has no effect on the estimation procedure, since the starting values for the marginal maximum likelihood procedure are fixed, and do not depend on the matrix of tetrachoric coefficients.

Display 2-3: EM estimation and quadrature points and weights

The bifactor procedure uses the 9 quadrature points and weights listed below. MML estimation for the bifactor model requires quadrature in only two dimensions. For a more detailed discussion, see the Phase 7 part of the output.

DISPLAY   2.     THE EM ESTIMATION OF PARAMETERS

9 QUADRATURE POINTS

DISPLAY   3.      9 QUADRATURE POINTS AND WEIGHTS

      1        -4.000000         0.000134
      2        -3.000000         0.004432
      3        -2.000000         0.053991
      4        -1.000000         0.241971
      5         0.000000         0.398942
      6         1.000000         0.241971
      7         2.000000         0.053991
      8         3.000000         0.004432
      9         4.000000         0.000134

The number of cycles for the EM algorithm is set equal to 30 (CYCLES = 30 in the BIFACTOR command). At each cycle, the value of ? log L (see Example 3 output) as well as the maximum change in the intercept and slope parameters are given. At cycle 30 the maximum change in intercept is 0.0050. The general factor slope estimates for the 32 items changed at most by 0.0047 while the corresponding value for the group factor equals 0.0095. These values indicate that, although convergence was not attained within the specified 30 cycles, the solution after 30 cycles is probably acceptable for all practical purposes.

CYCLE 30  -2 X MARGINAL LOG-LIKELIHOOD =    0.1882667932D+05
                                    CHANGE =     0.4390039691D-01

MAXIMUM CHANGE OF ESTIMATES
             INTERCEPT =   0.004952 SLOPE =   0.004758

Display 4: Chi-square and degrees of freedom

DISPLAY   4.  CHI-SQUARE = 11150.36  DF = 503.00  P = 0.000

The -value is 1150.36 with degrees of freedom equal to 503. The number of degrees of freedom is calculated as

where N is the number of distinct patterns, n is the number of items, and  is the number of items assigned to group factors. For this example, N = 600, n = 32 and, since all the items are assigned to group factors,  = 32.

The -statistic is only correct when all possible  patterns are observed. For the present sample, since , the -statistic is too inaccurate to be used as a goodness-of-fit test statistic. The difference in the -statistics for alternative models, however, yields a valid test statistic for judging whether the inclusion of additional parameters results in a significant improvement of model fit.

Example

It is hypothesized that the 12 physics items are indicators of a general factor only, while the biology and chemistry items are indicators of a general and two uncorrelated group factors. We wish to test

  • : The 32 items are indicators of a general factor, but the 13 biology and 7 chemistry items are also indicators of two uncorrelated group factors.
  • : The 32 items are indicators of a general as well as three uncorrelated group factors.

To obtain the -statistic and degrees of freedom under , the BIFACTOR command is modified as follows:

>BIFACTOR NIGROUP=2, LIST=3, CYCLES=30,
IGROUPS=(2,0,2,0,0,2,1,2,1,1,1,0,1,0,1,2,1,1,0,0,1,1,0,1,0,0,1,0,2,0,1,2), CPARMS=(0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1,
0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1,
0.1, 0.1, 0.1, 0.1);

Note that the NIGROUP keyword is set equal to 2 and that each value of 3, corresponding to the position of the physics items in the data set, is substituted by a value of 0 in the IGROUPS keyword. A symbol indicates that the corresponding items are not assigned to any group factors. A graphical presentation of the  model is shown below.

 

If we run exampl04.tsf with the changes to the BIFACTOR command as discussed above, the -statistic value and degrees of freedom shown below is obtained.

DISPLAY 4.  CHI-SQUARE = 11179.71  DF=515.00 P=0.000

To test  against , one computes the difference in the corresponding s and degrees of freedom. Hence  = 11179.71 ?11150.36 = 29.35 with degrees of freedom equal to 515 ?503 = 12.

Since ,  is rejected and it is concluded that items from all 3 subjects should be used for the group factors.

Display 5: Untransformed item parameters

The estimates for the intercept and slope parameters are listed below.

DISPLAY   5.    UNTRANSFORMED ITEM PARAMETERS
               CHANCE INTERCEPT SLOPES

                                 1      2
   1 CHEM01        0.100     -1.054     0.709     0.417
   2 PHYS02        0.100     0.126     1.019     0.548
   3 CHEM03        0.100     -1.360     1.265     -0.182
   4 PHYS04        0.100     -0.578     0.469     0.377
   5 PHYS05        0.100     0.263     0.635     0.337
   6 CHEM06        0.100     -2.729     1.706     0.308
      …
  31 BIOL31        0.100     1.608     1.447     -0.005
  32 CHEM32        0.100     -1.522     0.190     0.066

An alternative way to present these estimated parameters are shown below for the first 10 items.

   Item     Chance    Intercept   General    Group1    Group2    Group3
   --------------------------------------------------------------------
 1 CHEM01    0.100     -1.054      0.709      0.000     0.417     0.000
 2 PHYS02    0.100      0.126      1.019      0.000     0.000     0.548
 3 CHEM03    0.100     -1.360      1.265      0.000    -0.182     0.000
 4 PHYS04    0.100     -0.578      0.469      0.000     0.000     0.377
 5 PHYS05    0.100      0.263      0.635      0.000     0.000     0.337
 6 CHEM06    0.100     -2.729      1.706      0.000     0.308     0.000
 7 BIOL07    0.100      0.839      0.586      0.636     0.000     0.000
 8 CHEM08    0.100     -2.220      1.144      0.000     0.929     0.000
 9 BIOL09    0.100      1.287      0.212      0.476     0.000     0.000
10 BIOL10    0.100     -0.464      0.762      0.444     0.000     0.000

Display 6: Percent of variance

DISPLAY   6. PERCENT OF VARIANCE
 ----------------------------
   GENERAL     0  31.7580
   ITEM GROUP  1   3.8018
   ITEM GROUP  2   2.7716
   ITEM GROUP  3   2.9551
   UNIQUENESS     58.7134
 ----------------------------

The percentage variance explained by each of the four factors is calculated as follows:

Let  denote the j-th slope parameter for item i, i = 1, 2, ? 32. If we define

then slopes are transformed to factor loadings (see Display 9 in the discussion of the Example 3 output) using the relationship

.

Example

For item 7,

The item 7 loadings are therefore 0.586/1.322 = 0.443 and 0.636/1.322 = 0.481 respectively.

Let  be a  matrix of factor loadings with elements (see Display 7)

The percentage variance explained by factor j is calculated as

where n = 32 and  the j-th characteristic root of  (See also the discussion of the Example 3 output, Display 15). The uniqueness component is calculated as

Display 7: Standardized difficulties, communalities and bifactor loadings

The bifactor loadings are derived from the slope estimates using the formula  (see Display 6 above). The standardized item i difficulty equals .

Example

For item 7,  = 1.322 so that the standardized difficulty is

 

Communalities are equal to the sum of the squares of the factor loadings. For example, the item 1  communality is equal to

DISPLAY   7. BIFACTOR RESULTS IN ITEM SEQUENTIAL ITEM ORDER
     ITEM    GROUP DIFFICULTY COMMUNALITY  GENERAL   SPECIFIC
 ----------------------------------------------------------------
    1 CHEM01    2     0.8138    0.4036     0.5475    0.3222
    2 PHYS02    3   -0.0821     0.5725    0.6663     0.3585
    3 CHEM03    2     0.8380    0.6204     0.7797   -0.1120
    4 PHYS04    3     0.4952    0.2659     0.4022    0.3226
    5 PHYS05    3    -0.2137    0.3408     0.5156    0.2739
    6 CHEM06    2    1.3636    0.7504     0.8524    0.1540
       …
   31 BIOL31    1    -0.9140    0.6768     0.8227   -0.0029
   32 CHEM32    2     1.4923    0.0388     0.1859    0.0649
----------------------------------------------------------------

Display 8: Bifactor results in item group order

The printout below shows the same information as for Display 7, except that the items are re-ordered by group number. All 32 items have positive loadings on the general factor, while the group factor loadings for BIOL31, CHEM03 and PHYS30 are negative, but relatively small.

DISPLAY   8. BIFACTOR RESULTS IN ITEM GROUPORDER
     ITEM    GROUP DIFFICULTY COMMUNALITY  GENERAL   SPECIFIC
 ----------------------------------------------------------------
    7 BIOL07    1    -0.6347    0.4277     0.4430    0.4812
    9 BIOL09    1    -1.1417    0.2136     0.1882    0.4221
   10 BIOL10    1     0.3483    0.4372     0.5713    0.3330
   11 BIOL11    1    -2.0915    0.3887     0.5390    0.3133
   13 BIOL13    1    -0.3328    0.3918     0.5249    0.3411
   15 BIOL15    1    -0.8412    0.4683     0.5493    0.4081
   17 BIOL17    1    -1.7506    0.3391     0.5395    0.2192
   18 BIOL18    1     0.5366    0.7089     0.8358    0.1022
   21 BIOL21    1    -1.3192     0.2177    0.2152     0.4140
   22 BIOL22    1    -1.4669    0.3685     0.5956    0.1175
   24 BIOL24    1    -0.5344    0.3935     0.5922    0.2069
   27 BIOL27    1    -1.0345    0.5244     0.7042    0.1686
   31 BIOL31    1    -0.9140    0.6768     0.8227   -0.0029

    1 CHEM01     2    0.8138     0.4036    0.5475     0.3222
    3 CHEM03    2     0.8380    0.6204     0.7797   -0.1120
    6 CHEM06    2     1.3636    0.7504     0.8524    0.1540
    8 CHEM08    2     1.2465    0.6847     0.6422    0.5219
   16 CHEM16    2     0.3761    0.3262     0.4820    0.3065
   29 CHEM29    2     0.6132    0.6825     0.5533    0.6135
   32 CHEM32    2     1.4923    0.0388     0.1859    0.0649
   

    2 PHYS02     3   -0.0821     0.5725    0.6663     0.3585
    4 PHYS04    3     0.4952    0.2659     0.4022    0.3226
    5 PHYS05    3    -0.2137    0.3408     0.5156    0.2739
   12 PHYS12    3     0.3828    0.0437     0.0999    0.1836
   14 PHYS14    3    -0.5195    0.4537     0.4988    0.4527
   19 PHYS19    3    -0.0128    0.2476     0.4967    0.0289
   20 PHYS20    3    -1.0889    0.4226     0.6200    0.1954
   23 PHYS23    3     0.7102    0.3164     0.4571    0.3279
   25 PHYS25    3     0.5005    0.4205     0.4814    0.4345
   26 PHYS26    3     0.2243    0.6128     0.7500    0.2242
   28 PHYS29    3     0.0337    0.3544     0.5948    0.0255
   30 PHYS30    3     0.3034    0.0979     0.2912   -0.1146
 ----------------------------------------------------------------

  Phase 7: General bifactor score: EAP estimate

The factor scores are so-called expected a-posteriori estimates of the general ability factor under the assumption of normality (see Phase 7, exampl03.out).

Let  denote the general ability for examinee i. The EAP score is the conditional expectation  where  is the item j score for examinee i. From well-known results for conditional distributions it follows that

where

The marginal probability function  is obtained in the EM step using a two-dimensional quadrature formula. Suppose  denotes the set of ordered item scores for the three groups. Under the assumption of uncorrelated group factors, it follows for Example 4 that

where  denotes the general ability for examinee i,  is the group 1 (biology),  the group 2 (chemistry) and  the group 3 (physics) ability respectively. Note that (see Display 9) , , ? .

Under the independence assumption, it follows that

.

Each term in this product can be expressed as a two-dimensional integral. The first term, for example, can be evaluated from

This integral can be approximated by

where  and  are the weights and  and  the points shown as Display 9 of the output.

Display 10: General factor scores and standard errors

The ability scores for each case and corresponding standard error estimates are tabulated. Examinee 5, for example, selected the correct answers to 22 of the 32 items. Therefore, the percentage correct is

The estimated ability score for this candidate is 0.8 with a standard error of 0.411. Candidate 7 also obtained correct answers to 22 of the 32 items, but the ability estimate is 0.691. It is evident that the ability estimated depend on the number of correct items as well as which subset of items was answered correctly.

DISPLAY   10. GENERAL FACTOR SCORE AND STANDARD ERROR (S.E.)

CASE HEADER:
 CASE   NUMBER   PERCENT PERCENT  CASE ID
       PRESENTED CORRECT OMITTED
         SCORE AND S.E.
============================================================================
    1     32     100.0   0.0  Case001
                 2.507  0.591
    2     32      53.1   0.0  Case002
                -0.066  0.323
    3     32      56.2   0.0  Case003
                 0.032  0.380
    4     32      50.0   0.0  Case004
                -0.559  0.505
    5     32     68.8    0.0 Case005
                 0.800  0.411
    6     32      62.5   0.0  Case006
                 0.342  0.491
    7     32      68.8   0.0  Case007
                 0.691  0.469
    8     32      65.6   0.0  Case008
                 0.286 0.463
    9     32      28.1   0.0  Case009
                -1.434  0.518
   10     32      46.9   0.0  Case010
                -0.965  0.342

Summary statistics for score estimates

The number of cases scored is equal to 600, with a mean of ?.0258 and standard deviation of 0.9011. Note that the ability scores are estimated under the assumption that the general factor ability has a normal distribution with mean 0 and standard deviation 1. For large data sets, one ideally wants the estimated ability scores to have mean 0 and standard deviation 1.

The root-mean-square posterior standard deviations are calculated as follows.

where ,  etc.

The RMS value of 0.4394 is relatively large, and indicates that, in general, 95% confidence intervals for the estimated scores will be wide. For example, a 95% confidence interval for examinee 5 is

The empirical reliability is a measure of how close the observed scores are to the true, but unobserved, scores. A reliability of 1, for example, implies that one can safely substitute the observed test scores for the unknown true scores.

SUMMARY STATISTICS FOR SCORE ESTIMATES
======================================

CASES SCORED         600
 MEAN:            -0.0258
 S.D.:             0.9011
 VARIANCE:         0.8119

ROOT-MEAN-SQUARE POSTERIOR STANDARD DEVIATIONS

RMS:              0.4344
 VARIANCE:         0.1887

EMPIRICAL
   RELIABILITY:    0.8114

 

Copyright ¢ç 2008, Scientific Software International, Inc., All rights reserved.
ÇÊ »çÀÌ¾ð½º¢ß °æ±âµµ °í¾ç½Ã Àϻ굿±¸ ¹é¼®µ¿1141-2 À¯´ÏÅ×Å©ºô 825
ÀüÈ­ 031-905-7754 ÆÑ½º 031-905-7724 À̸ÞÀÏ: