What is the SPSS Annotated Output for Canonical Correlation Analysis?

What is the SPSS Annotated Output for Canonical Correlation Analysis?

The SPSS Annotated Output for Canonical Correlation Analysis is a statistical tool used to analyze the relationship between two sets of variables. It provides a detailed summary of the results from the canonical correlation analysis, including the correlation coefficients, eigenvalues, and significance levels. The output also includes graphical representations, such as scatterplots and biplots, to visually illustrate the relationship between the two sets of variables. This annotated output serves as a comprehensive and organized report, aiding researchers in interpreting and understanding the results of their analysis.

Canonical Correlation Analysis | SPSS Annotated Output

This page shows an example of a canonical correlation analysis with footnotes
explaining the output in SPSS.  A researcher has collected data on three
psychological variables, four academic variables (standardized test scores) and
gender for 600 college freshman.  She is interested in how the set of
psychological variables relates to the academic variables and gender.  In
particular, the researcher is interested in how many dimensions are necessary to
understand the association between the two sets of variables.

We have a data file,
https://stats.idre.ucla.edu/wp-content/uploads/2016/02/mmr.sav, with 600 observations on eight
variables.  The psychological variables are locus of control,
self-concept
and motivation.  The academic variables are standardized
test scores in reading, writing, math and science.  Additionally, the variable female is a zero-one indicator variable with
the one indicating a female student.  The researcher is interested in the
relationship between the psychological variables and the academic variables,
with gender considered as well.  Canonical correlation analysis aims to
find pairs of linear combinations of each group of variables that are highly
correlated.  These linear combinations are called canonical variates.  Each
canonical variate is orthogonal to the other canonical variates except for the
one with which its correlation has been maximized.  The possible number of such
pairs is limited to the number of variables in the smallest group.  In our
example, there are three psychological variables and more than three academic
variables.  Thus, a canonical correlation analysis on these sets of variables
will generate three pairs of canonical variates.

To begin, let’s read in and summarize the dataset.

get file='d:datammr.sav'.

descriptives
  variables=locus_of_control self_concept motivation 
  read write math science female
  /statistics=mean stddev min max.

 

Image SPSS_CCA1

 

These descriptives indicate that there are not any missing values in the data
and suggest the different scales the different variables. We can proceed with
the canonical correlation analysis without worries of missing data, keeping in
mind that our variables differ widely in scale.

SPSS performs canonical correlation using the manova command with the discrim
option.  The
manova command is one of the SPSS commands that can only be accessed via
syntax; there is not a sequence of pull-down menus or point-and-clicks that
could arrive at this analysis.

Due to the length of the output, we will be omitting some of the output that
is extraneous to our canonical correlation analysis and making comments in
several places along the way.

In the manova command, we first list the variables in our
psychological group (locus_of_control, self_concept and
motivation
).  Then, after the SPSS keyword with, we list the variables in our academic group
(read, write, math, science and female).  SPSS refers to the first group of variables as the “dependent variables” and the
second group of variables as the “covariates”.  This follows manova
convention.

manova locus_of_control self_concept motivation with read write math science female
/ discrim all alpha(1) 
/ print=sig(eigen dim).
...[additional output omitted]...
* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 EFFECT .. WITHIN CELLS Regression
 Multivariate Tests of Significance (S = 3, M = 1/2, N = 295 )

 Test Name       Value  Approx. F Hypoth. DF   Error DF  Sig. of F

 Pillais          .25425   11.00057      15.00    1782.00       .000
 Hotellings       .31430   12.37633      15.00    1772.00       .000
 Wilks            .75436   11.71573      15.00    1634.65       .000
 Roys             .21538

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Eigenvalues and Canonical Correlations

 Root No.    Eigenvalue        Pct.   Cum. Pct.  Canon Cor.     Sq. Cor

        1          .274      87.336      87.336        .464        .215
        2          .029       9.185      96.522        .168        .028
        3          .011       3.478     100.000        .104        .011

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Dimension Reduction Analysis

 Roots        Wilks L.          F Hypoth. DF   Error DF  Sig. of F

 1 TO 3         .75436   11.71573      15.00    1634.65       .000
 2 TO 3         .96143    2.94446       8.00    1186.00       .003
 3 TO 3         .98919    2.16461       3.00     594.00       .091

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
...[additional output omitted]...
* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 Raw canonical coefficients for DEPENDENT variables
           Function No.

 Variable            1          2          3

 locus_of        1.254      -.621       .662
 self_con        -.351     -1.188      -.827
 motivati        1.262      2.027     -2.000

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Standardized canonical coefficients for DEPENDENT variables
           Function No.

 Variable            1          2          3

 locus_of         .840      -.417       .444
 self_con        -.248      -.838      -.583
 motivati         .433       .695      -.686

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Correlations between DEPENDENT and canonical variables
           Function No.

 Variable            1          2          3

 locus_of         .904      -.390       .176
 self_con         .021      -.709      -.705
 motivati         .567       .351      -.745

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Variance in dependent variables explained by canonical variables

 CAN. VAR.  Pct Var DE Cum Pct DE Pct Var CO Cum Pct CO

        1       37.980     37.980      8.180      8.180
        2       25.910     63.889       .727      8.907
        3       36.111    100.000       .391      9.297

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Raw canonical coefficients for COVARIATES
           Function No.

 COVARIATE           1          2          3

 read             .045      -.005      -.021
 write            .036       .042      -.091
 math             .023       .004      -.009
 science          .005      -.085       .110
 female           .632      1.085      1.795


* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 Standardized canonical coefficients for COVARIATES
           CAN. VAR.

 COVARIATE           1          2          3

 read             .451      -.050      -.216
 write            .349       .409      -.888
 math             .220       .040      -.088
 science          .049      -.827      1.066
 female           .315       .541       .894

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Correlations between COVARIATES and canonical variables
           CAN. VAR.

 Covariate           1          2          3

 read             .840      -.359      -.135
 write            .877       .065      -.255
 math             .764      -.298      -.148
 science          .658      -.677       .230
 female           .364       .755       .543

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Variance in covariates explained by canonical variables

 CAN. VAR.  Pct Var DE Cum Pct DE Pct Var CO Cum Pct CO

        1       11.305     11.305     52.488     52.488
        2         .701     12.006     24.994     77.482
        3         .098     12.104      9.066     86.548

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
...[additional output omitted]...

Data Summary, Eigenvalues and Hypothesis Tests

* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 EFFECT .. WITHIN CELLS Regression
 Multivariate Tests of Significance (S = 3, M = 1/2, N = 295 )

 Test Name          Valuee  Approx. Ff Hypoth. DFg   Error DFg  Sig. of Fh

 Pillaisa          .25425   11.00057      15.00    1782.00       .000
 Hotellingsb       .31430   12.37633      15.00    1772.00       .000
 Wilksc            .75436   11.71573      15.00    1634.65       .000
 Roysd             .21538

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Eigenvalues and Canonical Correlations

 Root No.i     Eigenvaluej     Pct.k   Cum. Pct.l  Canon Cor.m     Sq. Corn

        1          .274      87.336      87.336        .464        .215
        2          .029       9.185      96.522        .168        .028
        3          .011       3.478     100.000        .104        .011

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Dimension Reduction Analysis

 Rootso        Wilks L.p     Ff    Hypoth. DFg  Error DFg  Sig. of Fh

 1 TO 3         .75436   11.71573      15.00    1634.65       .000
 2 TO 3         .96143    2.94446       8.00    1186.00       .003
 3 TO 3         .98919    2.16461       3.00     594.00       .091

a. Pillais – This is Pillai’s trace, one of the four multivariate
statistics calculated by SPSS to test the null hypothesis that the canonical
correlations are zero (which, in turn, means that there is no linear
relationship between the two specified groups of variables).  Pillai’s trace is the sum of the squared canonical
correlations, which can be found in the next section of output (see superscript
n): 0.4642 + 0.1682 + 0.1042 =
0.25425.

b. Hotellings – This is the Hotelling-Lawley trace.  It is very similar
to Pillai’s trace and can be calculated as the sum
of the values of (canonical correlation2/(1-canonical correlation2)).  We can calculate 0.4642
/(1- 0.4642) + 0.1682/(1-0.1682) + 0.1042/(1-0.1042) = 0.31430.

c. Wilks – This is Wilks’ lambda, another multivariate
statistic calculated by SPSS.  It is the product of the values of
(1-canonical correlation2).  In this example, our canonical
correlations are 0.4641,  0.1675, and 0.1040 so the Wilks’ Lambda is (1- 0.4642)*(1-0.1682)*(1-0.1042)
= 0.75436.

d. Roys – This is Roy’s greatest root.  It can be calculated from
the largest eigenvalue: largest eigenvalue/(1 + largest eigenvalue).  Because it is
based on a maximum, it can behave differently from the other three test
statistics.  In instances where the other three are not statistically significant and Roy’s is
statistically significant, the effect should be considered to be not statistically significant.

e. Value – This is the value of the multivariate test
listed in the prior column.

f.
(Approx.) F

These are the F values associated with the various tests that are included in
SPSS’s output.  For the multivariate tests, the F values are approximate.

g. Hypoth. DF, Error DF – These are the degrees of freedom used in
determining the F values.  Note that there are instances in which the
degrees of freedom may be a non-integer because these degrees of freedom are calculated using the mean
squared errors, which are often non-integers.

h. Sig. of F – This is the p-value associated with the F value of a
given test statistic.  The null hypothesis that our two sets of variables are not
linearly related is evaluated with regard to this p-value.  For a given alpha
level, such as 0.05, if the p-value is less than alpha, the null hypothesis is rejected.  If
not, then we fail to reject the null hypothesis.

i. Root No. – This is the rank of the given eigenvalue (largest to
smallest).  There are as many roots as there were variables in the smaller
of the two variable sets.  In this example, our set of psychological
variables contains three variables and our set of academic variables contains
five variables. Thus the smaller variable set contains three variables and the
analysis generates three roots.

j. Eigenvalue – These are the eigenvalues of the product of the model matrix and the inverse of
the error matrix.  These eigenvalues can also be calculated using the squared
canonical correlations.  The largest eigenvalue is equal to largest squared
correlation /(1- largest squared correlation); 0.215/(1-0.215) =
0.274.  These calculations can be completed for each correlation to find
the corresponding eigenvalue.  The relative size of the eigenvalues reflect how
much of the variance in the canonical variates can be explained by the
corresponding canonical correlation.  Thus, the eigenvalue corresponding to
the first correlation is greatest, and all subsequent eigenvalues are smaller.

k. Pct. – This is the percent of the sum of the eigenvalues represented by a given
eigenvalue.  The sum of the three eigenvalues is (0.2745+0.0289+0.0109) =
0.3143.  Then, the proportions can be calculated: 0.2745/0.3143 = 0.8734,
0.0289/0.3143 = 0.0919, and 0.0109/0.3143 = 0.0348.  This is the proportion of explained variance in the canonical variates attributed to
a given canonical correlation.

l. Cum. Pct. – This is the cumulative sum of the percents.

m. Canon Cor. – These are the Pearson correlations of the pairs of
canonical variates.  The first
pair of variates, a linear combination of the psychological measurements and
a linear combination of the academic measurements, has a correlation
coefficient of 0.464.  The second pair has a correlation coefficient of
0.168, and the third pair 0.104. Each subsequent pair of canonical variates is
less correlated.  These can be interpreted as any other Pearson
correlations.  That is, the square of the correlation represents the
proportion of the variance in one group’s variate explained by the other group’s
variate.

n. Sq. Cor – These are the squares of the canonical correlations.
For example, (0.464*0.464) = 0.215.

o. Roots – This is the set of roots included in the null hypothesis
being tested.  The null hypothesis is that all of the correlations
associated with the roots in the given set are equal to zero in the population.  By testing these different sets of roots, we are determining how many dimensions
are required to describe the relationship between the two groups of variables.  Because each root is less informative than the one before it, unnecessary
dimensions will be associated with the smallest eigenvalues.  Thus, we
start our test with the full set of roots and then test subsets generated by
omitting the greatest root in the previous set.  Here, we first tested all three
roots, then roots two and three, and then root three alone.

p. Wilks L. – Here, the Wilks lambda test statistic is used for
testing the null hypothesis that the given canonical correlation and all smaller
ones are equal to zero in the population.  Each value can be calculated as the product of the values of
(1-canonical correlation2) for the set of canonical correlations
being tested.  In this example, our canonical
correlations are 0.464, 0.168 and 0.104, so the value for testing
that all three of the correlations are zero is (1- 0.4642)*(1-0.1682)*(1-0.1042)
= 0.75436.  To test that the two smaller canonical correlations, 0.168
and 0.104, are zero in the population, the value is (1-0.1682)*(1-0.1042)
= 0.96143.  The value for testing that the smallest canonical correlation is zero is (1-0.1042) = 0.98919.


Canonical Coefficients, Correlations, and Variance Explained

* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 Raw canonical coefficients for DEPENDENT variablesq
           Function No.

 Variable            1          2          3

 locus_of        1.254      -.621       .662
 self_con        -.351     -1.188      -.827
 motivati        1.262      2.027     -2.000

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Standardized canonical coefficients for DEPENDENT variablesr
           Function No.

 Variable            1          2          3

 locus_of         .840      -.417       .444
 self_con        -.248      -.838      -.583
 motivati         .433       .695      -.686

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Correlations between DEPENDENT and canonical variabless
           Function No.

 Variable            1          2          3

 locus_of         .904      -.390       .176
 self_con         .021      -.709      -.705
 motivati         .567       .351      -.745

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Variance in dependent variables explained by canonical variablest

 CAN. VAR.    Pct Var DE Cum Pct DE Pct Var CO Cum Pct CO

        1       37.980     37.980      8.180      8.180
        2       25.910     63.889       .727      8.907
        3       36.111    100.000       .391      9.297

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Raw canonical coefficients for COVARIATESq
           Function No.

 COVARIATE           1          2          3

 read             .045      -.005      -.021
 write            .036       .042      -.091
 math             .023       .004      -.009
 science          .005      -.085       .110
 female           .632      1.085      1.795


* * * * * * A n a l y s i s   o f   V a r i a n c e -- design   1 * * * * * *

 Standardized canonical coefficients for COVARIATESr
           CAN. VAR.

 COVARIATE           1          2          3

 read             .451      -.050      -.216
 write            .349       .409      -.888
 math             .220       .040      -.088
 science          .049      -.827      1.066
 female           .315       .541       .894

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Correlations between COVARIATES and canonical variabless
           CAN. VAR.

 Covariate           1          2          3

 read             .840      -.359      -.135
 write            .877       .065      -.255
 math             .764      -.298      -.148
 science          .658      -.677       .230
 female           .364       .755       .543

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
 Variance in covariates explained by canonical variablesu

 CAN. VAR.   Pct Var DE Cum Pct DE Pct Var CO Cum Pct CO

        1       11.305     11.305     52.488     52.488
        2         .701     12.006     24.994     77.482
        3         .098     12.104      9.066     86.548

 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

q. Raw canonical coefficients for DEPENDENT/COVARIATE variables
These are the raw canonical coefficients.  They define the linear relationship
between the variables in a given group and the canonical variates.  They can be interpreted in the same
manner as regression coefficients,
assuming the canonical variate as the outcome variable.  For example, a one
unit increase in locus_of_control leads to a 1.254 unit increase in
the first variate of the psychological measurements, and a one unit
increase in read
score leads to a 0.045 unit increase in the first variate of the academic
measurements. Recall that our variables varied in scale. This is reflected in
the varied scale of these raw coefficients.

r.
Standardized canonical coefficients for DEPENDENT/COVARIATE variables

– These are the standardized canonical coefficients.  This means that, if all of
the variables in the analysis are rescaled to have a mean of zero and a standard
deviation of 1, the coefficients generating the canonical variates would
indicate how a one standard deviation increase in the variable would change the
variate.  For example, an increase of one standard deviation in
locus_of_control

would lead to a 0.840 standard deviation increase in the first variate of the psychological
measurements, and an increase of one standard deviation in
read

would lead to a 0.451 standard deviation increase in the first variate of the academic
measurements.

s.
Correlations between DEPENDENT/COVARIATE variables and canonical
variables
– These are the correlations between each variable in a group and the group’s
canonical variates.  For example, we can see in the “dependent” variables that
locus_of_control

has a Pearson correlation of 0.904 with
the first psychological variate, -0.390 with the second psychological variate,
and 0.176 with the third psychological variate.  In the “covariates” section, we
can see that read
has a Pearson correlation of 0.840 with the first academic variate, -0.359 with
the second academic variate, and -0.135 with the third academic variate.

t.
Variance in dependent variables explained by canonical variables

This is the degree to which the canonical variates of both the dependent
variables (DE)
and covariates (CO) can explain the
standardized variability in the dependent variables.  For both sets of
canonical variates, the percent and cumulative percent of variability explained
by each variate is displayed.

u.
Variance in covariates explained by canonical variables

This is the degree to which the canonical variates of both the dependent
variables (DE)
and covariates (CO) can explain the
standardized variability in the covariates.  For both sets of canonical
variates, the percent and cumulative percent of variability explained by each
variate is displayed.

For further information on canonical correlation analysis in SPSS, see the
corresponding
Data Analysis Example page.

 

 

Cite this article

stats writer (2024). What is the SPSS Annotated Output for Canonical Correlation Analysis?. PSYCHOLOGICAL SCALES. Retrieved from https://scales.arabpsychology.com/stats/what-is-the-spss-annotated-output-for-canonical-correlation-analysis/

stats writer. "What is the SPSS Annotated Output for Canonical Correlation Analysis?." PSYCHOLOGICAL SCALES, 30 Jun. 2024, https://scales.arabpsychology.com/stats/what-is-the-spss-annotated-output-for-canonical-correlation-analysis/.

stats writer. "What is the SPSS Annotated Output for Canonical Correlation Analysis?." PSYCHOLOGICAL SCALES, 2024. https://scales.arabpsychology.com/stats/what-is-the-spss-annotated-output-for-canonical-correlation-analysis/.

stats writer (2024) 'What is the SPSS Annotated Output for Canonical Correlation Analysis?', PSYCHOLOGICAL SCALES. Available at: https://scales.arabpsychology.com/stats/what-is-the-spss-annotated-output-for-canonical-correlation-analysis/.

[1] stats writer, "What is the SPSS Annotated Output for Canonical Correlation Analysis?," PSYCHOLOGICAL SCALES, vol. X, no. Y, ص Z-Z, June, 2024.

stats writer. What is the SPSS Annotated Output for Canonical Correlation Analysis?. PSYCHOLOGICAL SCALES. 2024;vol(issue):pages.

Download Post (.PDF)
Slide Up
x
PDF
Scroll to Top