<- file stat 97canon.html -> Canonical correlation (1977) Here are exchanges about why canonical correlation is the generic label that includes MANOVA and discriminant function and "multivariate multiple regression"; that last one is not necessarily a standard term. *----------------
  • Canonical corr, etc: naming
  • =======================Rich Ulrich, 6 May 1997==========ssc,sse,ssm Subject: Re: Canonical correlation with categorical variables Message-ID: <5knju8$dro@usenet.srv.cis.pitt.edu> A few days ago, : >Kerry W Go (kgo@ix.netcom.com) wrote: : >: Is it possible to perform a canonical correlation if some of the : >: variables are continuous and some are categorial? I replied, : > -- That's what MANOVA does. Now, Kerry says, kgo: I was under the impression that MANOVA is used when the dependent : variables are continuous and the independent variables are : categorical. -- Yes, that is the case. I thought it answered your question, in short, since it should remind you what canonical correlation consists of. And what MANOVA consists of. Here is a much longer answer: For MANOVA, the categorical variables are coded into k-1 (scored 0/1, say) dummy variables, and then canonical correlation is obtained on the covariance matrix between the left-side and the right-side variables. So, express your categorical variables the way you want to: treat them as interval, or use 0/1 dummy codes; append them to the list of your continuous variables, on either side of the canonical equation; and carry out your canonical correlation. The only problem then is figuring out what that messy solution can tell you.... kgo: What if, for example, one of the dependent variables is : categorical and one is continuous, and one of the independent : variables is catagorical and one is continuous? Would MANOVA still : work? No, not until you have a handier computer program than I have seen. Discriminant function is the subset of MANOVA which may be used to display can-corr results when you have just one categorical variable on the left side vs continuous variables. The testing is fairly straightforward, and data-displays include groups centroids, and the 'classification table'. MANOVA displays can-corr tests when there are multiple-category-variables on the left, vs continuous variables; there may be numerous tests provided, both multivariate and univariate, to try to cover the problems that may be intended. A simple Correspondence analysis shows some can-corr results when you have one categorical variable on each side; the testing is often only incidental to getting the graphical data display, and the ordering obtained for categories. (And it is possible to use more than one variable per side, in some fashion.) In sum - You can carry out a canonical correlation of whatever variables you have on hand, and you can include dummy variables to encode each categorical variable, to add to continuous variables. But, presenting your results becomes increasing difficult. I can imagine that someone might set up a "multivariate-multiple regression" with just a couple of dummy variables. But I don't think I would be very happy with any such analysis, unless I could read good sense into the dummy variables themselves. I hope this helps with your question....
  • Canonical correlation (what)
  • =======================Rich Ulrich, 07 May 1997==========ssc,sse,ssm Subject: Re: Canonical correlation with categorical variables Message-ID: <5kqm46$sc2@usenet.srv.cis.pitt.edu> << ...in answer to a Q. about canonical correlation and categorical variables, I wrote ... >> : > -- That's what MANOVA does. Paige Miller (paigem@kodak.com) wrote: : Let me chime in with a comment/question. I don't quite think that : canonical correlation and MANOVA are equivalent when some of the : independent variables are categorical. At least, that's my : understanding. -- sorry, you are wrong. Discriminant function is the *simple* case of MANOVA, where you just do *one* eigen-problem/ canonical correlation. A MANOVA may perform several analyses (one for each main-effect variable, for instance), but each one is a canonical correlation. : The reason I say that is: if all of your independent and dependent : variables are continuous, you could perform multiple regression or you : could perform canonical correlation, and the two procedures do not give : the same results. They do not optimize the same objective function. Now, << ... stuff deleted >> ??? "multiple regression" usually is used to label an analysis with many predictors and just one outcome. "Multivariate multiple-regression" is a name sometimes used for many-to-many prediction, with continuous variables, and so far as I know, it is precisely a canonical correlation - maximizing the correlation of the left side variables with the right side. So I am curious as to what analyses you are describing, above, where you think there are different results... *--------FAQ edit: one week later: Frank Ivis (FIVIS@ARF.ORG) wrote: : Rich, : The SPSS Reference Guide (1990) (p. 801) has an example of : 'multivariate multiple regression', including syntax and output using the : MANOVA procedure. : Hope this helps, -- THANKS - that is probably where I saw it, years ago. But if we don't find it other places, I guess we should not assume that people know what we mean by it. *--------
  • Canonical//multiv. multiple regr
  • =======================Rich Ulrich, 08 May 1997==========ssc,sse,ssm Subject: Re: Canonical correlation with categorical variables Message-ID: <5ktg4s$a67@usenet.srv.cis.pitt.edu> Paige Miller (paigem@kodak.com) wrote: : Richard F Ulrich wrote: << ... snip>> : > ??? "multiple regression" usually is used to label an analysis with : > many predictors and just one outcome. "Multivariate : > multiple-regression" is a name sometimes used for many-to-many prediction, : > with continuous variables, and so far as I know, it is precisely a : > canonical correlation - maximizing the correlation of the left side : > variables with the right side. : > : > So I am curious as to what analyses you are describing, above, where : > you think there are different results... : Multivariate multiple regression optimizes a least squares criterion. It : actually performs a multiple regression for each dependent variable. Okay, Paige, I admit I am surprised a bit: I have just checked 7 MV-text books and two stat manuals, and I STILL cannot find anything labeled, precisely, "multivariate multiple regression". Tabachnick and Fidell's text discusses the canonical analysis that I had in mind, under the name, "multivariate analysis of covariance". The BMDP manual offers (it looks like) just what you are describing, as "multivariate regression." Well, Multivariate ANOVA has multiple criteria and uses canonical roots; so it seems to me that prefacing Multiple regression with "multivariate" ought to be the same generalization. But most of my texts are not new, and don't discuss either model. * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
  • Document by Rich Ulrich. E-mail to wpilib+@pitt.edu
  • FAQ top.
  • Ulrich home page.
  • Ulrich FAQ. http://www.pitt.edu/~wpilib/stats99.html