• J.J. Jansen
  • R. Bro
  • H.C.J. Hoefsloot
  • F.W.J. van den Berg
  • J.A. Westerhuis
  • A.K. Smilde
Novel post-genomics experiments such as metabolomics provide datasets that are highly multivariate and often reflect an underlying experimental design, developed with a specific experimental question in mind. ANOVA-simultaneous component analysis (ASCA) can be used for the analysis of multivariate data obtained from an experimental design instead of the widely used principal component analysis (PCA). This increases the interpretability of the model in terms of the experimental question. Aside from the levels of individual factors, variation that can be described by the experimental design may also depend on levels of multiple (crossed) factors simultaneously, e.g. the interactions. ASCA describes each contribution with a PCA model, but a contribution depending on crossed factors may be described more parsimoniously by multiway models like parallel factor analysis (PARAFAC). The combination of PARAFAC and ASCA, named PARAFASCA, provides a view on the data that is both parsimonious and focused on the experimental question. The novel method is used to analyze a dataset in which the effect of two doses of hydrazine on the urinary chemical composition of rats is investigated by time-resolved metabolic fingerprinting with nuclear magnetic resonance (NMR) spectroscopy. This experiment has been conducted to monitor the dose-specific urine composition changes in time upon hydrazine administration. Comparison of the PCA, the ASCA and the PARAFASCA models shows that ASCA and PARAFASCA describe the data more dedicated to the experimental question than PCA, but that PARAFASCA is more parsimonious than ASCA, and separates the variation underlying different effects better.
Original languageEnglish
JournalJournal of Chemometrics
Journal publication date2008

ID: 368740