Chemometrics in Metabonomics
Chemometrics in Metabonomics
Chemometrics in Metabonomics
Research group for Chemometrics, Institute of Chemistry, Umeå University, Sweden, Biological Chemistry,
Biomedicinal Sciences Division, Faculty of Medicine, Imperial College London, Sir Alexander Fleming Building,
London SW7 2AZ, United Kingdom, Department of Pharmaceutical Chemistry, Uppsala University, Sweden,
and AcurePharma, Uppsala, Sweden
Keywords: Statistical Experimental Design (SED) • PCA • OPLS • Class-specific studies • Dynamic studies •
Multivariate Design
averaging, the functional space is efficiently mapped, and sents a multivariate profile (with K-descriptors) for each object
interactions and synergisms are seen. Antti et al.8 have applied that is a fingerprint of its inherent properties.
a statistical experimental design to investigate the effect of the Geometrically, the multivariate profile represents one point
dose of hydrazine and time on liver toxicity. The result from in K-dimensional space, whose position (coordinates) in this
the NMR and clinical chemistry was evaluated by PLS. The PLS space is given by the values in each descriptor. For multiple
analysis could also reveal the correlation pattern between the profiles, it is possible to construct a two-dimensional data table,
different blocks as well as within blocks according to dose, time, an X matrix, by stacking each multivariate profile on top of
and the interaction between time and dose. each other. The N rows then produce a swarm of points in
Extract Information from Data. In metabonomic studies, K-dimensional space,
the observations and samples are often characterized using 2.1. Projection-Based Methods. The main, underlying as-
modern instrumentation such as GC-MS, LC-MS, and LC- sumption of projection-based methods is that the system or
NMR spectroscopy. The analytical platform is important and process under consideration is driven by a small number of
largely determined by the biological system and the scientific latent variables (LVs).20 Thus, projection-based methods can
question. be regarded as a data analysis toolbox, for indirect observation
Multivariate analyses based on projection methods represent of these LVs. This class of models is conceptually very different
a number of efficient and useful methods for the analysis and from traditional regression models with independent predictor
modeling of these complex data. Principal Component Analy- variables. They are able to handle many, incomplete, and
sis2 (PCA) is the workhorse in chemometrics. PCA makes it is correlated predictor variables in a simple and straightforward
possible to extract and display the systematic variation in the way, hence their wide use.
data. A PCA model provides a summary, or overview, of all Projection methods convert the multidimensional data table
observations or samples in the data table. In addition, group- into a low-dimensional model plane that approximates all rows
ings, trends, and outliers can also be found. Hence, projection- (e.g., objects or observations) in X, that is, the swarm of points.
based methods represent a solid basis for metabonomic The first PCA model component (t1p1T) describes the largest
analysis. Canonical correlation,9 correspondence analysis,10 variation in the swarm of points. The second component
neural networks,11-12 Bayesian modeling,13 and hidden Markov models the second largest variation and so on. All PCA
models14 represent additional modeling methods but are components are mutually linearly orthogonal to each other see
outside the scope of this review. Figure 1. The scores (T) represent a low-dimensional plane that
closely approximates X, that is, the swarm of points. A scatter
Metabonomic studies typically constitute a set of controls
plot of the first two score vectors (t1-t2) provides a summary,
and treated samples, including additional knowledge of the
or overview, of all observations or samples in the data table.
samples, for example, dose, age, gender, and diet. In these
Groupings, trends, and outliers are revealed. The position of
situations, a more focused evaluation and analysis of the data
each object in the model plane is used to relate objects to each
is possible. That is, rather than asking the question “What is
other. Hence, objects that are close to each other have a similar
there?”, one can start to ask, “What is its relation to?” or “What
multivariate profile, given the K-descriptors. Conversely, objects
is the difference between?”. In modeling, this additional
that lie far from each other have dissimilar properties.
knowledge constitutes an extra data table, that is, a Y matrix.
Analogous to the scores, the loading vectors (p1,p2) define
Partial least-squares3 (PLS) and Orthogonal-PLS15-19 (OPLS)
the relation among the measured variables, that is, the columns
represent two modeling methods for relating two data tables.
in the X matrix. A scatter plot, also known as the loading plot
The Y data table can be both quantitative (e.g., age, dose
shows the influence (weight) of the individual X-variables in
concentration) and qualitative (e.g., control/treated) data.
the model. An important feature is that directions in the score
plot correspond to directions in the loading plot, for example,
Chemometric Approach to Metabonomic Studies
for identifying which variables (loadings) separate different
1. Step 1: Define the Aim. It is important to formulate the groups of objects (the scores). This is a powerful tool for
objectives and goals of the metabonomic study. A number of understanding the underlying patterns in the data. Hence,
questions have to be answered and/or taken into consideration projection-based methods represent a solid basis for metabo-
in both the design of study as well as in the evaluation of the nomic analysis.
outcome. For example, What is previously known? What The part of X that is not explained by the model forms the
additional information is needed? How to reach the objectives; residuals (E) and represents the distance between each point
thta is, what experiments are needed and how to perform them? in K-space and its projection on the plane. The scores, loadings,
If these questions cannot be answered, there is no point to and residuals together describe all of the variation in X.
continue.
2. Step 2: Selection of Objects. The selection of the objects X ) TPT + E ) t1p1T + t2p2T + E
(e.g., samples, individuals) needs to span the experimental
domain in a balanced and systematic manner. To be able to 2.2. Multivariate Design. The need and usefulness of
do this, we have to characterize the objects with both measured experimental design in complex systems should be emphasized,
and observed descriptors. This often includes setting up specific because it creates a controlled setting of the environment, even
inclusion and exclusion criteria for the study, such as age span though most of the variation between the different objects is
(e.g., 18-45 years), body mass index (e.g., 20-30), medicinal uncontrolled. Multivariate design (MVD)21,22 is a combination
chemistry profiles (e.g., lipids, glucose), gender, tobacco habits, of multivariate characterization (MVC),23-25 principal compo-
and use of drugs. In addition to those criteria, additional nent analysis (PCA), and Statistical Experimental Design (SED)
information regarding each object is collected by question- to select a diverse set of objects that represents all objects, that
naires that include life style factors, food and drinking habits, is, spans the variation. There is a number of different experi-
social situation, and so on. This collected information repre- mental designs that can be applied to span the variation in a
Figure 1. A principal component analysis (PCA) model approximates the variation in a data table by a low dimensional model plane.
This model plane provides a score plot, where the relation among the observations or samples in the model plane is visualized, for
example, if there are any groupings, trends, or outliers. The loading plot describes the influence of the variables in the model plane,
and the relation among them. An important feature is that directions in the score plot correspond to directions in the loading plot, and
vice versa.
Figure 2. Four objects are selected according to a multivariate design that spans the biological variation.
systematic way and obtain well-balanced data. The most Dumas et al.30assessed the analytical reproducibility of
commonly used are factorial designs6 and D-optimal design26 human urine samples characterized by H-NMR in the INTER-
that fulfill the criteria of balanced data and orthogonality. In MAP study. INTERMAP was launched in 1996 to investigate
MVD, the principal component model scores, for example, t1 the relationship of multiple dietary variables to blood pressure.
and t2 are used to select the objects, see Figure 2. The selection The conclusion was that most errors are due to specimen
is based on diversity between the objects. handling inhomogeneity.
3. Step 3: Sample Preparation and Characterization. In Multiple factor analysis (MFA) was used by Dumas et al.31
metabonomics, it is important to keep the experimental and to integrate NMR and MS data for metabolic fingerprinting on
biological variation at a minimum. At the same time, the cattle treated with anabolic steroids. Only minor overlap was
metabolic analysis should be global, quantitative, robust, found in the correlation structure between the MS and NMR
reproducible, accurate, and interpretable. In addition, the variables. They underline the relation of a multivariate profile
physicochemical diversity of metabolites (amino acids, fatty not only to the biological information content, but also to its
acids, carbohydrates, and organic acids) raises problems for inherent signature from the analytical instrument used.
extraction and working procedures for different analytical 4. Step 4: Evaluation of the Collected Data. In contrast to
techniques. Here, statistical design of experiments represents an 1H-NMR spectrum, data collected from hyphenated instru-
an important strategy to systematically investigate factors and ments such as GC-MS, LC-MS, and UPLC-NMR must be
optimize the experimental protocols. Typical working proce- processed before multivariate analysis. The reason is the two-
dures for NMR spectroscopy for biofluids and tissue extraction dimensional nature (e.g., chromatogram/mass spectra) of the
are found in Appendix 4, in the SMRS Policy document.27 For data for each sample. Curve resolution or deconvolution
GC-MS, see refs 28 and 29. methods are mainly applied for data processing32-36 that result
Figure 3. In the score plot (left panel), the confidence interval is defined by the Hotelling’s T2 ellipse (95% confidence interval), and
observations outside the confidence ellipse are considered outliers. Outliers can also be detected by the distance to model parameter,
DModX, based on the model residuals (right panel).
in a multivariate profile for each sample. Since a variable in a Interesting individual observations, such as outliers, can be
data table should define the same property over all samples, examined and interpreted by the contribution plot.46 It displays
variability in NMR peak shifts cause problems for statistical the weighted difference between the observation and the model
modeling. Because of this, a multitude of different peak center. Hence, we can identify what is unique (deviating) for
alignment methods have been developed.37,38 Variability of an observation compared to “normality”. Similarly, the con-
chemical shifts in H-NMR spectra of biofluids, for example, due tribution plot can also be used for comparing different obser-
to pH variation, metal-ion concentrations, and chemical ex- vations.
change phenomena, has spurred the development of bucketing PCA modeling was used to assess the statistical differentia-
and peak alignment methods. A commonly used method tion between the groups, and the covariance loadings plot for
involves bucketing the data, where signal integration within biochemical interpretation. One example is the paper by Akira
spectral regions is performed.39 An alternative approach has et al.47 wherein the biochemical changes between hypertensive
been to use automatic peak alignment methods to resolve the rats and their normotensive controls to provide insight into
problem of peak position variation. Stoyanova et al.40 removed blood pressure regulation was investigated. The design study
the positional noise using PCA. Forshed et al.41,42 and Lee et included six male rats from each class, and urine was sampled
al.43 have applied genetic algorithms to align segments of twice at 12 and 26 weeks of age. PCA have been frequently
spectra to determine the misalignment across a series of NMR applied in the evaluation of metabonomic data and should be
spectra. Cloarec et al.18 applied the OPLS method to evaluate the method of choice for obtaining an overview, find clusters,
the influence of typical peak position variation on the robust- and to identify outliers. For a few different examples on
ness of pattern recognition methods and demonstrated that applications see refs 48-55.
the inclusion of variable peak position can be beneficial and Haluska and Powers56 have discussed the draw back of
lead to useful biochemical information. Typically, alignment spectral noise when evaluating the data by PCA and suggest
methods rely upon having a master or reference profile. simply removing the noise regions by setting a threshold and
Projection-based methods are sensitive to scaling of the only use the signals above the spectral noise in the PCA.
variables. Scaling of variables changes the length of each axis Another approach to this would be to utilize the prior knowl-
in the K-dimensional space. The primary objective of scaling edge gained in “Study Design”, which gives us the ability to
is to reduce the noise in the data, and thereby enhance the separate the observations in at least two different classes, and
information content and quality. Column centring, whereby thereby use more advanced multivariate methods such as
the mean trajectory is removed from the data, is followed by SIMCA, PLS-DA, and/or OPLS-DA.
either no scaling or pareto scaling of the variables. Pareto
scaling is recommended for metabonomic data and is done Class Specific Studies
by dividing each variable by the square root of its standard Most of the published papers within the field are dealing
deviation. with classification problems such as disease diagnosis or treated
Principal component analysis is used to get an overview of versus control, that is, to identify a group of control observa-
the multivariate profiles. Examining the scatter plot of the first tions and another group of observations known to have a
two score vectors (t1-t2) reveals the homogeneity of the data, specific disease. In a number of papers, several classes can be
any groupings, outliers, and trends. Strong outliers are found identified, but in all of these papers, the evaluation has been
as deviating points in the scatter plot. The Hotelling’s T2 region, made as a two-class case.
shown as an ellipse in Figure 3 (left panel), defines the 95% Two-class problem: Disease and control observations define
confidence interval of the modeled variation.44 Outliers may two separate classes.
also be detected in the model residuals. The distance to model One-class problem: Only disease observations define a class;
plot45 (DModX) can be used and is a statistical test for detecting control samples are too heterogeneous, for example, due to
outliers based on the model residual variance; see Figure 3 other variations caused by diseases, gender, age, diet, lifestyle,
(right panel). genes, unknown factors, and so on.
Figure 4. Illustration of SIMCA classification. In the left panel, the one-class classifier is shown, referred to as the asymmetric case. In
the right panel, the SIMCA classification is shown with two classes, separately modeled by PCA.
Figure 6. A geometrical illustration of the difference between the PLS-DA and OPLS-DA models. In the left panel, the PLS components
cannot separate the between-class variation from the within-class variation, and the resulting PLS component loadings mixes both
types of variations. In the right panel, the OPLS components are able to separate these two different variations. Component 1 (t1p) is
the predictive component and displays the between-class ([blue circles], [yellow squares]) variation of the samples. The corresponding
loading profile can be used for identifying variables important for the class separation. Component 2 (t2o) is the Y-orthogonal component
and models the within group (within-class) variation.
In Brindle et al.,71 PLS-DA was applied together with a Stella et al.74 have illustrated the use OPLS for characteriza-
multivariate preprocessing filter called orthogonal signal cor- tion of metabolic profile due to different diets and thereby
rection (OSC) for developing a diagnostic tool for predicting identified difference in metabolic pattern between low-meat
the severity of coronary heart disease based on NMR spectral diet and vegetarian diet. This is the first systematic study
profiles of human serum. The OSC filter removes the uncor- reported on the dietary effects on the metabolism.
related signals resulting in information of the within-class
variation. Wagner et al. report the use of this, in a paper72 Dynamic Studies
wherein OSC was applied to investigate the background Metabonomic studies that involve the quantification of the
information, which was not due to the exposure of the dynamic metabolic response are best evaluated using sequen-
compound acetaminophen. As stated by Wagner et al., the OSC tial sampling over an appropriate time course. The evaluation
component surprisingly provided an additional classification of human biofluid samples is further complicated by a high
of male and females. These observations lead us to discuss the degree of normal physiological variation caused by genetic and
OPLS method. lifestyle differences. Dynamic sampling makes it possible to
The Orthogonal-PLS Method (OPLS). The OPLS15 method evaluate and handle the different types of variations such as
is a recent modification of the PLS method.3 The main idea of individual differences in metabolic kinetics, circadian rhythm,
OPLS is to separate the systematic variation in X into two parts, and fast and slow responders.
one that is linearly related to Y and one that is unrelated Dynamic Sampling. Biological processes are dynamic by
(orthogonal) to Y. This partitioning of the X-data facilitates nature, that is, there is a temporal progression. Some problems
model interpretation and model execution on new samples.15,19 are caused by quick and slow responders following intervention
The OPLS model comprises two modeled variations, the or treatment.
Y-predictive (TpPpT) and the Y-orthogonal (ToPoT) components. For this reason, the study design is laid out as sequential
Only the Y-predictive variation is used for the modeling of Y samples over an appropriate time course to capture individual
(TpCpT). trajectories. Sampling period and interval are based on the
expected or known pharmaco-kinetics of the expected effect.
Model of X: X ) TpPpT + ToPoT + E
In other words, statistical experimental design is used to
Model of Y: Y ) TpCpT + F maximize the information content and increase the chances
of capturing all possible variations of responses. This allows
E and F are the residual matrices of X and Y, respectively. OPLS flexibility to the subsequent analysis and an unbiased evalu-
can, analogously to PLS-DA, be used for discrimination (OPLS- ation of each individual’s kinetic profile. This also implies that
DA); see, for instance, ref 19. In Figure 6, the advantages with the often assumed control (or pre-dose) and treated modeling
OPLS-DA compared to PLS-DA are exemplified. The between- approach is not optimal, as it fails to take into account the
class variation and the within-class variation are separated by individual dynamics, for example, slow and fast responders.
OPLS-DA but not by PLS-DA, and this facilitates the OPLS-DA In addition, for dynamic studies, the traditional control group
model interpretation. does not exist. Instead, each individual (object) is its own
In Cloarec et al.,73 a combined covariation and correlation- reference control.
loading plot provided additional information on the physico- Coen et al.75 retrieved gene expression data and performed
chemical variations in the data. This was done by means of different types of NMR experiments of liver tissue from rats
coloring each covariance loading by its correlation value to class dosed with acetaminophen. The design study included 70 rats,
separation. 5 time points, and 4 different dose levels. Statistical analysis
Figure 8. Batch control charts can be constructed from a PLS or OPLS batch model score vectors. The average score trajectory (for
each component) with upper and lower control limits (based on standard deviations) indicates the normal dynamic trajectory for a
batch. The control chart can be used for detecting deviations from normality.
(50) Wang, Y.; Bollard, M. E.; Keun, H.; Antti, H.; Beckonert, O.; Ebbels, (69) Constantinou, M. A.; Papakonstantinou, E.; Spraul, M.; Sevastia-
T. M.; Lindon, J. C.; Holmes, E.; Tang, H.; Nicholson, J. K. Spectral dou, S.; Costalos, C.; Koupparis, M. A.; Shulpis, K.; Tsantili-
editing and pattern recognition methods applied to high-resolu- Kakoulidou, A.; Mikros, E. H-1 NMR-based metabonomics for the
tion magic-angle spinning 1H nuclear magnetic resonance diagnosis of inborn errors of metabolism in urine. Anal. Chim.
spectroscopy of liver tissues. Anal. Biochem. 2003, 323, 26-32. Acta 2005, 542, 169-177.
(51) Plumb, R. S.; Stumpf, C. L.; Gorenstein, M. V.; Castro-Perez, J. (70) Yang, J.; Zhao, X.; Liu, X.; Wang, C.; Gao, P.; Wang, J.; Li, L.; Gu,
M.; Dear, G. J.; Anthony, M.; Sweatman, B. C.; Connor, S. C.; J.; Yang, S.; Xu, G. High performance liquid chromatography-
Haselden, J. N. Metabonomics: The use of electrospray mass mass spectrometry for metabonomics: potential biomarkers for
spectrometry coupled to reversed-phase liquid chromatography acute deterioration of liver function in chronic hepatitis B. J.
shows potential for the screening of rat urine in drug develop- Proteome Res. 2006, 5, 554-561.
ment. Rapid. Commun. Mass. Spectrom. 2002, 16, 1991-1996. (71) Brindle, J. T.; Antti, H.; Holmes, E.; Tranter, G.; Nicholson, J. K.;
(52) Robertson, D. G.; Reily, M. D.; Sigler, R. E.; Wells, D. F.; Paterson, Bethell, H. W.; Clarke, S.; Schofield, P. M.; McKilligin, E.;
D. A.; Braden, T. K. Metabonomics: Evaluation of nuclear Mosedale, D. E.; Grainger, D. J. Rapid and noninvasive diagnosis
magnetic resonance (NMR) and pattern recognition technology of the presence and severity of coronary heart disease using 1H-
for rapid in vivo screening of liver and kidney toxicants. Toxicol. NMR-based metabonomics. Nat. Med. 2002, 8, 1439-1444.
Sci. 2000, 57, 326-337. (72) Wagner, S.; Scholz, K.; Donegan, M.; Burton, L.; Wingate, J.; Volkel,
(53) Kim, S. W.; Ban, S. H.; Ahn, C. Y.; Oh, H. M.; Chung, H.; Cho, S. W. Metabonomics and biomarker discovery: LC-MS metabolic
H.; Park, Y. M.; Liu, J. R. Taxonomic discrimination of cyano- profiling and constant neutral loss scanning combined with
bacteria by metabolic fingerprinting using proton nuclear mag- multivariate data analysis for mercapturic acid analysis. Anal.
netic resonance spectra and multivariate statistical analysis. J. Chem. 2006, 78, 1296-1305.
Plant Biol. 2006, 49, 271-275. (73) Cloarec, O. M.; Dumas, E.; Craig, A.; Barton, R. H.; Trygg, J.;
(54) Rasmussen, B.; Cloarec, O.; Tang, H. R.; Staerk, D.; Jaroszewski, Hudson, J.; Blancher, C.; Gauguier, D.; Lindon, J. C.; Holmes, E.;
J. W. Multivariate analysis of integrated and full-resolution H-1- Nicholson, J. Statistical total correlation spectroscopy: An ex-
NMR spectral data from complex pharmaceutical preparations: ploratory approach for latent biomarker identification from
St. John’s wort. Planta Med. 2006, 72, 556-563. metabolic H-1 NMR data sets. Anal. Chem. 2005, 77, 1282-1289.
(55) Chen, H. W.; Pan, Z. Z.; Talaty, N.; Raftery, D.; Cooks, R. G. (74) Stella, C.; Beckwith-Hall, B.; Cloarec, O.; Holmes, E.; Lindon, J.
Combining desorption electrospray ionization mass spectrometry C.; Powell, J.; van der Ouderaa, F.; Bingham, S.; Cross, A. J.;
and nuclear magnetic resonance for differential metabolomics Nicholson, J. K. Susceptibility of human metabolic phenotypes
without sample preparation. Rapid. Commun. Mass Spectrom. to dietary modulation. J. Proteome Res. 2006, 5, 2780-2788.
2006, 20, 1577-1584. (75) Coen, M.; Ruepp, S. U.; Lindon, J. C.; Nicholson, J. K.; Pognan,
(56) Halouska, S.; Powers, R. Negative impact of noise on the principal F.; Lenz, E. M.; Wilson, I. D. Integrated application of transcrip-
component analysis of NMR data. J. Magn. Reson. 2006, 178, 88- tomics and metabonomics yields new insight into the toxicity
95. due to paracetamol in the mouse. J. Pharm. Biomed. Anal. 2004,
(57) Wold, S. Pattern recognition by means of disjoint principal 35, 93-105.
components models. Pattern Recognit. 1976, 8, 127-139. (76) Smilde, A. K.; Jansen, J. J.; Hoefsloot, H. C. J.; Lamers, R. J. A. N.;
(58) Dumas, M. E.; Canlet, C.; Vercauteren, J.; Andre, F.; Paris, A.
van der Greef, J.; Timmerman, M. E. ANOVA-simultaneous,
Homeostatic signature of anabolic steroids in cattle using H-1-
component analysis (ASCA): A new tool for analyzing designed
C-13 HMBC NMR metabonomics. J. Proteome Res. 2005, 4, 1493-
metabolomics data. Bioinformatics 2005, 21, 3043-3048.
1502.
(77) Dieterle, F.; Schlotterbeck, G. T.; Ross, A.; Niederhauser, U.; Senn,
(59) Odunsi, K. R.; Wollman, M.; Ambrosone, C. B.; Hutson, A.;
H. Application of metabonomics in a compound ranking study
McCann, S. E.; Tammela, J.; Geisler, J. P.; Miller, G.; Sellers, T.;
in early drug development revealing drug-induced excretion of
Cliby, W.; Qian, F.; Keitz, B.; Intengan, M.; Lele, S.; Alderfer, J. L.
choline into urine. Chem. Res. Toxicol. 2006, 19, 1175-1181.
Detection of epithelial ovarian cancer using 1H-NMR-based
(78) Bollard, M. E.; Keun, H. C.; Beckonert, O.; Ebbels, T. M.; Antti,
metabonomics. Int. J. Cancer 2005, 113, 782-788.
H.; Nicholls, A. W.; Shockcor, J. P.; Cantor, G. H.; Stevens, G.;
(60) Holmes, E.; Nicholls, A. W.; Lindon, J. C.; Connor, S. C.; Connelly,
Lindon, J. C.; Holmes, E.; Nicholson, J. K. Comparative metabo-
J. C.; Haselden, J. N.; Damment, S. J.; Spraul, M.; Neidig, P.;
Nicholson, J. K. Chemometric models for toxicity classification nomics of differential hydrazine toxicity in the rat and mouse.
based on NMR spectra of biofluids. Chem. Res. Toxicol. 2000, 13, Toxicol. Appl. Pharmacol. 2005, 204, 135-151.
471-478. (79) Keun, H. C.; Ebbels, T. M.; Bollard, M. E.; Beckonert, O.; Antti,
(61) McKee, C. L. G.; Wilson, I. D.; Nicholson, J. K. Metabolic H.; Holmes, E.; Lindon, J. C.; Nicholson, J. K. Geometric trajectory
phenotyping of nude and normal (Alpk : ApfCD, C57BL10J) mice. analysis of metabolic responses to toxicity can define treatment
J. Proteome Res. 2006, 5, 378-384. specific profiles. Chem. Res. Toxicol. 2004, 17, 579-587.
(62) Wold, S.; Eriksson, L.; Sjöström, M. PLS in Chemistry. Encyclo- (80) Ishihara, K.; Katsutani, N.; Aoki, T. A metabonomics study of the
pedia of Computational Chemistry; Schleyer, P. V. R., Ed.; John hepatotoxicants galactosamine, methylene dianiline and clofi-
Wiley & Sons: New York, 1998; pp2006-2016. brate in rats. Basic Clin. Pharmacol. Toxicol. 2006, 99, 251-260.
(63) Wold, S.; Albano, C.; Dunn, W. J., III; Edlund, U.; Esbensen, K.; (81) Williams, R. E.; Lenz, E. M.; Lowden, J. S.; Rantalainen, M.; Wilson,
Geladi, P.; Hellberg, S.; Johansson. E.; Lindberg, W.; Sjöström, I. D. The metabonomics of aging and development in the rat:
M. In Multivariate Data Analysis in Chemistry; NATO ASI Series an investigation into the effect of age on the profile of endog-
C 138, D; Reidel Publ. Co.: Dordrecht, Holland, 1984. enous metabolites in the urine of male rats using 1H NMR and
(64) Wang, C. H.; Kong, W.; Guan, Y. F.; Yang, J.; Gu, J. R.; Yang, S. L.; HPLC-TOF MS. Mol. Biosyst. 2005, 1, 166-175.
Xu, G. W. Plasma phospholipid metabolic profiling and biom- (82) Williams, R. E.; Lenz, E. M.; Rantalainen, M.; Wilson, I. D. The
arkers of type 2 diabetes mellitus based on high-performance comparative metabonomics of age-related changes in the urinary
liquid chromatography/electrospray mass spectrometry and mul- composition of male Wistar-derived and Zucker (fa/fa) obese rats.
tivariate statistical analysis. Anal. Chem. 2005, 77, 4108-4116. Mol. Biosyst. 2006, 2, 193-202.
(65) Wang, C.; Kong, H.; Guan, Y.; Yang, J.; Gu, J.; Yang, S.; Xu, G. (83) Schnackenberg, L. K.; Jones, R. C.; Thyparambil, S.; Taylor, J. T.;
Plasma phospholipid metabolic profiling and biomarkers of type Han, T.; Tong, W.; Hansen, D. K.; Fuscoe, J. C.; Edmondson, R.
2 diabetes mellitus based on high-performance liquid chroma- D.; Beger, R. D.; Dragan, Y. P. An integrated study of acute effects
tography/electrospray mass spectrometry and multivariate sta- of valproic acid in the liver using metabonomics, proteomics, and
tistical analysis. Anal. Chem. 2005, 77, 4108-4116. transcriptomics platforms. OMICS 2006, 10, 1-14.
(66) Dieterle, F.; Ross, A.; Schlotterbeck, G.; Senn, H. Metabolite (84) Bollard, M. E.; Keun, H. C.; Beckonert, O.; Ebbels, T. M. D.; Antti,
projection analysis for fast identification of metabolites in me- H.; Nicholls, A. W.; Shockcor, J. P.; Cantor, G. H.; Stevens, G.;
tabonomics. Application in an amiodarone study. Anal. Chem. Lindon, J. C.; Holmes, E.; Nicholson, J. K. Comparative metabo-
2006, 78, 3551-3561. nomics of differential hydrazine toxicity in the rat and mouse.
(67) Yin, P.; Zhao, X.; Li, Q.; Wang, J.; Li, J.; Xu, G. Metabonomics study Toxicol. Appl. Pharmacol. 2005, 204, 135-151.
of intestinal fistulas based on ultraperformance liquid chroma- (85) Antti, H.; Bollard, M. E.; Ebbels, T.; Keun, H.; Lindon, J. C.;
tography coupled with Q-TOF mass spectrometry (UPLC/Q-TOF Nicholson, J. K.; Holmes, E. Batch statistical processing of H-1
MS). J. Proteome Res. 2006, 5, 2135-2143. NMR-derived urinary spectral data. J. Chemom. 2002, 16, 461-
(68) Ramadan, Z.; Jacobs, D.; Grigorov, M.; Kochhar, S. Metabolic 468.
profiling using principal component analysis, discriminant partial (86) Wold, S.; Kettaneh, N.; Friden, H.; Holmberg, A. Modelling and
least squares, and genetic algorithms. Talanta 2006, 68, 1683- diagnostics of batch processes and analogous kinetic experi-
1691. ments. Chemom. Intell. Lab. Syst. 1998, 44, 331-340.