Document details

A machine learning and chemometrics assisted interpretation of spectroscopic da...

Author(s): Maraschin, Marcelo cv logo 1 ; Somensi-Zeggio, A. cv logo 2 ; Oliveira, S. K. cv logo 3 ; Kuhnen, S. cv logo 4 ; Tomazzoli, M. M. cv logo 5 ; Zeri, A. C. M. cv logo 6 ; Carreira, Rafael cv logo 7 ; Rocha, Miguel cv logo 8

Date: 2012

Persistent ID: http://hdl.handle.net/1822/23883

Origin: RepositóriUM - Universidade do Minho

Subject(s): Supervised classification techniques; Evolutionary algorithms; Random forest; PLS-DA; Wrapper methods; NMR-based metabolomics


Description
In this work, a metabolomics dataset from 1H nuclear magnetic resonance spectroscopy of Brazilian propolis was analyzed using machine learning algorithms, including feature selection and classification methods. Partial least square-discriminant analysis (PLS-DA), random forest (RF), and wrapper methods combining decision trees and rules with evolutionary algorithms (EA) showed to be complementary approaches, allowing to obtain relevant information as to the importance of a given set of features, mostly related to the structural fingerprint of aliphatic and aromatic compounds typically found in propolis, e.g., fatty acids and phenolic compounds. The feature selection and decision tree-based algorithms used appear to be suitable tools for building classification models for the Brazilian propolis metabolomics regarding its geographic origin, with consistency, high accuracy, and avoiding redundant information as to the metabolic signature of relevant compounds.
Document Type Article
Language English
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Related documents



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento EU