TY - JOUR
T1 - Data clustering for classification of vegetable biomass from compositional data
T2 - A tool for biomass valorization
AU - Durán-Aranguren, Daniel D.
AU - Toro-Delgado, Juan
AU - Núñez-Barrero, Valentina
AU - Florez-Bulla, Valentina
AU - Sierra, Rocío
AU - Posada, John A.
AU - Mussatto, Solange I.
PY - 2024
Y1 - 2024
N2 - Compositional data on vegetable biomass is widely available from research papers and online databases. However, the high diversity of biomass characteristics and composition represents a challenge for researchers and companies willing to produce novel substances from residues, and that should decide on the best and most feasible options for their use as feedstocks. The present study constructed a database with information gathered from the proximate, ultimate, and chemical composition of different biomass residues that can be used for data analysis and classification to elucidate better how they can be valorized. Different data clustering techniques were implemented to determine how compositional data can be segmented. The identified groups, that contained residues with similar characteristics, allowed to have an insight into the valorization of these biomasses, which can be used as an initial tool for biorefinery design. The use of data clustering facilitated the identification of different types of biomasses in a systematic way, which until now has not been reported in the literature.
AB - Compositional data on vegetable biomass is widely available from research papers and online databases. However, the high diversity of biomass characteristics and composition represents a challenge for researchers and companies willing to produce novel substances from residues, and that should decide on the best and most feasible options for their use as feedstocks. The present study constructed a database with information gathered from the proximate, ultimate, and chemical composition of different biomass residues that can be used for data analysis and classification to elucidate better how they can be valorized. Different data clustering techniques were implemented to determine how compositional data can be segmented. The identified groups, that contained residues with similar characteristics, allowed to have an insight into the valorization of these biomasses, which can be used as an initial tool for biorefinery design. The use of data clustering facilitated the identification of different types of biomasses in a systematic way, which until now has not been reported in the literature.
KW - Biomass classification
KW - Biomass composition
KW - Biorefinery design
KW - Data clustering
KW - Machine learning
KW - Principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=85206922123&partnerID=8YFLogxK
U2 - 10.1016/j.biombioe.2024.107447
DO - 10.1016/j.biombioe.2024.107447
M3 - Article
AN - SCOPUS:85206922123
SN - 0961-9534
VL - 191
JO - Biomass and Bioenergy
JF - Biomass and Bioenergy
M1 - 107447
ER -