New Journal Article in MDPI’s Nitrogen
Our colleague Sebastian Raubitzek, researcher at SBA Research and a member of the Security and Privacy Research Group at the University of Vienna, has published a journal article titled “Multi-Class Machine Learning to Quantify the Impact of Nitrogen Management Practices on Grassland Biomass” in MDPI’s Journal Nitrogen in collaboration with the CD lab AsTra.
© Niklas Schnaubelt
Abstract
Grassland biomass yield reflects a complex interaction of management intensity and environmental factors, yet quantifying the relative role of practices such as mowing and fertilization remains challenging. In this study, we introduce a multi-class machine learning framework to predict above-ground biomass on 150 permanent grassland plots across eight years (2009–2016) in Germany’s Biodiversity Exploratories and to evaluate the influence of key management variables. Following rigorous data cleaning, imputation of missing nitrogen values, feature standardization, and encoding of categorical practices, we trained CatBoost classifiers optimized via Bayesian hyperparameter search and mitigated class imbalance with ADASYN oversampling. We assessed model performance under binary, three-class, four-class, and five-class quantile-based categorizations, achieving test accuracies of 0.76, 0.57, 0.42, and 0.38, respectively. Across all schemes, mowing frequency and mineral nitrogen input emerged as the dominant predictors, while secondary variables such as drainage and conditioner use contributed as well.
These results demonstrate that broad biomass categories can be forecast reliably from standardized management records, whereas finer distinctions necessitate additional environmental information or automated sensing to capture nonlinear effects and reduce reporting bias. This work shows both the potential and the limits of machine learning for informing sustainable grassland management and explainability thereof. Frequent mowing and higher mineral nitrogen inputs explained most of the predictable variation, enabling a 76% accurate separation of low and high biomass categories. Predictive accuracy fell below 60% for finer class resolutions, indicating that management records alone are insufficient for detailed yield forecasts without complementary environmental data.
Authors: Sebastian Raubitzek, Margarita Hartlieb, Philip König, Judith Hinderling, and Kevin Mallinger.
