Issue 01/2026

Regression-based estimates for sectional stem volume of Scots pine (Pinus sylvestris L.) in Greece considering the mean diameter reduction factor

Article by Athanasios A. Fallias, Maria J. Diamantopoulou | 10.04.2026 - 10:26

Schlüsselbegriffe: Biometrie, Vorrat, Teilvolumenmessung, Allometrie, Pieria-Gebirge, Smalian-Formel

Available at https://doi.org/10.53203/fs.2601.2

Download as PDF

See below the issue 1/2026 as E-Paper or have a look at our E-Paper archive dating back to 1955.

Abstract

This research presents the development of a reliable system for predicting the total stem volume of Scots pine (Pinus sylvestris L.) trees by building models for estimating the mean diameter reduction factor (α). Using easily measured in-field variables as independent variables, the construction of regression models that best fit a representative sample of trees from the mountainous complex of the Pieria Mountains was investigated. The evaluation of these models was based on a set of fit criteria, including R²adj, SEE, Bias, AIC, BIC, VIF, and residual distribution. The results showed that the proposed model estimates (α) with acceptable accuracy (SEE = 0.187, R²adj = 0.839). Ultimately, the proposed system for predicting total stem volume will support more effective and sustainable forest management. By providing accurate estimates of standing wood volume based solely on stump diameter, it offers a practical, efficient tool for reliable volumetric assessment.

Zusammenfassung

Diese Forschungsarbeit präsentiert die Entwicklung eines zuverlässigen Systems zur Schätzung des Gesamtstammvolumens von Waldkiefer (Pinus sylvestris L.) durch die Erstellung von Modellen zur Schätzung des mittleren Durchmesserreduktionsfaktors (α). Unter Verwendung von leicht im Feld messbaren Variablen als unabhängige Variablen wurde die Erstellung von Regressionsmodellen untersucht, die am besten zu einer repräsentativen Stichprobe von Bäumen aus dem Gebirgskomplex der Pieria-Berge passen. Die Bewertung dieser Modelle basierte auf einer Reihe von Anpassungskriterien, darunter R²adj, SEE, Bias, AIC, BIC, VIF und Residuenverteilung. Die Ergebnisse zeigten, dass das vorgeschlagene Modell (α) mit akzeptabler Genauig­keit schätzt (SEE = 0.187, R²adj= 0.839). Letztendlich wird das vorgeschlagene System zur Vorhersage des Gesamtstammvolumens eine effektivere und nachhaltigere Waldbewirtschaftung unterstützen. Durch die Bereitstellung genauer Schätzungen des Holzvolumens eines Bestandes, die ausschließlich auf dem Baumstumpfdurchmesser basiert, bietet es ein praktisches, effizientes Werkzeug für eine zuverlässige Volumenbewertung.

1 Introduction

Tree taper is a critical characteristic of trees that is considered when estimating their volume, biomass, or the carbon they can store (Polglase et al., 2003; Jones et al., 2022). Tree taper refers to the rate at which stem diameters decrease from the ground to the tree top (Burkhart & Tomé, 2012). The external profile of a tree typically shows a clear bend near the base. Then, it becomes approximately linear through the central portion of the bole, and it exhibits greater variability in the upper stem (Shaw et al., 2003). Because tree boles have irregular geometric shapes, accurately estimating and predicting their volume and biomass has been a persistent challenge.

The stem of a tree can be conceptualized as the surface generated by rotating a curve that is convex outward near the lower stem and concave inward toward the upper stem. Its overall form generally lies between that of a cone and a paraboloid. Given the substantial economic implications of accurate estimates of commercial wood products, particularly stem volume, research in forest biometrics has increasingly focused on factors that improve the precision of such estimates. Usually, the largest share of tree volume and biomass is in the stem (Poorter et al., 2012), and therefore, changes in taper lead to changes in the total volume and biomass of trees that are otherwise of the same size (e.g., same height and diameter). For this reason, the investigation of conicity is a significant interest for most researchers involved in model development (Eckmüllner et al., 2007; Weiskittel et al., 2011).

Several factors have been studied and proposed to describe the shape of the stem, such as the form factor, form height, form ratio, and diameter reduction factor, along with various taper equations (Prodan, 1965; Matis, 2004; Van Laar & Akça, 2007). Nevertheless, the ability to mathematically describe the shape of tree stems has been the subject of ongoing research for more than a hundred years (Kublin et al., 2013; McTague & Weiskittel, 2021).

One form factor, which has not yet been extensively investigated and serves as an indicator of stem form, is the mean diameter reduction factor (α) of the tree’s stem. This factor quantifies the average decrease in stem diameter per unit increase in height. It is calculated as the difference between two specified diameters, typically the stump diameter (d0.3), measured 0.3 m above the ground, or the breast-height diameter (d1.3), measured 1.3 m above the ground, and the diameter measured at a selected upper point on the stem, divided by the vertical distance between the measurement points (Matis, 2004). It is not constant along the stem, but it varies systematically. It is relatively high near the base, decreases toward the mid-height of the stem, and subsequently increases again as it approaches the upper portions of the stem (Matis, 2004).

Knowledge of (α), derived from appropriate taper models, is crucial for precise estimation of stem volume, which depends on the availability of diameter measurements at multiple heights. In addition, understanding taper contributes indirectly to characterizing stand structure (West, 2015) and provides essential information for effective forest management. To the best of the authors’ knowledge, the existing body of research on this topic remains limited. Indicatively, Pérez et al. (2003) introduce a diameter reduction factor (cm of diameter per m of height) derived from stem analysis of teak (Tectona grandis) trees, Maraseni et al. (2007) estimated the taper rates of basal and distal parts through mean diameter reduction factor as well as volume of smaller-sized logs in Spotted gum (Corymbia citriodora subsp. variegata) trees, Fuentes et al. (2010) used a stem-diameter reduction factor for stem diameters calculation in European beech (Fagu­­s sylvatica L.) trees, while Diamantopoulou (2022) and lately Diamantopoulou (2025) compared non-linear regression and advanced machine learning modeling approaches for fir (Abies borisii-regis Matff.) trees mean diameter reduction factor reliable prediction.

For many decades, despite its limitations, regression modeling (Draper & Smith, 1998) has been widely used in forest research, providing solutions for both research and practice. For this reason, developing a volumetric modeling system that is easy to understand and apply by forest practitioners remains essential.

The aim of this study is to develop a robust regression-based model to predict the mean diameter reduction factor (α) using field measurements from Scots pine (Pinus sylvestris L.) stands in the Pieria Mountains. A further objective is to assess whether the model’s predictions of each tree’s mean diameter reduction factor can be reliably applied to estimate total stem volume accurately. Finally, we aim to provide a precise and trustworthy system for stem-volume prediction of Scots pine trees, easily understandable and usable, based solely on d0.3, an easily obtainable field stem diameter that remains accessible even after a tree has been felled.

2 Materials and Methods

Fallias_Fig1.jpg

Figure 1: The study area of the Public Forest Complex of Skoteina - Livadi - Fteri - Platanorema in the Pieria Mountains.

Abbildung 1: Das Untersuchungsgebiet im Gebiet Skoteina–Livadi–Fteri–Platanorema in den Pieria-Bergen.

2.1 Study area and sample data

The study area consists of the Public Forest Complex of Skoteina-Livadi-Fteri-Platanorema in the Pieria Mountains, with a total area of 9957.68 ha, of which 1272.08 ha are in the pine (Black pine and Scots pine) management class, of which980.73ha are forested (Pieria Forest Directorate, 2013) (Figure1). The local population of Pinus sylvestris L. constitutes the southernmost forest boundary of its geographical distribution in Eastern Europe (Zagas, 1990; Gerasimidis et al., 2006; Grigoriadis et al., 2014; Oikonomakis & Ganatsas, 2020).

The specific description of this forest complex enters into a more detailed ecological description of the individual forest stands. As regards the specific description of the stands from which the sample was taken, there is some information through the description sheets regarding the existence of silvicultural species, silvicultural shape, tree‘s age and regeneration dynamics (Pieria Forest Directorate, 2013), which indicatively attempt to analyze the assessment of the current situation in terms of silvicultural-management measures and proposed adaptation strategies to the current conditions (Peterson St-Laurent et al., 2021), as well as the prediction of their future evolution over time in the context of climate change (Mauri et al., 2022). Mainly, mixed forest stands of Scots pine (Pinus sylvestris L.) with Black pine (Pinus nigra Arn.) and sporadic European beech (Fagus sylvatica L.) trees are observed, a fairly common occurrence for mixed forests in Greece (Zagas et al., 2001), creating uneven-aged stands with a sub-forested silvicultural shape and a range of pine ages of approximately 1-100 (120) years. The regeneration dynamics of pine are low to moderate by positions, as a gradual ingress of beech is observed (Pieria Forest Directorate, 2013).

In the context of this study, a simple random pilot sample of 10 trees was collected from the study area, through which a pilot regression equation was developed. The standard error of estimate of the pilot equation (Brooks & Barcikowski, 1994) led to the simple random sample of 100 Scots pine (Pinus sylvestris L.) trees to be measured in the field, which fully satisfied the requirements for the accuracy of the regression equations developed. 

For the trees in the sample, the d0.3 and the d1.3 were measured to the nearest 0.1 cm using a Finnish caliper. Total tree height (tht) was recorded to the nearest 0.1 m with a Blume–Leiss hypsometer. Stem diameters were then measured at 2-m intervals along the stem, beginning at d1.3, using a Spiegel relascope. All primary data were collected between early and late May.

The most commonly used general formula, and also applied in this study, refers to the total stem. Accordingly, the mean diameter reduction factor (α) for each tree was calculated using the formula (Prodan, 1965; Matis, 2004):

where: d0.3 is the stump diameter, dn is the last upper diameter of the stem, L is the distance between the diameters d0.3 and dn.

The specific limits on the diameters in the formula depend on the research context. The mean diameter reduction factor (α) for the entire tree stem was calculated to estimate the total tree stem volume. In this context, the largest diameter considered was the stump diameter, which remains in the forest after logging, while the upper diameter approached zero. Thus, this study addresses the mean diameter reduction factor (α) for the whole tree stem.

The sample data were explored through Exploratory Data Analysis (EDA), in IBM SPSS Statistics v. 29.0 statistical package (IBM SPSS, 2023; Diamantopoulou, 2024). EDA was used to assess the structure and composition of the data set under study, using graphical representations and statistical tests (Tukey, 1977; Hartwing & Dearing 1979; Hoaglin et al., 1985).

The graphical representations provided a rough indication of whether the population followed a normal distribution. Several more objective formal statistical methods and tests are available to evaluate the same question. One of the most commonly used is the Lilliefors test (Lilliefors, 1967), which is a modified version of the Kolmogorov–Smirnov test designed for situations in which the mean and variance of the normal distribution population are unknown. This modification was used as a non-parametric goodness-of-fit test to assess how well the α, which served as the dependent variable in the regression models, matches the normal distribution (H0) while exploratory graphing was used to assess homogeneity or non-homogeneity of variance for α.

Furthermore, collinearity among the independent variables (d0.3, h) was examined using the variance inflation factor (VIF):

where: R2j is the coefficient of determination from regressing Χj on all the other predictors.

2.2 Model development 

To develop models of the a, standard regression methods (simple, multiple, and non-linear) were used, applying the available procedures in IBM SPSS Statistics v. 29.0 (IBM SPSS, 2023).

The Levenberg-Marquardt iterative method (Levenberg, 1944; Marquardt, 1963; Ratkowsky, 1990; Murthy, 2014) was used to fit the non-linear equations to the data. The selection of this methodology is based on combining the best features of two methods: the Gauss-Newton method and the steepest descent method, avoiding their most serious limitations. It is quite likely to converge almost always (IBM SPSS, 2023), while it is rarely found to recalculate the model coefficients indefinitely, as is often the case with the steepest descent method. Overall, this method seems to work well in many instances of forest biometric variable estimation (Diamantopoulou, 2022) making it a reasonable choice in practice (Draper & Smith, 1998). The sum of squared errors was utilized as the loss function. Finally, initial parameter values for the non-linear model estimation were determined following the procedures outlined in Draper and Smith (1998).

 

2.3 Measures for evaluating regression models

The evaluation of the regression models were based on the following adjustment criteria (Raptis et al., 2024):

a) the adjusted coefficient of determination (R²adj)

b) the standard error of estimate (SEE)

c) the Bias

d) Akaike information criterion (AIC) for least-squares regression with normally distributed errors

e) Bayesian information criterion (BIC) for least-squares regression

where: R² is the coefficient of determination, yi and ŷi are the values of the observed and predicted values of the dependent variable, respectively, n is the number of observations, p is the number of independent variables in the equation.

2.4 Volume estimation 

Segmented volume was calculated using Smalian‘s formula (Equation 8). This formula is a widely recognized one in forest biometrics for estimating the volume of logs or tree stems with irregular shapes (Philip, 1994; Matis, 2004; Van Laar & Akça, 2007). This method ensures accurate stem volume calculation, accounting for the conical shape of tree stems, which is significant for irregularly shaped trees (Diamantopoulou & Georgakis, 2024):

where: d0.3 is the stump diameter measured 0.3 m above the ground, d1.3 is the breast height diameter measured at 1.3 m from the ground, dn is the last upper diameter of the stem, d1,2,3,…n, are the sequential stem diameters with 2 m step, lk is the length of the remaining segment between the last observed diameter and the tree top.

3 Results

Based on the exploratory data analysis (Figure 2), the box plot of the dependent variable α (Figure 2a) shows no isolated or extreme outliers. The median is centered within the interquartile range, suggesting an approximately symmetrical distribution. The data spread is relatively large, ranging from about 0.3 cm/m to 2.5 cm/m. The histogram of the observations (Figure 2b) satisfactorily approximates the normal distribution, as indicated by the black continuous line. The normal probability plot (Figure 2c) shows the adjustment of the mean diameter reduction factor values to the normal distribution, as the observations are shown to coincide or tend towards the straight line of the graph (Figure 2c), with some minor losses at the ends. Finally, the normal deviation plot (Figure 2d) complements the Q–Q plot by explicitly illustrating the deviations of the observed values from those expected under normality.

Fallias_Fig2.jpg

Figure 2: a) Box plot, b) Histogram, c) Normal probability plot and d) Deviation from normal (Q-Q) plot, as resulted from the EDA of the mean diameter reduction factor(α).

Abbildung 2: a) Boxplot, b) Histogramm, c) Normalverteilungsdiagramm und d) Abweichung vom Normalverteilungsdiagramm (Q-Q-Diagramm), wie sie sich aus der EDA des mittleren Durchmesserreduktionsfaktors (α) ergeben.

Fallias_Tab1.jpg

Table 1: Kolmogorov-Smirnov normality test of the dependent variable of the mean diameter reduction factor (α). 

Tabelle 1: Kolmogorov-Smirnov-Test zur Überprüfung der Normalverteilung der abhängigen Variable des mittleren Durchmesserreduktionsfaktors (α).

In order to have an objective evaluation of whether the α follows a normal distribution at the 5% significance level, the modified Kolmogorov-Smirnov test of normality was applied.  According to the results presented in Table 1, the significance value (Sig.) is 0.200, which exceeds the 0.05 threshold. Therefore, there is insufficient evidence to reject the null hypothesis of normality at α = 0.05, suggesting that the variable is normally distributed (Table 1).

The size of diameters at sequential heights, the total height and the mean diameter reduction factor, the range of the variable values, the minimum and the maximum values the arithmetical mean and the standard error of the mean, the standard deviation and the variance of the sample variables are given in Table 2.

Fallias_Tab2.jpg

Table 2: Descriptive statistics of the variables.

Tabelle 2: Deskriptive Statistiken der Variablen.

Fallias_Fig3.jpg

Figure 3: Scatter plot of the mean diameter reduction factor- stump diameter-total height.

Abbildung 3: Streudiagramm des mittleren Reduktionskoeffizient von Durchmesser – Baum­stumpf­durchmesser – Gesamthöhe.

The relationship between the mean diameter reduction factor and the stump diameter and the total tree height (Figure 3) cannot be reliably classified as linear or non-linear. For this reason, a wide range of equations was fitted to the data using regression analysis (linear, multiple, and non-linear), to select the equation that best fit the data with the greatest possible accuracy.

The models that provided the best fit to the data for each regression category are given in Table 3.

The three best models of simple linear (Table 3, Equation 9), multiple linear (Table 3, Equation 10), and non-linear regression (Table 3, Equation 11) gave similar results in terms of their evaluation measures.

Fallias_Tab3.jpg

Table 3: Fitting criteria of the regression models.

Tabelle 3: Bewertungskriterien der Regressionsmodelle.

The multiple linear model that was examined and ultimately selected as the best fit for the field data (Table 3, Equation10) produced an adjusted coefficient of determination R²adj= 0.839 and a standard error of estimation SEE = 0.1868. The VIF values were also calculated for the coefficients b1, b2, and b3 of Equation 10. Their VIF values were less than 5, indicating no serious multicollinearity in the model (Marquardt, 1974; Neter et al., 1990). This condition can be acceptable in the sense that the model coefficient estimates are not significantly affected by limited multicollinearity and are therefore considered satisfactorily stable.

Fallias_Tab4.jpg

Table 4: Parameter estimates of the multiple linear regression model (10).

Tabelle 4: Schätzungen der Parameter des multiplen linearen Regressionsmodell (10).

The histogram of the residuals of the best-performing multiple linear equation (Table 3, Equation 10) satisfactorily approximates the normal distribution (Figure 4a). Furthermore, the distribution of the residuals of the estimated mean diameter reduction factor showed a stable variance across the range of predicted values (Figure 4b). Figure 4c further supported a very good fit of the model to the data, yielding estimates that explain approximately 83% of the total variance of the observed values of α.

Fallias_Fig4.jpg

Figure 4: Measures of the multiple linear regression model (10), evaluation in the form of a) histogram, b) residual distribution graph and c) 45-degree line plot.

Abbildung 4: Bewertungsmaßnahmen des multiplen linearen Regressionsmodells (10), in Form von a) Histogramm, b) Residuenverteilungsdiagramm und c) 45°-Linien-Diagramm.

Fallias_Fig5.jpg

Figure 5: Scatterplot of the actual volume via the Smalian’s formula with the estimated volume via the Smalian’s formula (8).

Abbildung 5: Streudiagramm der 45°-Linie des tatsächlichen Volumens mit der Smalian-Formel mit des geschätzten Volumens mit der Smalian-Formel mit Durchmesserdaten, die sich aus der Anwendung der Gleichung (8) ergeben haben.

3.1 Volume prediction

To further evaluate the effect of the α estimation by the model of Table 4 on the configuration of the stem diameter estimates, these diameters were used as data for the segmental calculation of stem volume using Smalian‘s formula (8). The observed total stem volume, calculated from the observed stem diameters, was compared with the estimated volume calculated with Smalian‘s formula (8) using the stem diameters produced from the estimated α.

Based on the 45-degree line scatter plot (Figure 5), the very good indirect estimation of the segmental total stem volume is highlighted. The coefficient of determination for the multiple linear relationship between the actual and indirectly calculated total stem volumes was particularly high, at 0.955. The standard error of the estimate for this relationship between total stem volumes was found to be 0.271, indicating that the estimated total volume is slightly overestimated and follows the dispersion of actual volume values.

From the comparative evaluation of the estimated versus actual stem volume, by stump diameter per tree, it was concluded that the estimates of total stem volume are highly accurate in all stump diameter tree-by-tree examined (Figure 6).

Fallias_Fig6.jpg

Figure 6: Comparative evaluation of actual volume using the Smalian’s formula (8) in relation to the indirectly calculated volume using the multiple linear equation (10), based on the stump diameter of a) tree-by-tree: 12-33 cm, b) tree-by-tree: 34-48 cm, c) tree-by-tree: 50-65 cm, d) tree-by-tree: 66-96 cm.

Abbildung 6: Vergleichende Bewertung des tatsächlichen Volumens unter Verwendung der Smalian-Formel (8) im Verhältnis zum indirect berechneten Volumen unter Verwendung der multiplen linearen Gleichung (10), basiert auf dem Baumstumpfdurchmesser von a) Baum zu Baum: 12–33 cm, b) Baum zu Baum: 34–48 cm, c) Baum zu Baum: 50–65 cm, d) Baum zu Baum: 66–96 cm.

The statistical analysis of the actual volume values yielded a mean value of 1.078 m³, with a standard error of the mean equal to 0.104, while the standard deviation of the values was found to be equal to 1.041 (Table 5), whereas the estimated volume values yielded a mean value of 1.242 m³, with a standard error of the mean equal to 0.121, while the standard deviation of the values was found to be equal to 1.211 (Table 5).

Fallias_Tab5.jpg

Table 5: Descriptive statistics of actual and estimated volume values.

Tabelle 5: Deskriptive Statistiken der tatsächlichen und geschätzten Volumenwerte.

Fallias_Fig7.jpg

Figure 7: Scatter plot of the actual volume via the Smalian’s formula (8) with the estimated volume via the Smalian’s formula (8) for each stump diameter of a) tree-by-tree: 12-33 cm, b) tree-by-tree: 34-48 cm, c) tree-by-tree: 50-65 cm, d) tree-by-tree: 66-96 cm.

Abbildung 7: Streudiagramm des tatsächlichen Volumens nach der Smalian-Formel (8) mit dem ge­schätzten Volumen nach der Smalian-Formel (8) für jeden Baumstumpfdurchmesser von a) Baum zu Baum: 12–33 cm, b) Baum zu Baum: 34–48 cm, c) Baum zu Baum: 50-65 cm, d) Baum zu Baum: 66-96 cm.

After applying the mean diameter reduction factor, which resulted from the development of the best-performing multiple linear model (Equation 10) to estimate the total stem volume, the corresponding means (standard error of the mean) of tree-by-tree were calculated for d0.3 of 12 to 33 cm as 0.233 (0.040), for d0.3 of 34 to 48 cm as 0.534 (0.070), for d0.3 of 50 to 65 cm as 1.618 (0.151) and for d0.3 of 66 to 96 cm as 2.623 (0.265), respectively (Table 5). In absolute values, the incorporation of the multiple linear equation (Equation 10) in the estimation of the stump diameters of 66 to 96 cm appears to overestimate the mean of the actual volume. Less overestimation was found in the estimation of stump diameters of 50 to 65 cm. In comparison, even less overestimation was observed when the multiple linear model (Equation 10) was incorporated into the estimation of stump diameters of 34 to 48 cm and 12 to 33 cm.

The coefficient of determination (standard error of the estimate) of the estimated volume values for d0.3 of 12 to 33 cm with the incorporation of the multiple linear model (Equation 10) was found to be equal to 94.9% (0.046), the corresponding coefficient of determination of the estimated values for d0.3 of 34 to 48 cm was found to be equal to 98.3% (0.046), the estimated volume values for d0.3 of 50 to 65 cm it was found to be equal to 91.9% (0.223) and finally, the estimated volume values for d0.3 of 66 to 96 cm, it was found to be equal to 85.2% (0.511), which shows that the estimation of the of the stump diameters values followed the dispersion of the actual volume values (Figure 7) after incorporating of the multiple linear equation (Equation 10). 

4 Discussion

In recent years, advances in the combined use of non-linear mixed effects models and segmented polynomial taper functions (Saygili & Kahriman, 2023), as well as the integration of advanced machine learning techniques such as generalized regression, Bayesian regularization, and support vector regression (Diamantopoulou, 2025), have significantly improved the accuracy of volume, diameter, and taper profile estimates. Overall, these approaches allow the development of more accurate and adaptive models with the ability to reliably estimate volume at different heights along the stem segments and by diameter classes, ensuring a more comprehensive understanding of the stem structure and the productive potential of the forest, providing essential tools for forestry applications such as logging planning, growth modeling, and timber utilization. However, forestry practice needs to acquire skills, expertise, and qualifications that it has not yet developed in order to apply and utilize these advanced modeling methodologies. This study aims to provide a system for stem-volume prediction of Scots pine (Pinus sylvestris L.) trees, with acceptable precision, easily understandable and usable, based on well-known regression modeling.

 

4.1 Model development 

Tree stump diameter and total tree height were the independent variables used to develop the stem mean diameter reduction factor model. As Figure 3 revealed, the relationship between the mean diameter reduction factor and the stump diameter and the total tree height could not be reliably classified as linear or non-linear. Therefore, numerous equations were adjusted to the data using regression analysis, including simple, multiple, and non-linear models, to select the most appropriate model. Finally, the multiple linear model (Equation 10) estimated the mean diameter reduction factor more accurately, as it was well adjusted to the data, with an adjusted coefficient of determination of 0.839 and a standard error of estimate equal to 0.1868 (Table 3). 

Accurate estimation of the mean diameter reduction factor enables the accurate prediction of diameters along the tree stem. The most valuable product in Greece, based on the selling price, is telecommunications and electricity poles, which, when they appear in the shaped stem and in any diameter (from 10 to 50 cm), maximize the commercial value of the Scots pine (Pinus sylvestris L.) tree (Kontos, 2002). This is a critical factor, as accuracy in this range of diameters directly affects estimates of productivity, economic performance, and forest management planning (Avery & Burkhart, 2015; Ulak et al., 2022), i.e., in the lower sections where the largest percentage of marketable wood volume is concentrated (Saygili & Kahriman 2023).

 

4.2 Volume estimation 

The results of the mean diameter reduction factor of Equation (10) were used in Smalian‘s segmental volumetry (Equation 8) for the indirect calculation of the total stem volume of Scots pine (Pinus sylvestris L.) in the Pieria Mountains of Greece, using a sample size that exceeded the minimum sampling requirements by approximately 1.5 times.

The results of the estimation of stem volume at all diameters of the stem per tree (Figure 6) indicate relatively high accuracy (Figure 7) with the incorporation of the applied multiple linear model (Equation 10), which estimates α quite well, allowing such equations to be reliably used in forest management (McTague & Weiskittel, 2021; Hansen et al., 2023). A comparison of the standard errors of the mean of their estimated volumes per tree (Table 5) shows that the values are slightly overestimated for stump diameters from 12 to 33 cm with SEE = 0.040 and from 34 to 48 cm with SEE = 0.070, compared to the corresponding standard errors of the means of their actual volumes at the same stump diameters. 

Based on the 45-degree line scatter plots (Figure 7), which were compiled for the overall comparative evaluation of the actual versus estimated total volume overall and tree-by-tree, as well as the calculation of the standard error of their estimates, it is found that the stem volume estimates obtained by incorporating the information from the multiple linear equation (Equation 10) have an SEE = 0.271 m3, slightly overestimating the actual volume values from the smallest to the largest stump diameters tree-by-tree. To predict the stem volume for stump diameters from 12 to 33 cm and from 34 to 48 cm per tree, were calculated SEE = 0.046 m3, followed by a significant and smaller overestimation of values across the entire range of stump diameters in the sample, thus contributing to the most reliable and accurate indirect determination of the total volume per tree of all these stump diameters. Also, a slightly greater overestimation was found in stump diameters from 50 to 65 cm and from 66 to 96 cm per tree, presenting SEE = 0.223 m3 and SEE = 0.511 m3, respectively, which values are considered equally acceptable. This distinction between stem dimensions highlights differences in morphology, conicity, and standing wood volume (Van Laar & Akça, 2007). The qualitative distribution of volume per tree provides a more targeted description of the structure of the forest and a more accurate representation of production, which is particularly useful for planning logging operations and economic evaluation of the forest (Pretzsch, 2009).

The importance of low standard errors of the volume estimates in the lower stem cannot be misrecognized. Given that most of the merchantable wood volume and economic value is concentrated in the first part of the stem, high accuracy in this area makes the multiple linear model (Equation 10) reliable and applicable to forest management decisions (Husch et al., 2003). In contrast, higher standard errors of the estimates in the upper parts of the stem have a limited effect, as the wood in these parts has a smaller diameter, lower quality, and limited commercial value. Based on this, high accuracy in the lower stem is considered desirable and contributes to increased confidence in total volume estimates (Kozak, 2004). The first section of the stem (the lower stem) accounts for most of the marketable wood volume and economic value, as confirmed by Saygili and Kahriman (2023). Thus, reducing the standard error of the mean (Table 5) in this section is crucial, as lower deviations lead to more accurate income forecasts, optimized logging operations, and reduced economic losses in timber marketing (Avery & Burkhart, 2015). In contrast, the higher standard error of the mean (Table 5) in the upper parts of the stem has a reduced impact, as these usually yield smaller diameters and lower-value products (McTague and Weiskittel, 2021).

 

4.3 Need for reliable stem-volume prediction system

Standing wood volume is one of the most basic characteristics of forest stands, as it is the main parameter described in forest inventories and management studies. The volume of a tree depends on its height, its basal area, its shape, and the thickness of its bark (Van Laar & Akça, 2007). Therefore, estimating volume is a difficult process in-field conditions, and any error in estimating one of the above factors results in an error in the estimated volume of the stem. Consequently, field work involves calculating the three basic parameters of the geometric formula (FAO, 1981), which may lead to errors. 

To avoid such errors in forestry practice, the use of volume tables is preferred. The creation of volume tables is based on the assumption that a tree‘s volume can be estimated using equations that relate it to specific tree characteristics. The breast height diameter, height, and some characteristics of the stem’s shape are usually the independent variables used to determine the volume (dependent variable) of the tree. However, research shows that adding a third independent variable, such as breast height diameter and total stem height, e.g., the crown base height (cbh) (Näslund, 1947) or the stump diameter (d0.3) (Wagner, 1982) or the diameter at 7 m, reduces the percentage of unexplained variance and enables a more accurate estimate of tree volume. According to Van Laar and Akça (2007), research regarding the form of trees in Finland showed that a double-entry volume table for Scots pine (Pinus sylvestris L.) produced biased volume estimates. The addition of an extra diameter to the upper part of the tree was necessary to obtain unbiased volume estimates for these areas. Van Laar and Akca (2007) further report that forest inventory research in Germany showed that adding an additional diameter of the upper part of the stem does not significantly improve the accuracy of tree estimates.

Volume tables for Scots pine (Pinus sylvestris L.) of the Public Forest Complex of Skoteina-Livadi-Fteri-Platanorema in the study area of the Pieria Mountains of Greece, and for similar neighboring forests, do not exist, neither local nor general. Therefore, the need for estimates for sectional stem volume of Scots pine (Pinus sylvestris L.) is of vital importance. 

Pieria Forest Directorate (2013) estimated standing wood volume by applying systematic sampling with sample plots across the forested area of all stands. Then, for the trees in the sample plots, diameter was measured to the nearest 0.1 cm with a caliper, and total tree height was recorded to the nearest 0.1 m with a relascope. The form factor, which was obtained according to the form factor tables provided by the Ministry of Agriculture for each diameter class. Finally, the volume of each stem is calculated based on the geometric formula. 

The suggested stem-volume prediction system that can reliably predict the total and sectional tree-by-tree volume based on the mean diameter reduction factor predictions (Equation 10) for deriving stem diameters along the stem which finally used for the segmented and total stem volume calculation using the Smalian’s formula, allows for more detailed and targeted silvicultural treatment of the stem. 

To further compare the effect produced from the α estimation (Equation 10) by the multiple linear model of Table 4 on the configuration of the stem diameter estimates, were used as data for the Smalian‘s segmental volumetry, the estimated volume calculated with Smalian‘s formula (Equation 8), was compared with the observed total stem volume, calculated with geometric formula from the observed stem diameters.

Fallias_Fig8.jpg

Figure 8: Scatter plot of the actual volume versus the volume via a) the Smalian’s formula (8) and b) the geometric formula.

Abbildung 8: Streudiagramm des beobachteten Volumens im Vergleich zum Volumen gemäß a) der Smalian-Formel (8) und b) der geometrischen Formel.

Based on the 45-degree line scatter plots (Figure 8), the very good indirect estimation of the total stem volume is achieved with the calculated volume using Smalian‘s Formula (Figure 8a), compared with the geometric formula (Figure 8b), which is less accurate. The coefficient of determination for the relationship between actual volume and stem volume using Smalian‘s formula (Equation 8) with diameter values derived by equation (10) embedded, was particularly high at 0.950, as the standard error of the estimate for this relationship between total stem volumes was found to be 0.271, indicating that the estimated volume is slightly overestimated. On the other hand, the coefficient of determination for the relationship between actual volume and stem volume using the geometric formula was lower at 0.850, while the standard error of estimation for the relationship between total stem volumes was 0.653, indicating that the estimated volume is significantly overestimated. This finding is of high practical significance since the management of the study area is currently based on the geometric formula.

Stump-based models are essential for reconstructing tree dimensions when d1.3 measurements are unavailable due to illegal logging or natural disturbances (Di Cosmo & Gasparini, 2020). A model to estimate d1.3 of felled trees was proposed by the first Italian National Forest Inventory in 1985 (MAF-ISAFA 1988), which classified the two broad groups of forest types and used d0.3 as the only quantitative variable. Subsequent analyses using data from multiple species demonstrated that incorporating both species specification and d0.3 substantially improves d1.3 prediction, with accuracy gains exceeding 30% (Di Cosmo & Gasparini, 2020). The competition of individual trees for natural resources leads in growth of their height as well as their basal area (Canham & Uriarte, 2006), while height growth is mainly determined by environmental conditions and growth of basal area is effectively constrained by growing space (Tomé & Burkhart, 1989). According Garlemos et al. (2019) in three study areas in Northern Greece using generalized linear modeling that integrates stump height diameter distance to neighboring trees, and tree species significantly enhances basal area predictions in mixed pine and spruce forests. However, incorporating neighbor effects reduced prediction precision in pure beech forests. This contrast suggests that stand composition determines the effectiveness of competition-based models.

The approach for predicting the mean diameter reduction factor can offer a solution of high practical importance. Once the (α) is determined, then the stump diameter measurement is the only diameter measurement required for assessing both the segmental and the total stem volume. This approach reduces the error of the geometric formula currently used in the forest by 0.382 m3 (58.4%), thereby contributing to the most reliable and accurate indirect determination of total stem volume tree-by-tree. The proposed strategy can provide the most precise information about the total stem volume, utilizing the easiest to obtain stem diameter, which is the stump diameter.

5 Conclusions

The proposed model for estimating the mean diameter reduction factor provides a simple, easy-to-use solution for accurately predicting the total stem volume of Scots pine (Pinus sylvestris L.). This knowledge can, in many ways, lead to optimal forest management, constituting a valuable tool in broader management and forestry contexts and offering a flexible solution, such as the estimation of the stem wood volume using easily obtained variables in the field, saving effort, and the prediction of the stem wood volume of trees that are no longer in the forest.

The suggested volume estimation system reduces the total stem volume estimation by 0.382 m3 (58.4%), as compared to the current methodology used in this forest area. The proposed strategy is hoped to contribute to the rational and sustainable management of the study forest and to the conservation of the mountain forest ecosystem. However, this system should be further tested across different Scots pine (Pinus sylvestris L.) forests to ensure its safe use for stem volume estimation and prediction.

References

Avery, T. E., Burkhart, H. E. (2015). Forest Measurements. Waveland Press. 448p.

Brooks, G. P., Barcikowski R. S. (1994). A New Sample Size for Regression. Paper presented at the Annual Meeting of the American Educational Research Association. New Orleans. ERIC Document Reproduction Service No. ED 412247. 55p.

Burkhart, H. E., Tomé, M. (2012). Tree Form and Stem Taper. In: Modeling Forest Trees and Stands. Springer, Dordrecht. 9-41pp. 

Canham, C. D., Uriarte, M. (2006). Analysis of neighborhood dynamics of forest ecosystems using likelihood methods and modeling. Ecological Applications, 16 (1): 62-73

Di Cosmo, L., Gasparini, P. (2020). Predicting diameter at breast height from stump measurements of removed trees to estimate cuttings, illegal loggings and natural disturbances. South-East European Forestry, 11 (1): 41-49

Diamantopoulou, M. J. (2022). Tree bole form simulation through support vector machine modeling: an alternative approach. Conference proceedings: Protection and Restoration of the Environment XVΙ, Kalamata, Greece, July 5-8-2022. 38-45pp.

Diamantopoulou, M. J. (2024). Forest Statistics. Theory and Forest Biometric Applications using IBM-SPSS and R. Ziti Publications. 524p. [in Greek]

Diamantopoulou, M. J. (2025). Tree stem mean diameter reduction factor prediction through advanced modeling approaches. Annals of Forest Research, 68 (1): 39-54

Diamantopoulou, M. J., Georgakis, A. (2024). Improving European Black Pine Stem Volume Prediction Using Machine Learning Models with Easily Accessible Field Measurements. Forests, 15 (12): 2251

Draper N.R., Smith H.(1998). Applied Regression Analysis. John Wiley & Sons. 706p.

Eckmüllner, O., Schedl, P., Sterba, H. (2007). Neue Schaftkurven für die Hauptbaumarten Österreichs und deren Ausformung in marktkonforme Sortimente. Austrian Journal of Forest Science, 124 (3-4¬): 215–236

FAO: Food and Agriculture Organization of the United Nations (1981). Manual of Forest Inventory: With Special Reference to Mixed Tropical Forests. FAO Forestry Paper. Vol. 27, Rome. 200p.

Fuentes, M., Niklasson, M., Drobyshev, I., Karlsson, M. (2010). Tree mortality in a semi-natural beech forest in SW Sweden. In: Editors Löf, M., Brunet, J., Mattsson, L., Nylinder, M. (eds) Broadleaved forests in southern Sweden: Management for multiple goals. Ecological Bulletins, 53: 117-129, 248p. 

Garlemos, A., Karanikolas, A., Psilidou, K., Pipinis, E., Stampoulidis, A., Kitikidou, K., Milios, E. (2019). Distance-dependent basal area models using stump height diameters for three tree species in Greece. Austrian Journal of Forest Science, 136 (4): 373–386

Gerasimidis, A., Panajiotidis, S., Hicks, S., Athanasiadis, N. (2006). An eight-year record of pollen deposition in the Pieria mountains (N. Greece) and its significance for interpreting fossil pollen assemblages. Review of Palaeobotany and Palynology, 141 (3-4): 231-243

Grigoriadis, N., Spyroglou, S., Grigoriadis, S., Klapanis, P. (2014). Effect of soil scarification on natural regeneration of mature Scots pine (Pinus sylvestris L.) stands in Greece. Global NEST Journal, 16 (4): 732-742

Hansen, E., Rahlf, J., Astrup, R., Gobakken, T. (2023). Taper, volume, and bark thickness models for spruce, pine, and birch in Norway. Scandinavian Journal of Forest Research, 38 (6): 413-428

Hartwing, F., Dearking, B. E. (1979). Exploratory data analysis. Series: Quantinative Applications in the Social Sciences No 07-016. Sage Publications. London. 83p.

Hoaglin, D., Mosteller, F., Tukey, J. W.(1985). Exploring data tables, trends and shapes. John Wiley and Sons, Inc. New York. 527p.

Husch, B., Beers, T. W., Kershaw Jr, J. A. (2002). Forest Mensuration. 4th Edition. John Wiley & Sons. 433p.

IBM SPSS CORP. (2023). IBM SPSS Statistics for Windows (version 29.0). [Computer software]. IBM Corp. Available at: https://www.ibm.com/docs/en/spss-statistics/29.0.0?topic=regression-nonlinear-options

Jones, D. A., Harrington, C. A., St. Clair, J. B. (2022). Influence of climate on annual changes in Douglas-fir stem taper. Trees, 36 (2): 849-861

Kontos, N. (2002). Maximizing the commercial value of forest pine log production. Doctoral thesis. Forest Economics Laboratory. Department of Forestry and Natural Environment. Aristotle University of Thessaloniki. Thessaloniki. 145p. [in Greek]

Kozak, A. (2004). My last words on Taper Equations. The Forestry Chronicle, 80 (4): 507-515

Kublin, E., Breidenbach, J., Kändler, G. (2013). A flexible stem taper and volume prediction method based on mixed-effects B-spline regression. European Journal of Forest Research, 132 (5):983-997

Levenberg, K. (1944). A method for the solution of certain non-linear problems in least squares. Quarterly of applied mathematics, 2 (2):164-168

Lilliefors, H. W. (1967). On the Kolmogorov-Smirnov test for normality with mean and variance unknown. Journal of the American statistical Association, 62 (318): 399-402 

MAF-ISAFA (1988). Criteri di stima delle utilizzazioni legnose pregresse. In: Inventario forestale nazionale-IFNI1985. Sintesi metodologia e risultati. Ministero dell’Agricoltura e delle Foreste, Corpo forestale dello Stato. Istituto Sperimentale per l’Assestamento Forestale e per l’Alpicoltura, Trento. 463p.

Maraseni, T. N., Cockfield, G., Apan, A. (2007). Estimation of taper rates and volume of smaller-sized logs in spotted gum saw timber plantations in Southeast Queensland, Australia. Southern Hemisphere Forestry Journal, 69 (3): 169-173

Marquardt, D. W. (1963). An algorithm for least-squares estimation of nonlinear parameters. Journal of the society for Industrial and Applied Mathematics, 11 (2): 431-441

Marquardt, D. W. (1974). Biased Estimators in Regression. SREB Summer Research Conference, Winter Park. Florida.

Matis, K. G. (2004). Forest Biometrics II. Dendrometry. Pegasos Publications. Thessaloniki. 674p. [in Greek]

Mauri, A., Girardello, M., Strona, G., Beck, P. S., Forzieri, G., Caudullo, G., Manca, F., Cescatti, A. (2022). EU-Trees4F, a dataset on the future distribution of European tree species. Scientific data, 9 (1): 37

McTague, J. P., Weiskittel, A. (2021). Evolution, history, and use of stem taper equations: a review of their development, application, and implementation. Canadian Journal of Forest Research, 51 (2):210-235

Murthy, Z. V. P. (2014). Nonlinear Regression: Levenberg-Marquardt Method. In: Drioli, E., Giorno, L. (eds) Encyclopedia of Membranes. Springer, Berlin, Heidelberg. 1-3pp.

Näslund, M. (1947). Functions and tables for computing the cubic volume of standing trees: Pine, spruce and birch in southern Sweden and in the whole of Sweden. Reports of the Forest Research Institute of Sweden. Meddelandenfrån Statens Skogsforskningsinstitut, 36 (3): 41-53

Neter, J., Wasserman, W., Kutner, M. H.(1990). Applied Linear Statistical Models. 3rd Edition R. D. Irwin, Inc Homewood, Illinois 60430. 1181p.

Oikonomakis, N. G., Ganatsas, P. (2020). Secondary forest succession in Silver birch (Betula pendula Roth.) and Scots pine (Pinus sylvestris L.) southern limits in Europe, in a site of Natura 2000 network–an ecogeographical approach. Forest systems, 29 (2): 81-96

Pérez, L. D., Viquez, E., Kanninen, M. (2003). Preliminary pruning programme for tectona grandis plantations in Costa Rica. Journal of Tropical Forest Science, 15 (4): 557-569

Peterson St-Laurent, G., Oakes, L. E., Cross, M., Hagerman, S. (2021). R–R–T (resistance–resilience–transformation) typology reveals differential conservation approaches across ecosystems and time. Communications biology, 4 (1): 39

Philip, M.(1994). Measuring Trees and Forests. 2nd Edition. CAB International, Wallingford. 310p.

Pieria Forest Directorate (2013). Forest Management Study of the Public Forest Complex of Skoteina-Livadi-Fteri-Platanorema. Forest Management Period 2014-2023. Katerini, May. 80p. [in Greek]

Polglase, P. J., Snowdon, P., Theiveyanathan, T., Paul, K. I., Raison, R. J., Grove, T., Rance, S. J. (2003). Calibration of the Full CAM model to Eucalyptus globulus and Pinus radiata and uncertainty analysis. National Carbon Accounting System Technical Report, Australian Greenhouse Office, Canberra (41):36

Poorter, H., Niklas, K. J., Reich, P. B., Oleksyn, J., Poot, P., Mommer, L. (2012). Biomass allocation to leaves, stems and roots: meta‐analyses of interspecific variation and environmental control. New Phytologist, 193 (1): 30-50

Pretzsch, H.(2009). Forest Dynamics, Growth and Yield. Springer. Berlin, Heidelberg. 664p.

Prodan, M.(1965). Holzmesslehre, J. D. Sauerlander’s Verlag. Frankfurt am Main. 644p.

Raptis, D. I., Papadopoulou, D., Psarra, A., Fallias, A. A., Tsitsanis, A. G., Kazana, V. (2024). Height-diameter models for King Boris fir (Abies borisii regis Mattf.) and Scots pine (Pinus sylvestris L.) in Olympus and Pieria Mountains, Greece. Journal of Mountain Science, 21 (5): 1475-1490

Ratkowsky, D. A. (1990). Handbook of Nonlinear Regression Models. Marcel Dekker, Inc. New York. 241p.

Saygili, B., Kahriman, A. (2023). Modeling compatible taper and stem volume of pure Scots pine stands in Northeastern Turkey. iForest, 16 (1): 38-46

Shaw, D. J., Meldahl, R. S., Kush, J. S., Somers, G. L. (2003). A tree taper model based on similar triangles and the use of crown ratio as a measure of form in taper equations for longleaf pine. Gen. Tech. Rep. SRS-66. Asheville, NC: U.S. Department of Agriculture, Forest Service, Southern Research Station. 8p.

Tome, M., Burkhart, H. E.(1989). Distance-dependent competition measures for predicting growth of individual trees. Forest Science, 35 (3): 816-831

Tukey, J. W.(1977). Exploratory Data Analysis. Reading/Addison-Wesley. 61p.

Ulak, S., Ghimire, K., Gautam, R., Bhandari, S. K., Poudel, K. P., Timilsina, Y. P., Pradhan, D., Subedi, T. (2022). Predicting the upper stem diameters and volume of a tropical dominant tree species. Journal of Forestry Research, 33 (6): 1725-1737

Van Laar, Α., Akça, A. (2007). Forest Mensuration. Springer, Berlin. 390p.

Wagner, M. (1982). Ermittlung von Einzelstamm – Volumen mit D-H und oberen Durchmessern. Allgemeine Forst und Jagdzeitung, 153:72–75

Weiskittel, A. R., Hann, D. W., Kershaw Jr, J. A., Vanclay, J. K. (2011). Forest Growth and Yield Modeling. John Wiley & Sons, London. 405p.

West, P. W. (2015). Tree and Forest Measurement. 3rd Edition, Springer, Cham, 214p.

Zagas, T., Tsitsoni, T., Hatzistathis, A. (2001). The mixed forests of Greece. Silva Gandavensis, 66: 68-75

Zagas, Th. D. (1990). Conditions for the natural establishment of Scots pine in the Rhodope region. Doctoral Thesis. Department of Forest Production – Forest Protection and Natural Environment. Department of Forestry and Natural Environment. Aristotle University of Thessaloniki. Thessaloniki. 170p. [in Greek]