Risk Factor Analysis of Cryopreserved Autologous Bone Flap Resorption in Adult Patients Undergoing Cranioplasty with Volumetry Measurement Using Conventional Statistics and Machine-Learning Technique
Article information
Abstract
Objective
Decompressive craniectomy (DC) with duroplasty is one of the common surgical treatments for life-threatening increased intracranial pressure (ICP). Once ICP is controlled, cranioplasty (CP) with reinsertion of the cryopreserved autologous bone flap or a synthetic implant is considered for protection and esthetics. Although with the risk of autologous bone flap resorption (BFR), cryopreserved autologous bone flap for CP is one of the important material due to its cost effectiveness. In this article, we performed conventional statistical analysis and the machine learning technique understand the risk factors for BFR.
Methods
Patients aged >18 years who underwent autologous bone CP between January 2015 and December 2021 were reviewed. Demographic data, medical records, and volumetric measurements of the autologous bone flap volume from 94 patients were collected. BFR was defined with absolute quantitative method (BFR-A) and relative quantitative method (BFR%). Conventional statistical analysis and random forest with hyper-ensemble approach (RF with HEA) was performed. And overlapped partial dependence plots (PDP) were generated.
Results
Conventional statistical analysis showed that only the initial autologous bone flap volume was statistically significant on BFR-A. RF with HEA showed that the initial autologous bone flap volume, interval between DC and CP, and bone quality were the factors with most contribution to BFR-A, while, trauma, bone quality, and initial autologous bone flap volume were the factors with most contribution to BFR%. Overlapped PDPs of the initial autologous bone flap volume on the BRF-A crossed at approximately 60 mL, and a relatively clear separation was found between the non-BFR and BFR groups. Therefore, the initial autologous bone flap of over 60 mL could be a possible risk factor for BFR.
Conclusion
From the present study, BFR in patients who underwent CP with autologous bone flap might be inevitable. However, the degree of BFR may differ from one to another. Therefore, considering artificial bone flaps as implants for patients with large DC could be reasonable. Still, the risk factors for BFR are not clearly understood. Therefore, chronological analysis and pathophysiologic studies are needed.
INTRODUCTION
Decompressive craniectomy (DC) with duroplasty is one of the common surgical treatments for life-threatening increased intracranial pressure (ICP) in patients with traumatic brain injury and stroke [2,42,47,48,57]. Once ICP is controlled, cranioplasty (CP) with reinsertion of the cryopreserved autologous bone flap or a synthetic implant is considered for protection and esthetics. The use of the elaborately crafted patient-specific synthetic implants for CP with various materials such as polyetheretherketone (PEEK) or titanium alloy is becoming popular recently, and the results are promising [14,23,28,31,41,49,53,59-61]. However, the reinsertion of the cryopreserved autologous bone flap is still a commonly used method for its cost-effectiveness. Although most CPs with cryopreserved autologous bone flap is performed without incidence, complications including surgical site infections and bone flap resorption (BFR) remains [10,12,19,29,35,45]. In this article, we performed conventional statistical analysis and the machine learning technique understand the risk factors for BFR.
MATERIALS AND METHODS
This retrospective study was approved by the Institutional Review Board before data collection (2021-07-024), and the need for patient consent was waived by the relevant ethics board because of the retrospective study design.
In this study, data of patients aged >18 years who underwent autologous bone CP between January 2015 and December 2021 were reviewed. Medical records and images of 202 patients were reviewed. Patients who had multiple follow up brain computed tomography (CT) images, which are adequate for three-dimensional (3D) reconstruction, and were followed for at least 6 months after autologous bone CP were included. Therefore, demographic data, medical records, and volumetric measurements of the autologous bone flap volume from 94 patients were collected. CT scans with abnormally storage Digital Imaging and Communications in Medicine (DICOM) data were excluded from this study.
Bone flap preparation and preservation
Once bone flaps were obtained during DC, soft tissues were meticulously removed, packed with two sterile layers, sealed in a sterile Ziploc bag, and then stored at -80°C without the application of local antibiotics or autoclaving. Only large fragments were preserved without fixation. Fragments or fractures were fixed during CP.
Volumetric analysis of the autograft bone flap
Our institute has used 1-mm slice bone setting images in default brain CT protocol since 2015. We used the first postoperative CT scan as the control point and the following CT scans for the BFR analysis. DICOM data of the brain CT was exported to generate a 3D model of the autologous bone flap of the CP using open-source 3D Slicer version 4.10.2 (http://www.slicer.org; Isomics, Inc, Cambridge, MA, USA). The same threshold level was used for each patient to render the whole skull 3D model and the autologous bone flap was precisely segmented using editing tools. Extracted bone flaps from each CT scans were exported in stereolithography file format and then imported to open-source Meshlab version 2016 (Visual Computing Lab; ISTI - CNR, Pisa, Italy) for volume measurement. If the 3D model of the autologous bone flap cannot be trimmed clearly from the 3D slicer, Meshmixer version 3.5 (http://www.meshmixer.com; Autodesk, Inc., San Rafael, CA, USA) was used for additional modification of the autologous bone flap 3D models.
BFR
As no standard method of defining BFR has been established for quantitative analysis, BFR was defined in two criteria : absolute quantitative method (BFR-A) and relative quantitative method (BFR%). BFR-A and BFR% are calculated using the following equation :
BFR-A = volume at first postoperative CT − volume at last CT
BFR% = (volume at first postoperative CT − volume at last CT)/volume at first postoperative CT [4].
For the statistical analysis and machine learning analysis, a positive BFR was defined as BFR-A is >20 mL or the BFR% is >30% which were approximately 20% of the dataset.
Estimation of bone quality
Hounsfield unit (HU) values were measured from the frontal bone of the non-CP site to estimate the current status of bone quality [36].
Statistical analysis
The Pearson test, Spearman rho test, and univariate and multivariate logistic regression analyses were employed in the risk factor analysis using R version 4.1.2 (R Project for Statistical Computing, Vienna, Austria). A p-value of <0.05 and 95% confidence intervals were considered statistically significant. The chi-squared test, Spearman rho test, and univariate and multivariate regression analyses were used to determine the correlation between risk factors and BFR.
Machine-learning analysis using random forest (RF) with a hyper-ensemble approach (HEA)
Machine-learning analysis using RF with HEA was employed to analyze the contribution of each variable to BFR [11,39,46].
Step 1 : generating multiple balanced datasets. Multiple balanced datasets were generated from the original dataset. A hybrid random sampling technique using the ROSE package was utilized to generate multiple balanced datasets from the original [33]. A total of 500 balanced datasets were generated for RF training using 500 randomly generated seed numbers.
Step 2 : RF training (ensemble methods). The RF algorithm was performed with R software (R version 4.1.2; R Project for Statistical Computing) using the randomForest package [32,56]. Each balanced dataset was subjected to RF training by building 50000 decision trees. Approximately 70% of observations were selected randomly with replacement (bootstrap aggregation; bagging) for RF training, and the remaining observations (out of bag [OOB]) were used as test datasets for the evaluation of the RF models [8]. Four randomly selected variables were used to split each node [8]. In handling, absent data were not filled with the na.roughfix function from the randomForest package because menopause should remain absent in male patients. The mean decrease accuracy (MDA) and mean decrease Gini (MDG) were obtained to assess the importance of variables. The mean and standard deviation of the OOB error rate, area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, and specificity were also calculated to validate the 500 RF models.
Step 3 : HEA. The results from 500 RF models were combined with the HEA (ensemble of ensembles) [11,39,46]. The mean and standard deviation of the OOB error rate, AUC, accuracy, sensitivity, and specificity were calculated from 500 RF models [11]. Variable importance was visualized with box plots of MDAs and MDGs to present various ranges of results exported from multiple RF models [11].
Overlapped partial dependence plots (PDP) were generated from the 500 RF models for the five most contributing factors using the edarf package to visualize the relationship between each factor and the BFR-A and BFR% [11,18].
RESULTS
Conventional statistical analysis
Among 94 patients, approximately 20% were classified as having positive BFR in both definitions BFR-A and BFR%. Despite the definition of BFR, there was no difference between the non-BFR group and the BFR group in demographics and medical histories. In BFR-A, the initial autologous bone flap volume was statistically significant and correlated with BFR. However, no other risk factors showed statistical significance on both BFR definitions (Table 1). Univariate and multivariate regression analyses also failed to show any statistical significance. The Cohen’s kappa between BFR-A and BFR % was 0.86.
Chronological analysis
The relative volume of the autologous bone flap was visualized with a spaghetti plot (Fig. 1). The volume of the autologous bone flap decreased over time.
Machine-learning analysis
The OBB error rate of each RF model on both BFR-A and BFR% was mostly stabilized within the first 20000 decision trees. The hyper-ensembled OOB error rates on BFR-A and BFR% were 13.44%±5.64% and 10.19%±3.99%, and the AUCs were 0.86±0.06 and 0.89±0.04, respectively. The mean accuracy, sensitivity, and specificity of the BFR-A were 0.88±0.11, 0.91±0.16, and 0.84±0.20 and those of BFR% were 0.90±0.09, 0.94±0.11, and 0.86±0.15, respectively.
The box plot shows that the individual RF models show various ranges in both MDA and MDG. The initial autologous bone flap volume, interval between DC and CP, and bone quality were the factors with most contribution to BFRA prediction according to both MDA and MDG. According to the MDA, trauma, bone quality, and initial autologous bone flap volume were the factors with most contribution to BFR% prediction. However, the ranking of importance of BFR% according to MDG differed with MDA, and trauma dropped to third place (Fig. 2).
Overlapped PDPs of the initial autologous bone flap volume on the BRF-A crossed at approximately 60 mL, and a relatively clear separation was found between the non-BFR and BFR groups. Therefore, the initial autologous bone flap of over 60 mL could be a possible risk factor for BFR. Most PDPs of the interval between DC and CP on BFR-A showed crossing of the predicted probability in approximately 60 days; however, results of PDPs compared with that of the initial autologous bone flap volume were debated. PDPs of bone quality, age, and Glasgow coma scale (GCS) also showed debated results on predicting BFR. Regarding BFR%, none of the variables showed a clear separation between the non-BFR and BFR groups.
DISCUSSION
CP aims to restore cerebral protection and cosmesis. Benefits reported included the improvement of cerebral perfusion, cerebrospinal fluid dynamics, and neurological function and the relief of the syndrome of the trephined [3,24,34,37,50,58]. Autologous bone flaps are widely used for CP for various reasons such as price competitiveness, biocompatibility, strength, and ideal contour [15,28]. Despite its advantages, the use of autologous bone flaps is at risk of BFR, which is reported in 1.96–22.8% of adult patients and 50% of pediatric patients [16]. Autologous bone flap requires revascularization, osteoconduction, osteoinduction, and osteogenesis for reintegration [20-22,29]. However, autologous bone flaps are devascularized during the harvesting process in DC; therefore, it has limited revascularization ability after CP. The fatty bone marrow also acts as a barrier for osteoblasts; therefore, increased signs of osteoclastic resorption became prominent [21]. In the present study, a large portion of the autologous bone flap had gradual volume loss (Fig. 1). A previous study also showed similar results, that is, the decrease in the autologous bone flap volume is a very common phenomenon, which is observed in up to 90% of patients who underwent CP [4,29].
The pathophysiology indicates that BFR is inevitable with only differences in the extent, and no study has provided a generalized definition of BFR [4,13,16,25,29,38,43] because defining BFR as an autologous bone flap that needs reinforcement or replacement is subjective and the indication of CP re-operation differed among clinicians. Therefore, this study used two criteria for BFR using an absolute quantitative method (BFR-A) and a relative quantitative method (BFR%). BFR-A shows that the amount of BFR corresponded to the autologous bone flap.
Suggested risk factors for BFR include sex, age, interval time between DC and CP, presence of systemic factors, autologous bone flap size, number of bone fragments, and shunt operation [5,7,9,22,38,40,44,45,51]. As much clinical data were collected from the cohort; however, in the conventional statistical analysis of risk factors for BFR, only the initial autologous bone flap volume from BFR-A was found to be significant. For a better understanding of BFR, RF with HEA was applied [7,11].
A decision tree is a classification algorithm that was used to generate and ideally classify the target in the present study, BFR. Generally, only a single decision tree can be derived from a single dataset. RF is an algorithm that assembles the results from multiple decision trees [8]. To generate multiple datasets, it uses bootstrap aggregation (bagging), which is randomly selecting observations (clinical cases) with replacements. In building a decision tree, randomly selected variables are applied to each node, that is, each decision tree gets unique features. This randomness gives RF flexibility and robustness in dealing with high-dimensional data and reduces overfitting [8]. Being a simple method, it is fast compared with other machine-learning methods. RF with HEA takes one step forward by assembling the results of multiple RFs (ensemble of ensembles). This method was first developed to analyze rare genetic variants, and recently, it is used in various clinical studies [1,11,46,52]. RF with HEA minimizes the human error from a single RF, and the range of results from multiple RFs can reveal the tendency of the effect of variables on the target [11]. For easier understanding, multiple studies with the same study design can show various possibilities by generating multiple dataset combinations. As all analyses are based on randomness, results are closer to general expectations presented in multiple studies.
The box plot of the MDA and MDG visualizes the effect of variables on BFR prediction from the RF with HEA. The initial autologous bone flap volume was the variable with the most influence on BFR-A prediction, followed by the interval between DC and CP, bone quality, age, and GCS. In BFR%, trauma as the reason for DC had the most influence, followed by bone quality, initial autologous bone flap volume, interval between DC and CP, and age. In the comparison of box plots of BFR-A and BFR%, the influence between variables in BFR-A was prominent compared with that in BFR%. In BFR%, the top-ranking variable from the MDA was discordant with that of MDG, whereas the ranks in both MDA and MDG from BFR-A were constant. This implies that the results of RF with HEA according to the definition of BFR-A in more reliable.
Overlapped PDPs from the top five variables for RF with HEA of BFR-A and BFR% were obtained to reveal how each variable affects BFR. Each pair of PDP plots represents the influence of the variables on BFR in a single RF, and the overlapped 500 pairs of PDPs provide an overview of the overall RF results. Moreover, each pair of PDPs is separated, and the variable is more capable of classifying BFR. Among overlapped PDPs, the initial autologous bone flap in BFR-A shows the most significant separation. A bone flap over 60 mL is a break-even point in predicting BFR. Other than the initial autologous bone flap volume from the BFR-A, the interval between DC and CP in the BFR-A is relatively less controversial than the rest of the overlapped PDPs. According to the results of RF with HEA, the initial autologous bone flap volume and interval between DC and CP are possible risk factors according to the definition of BFR-A.
Risk factors for BFR
Bone flap size
As mentioned above, the initial autologous bone flap volume was statistically significant and pointed out as the most potent risk factor in predicting BFR-A (Table 1). Overlapped PDPs in the initial autologous bone flap volume in the BFR-A showed that the larger bone flap is a potential risk factor of BFR (Fig. 3). The influence of the initial autologous bone flap volume is dampened in the definition of BFR% as it is leveled. Therefore, the initial autologous bone flap volume failed to show statistical significance in BFR%; however, it still ranked third in the BFR% from the results of RF with HEA (Fig. 3). This result may represent that the autologous bone flap size is a strong risk factor. Previous studies also showed that larger bone flap is a risk factor of BFR [22,27,35]. Unfortunately, the main goal of DC is to obtain as much space as possible by removing the bone flap and performing duroplasty. Therefore, reducing the size of the autologous bone flap to prevent BFR is rarely feasible [25].
Interval between DC and CP
Some previous studies have proposed that cryopreservation of the autologous bone flap impairs viability and graft revascularization, which leads to BFR [6,27,40,54,55]. The present study also shows similar results from BFR-A. According to the overlapped PDP of the interval between DC and CP in BFR-A, CP after 3 months is expected to be a risk factor for BFR, whereas CP within 2 months shows protective effects on BFR (Fig. 3). Still, some aspects of the RF show debated results, and the results of the overlapped PDPs should be carefully interpreted. However, some studies have shown that prolonged cryopreservation does not affect BFR [26].
Age, menopause, and bone quality
BFR in pediatric patients is a well-known risk factor and is reported in up to 50% of cases [7,16,30,35]. According to the idea that BFR is caused by an imbalance between bone formation and bone resorption, we assumed that younger age and osteoporosis could be risk factors for BFR. With further assumption, as menopause is one of the risk factors of osteoporosis, it was also included as a potential risk factor in the present study. During the medical record review, age and menopause data were available; however, owing to the retrospective design, the majority of the patients have not undergone osteoporosis study. Instead gathering osteoporosis data, the skull bone was measured in HUs, as it is a potentially opportunistic osteoporosis screening test [36]. However, unlike our expectations, age, menopause, and bone quality were not statistically significant. In RF with HEA, bone quality and age were among the top five variables in according to MDA and MDG in both definitions (Fig. 2). Overlapped PDPs could not reveal how the variables affect BFR because mixed results are shown (Fig. 3).
Trauma and number of bone fragments
In previous studies, trauma and the number of bone fragments were pointed out to be risk factors for BFR [7,17,51]. In the present study, whether trauma is a risk factor of BFR is quite unclear, particularly on the results of the RF with HEA. Trauma only appears as a risk factor with strong effects on BFR% (Fig. 2). However, the ranking of trauma reverses on MDA and MDG. The boxplot also shows that the RF with HEA had a wide range of results compared with other top-ranked risk factors. Owing to the binary nature of variables, overlapped PDPs should be more clearly separated, if the results of the RF with HEA are consistent. However, the overlapped PDPs of trauma show mixed results, and the explanation of the effect of trauma on BFR is unclear (Fig. 3).
Limitations
Although quantitative methods were applied to determine BFR objectively, few limitations were noted. First, owing to the retrospective design of the study, follow-up intervals and period were not unified. Therefore, chronologic analyses of BFR were unavailable. The annual decrease rate to define BFR was also considered, and the annual decrease rate was not constant throughout time; thus, the annual decrease rate calculated from the large gap between CT scans may not be reliable. To understand the chronologic characteristics of BFR, a well-designed follow-up CT protocol is necessary. Second, the quality of CT could be different. Our institute used the same CT protocol with the same CT machines since 2015. However, two of the volumetric data showed alias data, which are sudden increments of the autologous bone flap volume >200% and dropped to a similar volume compared with the initial autologous volume. We could not find the reason for these phenomena; however, the only plausible explanation is the error during exporting of data to the DICOM server. As the volumetric data might not represent 100% exact volume of the autologous bone flap, we gave a sufficient margin of difference in defining BFR. Third, in the present study, the definitions of BFR in the clinical field are not the same. In the clinical field, determining BFR is necessary to screen patients who underwent CP and need reinforcement or replacement of the autologous bone flap. However, bone volume loss does not mean that the strength of the autologous bone f lap decreased. Therefore, the opinion of the clinicians could intervene on proceeding with additional surgical procedures. Although the clinical significance was unsatisfactory, the volumetric method was chosen to subjectively analyze the risk factors for BFR. Finally, the use of RF with HEA could be questionable, that is, whether it is the most suitable machine-learning technique. A previous study using RF with HEA showed a good explanation of the risk factors in a small cohort [11]. As RF with HEA implies randomness from multistage during the analysis, human alternation was nearly absent, which could affect the results [11]. RF with HEA also has an advantage over support vector machine and neural network on interpretation of its results [11]. The results of the RF with HEA do not represent the causality of the variable to the target, however, overlapped PDPs visualize the tendency of the effects of variables on the target. The results of RF with HEA could be straightforwardly interpreted if the overlapped PDPs are separated clearly. In current study, most of the results of the RF with HEA showed imperfect separation of the overlapped PDPs; therefore, we can only determine which variable is more potent in causing BFR but cannot determine how it influences BFR.
CONCLUSION
From the present study, BFR in patients who underwent CP with autologous bone flap might be inevitable. However, the degree of BFR may differ from one to another. Currently, personalized artificial bone flaps with PEEK or titanium alloy are widely available. Considering artificial bone flaps as implants for patients with large DC could be reasonable. Still, the risk factors for BFR are not clearly understood. Therefore, chronological analysis and pathophysiologic studies are needed.
Notes
Conflicts of interest
No potential conflict of interest relevant to this article was reported.
Informed consent
This type of study does not require informed consent.
Author contributions
Conceptualization : JC; Data curation : YS, JC; Formal analysis : YS, JC; Funding acquisition : JC; Methodology : YS, JC; Project administration : JC; Visualization : YS, JC; Writing - original draft : YS; Writing - review & editing : YS, JC
Data sharing
None
Preprint
None
Acknowledgements
The present research was supported by the research fund of Dankook University in 2021.