The resistance to treatment prompted us to perform the national PEPPI study (Pediatric
Ependymoma Photons Protons and Imaging). PEPPI collated clinical, imaging and dosimetry data
from children with intracranial ependymoma treated in France between 2000 and 2013. Since
then, patients are treated in the current prospective SIOP II Ependymoma program. Our
clinical results confirmed the classical clinical prognostic factors including radiotherapy
dose, and we showed that imaging biomarkers from T2/FLAIR, perfusion and diffusion MRI were
novel prognostic factors. We reported that relapse after RT occurs mainly locally within the
high-dose region. The prognostic value of dose and the high rate of relapse with standard
dose prompted us to perform an in silico dosimetry study of dose escalation comparing photons
and protons, confirming the feasibility of this approach. In PEPPI series, the prediction of
the site of relapse with advanced MR imaging was not possible due to small sample of imaging
data and the recent biomolecular classification was not available. Recently, DNA methylation
profiling provided a novel classification of ependymoma in molecular subgroups harbouring a
strong prognostic value.
EPENDYMOMICS project aims to determine the prognostic role of multimodal imaging, to identify
radioresistant clusters predictive of relapse and to analyze the link between radiomics
features and genomic markers, hence leading to a radiogenomics study. It will build on a
database with a larger number of patients called NETSPARE (Network to Structure and Share
Pediatric data to Accelerate Research on Ependymoma) to develop a radiomics approach, i.e.
predict clinical endpoints by relying on quantitative features extracted from medical images
using either handcrafted or automated features extraction algorithms. These features will be
exploited and combined with other available variables (clinical, genetic, etc.) as improved
decision support.
1. Data collection MRI: Diagnostic, FU until relapse (DICOM format)
- - Sequences: T1W, T2W, FLAIR, T1W with & without contrast enhancement,
diffusion-weighted imaging (DWI), perfusion-weighted imaging (PWI) Radiotherapy: CT
scan, RTDOSE, RTSS, RTPLAN (DICOM RT format) Clinical: Age, surgery, chemotherapy,
doses, late effects, relapse date Histology: Hematoxylin and Eosin (H&E)-stained
histopathology slides Molecular Biology: Methylation groups RELA, YAP, PFA, and PFB.
2. Data quality check and delineation of volume of interest to curate data before
post-processing and analysis.
To ensure preliminary robust image segmentation the Radiotherapy Structure Set will be
checked for each patient. The referring radiologist will confirm all imaging changes
after RT. An expert radiation oncologist will segment volumes of post-treatment
abnormalities in T1WI post contrast, in T2W/FLAIR imaging, as well as site of relapse
and missing OARs.
3. Imaging data post-processing and analysis of changes after radiotherapy We will process
the DWI and PWI with the Olea Sphere®3.0 software, a post-processing solution for MRI
and CT scanners. We will extract and generate Apparent Diffusion Coefficient maps from
DWI data calculated on a voxel-by-voxel basis and relative cerebral blood volume maps
calculated from PWI data with an oscillation-index singular value decomposition routine
and correction for T1-weighted leakage effects.
All the MRI data will be strictly co-registered with the T1WI-PC and planning CT.
4. Radiomics harmonization We will perform the increase of the level of harmonization of
images and/or extracted features on highly heterogeneous dataset due to its multiple
sources.
We will develop a GAN-based framework, translating heterogeneous images to match the
properties of a standard dataset, such as a template reference image, or alternatively,
one set of images chosen as a reference. We need to determine the relevant properties
within images (local or global metrics, texture, edges, contrast, signal-to-noise ratio,
etc.) and to ensure the ability of the framework to harmonize images without losing
their clinically relevant informative and content.
In the features space, numerous statistical approaches can be applied, such as
normalization or batch effect compensation. ComBat has been shown to outperform other
similar approaches and we will rely on Monte Carlo estimation for small samples. We will
investigate the combination of the batch-correction methods with unsupervised clustering
to deal with data presenting very high heterogeneity and a very small number of samples
per batch.
It might be beneficial and complementary to combine image-based and feature-based
harmonization methodologies for improving the results of multicenter radiomics studies.
We will evaluate the potential added benefit of both previously developed approaches in
improving the results. The goal of this task will be to establish whether the first or
the second approach (or the combination of both) is the most efficient, taking into
account not only the absolute improvement observed in the results, but also the
computing time and effort required to implement each approach in practice.
5. Biomolecular analysis Biological resources from PEPPI and SIOPII ependymoma cohorts of
patients will be included. First, the diagnosis of intracranial ependymomas will be
confirmed and then pathological features will be investigated.
WHO 2020 classification of ependymomas: the cIMPACT-NOW (Consortium to Inform Molecular
and Practical Approaches to CNS Tumor Taxonomy
- - Not Official WHO) decided to classify
ependymomas according to location and the underlying genetic alteration.
The current WHO
classification is under revision.
Biological studies required to classify intracranial ependymoma of the project:
Immunohistochemistry: p65-RelA, H3K27Me3. FISH analysis (classification and Copy Number
Variations). DNA-methylation profiling. RNAseq analysis.
6. Deep learning (DL) for molecular biomarkers of ependymoma For this purpose, the Zimmer
team has designed and trained deep convolutional neural networks based on the Inception
V3 architecture to distinguish ependymoma from glioma grades III and
- IV. Preliminary
results based on an imaging data set indicate a high accuracy of discrimination and
suggest the possibility to predict clinically relevant mutations from H&E images alone
(manuscript in preparation).
We will extend these DL approaches to predict resistance to therapy and metastasis. This
will be done by using the histopathology images alone either as input or in combination
with classifications based on molecular assays. At a technical level, we will use
state-of-the-art deep neural network architectures based on convolutional layers and
skip connections. To compensate for the moderate size of the training data set, we will
make heavy use of transfer learning. Here, we will make specific use of the TGCA
database, which contains images from very different cancers such as breast cancer, and
adapt successful models by retraining on a subset of the ependymoma data acquired in
this study. We also will adopt multi-scale image representations for a better
exploitation of the image information at cellular and tissue levels, use H&E specific
color normalization and methods for automated elimination of regions subject to imaging
artefacts. We also will incorporate the developed methods in Imjoy. By decoupling the
graphical user interface from the computational back end, it enables users to access
high-end computing capabilities from any workstation or laptop.
7. Statistical analysis Several signatures will be developed and validated : a radiomic
signature (RAD-Score), a radiomic and clinical signature (RADClin-Score), a molecular
signature (MOL-score) and a global signature (Radiomic, Molecular and clinical
signature).
- - RAD-Score associated with progression-free survival (PFS).
To avoid biases,
NETSPARE built from retrospective multicenter study including independent cohorts
is a training cohort. The validation cohort to test the performance of the radiomic
signature corresponds to the UK cohort of SIOP II Program (n=220). This division
allows external validation.
Training Cohort: the median PFS is estimated to be 5 years in literature. This corresponds to
a 3.5-year PFS of 62%. With 370 patients and a median follow-up of 3.5 years, we expect at
least 120 events. According to the rule of thumb recommending 10 events per variable of
interest, a multiparametric model combining 10 variables may be trained. An alternative
modeling strategy based on a penalized approach will be used to study the association between
imaging parameters and PFS. As imaging biomarker data far exceeds sample size and are
intercorrelated, penalized methods (previously used in different situations with spatial
data) will be used for the regularization and selection of variables, by encouraging the
grouping effect and select explanatory variables.
Validation cohort: 66 events are expected. With this number of events, it is possible to
detect with 80% power a hazard ratio of 0.5 between the low- and high-risk group (Logrank
test two-sided 0.05).
Demographic data: Continuous variables will be summarized by cohort using median, minimum,
maximum and number of available observations. Qualitative variables will be summarized using
counts, percentages and number of missing data. Comparison between cohort will be performed
using the chi-square test (or Fisher exact test if applicable) for qualitative variables and
the Mann-Whitney test for quantitative variables.
- - Radiomics signature development and validation: To study the association between imaging
features and PFS, we will use an alternative modeling strategy based on the penalized Cox
regression model.
Elastic Net method encourages grouping effect and selects explanatory
variables. We will perform a 10-fold cross-validation to select the best penalty parameter
lambda. The mixing parameter α, other parameter of the Elastic net method, will be set to a
default value of 0.5. The regression coefficients associated with each state of the different
variables will be used to determine the RAD score. As the outcome is time-dependent, the risk
score will be dichotomized using a time-dependent ROC curve to obtain a classifier (risk
groups: poor vs.#46;intermediate vs.#46;good prognosis). The optimal threshold has been determined to
obtain at 3 years a sensitivity of at least 80% and the best specificity.
An interval validation will be performed using a leave k-out cross-validation. We will then
apply the RAD-score signature to the validation cohort. Performances of the score will be
evaluated using different criteria: monotonicity of the prognostic stratification assessed
using Kaplan-Meier survival curves and Hazard Ratio (HR) estimation, discrimination evaluated
by Harrell's C-Index and the D statistics, quality of the model and the proportion of
explained variation estimated by the Akaike Information Criterion (AIC) and the RD²
statistic, respectively. The ability of prognostic score to identify patients at high risk of
relapse during the first 3 years will be evaluated by determining the sensitivity,
specificity and predictive value using a time-dependent ROC curve.
- - RADClin-Score development and validation: To incorporate clinical and imaging informations,
we will combine both low-dimensional clinical and high-dimensional imaging data in a global
prediction model.
First, we will completely ignore the imaging features. A Cox model will be
fitted to identify clinical covariates associated with PFS. We will build a clinical linear
predictor using a regression coefficient. Next, we will use Elastic Net with the clinical
linear predictor in offset to identify imaging features. Finally, we will compute the
RADClin-Score using a regression coefficient. We will plot a Venn diagram to compare feature
selection with the RADClin-Score and the RAD-Score. We will compare the model using the
different criteria previously presented.