An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients.
read more
Citations
Cutoff Finder: A Comprehensive and Straightforward Web Application Enabling Rapid Biomarker Cutoff Optimization
Chromosomal instability drives metastasis through a cytosolic DNA response.
Single-cell analysis reveals a stem-cell program in human metastatic breast cancer cells.
Prolyl-4-hydroxylase α subunit 2 promotes breast cancer progression and metastasis by regulating collagen deposition
Implementing an online tool for genome-wide validation of survival-associated biomarkers in ovarian-cancer using microarray data from 1287 patients.
References
Nonparametric Estimation from Incomplete Observations
A Multigene Assay to Predict Recurrence of Tamoxifen-Treated, Node-Negative Breast Cancer
affy---analysis of Affymetrix GeneChip data at the probe level
Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.
Gene Expression and Benefit of Chemotherapy in Women With Node-Negative, Estrogen Receptor–Positive Breast Cancer
Related Papers (5)
Comprehensive molecular portraits of human breast tumours
Molecular portraits of human breast tumours
Frequently Asked Questions (14)
Q2. What are the future works mentioned in the paper "An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients" ?
As their service performs the requested analysis in real time on the original data, the extension of the analysis ( e. g., the inclusion of additional samples or filtering for other clinical parameters ) will be easily feasible in the future. They suggested using an ESR1 mRNA cutoff value of 500 to identify ER positive status with an overall accuracy of 90 %. Therefore, the authors suggest the use of above prognostic genes as measured using microarrays. The integrative genomic analysis is still evolving ; thus future integration of additional forms of data such as sequence, location, or copy number variations might potentially add vital additional information which will enable us to deliver higher accuracy in prognosis prediction.
Q3. What is the advantage of using the median for splitting?
The advantage of the use of the median for splitting is the negligible effect of outliers, which—due to the high dynamic range of the microarrays—could seriously skew the results when using the mean.
Q4. What is the ER status of the patients?
Since gene expression arrays might be used to confirm ER status, the authors implemented an estimation of ER status based on gene expression data.
Q5. How many patients have free survival data?
1,593 patients have relapsed free survival data, 594 have overall survival data and 767 have distant metastasis free survival data.
Q6. What is the definition of a biomarker?
With the introduction of biomarkers such as estrogen receptor and HER2 in evaluating the clinical course of breast cancer, biomarker discovery has shifted toward a more molecular level with a large number of individual gene or protein expression levels being tested.
Q7. What is the importance of using nearly identical platforms?
The use of nearly identical platforms is important since different platforms for gene-expression profiling measure expression of the same gene with varying precision, on different relative scales, and with different dynamic ranges [11].
Q8. What is the benefit of the Kaplan–Meier curve?
An important benefit of the Kaplan–Meier curve is that the method takes into account ‘‘censored’’ data— losses from the cohort before the final outcome is observed (for instance, if a patient withdraws from a study).
Q9. How long is the median relapse free survival?
The median relapse free survival is 6.43 years, 968/ 1,231 patients are estrogen-receptor positive by histological or radioimmunoassay based evaluation, and 190/1,369 are lymph-node positive.
Q10. What is the sizing of the data?
As their service performs the requested analysis in real time on the original data, the extension of the analysis (e.g., the inclusion of additional samples or filtering for other clinical parameters) will be easily feasible in the future.
Q11. What is the way to assess the prognostic value of the markers?
In principle, a cutoff-free correlation analysis of gene expression and survival data is possible using Cox proportional hazard models.
Q12. What is the limitation of the approach?
The authors must note a limitation of their approach: the use of the median (or upper/lower quartile) sample for dividing the samples into high- and low-expression groups.
Q13. What was the MAS5 filter used to make the analysis?
a randomization algorithm-selected set of patients of similar, over-represented clinical characteristics were removed in making an additional filter for the analysis.
Q14. what is the eference of a nth es?
NA 15 9[3 3]T ota l9 68 (78 %) 19 0(1 5% )6 89 (43 %) 6.4 ±4 .11 98 /53 4/3 12 57 ±1 32 .2± 1.1 1,8 09P are nth eses :p erce nta ge of pat ien tsw ith inth ed atas etdetermination on the microarray.