Increase the Accuracy of Detection of Pathogenic Genes of Breast Cancer using a Graph-Based Approach to the Gene Prioritization Problem

Authors

  • Mohammed Thajeel Abdullah Karbala Technical Institute, Al-Furat Al-Awsat Technical University

DOI:

https://doi.org/10.31185/wjps.185

Abstract

Cancer is one of the most common causes of mortality today. This disease's complications impose many costs on the human community's health, care, and well-being sectors. Solving complex biological problems requires advanced computational methods, and bioinformatics was created to solve such complex problems with the active interaction of several fields of science. Bioinformatics is an interdisciplinary science combining biological sciences, computers, mathematics, and statistics. The issue investigated in this research deals with one of the challenging issues in bioinformatics, namely candidate gene prioritization in breast cancer. Gene prioritization means sorting genes based on their relevance to a specific disease, such as breast cancer. Finally, the genes are checked according to their importance in performing costly experiments. The proposed approach in this research is to present a method based on graph mining for prioritizing genes. The study conducted with ENDEAOUR and DIR methods was compared and evaluated. The evaluation results show that the designed method is more efficient than other compared methods.

References

T. Ching, D.S. Himmelstein, B.K. Beaulieu-Jones, A.A. Kalinin, B.T. Do, G.P. Way, E. Ferrero, P.M. Agapow, M. Zietz, M.M. Hoffman, and W. Xie, "Opportunities and ob-stacles for deep learning in biology and medicine". Journal of The Royal Society Inter-face, 15(141), 2018. p.20170387.

M.N. Weedon, L. Jackson, J.W. Harrison, K.S. Ruth, J. Tyrrell, A.T. Hattersley and C.F. Wright, "Use of SNP chips to detect rare pathogenic variants: retrospective", population based diagnostic evalua-tion. bmj, 372. 2021.

S. Erten and M. Koyutürk, "Role of centrality in network-based prioritization of disease genes". In Evolutionary Computation, Machine Learning and Data Mining in Bioin-formatics: 8th European Conference, EvoBIO 2010, Istanbul, Turkey, April 7-9, 2010. Proceedings 8 (pp. 13-25). Springer Berlin Heidelberg. 2010.

A.W. Kurian, K.C. Ward, A.S. Hamilton, D.M. Deapen, P. Abrahamse, I. Bondarenko, Y. Li, S.T. Hawley, M. Morrow, R. Jagsi, and S.J. Katz, "Uptake, results, and outcomes of germline multiple-gene sequencing after diagnosis of breast cancer". JAMA oncolo-gy, 4(8), 2018. pp.1066-1072.

Mu, W., Li, B., Wu, S., Chen, J., Sain, D., Xu, D., Black, M.H., Karam, R., Gillespie, K., Hagman, K.D.F. and Guidugli, L., 2019. Detection of structural variation using tar-get captured next-generation sequencing data for genetic diagnostic testing. Genetics in Medicine, 21(7), pp.1603-1610.

Nicolosi, P., Ledet, E., Yang, S., Michalski, S., Freschi, B., O’Leary, E., Esplin, E.D., Nussbaum, R.L. and Sartor, O., 2019. Prevalence of germline variants in prostate can-cer and implications for current genetic testing guidelines. JAMA oncology, 5(4), pp.523-528.

Feng, Y., Spezia, M., Huang, S., Yuan, C., Zeng, Z., Zhang, L., Ji, X., Liu, W., Huang, B., Luo, W. and Liu, B., 2018. Breast cancer development and progression: Risk fac-tors, cancer stem cells, signaling pathways, genomics, and molecular pathogene-sis. Genes & diseases, 5(2), pp.77-106.

Pan, X., Hu, X., Zhang, Y.H., Chen, L., Zhu, L., Wan, S., Huang, T. and Cai, Y.D., 2019. Identification of the copy number variant biomarkers for breast cancer sub-types. Molecular Genetics and Genomics, 294, pp.95-110.

Yadav, S. and Couch, F.J., 2019. Germline genetic testing for breast cancer risk: the past, present, and future. American Society of Clinical Oncology Educational Book, 39, pp.61-74.

Kurian, A.W., Ward, K.C., Abrahamse, P., Bondarenko, I., Hamilton, A.S., Deapen, D., Morrow, M., Berek, J.S., Hofer, T.P. and Katz, S.J., 2021. Time trends in receipt of germline genetic testing and results for women diagnosed with breast cancer or ovarian cancer, 2012-2019. Journal of Clinical Oncology, 39(15), p.1631.

Chen, L., Zeng, T., Pan, X., Zhang, Y.H., Huang, T. and Cai, Y.D., 2019. Identifying methylation pattern and genes associated with breast cancer subtypes. International journal of molecular sciences, 20(17), p.4269.

Landrith, T., Li, B., Cass, A.A., Conner, B.R., LaDuca, H., McKenna, D.B., Maxwell, K.N., Domchek, S., Morman, N.A., Heinlen, C. and Wham, D., 2020. Splicing profile by capture RNA-seq identifies pathogenic germline variants in tumor suppressor genes. NPJ precision oncology, 4(1), p.4.

Yadav, S., Hu, C., Hart, S.N., Boddicker, N., Polley, E.C., Na, J., Gnanaolivu, R., Lee, K.Y., Lindstrom, T., Armasu, S. and Fitz-Gibbon, P., 2020. Evaluation of germline ge-netic testing criteria in a hospital-based series of women with breast cancer. Journal of Clinical Oncology, 38(13), p.1409.

Federici, G. and Soddu, S., 2020. Variants of uncertain significance in the era of high-throughput genome sequencing: a lesson from breast and ovary cancers. Journal of Ex-perimental & Clinical Cancer Research, 39, pp.1-12.

Weitzel, J.N., Neuhausen, S.L., Adamson, A., Tao, S., Ricker, C., Maoz, A., Rosenblatt, M., Nehoray, B., Sand, S., Steele, L. and Unzeitig, G., 2019. Pathogenic and likely pathogenic variants in PALB2, CHEK2, and other known breast cancer susceptibility genes among 1054 BRCA‐negative Hispanics with breast cancer. Cancer, 125(16), pp.2829-2836.

Fine, R.S., Pers, T.H., Amariuta, T., Raychaudhuri, S. and Hirschhorn, J.N., 2019. Benchmarker: an unbiased, association-data-driven strategy to evaluate gene prioritiza-tion algorithms. The American Journal of Human Genetics, 104(6), pp.1025-1039.

Chen, J., Bardes, E.E., Aronow, B.J. and Jegga, A.G., 2009. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic acids re-search, 37(suppl_2), pp.W305-W311.

Seelow, D., Schwarz, J.M. and Schuelke, M., 2008. GeneDistiller—distilling candidate genes from linkage intervals. PloS one, 3(12), p.e3874.

Nitsch, D., Tranchevent, L.C., Goncalves, J.P., Vogt, J.K., Madeira, S.C. and Moreau, Y., 2011. PINTA: a web server for network-based gene prioritization from expression data. Nucleic acids research, 39(suppl_2), pp.W334-W338.

Tranchevent, L.C., Ardeshirdavani, A., ElShal, S., Alcaide, D., Aerts, J., Auboeuf, D. and Moreau, Y., 2016. Candidate gene prioritization with Endeavour. Nucleic acids re-search, 44(W1), pp.W117-W121.

Trifu, S.C., Vlăduţi, A. and Trifu, A.I., 2020. Genetic aspects in schizophrenia. Recep-toral theories. Metabolic theories. Romanian journal of morphology and embryolo-gy, 61(1), p.25.

Azadifar, S. and Ahmadi, A., 2022. A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning. BMC bioinformatics, 23(1), p.422.

Hutz, J.E., Kraja, A.T., McLeod, H.L. and Province, M.A., 2008. CANDID: a flexible method for prioritizing candidate genes for complex human traits. Genetic Epidemiolo-gy: The Official Publication of the International Genetic Epidemiology Society, 32(8), pp.779-790.

Danis, D., Jacobsen, J.O., Carmody, L.C., Gargano, M.A., McMurry, J.A., Hegde, A., Haendel, M.A., Valentini, G., Smedley, D. and Robinson, P.N., 2021. Interpretable pri-oritization of splice variants in diagnostic next-generation sequencing. The American Journal of Human Genetics, 108(9), pp.1564-1577.

Downloads

Published

2023-09-30

Issue

Section

Computer

How to Cite

Mohammed Thajeel Abdullah. (2023). Increase the Accuracy of Detection of Pathogenic Genes of Breast Cancer using a Graph-Based Approach to the Gene Prioritization Problem. Wasit Journal for Pure Sciences , 2(3), 139-151. https://doi.org/10.31185/wjps.185