Breast cancer is the most common malignancy disease that affects female population and the number of affected people is the second most common leading cause of cancer deaths among all cancer types in the developing countries. Mammography is the most effective method for detection of early breast cancer to increase the survival rate. This paper presented the classification method for mammogram Image using the decision tree techniques. Three measures were used to evaluate performance in terms of accuracy, sensitivity, and privacy. The aim of the study is to determine the best decision tree classifier for medical datasets classification. The study emphasizes five phases; starting with collecting images, pre-processing (image cropping of ROI), features extracting, classification and end with testing and evaluating. Experimental results show that Random Forest has a better performance than ID3, J48.
Published in | American Journal of Data Mining and Knowledge Discovery (Volume 6, Issue 2) |
DOI | 10.11648/j.ajdmkd.20210602.13 |
Page(s) | 31-35 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2021. Published by Science Publishing Group |
Mammograms, Breast Cancer, Decision Tree, Early Detection, Image Classification
[1] | Pareek, A. and S. M. Arora, Breast cancer detection techniques using medical image processing. Breast cancer, 2017. 2 (3). |
[2] | S. Punitha, S. Ravi, et al., Breast Cancer Detection using Classification Techniques in Digital Mammography: International Science Press, I J C T A, 9 (7), 2016, pp. 3123-3134, ISSN: 0974- 5572. |
[3] | Curtis, C., et al., The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature, 2012. 486 (7403): p. 346-352. |
[4] | Perou, C. M., et al., Molecular portraits of human breast tumours. Nature, 2000. 406 (6797): p. 747-752. |
[5] | Tang, J., Rangayyan, R. M., Xu, J., El Naqa, I., & Yang, Y. (2009). Computer-aided detection and diagnosis of breast cancer with mammography: recent advances. Information Technology in Biomedicine, IEEE Transactions on, 13 (2), 23. |
[6] | Smith, R. A., V. Cokkinides, and H. J. Eyre, American Cancer Society guidelines for the early detection of cancer, 2006. CA: a cancer journal for clinicians, 2006. 56 (1): p. 11-25. |
[7] | Affar, M. A., Hybrid Texture based Classification of Brea Mammograms using Ad boost Classifier. International Journal of Advanced Computer Science and Applications, 2017. 8 (5). |
[8] | Choi, J. P., T. H. Han, and R. W. Park, A hybrid bayesian network model for predicting breast cancer prognosis. Journal of Korean Society of Medical Informatics, 2009. 15 (1): p. 49-57. |
[9] | Aswinikumarmohanty, Sukantakumar swain, Pratapkumarchampati, Sarojkumarlenka, “Image Mining for Mammogram Classification by Association Rule Using Statistical and GLCM features”, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 5, No 3, 2011. |
[10] | S. Aruna, Dr S. P. Rajagopalan and L. V. Nandakishore, 2011 Knowledge Based Analysis Of Various Statistical Tools In Detecting Breast Cancer. 2011. |
[11] | Y. J. Lee, O. L. M. W. H. W. Survival -Time Classification of Breast Cancer Patients. 2008 [cited 2017; Available from: http://www.cs.wisc.edu/dmi/annrev/rev0601/uj.ppt. |
[12] | Clark, A. F. The mini-MIAS database of mammograms. |
[13] | Usha, S. and S. Arumugam, Calcification Classification in Mammograms Using Decision Trees. World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering, 2016. 9 (9): p. 2127-2131. |
[14] | E. Frank, M. Hall, and L. Trigg, "Weka: Waikato environment for knowledge analysis," The University of Waikato, Hamilton, New Zealand, 1999. |
[15] | Mariana R. Mendoza, Guilherme C. da Fonseca, Guilherme Loss- Morais, Ronnie Alves, RogerioMargis, Ana L. C. Bazzan, “Predicting Human MicroRNA Target Genes with a Random Forest Classifier”, plos, 2013. |
[16] | J. R. Quinlan, “Induction of decision tree”. Journal of Machine Learning 1, 1986, Pg. no: 81-106. |
[17] | Zhang, Y., Zhao, Y., A Comparison of BBN, AD Tree and MLP in separating Quasars from Large Survey Catalogues, ChJAA 7, 289- 296, 2007. |
[18] | S. Usha, S. Arumugam (2015). “Calcification Classification in Mammograms Using Decision Trees” International Journal of Computer, Electrical, Automation, Control and Information Engineering Vol: 9, No: 9, 201. |
APA Style
Mohamed Alhag Alobed, Ali Ahmed, Ashraf Osman Ibrahim. (2021). Classification of Breast Cancer Image Using Data Mining Techniques. American Journal of Data Mining and Knowledge Discovery, 6(2), 31-35. https://doi.org/10.11648/j.ajdmkd.20210602.13
ACS Style
Mohamed Alhag Alobed; Ali Ahmed; Ashraf Osman Ibrahim. Classification of Breast Cancer Image Using Data Mining Techniques. Am. J. Data Min. Knowl. Discov. 2021, 6(2), 31-35. doi: 10.11648/j.ajdmkd.20210602.13
AMA Style
Mohamed Alhag Alobed, Ali Ahmed, Ashraf Osman Ibrahim. Classification of Breast Cancer Image Using Data Mining Techniques. Am J Data Min Knowl Discov. 2021;6(2):31-35. doi: 10.11648/j.ajdmkd.20210602.13
@article{10.11648/j.ajdmkd.20210602.13, author = {Mohamed Alhag Alobed and Ali Ahmed and Ashraf Osman Ibrahim}, title = {Classification of Breast Cancer Image Using Data Mining Techniques}, journal = {American Journal of Data Mining and Knowledge Discovery}, volume = {6}, number = {2}, pages = {31-35}, doi = {10.11648/j.ajdmkd.20210602.13}, url = {https://doi.org/10.11648/j.ajdmkd.20210602.13}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajdmkd.20210602.13}, abstract = {Breast cancer is the most common malignancy disease that affects female population and the number of affected people is the second most common leading cause of cancer deaths among all cancer types in the developing countries. Mammography is the most effective method for detection of early breast cancer to increase the survival rate. This paper presented the classification method for mammogram Image using the decision tree techniques. Three measures were used to evaluate performance in terms of accuracy, sensitivity, and privacy. The aim of the study is to determine the best decision tree classifier for medical datasets classification. The study emphasizes five phases; starting with collecting images, pre-processing (image cropping of ROI), features extracting, classification and end with testing and evaluating. Experimental results show that Random Forest has a better performance than ID3, J48.}, year = {2021} }
TY - JOUR T1 - Classification of Breast Cancer Image Using Data Mining Techniques AU - Mohamed Alhag Alobed AU - Ali Ahmed AU - Ashraf Osman Ibrahim Y1 - 2021/11/25 PY - 2021 N1 - https://doi.org/10.11648/j.ajdmkd.20210602.13 DO - 10.11648/j.ajdmkd.20210602.13 T2 - American Journal of Data Mining and Knowledge Discovery JF - American Journal of Data Mining and Knowledge Discovery JO - American Journal of Data Mining and Knowledge Discovery SP - 31 EP - 35 PB - Science Publishing Group SN - 2578-7837 UR - https://doi.org/10.11648/j.ajdmkd.20210602.13 AB - Breast cancer is the most common malignancy disease that affects female population and the number of affected people is the second most common leading cause of cancer deaths among all cancer types in the developing countries. Mammography is the most effective method for detection of early breast cancer to increase the survival rate. This paper presented the classification method for mammogram Image using the decision tree techniques. Three measures were used to evaluate performance in terms of accuracy, sensitivity, and privacy. The aim of the study is to determine the best decision tree classifier for medical datasets classification. The study emphasizes five phases; starting with collecting images, pre-processing (image cropping of ROI), features extracting, classification and end with testing and evaluating. Experimental results show that Random Forest has a better performance than ID3, J48. VL - 6 IS - 2 ER -