Autores
Fócil Arias Carolina
Sidorov Grigori
Gelbukh Alexander
Sánchez Pérez Miguel Ángel
Título A Comparative Analysis of Learning Techniques for Cancer Risk Prediction based on Medical Textual Records
Tipo Revista
Sub-tipo Indefinido
Descripción Research in Computing Science
Resumen In this paper, we compare the performance of a variety of machine learning algorithms, including supervised Naïve Bayes, J48, SVM, Random Tree, Random Forest, and non-supervised KNN for determining the type of cancer a patient is suffering using medical textual records. We train these classifiers on different sets of features such as unigrams and bigrams of words, character n-grams using tf-idf weighting scheme and binary feature representation. We evaluated performance of the classifers in terms of accuracy, precision, recall, and F-measure. The obtained results show that Naïve Bayes and SVM achieve the best performance in this task.
Observaciones
Lugar Ciudad de México
País Mexico
No. de páginas 77-88
Vol. / Cap. v. 130
Inicio 2016-09-19
Fin
ISBN/ISSN