Autores
De Jesús Sánchez Sara
Aguirre Anaya Eleazar
Calvo Castro Francisco Hiram
Coyac Torres Jorge Enrique
Acosta Bermejo Raúl
Título ICIS: A Model for Context-Based Classification of Sensitive Personal Information
Tipo Congreso
Sub-tipo Memoria
Descripción 12th International Congress in Telematics and Computing, WITCOM 2023
Resumen Sensitive personal information is at risk of exposure by the institutions it is shared. Institutions are responsible for preserving the privacy of the personal data they hold, even more so, in the case of sensitive data. This paper shows the design of ICIS, a model that considers the context to identify 55 personal data types in unstructured texts of government type documents, regardless the size and type, and then classify each text segment as sensitive personal information, using natural language processing and machine learning techniques. ICIS not only indicates whether a text segment contains sensitive information or not, it also indicates personal data identified in each text segment, their location in the document and whether each text segment is classified as sensitive information. The main contributions of this work are both the identification of personal data and the classification of sensitive information based on the context, and the definition of sensitive personal information, in computational terms. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Observaciones DOI 10.1007/978-3-031-45316-8_28 Communications in Computer and Information Science, v. 1906
Lugar Puerto Vallarta
País Mexico
No. de páginas 445-459
Vol. / Cap. v. 1906 CCSI
Inicio 2023-11-13
Fin 2023-11-17
ISBN/ISSN 9783031453151