Resumen
In this paper we detailed a multinomial classification-based methodology that combines different algorithms (SVM and MLP) with document representations (Tf Idf vectorization and Doc2vec embedding) and: (i) can distinguish between crime-related news and not-crime related news and; (ii) allows the assignment of each crime-related news to its corresponding crime type. With a F1-score of 84% achieved by the MLP with Doc2vec approach, it can be concluded that it is possible to answer the question of how the crimes are committed (what types of crime are perpetrated) and, in this way, offer a thermometer to citizens about criminal activity in a given territory, as reported by news articles.
Idioma original | Inglés |
---|---|
Título de la publicación alojada | Lecture Notes in Networks and Systems |
Subtítulo de la publicación alojada | Proceedings of the 2019 Future of Information and Communication Conference |
Editores | Kohei Arai, Rahul Bhatia |
Lugar de publicación | Cham |
Editorial | Springer Verlag |
Páginas | 725-741 |
Número de páginas | 17 |
ISBN (versión digital) | 9783030123888 |
ISBN (versión impresa) | 9783030123871 |
DOI | |
Estado | Publicada - 2020 |
Serie de la publicación
Nombre | Lecture Notes in Networks and Systems |
---|---|
Volumen | 69 |
Nota bibliográfica
Publisher Copyright:© Springer Nature Switzerland AG 2020.