Abstract
In this paper we detailed a multinomial classification-based methodology that combines different algorithms (SVM and MLP) with document representations (Tf Idf vectorization and Doc2vec embedding) and: (i) can distinguish between crime-related news and not-crime related news and; (ii) allows the assignment of each crime-related news to its corresponding crime type. With a F1-score of 84% achieved by the MLP with Doc2vec approach, it can be concluded that it is possible to answer the question of how the crimes are committed (what types of crime are perpetrated) and, in this way, offer a thermometer to citizens about criminal activity in a given territory, as reported by news articles.
| Original language | English |
|---|---|
| Title of host publication | Advances in information and communication |
| Subtitle of host publication | Proceedings of the 2019 Future of Information and Communication Conference (FICC) |
| Editors | Kohei Arai, Rahul Bhatia |
| Place of Publication | Springer, Cham |
| Publisher | Springer Verlag |
| Pages | 725-741 |
| Number of pages | 17 |
| Volume | 1 |
| ISBN (Electronic) | 9783030123888 |
| ISBN (Print) | 9783030123871 |
| DOIs | |
| State | E-pub ahead of print - 2 Feb 2019 |
| Event | Future of Information and Communication Conference (FICC) 2019 - San Francisco, United States Duration: 14 Mar 2019 → 15 Mar 2019 https://saiconference.com/Conferences/FICC2019 |
Publication series
| Name | Lecture Notes in Networks and Systems |
|---|---|
| Volume | 69 |
Conference
| Conference | Future of Information and Communication Conference (FICC) 2019 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 14/03/19 → 15/03/19 |
| Internet address |
Bibliographical note
Publisher Copyright: © Springer Nature Switzerland AG 2020.UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 16 Peace, Justice and Strong Institutions
-
SDG 17 Partnerships for the Goals
Keywords
- Classification
- Crime analysis
- Text mining
- Word vectorization and embeddings
Fingerprint
Dive into the research topics of 'Crime alert! crime typification in news based on text mining'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver