Corrado Monti

Wikipedia _ [all topics]

2 papers found.

Cleansing Wikipedia Categories Using Centrality

Included in my PhD thesis.

Paolo Boldi and Corrado Monti.

Proceedings of the 25th International Conference Companion on World Wide Web, ACM 2016.

Link | PDF | GitHub

Collaborative knowledge systems evolve through messy, user-generated hierarchies. This study proposes a centrality-based pruning method that cleanses the Wikipedia category network by identifying structural redundancies and inconsistencies. By relying solely on endogenous information, it demonstrates how collective curation and algorithmic structure can be combined to improve the organization of open knowledge — work that later became the basis for a Wikipedia-based benchmark for graph neural networks, now widely used in research.

Learning Latent Category Matrix to Find Unexpected Relations in Wikipedia

Included in my PhD thesis.

Paolo Boldi and Corrado Monti.

Proceedings of the 8th ACM Conference on Web Science, (WebSci2016), ACM 2016.

Link | PDF | GitHub

Discovering non-obvious relations in knowledge systems requires models that go beyond surface similarity. This paper presents an online margin-based learning algorithm that infers a latent matrix of category interactions to uncover hidden connections within Wikipedia’s hyperlink structure. The method efficiently scales to large graphs, revealing how semantic structures and user-generated organization jointly shape information discovery.