Authors: Dora Kiesel, Patrick Riehmann, Fan Fan, Yamen Ajjour, Henning Wachsmuth, Benno Stein, Bernd Froehlich
Abstract: Topic modeling algorithms such as Latent Dirichlet Allocation (LDA) typically represent documents as a weighted combination of topics. Therefore, generalized barycentric coordinates are a natural fit for the visualization of a topic space. However, spatial positions in a planar barycentric coordinate system are ambiguous for more than three coordinates. Our glyphs for representing documents in combination with layout guidelines help to reduce the positional ambiguity. With an increasing number of documents, barycentric coordinate embeddings suffer from overplotting and visual clutter like other embeddings, possibly even more so since document positions are fully independent of each other. Our experiments with jittering, aggregating glyphs, and grids show potential to reduce these problems for barycentric and other layouts.