z-logo
open-access-imgOpen Access
Visualization in stylometry: Cluster analysis using networks
Author(s) -
Maciej Eder
Publication year - 2015
Publication title -
digital scholarship in the humanities
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.4
H-Index - 15
eISSN - 2055-768X
pISSN - 2055-7671
DOI - 10.1093/llc/fqv061
Subject(s) - stylometry , visualization , cluster analysis , computer science , hierarchical clustering , data mining , dendrogram , cluster (spacecraft) , network analysis , tree (set theory) , information visualization , artificial intelligence , pattern recognition (psychology) , mathematics , engineering , mathematical analysis , population , demography , sociology , genetic diversity , programming language , electrical engineering
The aim of this article is to discuss reliability issues of a few visual techniques used in stylometry, and to introduce a new method that enhances the explanatory power of visualization with a procedure of validation inspired by advanced statistical methods. A promising way of extending cluster analysis dendrograms with a self-validating procedure involves producing numerous particular ‘snapshots’, or dendrograms produced using different input parameters, and combining them all into the form of a consensus tree. Significantly better results, however, can be obtained using a new visualization technique, which combines the idea of nearest neighborhood derived from cluster analysis, the idea of hammering out a clustering consensus from bootstrap consensus trees, with the idea of mapping textual similarities onto a form of a network. Additionally, network analysis seems to be a good solution for large data sets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom