What is hiercat?
hiercat is an automatic text classifier which uses the hierarchical structure of class labels to improve classification performance. The model it uses is that of Gaussier, et. al [1]. It was originally developed as part of the AMSC 663/664 project courses at the University of Maryland, College Park [2].
Is it difficult to use?
The short answer is no. Some effort has been spent on making hiercat easy to compile and run. If you run into problems, email me, and I will try to help out.
Is it free?
hiercat is free, both as in speech and as in beer. It is released under the GNU General Public License (GPL).
Transcriptions of survivor testimonies' are being produced using automatic speech recognition which must then be categorized for efficient information retrieval. This project seeks to leverage hierarchical properties of these categories (eg, Berlin is in Germany which is a location) to improve categorization.
math.umd.edu