This paper proposes a research line for developing new ways of automatically characterizing groups of documents, being them either clusters or categories. This research line is based upon the works of many other researchers and tries to summarize the most problematic issues of category labelling in order to devise possible solutions. Various lines of action are described, as well as future research lines and developments.
Author and article information
Rodrigo Sánchez Jiménez
Dpto. de Biblioteconomía y Documentación UCM.
Facultad de Ciencias de la Información
Avda. Complutense, s/n. (Moncloa)
28040 Madrid, SPAIN