Style is an integral part of natural language in written, spoken or machine generated forms. Humans have been dealing with style in language since the beginnings of language itself, but computers and machine processes have only recently begun to process natural language styles. Automatic processing of styles poses two interrelated challenges: classification and transformation. There have been recent advances in corpus classification, automatic clustering and authorship attribution along many dimensions but little work directly related to writing styles directly and even less in transformation. In this paper we examine relevant literature to define and operationalize a notion of “style” which we employ to designate style markers usable in classification machines. A measurable reading of these markers also helps guide style transformation algorithms. We demonstrate the concept by showing a detectable stylistic shift in a sample piece of text relative to a target corpus. We present ongoing work in building a comprehensive style recognition and transformation system and discuss our results.
Content
Author and article information
Contributors
Foaad Khosmood
Robert A. Levinson
Conference
Publication date:
October
2008
Publication date
(Print):
October
2008
Pages: 1-11
Affiliations
[0001]University of California Santa Cruz
Department of Computer Science, 1156 High St, Santa Cruz, CA 95064, USA