O'Reilly logo

Natural Language Processing and Cognitive Science by Michael Zock, Mai Yanagimura, Elena Yagunova, Svetlana Volskaya, Jordi Turmo, Susanna Tron, Rocco Tripodi, Radu Topor, Ludovic Tanguy, David Suendermann-Oeft, Elizabeth Rosenfeld, Horacio Rodriguez, Livio Robaldo, Solange O. Rezende, Michaela Regneri, Lidia Pivovarova, Marcello Pelillo, Catherine Pelachaud, Thiago S. Pardo, Ali M. Naderi, Sachiyo Muranishi, Michael Muck, Martin Mory, Francois Morlane-Hondere, Tarek Mehrez, Wieslaw Lubaszewski, Patrick Lange, Shingo Kuroiwa, Manfred Klenner, Diane King, Mohamed Hamed Kholief, Alexei V. Ivanov, Yasuo Horiuchi, Nora Hollenstein, Myriam Hernandez Alvarez, Jiri Havelka, Nabil Hathout, Marcin Hareza, Jose M. Gomez, Emiliano Giovannetti, Nadine Glas, Daniela Gifu, Izabela Gatkowska, Jean-Gabriel Ganascia, Daisuke Furukawa, Richard Frost, Cécile Fabre, Ahmed Magdy Ezzeldin, Yasser El-Sonbaty, Luigi Di Caro, Matthew Crocker, Dan Cristea, Jan Curin, Conrado S. Merley, Pawel Chrzazcz, Jesus Calvillo, Mohamed Amine Boukhaled, Guido Boella, Jared Bernstein, Alessia Bellusci, Andrea Bellandi, Raimo Bakis, Eniafe Festus Ayetiran, Michael Amsler, Nabil Abdullah, César Aguilar, Olga Acosta, Rodolfo Delmonte, Bernadette Sharp

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Mohamed Amine Boukhaled and Jean-Gabriel Ganascia

Using Function Words for Authorship Attribution: Bag-Of-Words vs. Sequential Rules

Abstract: Authorship attribution is the task of identifying the author of a given document. Various style markers have been proposed in the literature to deal with the authorship attribution task. Frequencies of function words have been shown to be very reliable and effective for this task. However, despite the fact that they are state-of-the-art, they basically rely on the invalid bag-of-words assumption, which stipulates that text is a set of independent words. In this contribution, we present a comparative study on using two different types of style marker based on function words for authorship attribution. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required