LEXICOMETRIE version O
LEXICOMETRIE version O [Electronic resource]
Encoding format: Unknown markup
Title proper taken from AHDS Catalogue Form
I. Political discourses in French : - Charles de Gaulle. 79 texts (titles : DG + rank number) : all the TV-broadcast speeches and press conferences between June 1958 and March 1959 (201 927 words); - Francois Mitterrand : 68 texts (titles : FM + year/month/day) : all the TV-broadcast speeches and press conferences between July 1981 and March 1987 (305 215 words) ; - Canadian Political Discourse. 47 texts (titles : Ctrone + year) : all the "speeches of the Throne" by the Canadian Prime Ministers between 1945 and 2 000 (147 267 words) ; - Quebecker Political Discourse. 48 texts (titles : Qtrone + year) all the "speeches of the Throne" by the Quebecker Prime Ministers between 1945 and 2000 (204 212 words) ; - French Political Discourse. 48 texts (titles : Fdecla + year) : all the "declarations d'investiture" by French Prime Ministers between 1945 and 1997 (258 779 words) ; II. French Theather of the XVIIth century : - Corneille. All the 34 plays by Pierre Corneille. The title of each file begins with the number of the text in the electronic archives of the Institut National de la Langue Francaise (INaLF) + the main word in the title of the play. NB : the INaLF's number is a chronological one. For example, "300Melite" is the first play by Corneille. (553 190 words) ; - Moliere. All the 32 plays by Jean-Baptiste Poquelin-Moliere. Title : INaLF's archives number + main word in the title of the play. NB : the INaLF's number is a chronological one (364 963 words) ; - Racine. All the 12 plays by Jean Racine. (Title : INaLF's archives number + main word in the title of the play. NB : the INaLF's number is a chronological one) 166 626 words ; III. Corpus Brunet. 50 excerpts of 11 French authors (Balzac, Chateaubriand, Flaubert, Marivaux, Maupassant, Proust, Rousseau, Sand, Vernes, Voltaire, Zola) chosen by Prof. Etienne Brunet for a "double blind" experiment with authorship attribution (436 748 words). IV. Documentation Articles and various papers concerning these corpora and/or methods used by the authors are collected in a file entitled : "Documentation". V. Tools 10 software packages dedicated to data analysis applied to these tagged corpora, are contained in the file "Outils". The directions for use in regard to each one are presented in a file the title of which is ending by : "…Lisez_moi".
For a complete list of related publications on this topic see the file 'LabbeTravaux.pdf' (in the Documentation file)