Vous êtes ici : Accueil › Recherche
› RECHERCHER
Titre
Auteur
Mot-clé
Année avant
Année après
Plus d'options

› AFFINER

› RESULTATS DE LA RECHERCHE  
45 résultats › Tri  › Par 
collection_x
Source : Hypotheses.org

Enrichment by Elimination, or: How to turn HTML into simple TEI using Python

Schöch, Christof (23 mars 2014)

There are lots of full text repositories of literary works out there, be it the venerable Project Gutenberg (founded in 1971, when the internet was just a few dozen computers), a pioneer like Gallica (with increasing amounts of plain text in the 90-95% correct OCR range), or a crowdsourced efforts ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Europeana for Quantitative Literary History (Europeana Research Blog, Text Mining #3)

Schöch, Christof (3 déc. 2015)

Note: The following post first appeared on the Europeana Research Blog on November 30, 2015, in their "Text Mining" series which also includes posts by Ted Underwood and Gregor Wiedemann. Over the last several years, there has been an increasing interest in large-scale, computational, quantitative ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Bibliography on Text Analysis, particularly Stylometry and Topic Modelling

Schöch, Christof (26 sept. 2012)

[Update, 2012-01-28: The bibliography has changed its name, has received a new URL, and has been thoroughly reorganized. All stuff related to text analysis is now in the collection called, well, "Analysis". The tagging system has been simplified, too.] Release early, release often, and talk about it ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Follow-up on Simenon and Sentence-Length: Visualization and Hypothesis-Testing

Schöch, Christof (8 sept. 2016)

In a conversation about my recent post on sentence length in Georges Simenon's work, Fotis Jannidis said he thought the post was typical of quite a lot of recent work in digital literary studies in that it is exploratory rather than focused on hypothesis testing. I think this is true and that it is ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Stylometry and pastiche. A case study from French crime fiction

Schöch, Christof (4 janv. 2015)

Arsène Lupin. Source: http://commons.wikimedia.org/wiki/File:Lupin01.jpg. Image in the public domain. When I enthusiastically present literary scholars with the surprising accuracy stylometric methods display in many cases of authorship attribution (given appropriate conditions, such as sufficient ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

The Geek's Quest, or: Mr Penumbra's 24-Hour Bookstore, by Robin Sloan (Computers and Literature in Fiction, 1)

Schöch, Christof (20 nov. 2012)

Sure, there have been a few traces of digital humanities and computer-supported literary scholarship in literary texts before, most notably in David Lodge's wonderful campus novels. There must be more, and maybe some of you can point them out to me. An early example comes from David Lodge's ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Busy times, and some light at the horizon

Schöch, Christof (17 mars 2013)

Lot's of things have been going on over these last weeks. Some of them were related to text analysis, others were rather related to job interviews, grant writing, teaching, real work, and travel, and I was just too busy to write anything here. Instead of a "real" post, this is just a way for me to ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Author or genre? Assessing the quality of cluster analysis graphs in two-dimensional classification problems

Schöch, Christof (29 oct. 2012)

One of the very fundamental issues in stylometric classification tasks is that the data under scrutiny is usually messy in some way. And I don't even mean dirty OCR here, which is a problem anyone even casually playing with Google's N-Gram Viewer and interested in pre-1800s texts will have noticed ( ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

Topic Modeling with MALLET: Hyperparameter Optimization

Schöch, Christof (14 nov. 2016)

This is a short technical post about an interesting feature of Mallet which I have recently discovered or rather, whose (for me) unexpected effect on the topic models I have discovered: the parameter that controls the hyperparameter optimization interval in Mallet. ((Another recent discovery, by the ...

Voir plus d'informations | Voir la fiche de la ressource

Source : Hypotheses.org

How to Create Lemmatized (French) Text for Topic Modeling

Schöch, Christof (31 mai 2014)

"Gravure représantant Pierre Corneille." From Wikipedia; source: Bibliothèque nationale de France. http://commons.wikimedia.org/wiki/File:Gravure_Pierre_Corneille.jpg (public domain). It would not, some years ago, have occurred to me that anyone would want to reduce literary texts to the following ...

Voir plus d'informations | Voir la fiche de la ressource