You are looking at content from Sapping Attention, which was my primary blog from
2010 to 2015; I am republishing all items from there on this page, but for the foreseeable
future you should be able to read them in their original form at
sappingattention.blogspot.com. For current
posts, see here.
Posts with tag Howells
Back to all postsDunning Statistics on authors
Oct 07 2011
As promised, some quick thoughts broken off my post on Dunning Log-likelihood. There, I looked at _big_ corpuses–two history classes of about 20,000 books each. But I also wonder how we can use algorithmic comparison on a much smaller scale: particularly, at the level of individual authors or works. English dept. digital humanists tend to rely on small sets of well curated, TEI texts, but even the ugly wilds of machine OCR might be able to offer them some insights. (Sidenote–interesting post by Ted Underwood today on the mechanics of creating a middle group between these two poles).