大頭照

Jonathan Reeve
@JonathanReeve

Infrastructures for Cultural Analytics, Digital Humanities, Text Analysis, and NLP

0 位贊助者每週向 JonathanReeve 資助 US$0.00
捐助   付款卡 直接扣款 PayPal

描述

I build tools and infrastructures for analyzing, collecting, and manipulating texts, so that we can better understand books and other textual cultures. Some of my recent projects have included Macro-Etym, a tool for analyzing the etymologies of a text; Text-Matcher, a text reuse detection tool, good at identifying when a text quotes from another; Corpus-DB, an API for Project Gutenberg and other text repositories; and Chapterize, a tool for splitting a book into its chapters. I also lead the Open-Editions project, which aims to produce richly-annotated editions of classic works of literature, and the Git-Lit project, which publishes the British Library's digital books through GitHub.

I'm a PhD candidate in English and Comparative Literature at Columbia University, where I work in the Literary Modeling and Visualization Lab of the Group for Experimental Methods in the Humanities. Our group has no funding of its own, and my graduate student funding is very modest, so donations (of money, cryptocurrency, and/or code) are deeply appreciated.

Read more about my work here, on my website..

已連結的帳號

JonathanReeve 在其他平臺擁有以下帳號:

儲存庫

text-matcher 星號數 120 於 10 個月前更新

A simple text reuse detection CLI tool.

corpus-db 星號數 57 於 4 年前更新

A textual corpus database for the digital humanities.

late-style-PCA 星號數 10 於 4 年前更新

An attempt to experimentally test Edward Said's claims about late style using computational text analysis and principal component analysis.

chapterize 星號數 81 於 6 年前更新

A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.

chapter-experiments 星號數 0 於 6 年前更新

Quantitative analyses of novelistic chapters. Diachronic analyses of chapter lengths, numbers of chapters, linguistic patterns within chapters.

sentence-trees 星號數 1 於 7 年前更新

Experiments with sentences as trees.

character-attribution 星號數 2 於 7 年前更新

Probabilistic attribution of character voices in fiction.

allusion-detection 星號數 9 於 8 年前更新

Computational intertextuality detection in Python. Fuzzy string matching, approximate string matching.

記錄

JonathanReeve 於 5 年前加入。

每週收入(美元)

每週贊助人的數目

此頁面包含機器翻譯的文句,因尚未審核,可能有不準確之處。您可以協助翻譯