Gutenkarte is a geographic text browser, intended to help readers explore the spatial component of classic works of literature. Gutenkarte downloads public domain texts from Project Gutenberg, and then feeds them to MetaCarta’s GeoParser API, which extracts and returns all the geographic locations it can find. Gutenkarte stores these locations in a database, along with citations into the text itself, and offers an interface where the book can be browsed by chapter, by place, or all at once on an interactive map. Ultimately, Gutenkarte will offer the ability to annotate and correct the places in the database, so that the community will be able construct and share rich geographic views of Project Gutenberg’s enormous body of literary classics.
Below, the mapping of “A Tale of Two Cities” of Charles Dickens. It is possible to spot some little problems in the parsing algorithm used. See the “us” flag in Croatia …
Tags: data mining, map algorithms, maps, text data mining