<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>the goggles do nothing</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/" />
    <link rel="self" type="application/atom+xml" href="http://www.thegogglesdonothing.com/atom.xml" />
    <id>tag:www.thegogglesdonothing.com,2009-04-06:/1</id>
    <updated>2013-06-15T04:34:39Z</updated>
    
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type 5.13-en</generator>

<entry>
    <title>NYPL Maps Hack 2013</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/06/nypl_maps_hack_2013.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.782</id>

    <published>2013-06-05T14:43:57Z</published>
    <updated>2013-06-15T04:34:39Z</updated>

    <summary> On June 4-5 I had a chance to go down to the NYPL and participate in their &#8220;Mobilizing Historic Geodata&#8221; event. Organized by NYPLLabs, the hackathon brought together programmers, map librarians, and others with an interest in working with...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/8969232205/" title="NYPL Labs: Mobilizing Historical Geodata by peterl, on Flickr"><img src="http://farm8.staticflickr.com/7333/8969232205_cdfdd3cbaf_c.jpg" width="800" height="595" alt="NYPL Labs: Mobilizing Historical Geodata"></a></p>

<p>On June 4-5 I had a chance to go down to the NYPL and participate in their &#8220;Mobilizing Historic Geodata&#8221; event.</p>

<p><a href="http://www.flickr.com/photos/peterl/8952220361/" title="NYPL Labs: Mobilizing Historical Geodata by peterl, on Flickr"><img src="http://farm6.staticflickr.com/5443/8952220361_d0f0677ae3_c.jpg" width="800" height="432" alt="NYPL Labs: Mobilizing Historical Geodata"></a></p>

<p>Organized by <a href="http://www.nypl.org/collections/labs">NYPLLabs</a>, the hackathon brought together programmers, map librarians, and others with an interest in working with historical spatial data.</p>

<p><a href="http://www.flickr.com/photos/peterl/8969227149/" title="NYPL Labs: Mobilizing Historical Geodata by peterl, on Flickr"><img src="http://farm4.staticflickr.com/3773/8969227149_7f450a11ca_c.jpg" width="800" height="499" alt="NYPL Labs: Mobilizing Historical Geodata"></a></p>

<p>One of the most interesting parts of the event for me was working alongside some folks who were involved with Flickr in the mid-2000s.  Schulyer and Aaron gave some fascinating background on the ways that Flickr was able to take aggregate geolocation data for millions of photographs and create rough outlines of neighborhoods, as defined by people describing their own photos.</p>

<p><a href="http://www.flickr.com/photos/peterl/8952228251/" title="NYPL Labs: Mobilizing Historical Geodata by peterl, on Flickr"><img src="http://farm6.staticflickr.com/5441/8952228251_6d61d23502_c.jpg" width="800" height="605" alt="NYPL Labs: Mobilizing Historical Geodata"></a></p>

<p>We all worked on various projects, ranging from the nearly-complete to some preliminary proofs-of-concept.  One thing I threw together on the last day was a tool for finding pre-1921 street addresses in Detroit.  That city&#8217;s numbering system changed in 1921, which means that all references to house or building numbers before that year do not resolve to their proper location when using tools like Google Maps or OpenStreetMap.  Luckily, motivated folks have uploaded <a href="http://stevemorse.org/census/changes/RenumInfo.htm">information about the change</a>, as well as <a href="http://stevemorse.org/census/changes/DetroitAB.htm">detailed tables of old and new street numbers</a> drawn from circa-1920 Detroit phone books.  The tool I worked on visualizes these changes on a modern street map of Detroit, drawing a red highlight along the path of a street as it existed circa 1921. Markers along the path show where address ranges would have fallen on a block-by-block basis &#8212; and also expose former streets that have disappeared since the 1920s, such as the intersection of Abbot and Michigan Avenue, below:</p>

<p><a href="http://www.flickr.com/photos/peterl/8970415604/" title="Detroit pre-1921 Address Finder by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8270/8970415604_d9c0d21763_c.jpg" width="800" height="694" alt="Detroit pre-1921 Address Finder"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Dog on a hike</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/06/dog_on_a_hike.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.781</id>

    <published>2013-06-02T16:53:44Z</published>
    <updated>2013-06-03T16:54:41Z</updated>

    <summary>I went up to the top of Sleeping Giant State Park this Sunday, just north of Hamden. At the top, a CCC-built lookout tower provided shade to a dog who had just finished his own hike:...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>I went up to the top of Sleeping Giant State Park this Sunday, just north of Hamden. At the top, a CCC-built lookout tower provided shade to a dog who had just finished his own hike:</p>

<p><a href="http://www.flickr.com/photos/peterl/8929981180/" title="Dog at Sleeping Giant by peterl, on Flickr"><img src="http://farm8.staticflickr.com/7290/8929981180_13f0e79be0_c.jpg" width="800" height="534" alt="Dog at Sleeping Giant"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Michigan Wildfires of 1881</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/05/michigan_wildfires_of_1881.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.780</id>

    <published>2013-05-31T18:51:49Z</published>
    <updated>2013-05-31T18:53:51Z</updated>

    <summary>An amazing view of the destruction from the fires that swept across the &#8220;thumb&#8221; of Michigan in 1881. Everything in the white area was burned, with only a few communities along the coast spared. The United States Army Signal Corps...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>An amazing view of the destruction from the fires that swept across the &#8220;thumb&#8221; of Michigan in 1881. Everything in the white area was burned, with only a few communities along the coast spared.</p>

<p><a href="http://www.flickr.com/photos/peterl/8904744394/" title="Michigan Wildfires of 1881 by peterl, on Flickr"><img src="http://farm3.staticflickr.com/2831/8904744394_f98c3a1d27_c.jpg" width="800" height="642" alt="Michigan Wildfires of 1881"></a></p>

<p>The United States Army Signal Corps prepared this map as part of their report in 1882.</p>
]]>
        

    </content>
</entry>

<entry>
    <title>Dodecatheon alpinum</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/05/dodecatheon_alpinum.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.779</id>

    <published>2013-05-28T05:39:22Z</published>
    <updated>2013-05-28T05:40:04Z</updated>

    <summary></summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/8863320114/" title="Shooting Star (Dodecatheon alpinum) by peterl, on Flickr"><img src="http://farm4.staticflickr.com/3815/8863320114_b5f5b60009_b.jpg" width="715" height="1024" alt="Shooting Star (Dodecatheon alpinum)"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>30-pin and Lightning connectors</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/05/30-pin_and_lightning_connectors.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.778</id>

    <published>2013-05-26T23:10:25Z</published>
    <updated>2013-05-27T23:14:03Z</updated>

    <summary>Apple&#8217;s circa-2003 &#8220;30-Pin&#8221; iPod connector: &#8230;compared to the new 2012 &#8220;Lightning&#8221; connector:...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>Apple&#8217;s circa-2003 &#8220;30-Pin&#8221; iPod connector:</p>

<p><a href="http://www.flickr.com/photos/peterl/8843131461/" title="iPod 30-pin connector by peterl, on Flickr"><img src="http://farm3.staticflickr.com/2872/8843131461_6b7da2116e_c.jpg" width="800" height="534" alt="iPod 30-pin connector"></a></p>

<p>&#8230;compared to the new 2012 &#8220;Lightning&#8221; connector:</p>

<p><a href="http://www.flickr.com/photos/peterl/8843116423/" title="Lightning Connector by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8418/8843116423_b740052964_c.jpg" width="800" height="637" alt="Lightning Connector"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Trip to New York</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/04/trip_to_new_york.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.777</id>

    <published>2013-04-10T02:37:30Z</published>
    <updated>2013-04-19T02:43:45Z</updated>

    <summary> This weekend I headed down to Manhattan to attend an event on Columbia&#8217;s campus. This was my first time in Grand Central Station since the Apple Store had moved in. I took the opportunity to walk north from Grand...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="food" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/8626882178/" title="Butler Library by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8112/8626882178_1a18a8032a_c.jpg" width="800" height="527" alt="Butler Library"></a></p>

<p>This weekend I headed down to Manhattan to attend an event on Columbia&#8217;s campus.</p>

<p>This was my first time in Grand Central Station since the Apple Store had moved in.</p>

<p><a href="http://www.flickr.com/photos/peterl/8625771503/" title="Grand Central Station Apple Store by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8545/8625771503_e90308b840_c.jpg" width="800" height="526" alt="Grand Central Station Apple Store"></a></p>

<p>I took the opportunity to walk north from Grand Central and shoot some scenes on the street.</p>

<p><a href="http://www.flickr.com/photos/peterl/8625772047/" title="Directions by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8120/8625772047_e043ed21ce_c.jpg" width="800" height="531" alt="Directions"></a></p>

<p><a href="http://www.flickr.com/photos/peterl/8625771897/" title="Hot dog handoff by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8539/8625771897_b36ac4b438_c.jpg" width="800" height="620" alt="Hot dog handoff"></a></p>

<p>The event was catered by <a href="http://www.aamanns-copenhagen.com">Aamanns</a>, a Danish deli which has just opened up a branch in NYC:</p>

<p><a href="http://www.flickr.com/photos/peterl/8626881818/" title="Aamans Copenhagen by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8121/8626881818_d908e18ed3_c.jpg" width="800" height="523" alt="Aamans Copenhagen"></a></p>

<p><a href="http://www.flickr.com/photos/peterl/8626882016/" title="Aamans Copenhagen by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8105/8626882016_818d5c9123_c.jpg" width="800" height="598" alt="Aamans Copenhagen"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>IKEA Numerär counter desk</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/03/ikea_numerär_counter_desk.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.776</id>

    <published>2013-04-01T02:39:11Z</published>
    <updated>2013-04-01T20:45:22Z</updated>

    <summary> A bunch of folks have discovered that IKEA&#8217;s Numerär solid wood kitchen counters work great as a desk when perched atop either table legs or small office drawer cabinets. I decided to give it a try, and &#8212; aside...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/8610463499/" title="IKEA Numerär counter desk by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8107/8610463499_a9e524f270_c.jpg" width="800" height="600" alt="IKEA Numerär counter desk"></a></p>

<p>A bunch of folks have discovered that IKEA&#8217;s <strong>Numerär</strong> solid wood kitchen counters work great as a desk when perched atop either table legs or small office drawer cabinets.  I decided to give it a try, and &#8212; aside from the trauma of carrying an 83lb hunk of solid wood from the car &#8212; can verify it works:</p>

<p><a href="http://www.flickr.com/photos/peterl/8610463831/" title="IKEA Numerär counter desk by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8113/8610463831_69834e62e0_c.jpg" width="800" height="480" alt="IKEA Numerär counter desk"></a></p>

<p>I didn&#8217;t bother to permanently attach the Numerär counter to the two legs (<strong>Vika Alex</strong> and <strong>Vika Annefors</strong>). The counter&#8217;s mass keeps it in place pretty well, and the friction from the (included) four rubber stops that sit on top of the two white Vika pieces means it won&#8217;t accidentally go anywhere.</p>
]]>
        

    </content>
</entry>

<entry>
    <title>68030 NeXT</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/03/68030_next.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.775</id>

    <published>2013-03-27T15:24:35Z</published>
    <updated>2013-03-27T15:25:58Z</updated>

    <summary>Didn&#8217;t want to send this in the moving truck, so I drove it from Chicago to New Haven in my car:...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>Didn&#8217;t want to send this in the moving truck, so I drove it from Chicago to New Haven in my car:</p>

<p><a href="http://www.flickr.com/photos/peterl/8593503457/" title="68030 Next by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8368/8593503457_73711f8ca5_c.jpg" width="800" height="406" alt="68030 Next"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>DH at the University of Alabama</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/03/dh_at_the_university_of_alabama.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.774</id>

    <published>2013-03-07T18:10:24Z</published>
    <updated>2013-03-12T16:55:14Z</updated>

    <summary> Back from a great trip to Tuscaloosa and the University of Alabama&#8217;s Digital Humanities Center. My first time on campus; the library is host to a great DH space and some interesting projects that touch on both local history...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/8549580978/" title="Alabama Digital Humanities Center by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8520/8549580978_3df8884cda_c.jpg" width="800" height="600" alt="Alabama Digital Humanities Center"></a></p>

<p>Back from a great trip to Tuscaloosa and the University of Alabama&#8217;s <a href="http://www.lib.ua.edu/digitalhumanities">Digital Humanities Center</a>. My first time on campus; the library is host to a great DH space and some interesting projects that touch on both local history and broader topics. </p>

<p><a href="http://www.flickr.com/photos/peterl/8538668835/" title="University of Alabama library by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8248/8538668835_1f001d7b61_o.jpg" width="612" height="612" alt="University of Alabama library"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Bond Chapel in Snow</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2013/03/bond_chapel_in_snow.shtml" />
    <id>tag:www.thegogglesdonothing.com,2013://1.773</id>

    <published>2013-03-05T21:45:29Z</published>
    <updated>2013-03-05T21:48:12Z</updated>

    <summary>My last day on campus, the weather was well on its way to delivering several inches of snow. From the archway of the Divinity School, Bond Chapel:...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>My last day on campus, the weather was well on its way to delivering several inches of snow. From the archway of the Divinity School, Bond Chapel:</p>

<p><a href="http://www.flickr.com/photos/peterl/8531535649/" title="Bond Chapel by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8249/8531535649_980884552b_b.jpg" width="768" height="1024" alt="Bond Chapel"></a></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Visualizing Texts on Maps</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2012/12/visualizing_texts_on_maps.shtml" />
    <id>tag:www.thegogglesdonothing.com,2012://1.772</id>

    <published>2012-12-04T01:37:09Z</published>
    <updated>2012-12-04T21:39:44Z</updated>

    <summary>Just a sneak preview of a website I&#8217;m building &#8212; the idea is to have an algorithm read through a bunch of magazine articles, find place names, and map those places onto a city or region of a country. Then,...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>Just a sneak preview of a website I&#8217;m building &#8212; the idea is to have an algorithm read through a bunch of magazine articles, find place names, and map those places onto a city or region of a country.  Then, when the user hovers over the place, a list of sentences appear on the right, showing the context for each occurrence. Each small red dot is a place mentioned in the text; larger blobs of color show concentrations of places.</p>

<p><img alt="gringopreview.jpg" src="http://www.thegogglesdonothing.com/archives/gringopreview.jpg" width="800" height="681" class="mt-image-none" style="" /></p>
]]>
        

    </content>
</entry>

<entry>
    <title>Hyde Park Transport, 1926</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2012/11/hyde_park_transport_1926.shtml" />
    <id>tag:www.thegogglesdonothing.com,2012://1.771</id>

    <published>2012-11-21T03:47:54Z</published>
    <updated>2012-11-21T04:01:07Z</updated>

    <summary> I was curious what it would look like to take this Rand McNally map of Chicago public transportation networks circa 1926 and overlay it on Google&#8217;s modern aerial photography of the city. When I first came to Hyde Park...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<iframe width="800" height="600" src="http://www.youtube.com/embed/3OxJM6u0bCk?rel=0" frameborder="0" allowfullscreen></iframe>

<p>I was curious what it would look like to take this <a href="http://collections.lib.uwm.edu/cdm4/document.php?CISOROOT=/agdm&amp;CISOPTR=1328&amp;REC=1">Rand McNally map of Chicago public transportation networks circa 1926</a> and overlay it on Google&#8217;s modern aerial photography of the city.</p>

<p><img alt="1926el.jpg" src="http://www.thegogglesdonothing.com/archives/1926el.jpg" width="800" height="569" class="mt-image-none" style="" /></p>

<p>When I first came to Hyde Park in 1993, there was an El track along 63rd street. That part of the <a href="http://www.chicagoreader.com/chicago/losing-track/Content?oid=894837">Green Line was torn down by the time I left</a> in 1997.</p>

<p><img alt="1926el63rd.jpg" src="http://www.thegogglesdonothing.com/archives/1926el63rd.jpg" width="800" height="397" class="mt-image-none" style="" /></p>

<p>63rd is certainly lighter and more open, but the demolition of the overhead tracks has hardly spurred economic development. The colored lines of the 1926 map hover over a mostly-vacant streetscape of today.</p>

<p><img alt="1926el63rdclose.jpg" src="http://www.thegogglesdonothing.com/archives/1926el63rdclose.jpg" width="800" height="430" class="mt-image-none" style="" /></p>

<p>I did this experiment with ArcGIS (for rectification) and <a href="http://geoserver.org ">GeoServer</a> (for serving the map tiles into Google Earth.)</p>
]]>
        

    </content>
</entry>

<entry>
    <title>DHCS 2012</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2012/11/dhcs_2012.shtml" />
    <id>tag:www.thegogglesdonothing.com,2012://1.770</id>

    <published>2012-11-19T21:36:02Z</published>
    <updated>2012-11-19T22:26:22Z</updated>

    <summary><![CDATA[We just wrapped up the 2012 Chicago Colloquium on Digital Humanities &amp; Computer Science (DHCS). Below is a picture from one of my favorite poster sessions, a team working on Classical Greek &amp; Latin textual applications for both iOS and...]]></summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="photography" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p>We just wrapped up the 2012 <a href="http://chicagocolloquium.org">Chicago Colloquium on Digital Humanities &amp; Computer Science</a> (DHCS). Below is a picture from one of my favorite poster sessions, a team working on Classical Greek &amp; Latin textual applications for both iOS and Android:</p>

<p><a href="http://www.flickr.com/photos/peterl/8200155753/" title="Parsing and Presentation of Digital Classical Texts on Mobile Platforms by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8338/8200155753_2b13e7ccd3_c.jpg" width="800" height="617" alt="Parsing and Presentation of Digital Classical Texts on Mobile Platforms"></a></p>

<p>More pictures from the conference are online <a href="http://www.flickr.com/photos/peterl/sets/72157632053479754/detail/">here</a>.</p>
]]>
        

    </content>
</entry>

<entry>
    <title>Topic-Modeling a Quattrocento Tuscan Diary</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2012/11/topic-modeling_a_quattrocento_tuscan_diary.shtml" />
    <id>tag:www.thegogglesdonothing.com,2012://1.769</id>

    <published>2012-11-11T23:26:19Z</published>
    <updated>2012-11-12T00:39:38Z</updated>

    <summary> I&#8217;ve been doing some work recently on the 15th-century diary of Luca Landucci, a Florentine apothecary who kept a detailed diary for over half a century. The text we&#8217;re working with has surely been normalized to a degree, but...</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="school" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><img alt="topic-diary.png" src="http://www.thegogglesdonothing.com/archives/topic-diary.png" width="322" height="266" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /> I&#8217;ve been doing some work recently on the 15th-century diary of <a href="http://en.wikipedia.org/wiki/Luca_Landucci">Luca Landucci</a>, a Florentine apothecary who kept a detailed diary for over half a century.</p>

<p>The text we&#8217;re working with has surely been normalized to a degree, but still represents a departure from contemporary Italian. This has an impact on text preparation &#8212; the process of stoplisting, or stripping out less-meaningful words to leave behind really interesting terms and phrases for analysis.  Normally I just run some utilities that list the most-frequently-used words in a corpus (such as &#8220;the&#8221; in English) and use the results as a starting point for a stoplist of words to ignore.  But I noticed in the case of Landucci&#8217;s diary that he begins almost every entry with a sentence that contains the name of a month. Because these months are really more metadata than data &#8212; a kind of in-line datestamp &#8212; I figured it would make more sense to strip them out and so set about adding the Italian names for all twelve months into the stoplist.  But after re-running my analysis, I was confused to see the months still popping up. The answer, of course, was that Luca wrote in a kind of Tuscan dialect, which differs in small ways from modern Italian. I stoplisted &#8220;settembre&#8221;, but Luca&#8217;s &#8220;settenbre&#8221; made it through the filter intact, along with &#8220;giennaio&#8221; (modern &#8220;gennaio&#8221;). So one by-product of this project will be a custom stoplist for Luca&#8217;s orthography &#8212; perhaps broad enough to function on other Tuscan text of the 1400s, perhaps not.</p>

<p>Another part of the preparation process was to split up the diary into individual days. Since the text I received was not marked-up in any kind of TEI or XML, I had to figure out how to do this by hand.  Luckily, Luca was remarkably consistent, starting every entry with &#8220;E a dì 8 aprile 1498&#8230;&#8221;, &#8220;E a dì 16 di febraio 1495&#8221;, &#8220;E a dì 7 detto&#8221;, and the like. I sliced apart the diary into chunks based on this pattern, using the Gnu project&#8217;s CSPLIT function:</p>

<p><code>csplit -k -f landucci_chunks/$i -z -b _%05d landucci.txt '/E a dì /' '{*}'</code></p>

<p>This left me with about 1,600 individual entries, of relatively uniform size. There were, of course, outliers &#8212; one epic entry approached 12k, and several dozen entries were only a few words long &#8212; a kind of Renaissance Twitter stream. I might consider going back and removing the very large diary entry, or splitting into a few chunks, at a further stage. But in general, I was happy with the distribution of size of the individual entries. As with so much else human culture, the varying length of Luca&#8217;s writing demonstrates a power-law curve:</p>

<p><img alt="diary-curve.png" src="http://www.thegogglesdonothing.com/archives/diary-curve.png" width="800" height="244" class="mt-image-none" style="" /></p>

<p>Running <a href="http://mallet.cs.umass.edu">Mallet</a>&#8217;s topic modeling code on the resulting files was my next step. I chose twenty topics to begin with, since it seemed like a reasonable first guess. </p>

<p><img alt="diary-topics.png" src="http://www.thegogglesdonothing.com/archives/diary-topics.png" width="800" height="255" class="mt-image-none" style="" /></p>

<p>I was pleasantly surprised with the cohesion of the results &#8212; some interesting patterns included a topic about that most famous Florentine family, the Medici:</p>

<p><img alt="diary-medici.png" src="http://www.thegogglesdonothing.com/archives/diary-medici.png" width="800" height="222" class="mt-image-none" style="" /></p>

<p>Also nice to see was a topic I&#8217;ve (provisionally) labeled &#8220;Economics&#8221;, which goes into the intricacies of taxation, inflation and commodity pricing:</p>

<p><img alt="diary-economics.png" src="http://www.thegogglesdonothing.com/archives/diary-economics.png" width="800" height="248" class="mt-image-none" style="" /></p>

<p>This is just the first cut of the data &#8212; I want to refer to a printed edition and figure out how many entries their &#8220;should&#8221; be, to see if it&#8217;s anywhere near the 1,580 that my chunking algorithm produced. And it would be nice to assign a ISO-format date (like 1459-12-21) to each entry, so that we could graph topic saturation over time. This might let us see how certain topics, such as economics, waxed and waned as a matter of concern from Luca. But even at this early stage, I think this project reinforces the appropriateness of diaries as raw material for topic modeling (cf, of course, Cameron Blevins&#8217; fantastic <a href="http://historying.org/2010/04/01/topic-modeling-martha-ballards-diary/">Martha Ballard diary</a> project.)  Unlike novels and other forms of print culture, diaries are relatively easy to cut into logical pieces and &#8212; at least in the case of Martha Ballard and Luca Landucci &#8212; offer a fascinating glimpse of one writer chronicling events over a long period of time.</p>
]]>
        

    </content>
</entry>

<entry>
    <title>HathiTrust UnCamp</title>
    <link rel="alternate" type="text/html" href="http://www.thegogglesdonothing.com/archives/2012/09/hathitrust_uncamp.shtml" />
    <id>tag:www.thegogglesdonothing.com,2012://1.768</id>

    <published>2012-09-13T05:38:44Z</published>
    <updated>2012-09-15T05:55:29Z</updated>

    <summary> Back from a two-day workshop at Indiana University, run by the HathiTrust Research Center folks. HathiTrust is, loosely, a consortium of universities and research libraries in the US which gave volumes to be digitized in the Google Books project....</summary>
    <author>
        <name>Peter Leonard</name>
        <uri>http://www.thegogglesdonothing.com</uri>
    </author>
    
        <category term="tech" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.thegogglesdonothing.com/">
        <![CDATA[<p><a href="http://www.flickr.com/photos/peterl/7980017475/" title="&quot;Correction Rules!&quot; by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8444/7980017475_2cd2290e32_c.jpg" width="800" height="320" alt="&quot;Correction Rules!&quot;"></a></p>

<p>Back from a two-day workshop at Indiana University, run by the HathiTrust <a href="http://www.hathitrust.org/htrc">Research Center</a> folks. <a href="http://www.hathitrust.org">HathiTrust</a> is, loosely, a consortium of universities and research libraries in the US which gave volumes to be digitized in the Google Books project. Though these volumes are all available on books.google.com (at least those out-of-copyright), the HathiTrust exists to ensure that duplicate copies are held by a consortium of all the libraries in perpetuity, in case Google isn&#8217;t around in a few decades.</p>

<p>The attraction to literary folks &#8212; or at least those of us with an interest in data mining &#8212; is obvious: tens of millions of books, all digitized in the space of a few years.  The tricky question has always been: how do we get access to them, and what kind of algorithms can we run on a corpus of this scale?</p>

<p><a href="http://www.flickr.com/photos/peterl/7980018668/" title="Corpus stats by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8171/7980018668_3fe1f75308_c.jpg" width="800" height="530" alt="Corpus stats"></a></p>

<p>For small-scale projects in the past, many of us were content to build up infrastructure at our local institutions: a big server here, a metadata database there&#8230;  I set up such systems when I was at UCLA, to work on the 19th Century Nordic-language corpus. This works fine for several hundred or thousand books, but doesn&#8217;t make any sense for projects at the million scale. </p>

<p>So instead, the future of large-scale text mining may look something like this:</p>

<p><a href="http://www.flickr.com/photos/peterl/7980006690/" title="Running against the HathiTrust corpus by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8321/7980006690_0f0a9c4e3e_c.jpg" width="800" height="629" alt="Running against the HathiTrust corpus"></a></p>

<p>The screenshot above shows me doing a word count of a bunch of Norwegian-language texts &#8212; the first item is actually an artifact of the beta-quality tokenizing code, a hyphen, but the rest are the words for <em>it</em>, <em>I</em>, <em>and</em>, <em>so</em>, etc.  This is actually a hybrid model, where a python script goes out and gets a zipped objects, decompresses them, and then does the word counting on a local machine.  Most users are likely to use a combination of such local analytics, coupled with large-scale (and somewhat less-frequently-run) examinations of large chunks of the collection.  That&#8217;s what this picture below shows off &#8212; visualizations from the Mellon-funded SEASR/MEANDRE toolkit.</p>

<p><a href="http://www.flickr.com/photos/peterl/7980003613/" title="Epic Datawall by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8461/7980003613_3b2ff1b82e_c.jpg" width="800" height="531" alt="Epic Datawall"></a></p>

<p>But regardless of the implementation details, we&#8217;re at the cusp of being able to do truly interesting things with out-of-copyright works from the 19th and early 20th Centuries.  All we need is for groups like the HathiTrust to navigate a very treacherous landscape of eager literary folks and suspicious publishing industry lawyers. If they succeed, we could derive real insight into all the cultural output that&#8217;s been preserved for centuries, and digitized quite suddenly in the span of my own grad school career.</p>

<p><a href="http://www.flickr.com/photos/peterl/7980004506/" title="Occupy the Cyberinfrastructure Building by peterl, on Flickr"><img src="http://farm9.staticflickr.com/8038/7980004506_9144a6b610_c.jpg" width="800" height="442" alt="Occupy the Cyberinfrastructure Building"></a></p>
]]>
        

    </content>
</entry>

</feed>
