<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-3744828794262426172</id><updated>2011-11-08T22:44:38.999-08:00</updated><category term='archival'/><category term='Gordon Brown'/><category term='Iran election'/><category term='ethnic names'/><category term='KDD'/><category term='juxtapositions'/><category term='ethnicity detection'/><category term='Edison Chen'/><category term='NSF'/><category term='graduate students'/><category term='Lydia alumni'/><category term='Mousavi'/><category term='heatmaps'/><category term='SBIR'/><category term='Lori Drew'/><category term='political science'/><category term='Skiena'/><category term='sentiment analysis'/><category term='United Kingdom'/><category term='General Sentiment'/><category term='pubmed'/><category term='Ahmadinejad'/><category term='AIDS'/><title type='text'>The TextMap Blog</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>11</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-1342075067361103998</id><published>2009-10-26T18:50:00.000-07:00</published><updated>2009-10-26T19:25:24.006-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='graduate students'/><category scheme='http://www.blogger.com/atom/ns#' term='Lydia alumni'/><title type='text'>Alumni Reunion</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_i23iQcpe1a0/SuZZ_41_m3I/AAAAAAAAADQ/TSvUXZkuems/s1600-h/lydia-banquet-09.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 204px;" src="http://3.bp.blogspot.com/_i23iQcpe1a0/SuZZ_41_m3I/AAAAAAAAADQ/TSvUXZkuems/s400/lydia-banquet-09.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5397100157575601010" /&gt;&lt;/a&gt;&lt;br /&gt;The Lydia/TextMap system was built in collaboration with my graduate students.   A lot of graduate students.  Indeed over thirty of them to date, all properly recognized on the &lt;a href="http://www.textmap.com/team.htm"&gt;team&lt;/a&gt; webpage.  I've grown quite close to them over the years, and we try to keep in touch through our annual Lydia Alumni Banquet in Manhattan.&lt;br /&gt;&lt;br /&gt;The 2009 banquet was this past weekend, and attracted a swarm of 18 loyal Lydia-oids.   I am proud to see that all are doing very well indeed, with careers progressing nicely despite the recession.   Most are somehow connected to the finance industry, including a growing number in hedge funds, but several others work in technology companies such as Google and Microsoft.    Several are starting families, with several engagements (Levon, Andrew, and Lohit) on top of recent weddings (Prachi, Namtrata, and Jai).   I hereby move that the first alumni child be named ``Lydia''. (or if the parents prefer, TextMap :-) )&lt;br /&gt;&lt;br /&gt;I also include a December 2008 photo of myself with three of the Lydia alums who could not attend this year's banquet, and look forward to seeing everyone at next year's banquet.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_i23iQcpe1a0/SuZXcCow76I/AAAAAAAAADA/xhtFFCogevc/s1600-h/official-lab-india-photo.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 400px; height: 267px;" src="http://1.bp.blogspot.com/_i23iQcpe1a0/SuZXcCow76I/AAAAAAAAADA/xhtFFCogevc/s400/official-lab-india-photo.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5397097342705921954" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-1342075067361103998?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/1342075067361103998/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/10/alumni-reunion.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1342075067361103998'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1342075067361103998'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/10/alumni-reunion.html' title='Alumni Reunion'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_i23iQcpe1a0/SuZZ_41_m3I/AAAAAAAAADQ/TSvUXZkuems/s72-c/lydia-banquet-09.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-6296150309661510259</id><published>2009-07-02T20:34:00.000-07:00</published><updated>2009-07-02T21:31:23.007-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Skiena'/><category scheme='http://www.blogger.com/atom/ns#' term='KDD'/><category scheme='http://www.blogger.com/atom/ns#' term='ethnicity detection'/><category scheme='http://www.blogger.com/atom/ns#' term='ethnic names'/><title type='text'>Ethnicity detection and the origin of Skiena</title><content type='html'>Although trends apparent in single-entity time series are revealing, more subtle analysis is possible by aggregating the signals of   all the entities in a given group (say women, businessmen, Africans, etc.).  But we first need to identify which entities are members of the group we are interested in.&lt;br /&gt;&lt;br /&gt;This motivates our paper ``Name-Ethnicity Classification from Open Sources'', just presented at the &lt;a href="http://www.sigkdd.org/kdd2009/"&gt;15th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining&lt;/a&gt; in Paris, France.   We developed a statistical classifier/HMM to map person names to likely ethnicities.    Given `Hu Jintai', we want to return `Chinese'.  Given `Dimitry Medvedev', we want to return `Russian' or at least `Eastern European'.  Given `Abdullah bin Abdul Aziz', we want to return `Muslim'.&lt;br /&gt;&lt;br /&gt;Our classifier is not perfect, but it gives us a tool to answer questions like ``How did news sentiment towards Muslims change in the wake of 9/11?'' or ``How do attitudes towards Hispanics vary across the U.S.?''.    Our results are quite interesting, and believe that entity-based ethnicity and nationality classification has many applications in social science research.&lt;br /&gt;My coauthors were  Anurag Ambekar, Charles Ward, Jahangir Mohammed, and Swapna Male.&lt;br /&gt;&lt;br /&gt;I encourage you to play with our ethnic name classifier at &lt;a href="http://www.textmap.com/ethnicity"&gt;http://www.textmap.com/ethnicity&lt;/a&gt; to see how it works.  &lt;br /&gt;&lt;br /&gt;This week I was thrilled to see the official 1920 and 1930 census pages for my grandparents.  The original Skiena (my grandfather Sol) arrived in the U.S. in 1911, but with no clear English spelling for his name.   Indeed, no less than &lt;span style="font-style:italic;"&gt;four&lt;/span&gt; distinct spellings are relevant for interpreting these census pages.  I list them with the primary and secondary ethnicities identified by our classifier to give you some idea of our purported roots:&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;Sheaner - Jewish (0.76) and British (0.22)&lt;br /&gt;&lt;li&gt;Skeaner - British (0.86) and Jewish (0.08)&lt;br /&gt;&lt;li&gt;Sciaaner - Hispanic (0.48) and Nordic (0.24)&lt;br /&gt;&lt;li&gt;Skiena - Eastern European (0.67) and British (0.21)&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;My Grandfather is from Russia, so Eastern European is indeed the correct answer.     For the record, this is a very hard case for the classifier; usually we are also given first names and deal with more common surnames.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-6296150309661510259?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/6296150309661510259/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/07/ethnicity-detection-and-origin-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/6296150309661510259'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/6296150309661510259'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/07/ethnicity-detection-and-origin-of.html' title='Ethnicity detection and the origin of Skiena'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-191190197592741279</id><published>2009-06-18T16:58:00.000-07:00</published><updated>2009-06-18T20:51:00.058-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Mousavi'/><category scheme='http://www.blogger.com/atom/ns#' term='Ahmadinejad'/><category scheme='http://www.blogger.com/atom/ns#' term='Iran election'/><title type='text'>Ahmadinejad Goes Down!</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/SjsHWlfun1I/AAAAAAAAACw/hKPOz1qOyTE/s1600-h/Ahmadinejad.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 400px; height: 304px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/SjsHWlfun1I/AAAAAAAAACw/hKPOz1qOyTE/s400/Ahmadinejad.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5348877067036303186" /&gt;&lt;/a&gt;&lt;br /&gt;Like much of the world, I have been following the presidential election in Iran and its aftermath with great excitement.   The election was crudely stolen by the incumbent Ahmadinejad after surprising open campaign, but the people of Iran have bravely taken to the streets in support of Mousavi -- the real winner.    It is too early to tell who will prevail in this bare-knuckle power struggle, but you get probably guess who I am rooting for.&lt;br /&gt;&lt;br /&gt;The sentiment polarity graph tells the interesting story.   Ebbs and flows of the campaign are reflected before the vote, particularly Ahmadinejad's widely-panned debate performance on June 4 and the increasing sense that Mousavi could win.   The election on June 12 drew enormous turnout followed too quickly by the announcement of a landslide Ahmadinejad victory.   But within 24 hours, Mousavi's claim of fraud gains credence, and Ahmadinejad's sentiment (at least) goes down.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-191190197592741279?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/191190197592741279/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/06/ahmadinejad-goes-down.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/191190197592741279'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/191190197592741279'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/06/ahmadinejad-goes-down.html' title='Ahmadinejad Goes Down!'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_i23iQcpe1a0/SjsHWlfun1I/AAAAAAAAACw/hKPOz1qOyTE/s72-c/Ahmadinejad.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-1877757057740097569</id><published>2009-06-11T02:05:00.000-07:00</published><updated>2009-06-11T02:32:06.205-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NSF'/><category scheme='http://www.blogger.com/atom/ns#' term='SBIR'/><category scheme='http://www.blogger.com/atom/ns#' term='General Sentiment'/><title type='text'>SBIR Award for General Sentiment</title><content type='html'>&lt;a href="http://www.generalsentiment.com"&gt;General Sentiment&lt;/a&gt;, the startup company which licenced Lydia technology from Stony Brook, has just received a $100,000 Small Business Innovative Research (SBIR) phase I grant from the National Science Foundation (NSF) entitled `` Identifying and Interpreting Trends through News/Blog Analysis''.&lt;br /&gt;&lt;br /&gt;Special thanks go to Barack Obama, as this award was funded under the American Recovery and Reinvestment Act of 2009 (ARRA).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-1877757057740097569?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/1877757057740097569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/06/sbir-award-for-general-sentiment.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1877757057740097569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1877757057740097569'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/06/sbir-award-for-general-sentiment.html' title='SBIR Award for General Sentiment'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-6684543708751045924</id><published>2009-06-11T01:10:00.000-07:00</published><updated>2009-06-11T01:57:31.690-07:00</updated><title type='text'>Lydia at the Hadoop Summit!</title><content type='html'>My student Mikhail Bautin just presented his work on the Lydia processing architecture to over 700 people at the &lt;a href="http://developer.yahoo.com/events/hadoopsummit09/"&gt;2009 Hadoop Summit&lt;/a&gt; in Santa Clara, CA.  He found it to be a great conference (better he says than the more academic venues I've sent him to before).   There is enormous energy in the Hadoop world today as it becomes the primary system for web-type parallel processing and cloud computing in general.&lt;br /&gt;&lt;br /&gt;Hadoop is a distributed processing system inspired by Google's MapReduce paradigm.  Computations proceed in rounds of mapping (sending data packets to particular machines based on identification keys) and reduce (crunching these tuples down to a particular result).   Such problems arise frequently in Lydia.  For example, we can imagine mapping all the sentences in our news corpus keyed to the name of the entities within it, so we can then use reduce to count the number of occurrences of each entity and the other entities it is juxtaposed with.  Hadoop manages all the messy stuff of parallel processing, like load balancing and distributed data structures and the like.&lt;br /&gt;&lt;br /&gt;It is hard to overstate the importance that Hadoop has made to the Lydia project, efforts which are now rapidly bearing fruit.   Expect to hear me soon report on results from enormous blog depositories we have spidered for years yet never previously been able to analyze.  Further, we now regularly do large scale analysis &lt;span style="font-style:italic;"&gt;of our analysis&lt;br /&gt;&lt;/span&gt; using Hadoop, for example in studying trends across all entities across nationalities or ethnic groups.&lt;br /&gt;&lt;br /&gt;It is equally hard to overstate the efforts Mikhail has made getting us there with our system.  I can ask nothing more of my other students except that they try to "be like Mike".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-6684543708751045924?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/6684543708751045924/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/06/lydia-at-hadoop-summit.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/6684543708751045924'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/6684543708751045924'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/06/lydia-at-hadoop-summit.html' title='Lydia at the Hadoop Summit!'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-7289186050503155246</id><published>2009-05-21T20:04:00.000-07:00</published><updated>2009-06-16T01:43:02.214-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sentiment analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='Lori Drew'/><title type='text'>The World's Worst Person?</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_i23iQcpe1a0/SjdKtegZVPI/AAAAAAAAACo/1FgQOOQVUuU/s1600-h/rouge2.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 300px; height: 270px;" src="http://4.bp.blogspot.com/_i23iQcpe1a0/SjdKtegZVPI/AAAAAAAAACo/1FgQOOQVUuU/s400/rouge2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5347825227669263602" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_i23iQcpe1a0/SjdKtMGJJUI/AAAAAAAAACg/i9zM5WWiR6s/s1600-h/rouge1.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 300px; height: 270px;" src="http://3.bp.blogspot.com/_i23iQcpe1a0/SjdKtMGJJUI/AAAAAAAAACg/i9zM5WWiR6s/s400/rouge1.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5347825222727312706" /&gt;&lt;/a&gt;&lt;br /&gt;A certain fascination exists with identifying the public figure with the lowest overall sentiment ranking.  It tells us something about a given society to discover who the most demonized figure is, the person spoken about with the greatest anger or rancor.&lt;br /&gt;&lt;br /&gt;Although our system does not make it easy to extract people by negative sentiment, any active news reader should be able to construct a rogue gallery of evil and destructive people -- like mass murders Adolf Hitler and Osama bin Laden, or the swindler Bernard Madoff.  But as shown by these sentiment polarity charts, all their reported vileness pales next to Lori Drew.&lt;br /&gt;&lt;br /&gt;Who is Lori Drew?  She is a mother whose cyber-bullying drove a fragile 13-year old girl (a rival of her daughter) to suicide, through messages sent from a fake MySpace profile.  The public outrage over this pushes many buttons -- from the general hatred of bullying defenseless children, public attitudes against overzealous parenting, fear of social network sites, and more.  As of this writing the judge is trying to appropriately sentence her, a task complicated by the fact that she has not been convicted of anything stronger than violating the MySpace terms of service.&lt;br /&gt;&lt;br /&gt;Now don't get me wrong.  She is clearly guilty of such horrifying behavior that it is difficult to see how she can live with herself.  But it is obviously an overreaction to mention her in the company of Hitler and Bin Laden. &lt;br /&gt;&lt;br /&gt;Realize that sentiment analysis aims at capturing what the world is thinking, not what it necessarily should be thinking or that which is objectively true.   Lydia sentiment signals measure interesting social and cultural phenomena, but their proper interpretation requires an understanding of context and the nature of the underlying news sources.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-7289186050503155246?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/7289186050503155246/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/worlds-worst-person.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/7289186050503155246'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/7289186050503155246'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/worlds-worst-person.html' title='The World&apos;s Worst Person?'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_i23iQcpe1a0/SjdKtegZVPI/AAAAAAAAACo/1FgQOOQVUuU/s72-c/rouge2.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-910151531233314645</id><published>2009-05-21T06:34:00.000-07:00</published><updated>2009-05-21T07:57:57.969-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='juxtapositions'/><category scheme='http://www.blogger.com/atom/ns#' term='heatmaps'/><category scheme='http://www.blogger.com/atom/ns#' term='Edison Chen'/><title type='text'>Edison Chen and the Computer News Processing</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_i23iQcpe1a0/ShVb9psPfDI/AAAAAAAAABg/IferZVPGGeU/s1600-h/edison-chen-sentiment-timeseries.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 188px;" src="http://1.bp.blogspot.com/_i23iQcpe1a0/ShVb9psPfDI/AAAAAAAAABg/IferZVPGGeU/s400/edison-chen-sentiment-timeseries.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5338274048039156786" /&gt;&lt;/a&gt;&lt;br /&gt;One of the pleasures of my sabbatical year in Hong Kong has been reading the local English newspaper (The South China Morning Post) and getting exposed to a new universe of locally-interesting characters.    &lt;span style="font-style:italic;"&gt;Edison Chen and the Computer Technician&lt;/span&gt; has been my favorite story of the year.  Lydia news analysis provides very interesting insights into the story and by proxy the culture here in Hong Kong.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/Edison_Chen"&gt;Edison Chen&lt;/a&gt;, son of a local tycoon, became a Cantopop (Cantonese pop music) singer and general entertainment/media personality.  Think a male Paris Hilton, with a similar set of unseemly incidents involving various fights with people and, in one case, a taxi.  This explains his generally negative sentiment scores (shown above) up to January 2008, when he took his computer in for repairs and hit the big time.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/ShVjflcYfPI/AAAAAAAAABo/7zDaBkwCzro/s1600-h/edison-chen-juxtapositions.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 374px; height: 398px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/ShVjflcYfPI/AAAAAAAAABo/7zDaBkwCzro/s400/edison-chen-juxtapositions.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5338282327595842802" /&gt;&lt;/a&gt;The computer technician (Ho Chun Sze) found a nice collection of sex photos of Mr. Chen with several female Cantopop stars, actresses, and models (Gillian Chung, Cecilia Cheung, Bobo Chan).  The technician showed them to his girlfriend, who showed them to somebody else, and then they ended up on the Internet.  All of these figures show up prominently as statistically juxtaposed with Edison Chen. In the wake of this scandal, Edison Chen's sentiment score suddenly turns &lt;span style="font-style:italic;"&gt;positive&lt;/span&gt;, resulting from respect for his healthy social life, approval of his apologetic behavior (including retiring from the Hong Kong scene to live quietly in Vancouver), and sympathy for the fact that he was ultimately blameless for the release of the photos.  &lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/ShVpyre8x7I/AAAAAAAAABw/OPvc-cvuQx0/s1600-h/edison-chen-freq-map.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 288px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/ShVpyre8x7I/AAAAAAAAABw/OPvc-cvuQx0/s400/edison-chen-freq-map.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5338289252704503730" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_i23iQcpe1a0/ShVqJEDhsCI/AAAAAAAAAB4/SjLXdk0SxKo/s1600-h/edison-chen-sent-map.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 400px; height: 288px;" src="http://1.bp.blogspot.com/_i23iQcpe1a0/ShVqJEDhsCI/AAAAAAAAAB4/SjLXdk0SxKo/s400/edison-chen-sent-map.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5338289637257490466" /&gt;&lt;/a&gt;&lt;br /&gt;Perhaps most interesting of all are the international heatmaps displaying the spatial reference frequency (left) and sentiment (right).  The frequency map shows the most intense interest in China, with secondary interest in countries with significant Cantonese communities (Canada and Australia). Chen's Chinese name was the number 1 search term in China in 2008. The sentiment map shows a negative reputation in all countries &lt;span style="font-style:italic;"&gt;except&lt;/span&gt; China!  &lt;br /&gt;Indeed, Chen finished second to Barack Obama in the &lt;a href="http://www.news24.com/News24/Entertainment/Celebrities/0,,2-1225-2108_2447759,00.html"&gt;Hong Kong Person of 2008 poll&lt;/a&gt; by RTHK radio, with almost 30% of the vote.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-910151531233314645?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/910151531233314645/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/edison-chen-and-computer-news.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/910151531233314645'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/910151531233314645'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/edison-chen-and-computer-news.html' title='Edison Chen and the Computer News Processing'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_i23iQcpe1a0/ShVb9psPfDI/AAAAAAAAABg/IferZVPGGeU/s72-c/edison-chen-sentiment-timeseries.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-3025373372239556537</id><published>2009-05-18T20:54:00.000-07:00</published><updated>2009-05-18T21:43:49.945-07:00</updated><title type='text'>Sentiment: United States vs. China</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/ShIzWwA6fkI/AAAAAAAAABI/ORGMQncxluU/s1600-h/us-sentiment.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 90px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/ShIzWwA6fkI/AAAAAAAAABI/ORGMQncxluU/s400/us-sentiment.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5337384974326332994" /&gt;&lt;/a&gt;&lt;br /&gt;Today's graphs give me a very queasy feeling.   I decided to compare the sentiment in the United States vs. China, and let's say the results are not very good for the home team.&lt;br /&gt;&lt;br /&gt;In particular, United States sentiment has been highly negative since the beginnings of the dailies depository in November 2004, a rating I would like to attribute at least initially to the war in Iraq and the Bush administration in general.   Indeed, the months marking Obama's election (November 2008) and his inauguration (January 2009) represent peaks in sentiment despite the economic crisis. But what floored me was the sharp negative spike in April 2009.  I attribute this to kvetching about the long-term strength of the dollar, but regardless U.S. sentiment polarity hit a new dailies low during this month.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_i23iQcpe1a0/ShI4VlisGiI/AAAAAAAAABY/srUbpr44M90/s1600-h/china-sentiment.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 91px;" src="http://3.bp.blogspot.com/_i23iQcpe1a0/ShI4VlisGiI/AAAAAAAAABY/srUbpr44M90/s400/china-sentiment.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5337390451893475874" /&gt;&lt;/a&gt;&lt;br /&gt;By comparison, check out the sentiment graph for China over the same period.  It has been generally positive since November 2004, with the exception of the Sichuan Earthquake in Spring 2008.   The big positive spike is August 2008 results from the enormously successful Beijing Olympics, which also give a nice boost to the U.S. that month.  Negative sentiment from the world economic crisis rules the next several months, but the Chinese funk lifted in April as the U.S. continues to descend.&lt;br /&gt;&lt;br /&gt;Now these generally negative U.S. and positive China sentiment reflect the longer term time-series from the thirty-year archival depository.   Negative news always gets more play than positive news in U.S. newspapers, so it is the changes in sentiment which are more revealing than the absolute sign.   The biggest plunge in U.S. sentiment in this period occurred (appropriately) September 2001.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-3025373372239556537?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/3025373372239556537/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/sentiment-united-states-vs-china.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/3025373372239556537'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/3025373372239556537'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/sentiment-united-states-vs-china.html' title='Sentiment: United States vs. China'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_i23iQcpe1a0/ShIzWwA6fkI/AAAAAAAAABI/ORGMQncxluU/s72-c/us-sentiment.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-1194550343020700943</id><published>2009-05-15T04:16:00.000-07:00</published><updated>2009-05-15T20:53:08.911-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Gordon Brown'/><category scheme='http://www.blogger.com/atom/ns#' term='United Kingdom'/><category scheme='http://www.blogger.com/atom/ns#' term='political science'/><title type='text'>British News Processing</title><content type='html'>I gave a demo today to Sean Carey, a professor of political science at the University of Sheffield.   He is interested in using our news analysis in cahoots with polling data from the British Election Study, to better understand how voters make their decision, and why.   They are gearing up for the next national election, likely to occur in Spring 2010.&lt;br /&gt;&lt;br /&gt;Since their study revolves around why British voters vote the way they do, they are only concerned with data from British newspapers.   The source set tab of TextMap Access makes it easy to create a source set from the dailies depository consisting of all newspapers from the United Kingdom.   Once this source set is named and registered, it will appear as an entry as a new depository ready for use in the frequency and sentiment tabs.&lt;br /&gt;&lt;br /&gt;One interesting discovery in playing with it was the fraction of references to a local entity like `Gordon Brown' that came from British sources.  The answer proved to be about 60%, which is quite impressive considering that less than 10% of our total spidered sources are from the United Kingdom.   But it makes sense that he would be what the local readership is interested in...&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_i23iQcpe1a0/Sg43A_upviI/AAAAAAAAABA/pXncYuZiyw8/s1600-h/juxtapositions-GB.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 252px;" src="http://1.bp.blogspot.com/_i23iQcpe1a0/Sg43A_upviI/AAAAAAAAABA/pXncYuZiyw8/s400/juxtapositions-GB.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5336263098727251490" /&gt;&lt;/a&gt;&lt;br /&gt;Particularly amusing was to look at the entities juxtaposed with Gordon Brown at different type scales. Tony Blair, who he served faithfully as Chancellor of the Exchequer, proves his strongest association over the full dailies depository (left column). The past year (center column) more strongly reflects his activities as Prime Minister, including interactions with world leaders (Obama, Sarkozy, Merkel).  The strongest associations over the past month (right) column reflect recent activities.  We were puzzled a bit by the strong association with Carol Ann Duffy, but a little reading revealed that Brown had just had appointed her as the first female Poet Laureate.&lt;br /&gt;&lt;br /&gt;One minor complication of British news processing is that the spelling and word usage is slightly different from what is used in the United States.   The lexical resources we employ both British and American spellings, and I expect that our NLP performance will be quite similar on British texts.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-1194550343020700943?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/1194550343020700943/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/british-news-processing.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1194550343020700943'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/1194550343020700943'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/british-news-processing.html' title='British News Processing'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_i23iQcpe1a0/Sg43A_upviI/AAAAAAAAABA/pXncYuZiyw8/s72-c/juxtapositions-GB.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-4495578639565687862</id><published>2009-05-14T01:51:00.000-07:00</published><updated>2009-05-14T02:38:48.732-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='archival'/><category scheme='http://www.blogger.com/atom/ns#' term='pubmed'/><category scheme='http://www.blogger.com/atom/ns#' term='AIDS'/><title type='text'>Trends in Medline/Pubmed</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/SgveWbaHl3I/AAAAAAAAAAw/5skGVJ740o4/s1600-h/breast-cancer.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 320px; height: 160px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/SgveWbaHl3I/AAAAAAAAAAw/5skGVJ740o4/s320/breast-cancer.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5335602660446279538" /&gt;&lt;/a&gt;&lt;br /&gt;Medline/Pubmed is a database of the abstracts from over 17 million published journal articles in the biomedical sciences.  Processing such a corpus is short work for the current version of the Lydia system, taking only a day or so on our 28-node cluster computer.  Our Medline depository provides a good example of how our news/blog processing system can be easily applied to any large-scale text corpus, with interesting results.  Here are two interesting discoveries from my explorations this afternoon. &lt;br /&gt;  &lt;br /&gt;One of the most frequent entities in this depository is cancer, and one of the most frequent cancers is breast cancer.  The figure above shows the pubmed frequency graph and rugplots for this disease.   The log-scale frequency graph shows the relentless exponential growth in research on breast cancer since the mid-1970's.  The regular, small scale bumps over the years reflect the periodicities with which journals are issues, such as quarterly or semiannually.&lt;br /&gt;&lt;br /&gt;The rug plot (shown below the time series) shows the distribution of articles identified as news, business, sports, entertainment, or other by our statistical classification methods.   Now these classifiers were tuned for news articles, and we do not expect too many sports/entertainment articles appearing in scientific journals (at least the ones I read).  But still the results are quite interesting.   There is a clear transition in the distribution starting in 1975, when the systematic collection of full text abstracts began.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_i23iQcpe1a0/Sgvl9sTGdkI/AAAAAAAAAA4/cMl4MrbVHls/s1600-h/archival-aids.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 320px; height: 75px;" src="http://2.bp.blogspot.com/_i23iQcpe1a0/Sgvl9sTGdkI/AAAAAAAAAA4/cMl4MrbVHls/s320/archival-aids.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5335611031576540738" /&gt;&lt;/a&gt;My other experiment involved a sentiment plot for AIDS since the beginnings of the epidemic around 1982.  Both the pubmed and archival depositories show the sentiment polarity of AIDS gradually but definitely drifting towards greater neutrality.  The scientific sentiment represented by pubmed has improved from -0.72 in March 1983 to -0.58 in December 2009.   The public sentiment about AIDS has risen from -0.78 to -0.48 over the same period.  Now AIDS remains a horrible, incurable disease, but it has become regarded more as a chronic condition which can be treated than a deadly plague -- and our sentiment metrics are sensitive enough to pick up on this trend.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-4495578639565687862?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/4495578639565687862/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/trends-in-medlinepubmed.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/4495578639565687862'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/4495578639565687862'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/trends-in-medlinepubmed.html' title='Trends in Medline/Pubmed'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_i23iQcpe1a0/SgveWbaHl3I/AAAAAAAAAAw/5skGVJ740o4/s72-c/breast-cancer.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3744828794262426172.post-5657432705635832361</id><published>2009-05-13T23:43:00.000-07:00</published><updated>2009-05-14T00:43:27.954-07:00</updated><title type='text'>Welcome to the TextMap Blog!</title><content type='html'>Hello World!   This is the first posting of a blog on developments revolving around the Lydia / TextMap news and blog analysis project at Stony Brook University.    These will include:&lt;div&gt;&lt;ul&gt;&lt;li&gt;Descriptions of newly available functionality on the TextMap website&lt;/li&gt;&lt;li&gt;Interesting little discoveries on how the world works, derived from TextMap Access data.&lt;/li&gt;&lt;li&gt;Reports on social science research based on TextMap analysis&lt;/li&gt;&lt;li style="text-align: justify;"&gt;Publication announcements of Lydia-oriented research out of our lab.&lt;/li&gt;&lt;li style="text-align: justify;"&gt;Developments at &lt;a href="http://www.generalsentiment.com/"&gt;General Sentiment LLC&lt;/a&gt;, a startup company based on Lydia technology.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The time is right to start this blog, because a lot is now happening in the Lydia/TextMap world. Several interesting new analysis depositories (including a longer and more comprehensive newspaper corpus, PubMed abstracts, patents, and Supreme Court decisions) have just come on line as our infrastructure matures.  Our TextMap Access interface now provides instant access to this vast amount of data and analysis.   I am now spending (wasting?) substantial amounts of time playing with our data, so this blog is the perfect place to relate my discoveries.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Substantial collaborations relying on our analysis have already begun with political scientists, sociologists, and historians, but this is hopefully just the start of several beautiful friendships. Thanks for coming on board.  I look forward to having reading (and making) news together.&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3744828794262426172-5657432705635832361?l=textmap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://textmap.blogspot.com/feeds/5657432705635832361/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://textmap.blogspot.com/2009/05/welcome-to-textmap-blog.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/5657432705635832361'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3744828794262426172/posts/default/5657432705635832361'/><link rel='alternate' type='text/html' href='http://textmap.blogspot.com/2009/05/welcome-to-textmap-blog.html' title='Welcome to the TextMap Blog!'/><author><name>Steven Skiena</name><uri>http://www.blogger.com/profile/16923380278093754963</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_i23iQcpe1a0/SgvUyvtLvzI/AAAAAAAAAAM/KCJQK4q1nqA/S220/skiena.jpg'/></author><thr:total>0</thr:total></entry></feed>
