<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xml:base="http://transformingfreedom.net/category/technology/search-engine" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Recordings by search engine</title>
    <link>http://transformingfreedom.net/category/technology/search-engine</link>
    <description></description>
    <language>en</language>
          <item>
    <title>Stephen Wolfram on Wolfram Alpha</title>
    <link>http://transformingfreedom.net/hyperaudio/stephen-wolfram-wolfram-alpha</link>
    <description>&lt;fieldset class=&quot;fieldgroup group-step1&quot;&gt;
&lt;div class=&quot;field-speaker&quot;&gt;

  
    
                        Speaker(s)          
          Stephen Wolfram
      
&lt;/div&gt;

&lt;div class=&quot;field-languages-spoken&quot;&gt;

  
    
                        Language spoken          
          English
      
&lt;/div&gt;

&lt;div class=&quot;field-recdate&quot;&gt;

  
    
                        Date of Recording          
          &lt;span class=&quot;date-display-single&quot;&gt;Wed, 2009-05-27&lt;/span&gt;
      
&lt;/div&gt;

&lt;div class=&quot;field-tags&quot;&gt;

      &lt;span&gt;Tags:&amp;nbsp;&lt;/span&gt;
  
    
          
          Wikipedia
      
          
          Wikipedia
      
          
          en
      
          
          Mathematica
      
          
          NKS
      
          
          NKS
      
          
          Radio Berkman
      
&lt;/div&gt;

&lt;div class=&quot;field-imagefield&quot;&gt;

      &lt;span&gt;Player-image:&amp;nbsp;&lt;/span&gt;
  
    
          
          &lt;img  class=&quot;imagefield imagefield-field_imagefield&quot; width=&quot;200&quot; height=&quot;150&quot; alt=&quot;&quot; src=&quot;http://transformingfreedom.net/files/images/Wolfram-Stephen-flickr-87907765_70f489df88.jpg?1275077759&quot; /&gt;
      
&lt;/div&gt;
&lt;/fieldset&gt;
&lt;fieldset class=&quot;fieldgroup group-step2&quot;&gt;
&lt;div class=&quot;field-teaser&quot;&gt;

  
    
          
          &lt;div class=&quot;teasertext&quot;&gt;
&lt;p&gt;David Weinberger interviews Stephen Wolfram  on his highly praised “computational knowledge engine” &lt;em&gt;Wolfram Alpha&lt;/em&gt; shortly before it was launched publicly for  Radio&amp;nbsp;Berkman.&lt;/p&gt;
&lt;p&gt;“[&amp;#8230;]asking  if we look at the world, the universe as it is, and you know,what are  the kind of underlying primitives, what are the computational,the simple  programs that can potentially drive all of this stuff, andWolfram Alpha  it’s sort of the realization that all this knowledge thatis out there  in this world&amp;nbsp;[&amp;#8230;]”&lt;/p&gt;
&lt;/div&gt;

      
&lt;/div&gt;


&lt;div class=&quot;field-license&quot;&gt;

      &lt;span&gt;License:&amp;nbsp;&lt;/span&gt;
  
    
          
          &lt;p&gt;Image &lt;span class=&quot;caps&quot;&gt;URL&lt;/span&gt;: &lt;a href=&quot;http://www.flickr.com/photos/hybernaut/87907765/&quot; title=&quot;http://www.flickr.com/photos/hybernaut/87907765/&quot;&gt;http://www.flickr.com/photos/hybernaut/87907765/&lt;/a&gt;&lt;br /&gt;License: &lt;a href=&quot;http://creativecommons.org/licenses/by-nc-sa/2.0/deed.en&quot;&gt;&lt;span class=&quot;caps&quot;&gt;CC&lt;/span&gt;  under Attribution-Noncommercial-Share Alike 2.0&amp;nbsp;Generic &lt;/a&gt;&lt;/p&gt;

      
&lt;/div&gt;
&lt;/fieldset&gt;
&lt;fieldset class=&quot;fieldgroup group-step3&quot;&gt;
&lt;div class=&quot;field-transcription&quot;&gt;

  
    
          
          &lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_00m00s&quot; name=&quot;at_00m00s&quot;&gt;00:00&lt;/a&gt; Hi  there. You are listening to a special release of Radio Berkman today.  For the first time we&amp;#8217;re releasing full audio from our most recent  interview. David Weinberger&amp;#8217;s interview with Stephen Wolfram, the  founder of the computational knowledge engine Wolfram Alpha, will still  appear as its own episode next week. But for now you can listen to the  full 55 minute interview.&amp;nbsp;Enjoy!&lt;/p&gt;
&lt;p&gt;David Weinberger: &lt;a class=&quot;timecode&quot; href=&quot;#at_00m23s&quot; name=&quot;at_00m23s&quot;&gt;00:23&lt;/a&gt; What is  Wolfram Alpha?&lt;br /&gt;Stephen Wolfram: So, our kind of best three-word  description is “it’s a computational knowledge engine”, so its purpose  is to take the knowledge that has been accumulated over the course of  years, and those parts of it that can be computed with, to be able to  compute with them and to be able to generate answers to specific  questions using a sort of corpus of existing knowledge, which is in the  data, the methods, the models and so on, that had been accumulated - put  them in computational form and be able to answer sort of any specific  question, where the answer can be&amp;nbsp;computed.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_01m06s&quot; name=&quot;at_01m06s&quot;&gt;01:06&lt;/a&gt; What  reported typical question do you have in mind for this?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: So, there  can be sort of pure formal questions, that are like mathematical  questions, like “What’s the intercall of this or that?”. There could be  questions about the world, like “What’s the population of such and such a  country?”, “What was the weather like on a particular day in a  particular place?”. There can be things that sort of combine some pure  data with computation, like for example, “Where will a particular… What  will the tide be on a particular day at a particular place?”, where you  have to know some geographical data, some historical information about  tides, and then you have to compute the physics of the tides to work out  what will happen. And there are things which are both… where the  answers are both numbers and quantitative kinds of things, and where the  answers are more kind of textual or&amp;nbsp;symbolic.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_01m59s&quot; name=&quot;at_01m59s&quot;&gt;01:59&lt;/a&gt; The answers  that the site gives are not simple one-word answers or numbers, they  give that too, but you are including a range of types of information  that somebody might want…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_02m10s&quot; name=&quot;at_02m10s&quot;&gt;02:10&lt;/a&gt; Right, so the idea is: “What would an expert  do if you ask them this question, what would they actually tell you,  would they just tell you one answer, would they give you kind of some  background, would they give you some related answers?” Kind of the idea  is to generate a report that sort of gives you both, that gives you sort  of all versions of what you might immediately want to know about a  particular thing. So that would be typically some graphical  presentations, some tables of data, some… maybe some synthesized text,  these kinds of things. It’s sort of a big challenge to sort of do this  kind of automated presentation, where you try and take out and in some  cases where there might be a huge amount of stuff that you could  compute, the problem is, to sort of pick out those things, that are most  kind of immediately cognitively useful to the person who asked that&amp;nbsp;question.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_03m03s&quot; name=&quot;at_03m03s&quot;&gt;03:03&lt;/a&gt; In the demonstration that I’ve seen, not a  lot of the answer text, the answers that are given, are links out, or  links to more information in fact. Is that on purpose or is that to  come?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Well, so, it’s hard enough to get kind of within one kind  of coherent system, to sort of build up the things that we think are  worth presenting to people so to speak, without having to worry about  whether some external link goes to something that’s meaningful or not.  Generally what we are trying to do is to try and have this be kind of,  this is giving you kind of the report that you want, there is some  drill-down that you can have, which might be an infinite amount of  drill-down that’s possible from… So, for example, the typical thing  would be, there’s some table of results, and then there’s a little  button that says “more”, you press “more”, you press it once you get  more stuff, you press it again you get even more stuff, you can keep  pressing it sometimes forever, and that’s kind of the main way to kind  of get more information, is something sort of within this controlled  environment that we have set&amp;nbsp;up.&lt;/p&gt;
&lt;p&gt;Now, we also have a few links  out, but we also have a sidebar where we intend to put sort of related  links to things that people might find useful given the particular  questions that they are asking. And often those links will be to things  that are complementary to the kind of sort of direct factual material  that we are giving.&amp;nbsp; Things that, for example, a linking out to, I don’t  know, Wikipedia or something, where there is some narrative description  of some particular topic that kind of complements, the just the facts  kind of results that we’re giving on Wolfram&amp;nbsp;Alpha.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_04m45s&quot; name=&quot;at_04m45s&quot;&gt;04:45&lt;/a&gt; There aren’t  also a lot of links internally to more of your own results. And so,  (again, just seeing the demonstration, I haven’t played with it and  besides it’s not alive yet) the visual sense seems to be: you’ve got  your answer, it’s a rich answer, there’s lots there, but this is the  definitive endpoint of your inquiry. As opposed to having internal links  that would say, “here is the answer and please explore more click, and  get more… results.&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Right, so the “more”-thing, there really is a  pretty… sort of a pretty elaborated world underneath all this  “more”-buttons and so on. I mean, there is really a lot to explore  there. If it’s a question of “go off and ask another related question,  or a question about some entity that happened to come up here, then yes  there are, there is a pop-up that comes up and there are links that pop  up that you can go and make queries about the entities that appear. I  would say there’s probably more that could be developed about that,  about the way that those pop-ups work and the way that one kind of goes  to explore more things. Our big emphasis has been: once we’re in an area  where we can actually could compute things, we can often compute a lot  of stuff, and the big issue is: what’s actually useful to show somebody,  sort of at the first level? And for the people who have been developing  all these wonderful computations it’s sometimes a little disappointing  when I say “Look, we can only put ten results here on the first thing  people see, and the rest has to be based on drill-down” because, you  know, it’s not useful to give 50 things on one page, nobody is going to  be able to pick out the thing that’s actually useful to them. But so  there’s a pretty rich world of sort of additional stuff that you can get  through just by pressing buttons and pulling down pull-downs for  alternate views of things and so&amp;nbsp;on.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_06m40s&quot; name=&quot;at_06m40s&quot;&gt;06:40&lt;/a&gt; So how does  it work? It’s pretty impressive. &lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: There are two key kind of  foundations that make Wolfram Alpha possible, and then there are a  variety of different sort of pieces of technology that have to be  assembled, to create the experience that we are setting up. So in terms  of the sort of the foundations, one of them is Mathematica and the other  is my &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; – New Kind of Science direction. Mathematica is sort of the  underlying infrastructure, the language, the platform on which the site  is built. I’ve been working on developing Mathematica for 23 years, so  it’s a big thing that has had lots of developments and has had a nice  sort of accelerating pace of developments in recent years, particularly  as sort of all the pieces that we have assembled over the years that  really sort of fit together according to the principles that we have  laid down, sort of seem to be interacting in a nice way, that lets the  system sort of grow very rapidly, so kind of the one important sort of  answer to “How does it work?” is “It’s a big mathematical program.” It’s  quite big, it’s five or six million lines of mathematical code, and  most measurements one can make is perhaps ten, maybe twenty times more  succinct as a programming language than a typical kind of, you know, C,  Java type programming language. So that’s a lot of code but if one was  doing it without the Mathematica platform, I think it is not a  manageable kind of project to be doing at this&amp;nbsp;point.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_08m23s&quot; name=&quot;at_08m23s&quot;&gt;08:23&lt;/a&gt; So  Mathematica enables you to do mathematical computation but also symbolic  and logical computation as well?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  Perhaps the biggest…  Mathematica was such a great name for Mathematica when Mathematica was  young, and the thing that one needed to know about what it did was: it  does mathematics on a computer, in a sense it’s been… I had sort of  hoped that the field of mathematics would have grown up to the point  where the kinds of things that Mathematica does are actually considered  mathematics. In a sense it’s a misnomer, and has been for a very long  time because Mathematica has grown far beyond the doing of mathematics,  it’s really a general environment for doing sort of anything that is  formal and technical and so on. It has been kind of a place where we can  fit together sort of all the algorithms that exist for practically  anything. And we have sort of maintained these kind of unifying  principles for the system that have been just wonderful in terms of  developing new capabilities, because it means, you know, we are adding  something that is supposed to be some kind of numerical process for  working out such or such a thing, but, because we have this kind of very  unified system, we are able in the inerts of that numerical thing, to  be calling on all sorts of, you know, computational geometry, or some  other, you know, kind of…, I don’t know, algebraic algorithm, or  anything like this. So it’s really, it’s… Mathematica itself kind of  outgrew the bounds of its name a long time&amp;nbsp;ago.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_10m03s&quot; name=&quot;at_10m03s&quot;&gt;10:03&lt;/a&gt; What’s  happened with Wolfram Alpha, one of the things that… Well, people used  to say, they used to say “This Mathematica thing is obviously really  powerful…” and lots of people around the world, lots of sort of leaders  and &lt;span class=&quot;caps&quot;&gt;RND&lt;/span&gt; in lots of places, lots people being educated, people doing  educating bill use Mathematica a lot. But people said, “This is really a  powerful thing, you really should be able to, you know, why are you  even selling it to people, I mean, why don’t you just you keep it for  yourself and build something great with it?”. Wolfram Alpha is probably  the largest project that’s been done with Mathematica (I don’t  necessarily know everything that everybody has done all around the world  with Mathematica but ) &lt;a class=&quot;timecode&quot; href=&quot;#at_10m45s&quot; name=&quot;at_10m45s&quot;&gt;10:45&lt;/a&gt; what Mathematica provides for us is kind of a  uniform way of representing knowledge in symbolic form, and applying…  and we already have in Mathematica a zillion algorithms, for operating  on that kind of symbolic representation, whether it is in a purely  mathematical way, whether it’s in a sort proving theorems and some  logical way, whether it is representing things graphically in some way,  these kinds of things. So, that’s kind of the technical platform that  makes Wolfram Alpha I think conceivable. From a conceptual point of  view, I think, it’s sort of the &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; New Kind of Science thing that makes  Wolfram Alpha kind of conceivable. I mean, I’ve kind of wondered, you  know, “Wolfram Alpha” is kind of the third big project I’ve tried to do  in my life, Mathematica and “&lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;” are the other two, I kind of have this  meta-theory, that most people have at most one big idea in their lives,  &lt;a class=&quot;timecode&quot; href=&quot;#at_11m46s&quot; name=&quot;at_11m46s&quot;&gt;11:46&lt;/a&gt; so if  I’ve got three big projects, they are really all the same idea, and they  are really all about kind of taking a sort of a rich set of things that  one is interesting in dealing with and understanding what&amp;nbsp; are the sort  of underlying primitives from which one can build up all those things,  whether it is in the computer-language of Mathematica having sort of  computational primitives that level with build-up programs, whether it  is in the case of &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, asking if we look at the world, the universe as  it is, and you know, what are the kind of underlying primitives, what  are the computational, the simple programs that can potentially drive  all of this stuff, and Wolfram Alpha it’s sort of the realization that  all this knowledge that is out there in this world, so there is a  feasibly small sort of set of frameworks, and algorithms and  computations from which one can sort of move out to deal with all this  different kind of&amp;nbsp;things.&lt;/p&gt;
&lt;p&gt;But for me, kind of the paradigm of  &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, which is sort of that idea that they really are for the world at  large, these kinds of small programs that are capable to be rich enough  in their behavior to sort of capture a lot of what goes on in the world,  the idea that there are these small programs is the thing that is sort  of from a paradigmatic point of view, makes it to me conceivable that  one can do something with Wolfram Alpha, and that it isn’t just too big,  and that one just doesn’t have to say (and sort of the traditional  kinds of sciences might say)&amp;nbsp; “It’s just too big, the only way to do  this is to have a billion people work on it and there just can’t be sort  of underlying sort of fairly small programmatic principles that they  will build this thing up”. So that for me, at least in my current…, I  have noticed that when I do these projects it takes me about a decade to  actually realize what the real points of what I did was. Sometimes  other people pointed out to me, in much less time than that, but, you  know, as I say, my meta-theory is, people have at most one big idea. So,  all these things are connected and exactly how kind of &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, the  paradigm of &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; flows into Wolfram Alpha, I know some part of it,  there’s probably more to that flow than I yet&amp;nbsp;known.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: In fact I  am very interested in the relation of &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, of new kind of science and  Wolfram Alpha. &lt;a class=&quot;timecode&quot; href=&quot;#at_14m03s&quot; name=&quot;at_14m03s&quot;&gt;14:03&lt;/a&gt; Can you give an example, maybe, of where &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;-thinking shows up in the  development of…?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Right, so I mean, for example, a very  straightforward thing is linguistic analysis of inputs. So, we have… It  might seem that you…, it’s just too messy, too complicated, there is too  many possibilities. The fact is that what we and up with all this  fairly small simple rules that we often deduce from looking at some  giant corpus of information from the web and giant collection of ways  that people say things, we try and abstract these fairly simple rules,  and then we have to ask ourselves, you know, “So what consequences do  these rules have?” and you know, if you look at the folks who are  developing the linguistic analysis parts of Wolfram Alpha, for some  strange reason almost all these senior people working on linguistic  analysis for Wolfram Alpha have worked on &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;. This is a strange  coincidence. Or maybe it isn’t such a coincidence because when you  actually look at work that’s being done, at these images, these  graphics, the job on the screen that look awfully like kind of evolution  of &lt;a class=&quot;timecode&quot; href=&quot;#at_15m10s&quot; name=&quot;at_15m10s&quot;&gt;15:10&lt;/a&gt;[?] or  some such other thing, there are all these blobs of color and so on,  that represent in this case various pieces of potential linguistic  analysis for phrases or whatever else it is. But it is very much sort of  an &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; approach, if you take this fairly small underlying programs and  you say “What did this build up&amp;nbsp; into and does successfully manage to  correspond to something that is useful for sort of doing linguistic  analysis?” I mean in general in developing Wolfram Alpha and actually  also a lot in developing Mathematica these days, the algorithms that we  have were not engineered by people. There are things that were  essentially found in the computational universe of possible algorithms, I  mean it’s kind of like, I view it as being kind of like an algorithmic  version of mining, I mean, when we deal with sort of standard  engineering technology, you know, we’re used to going out into the world  and going and finding the iron ore and, you know, sort of putting it  together and making something, you know, making a steel thing out of it  or something like that. Now, what &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; sort of suggests is that out in  the computational universe of possible programs, there’s lots of  interesting quite small programs that exist, and the question is: “Can  we go as humans and find programs that are useful to mine from that  collection of possible programs, that are useful to mine for the  particular technological purposes that we have?”. And that’s something  we’ve ended up doing a lot of, I mean a lot of the pieces of code that  exist in Mathematica and Wolfram Alpha are things that were sort of just  found in the computational universe and turned out to be really useful,  in the same way that people found that, you know, magnetite was really  useful, or liquid crystals were really useful in the material&amp;nbsp;world.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  It’s not clear to me how you are using &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; in the natural language  processing &lt;a class=&quot;timecode&quot; href=&quot;#at_17m00s&quot; name=&quot;at_17m00s&quot;&gt;17:00&lt;/a&gt; the &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; in the &lt;span class=&quot;caps&quot;&gt;NLP&lt;/span&gt;. It looked like, just on the service of the sorts of  priories that you were entering when you were giving your examples, it  looked like you were doing…, that what was happening was the normal  thing for the sort of query-managing, that there are stock-words that  you take out because they are not interesting words and they were sort  of human… referred to human fluff-finders, and your discussion is sort  of fluff-words that you take out that aren’t necessary, and what you end  up with is a set of core-words where you can induce or deduce a  relationship, and one of your examples was… a query was &lt;span class=&quot;caps&quot;&gt;MSFT&lt;/span&gt;, that…  Microsoft’s stock symbol, - &lt;span class=&quot;caps&quot;&gt;SUN&lt;/span&gt;, in which case we got back interesting &lt;a href=&quot;http://www.wolframalpha.com/input/?i=SUN+MSFT&quot;&gt;data about both of  both companies&lt;/a&gt;, and so somewhere Wolfram Alpha decided that with a  stock-symbol like &lt;span class=&quot;caps&quot;&gt;MSFT&lt;/span&gt;, which it had noticed presumably has some data  base of stock-symbols, that, therefore is a pretty good assumption that  &lt;span class=&quot;caps&quot;&gt;SUN&lt;/span&gt; doesn’t refer to the solar entity but to another company. That’s  what I sort of thought was going on, based on nothing, is there an &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;  piece of that is doing the parsing or is…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  Well I think in that  particular case… let’s see what’s going on in that particular case…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  Well take a… If that’s not a case where the &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; supplies then take one&amp;nbsp;word…&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: I’ve got to do more work on this because…, it’s funny:  when I talk to the folks who are working on linguistic analysis for  Wolfram Alpha, if you ask them, they’ll say “Yes we know lots on &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, we  worked on &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; for a bunch of years right before working on Wolfram  Alpha” and I’ll say “How you are using &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; on this”, and they’ll say  that, they’ll kind of scratch their heads and they’ll say “Well I’m not  completely sure.” And then I’ll ask them “Okay, how does this or that  work?” and they’ll start explaining “Well this is a simple algorithm, we  run it, it…, you know, produces all this stuff, and then, you know, you  look at the pictures that are sitting on their screens and so on, and  say “This is awfully like kind of the &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; paradigm, a small program  trying to figure out what it does, trying to see in this particular case  whether what it does is useful for a particular&amp;nbsp;purpose”&lt;/p&gt;
&lt;p&gt;I think  we have to come up with much more slam-dunk examples of, you know, this  is the case where the way that this particular piece of parsing is done  is sort of a pure &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;-play, but what’s kind of going on, I mean in  general, you know with parsing of inputs, you know, there are certain  rules, sometimes they are as simple as templates, sometimes they are  more grammatical kind of things that are more like the structure of  mathematical expression, or some, you know, human language expressions,  and sort of the point is, there are these sort of small fragments of  program that are sort of being applied over and over again, and  producing kind of the richness of interpretation that we end up actually  seeing and interpreting language, but we need some better slam-dunk&amp;nbsp;examples.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_20m17s&quot; name=&quot;at_20m17s&quot;&gt;20:17&lt;/a&gt; So we’ve got Mathematica, we’ve got &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt;, and  there are other components to Wolfram Alpha. There is data for example…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:   You know, the way I see the system from a sort of technology point of  view, there are really kind of four pillars of the technology that we’ve  built: the first of them is kind of curated data, so, you know, we have  kind of… we have got a lot of data that we have organized, correlated,  cleaned up, made computable - not just static data, but also data that’s  kind of flowing in from all sorts of feeds and all this kind of thing.  We’ve kind of built this sort of industrial pipeline for curating data,  where we put a lot of effort into identifying the right sources, we get  the data in, we have a lot of automation and kind of trying to find  correlations, anomalisms and so on of the data, then we feed it to  actual sort of human data curators, we sort of expose it to domain  experts, because one thing I’ve discovered is that if there isn’t a  domain expert somewhere in the pipeline you are going to get the wrong  answer. And then, you know, the goal is to take sort of raw data from  the outside and get it to the point where it is really consistent and  computable with. So that’s kind of the first component, it’s kind of  this curated data&amp;nbsp;component.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_21m34s&quot; name=&quot;at_21m34s&quot;&gt;21:34&lt;/a&gt;Another component is the, kind of the  computational algorithms that go inside. There’s the question of taking  the fruits of science and engineering and all sorts of forms of  analysis, and encoding them in a computational form&amp;nbsp; so that we can  actually use them, when people need to use them, apply them to data that  we have and so on. That has been a big effort of kind of just seeing  just what is… and just what the sciences and so on achieved, and how do  we make it computational and there’s just the big Mathematica program -  five or six million lines of mathematical code now, that encodes sort of  the things that one can compute about the world based on sciences and  so on. That’s kind of the second component of Wolfram&amp;nbsp;Alpha.&amp;nbsp;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_22m18s&quot; name=&quot;at_22m18s&quot;&gt;22:18&lt;/a&gt; The third  component is sort of a linguistic analysis component of how do we take  these funny utterances that humans will feed to Wolfram Alpha and trying  to understand what we can compute from that. And in a sense that that  was a part of…, I mean every part of this project seemed to me like it  might be impossible, this is one of ones where I had sort of an argument  from history that this part would be impossible, which is that people  had been trying to do natural language understanding for a really long  time, and its successes are not that wide-spread, but&amp;nbsp; what I realized  at some key-moment was: what we are trying to do is kind of the opposite  what people have mostly been trying to do. What people mostly have been  trying to do is they will throw a thousand, a millions of pages of sort  of perfectly formed text, linguistic texts, and they ask their  computers “Go and understand all this narratives, go read this books and  tell me what they are about” -kind of thing. Computers have found it  really hard to do that. Our problem is kind of the opposite. There is a  certain sort of things that we know how to compute things with, and that  we represent in sort of symbolic form and if we can figure out, if we  can get something that is in that symbolic form, then we are often  running. &lt;a class=&quot;timecode&quot; href=&quot;#at_23m34s&quot; name=&quot;at_23m34s&quot;&gt;23:34&lt;/a&gt; So really the question is “Can we take this sort of human utterances and  map them to this symbolic forms that we happen to know about?” and then  we can go off and actually do our thing and compute things with them.  So it’s sort of the opposite from the traditional natural language  understanding problem. And it’s something where we have been, I mean I  was not sure whether we would sort of be squashed by the ambiguities and  vicissitudes of language. It has turned out to be surprisingly that all  these different notations and pieces of compact representation that  people use as specialists in particular areas don’t overlap as much as I  thought they did. And kind of a good test now, sometimes we’ll see  these things which we can’t deal with, and I’ll show them to a bunch of  people and no person will be able to figure out what on Earth this was  about. And one of the things we realized is, there may be some perfect  kind of almost mathematical theory of how language should be specified  but it sort of can’t work, because, as we are putting on all these  different kinds of specialist notations, if everyone was arbitrarily  extended in perfect way, they would inevitably collide with each-other.  And so we only get to where, you know, …if we can just do the part that  humans can understand - that’s really what we need to do. Because it’s  kind of the humans who are going to be entering the… you know, in this  sort of… at least this interface to Wolfram Alpha, it’s humans who are  doing the entering of things. So anyway, this sort of third component is  this linguistics analysis component and being able to deal with kind of  freeform linguistic&amp;nbsp;input.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_25m00s&quot; name=&quot;at_25m00s&quot;&gt;25:00&lt;/a&gt; Fourth component is really how you present  the results. Because sometimes there’s just a zillion things you can  compute about something, and there’s the question of “Which actual  results do we present so that it is kind of cognitively useful to the  human who is reading it?”. And there’s sort of a collection of expert  knowledge, heuristics, algorithms, this sort of thing that we call  computational aesthetics which is kind of the question of “When you’re  going to present some piece of data in graphical form, what’s the kind  of way to do that that is most likely to be accessible to a person?”. So  this kind of the fourth component is this kind of automated  presentation component that is important. And one of the things I  realized about the system as we built it is: we couldn’t really shirk on  any one of these four things, we can’t… We need a lot of actual curated  data, otherwise there’s no kind of raw material to do anything with. We  need to be able to compute with it, it’s no good to be able to just say  “Look up the data”, because the chance that somebody just wants to just  look up the particular thing that you put in is not going to be very  great, we need to actually be able to compute from that. And we need to  have a way for people to interact with the system, you know, we need to  be able to… we’re not going to expect them to read the Mathematica  documentation now - even just for Mathematica, it’s 10,000 pages. So for  Wolfram Alpha we’re not going to expect people to read the whole  encyclopedia in order to be able to interact with us, so the only choice  is to have a zero documentation approach, where we’re just dealing with  sort of pure sort of language as people will produce it. And then  finally this presentation of information: we can generate all sorts of  wonderful stuff but if we can’t present it in a form that people can  absorb, that’s also not useful, so we kind of need all these four  components to actually build a system that can be&amp;nbsp;useful.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_26m57s&quot; name=&quot;at_26m57s&quot;&gt;26:57&lt;/a&gt; You can  actually take one of the components - the data component, and do you  want it to split it in two and there’s a component of that that could  conceivably be counted as important, which is the metadata, the decision  about which information about the information you’re going to capture.  This metadata, which at least much of it is apparently in the form of  anthologies, of connected concepts, may have its own value…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  -Right, right, well I mean I think&amp;#8230;&amp;nbsp; &lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: -And its own limitations,  by the way!&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: -Right, we didn’t think of the assembly of data into  Wolfram Alpha as being an exercise in anthology, we really thought of  it as just “Let’s figure out how we compute things in all these  different domains”. Obviously, we’re educated folks so we know about,  kind of the idea of anthologies and the various efforts people have made  to sort of do semantic web concept things with anthologies, these kinds  of things, right? &lt;a class=&quot;timecode&quot; href=&quot;#at_27m54s&quot; name=&quot;at_27m54s&quot;&gt;27:54&lt;/a&gt; When you’re going to describe a kind of thing, there’s a question of  “What are the pieces of that description?”, “What are the…” you know, if  you’re going to describe I don’t know, you have some entity…&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  -Say, a country.&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: -Right. So there’s a notion of a country there’s  a notion of that country has certain properties and certain attributes,  like what date were you talking about this country at, there’s  questions about you know, are there groups of countries that are kind  of, you know, classes, from which countries in &lt;span class=&quot;caps&quot;&gt;NATO&lt;/span&gt; or something like  that or countries in South America, these are all part of the  anthological structure of the knowledge about countries. And for us, we  have a pretty definite representation of that kind of knowledge about  countries, we need that because otherwise we can’t have it sort of  operate with other kinds of information, we need to be able to have, you  know… This is part of making Wolfram Alpha a sort of manageable thing,  is that there are global frameworks that operate across a bunch of  different domains. And for that we need a sort of organized anthology of  things. Now there are some tricks&amp;nbsp;of…&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_29m10s&quot; name=&quot;at_29m10s&quot;&gt;29:10&lt;/a&gt;One of the  big challenges with anthologies is: you have an anthology about  countries: how does it relate to the anthology about cities, how does it  relate to the anthology about this, that and the other, right? We were  approaching this from a very kind of practical engineering point of  view: just building anthologies across hundreds of domains, but we  realized that there were important ways to have these anthologies  communicate with each other, that one of them was actually a very sort  of science-oriented way, which is units of measure of things or physical  quantities that things&amp;nbsp;represent.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_29m49s&quot; name=&quot;at_29m49s&quot;&gt;29:49&lt;/a&gt; So, for  instance, if you have two things that both have units or volume, there’s  a reasonable chance that these things are somehow related to each  other. Not always! There can be a thing where… (I’m not sure I can  generate it for length cubed in particular, but) it’s quite commonly the  case that for something which has sort of the same physical dimensions,  there may be three or four different kinds of concepts that that  relates to.&amp;nbsp; And so for example, a length could be a depth it could be  an altitude it could be a height, these are related concepts and often  they can be dealt with in a sort of an interoperable kind of way: you  can divide two of these things and it’s meaningful, you can perhaps add  them and it’s meaningful. But then there are some other kinds of  purposes for which you can’t combine them and for which you might want  to give some different-related information and so on. So it gets quite  subtle, but we have kind of a reasonable pragmatic way at least to deal  with sort of things like properties that are related and that can be  combined in particular ways, these types of things. &lt;a class=&quot;timecode&quot; href=&quot;#at_30m51s&quot; name=&quot;at_30m51s&quot;&gt;30:51&lt;/a&gt; I mean I  think that… (I would love it to be the case because I’m kind of a  theoretical scientist by life pursuit so to speak) it would be a great  if there was just a grand theory of all this stuff, if we could just  start from scratch as one of the sort of philosophers of all - whether  it was you know from the Roget’s to the Mortimer Adler’s to the whatever  whoever wants it to organize knowledge in a very sort of structured  way. Just sort of say “Top down, let’s just describe the anthology of  the world and let’s just then build a system based on the anthology of  the world that we’ve thereby described”. I don’t expect…, that isn’t  what we did and despite my great predilection for grand theories of  things, that seems really hard and then in some sense not useful,  because &lt;a class=&quot;timecode&quot; href=&quot;#at_31m47s&quot; name=&quot;at_31m47s&quot;&gt;31:47&lt;/a&gt; what we’re dealing with is sort of human-accessible knowledge (and here I  might get a little philosophical on things, but): there’s sort of the  question of “What knowledge is possible?” and “What knowledge is  human-related?”. So for instance from &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; we’ve learnt that in the  computational universe of possible programs there’s lots of stuff out  there. We as humans have only explored some tiny corners of this. For  example in our mathematics, it is the case that you could invent the  particular mathematics that we’ve pursued as growing from some  particular axiom systems and so on - but we could just say “Let’s throw  it open, let’s look at the space of all possible axiom systems”,  gazillion of possible axiom systems of which the mathematics that we’ve  pursued is only ten of them for example, but there are gazillions of  others and they all have their properties and they all have particular  features. This stuff that has humanly been accessed is only this tiny  corner of this ten axiom system that we kind of grandfathered through  from the Babylonians&amp;nbsp;on.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_32m56s&quot; name=&quot;at_32m56s&quot;&gt;32:56&lt;/a&gt;Now, what happens when we look at all the  other ones? What happens with all this knowledge that hasn’t been  humanly-accessed knowledge? Well one of the things that I learnt from  &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; is a lot of it is knowledge you just can’t get in any kind of quick  way. That is, you go right into computational irreducibility and  undecidability and so on, there’s lots that you can’t compute about the  world. There’s a certain amount that you can compute that is the parts  that have formed the science that we’ve developed traditionally as  humans, the mathematics we’ve developed, things like that. There’s also a  lot out there that is not accessible in that way. Now the other thing  that is the case is: one problem is you can’t compute it. If you fall  off the parts that are in the language of &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; “computationally  reduceable”, the parts for which theoretical science works well, if you  fall off those parts - and there’s a lot away from those parts - you  just can’t compute that much, and you just have to follow simulations to  see what will happen, if things can’t quickly work out what will  happen. &lt;a class=&quot;timecode&quot; href=&quot;#at_34m00s&quot; name=&quot;at_34m00s&quot;&gt;34:00&lt;/a&gt; There’s a second problem which is: you can’t linguistically describe  what you’re talking about. We as humans we’ve developed a certain  language, the concepts that we’ve put into our language are concepts  that we have done a lot with, that we have sort of become familiar with,  those are the things that we tend to give words to. And so, one of the  other limitations in a sense is sort of a philosophical level of  limitation, is that Wolfram Alpha - its interface is linguistic, and so  the things that once can easily describe to it are things for which  there is human language. And that sort of is a…, there’s a lot else out  there that is both very hard to compute with and for which there isn’t  sort of a human linguistic representation that’s convenient. I mean when  you see a picture of a [? &lt;a class=&quot;timecode&quot; href=&quot;#at_34m51s&quot; name=&quot;at_34m51s&quot;&gt;34:51&lt;/a&gt;] evolution and somebody says “Quick, describe  it!” we don’t have words to describe these things, it’s a long…, you’d  be writing…, you’d basically eventually just say “Well just look at the  picture, you know, see what it looks like: that!”. &lt;a class=&quot;timecode&quot; href=&quot;#at_35m08s&quot; name=&quot;at_35m08s&quot;&gt;35:08&lt;/a&gt; I think that  in some sense one of my sort of my flip, almost flip, rather  philosophical comments about Wolfram Alpha is this idea of computational  irreducibility, the idea that there’s lots of sophisticated computation  that goes on in the world and nature, that makes it sort of the  complexity of nature, it’s the flip side of that fact that makes Wolfram  Alpha possible, by which I mean that there’s&amp;nbsp; in a sense comparatively  little about which we can talk linguistically or about which we can  compute. In the space of all possible things there’s only this small  corner and that means that when we get to linguistically talk about  things, because there’s only this small corner we can talk about, it is  feasible to actually understand this broad range of language, because it  can only talk about this small corner of things. If we were able to  talk about everything, it would be much harder to understand with a  short description what we could possibly be talking about. I’m sorry  this is getting a very philosophical end of things. &lt;a class=&quot;timecode&quot; href=&quot;#at_36m11s&quot; name=&quot;at_36m11s&quot;&gt;36:11&lt;/a&gt;&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: So I  want to ask you about… within that small corner of things that we can  talk about, the possible limitation, back on the metadata side: as many  have pointed out, metadata is politics: the decision about what counts,  what’s interesting to us and ought to be tracked about say, a country or  any other entity is itself a reflection of cultural and possibly  political interest. So if I were to ask Wolfram Alpha a question that I  think it&amp;#8217;s not designed for - such as how many people in the &lt;span class=&quot;caps&quot;&gt;U.S.&lt;/span&gt; have  been tortured, or give me a list of world religions, which is a very  straightforward sort of question, that one’s…, the first question  honestly is politically loaded, the second one is very straightforward  sort of question.&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Will scientology show up on it?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: It’s a  good question. (…) What’s the decision process, what do you do with  that?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Right. &lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  Inevitable decision… &lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_37m06s&quot; name=&quot;at_37m06s&quot;&gt;37:06&lt;/a&gt; So what  we’re trying to do in this phase is a project. What we’ve done is to  take generally reliable public sources of data so to speak, and try and  make them computable. We’re not going out and individually surveying  people and saying “What is your religion?” or something like that. So  this question of, in the case of for example religions, we’ve actually  we’ve been agonizing about this particular one, and there was a big push  on part of some people on the project that we just shouldn’t put this  in because it’s too much of a mess, right? But then other people pointed  out “Look, it’s more useful to have something than to have nothing”. I  mean it is useful to know roughly how many Christians are there in the  world or are there in South America or something like that, that’s  useful information. Now, even if it’s slightly wrong because there might  be different definitions. “How many animists are there in America?” or  something, right? There might be different definitions of that, you  know, I sometimes think that after my work on &lt;span class=&quot;caps&quot;&gt;NKS&lt;/span&gt; that I should count  myself as an animist… but that’s a separate&amp;nbsp;discussion.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  There’s a headline for you: “Wolfram becomes&amp;nbsp;animist”&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  (Laughs) Right. &lt;a class=&quot;timecode&quot; href=&quot;#at_38m33s&quot; name=&quot;at_38m33s&quot;&gt;38:33&lt;/a&gt; Our decision procedure is probably similar to the decision procedure of  folks who’ve tried to put together encyclopedias or any other kind of  authoritative reference work. We try and do the best we can, we talk to  experts, in our case we have a couple of constraints: we want to have  something… eventually we’re going to have a definite number for  something. Even though we may annotate it with a footnote, at some level  we’re going to have to make a decision, there is a definite number for  this thing. So in the case of religions you can look the &lt;span class=&quot;caps&quot;&gt;CIA&lt;/span&gt; fact book  will have some data on that, there are particular organizations that  collect data on that kind of thing. I don’t know in particular the case  of that example, I know that’s something that we’re actually actively  trying to unscramble right now. With luck, those different sources will  vaguely, at least close to agree. At least to the first or second  decimal place they’ll agree, the first and second significant figure  they’ll&amp;nbsp;agree.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: Let me give you a different kind of example  cause…, &lt;a class=&quot;timecode&quot; href=&quot;#at_39m37s&quot; name=&quot;at_39m37s&quot;&gt;39:37&lt;/a&gt; for example if I ask for a list of broadband penetration bionation, this  is hardly contested - what constitutes broadband. The &lt;span class=&quot;caps&quot;&gt;FCC&lt;/span&gt; has had one  way of counting broadband that is extremely ingenuous, but people will  use these numbers from your site to the extent to which your site  becomes the place where you go to get the answers that the experts would  have given you, the extent in which it succeeds is going to be used in  these debates. Does it concern you that some of the answers at least,  are in fact based upon good decisions but decisions nonetheless about  what constitutes good data and how they’re categorized, that&amp;nbsp;data?&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  &lt;a class=&quot;timecode&quot; href=&quot;#at_40m19s&quot; name=&quot;at_40m19s&quot;&gt;40:19&lt;/a&gt; We have  three escape valves for this issue. The first thing is: you type your  input and we’ll typically give what we call the input interpretation.  The first part of output will be the input interpretation. So if you say  something like, you know, “How much do school teachers get paid?” -  it’s a complicated question, right? The input interpretation will  probably say “school teachers… including this and that and the other,  excluding this…, medium wage”. So we’re saying what question we’re  actually answering, that might be, we hope is, close to any useful  variant of the question you asked. But it might not be in a question  where you might have had something different in mind, you might have had  the minimum wage or the maximum wage or something else. Now, typically  in our sort of report that we generate we will give what we know about  the range of wages or some such other thing but when we have to compute  from it if you say “How much do teachers get paid divided by doctors?”,  by the time you’re doing that we’ll give a definite answer for the  ration of medium wages and then we may give a bunch of additional  information about the data that went into it, so the first… in a sense  “escape” valve, first place where we get to put things through a more  precise filter is we have this input interpretation that says “This is  what we’re actually dealing with”. &lt;br /&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_41m51s&quot; name=&quot;at_41m51s&quot;&gt;41:51&lt;/a&gt; Another  thing is that when we give answers, we try to give pithy footnotes that  give an indication of what it is we’re actually assuming, I mean, a  typical example is some health kind of thing, where there’ll be a  classification, for your overweight for instance. Now, to many people  that’s a value judgement, to some other people it’s a precise medical  distinction. What we’re dealing with is more the precise medical  distinction than the value judgement, and so we’ll try to make a pithy  footnote cause we know if we put a giant essay there, nobody’s going to  read it. It sort of says, I don’t remember that particular one but it’ll  say, “based on &lt;span class=&quot;caps&quot;&gt;CDC&lt;/span&gt; or &lt;span class=&quot;caps&quot;&gt;WHO&lt;/span&gt; guidelines” or some such other thing. So  that’s how we try to deal with these things where there’s a distinction  that’s been made and it sort of, we try to go with whatever the  standard’s body has said and then say what the standard’s body&amp;nbsp;is.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_42m54s&quot; name=&quot;at_42m54s&quot;&gt;42:54&lt;/a&gt; And then  the final thing that we get to do is, we sometimes get to put up a thing  saying “assuming you mean blah”, then we give a bunch of results, and  use “blah” instead. And so that’s another place where we get to if  people have that sort of a controversial thing. One that came up  recently that was a slightly different issue but there’s questions about  “Do you assume this country is part of that country?” and things like  that, well we can assume both cases. There were these funny cases where  we just had one recently that was sort of horrifying was the  pets-versus-food problem. So the issue is, if you type in “rabbit”, what  do you get? And for reasons of bad linguistic paralyzation one was  getting nutrition content. So what one realizes is that in the &lt;span class=&quot;caps&quot;&gt;U.S.&lt;/span&gt; this  is very upsetting to people, and in other countries it’s a different  story. So that’s a case where we get to use &lt;span class=&quot;caps&quot;&gt;GIP&lt;/span&gt; information and we’ll  make the decision in the &lt;span class=&quot;caps&quot;&gt;U.S.&lt;/span&gt; a rabbit is an animal with these  characteristics; I don’t know in the case of rabbits in particular but  let’s assume that they’re popular food in some other country and are not  usually kept as pets there, then the &lt;span class=&quot;caps&quot;&gt;GIP&lt;/span&gt; of that country will cause us  to flip around that assumption. &lt;a class=&quot;timecode&quot; href=&quot;#at_44m33s&quot; name=&quot;at_44m33s&quot;&gt;44:33&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: How open is this site? It’s  open to everybody to use, it’s a free site but the data and the metadata  and the algorithms are seemingly quite valuable, other people could  make use of it. What are the plans for making this…, for opening the  site&amp;nbsp;up?&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Well so the first thing is that there’s going to  be an &lt;span class=&quot;caps&quot;&gt;API&lt;/span&gt; for people to make different levels of…, have different sort  of kinds of access to the&amp;nbsp;site.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: So another computer  program can call yours and gets results back?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Right. Or it could  be another website that is including some pods of output from Wolfram  Alpha on that site providing some input field with some specialized  input field that can go and use the Wolfram Alpha engine and get results  back or it could be some, you know, desktop application program that is  making calls to Wolfram Alpha. The thing one realizes is, it’s a living  breathing thing, it’s not very easy to take pieces of it and just say  to people “Here’s a big piece, go do something useful with it” because,  for example let’s take the data: the data changes all the time the data  is changing. Some data’s changing very rapidly on times it goes over  seconds some data changes on time goes over months - it’s not the case,  we can’t just say sort of “Dump the data out” because the data is really  just the seed for, in many many cases, the data is the seed for an&amp;nbsp;algorithm.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_46m02s&quot; name=&quot;at_46m02s&quot;&gt;46:02&lt;/a&gt; It’s also frequently data that is proprietary  and licensed, another reason why you can’t simply dump it&amp;nbsp;out.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  Right. The idea is that we will provide a resource for anybody who  wants to use what we have, we will provide a good way for them as humans  to use it, or as humans using programs to have their programs use it,  and the idea is to provide an efficient well kept-up industrially strong  kind of thing that people can go to get what they need from this. We  certainly were aware of the fact that there are computations that people  will want to do that will run longer than the site could possibly run  them for, and so the good news is that we have a great sort of vehicle  for running this sort of computations, and that’s Mathematica, and so  the issue is…, and Mathematica has for twenty years happily run on  millions of actual computers around the world on people’s desktops, and  so one of the things that I think we’ll see happening is that there’ll  be things where there’s sort of the Wolfram Alpha kind of central store  of data, sort of computational elements and so on, and then there’s the  sort of the fit client for it which is the Mathematica that’s able to  take pieces of what’s coming from Wolfram Alpha and run that stuff  locally. So this is sort of a way of again opening out and enabling more  people to get access to the capabilities that we’re&amp;nbsp;providing.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;:  &lt;a class=&quot;timecode&quot; href=&quot;#at_47m52s&quot; name=&quot;at_47m52s&quot;&gt;47:52&lt;/a&gt; The  anthologies or scheme or whatever you want to call it, those themselves  and the fact that you have a way of representing say, a nation and  multiple other demands, that’s something that people working on the  semantic web are working quite hard doing. Is that…, you’re gonna make  that available as well?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: Probably. So one thing we hope to do is  to have nice organized, almost formalized process for people to  contribute data to our data repository. And in order to make that  possible we kind of have to say how is this state going to be  structured. And so I think we’ll probably have quite an effort for  particular domains in sort of exposing some of the structures that we’ve  already built and possibly people will propose other pieces of  structure that need to get added on to that, and so I think that that  will get exposed and if there’s good traffic in data being provided for  some access by people in Wolfram Alpha, if there’s good traffic to that,  I suspect that the anthology that we have will sort of become kind of a  useful standard for other people who are doing things where they try to  fit semantic data&amp;nbsp;together.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_49m02s&quot; name=&quot;at_49m02s&quot;&gt;49:02&lt;/a&gt; A useful  free standard, a useful open standard? Will either of those&amp;nbsp;do?&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;:  I’m not sure I thought through all of these, my guess is that we will  want it to be an open standard because from our point of view the more  data that exists in the world that is structured in particularly a way  that is easy to deal with - the better everybody is, and the better we  are too. I mean in a sense our objective is to make this sort of good,  sort of definitive sort of place to go to get computational knowledge,  and there are many components to that, I mean the data is just one  component. And if more of the world would help in assembling the data  that would be great. I mean so far I have to say that we hire lots of  data curators and they do a great job, but we hope that there will be  quite a community of sort of volunteer data curators that will provide  sort of tools for volunteer data curators. But I’m a little bit less…  I’ll be very interested to see what happens. You know, Wikipedia has  been a fantastic success story of kind of the things assembled by crowds  so to speak, but I have this feeling that writing the great essay that  shows up on Wikipedia is a somehow generally more exciting activity  than…, I mean I really like lining up things and data but I have a  suspicion that that is more of a…, the nerdiness quotient or something  has to be higher. I’m proudly in that box so to speak, but I think it’s a  more professional judgement intensive kind of activity I think, because  you see, here’s the point: when you see an essay on Wikipedia, the very  texture of that essay sort of tells you what’s going on, it tells you  whether the person knows what they’re talking about, tells you whether…  it just gives you some sort of feeling for what’s going on. Data is in a  sense very cold, you know, it’s just “7.2” you know, “The number is  7.2”, doesn’t come attached to an essay, it’s just “7.2”. And what you  need, the kind of reliability that you need and the kind of people who  are going to make that number 7.2 as opposed to 7.8, you have to have  a…, at least my current view is that it’s important to have a somewhat  more controlled environment because you don’t get to know all of the  footnotes so to speak immediately, as you do in kind of the essay-type  setting. &lt;a class=&quot;timecode&quot; href=&quot;#at_51m59s&quot; name=&quot;at_51m59s&quot;&gt;51:59&lt;/a&gt; So right now my best mechanism is: you hire good data curators, we have  all these tests for data curators, we hire good people, they have to  have quite a bit of domain knowledge otherwise they get things wrong,  and it’s a somewhat more sort of organized and professionalized thing.  Now, I’m certainly hoping that we will be able to provide the tools and  the framework to have a very good community of volunteer and so on data  curators out there who can potentially help with what we’re doing, but  we fully expect that what they’re doing will have to sort of fit in to  this whole pipeline we’ve built and that will have to be doing our sort  of data ordered in processes and so on, and right now it just doesn’t  feel like the kind of thing for which it’s going to work to just say  “Ok&amp;nbsp; world, everybody just put the pieces you feel like&amp;nbsp;in”.&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: &lt;a class=&quot;timecode&quot; href=&quot;#at_52m05s&quot; name=&quot;at_52m05s&quot;&gt;52:05&lt;/a&gt; In the  media we’ve already seen Wolfram Alpha referred to as a Google killer.  Who’re you trying to kill – anybody, with this?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: That’s not what  I’m doing… I build&amp;nbsp;something…&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;DW&lt;/span&gt;: Trying to kill Wikipedia,  trying to kill reference librarians?&lt;br /&gt;&lt;span class=&quot;caps&quot;&gt;SW&lt;/span&gt;: No, no, I’m… The thing I  like to do is I like to build stuff. This is a thing that I want to  build. I build stuff as a practical person, it has to fit in the  ecosystem of the world it has to support itself, things like that. If we  look at Google and Wikipedia these are both sort of great achievements,  and they’re both very complimentary to what we’re doing. We’re dealing  with the facts just the facts, Wikipedia is dealing with the narrative  and you might have noticed the side bar to Wolfram Alpha often pulls up  Wikipedia links with the rollovers that show sort of some little piece  of narrative from Wikipedia. That I think is a terrific kind of synergy  there and maybe we’ll get other kinds of encyclopedia-kinds of narrative  things that can sort of complement Wolfram Alpha. As far as search is  concerned, traditional web search again is a very complementary kind of  thing, I mean I use Wolfram Alpha lots, I also use search engines and  Google and so on lots. I use them for different kinds of things, I’ll  use Wolfram Alpha when I have some specific question that is where I’m  trying to compute something, but whereas I’ll use a search engine when  I’m looking for some fragmentary piece of information that isn’t  systematic at all, where there’s just something about&amp;#8230;David  Weinberger, that just happens to be out there, that I can kind of pick  up and read the narrative about&amp;nbsp;it.&lt;/p&gt;
&lt;p&gt;&lt;a class=&quot;timecode&quot; href=&quot;#at_55m01s&quot; name=&quot;at_55m01s&quot;&gt;55:01&lt;/a&gt;Perhaps I’m a  business idealist, ok?&amp;nbsp; I’ve been lucky enough to have a successful  business for twenty-something years and that I’ve been able to sort of  pursue with Mathematica and so on, something that I consider to be kind  of intellectually… that sort of high intellectual integrity that also  can be a very pleasantly successful business, and I certainly hope to do  the same kind of thing with Wolfram Alpha and I’m hoping for the best  possible synergy with folks who have complimentary kinds of things. I  mean I hope&amp;nbsp; we won’t be killing anybody I hope nobody will be trying to  kill us so to speak, I think that we’re doing something that I believe  can lead to a lot of positive progress in the world and I’m just keen to  see that&amp;nbsp;happen.&lt;/p&gt;
&lt;p&gt;David Weinberger: Thank you very much.&lt;br /&gt;Stephen  Wolfram: My&amp;nbsp;pleasure./p&amp;gt;&lt;/p&gt;

      
&lt;/div&gt;
&lt;/fieldset&gt;

&lt;div class=&quot;field-dl-file&quot;&gt;

      &lt;span&gt;Downloadfile:&amp;nbsp;&lt;/span&gt;
  
    
          
          &lt;div class=&quot;filefield-file clear-block&quot;&gt;&lt;img &quot;  alt=&quot;application/ogg icon&quot; src=&quot;http://transformingfreedom.net/sites/all/modules/filefield/icons/application-octet-stream.png&quot; /&gt;&lt;a href=&quot;/files/wofram-stephen--2009-05-28--wolfram-supreme-dweinberg--_radio_berkman--160kbps_0_0_0.ogg&quot; type=&quot;application/ogg; length=61444244&quot;&gt;stephen-wolfram-2009-05-27-stephen-wolfram-wolfram-alpha_tf.ogg&lt;/a&gt;&lt;/div&gt;
      
&lt;/div&gt;
</description>
     <comments>http://transformingfreedom.net/hyperaudio/stephen-wolfram-wolfram-alpha#comments</comments>
 <category domain="http://transformingfreedom.net/category/social-tags/computational-irreducibility">Computational irreducibility</category>
 <category domain="http://transformingfreedom.net/category/person/david-weinberger">David Weinberger</category>
 <category domain="http://transformingfreedom.net/category/languages-spoken/english">English</category>
 <category domain="http://transformingfreedom.net/category/tags/mathematica">Mathematica</category>
 <category domain="http://transformingfreedom.net/category/social-tags/mathematical-software">Mathematical software</category>
 <category domain="http://transformingfreedom.net/category/social-tags/mathematics">Mathematics</category>
 <category domain="http://transformingfreedom.net/category/social-tags/modal-logic">Modal logic</category>
 <category domain="http://transformingfreedom.net/category/technology/natural-language-processing">natural language processing</category>
 <category domain="http://transformingfreedom.net/category/social-tags/nks">NKS</category>
 <category domain="http://transformingfreedom.net/category/tags/radio-berkman">Radio Berkman</category>
 <category domain="http://transformingfreedom.net/category/technology/search-engine">search engine</category>
 <category domain="http://transformingfreedom.net/category/speaker/stephen-wolfram">Stephen Wolfram</category>
 <category domain="http://transformingfreedom.net/category/social-tags/wikipedia">Wikipedia</category>
 <category domain="http://transformingfreedom.net/category/social-tags/wolfram-alpha">Wolfram Alpha</category>
 <category domain="http://transformingfreedom.net/category/social-tags/wolfram-research">Wolfram Research</category>
 <enclosure url="http://transformingfreedom.net/files/wofram-stephen--2009-05-28--wolfram-supreme-dweinberg--_radio_berkman--160kbps_0_0_0.ogg" type="application/ogg" />
 <pubDate>Wed, 30 Sep 2009 17:02:02 +0000</pubDate>
 <dc:creator>Volker E.</dc:creator>
 <guid isPermaLink="false">241 at http://transformingfreedom.net</guid>
  </item>
  </channel>
</rss>