<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://community.research.microsoft.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>TechFest Live! : Silicon Valley</title><link>http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Silicon+Valley/default.aspx</link><description>Tags: Silicon Valley</description><dc:language>en</dc:language><generator>CommunityServer 2008.5 SP1 (Build: 31106.3070)</generator><item><title>The Value of TechFest</title><link>http://community.research.microsoft.com/blogs/techfestlive/archive/2009/02/24/the-value-of-techfest.aspx</link><pubDate>Wed, 25 Feb 2009 01:57:00 GMT</pubDate><guid isPermaLink="false">eaca9afb-5ccf-4c08-b3f3-369c7e6f1a06:4678</guid><dc:creator>robk</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/rsscomments.aspx?PostID=4678</wfw:commentRss><wfw:comment xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/commentapi.aspx?PostID=4678</wfw:comment><comments>http://community.research.microsoft.com/blogs/techfestlive/archive/2009/02/24/the-value-of-techfest.aspx#comments</comments><description>&lt;p&gt;I just ran into &lt;a href="http://research.microsoft.com/en-us/people/roylevin/"&gt;Roy Levin&lt;/a&gt;, Microsoft distinguished engineer and managing director of &lt;a href="http://research.microsoft.com/en-us/labs/siliconvalley/"&gt;Microsoft Research Silicon Valley&lt;/a&gt;, in the Rainier/St. Helens Room at the Microsoft Conference Center. We found ourselves marveling at Commute UX, one of the TechFest 2009 demos I&amp;#39;ll be posting about soon, and I asked him about what TechFest means for him and his lab.&lt;/p&gt;
&lt;p&gt;&amp;quot;The value of TechFest,&amp;quot; Levin said,&amp;nbsp;&amp;quot;is for people to get a feel for the breadth and the depth of the innovation that goes on in Microsoft Research. In the room we&amp;#39;re in right now, there is a sampler, really, of the larger event, and just wandering around this room, you see stuff from so many different areas of computer science and their applications to so many different areas of real life. For me, that&amp;#39;s where the real impact of this event is: just feeling-- in a very direct way, a very visceral way--how the work that we do can make the world better through computing.&amp;quot;&lt;/p&gt;
&lt;p&gt;Levin&amp;#39;s Silicon Valley lab specializes in research on distributed computing, and&amp;nbsp;that&amp;nbsp;means TechFest plays a particularly valuable role.&lt;/p&gt;
&lt;p&gt;&amp;quot;There are a couple of demos here, in this room, from our lab that focus on understanding and modifying the way that complicated distributed systems work, in data centers in particular,&amp;quot; he said. &amp;quot;The benefit, from my point of view, from participating in this event is that people get a glimpse at the problems, the very difficult problems, that we face in trying to harness the world of parallel computing and make it useful, in the form of cloud services,&amp;nbsp;trying to operate these very large services and provide things on a global scale.&amp;quot;&lt;/p&gt;
&lt;p&gt;Finally, Levin noted, there&amp;#39;s a special quality to the event that makes it loom large on researchers&amp;#39; annual calendars.&lt;/p&gt;
&lt;p&gt;&amp;quot;There&amp;#39;s a special place in my heart for TechFest,&amp;quot; he stated,&amp;nbsp;&amp;quot;because, obviously, it&amp;#39;s a hometown event, compared to going to a conference or some other meeting. It also is &amp;#39;hometown&amp;#39; in a different way, because members of Microsoft Research come in from all over the world. We get to meet with people that we don&amp;#39;t get to see the rest of the year. That&amp;#39;s really great.&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://community.research.microsoft.com/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/Levin_2D00_Huasmall.jpg"&gt;&lt;img border="0" src="http://community.research.microsoft.com/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/Levin_2D00_Huasmall.jpg" alt="" /&gt;&lt;/a&gt;&lt;span style="font-size:xx-small;"&gt;&lt;br /&gt;Roy Levin (&lt;em&gt;left&lt;/em&gt;) and Xian-Sheng Hua of Microsoft Research Asia chat during&amp;nbsp;TechFest.&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://community.research.microsoft.com/aggbug.aspx?PostID=4678" width="1" height="1"&gt;</description><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Research/default.aspx">Research</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/TechFest/default.aspx">TechFest</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Microsoft/default.aspx">Microsoft</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Roy+Levin/default.aspx">Roy Levin</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Silicon+Valley/default.aspx">Silicon Valley</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/2009/default.aspx">2009</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Xian-Sheng+Hua/default.aspx">Xian-Sheng Hua</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/distributed+computing/default.aspx">distributed computing</category></item><item><title>Improving Data-Center Performance</title><link>http://community.research.microsoft.com/blogs/techfestlive/archive/2009/02/24/improving-data-center-performance.aspx</link><pubDate>Tue, 24 Feb 2009 21:50:00 GMT</pubDate><guid isPermaLink="false">eaca9afb-5ccf-4c08-b3f3-369c7e6f1a06:4673</guid><dc:creator>robk</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/rsscomments.aspx?PostID=4673</wfw:commentRss><wfw:comment xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/commentapi.aspx?PostID=4673</wfw:comment><comments>http://community.research.microsoft.com/blogs/techfestlive/archive/2009/02/24/improving-data-center-performance.aspx#comments</comments><description>&lt;p&gt;&lt;a href="http://community.research.microsoft.com/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/goldszmidt.JPG"&gt;&lt;img border="0" width="448" src="http://community.research.microsoft.com/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/goldszmidt.JPG" alt="Moises Goldszmidt displays demo" height="336" style="border:0;" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://research.microsoft.com/en-us/people/moises/"&gt;Moises Goldszmidt&lt;/a&gt; (&lt;em&gt;above&lt;/em&gt;), principal researcher at &lt;a href="http://research.microsoft.com/en-us/labs/siliconvalley/default.aspx"&gt;Microsoft Research Silicon Valley&lt;/a&gt;, is showing a pair of demos, in conjunction with lab colleague &lt;a href="http://research.microsoft.com/en-us/people/mbudiu/"&gt;Mihai Budiu&lt;/a&gt;, that examines performance in data centers.&lt;/p&gt;
&lt;p&gt;&amp;quot;The challenge,&amp;quot; Goldszmidt says, &amp;quot;is: How do I summarize&amp;nbsp;thousands of machines and hundreds of metrics and find&amp;nbsp;the key elements over that huge space that&amp;#39;s giving us surprises, such that I can let it retrieve that fingerprint? How do I do that automatically?&amp;quot;&lt;/p&gt;
&lt;p&gt;The demo is called Predicting Problems in the Data Center.&lt;/p&gt;
&lt;p&gt;&amp;quot;We are using very sophisticated machine-learning techniques,&amp;quot; Goldszmidt states, &amp;quot;that build automated models that are able to extract the main characteristics of each one of these crises.&amp;quot;&lt;/p&gt;
&lt;p&gt;The value of such work is readily apparent.&lt;/p&gt;
&lt;p&gt;&amp;quot;Eighty percent of the time, we&amp;#39;re predicting&amp;nbsp;one hour in advance&amp;nbsp;a set of actions we need to do to mitigate a problem,&amp;quot; he says, resulting in&amp;nbsp;&amp;quot;less downtime, less latency for our clients using our services. Our services are more efficient to run, because we don&amp;#39;t have to have that many people look at the problem.&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://community.research.microsoft.com/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/budiusmall.jpg"&gt;&lt;img border="0" width="448" src="http://community.research.microsoft.com/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/techfestlive/budiusmall.jpg" alt="Mihai Budiu" height="336" style="border:0;float:left;" /&gt;&lt;/a&gt;The second demo in the booth is named Profiling the Performance of Distributed Systems. It features a colorful&amp;nbsp;analytics engine that enables the monitoring of&amp;nbsp;vast data centers.&lt;/p&gt;
&lt;p&gt;&amp;quot;Once you have a large cluster that you can run your services and applications on, it&amp;#39;s very hard to understand, if something goes wrong, what&amp;#39;s wrong,&amp;quot; explains Budiu (&lt;em&gt;left&lt;/em&gt;). &amp;quot;It could be a hardware problem. It could be your application has partitioned the data wrongly. It could be something in between, such as&amp;nbsp;the network&amp;nbsp;being down. &lt;/p&gt;
&lt;p&gt;&amp;quot;This is a tool which pulls a lot of metrics from the machines in&amp;nbsp;your cluster and allows you to easily visualize the data and find correlations in the data. &amp;nbsp;You can assign colors to&amp;nbsp;metrics and drag and drop the colors into other windows to see how the metrics correlate with other metrics.You can assign colors according to how many CPU cycles are utilized and immediately&amp;nbsp;drop the color to see where the high CPU cycles are being used, in which&amp;nbsp;machine.&amp;quot;&lt;/p&gt;
&lt;p&gt;In both demos, Goldszmidt declares, the objective is the same.&lt;/p&gt;
&lt;p&gt;&amp;quot;Reduce the time to understand what the heck is going on,&amp;quot; he says. &amp;quot;Build a better service. That&amp;#39;s the final goal.&amp;quot;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://community.research.microsoft.com/aggbug.aspx?PostID=4673" width="1" height="1"&gt;</description><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Research/default.aspx">Research</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/TechFest/default.aspx">TechFest</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Microsoft/default.aspx">Microsoft</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Silicon+Valley/default.aspx">Silicon Valley</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/2009/default.aspx">2009</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/distributed+systems/default.aspx">distributed systems</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Moises+Goldszmidt/default.aspx">Moises Goldszmidt</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Mihai+Budiu/default.aspx">Mihai Budiu</category></item><item><title>Privacy Integrated Queries</title><link>http://community.research.microsoft.com/blogs/techfestlive/archive/2008/03/06/privacy-integrated-queries.aspx</link><pubDate>Thu, 06 Mar 2008 21:57:00 GMT</pubDate><guid isPermaLink="false">eaca9afb-5ccf-4c08-b3f3-369c7e6f1a06:776</guid><dc:creator>robk</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/rsscomments.aspx?PostID=776</wfw:commentRss><wfw:comment xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://community.research.microsoft.com/blogs/techfestlive/commentapi.aspx?PostID=776</wfw:comment><comments>http://community.research.microsoft.com/blogs/techfestlive/archive/2008/03/06/privacy-integrated-queries.aspx#comments</comments><description>&lt;p&gt;Data abounds in the digital age. Bits by the billions are collected on a daily basis from a variety of sources: Web services, financial programs, governmental agencies. Those who specialize in data mining and analysis could spend a hundred lifetimes sifting through such data, looking for patterns and clues that could help explain and fine-tune 21st-century life.&lt;/p&gt;
&lt;p&gt;But they can&amp;#39;t.&lt;/p&gt;
&lt;p&gt;Much of that never-ending stream of data is privacy-protected--as it should be. Nobody wants their personal information accessible to any and all. Privacy is the cornerstone of the Internet era. Without it, society would devolve into digital chaos. But those same privacy protections deny experts access to all that massive, tantalizing data.&lt;/p&gt;
&lt;p&gt;&lt;a class="" title="Frank McSherry" href="http://research.microsoft.com/users/mcsherry/"&gt;Frank McSherry&lt;/a&gt; aims to change all that. McSherry, a researcher at Microsoft Research Silicon Valley, is demonstrating, along with colleague &lt;a class="" title="Cynthia Dwork" href="http://research.microsoft.com/users/dwork/"&gt;Cynthia Dwork&lt;/a&gt;, a principal researcher at the same lab, a project called Privacy Integrated Queries,&amp;nbsp;designed to enable the mining of huge data collections while not putting individuals&amp;#39; private information at risk.&lt;/p&gt;
&lt;p&gt;&amp;quot;We&amp;#39;re looking to put together a&amp;nbsp;privacy-preserving data-mining platform,&amp;quot; McSherry says,&amp;nbsp;&amp;quot;tools that analysts can use, even without privacy training, to interact with and mine data from sensitive data sets that they wouldn&amp;#39;t otherwise have access to. &lt;/p&gt;
&lt;p&gt;&amp;quot;Cynthia Dwork and I have had a lot of prior experience with this privacy-preserving data analysis over the last few years. There&amp;#39;s a lot of really formal mathematics behind it, but every time we do something new, we start from scratch in some sense: prove theorems, write papers--this is the model for convincing people things are private. We thought it would be smart to try to factor out the common technology we&amp;#39;ve been using in each of these results and package it&amp;nbsp;in a framework that people could use to put together their own analyses.&amp;quot;&lt;/p&gt;
&lt;p&gt;One scenario in which such a platform could play a useful role relates to recent troubles in the financial sector.&lt;/p&gt;
&lt;p&gt;&amp;quot;Some folks,&amp;quot; McSherry says,&amp;nbsp;&amp;quot;are really excited about finding what went&amp;nbsp;wrong in the subprime collapse. Unfortunately, all that data is locked up--all the mortgage information, who bought what, at what rates--sealed up for privacy reasons. People can&amp;#39;t sort out where the next collapse will be and how to counteract it, and that&amp;#39;s unfortunate, that privacy is getting in the way, in some sense, of a real common-good happening.&amp;nbsp;They didn&amp;#39;t want to know who had what mortgage, but what parts of the country are most at risk.&amp;quot;&lt;/p&gt;
&lt;p&gt;Perhaps that episode would have&amp;nbsp;developed differently if data analysts had access to the Privacy Integrated &lt;br /&gt;Queries technology devised by McSherry and Dwork.&lt;/p&gt;
&lt;p&gt;&amp;quot;We put together something that looks a lot like LINQ, Language Integrated Query, a sort of SQL-style, programmatic data access,&amp;quot; McSherry explains. &amp;quot;To the user, it&amp;#39;s basically indistinguishable. But under the covers, the privacy thing is going on, instincting about what you&amp;#39;re asking for and communicating back with the data center, trying to determine if this is OK and pushing a lot of formal mathematics around, making sure that, at the end of the day, you haven&amp;#39;t compromised privacy.&lt;br /&gt;&lt;br /&gt;&amp;quot;The goal was to try to make it transparent to the users, so they didn&amp;#39;t have to worry about what weird, funny machinations underneath are going on. They could just program against it as if it were LINQ.&amp;quot;&lt;/p&gt;
&lt;p&gt;But this latest project has one significant distinction that sets it apart from LINQ.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;quot;Unlike LINQ,&amp;quot; McSherry says,&amp;nbsp;&amp;quot;you don&amp;#39;t get to just enumerate a data set. You have to stay one step removed. You explain what you&amp;#39;d like to do with the data set and how you&amp;#39;d like it to be aggregated, and the results come back to you, perturbed a little bit. You get a little bit of noise, and the noise introduces uncertainty about the answer, which turns into this formal notion of privacy.&amp;quot;&lt;/p&gt;
&lt;p&gt;Privacy Integrated Queries could prove beneficial in a number of settings. The medical field, for example, could benefit greatly if the data could be safely aggregated without disclosing&amp;nbsp;the associated personal information. Another potential usage could find takers within Microsoft.&lt;/p&gt;
&lt;p&gt;&amp;quot;We have all sorts of data on who searched for what,&amp;quot; McSherry notes. &amp;quot;It&amp;#39;s good stuff. Microsoft would love to collaborate with external researchers, people interested in Web research who don&amp;#39;t have access to the scale of data that we have, but we&amp;#39;re really concerned about privacy. That sort of stalls research, to some extent, for&amp;nbsp;Web researchers who don&amp;#39;t have anywhere to go. They can&amp;#39;t start up their own&amp;nbsp;Web-search engine. &lt;/p&gt;
&lt;p&gt;&amp;quot;With this sort of technology, we can really start the ball rolling on this type of work, work&amp;nbsp;that people just haven&amp;#39;t been able to do before.&amp;quot; &amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://community.research.microsoft.com/blogs/techfestlive/McSherry-Dwork4.jpg"&gt;&lt;img src="http://community.research.microsoft.com/blogs/techfestlive/McSherry-Dwork4.jpg" border="0" alt="" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Frank McSherry and Cynthia Dwork of Microsoft Research Silicon Valley in front of their TechFest poster on Privacy Integrated Queries.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://community.research.microsoft.com/aggbug.aspx?PostID=776" width="1" height="1"&gt;</description><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/2008/default.aspx">2008</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Research/default.aspx">Research</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/TechFest/default.aspx">TechFest</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Microsoft/default.aspx">Microsoft</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Frank+McSherry/default.aspx">Frank McSherry</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/privacy/default.aspx">privacy</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Silicon+Valley/default.aspx">Silicon Valley</category><category domain="http://community.research.microsoft.com/blogs/techfestlive/archive/tags/Cynthia+Dwork/default.aspx">Cynthia Dwork</category></item></channel></rss>