<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; voice search</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/voice-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 08 Feb 2012 18:55:10 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>That Didn&#8217;t Take Long! Siri-based Comparison Shopping Adds Best Buy Catalog</title>
		<link>http://opusresearch.net/wordpress/2011/12/16/that-didnt-take-long-siri-based-comparison-shopping-adds-best-buy-catalog/</link>
		<comments>http://opusresearch.net/wordpress/2011/12/16/that-didnt-take-long-siri-based-comparison-shopping-adds-best-buy-catalog/#comments</comments>
		<pubDate>Sat, 17 Dec 2011 01:02:07 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5043</guid>
		<description><![CDATA[Spoken queries to Siri regarding electronic gadgets, appliances, games and computers will result in a display of responses that include the the SKUs (stock keeping units) in Best Buy's catalog]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/12/siriwolfram.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/12/siriwolfram.jpg" alt="" title="siriwolfram" width="144" height="219" class="alignright size-full wp-image-5044" /></a>It&#8217;s quickly becoming apparent that Conversational Commerce and Recombinant Communications are inextricably intertwine. (Try saying that five times fast). We&#8217;re in month 2 of Siri&#8217;s beta release on Apple&#8217;s iPhone 4S and we&#8217;re already witnessing how the service will improve as a product of natural selection, gradual upgrades, augmentation and evolution. As a case in point, spoken queries to Siri regarding electronic gadgets, appliances, games and computers will result in a display of responses that include the the SKUs (stock keeping units) in Best Buy&#8217;s catalog. </p>
<p>You can find coverage of the phenomenon in dozens of tech publications today, but they seem to trace back to <a href="http://www.razorianfly.com/2011/12/16/siri-can-now-help-you-shop-at-best-buy/">this post</a> on the Apple-centric tech blog called RazorianFly.com. According to the post, the enhancement is very much the result of Wolfram Alpha (an answer-oriented search engine that is integrated into Siri&#8217;s search results) integrating with Best Buy&#8217;s product database through BestBuy.com&#8217;s API. Or, as more than one tech blog put it, <a href="http://www.webpronews.com/wolframalpha-brings-comparison-shopping-to-siri-2011-12">Siri now returns the same errors as a search on Wolfram Alpha</a>. </p>
<p>As Shaylin Clark at WebProNews explains in the post above, as a &#8220;computational engine&#8221; oriented toward asking questions, Wolfram Alpha can be quirky (he calls it &#8220;finicky&#8221;). But putting speech-based access to comparison shopping that includes Best Buy&#8217;s inventory marks progress, even if the results are not always optimal. The point is that end-users are gaining experience with the service. They are learning what it is good at and where it fails. </p>
<p>My empirical observation is that people are being much more patient with Siri than they had been with prior renditions of voice-based &#8220;assistants&#8221; (like Wildfire, HeyAnita or Webley). One reason is that the service is faster, better, more robust and capable of doing more things than its predecessors. There&#8217;s more knowledge in the databases that comprise its available knowledge (heck, it defaults to a search on Google, but it has maps, online music and Wolfram Alpha to bring to bear). It&#8217;s very early days and Siri is bound to get better. And it will inspire competing services from Google, Microsoft/Tellme, Amazon, Nuance, Vlingo and a handful of others. Each will add new features, functions, information and APIs to differentiate their services and deliver a better customer experience. </p>
<p>At this point Apple has taken a leadership position by coming to market with a service that&#8217;s instantiated as an embedded application that recognizes utterances accurately; determines context and meaning; and then has meaningful integrations with a broad range of knowledge bases so that it starts by recognizing intent and finishes by delivering relevant results. The truly exciting aspect to this is that the the services from Apple and its competitors will continue to evolve and get better.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/12/16/that-didnt-take-long-siri-based-comparison-shopping-adds-best-buy-catalog/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Novauris and SingTel Offer Mobile Voice Search in Singapore</title>
		<link>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 19:46:20 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[localization]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Novauris]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4934</guid>
		<description><![CDATA[SingTel and Novauris launched a version of voice search that is tailored specifically for "the unique style of English" that is spoken on the nation-island of Singapore. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg" alt="" title="singtel-logo" width="144" height="57" class="alignright size-full wp-image-4937" /></a>Every press release or news item about speech-enabled mobile services now carries the sobriquet &#8220;Siri-like.&#8221; Its use is so frequent these days that it is highly likely that it will be added to the Oxford English Dictionary in the coming year. I mean this year the editors accepted the contemporary definitions of &#8220;OMG&#8221; and &#8220;LOL.&#8221; &#8217;nuff said.</p>
<p>The reason that the term &#8220;Siri-like&#8221; is appearing so often is that there has been an acceleration in the introduction and marketing of new speech-enabled services. One that&#8217;s worth noting is <a href="http://www.ereleases.com/pr/novauris-singtel-bring-local-voice-search-singapore-69256">SingTel deFIND</a>, a version of voice search that is tailored specifically for &#8220;the unique style of English&#8221; that is spoken on the nation-island of Singapore. </p>
<p>SingTel turned to UK-based Novauris to build the corpus of localized utterances. The two companies have successfully introduced an application that delivers details on local shops, restaurants or retailers when a user simply says the name. Users can also search by category (e.g. Filipino restaurant) and the application will use GPS and SingTel&#8217;s &#8220;InSing&#8221; database to show what businesses are nearby.</p>
<p>In the coming months, we expect to see the introduction of speech-enabled mobile search and assistance apps to accelerate and that means there are lots more opportunities for firms that specialize in this sort of localization. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Isn&#8217;t Yahoo Doing More with Vlingo?</title>
		<link>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/#comments</comments>
		<pubDate>Tue, 15 Jun 2010 17:08:08 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3033</guid>
		<description><![CDATA[
We met with Vlingo yesterday. The company is previewing some new functionality and we had an interesting and wide-ranging discussion about the product and strategy. My question is: why isn&#39;t Vlingo investor Yahoo! leveraging the heck out of it? There was some initial fanfare about voice search on Yahoo! Mobile but since then there&#39;s been [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/Vlingo_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/Vlingo_logo.png" alt="" title="Vlingo_logo" width="198" height="80" class="alignright size-full wp-image-3034" /></a>
<p>We met with <a href="http://www.vlingo.com/">Vlingo</a> yesterday. The company is previewing some new functionality and we had an interesting and wide-ranging discussion about the product and strategy. My question is: why isn&#39;t Vlingo <a href="/news/user-experience/yahoo-upgrades-onesearch-shortcut-auto-locate-voice">investor Yahoo!</a> leveraging the heck out of it? There was some <a href="http://www.ysearchblog.com/2009/02/09/yahoo-onesearch-shortcut-now-with-auto-locate-improved-search-assist-and-windows-mobile-client/">initial fanfare</a> about voice search on Yahoo! Mobile but since then there&#39;s been little or nothing. </p>
<p>I think that Yahoo! should aggressively integrate Vlingo into its mobile app as a discovery tool for all types of content and not just Web search. It should also build a mobile version of Yahoo! Answers and think about Vlingo as a front end to that experience. </p>
<p>Alternatively our friends in Finland (Nokia) might consider buying a company like Vlingo and integrating it as a core component of its handsets &#8212; for smartphones and not-so-smart phones. </p>
<p>Voice will continue to improve as a tool for a broad array of uses on the handset. Apple obviously bought Siri, which is not about voice search per se, but about semantic understanding of voice queries and matching with transactional services. And Google continues to invest in voice and see it as a differentiator. </p>
<p>Microsoft of course had formidable voice assets but has not fully exploited or marketed them. Perhaps with the release of Windows (Mobile) 7 handsets we&#39;ll see a more prominent role for voice on the device. </p>
<p>But Yahoo!&#39;s got to get into the game more here.  </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google TV: &#8220;Put a Man on the Couch by Holiday Buying Season&#8221;</title>
		<link>http://opusresearch.net/wordpress/2010/05/20/google-tv-put-a-man-on-the-couch-by-holiday-buying-season/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/20/google-tv-put-a-man-on-the-couch-by-holiday-buying-season/#comments</comments>
		<pubDate>Thu, 20 May 2010 18:27:25 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Android]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2896</guid>
		<description><![CDATA[The demo at Google I/O 2010 had lots of glitches, but the importance of Google TV for the developer community and for the TV-watching public cannot be overestimated.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/Google-TV-logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/Google-TV-logo-150x125.jpg" alt="" title="Google-TV-logo" width="150" height="125" class="alignright size-thumbnail wp-image-2900" /></a>The demo at Google I/O 2010 had lots of glitches, but the importance of Google TV for the developer community and for the TV-watching public cannot be overestimated. As &#8220;putting a man on the moon by the end of the decade&#8221; rallied the NASA engineers (developers) to accelerate development of new fuels, engines, computers and a broad spectrum of other technologies; putting Google users on the couch takes on some surprising dimensions &#8211; including social networking, voice search (and navigation), video processing (Flash on Intel Atom) and set-top boxes/remote controls.</p>
<p>The optimist in me wants all of the piece parts to work together seamlessly. The reality is bound to be quite different. As was demonstrated by interference in the conference venue, things are bound to go wrong (as the display of Nicolas Cage &#8220;animal sex diet&#8221; during the demo dramatized). But that did not blind the developer community (or indeed the CEO&#8217;s from Intel, Best Buy, Sony, Adobe and DishTV) of the transformative potential of adding &#8220;seamless&#8221; and simultaneous search to accompany the five hours of TV watching that people are ordinarily doing.</p>
<p>From the RC (Recombinant Communications) perspective, Google claims that development of new applications will benefit from its &#8220;open&#8221; approach. <a href="http://googleblog.blogspot.com/2010/05/announcing-google-tv-tv-meets-web-web.html">As described on the Google Developers Blog</a>, the software, OS guts and APIs are the household names of open computing: Android, Chrome and Ajax. The application development toolkits are not fully developed, but they start with the SDKs for Chrome and Android. </p>
<p>From a competitive point of view, Google is treading where Apple (with AppleTV) and Cisco (with its &#8220;ownage&#8221; of ScientificAtlanta and LinkSys for couchside products), along with Motorola, Philips, Tatung and the mainstream set-top box makers have already done their best to fragment the market. What Google brings to the game as differentiaters is the power of its advertising-based business model along with its experience (and existing functionality) in real-time search/social search, voice search and speech-to-text transcription/translation. </p>
<p>Each of these elements has the potential to spark the imagination of users, as well as the development community. But they are also fraught with risk. Speech-to-text transcription (much less translation) has already created fodder for all sorts of ridicule. Quality is bound to improve over time, but neither transcription nor translation will ever be perfect and much of the early energy should be dedicated to managing expectations.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/20/google-tv-put-a-man-on-the-couch-by-holiday-buying-season/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Windows (7) Phone: Where Is Voice?</title>
		<link>http://opusresearch.net/wordpress/2010/02/15/windows-7-phone-where-is-voice/</link>
		<comments>http://opusresearch.net/wordpress/2010/02/15/windows-7-phone-where-is-voice/#comments</comments>
		<pubDate>Mon, 15 Feb 2010 19:52:13 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2372</guid>
		<description><![CDATA[I saw most of Steve Ballmer&#8217;s Windows 7 keynote at Mobile World Congress this morning. Most of the phone&#8217;s features were demonstrated and there are some cool ones. Mostly the phone looks different than other things in the market, even as it borrows certain elements from Apple and Google. 
However, one of Microsoft&#8217;s core mobile [...]]]></description>
			<content:encoded><![CDATA[<p>I saw most of Steve Ballmer&#8217;s Windows 7 keynote at Mobile World Congress this morning. Most of the phone&#8217;s features were demonstrated and there are some cool ones. Mostly the phone looks different than other things in the market, even as it borrows certain elements from Apple and Google. </p>
<p>However, one of Microsoft&#8217;s core mobile competencies &#8212; voice and voice search &#8212; was not on display in the demo. I would have expected deeper integration of Tellme&#8217;s technology into 7 and that this would be one of the differentiators or would-be differentiators for the company. Yet it was nowhere in evidence. </p>
<p>I&#8217;m sure that voice search will be available for Bing; but will it be available at a deeper level in the way Google is trying to integrate voice on the Nexus One? That wasn&#8217;t the case today and it&#8217;s not clear how and whether Microsoft will use voice as the devices roll out late this year. If the company is smart it will seek to voice-enable as much as possible on the device. </p>
<p>Here&#8217;s a quick demo of local search from MobileCrunch on the scene in Barcelona:  </p>
<p><a href="http://www.youtube.com/watch?v=1FIAVenNon0&amp;feature=player_embedded#" title="Picture 192 by sterlingtkg, on Flickr"><img src="http://farm3.static.flickr.com/2712/4359475977_9241d63dc7_o.png" alt="Picture 192" width="550" height="311" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/02/15/windows-7-phone-where-is-voice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s Turn-by-Turn GPS has &#8220;Voice Guidance&#8221;</title>
		<link>http://opusresearch.net/wordpress/2009/10/28/googles-turn-by-turn-gps-has-voice-guidance/</link>
		<comments>http://opusresearch.net/wordpress/2009/10/28/googles-turn-by-turn-gps-has-voice-guidance/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 21:49:33 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1727</guid>
		<description><![CDATA[The "beta" version of Google Maps Navigation app gives instant legitimacy to voice-activated GPS, offered free of charge on Android phones.]]></description>
			<content:encoded><![CDATA[<p><object width="560" height="340"><param name="movie" value="http://www.youtube.com/v/tGXK4jKN_jY&#038;hl=en&#038;fs=1&#038;"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/tGXK4jKN_jY&#038;hl=en&#038;fs=1&#038;" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="560" height="340"></embed></object><br />
Don&#8217;t take my word for it, have a look at the video. You&#8217;ll understand pretty quickly why the stock prices of both Garmin and TomTom have taken their respective nose dives. </p>
<p>But bear in mind, this is just a &#8220;beta&#8221; version of a rendition of Google Maps for Android phones. To its credit, it appears very flexible and it enables the phone users to use their own terms to say where they want to go. But I don&#8217;t come to praise Google, just to give them credit for creating a showcase for the features and functions that other providers of navigation devices should offer. It provides instant legitimacy and credibility for functions that as recently as yesterday were considered &#8220;futuristic.&#8221;</p>
<p>It delivers on the promise of &#8220;Internet-based, GPS navigation.&#8221; In its &#8220;car mode&#8221; it comes close to being totally hands-free and, in the spirit of everything else that Google is doing in this space, it is offered free-of-charge to end users. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/10/28/googles-turn-by-turn-gps-has-voice-guidance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mobile Speech: Unlocking Personal Apps, Features and Functions</title>
		<link>http://opusresearch.net/wordpress/2009/09/16/mobile-speech-unlocking-personal-apps-features-and-functions/</link>
		<comments>http://opusresearch.net/wordpress/2009/09/16/mobile-speech-unlocking-personal-apps-features-and-functions/#comments</comments>
		<pubDate>Wed, 16 Sep 2009 17:45:11 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[Featured Research]]></category>
		<category><![CDATA[Reports]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1455</guid>
		<description><![CDATA[
Featured Research
The forces of human nature, technological progress and regulatory stricture are converging to boost interest in “truly hands-free” mobile applications. Dozens of firms have responded with a broad variety of software, services and features that work remarkably well. Next step: to build a sustainable market model.
Featured Research Reports are available to registered users only. [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/pdfreports/MobileSpeech09_091609.png" width="106" height="150" align='right'  HSPACE=10 vspace=10 border=1/><br />
<em>Featured Research</em><br />
The forces of human nature, technological progress and regulatory stricture are converging to boost interest in “truly hands-free” mobile applications. Dozens of firms have responded with a broad variety of software, services and features that work remarkably well. Next step: to build a sustainable market model.</p>
<p><em>Featured Research Reports are available to registered users only.</em> </p>
<p>For more information on becoming an Opus Research client, please contact Pete Headrick (<a href="mailto:pheadrick@opusresearch.net">pheadrick@opusresearch.net</a>).</p>
<p><a href="http://opusresearch.net/wordpress/pdfreports/SEMS2009_final_leadup.pdf"><strong>Click Here to View the Report Summary</strong></a></p>
<p><!--/hidethis--></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/09/16/mobile-speech-unlocking-personal-apps-features-and-functions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Orwellian Aspects to the App Store Strategy</title>
		<link>http://opusresearch.net/wordpress/2009/07/28/orwellian-aspects-to-the-app-store-strategy/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/28/orwellian-aspects-to-the-app-store-strategy/#comments</comments>
		<pubDate>Tue, 28 Jul 2009 18:44:00 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Amazon.com]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Google Voice]]></category>
		<category><![CDATA[Recombinant Telephony]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1090</guid>
		<description><![CDATA[When Apple purged the App Store of all things Google Voice, it revealed about itself what "deKindling" the unauthorized version of Orwell's "1984" did for Amazon.com.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/iphone_apple_logo3-150x150.jpg" alt="iphone_apple_logo3" title="iphone_apple_logo3" width="150" height="150" class="alignright size-thumbnail wp-image-1094" />When Apple <a href="http://www.techcrunch.com/2009/07/27/apple-is-growing-rotten-to-the-core-and-its-likely-atts-fault/">purged the App Store of all things Google Voice</a>, it revealed about itself what &#8220;deKindling&#8221; the unauthorized version of Orwell&#8217;s &#8220;1984&#8243; did for Amazon.com. It gives high-visibility to practices that publishers of electronic content and applications like to carry out quietly, over-the-weekend and without recognition or attribution. The impact of such an approach is quite the opposite. It only takes one post in an alpha-blog (like TechCrunch, Slate or even BusinessWeek.com) to raise the ire of the mass of aggrieved parties or chronic complainers. </p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/s-AMAZON-KINDLE-large-150x150.jpg" alt="s-AMAZON-KINDLE-large" title="s-AMAZON-KINDLE-large" width="150" height="150" class="alignright size-thumbnail wp-image-1095" />Amazon.com closed a chapter in this saga with a public apology from Jeff Bezos, its founder. I don&#8217;t know what the plural of &#8220;mea culpa&#8221; is (guessing &#8220;nostri culpa&#8221;), but the following excerpt captures the spirit of his statement:</p>
<blockquote><p>Our &#8220;solution&#8221; to the problem was stupid, thoughtless, and painfully out of line with our principles. It is wholly self-inflicted, and we deserve the criticism we&#8217;ve received. We will use the scar tissue from this painful mistake to help make better decisions going forward, ones that match our mission.</p></blockquote>
<p>Yet, deKindling &#8220;1984&#8243; was not an isolated incident. It had been routine for Amazon.com to delete unauthorized or pirated titles from Kindle subscribers at its discretion. It is a core capability of Digital Rights Management (DRM) schemes and had been applied to several of Ayn Rand&#8217;s titles among others. Giving Orwell&#8217;s classic the deKindling treatment crossed a threshold primarily because it was so &#8230; Orwellian.</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Google_logo1-150x74.png" alt="Google_logo" title="Google_logo" width="150" height="74" class="alignleft size-thumbnail wp-image-1097" />News about Apple&#8217;s snub of Google Voice came hot on the heels of Amazon.com&#8217;s high profile departure from its principles. To be clear, the App Store staff&#8217;s approval process has long been something of a Black Box. Amid cries from the developer community to keep the process &#8220;open&#8221;, the best Apple can do is broadly distribute its specifications, access to its APIs and do considerable hand-holding during the development and approval process. From Apple&#8217;s point of view, the fact that iPhone owners downloaded more than 1.5 billion apps from the App Store in its first year of operation speaks for itself. Developers, with visions of dollar signs dancing in their heads, include the iPhone as their targeted smartphone and the inventory of apps just keeps growing.</p>
<p>Taking GV Mobile and an alternative app called VoiceCentral out of inventory is Apple&#8217;s right. I think they will discover that it&#8217;s bad policy and establishes a precedent that will have a chilling effect on development and delivery of rich phone applications &#8211; meaning ones that can be highly responsive to phone users&#8217; fast-changing needs. TechCrunch blames the Apple/AT&#038;T mind-meld. After all, many users of BlackBerry&#8217;s and Android-based phones are already using a native versions of Google Voice and discovering the power (and pratfalls) of advanced call management, voice processing and speech recognition rolled into a single app. iPhone owners are bound to want the same capabilities and my belief is that Apple should comply.</p>
<p>Update: Here&#8217;s a <a href="http://twitition.com/xh8jk">link</a> to a &#8220;Twitition&#8221; urging Apple to &#8220;unblock&#8221; the Google Voice app for the iPhone. [Caveat: As a rule, I don't use a service that requires 're-entry' of an existing account name and password]</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/28/orwellian-aspects-to-the-app-store-strategy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Attention to Mobile Voice Control (Thanks Apple and Om!)</title>
		<link>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/#comments</comments>
		<pubDate>Wed, 08 Jul 2009 22:43:04 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=960</guid>
		<description><![CDATA["How Speech Technologies Will Transform Mobile Use" is a topic that's near and dear to my heart; but at this point I would frame the issue as "How Mobile Use Will Transform Speech Technologies."]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-9-150x150.png" align='right' HSPACE=5 vspace=5/>Because I have been an analyst in the automated speech domain for almost 20 years, I&#8217;m all for the introduction of interest among a fresh crop of knowledgeable proponents of new speech-enabled or &#8220;voice command&#8221; services. It was no mean feat for Apple to include Voice Control as one of the top features for the iPhone 3GS. It made voice input for command/control and dictation a peer of improved video, cut-and-paste, global search and other &#8220;most wanted&#8221; services.</p>
<p>Now GigaOm features <a href="http://gigaom.com/2009/07/07/is-iphones-voice-control-the-sound-of-things-to-come/">the opinion of guest columnist Phil Hendrix</a>, Founder and Director or the Institute for Mobile Markets Research. Somewhere behind the new GigaOm Pro&#8217;s paywall is a report from immr that professes to explain &#8220;How Speech Technologies Will Transform Mobile Use.&#8221; It&#8217;s a topic that&#8217;s near and dear to my heart; but at this point I might want to frame it as &#8220;How Mobile Use Will Transform Speech Technologies.&#8221;</p>
<p>1) Make the option for voice input ubiquitous &#8211; We&#8217;re getting there. The iPhone 3GS is a high-visibility device, but preloaded or downloaded apps that support speech input are already on hundreds of millions of handsets around the world. In addition, dial-up services for voice-enabled search (like GOOG411 or Microsoft/Tellme&#8217;s Bing411) have the potential to be a speed-dial button away when they add new territories and languages. </p>
<p>2) Do a better job of managing user expectations of accuracy &#8211; speech rec will never be 100% accurate. Think of all the times that you don&#8217;t understand what a person is saying to you on the first try. Why should wireless users expect machines to be any better. It&#8217;s important upfront and by multiple channels (demonstrations, video tutorials, Peer2Peer discussions), to make it clear that systems make best efforts to capture what is being said, but may not always &#8220;understand&#8221; meaning. It may seem like a hard logic to follow, but it is vital for maintaining the technology&#8217;s potential and the caller&#8217;s low blood pressure.</p>
<p>3) Showcase and support new use cases and tutorials &#8211; I think that we&#8217;ve only begun to present wireless subscribers with the sorts of options that put the technology in its best light. People love to break these things. It was in this post, that I pointed to the <a href="http://opusresearch.net/wordpress/2009/06/30/nytimes-column-highlights-google-voices-transcription-shortcomings/">NYTimes technology blog </a>in which David Gallagher logged the many transcription mistakes boldly displayed by Google Voice&#8217;s voicemail-to-text transcription. </p>
<p>4) Never make spoken input the only option &#8211; Note that I said make the &#8220;option&#8221; ubiquitous. Under no circumstances should an application require spoken input. Google understands this. So does Nuance, often regarded as a roll-up of speech processing technologies, but really a purveyor of platforms that accept (and in many cases predict) user input in the form of text, spoken words and short codes.</p>
<p>My problem with the NYTimes column and other vehicles for criticism is this: they are largely &#8220;descriptive&#8221; and not &#8220;prescriptive.&#8221; Instead of showing how easily broken the new interface is, how about demonstrating what it does quite well. It captures utterances and, like a marginally good stenographer, presents a &#8220;rough draft&#8221; of what was said, along with some notation of where the input might be &#8220;iffy.&#8221; Speakers should never think of this as a finished product. At best it should provide a form of talking triage &#8211; showing the speaker, or recipient where the transcription may need a bit of work or sculpting. </p>
<p>At base it&#8217;s an opportunity to game the system. If people like to make a game of breaking the application (by reading Jabberwocky to Google Voice, for instance), it makes more sense to cast yourself in the role of hero and fix what&#8217;s broken in a poorly transcribed message. If voice control is to be &#8220;the sound of things to come&#8221; for mobile subscribers, it requires a much more concerted effort than the shallow effort to integrate with the iPhone or the one-and-done guessing game that Google Voice plays. Handsfree dialing and text input through a Bluetooth headset or earbud would be a nice start. Well-prompted and choreographed messaging &#8211; be it addressing and dictation of SMS or friendly prompts to help originators or recipients triage transcribed voice mail &#8211; would be a cool way to start. </p>
<p>Mobile voice is a big area of opportunity. IBM had been working on it for years. Nuance, Google, Vlingo, Microsoft/Tellme, Spinvox, PhoneTag, Yap and a few others are fielding some pretty impressive products and services. Adoption will be no accident and, even though its been years in the making, it is still early days in product and market development. In other words: The Customer Has Not Spoken. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>IBM Labs Boosting &#8220;The Spoken Web&#8221;</title>
		<link>http://opusresearch.net/wordpress/2009/06/19/ibm-labs-boosting-the-spoken-web/</link>
		<comments>http://opusresearch.net/wordpress/2009/06/19/ibm-labs-boosting-the-spoken-web/#comments</comments>
		<pubDate>Fri, 19 Jun 2009 17:12:55 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[internet2go]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=820</guid>
		<description><![CDATA[<a href="http://www.economist.com/research/articlesBySubject/displayStory.cfm?story_id=13855374&#38;subjectID=894408&#38;fsrc=nwl">This article in the Economist magazine</a>, entitled, "A Web of Sound: Talk About That", reminded me that the legend of using VoiceXML to speech-enable the World Wide Web is alive, well and targeting the greater good by making Web sites more accessible to the illiterate. The article's author credits Guruduth Banavar, the director of IBM’s India Research Laboratory, with undertaking a project to make it easier to develop so called "voice sites" which enable callers to navigate the Web and retrieve personal information.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/IBM-Logo2.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/IBM-Logo2.png" alt="" title="IBM-Logo" width="125" height="60" class="alignright size-full wp-image-1055" /></a><a href="http://www.economist.com/research/articlesBySubject/displayStory.cfm?story_id=13855374&amp;subjectID=894408&amp;fsrc=nwl">This article in the Economist magazine</a>, entitled, &#8220;A Web of Sound: Talk About That&#8221;, reminded me that the legend of using VoiceXML to speech-enable the World Wide Web is alive, well and targeting the greater good by making Web sites more accessible to the illiterate. The article&#8217;s author credits Guruduth Banavar, the director of IBM’s India Research Laboratory, with undertaking a project to make it easier to develop so called &#8220;voice sites&#8221; which enable callers to navigate the Web and retrieve personal information.</p>
<p>The &#8220;spoken Web&#8221; conjured up in the article, will ride the coattails of the growth in wireless subscribers. However, unlike their commercial cousins, the speech-enabled contact centers, these &#8220;voice sites&#8221; run on relatively small servers and are designed to be local or personal in nature. The article&#8217;s author calls them &#8220;portals through which people can find out such things as when the mobile hospital will next visit their village, the price of rice in the local market and which wells they should use for irrigation.&#8221; To support speech-based browing, IBM is employing a new linking mechanism called the hyperspeech transfer protocol (HSTP), which is the spoken equivalent the the hypertext transfer protocol which drives the &#8220;http://&#8221; in a visual browser&#8217;s navigation bar.</p>
<p>The development efforts are laudable and are a testament to the pervasive, global movement to extend the power of the Internet to mobile devices. There&#8217;s more than a little irony in the fact that the effort is characterized as part of a research effort, rather than a marketing or product development initiative. IBM&#8217;s rich history with VoiceXML goes back a decade and a half. It was cultivated by a Speech Products Group in Boca Raton, FL, that for much of its life-span occupied the same building where the original IBM PC was conceived and prototyped in the early 1980s. The Speech Group was successively absorbed into the now-defunct &#8220;Pervasive Computing&#8221; business unit and then &#8220;mainstreamed&#8221; out of existence when its core product &#8211; called WebSphere Voice Server &#8211; migrated into the huge catalogue of WebSphere-branded middleware and application servers.</p>
<p>The coup de gras for IBM Speech took place last January when IBM licensed a good deal of its source code to rival speech processing company, Nuance. In an <a href="http://opusresearch.net/wordpress/2009/01/20/nuance-licenses-speech-code-from-ibm-impacts-on-the-speech-engine-landscape/">advisory</a> that we published at the time, we called it a variation of &#8220;Win-Win-Win&#8221; formula. IBM would get upfront money for its licensed technology, Nuance would have a richer code-based on which to build future products and services and customers would benefit by having better products from both companies. </p>
<p>Instead, it has been back-to-the-future (more accurately back-to-the-lab) for IBM voiceXML efforts. Meanwhile, Nuance and its rivals are vying to make a living by making it possible for wireless subscribers to speak commands, search terms, navigational instructions and messages into their wireless devices. As a result, we anticipate a rich set of commercial products to come out this year. Some of the new devices and services may be &#8220;powered by IBM&#8221;, but in most cases &#8211; as with the &#8220;voice sites&#8221; and the development of HSTP, they will have a decidedly &#8220;alpha&#8221; test feel to them.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/06/19/ibm-labs-boosting-the-spoken-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

