<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; mobile speech</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/mobile-speech/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 08 Feb 2012 18:55:10 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Novauris and SingTel Offer Mobile Voice Search in Singapore</title>
		<link>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 19:46:20 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[localization]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Novauris]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4934</guid>
		<description><![CDATA[SingTel and Novauris launched a version of voice search that is tailored specifically for "the unique style of English" that is spoken on the nation-island of Singapore. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg" alt="" title="singtel-logo" width="144" height="57" class="alignright size-full wp-image-4937" /></a>Every press release or news item about speech-enabled mobile services now carries the sobriquet &#8220;Siri-like.&#8221; Its use is so frequent these days that it is highly likely that it will be added to the Oxford English Dictionary in the coming year. I mean this year the editors accepted the contemporary definitions of &#8220;OMG&#8221; and &#8220;LOL.&#8221; &#8217;nuff said.</p>
<p>The reason that the term &#8220;Siri-like&#8221; is appearing so often is that there has been an acceleration in the introduction and marketing of new speech-enabled services. One that&#8217;s worth noting is <a href="http://www.ereleases.com/pr/novauris-singtel-bring-local-voice-search-singapore-69256">SingTel deFIND</a>, a version of voice search that is tailored specifically for &#8220;the unique style of English&#8221; that is spoken on the nation-island of Singapore. </p>
<p>SingTel turned to UK-based Novauris to build the corpus of localized utterances. The two companies have successfully introduced an application that delivers details on local shops, restaurants or retailers when a user simply says the name. Users can also search by category (e.g. Filipino restaurant) and the application will use GPS and SingTel&#8217;s &#8220;InSing&#8221; database to show what businesses are nearby.</p>
<p>In the coming months, we expect to see the introduction of speech-enabled mobile search and assistance apps to accelerate and that means there are lots more opportunities for firms that specialize in this sort of localization. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance&#8217;s Steve Chambers to Deliver Keynote at Voice Biometrics Conference New York  April 3rd &amp; 4th, 2012</title>
		<link>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/#comments</comments>
		<pubDate>Wed, 09 Nov 2011 18:21:40 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance Communications]]></category>
		<category><![CDATA[Voice biometrics]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4913</guid>
		<description><![CDATA[We’re pleased to announce that Nuance’s President, Sales and Marketing, Steve Chambers, will deliver a keynote address during the Voice Biometrics Conference-NYC April 3-4, 2012]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/SGC2-Photo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/SGC2-Photo-150x150.jpg" alt="" title="SGC2 - Photo" width="150" height="150" class="alignleft size-thumbnail wp-image-4915" /></a>Planning for <a href="http://bit.ly/rvwX9J">Opus Research’s 2012 Voice Biometrics Conference New York on April 3rd-4th</a> is in full swing and we’re pleased to announce that Nuance’s President, Sales and Marketing, Steve Chambers, will deliver a keynote address during the event. He will share his perspectives on how speaker verification, accurate speech recognition and artificial intelligence will help enterprises to anticipate a user&#8217;s intent and establish a foundation of trusted communications. Bringing these technologies together will provide a balance of secure interactions and a positive user experiences across channels including, traditional phones, mobile devices and e-commerce Web-sites.</p>
<p><a href="http://bit.ly/uKWI9o ">Register Now &#8211; Discounted Offer Ends November 30th</a>:<br />
The first 30 people to register for Voice Biometric Conference New York can register for $299.00  That&#8217;s over half off the full conference rate of 699.00.  Use registration code, &#8216;earlyvbc&#8217;  to receive the special rate.  </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twilio Positioning SMS as a Pre-API for Siri Development Efforts</title>
		<link>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/#comments</comments>
		<pubDate>Thu, 03 Nov 2011 18:32:40 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Mobile Apps]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[Twilio]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4889</guid>
		<description><![CDATA[Twilio is encouraging developers to come up with interesting new applications for the iPhone 4S using the Twilio platform for SMS as a quasi-API.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/twiliologo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/twiliologo.jpg" alt="" title="twiliologo" width="151" height="60" class="alignright size-full wp-image-4784" /></a>In <a href="http://www.twilio.com/contests/2011/10/siri-video-developer-contest.html">this post</a>, the peripatetic promoter of cloud-based phone hacks, Twilio, encourages developers to come up with interesting new applications for the iPhone 4S, taking advantage of speech-based assistant, <a href="http://opusresearch.net/wordpress/2011/10/04/siri-beta-assumes-primacy-on-iphone-4s-home-button/">Siri</a>. </p>
<p>Initiatives and contests like this one illustrate one more reason why Apple&#8217;s introduction of Siri is a signal event for mobile speech. The &#8220;application&#8221; (placed in quotes for reasons I will explain shortly) has its limits. In fact, it is not really an application in the traditional sense of the word. Like many of the downloadable &#8220;speech-enablers,&#8221; Siri defies categorization. There are &#8220;command and control&#8221; elements that fall in the category of &#8220;utility.&#8221; There are dictation and messaging components that make it a &#8220;communications&#8221; app. Finally, there are (or were) the links to 3rd party web sites that enabled Siri to transform the iPhone into a personal assistant. </p>
<p>The pros and cons of the speech-based mobile assistant were tossed around most recently when Google&#8217;s <a href="http://allthingsd.com/20111019/android-chief-says-your-phone-should-not-be-your-assistant/">Andy Rubin dismissed the idea at an AsiaD (an All Things D conference)</a>. The gist of his criticism was that he&#8217;d &#8220;been-there-done-that-and-it-failed,&#8221; with reference to two speech-enabled personal digital assistants. One was <a href="http://en.wikipedia.org/wiki/General_Magic">General Magic</a>, which was spun out of Apple Computer back in 1990 and had a few, high-visibility partnerships, including Sony, Motorola and AT&#038;T among others. </p>
<p>In hindsight, Rubin may see General Magic as a failure but, in fact, its engineers designed and developed a new operating system (Magic Cap) and scripting language (Telescript) that were precursors VoiceXML and efforts to create tools that support agile programming for speech-based, conversational interfaces. The technologies that started in General Magic live on in the automated speech offerings of GM OnStar. And somewhere among the intellectual property vault owned by Microsoft co-founder Paul Allen&#8217;s Vulcan Ventures are General Magic&#8217;s patents, which were bought at auction in 2002.</p>
<p>Rubin also made mention of Wildfire Communications, Inc., a company founded by Rich Miner, who is now a partner at Google Ventures. But Wildfire&#8217;s experience is quite different from General Magic. Founded in 1991, Wildfire built a very loyal following for its speech-enabled services which, at the time, were largely built around management of voice and telephony functions, like voicemail management, call origination, call answering and the like. In 2002, France Telecom&#8217;s Orange Wireless bought the company for $147 million and offered the service to its mobile constituency.</p>
<p>At the time, the service was well-received by mobile customers but, because it was originally engineered as an enterprise app, Orange realized that it would have to re-engineer the underlying technology platform in order to offer the service in sufficient scale. Instead, the telco opted to shutter the service in 2005. As <a href="http://www.theregister.co.uk/2005/07/05/orange_wildfire/">this article</a> by Tim Richardson in The Register explains, shutting down the service took longer than anticipated because of the protests of a loyal following of Wildfire users who, to this day, feel like Orange was too hasty in its decision to cease the offering.</p>
<p>Siri (as an Apple initiative) shares quite a few of the attributes of both General Magic and Wildfire that attracted the attention and imagination of developers. The big difference today is that modern technology around computing power and storage support offering the service economically at scale. In addition, even without a formal API, the creative energy of 3rd party developers can be applied to enhancing the service using tools and scripting languages that have evolved into agile environments since the days of Magic Cap and Telescript.</p>
<p>Google has reason to be dismissive of Siri because it is important to call into question its ability to provide answers to questions that used to be the sole domain of the Google Search box (and therefore a source of advertising supported revenue for Google). But it can equally be argued that Voice Search and Voice Actions on the Android platform will benefit from general acceptance of speech-enabled assistants, like Siri. We have to see whether and when Apple introduces Siri as a downloadable app that runs on other devices and how well it (re)integrates the service with popular destination sites like Yelp!, OpenTable, Fandango, etc. Today Vlingo and Nuance&#8217;s DragonGo! have an advantage when supporting mobile ecommerce.</p>
<p>Greg Sterling and I will be issuing a report on &#8220;Mobile Speech Applications and Services&#8221; in the coming month. In it we will assess current initiatives and provide our insights and perspectives on the ultimate impact on local search and conversational commerce.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Siri Factor, Three Days On</title>
		<link>http://opusresearch.net/wordpress/2011/10/16/the-siri-factor-three-days-on/</link>
		<comments>http://opusresearch.net/wordpress/2011/10/16/the-siri-factor-three-days-on/#comments</comments>
		<pubDate>Sun, 16 Oct 2011 15:52:34 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Siri]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4848</guid>
		<description><![CDATA[While most people are marveling at how well Siri recognizes and fulfills on many of their intentions, the inevitable criticism has begun.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/10/Unknown.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/10/Unknown.jpeg" alt="" title="Siributton" width="120" height="120" class="alignright size-full wp-image-4849" /></a>While most people are marveling at how well Siri recognizes and fulfills on many of their intentions, the inevitable criticism has begun. In this post called<a href="http://www.talkingpointz.com/siriously-this-sucks"> &#8220;SIRIously This Sucks!</a>&#8221; Colin Berkshire, a guest contributor on Dave Michels&#8217; new blog, recites a litany of deficiencies in the new service. The gist is that Siri is good at doing tasks that its developers anticipated &#8211; like setting the alarm clock, dictating text messages, getting directions &#8211; but &#8221; if you stray much off the beaten path it is like playing twenty questions with a belligerent two year old.&#8221;</p>
<p>Some speech app developers have piled by noting that the service should do more &#8220;on the device.&#8221; It is crippled when the data link to the server is down (which happens quite a bit over AT&#038;T&#8217;s network &#8211; at least in SF). I would also note that Apple made no friends by discontinuing the Siri App for those (like me) who have it on their plain vanilla iPhone 4s with several more months on their contracts.</p>
<p>Sight unseen, I take the attitude that this is the reason Siri made the transition from approved app in the iTunes store to &#8220;beta&#8221; version of a native feature (meaning it ships pre-loaded and accessible through the &#8220;Home&#8221; button).</p>
<p>I take the attitude that this rendition of Siri is the worst one that the general public will encounter and that it can only get better. This started me thinking of computer graphics for the movies. Anyone who saw the first Star Wars was totally &#8220;wow&#8217;d!&#8221; and had little idea how much better it would get. Meanwhile, the producers of the film were already seeing all its faults and telling themselves that they were spending too much time on the stupid stuff like making sure that the strings holding up models of starfighters.</p>
<p>Even in the days of Pixar, they are ever-improving computer generated images in subtle ways that make for a better viewer experience. The animators of the first &#8220;Toy Story&#8221; told themselves that &#8220;this is the worst looking movie we will ever make.&#8221; And so it was.</p>
<p>We should weather the criticism of &#8220;SIRIously sucking&#8221; that I&#8217;ve seen. We can only hope that the data link between device and server gets more consistent because the marriage of AI and speech rec that is required to provide a consistently successful user experience depends on it. And we need a better way for the app to work when the data link is down.</p>
<p>I&#8217;m pretty sure that Apple and the Siri folks are already addressing these issues. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/10/16/the-siri-factor-three-days-on/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Siri (beta) Assumes Primacy on iPhone 4S Home Button</title>
		<link>http://opusresearch.net/wordpress/2011/10/04/siri-beta-assumes-primacy-on-iphone-4s-home-button/</link>
		<comments>http://opusresearch.net/wordpress/2011/10/04/siri-beta-assumes-primacy-on-iphone-4s-home-button/#comments</comments>
		<pubDate>Tue, 04 Oct 2011 21:53:07 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple iPhone]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[Vlingo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4819</guid>
		<description><![CDATA[Apple may have disappointed the world by not introducing an iPhone5, but the one area where the new device does not fall short is that it marks the re-emergence of Siri, which Apple acquired in April of 2010.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg" alt="" title="Siri_logo" width="144" height="99" class="alignright size-full wp-image-2773" /></a>Thanks to a surfeit of hype surrounding the imminent introduction of the iPhone5, the whole world was watching as Apple introduced (drum roll, please!) the Apple 4S. The disappointment was palpable among the analysts who, based on &#8220;leaks&#8221; to the Wall Street Journal and elsewhere, were expecting a larger screen, better graphics and faster connections. But the one area where the new device does not fall short is that it marks the re-emergence of Siri, which Apple acquired in April of 2010.</p>
<p>On the iPhone 4S running iOS 5.0, holding the Home Button down for 10 seconds will start Siri. For many handsfree Bluetooth earbuds, holding the &#8220;activate&#8221; key down will do the same. Once invoked, Siri lets phone users speak to their phones in their own words. In the demo, Apple iOS engineer Scott Forstall showed three different ways to get Siri to provide local weather, ranging from &#8220;What&#8217;s the weather?&#8221; to &#8220;Do you think it will rain?&#8221; [Here's <a href="http://www.youtube.com/watch?v=L4D4kRbEdJw&#038;feature=player_embedded">the commercia</a>l - complete with Ray Charles in the background - that Apple will be running to support Siri]</p>
<p>On prior versions of the iPhone, holding the Home Button down for 10 seconds invokes &#8220;Voice Control&#8221; which enables phone owners to use their voice to control playback on their iPhone or initiate phone calls. Deep integration of Siri enables users to treat their phones as a voice-activated assistant with the ability to conduct Web searches, dictate and play back text messages and carry out voice commerce (by search Yelp! or OpenTable to choose restaurants and book seats), in addition to voice dialing and media control.</p>
<p>The introduction of a Siri-powered Voice Assistant represents the &#8220;re-emergence&#8221; of Siri because a Siri app is already available as a free download from the iTunes AppStore. iPhone owners who now launch the app are greeted with this message:</p>
<blockquote><p>I&#8217;ve been replaced! The new Siri is even smarter and better looking than me [sic], and waiting for you on the iPhone 4S. Ill be leaving for home Oct 15th</p></blockquote>
<p>This signals that Apple is destined to de-activate Siri in two weeks, when it may move to the home button on all iPhones. But, in the mean time, it enables iPhone owners to use &#8220;plain English&#8221; (or the language of their choice) to find restaurants, nearby movies by title and showtimes, events and businesses. They can ask Siri about the weather (as was demonstrated in the live demo at the iPhone 4S launch), order a taxi, request reminders and even originate Twitter posts.</p>
<p>Introduction of Siri for the iPhone 4S represented a step forward for the service. Activation with the Home Button simplifies service activation. Close integration with message origination is an important enhancement. Most importantly, Apple &#8211; because it failed to introduce the iPhone 5 &#8211; in effect made Siri, and natural language interaction over its mobile devices, the center of attention. The only other features that merited mention as bold moves forward were the dual mode (CDMA/GSM) aspects of the new phone &#8211; making it in effect a &#8216;world phone,&#8221; and supposedly higher data transmission speeds.</p>
<p>The only step back was the characterization of Siri as a &#8220;beta&#8221; product. Phil Schiller, head of product marketing for Apple, sounded almost apologetic as he introduced Siri &#8220;as a beta version,&#8221; as if, were it to fail, he could say, &#8220;I told you so.&#8221; That said, the demos worked as anticipated and the crowd seemed impressed. In addition to getting weather information in three different ways, Forstall used Siri to set the phone&#8217;s alarm clock, listened to and replied to a colleague&#8217;s text message and provided instructions to the iPod application.</p>
<p>The relationship of Siri (which is &#8220;powered by Nuance&#8221;) to<a href="http://opusresearch.net/wordpress/2011/09/13/understanding-the-intent-of-nuance-mobile-advantage/"> Nuance&#8217;s Dragon Go! </a>is something of a conundrum. Both let users speak to their phones to use Phil Schiller&#8217;s words, &#8220;the way they wanna talk.&#8221; But Siri is initiated from the Home Button. Vlingo is in the running as well. It has been true to the <a href="http://opusresearch.net/wordpress/2011/02/10/vlingos-reveals-its-2011-virtual-assistant-roadmap/">Virtual Assistant roadmap</a> that it outlined to us earlier this year. </p>
<p>Apple&#8217;s announcement of Siri should have a ripple effect as millions of iPhone users recognize that speech recognition works reliably and has been well integrated with the services that they regularly use. It feels like speech enabled mobile services have reached an important inflection point.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/10/04/siri-beta-assumes-primacy-on-iphone-4s-home-button/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Waze Talks (on both iPhone and Android)! Thanks to Nuance&#8217;s Vocalizer Network</title>
		<link>http://opusresearch.net/wordpress/2011/09/29/waze-talks-on-both-iphone-and-android-thanks-to-nuances-vocalizer-network/</link>
		<comments>http://opusresearch.net/wordpress/2011/09/29/waze-talks-on-both-iphone-and-android-thanks-to-nuances-vocalizer-network/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 21:27:39 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[app development]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[navigation]]></category>
		<category><![CDATA[Nuance]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4802</guid>
		<description><![CDATA[The engineers at Waze have steadily added to the app's voice-based features and functions. As noted in the company's "By the Waze" blog, TTS-based driving instructions have been a feature of the "Commuting Widget" available for download from the Android Marketplace. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/Wazescreenshot.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/Wazescreenshot.png" alt="" title="Wazescreenshot" width="227" height="341" class="alignright size-full wp-image-4805" /></a>In February (2011) we posted <a href="http://opusresearch.net/wordpress/2011/02/10/waze-adds-voice-features-to-crowdsourced-traffic-reporting-app/">this note</a> about speech enabling Waze, the crowdsourced traffic reporting app for smartphones. At the time, we noted that speech input and output would be a necessity for a truly &#8220;hands-free/eyes forward&#8221; navigational app for smartphones.</p>
<p>True to their word, the engineers at Waze have steadily added to the app&#8217;s voice-based features and functions. As noted in <a href="http://www.waze.com/blog/great-new-waze-version-features-tts-voice-guidance-android-commuting-widget/">this post</a> in the company&#8217;s &#8220;By the Waze&#8221; blog, TTS-based driving instructions have been a feature of the &#8220;Commuting Widget&#8221; available for download from the Android Marketplace. Then, on September 22, <a href="http://www.marketwatch.com/story/nuance-gives-voice-to-waze-app-for-iphone-and-android-2011-09-22">Nuance Communications let it be known</a> that the voice rendering of navigational advice in the Waze applications for both Android and iOS-based devices was using a cloud-based instantiation of its flagship text-to-speech rendering software called Vocalizer Network as invoked from Nuance Vocalizer Studio.</p>
<p>In a separate, but related story, Nuance also announced that it had retooled the packaging of the services and features of the Nuance Mobile Developer (NDEV-Mobile) program. As described in <a href="http://www.marketwatch.com/story/nuance-unveils-new-ndev-mobile-developer-program-complete-with-free-access-to-renowned-dragon-mobile-voice-technology-2011-09-27">this press release</a>, the new configuration offers third-party developers three distinct tiers to simplify how they use Nuance&#8217;s Dragon speech recognition on multiple mobile platoforms, including Windows Phone 7, in addition to iOS, Android and an http-based Web interface. Since its inception in January 2011, the program has registered over 4,000 mobile app developers and, according to the Nuance Press Release, it &#8220;has voice-enabled some of the market&#8217;s most popular apps, including Siri, Price Check by Amazon, Ask for iPhone, Merriam-Webster, Dictionary.com, RemoteLink from OnStar, SpeechTrans, Yellow Pages and AirYell from Avantar, iTranslate, Taskmind, SayHi Translate, Vocre, Bon&#8217;App&#8221; among others. </p>
<p>The ability to voice enable search, navigation and translation has captured the imagination and energy of mobile app developers. But the notion of knitting each function together to act as a personal agent is the ultimate enticement.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/09/29/waze-talks-on-both-iphone-and-android-thanks-to-nuances-vocalizer-network/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Butterfly Effect: Did Nuance&#8217;s Acquisition of Loquendo Kill HP&#8217;s WebOS?</title>
		<link>http://opusresearch.net/wordpress/2011/08/18/the-butterfly-effect-could-nuances-acquisition-of-loquendo-killed-hps-webos/</link>
		<comments>http://opusresearch.net/wordpress/2011/08/18/the-butterfly-effect-could-nuances-acquisition-of-loquendo-killed-hps-webos/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 22:52:53 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[WebOS]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4740</guid>
		<description><![CDATA[For many followers of the tablet wars, the death of HP's TouchPad was a foregone conclusion. But I had high hopes.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/08/Screen-shot-2011-08-18-at-3.40.52-PM.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/08/Screen-shot-2011-08-18-at-3.40.52-PM-e1313707820168.png" alt="" title="Screen shot 2011-08-18 at 3.40.52 PM" width="151" height="75" class="alignright size-full wp-image-4742" /></a>For many followers of the tablet wars, the death of HP&#8217;s TouchPad was a foregone conclusion. But I had high hopes. To follow the common meme there was no way that HP, an enterprise-facing company that sells system integration services and ink cartridges, was going to make a dent in Apple Computer&#8217;s sales of iPads and other iOS-based devices as they win the hearts and minds of everyday people (as opposed to those everyday people in their roles as enterprise employees). </p>
<p>On the negative side, the TouchPad has been victimized by a flurry of bad reviews and complaints about pricing. On the positive side, Google&#8217;s acquisition of Motorola Mobility fueled speculation that competing mobile device manufacturers like HTC, Samsung, LG and other Android adherents, might look to HP and WebOS to provide a competitive product. </p>
<p>To compete with Apple and Google (cum Moto) HP would have to be at parity or better with their basic offerings, in terms of basic functionality. Both the iPhone and Android phones are speech-enabled both from the home screen and the &#8220;search box.&#8221; Apple relies heavily on Nuance, while Google benefits from its own extensive investment in both speech recognition and natural language processing. Collectively, they provide a pleasing and effective user interface.</p>
<p>Speech recognition &#8211; from voice dialing to form filling and search &#8211; has been on the WebOS wishlist dating back to rev 2.0. Version 3.0.0 offered some semblance of voice dialing, but (though I admittedly have not had a demonstration unit to play with) there is no native speech recognition. There is only a short list of independent, embedded speech processing software providers. By acquiring SVOX and now Loquendo, Nuance has steadily reduced their ranks. With WebOS under fire from on so many other fronts (from short battery life to a paucity of applications), the inability to support spoken input is just one more factor in the negative column.</p>
<p>HP is in a major repositioning mode and that is likely to focus almost exclusively on the enterprise market. WebOS is in &#8220;hibernation&#8221; and &#8220;consumer&#8221; models for PCs and laptops are being removed from marketing plans. That said, I just got <a href="http://info.hp.com/w/webView?cid=17625958766&#038;mid=1014162004&#038;pid=1388796&#038;vid=13597&#038;ee=ZG1pbGxlckBvcHVzcmVzZWFyY2gubmV0&#038;si=&#038;mv=H&#038;bv=H&#038;oc=N&#038;sc=&#038;k=ySk9K">this email promotion</a> from HP (obviously sent by a direct mail robot) marketing the TouchPad and accessories. Obviously, it isn&#8217;t over til it&#8217;s over.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/08/18/the-butterfly-effect-could-nuances-acquisition-of-loquendo-killed-hps-webos/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Updated: No Difference Between Native and Captive: Apple To Leverage Both Siri and Nuance</title>
		<link>http://opusresearch.net/wordpress/2011/05/10/no-difference-between-native-and-captive-apple-to-leverage-both-siri-and-nuance/</link>
		<comments>http://opusresearch.net/wordpress/2011/05/10/no-difference-between-native-and-captive-apple-to-leverage-both-siri-and-nuance/#comments</comments>
		<pubDate>Tue, 10 May 2011 21:33:46 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[multimodal]]></category>
		<category><![CDATA[Nuance]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4450</guid>
		<description><![CDATA[With Microsoft plunking down $8.5 billion of its $36 billion in cash and near cash to buy Skype, a few analysts have started to take a closer look at the $25 billion in cash and short term investments on Apple's balance sheet and apparently concluded that it is contemplating a deal with Nuance Communications.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/05/nuanceapplelogos.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/05/nuanceapplelogos.png" alt="" title="nuanceapplelogos" width="151" height="71" class="alignright size-full wp-image-4456" /></a>With Microsoft plunking down $8.5 billion of its $36 billion in cash and near cash to buy Skype, a few analysts have started to take a closer look at the $25 billion in cash and short term investments on Apple&#8217;s balance sheet and apparently concluded that it is contemplating a deal with Nuance Communications. The discussion started at about the time that Google, Microsoft and Facebook were said to be in a three-way auction for Skype, whose S-1 filing (in preparation for an initial public offering of common stock) reflected a net loss in 2010 on revenues of about $850 million. <a href="http://techcrunch.com/2011/05/06/apple-nuance-ios-siri/">MG Siegler wrote a piece in TechCrunch</a> that described yet another three-way relationship. This time the dynamics involve Apple as parent of mobile assistance service provider, Siri (which <a href="http://opusresearch.net/wordpress/2010/04/28/if-true-apples-purchase-of-siri-heralds-new-age-of-virtual-assistance/">Apple acquired almost exactly one year ago</a>), and long-time customer/partner of Nuance, which sources both the speech processing capabilities that power Voice Control in iOS platforms and a number of downloadable apps to support dictation and predictive input of text-based content.</p>
<p>Yesterday Siegler published <a href="http://techcrunch.com/2011/05/09/apple-nuance-data-center-deal/">this story</a>pinpointing Apple&#8217;s new data center in the hills of North Carolina as the locus where at least some of the servers will be running instantiations of both Siri and Nuance-based applications so that mobile and hybrid apps running on the new iOS can take optimize the interplay between speech-recognition or predictive texting, along with application logic and &#8220;artificial intelligence&#8221; to understand intent and deliver results. As Greg Sterling points out in <a href="http://www.internet2go.net/news/mobile-platforms/apple-discussions-nuance-broaden-speech-control-iphone">this post on Internet2Go</a>, Apple&#8217;s iOS-based experience has bit of catching up to do vis-a-vis Google&#8217;s Android-based devices. </p>
<p>As I noted in <a href="http://opusresearch.net/wordpress/2010/08/16/voice-actions-for-android-speechable-moments-from-google-spell-new-market-dynamics/">this pos</a>t in August 2010, Google&#8217;s &#8220;Voice Actions&#8221; conditioned both users and application developers to expect spoken utterances to be one of the input modalities across all applications. A month later I described <a href="http://opusresearch.net/wordpress/2010/09/10/google-has-home-field-advantage-on-the-android-home-page/">Google&#8217;s &#8220;home field advantage&#8221;</a> when it introduced the many ways that a set of widgets could be used in Android that, in essence, made speech processing &#8220;native&#8221; to the operating system and therefore, of consistent use starting with the Home Screen and spanning all applications (like search) and utilities (like texting or dictation). Indeed, at Google I/0, Vlingo is showing off the latest version of its &#8220;Virtual Assistant&#8221; for Android-based phones. On a Samsung Galaxy 2, Vlingo is showing off speech-based access and control to a multiplicity of functions directly from the home page, but [contrary to what I may have implied here before] the Vlingo app connects directly to ASR resources and applications in Vlingo&#8217;s cloud.</p>
<p>Apple has been signaling its intent to meet and exceed Google&#8217;s speech-based offerings for a number of years now. In doing so, it has formed a broad (but not highly publicized) relationship with Nuance as provider of speech processing and predictive input for a broad spectrum of products and services. Followers of Siri know that roughly a month before its formal product launch in February 2010, it switched from its long-time speech recognition vendor to Nuance. At the time, it was thought to be avoiding a lawsuit.</p>
<p>From the mobile user&#8217;s point of view, there is no difference between Native and Captive. Google may have been first to market with speech-enabled services that smooth over the speed bumps between siloed applications. Apple, with an assist from &#8220;native&#8221; implementations of Nuance-based technology mated with &#8220;captive&#8221; Siri&#8217;s formidable combination of application logic and dynamic, e-commerce oriented data flows will try to meet and exceed Google&#8217;s efforts to provide the most pleasing user experience for goal-oriented mobile subscribers. The approach, which has been underway for more than a year now, obviates the need for Apple to spend billions of dollars to buy Nuance, but it will require a long-term relationship akin to the three-year joint development agreement between Nuance and IBM. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/05/10/no-difference-between-native-and-captive-apple-to-leverage-both-siri-and-nuance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Translate App is Now Available for iPhone Users</title>
		<link>http://opusresearch.net/wordpress/2011/02/08/google-translate-app-is-now-available-for-iphone-users/</link>
		<comments>http://opusresearch.net/wordpress/2011/02/08/google-translate-app-is-now-available-for-iphone-users/#comments</comments>
		<pubDate>Wed, 09 Feb 2011 04:39:58 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Mobile Translation]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4097</guid>
		<description><![CDATA[Google officially launched its own speech-to-speech translation application for iPhone users to download from Apple's ITunes store.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/11/Google_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/11/Google_logo.jpg" alt="" title="Google_logo" width="150" height="59" class="alignright size-full wp-image-1943" /></a>Today <a href="http://googletranslate.blogspot.com/2011/02/introducing-google-translate-app-for.html">Google officially launched its own speech-to-speech translation application for iPhone</a> users to download from Apple&#8217;s ITunes store. Did they wait for Verizon Wireless to start offering the iPhone? We&#8217;ll never know. Google&#8217;s offering is not the first of its kind. I, myself, (looking only at &#8220;free&#8221; apps for the iPhone) have loaded Trippo(tm) (from Cellictica), Arabic Buddy (Sakhr) and SnapTranslate (powered by Beyo). Each has a point of differentiation such as scanning and reading road signs in non-Arabic alphabets or offering high accuracy in Arabic languages. In the spirit of Recombinant Communications many of them already use the <a href="http://code.google.com/apis/language/translate/overview.html">Google Translate API</a> after using their own or a third-party&#8217;s speech recognition/transcription resources. Trippo, for instance, employs Nuance&#8217;s speech recognition and transcription (Dragon) as well as its text-to-speech synthesis software. </p>
<p>Google uses its own speech recognition/dictation resources as well as Google Translate to perform the transcription and translation operations. It is thought to use SVOX&#8217;s text-to-speech rendering software to recite the translated output, although its <a href="http://opusresearch.net/wordpress/2010/12/03/googles-latest-acquisition-brings-text-to-speech-luminaries-into-its-fold/">acquisition of Phonetic Arts</a> leads me to believe that the search giant is getting ready to incorporate its own text-to-speech rendering (probably with a human-sounding voice that conveys emotion.</p>
<p>Does Google&#8217;s entry spell the end for competing products? Not necessarily. As <a href="http://www.internet2go.net/news/europe/telenav-holding-its-own-vs-google-navigation">Greg Sterling noted in a post on Internet2Go</a>, the &#8220;free&#8221; Google Navigation application for Android phones barely made a dent in Telenav&#8217;s business. The take-away is that creativity and attention to a great user experience is rewarded in the marketplace. Google&#8217;s technologies for speech recognition, translation and synthesis will continue to be important resources for its own products and services, but other, product-oriented companies will continue to introduce services that successfully bond with customers, even if they compete directly with the technology that they are employing.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/02/08/google-translate-app-is-now-available-for-iphone-users/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance Offers Vocon &#8220;App Module&#8221; for Personal Navigation</title>
		<link>http://opusresearch.net/wordpress/2011/02/07/nuance-automotive-offers-vocon-app-module-for-personal-navigation-devices/</link>
		<comments>http://opusresearch.net/wordpress/2011/02/07/nuance-automotive-offers-vocon-app-module-for-personal-navigation-devices/#comments</comments>
		<pubDate>Mon, 07 Feb 2011 18:09:15 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[automotive]]></category>
		<category><![CDATA[mobile applications]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[navigation]]></category>
		<category><![CDATA[Nuance Communications]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4085</guid>
		<description><![CDATA[Nuance is making it easier for mobile application developers to add voice input and output to personal navigation applications by offering Vocon Navigation as a pre-packaged "App Module" to support "one-shot destination entry." ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/04/nuance_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/04/nuance_logo.jpg" alt="" title="nuance_logo" width="117" height="75" class="alignright size-full wp-image-356" /></a>Nuance is making it easier for mobile application developers to add voice input and output to personal navigation applications by <a href="http://www.nuance.com/company/news-room/press-releases/NC_008248">offering Vocon Navigation as a pre-packaged &#8220;App Module&#8221;</a> to support &#8220;one-shot destination entry.&#8221;  Vocon is Nuance&#8217;s long-standing line of embedded speech processing technology. It has been successfully integrated into the &#8220;One-Shot Destination Entry&#8221; offerings integrated into the PNDs and navigation applications in models marketed by Audi, BMW, Ford and Mercedes-Benz. The Vocon Navigation App Module is designed to leverage the creative energy of a broader community of mobile application developers by packaging the core software with grammars, text-to-speech capabilities, dialog flow and links to the graphical user interface. </p>
<p>Several languages and dialects are supported, as are frameworks and software to support mobile, multi-modal interactions. You can keep up to date by referring to <a href="http://www.nuance.com/for-business/by-solution/automotive-products-and-solutions/index.htm">this descriptive page</a> on the Nuance Web site. </p>
<p>It is becoming increasingly evident that we are well into an era that takes us &#8220;Beyond the SDK&#8221; in the mobile world. While this offering does not fall into the category of a speech API, it does overlap with many of the initiatives that mobile API providers have launched to fire up the creative juices of the app developer community. As the makers of &#8220;tools,&#8221; &#8220;reusable code,&#8221; and &#8220;development environments well know, developers gravitate toward the well-understood, well-documented and familiar. Spoken input and output from mobile navigation devices and apps conform to those three criteria.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/02/07/nuance-automotive-offers-vocon-app-module-for-personal-navigation-devices/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

