<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; Nuance</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/nuance/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Recombinant Communications</description>
	<lastBuildDate>Thu, 29 Jul 2010 23:57:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Action in the Mobile Voice Front</title>
		<link>http://opusresearch.net/wordpress/2010/07/15/action-in-the-mobile-voice-front/</link>
		<comments>http://opusresearch.net/wordpress/2010/07/15/action-in-the-mobile-voice-front/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 18:40:36 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Mobile Platforms]]></category>
		<category><![CDATA[mobile voice]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Vlingo]]></category>
		<category><![CDATA[voice user interface]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3220</guid>
		<description><![CDATA[Mobile voice technology providers Apple, Vlingo and Nuance took actions that, to varying degrees, turn up the heat in the world of mobile voice. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/nipper.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/nipper.jpg" alt="" title="nipper" width="121" height="88" class="alignright size-full wp-image-3229" /></a>Mobile voice technology providers Apple, Vlingo and Nuance took actions that, to varying degrees, turn up the heat in the world of mobile voice. For its part, Apple has been granted yet another patent for a major component of a hands-free, voice user interface (VUI). In <a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&#038;Sect2=HITOFF&#038;d=PALL&#038;p=1&#038;u=/netahtml/PTO/srchnum.htm&#038;r=1&#038;f=G&#038;l=50&#038;s1=7,757,173.PN.&#038;OS=PN/7,757,173&#038;RS=PN/7,757,173">U.S. Patent Number 7,757,173</a> the inventor describes a dynamic or &#8220;updateable&#8221; voice menu. As described in the filing, the technology is designed to offer many of the context-sensitive attributes of a dynamic, graphical user interface for search and retrieval of &#8220;media&#8221;, like recorded music; but the filing notes that &#8220;songs&#8221; or &#8220;music&#8221; could be &#8220;generalized to any form of digital media, which can include sound files, picture data, movies, text files or any other types of media that can be digitally stored on a computer.&#8221;</p>
<p>Some of what Apple describes conceptually, Vlingo is putting into practice with the r<a href="http://">elease of its SuperDialer for Android</a> application. Greg Sterling writes about it <a href="http://www.internet2go.net/news/local-search/vlingo-wants-take-siris-place">here</a>, noting that it is designed to take on Siri for local, mobile search. Yet, with &#8220;SuperDialer&#8221; Vlingo is delivering an easy-to-understand use case for a voice-based front end to messaging resources, social networks, search and, ultimately transactions.</p>
<p>Nuance, for its part, reminds us that the automobile is destined to be the ultimate smart, mobile device. The companies have jointly <a href="http://www.nuance.com/news/pressreleases/2010/20100715_MyFordTouch.asp">expanded the range of speech-enabled features and functions it is offering in conjunction with Ford as part of &#8220;MyFord Touch&#8221;</a>. By adding more first-level commands and making the interface more dynamic and personal, the initiative is designed to make a person&#8217;s voice &#8220;the primary in-car communications interface.&#8221; </p>
<p>Establishing the primacy of a user&#8217;s voice for command and information entry in cars and on smartphones remains a tall order, but the speed at which solutions providers introduce new refinements is definitely accelerating.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/07/15/action-in-the-mobile-voice-front/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance and MetaSwitch Offer &#8220;Carrier Optimized&#8221; Voicemail-To-Text</title>
		<link>http://opusresearch.net/wordpress/2010/06/22/nuance-and-metaswitch-offer-carrier-optimized-voicemail-to-text/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/22/nuance-and-metaswitch-offer-carrier-optimized-voicemail-to-text/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 18:22:03 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Carrier Services]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[speech processing]]></category>
		<category><![CDATA[voicemail-to-text]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3052</guid>
		<description><![CDATA[Today MetaSwitch and Nuance announced that ivoicemail-to-text software from Nuance is tightly integrated into MetaSwitches flagship CommPortal platform.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/MetaswitchNetworks_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/MetaswitchNetworks_logo.png" alt="" title="MetaswitchNetworks_logo" width="144" height="34" class="alignright size-full wp-image-3057" /></a>Today MetaSwitch and Nuance announced that voicemail-to-text software from Nuance is tightly integrated into MetaSwitches flagship CommPortal platform. Thus MetaSwitch continues its transformation from a sleepy softswitch provider with about $140 million in revenues to the &#8220;network application engine that can&#8221;, with ambitious growth targeted through the addition of new partners and capabilities. </p>
<p>While announcements in the domain of network devices often fall into the &#8220;telco plumbing&#8221; category, this one is significant because it signals that transcription and delivery of spoken messages will be more routinely integrated into the product offerings of the &#8220;multi-screen&#8221; (TV, PC and mobile phone) carriers. This <a href="http://www.metaswitch.com/news/metaswitch-enhances-unified-messaging-with-speech-to-text.aspx">press release </a>surrounding the Nuance partnership includes a testimonial from the AVP of Commercial Products at Frontier Communications, welcoming the enhancement to the carrier&#8217;s existing voicemail facilities. Indeed, the major purpose of the partnership is to make it easy for MetaSwitch&#8217;s existing customers to add a popular, revenue producing feature.</p>
<p>MetaSwitch claims about 500 customers in North America alone. The list includes OEMs like Cisco, Alcatel-Lucent, Motorola, but really focuses on all manner of carriers (with publicly announced customers listed <a href="http://www.metaswitch.com/company/carrier-customer-list.aspx">here</a>). </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/22/nuance-and-metaswitch-offer-carrier-optimized-voicemail-to-text/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance Brings Speech to ng Connect</title>
		<link>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 00:58:44 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3000</guid>
		<description><![CDATA[With its focus squarely on the automotive market, Nuance announced that it has joined ng Connect, an intercompany, interdisciplinary wireless development initiative founded in 2009 by Alcatel-Lucent. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/NEW_ngConnect_color1.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/NEW_ngConnect_color1.jpg" alt="" title="NEW_ngConnect_color" width="150" height="66" class="alignright size-full wp-image-3004" /></a>With its focus squarely on the automotive market, <a href="http://www.nuance.com/news/pressreleases/2010/20100608_globalNG.asp">Nuance announced that it has joined ng Connect</a>, an intercompany, interdisciplinary wireless development initiative founded in 2009 by Alcatel-Lucent. The consortium is a mixed bag of technology providers, that include (in addition to ALU) &#8220;name brand&#8221; IT infrastructure providers like HP, NCR, QNX Software and Gemalto along with mobile device makers like Samsung and Kyocera (which makes its own phones as well as Sanyo), joined by application and content providers like <a href="http://opusresearch.net/wordpress/2009/03/25/keep-an-eye-or-ear-out-for-rebelvox/">RebelVox</a>, Atlantic Recordings, BUZZMedia and about twenty others. </p>
<p>The focus of ng Connect is LTE, referring to the &#8220;Long Term Evolution&#8221; of 3G mobile, a term which, itself is trademarked by the 3GPP (3G Partnership Project). Both the 3GPP and ng Connect seek to develop applications and services that take advantage of the high throughput, low latency and plug-and-play nature of the underlying technologies. It is considered a stepping stone toward the 4G (and the as-yet unspecified 5G) environments which promise higher speeds and larger cell sizes. In joining ng Connect, Nuance is the lone provider of key speech processing technologies (Automated Speech Recognition and Text-To-Speech) as well as the predictive texting applications that can fulfill requirements for the new requirements for command, information input and dialog that users of mobile applications require.</p>
<p>Toyota, though its U.S. Sales Division, is the lone automobile manufacturer participating in ng Connect. Nuance has already had its speech processing technologies <a href="http://opusresearch.net/wordpress/2010/01/07/ford-turns-cars-into-open-platforms-for-recombinant-mobile-speech-apps/">integrated into the Ford SYNC system</a> (along with Microsoft and BSquare). Through its participation in ng Connect, Nuance hopes to benefit from sharing results of shared customer research, technology development and market conditioning and promotion. In the automotive domain, Loquendo is probably Nuance&#8217;s most direct competitor (with well-accepted solutions in &#8220;embedded&#8221; text-to-speech and speech recogntion, well-suited for both in-car systems and mobile phones).  </p>
<p>Yet there is no better place than a moving car to showcase the value of high-speed, wireless links and low-latency access to shared resources built around maps, games, social networks and associated data stores. Accurate speech processing is important, but so is elegant orchestration of real-time transactions, asynchronous and synchronous conversations and a broad variety of interactions. In the car Microsoft, with Bing Search, Maps and active Windows development community of developers are already pitted against Google with its voice search, maps and Android developers. Joining the ng Connect community immediately expands Nuance&#8217;s hooks into a broader community of technology providers that stand ready to cooperate to bring more services to market.</p>
<p>The track record for these types of consortia is spotty. While participants profess to promote an &#8220;open&#8221; or shared approach to solution development, participating companies have shown a pre-disposition to protect their brands and their perceived competitive advantage. Still, we wish them luck.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Nuance/IBM Five-Year Plan: R&amp;D Focused on Understanding</title>
		<link>http://opusresearch.net/wordpress/2010/05/24/the-nuanceibm-five-year-plan-rd-focused-on-understanding/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/24/the-nuanceibm-five-year-plan-rd-focused-on-understanding/#comments</comments>
		<pubDate>Mon, 24 May 2010 18:44:20 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[Advisories]]></category>
		<category><![CDATA[Featured Research]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[speech processing]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2917</guid>
		<description><![CDATA[
Featured Research
The R&#038;D relationship between IBM and Nuance has reached its third stage, now that the two companies have entered a five-year joint research initiative. Their collective objective is to get to the next phase in speech processing, where person-to-machine interactions are as natural as person-to-person.
Advisories are available to registered users only. 
For more information [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/pdfreports/adv_TwitCC_Apr15.png" align='right' HSPACE=5 vspace=5 border=1/><br />
<em>Featured Research</em><br />
The R&#038;D relationship between IBM and Nuance has reached its third stage, now that the two companies have entered a five-year joint research initiative. Their collective objective is to get to the next phase in speech processing, where person-to-machine interactions are as natural as person-to-person.</p>
<p><em>Advisories are available to registered users only.</em> </p>
<p>For more information on becoming an Opus Research client, please contact Pete Headrick (<a href="mailto:pheadrick@opusresearch.net">pheadrick@opusresearch.net</a>).</p>
<p><!--/hidethis--></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/24/the-nuanceibm-five-year-plan-rd-focused-on-understanding/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance to Power Voice-to-Text Transcription on Cisco&#8217;s Unity Platform</title>
		<link>http://opusresearch.net/wordpress/2010/03/24/nuance-to-power-voice-to-text-transcription-on-ciscos-unity-platform/</link>
		<comments>http://opusresearch.net/wordpress/2010/03/24/nuance-to-power-voice-to-text-transcription-on-ciscos-unity-platform/#comments</comments>
		<pubDate>Wed, 24 Mar 2010 19:55:03 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[voicemail-to-text]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2601</guid>
		<description><![CDATA[Nuance Communications announced that Cisco is integrating its voice messaging transcription service into Cisco Unity, the IP-telephony giant's unified messaging platform.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo-150x107.png" alt="" title="NuanceLogo" width="150" height="107" class="alignright size-thumbnail wp-image-1194" /></a>Nuance Communications announced that Cisco is integrating its voice messaging transcription service into Cisco Unity, the IP-telephony giant&#8217;s unified messaging platform. The deal reflects Nuance&#8217;s stepped up focus on multi-modal messaging in the enterprise. As for Cisco, it will find that rendering voice messages as an email or text message will be an indispensable, and frequently-used component of its mobile enterprise strategy.</p>
<p>Cisco SpeechView with Nuance Voice-to-Text transcription will automatically transcribe all voicemail messages and deliver them as email, with attached audio files for review and the caller ID in the subject line. Cisco will package the service as part of its Unified Communications System 8.0. It will be marketed through authorized Cisco channels in North and South America and Europe SpeechView featureing Naunce Voice-to-Text messaging. Initially the service will support English, French, German, Italian, Brazilian Portuguese, and Spanish. The audio message remains in the user&#8217;s voicemail box and can be retrieved at any time. </p>
<p>The agreement with Cisco validates Nuance&#8217;s decision to focus its development and marketing efforts on enterprise customers and communications service providers. Thanks to the acquisition of Jott and SpinVox, as well as the internal development of its own &#8220;Voicemail-to-Text&#8221; service, the company has already forged relationships with AT&#038;T, Vonage, Bell Canada, Rogers Wireless, Virgin Mobile Canada, Optus, Vodafone, Vodacom, SaskTel, TELUS, Movistar and others. All services leverage the Dragon NaturallySpeaking dictation software, but may employ the human element that live transcribers bring, to provide the most accurate renderings possible. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/03/24/nuance-to-power-voice-to-text-transcription-on-ciscos-unity-platform/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Recombinant Communications Brings New Life to Text-to-Speech</title>
		<link>http://opusresearch.net/wordpress/2010/03/02/recombinant-communications-brings-new-life-to-text-to-speech/</link>
		<comments>http://opusresearch.net/wordpress/2010/03/02/recombinant-communications-brings-new-life-to-text-to-speech/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 00:09:22 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[Advisories]]></category>
		<category><![CDATA[Featured Research]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[Text-to-Speech]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2472</guid>
		<description><![CDATA[
Featured Research
The advent of Recombinant Communications has the potential to breathe new life into some well-established voice processing technologies – including text-to-speech (TTS) rendering. New applications “read” Tweets, email and text messages easily. New platforms allow tuning of output to support specific voices or brands.
Advisories are available to registered users only. 
For more information on [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/pdfreports/adv_RCandTTS_mar2.png" align='right' HSPACE=5 vspace=5 border=1/><br />
<em>Featured Research</em><br />
The advent of Recombinant Communications has the potential to breathe new life into some well-established voice processing technologies – including text-to-speech (TTS) rendering. New applications “read” Tweets, email and text messages easily. New platforms allow tuning of output to support specific voices or brands.</p>
<p><em>Advisories are available to registered users only.</em> </p>
<p>For more information on becoming an Opus Research client, please contact Pete Headrick (<a href="mailto:pheadrick@opusresearch.net">pheadrick@opusresearch.net</a>).</p>
<p><!--/hidethis--></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/03/02/recombinant-communications-brings-new-life-to-text-to-speech/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance Buys MacSpeech: Getting more Cozy with Apple and its Users</title>
		<link>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/</link>
		<comments>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 16:14:03 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2376</guid>
		<description><![CDATA[Nuance announced that it is acquiring MacSpeech, a company that has been developing speech recognition resources for Apple's MacIntosh computers since 1996. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo-150x107.png" alt="NuanceLogo" title="NuanceLogo" width="150" height="107" class="alignright size-thumbnail wp-image-1194" />In this <a href="http://www.nuance.com/news/pressreleases/2010/20100216_macspeech.asp">press release</a>, Nuance announced that it is acquiring MacSpeech, a company that has been developing speech recognition resources for Apple&#8217;s MacIntosh computers since 1996. Because MacSpeech started licensing Dragon Dictate in 2008, this acquisition may amount to a mere formality. But Peter Mahoney, Dragon&#8217;s general manager (as well as senior vice president of Nuance) explains that the two companies will work more closely to bring a line of Dragon branded dictation software to market that is &#8220;100 percent Mac.&#8221;</p>
<p>Symbolically, the move signals that Nuance (especially with the Dragon brand) is emerging as the most-favored speech-processing technology across Apple&#8217;s broad spectrum of  computing and communications platforms, including personal computers, notebooks, laptops, iPhones, iPods and (we would supposed) the upcoming iPad. As we&#8217;ve all learned from the &#8220;Google phenomenon&#8221;, the more utterances that a company is able to collect on behalf of a broad customer base, the better the odds of accurate rendering of spoken words. Stated differently, deeper integration into Apple&#8217;s application environment should translate into a better user experience as those users help computers &#8220;learn&#8221; to understand what they are saying.</p>
<p>Google&#8217;s &#8220;cloud-based&#8221; approach to speech recognition makes it &#8220;platform agnostic&#8221;, therefore we will continue to see speech-enabled Google applications on iPhones and other connected platforms. Google will continue to use the voluminous amounts of spoken phrases to expand a portfolio of offerings that already includes search, transcription and translation. Nuance&#8217;s is responding by forging relationships aimed to make its speech-enabled applications work well on Mac&#8217;s and other Apple branded products, while at the same time supporting multimodal communications on a wide variety of mobile handsets. </p>
<p>Now, all eyes (or ears) should be on Microsoft, the only other &#8220;superpower&#8221; with the ability to define and refine speech-enabled user experience across platforms and modalities. With its own speech recognition &#8220;engine&#8221;, a dedicated speech app server farm called Tellme and some excellent &#8220;voice search&#8221; apps operating under the Bing brand, it has the potential to compete. But, as Greg Sterling noted in <a href="http://opusresearch.net/wordpress/2010/02/15/windows-7-phone-where-is-voice/">this post</a> about the introduction of its new mobile operating system, it has definitely bestowed second-class status to its voice enabled services.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Siri Debuts on iPhone: Speech-based Virtual Personal Assistant</title>
		<link>http://opusresearch.net/wordpress/2010/02/04/siri-debuts-on-iphone-speech-based-virtual-personal-assistant/</link>
		<comments>http://opusresearch.net/wordpress/2010/02/04/siri-debuts-on-iphone-speech-based-virtual-personal-assistant/#comments</comments>
		<pubDate>Fri, 05 Feb 2010 05:00:32 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[local commerce]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2310</guid>
		<description><![CDATA[Today the App Store in Apple's iTunes site begins distributing Siri, a new app that transforms the iPhone into a "virtual personal assistant."]]></description>
			<content:encoded><![CDATA[<p>Today the App Store in Apple&#8217;s iTunes site begins distributing Siri, a new app that transforms the iPhone into a &#8220;virtual personal assistant.&#8221; I know, we&#8217;ve heard the term before, describing precursor services like Wildfire, HeyAnita or the product of General Magic. Yet, in all those cases, the principal roles of the Virtual Assistant was to handle scheduling, messaging and simple directory-based activities (call origination, incoming call handling and the like).</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/02/IMG_0185-150x150.jpg" alt="IMG_0185" title="IMG_0185" width="150" height="150" class="alignright size-thumbnail wp-image-2311" />Siri is set apart because it applies the depth of knowledge its founders and software specialists have built at SRI and elsewhere in creating a &#8220;cognitive assistant that learns and organizes&#8221; (CALO). Siri users benefit from a voluminous amount of pre-preprocessing and organization of information that has been carried out &#8220;in the cloud&#8221; on their behalf. </p>
<p>The image above illustrates Siri&#8217;s landing page. The illustrated topic areas serve as reminders of the sort of often-asked-for information which the service is tuned to handle. It also suggests phrases that users might try to get the information they want. Note that the suggestion below &#8220;Movies&#8221; is &#8220;PG-13 movies this afternoon&#8221;, illustrating that the &#8220;artificial intelligence&#8221; ingrained in the service is quite capable of knowing a movies rating and the meaning of &#8220;this afternoon&#8221; as well as the physical location of the originating user. And, given the precepts of CALO, responses get more accurate and useful as the system acquires more usage history.</p>
<p>I&#8217;ve had the service for a couple of days and here are my initial reactions. My overall experience has been quite positive. The quality of voice recognition (powered by the same &#8220;engine&#8221; that supports Dragon Dictation and Dragon Search on the iPhone) is quite good. It has been accurate both indoors and out. More importantly, the results are illustrated in a large white box for editing before submision. This form of spoken &#8220;utterance triage&#8221; is a must for speech enabled applications and will ultimately give users a chance to correct punctuation and capitalization, in addition to spelling.</p>
<p>Response time could feel a bit draggy (the general public hates latencies); but, on the positive side, the answers were qualitatively different from those of a general search engine (like the voice-activated Google app for the iPhone). Put simply, the service is more &#8220;domain aware.&#8221; It recognizes the differences in intent when a query is about a Taxi versus a movie and responds accordingly. The request for a &#8220;taxi service&#8221; is a great example. Google serves up links to various local taxi cab services in the area, including the phone numbers and a means to get directions.</p>
<p>Siri, by contrast, assumes that you want a taxi immediately and serves up a form, using Taxi Magic (powered by RideCharge), to book a ride, based on your location and a specified time. Before delivering the form, however, Siri serves up a number of comic book like dialog balloons with statements in plain English to tell you how it is processing your request. For example it might say &#8220;I found these taxis within walking distance&#8221; or suggest another way to interpret your utterance: &#8220;Get me a cab&#8221;, for instance. </p>
<p>In each of the chosen categories, the search engine is designed to accelerate the process of search and decisionmaking that culminates in a purchas or transaction. The company&#8217;s financial success will be predicated on supporting multiple transactions and taking a percentage fof the revenue generated. That&#8217;s another big difference between Google Voice Search and Siri. </p>
<p>Based on my experience, I encourage people to download and gain experience with Siri, just as it gains experience with you.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/02/04/siri-debuts-on-iphone-speech-based-virtual-personal-assistant/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>It&#8217;s Official: Nuance is Buying SpinVox for $66 Million in Cash, Plus Stock</title>
		<link>http://opusresearch.net/wordpress/2009/12/30/its-official-nuance-is-buying-spinvox-for-66-million-in-cash-plus-stock/</link>
		<comments>http://opusresearch.net/wordpress/2009/12/30/its-official-nuance-is-buying-spinvox-for-66-million-in-cash-plus-stock/#comments</comments>
		<pubDate>Thu, 31 Dec 2009 00:10:32 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[SpinVox]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2147</guid>
		<description><![CDATA[Nuance has acquired SpinVox to "accelarate its voice to text business." According to the release the transaction was worth $102 million with a third of that coming in Nuance common stock.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/04/nuance_logo.jpg" alt="nuance_logo" title="nuance_logo" width="117" height="75" class="alignright size-full wp-image-356" />Greg Sterling provided the following commentary on Nuance&#8217;s purchase of SpinVox on the Internet2Go site:</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/12/spinvox_logo-150x45.png" alt="spinvox_logo" title="spinvox_logo" width="150" height="45" class="alignleft size-thumbnail wp-image-2154" />
<p>Nuance has acquired SpinVox to &quot;accelarate its voice to text business.&quot; According to the release the transaction was worth $102 million with a third of that coming in Nuance common stock. From the <a href="http://www.nuance.com/news/pressreleases/2009/20091230_acquireSpinVox.asp">release</a>:</p>
<blockquote><p>As the estimated number of operational voicemail boxes in the world has passed one billion, and consumer and corporate activity now generate over 150 billion voicemails a year, Nuance and SpinVox have experienced strong interest in voice-to-text automation. The two companies helped pioneer solutions that utilize speech recognition and transcription workflow solutions to convert voicemails into text that can be sent to users as SMS or email messages. This transaction marries innovative speech solutions and robust carrier-grade infrastructure to accelerate innovation, and deliver these voice-to-text services to global subscribers.</p>
</blockquote>
<p>And from the <a href="http://blog.spinvox.com/2009/12/30/speech-pioneers-nuance-and-spinvox-join-forces-to-advance-global-speech-technology-market/">SpinVox blog</a>:</p>
<blockquote><p>Okay, so what does this mean? Without question there is an accelerating demand from carriers, consumers and enterprises for robust speech-enabled services and automated voice-to-text platforms – in fact, SpinVox already services nearly 100 million users worldwide. With that in mind, Nuance will leverage SpinVox’s carrier-grade voice-to-text infrastructure, network product portfolio, multi-language support and experienced UK-based development teams to further drive and accelerate adoption of voice-to-text around the world.</p>
<p>This is great news for customers who will benefit from both the technology strength and superior product and services delivery. There will be more services, more applications, highly accurate voice-to-text transcription and the best delivery platform available – no matter where you are in the world! </p>
</blockquote>
<p>SpinVox a year ago in March <a href="http://www.spinvox.com/spinvox-secures-over-100-million-in-new-funding-round..html">obtained $100 million</a> in a massive funding round and more than $200 million in total. The company was <a href="http://www.theregister.co.uk/2009/07/29/spinvox_mechanical_turk/">rocked by scandal</a> when it appeared that most of the speech-to-text transcription was done by humans and not by machine, as had been claimed. </p>
<p>The company had an outstanding £30m ($48.2 million) loan that it was having difficulty repaying.  </p>
<p>While this is obviously not the outcome SpinVox envisioned a couple of years ago, Nuance picks up a valuable addition to its suite of enterprise and consumer voice applications, a range of existing clients and a large installed base of users.  </p>
<p>I had povided my thoughts on the SpinVox acquisition last week. I called it a coup for Nuance, but I also see it as an important development for incumbent telcos. Transcription of voice messages extends the life of existing voicemail platforms and is the missing link in the evolution messaging services that prove their value to end-users by going well-beyond simple &#8220;message waiting&#8221; notifications to the delivery of complete messages as email or SMS texts. This breaks down long-standing boundaries between voicemail and email, and transforms the telephones role in message origination.</p>
<p>SpinVox had taken considerable heat for its decision to involve live agents in the &#8220;disambiguation process&#8221; required for accurate rendering of spoken words. Yet, to me, this is yet another flavor of the high-tech-plus-high-touch combination that make real-time services truly useful. Both Nuance and SpinVox have placed a premium on accurately rendering voicemail messages. It was a tactical choice and a differentiator, especially against Google Voice (which is thought to be 100% automated). </p>
<p>As I say repeatedly (often with accurate rendering) 100% accuracy in transcribing voicemail is a pipe dream. Both human-aided and totally automated systems are notorious for their failures to recognize such things as street names and other proper nouns, and that situation is unlikely to improve. But this failure, in and of itself, creates the seeds for stronger bonds between people who send or receive messages from one another.</p>
<p>Admittedly, this is not like cracking &#8220;The Da Vinci Code&#8221;, but there are game like qualities to figuring out some of the messages that are received as spoken words and rendered as text messages. In most cases, the meaning comes across loud and clear. Besides, as is true with applications from Nuance (Voicemail2Text) and SpinVox (in many flavors) recipients can listen to MP3 files of the messages as attachments to the email or links to the SMS text. The value of text-based delivery is undeniable, as is the high probability that at least one of the words or phrases will be inaccurate [I'm going to address this phenomenon in a CATScan called "The end of 5 Nines... Hallalujah!"]. </p>
<p>Combine the factors mentioned above and you&#8217;ll understand why it is more important than ever for Nuance to expand the potential user base for its voicemail-to-text services. Google has gone full-speed ahead with notoriously inaccurate voicemail-to-text rendering deeply embedded in its Google Voice services. At this point, accuracy is not the issue (though it is important to be as accurate as possible); global reach is the objective. SpinVox&#8217;s contracts with global carriers is very important to both Nuance and the carriers, themselves, as they prepare to compete with Ma Google (the Search Giant as Telco).</p>
<p>It&#8217;s also a potential win for end-users. As Google has so often proven, accurate treatment (of search queries or spoken utterances) improves with volume. The combination of Nuance and SpinVox can create the critical mass of users required to result in a highly-accurate service while, at the same time, posing formidable competition to Google Voice.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/12/30/its-official-nuance-is-buying-spinvox-for-66-million-in-cash-plus-stock/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update: Proposed &#8220;Dot Rev&#8221; of Dragon Dictation on the iPhone Will Address Privacy Concerns</title>
		<link>http://opusresearch.net/wordpress/2009/12/10/update-proposed-dot-rev-of-dragon-dictation-on-the-iphone-will-address-privacy-concerns/</link>
		<comments>http://opusresearch.net/wordpress/2009/12/10/update-proposed-dot-rev-of-dragon-dictation-on-the-iphone-will-address-privacy-concerns/#comments</comments>
		<pubDate>Thu, 10 Dec 2009 14:01:45 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Dragon]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Nuance]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2020</guid>
		<description><![CDATA[One particular aspect of Nuance Communications Dragon Dictation for the iPhone has captured the imagination of the connected public, and not necessarily in a good way. In this blog post, Nuance&#8217;s Michael Thompson addresses the concerns of a group of people who question why, during installation, the new application copies and uploads all the names [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/12/dragon_mobile_logo-150x38.png" alt="dragon_mobile_logo" title="dragon_mobile_logo" width="150" height="38" class="alignright size-thumbnail wp-image-2024" />One particular aspect of Nuance Communications Dragon Dictation for the iPhone has captured the imagination of the connected public, and not necessarily in a good way. In <a href="http://blog.dragonmobileapps.com/2009/12/what-dragon-dictation-for-iphone-does.html">this blog post</a>, Nuance&#8217;s Michael Thompson addresses the concerns of a group of people who question why, during installation, the new application copies and uploads all the names in an iPhone&#8217;s contact list. The thread of comments to the post start with concern over what Nuance intends to do with the names, but quickly branch out into a quite thorough (perhaps too thorough) critique of storage and protection of so-called &#8220;speech data.&#8221;</p>
<p>To be clear, Thompson assures the public that Nuance uploads the names for a single purpose: to improve the application&#8217;s ability to render the names inside a dictated message. From experience Nuance and its cohort of speech-to-text service providers are well aware of the difficulty of recognizing proper nouns. Therefore, the firm has opted to upload names only. They are not associated with other contact information or with the identity of the device and its owner.</p>
<p>Still, &#8220;privacy&#8221;, broadly defined remains a very sensitive matter, and a hot-button issue. Some of the specific concerns (such as the one from a &#8220;defense contractor&#8221; who needs to certify that his list of contracts is under his control or in a secure server) militate toward Nuance offering a simple &#8220;opt-out&#8221; option upon initiation of the app. We&#8217;ve learned that Nuance has already added that option for &#8220;version 1.1&#8243; which has already been proposed for expedited treatment by the AppStore gatekeepers. This &#8220;opt-out&#8221; out strategy is considered a short-term fix by Nuance. The size and scope of responses (albeit many are anonymous) is leading the company to &#8220;explore options that give users more control over what gets uploaded.&#8221; (quoting a post from Nuance Sr. Manager Nirmalya De).</p>
<p>I, personally, don&#8217;t believe that uploading contact names to improve recognition amounts to a &#8220;illegal disclosure&#8221;. At the same time, I applaud efforts to make mobile subscribers (and Web users in general) aware of the meta-data of their own creation that can be used to refine and improve services. Nuance has learned an important lesson: that wireless subscribers should control the information that they store on their mobile devices. But, in the mean time, the wireless public has made its general preference known: According to today&#8217;s <a href="http://blog.dragonmobileapps.com/2009/12/you-have-made-dragon-top-3-free-app.html">post on the Dragon Dictate Blog</a>, the app jetted to the upper echelons of the iTunes App Store&#8217;s list of downloads, achieving #3 among all free apps and #1 in the &#8220;business category.&#8221; </p>
<p>It is early days for adoption of speech enabled mobile serivces, but the public is clearly willing to test-drive (I should say &#8220;text-drive&#8221;) the latest application.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/12/10/update-proposed-dot-rev-of-dragon-dictation-on-the-iphone-will-address-privacy-concerns/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
