<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; user experience</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/user-experience/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 23 May 2012 17:29:43 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Merger is the Message: What Acquisitions like Swype and GroupMe Really Mean</title>
		<link>http://opusresearch.net/wordpress/2011/10/07/the-merger-is-the-message-what-acquisitions-like-swype-and-groupme-really-mean/</link>
		<comments>http://opusresearch.net/wordpress/2011/10/07/the-merger-is-the-message-what-acquisitions-like-swype-and-groupme-really-mean/#comments</comments>
		<pubDate>Fri, 07 Oct 2011 19:13:14 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Mergers and Acquisitions]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Skype]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4828</guid>
		<description><![CDATA[There's a new calculus afoot among "agile" application and software developers, especially those addressing the marketplace created by new mobile devices. It pays to "sell solutions," especially when your solution is for a very specific, well-recognized problem. 
]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/10/Unknown-1.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/10/Unknown-1.jpeg" alt="" title="Swypelogo" width="151" height="85" class="alignright size-full wp-image-4836" /></a>There&#8217;s a new calculus afoot among &#8220;agile&#8221; application and software developers, especially those addressing the marketplace created by new mobile devices. It pays to &#8220;sell solutions,&#8221; especially when your solution is for a very specific, well-recognized problem. </p>
<p>This lesson was driven home most recently when Nuance Communications acquired Swype, Inc., a small, company whose single product is software makes it simpler and faster to use your finger (or a stylus?) to enter text on a smartphone. According to this<a href="http://phx.corporate-ir.net/preview/phoenix.zhtml?c=110330&#038;p=irol-SECText&#038;TEXT=aHR0cDovL2lyLmludC53ZXN0bGF3YnVzaW5lc3MuY29tL2RvY3VtZW50L3YxLzAwMDA5NTAxMjMtMTEtMDg5MDE0L3htbA%3d%3d"> SEC filing</a> and <a href="http://internet2go.net/news/carriers/nuance-pays-100m-swype-why-exactly">this comment</a> by my associate Greg Sterling on Internet2Go, Nuance is committing something on the order of $100 million to merge Swype into its operations, presumably to add Swype capabilities to its Flex T9 interface.</p>
<p>In his post, Greg questions why Nuance needs to buy Swype, even as Flex T9 already outperforms Swype in some respects. He presents a list of possible answers to his question, including what I would call &#8220;the three P&#8217;s,&#8221; patents, positioning and personnel. But I would add a fourth: &#8220;Packaging.&#8221; Swype&#8217;s market proposition was simplicity itself: &#8220;Swype is a faster and easier way to input text on any screen.&#8221; That&#8217;s it. And it won over a following. </p>
<p>As for Nuance, it is following its time-tested modus operandi for constant improvement and refinement through acquisition. It&#8217;s been doing to bring specific refinements to its product portfolio automated speech processing &#8211; including voice biometrics, directory assistance/auto-attendant, text-to-speech rendering, dictation, medical transcription and other domains that benefit from solving specific problems. The core T9 technology came with the acquisition of Tegic from AOL-Time Warner in 2007 for $247 million. At the time I called it part of Nuance&#8217;s &#8220;Pay It Forward&#8221; strategy of constantly improving a mobile subscriber&#8217;s ability to take control of his or her device and the services provided through it.</p>
<p>For Nuance, it adds to its portfolio of technologies that support highly personalized mobile services. For Swype, it provides a windfall, liquidity event that satisfies its founders and investors. For those of us into pattern recognition, this acquisition is similar to the one that brought GroupMe &#8211; a year-and-a-half old, single-product company &#8211; into Microsoft/Skype&#8217;s pantheon of products and services. A journalist at the International Business Times-Australia accurately referred to GroupMe as &#8220;a super-small, zero-revenue tech company founded only last year at the Techcrunch Disrupt Hackathon.&#8221; </p>
<p>So why did GroupMe fetch a multi-million price (rumored to be between $65 and $78 million) from Microsoft? The resemblance to the Nuance/Swype deal is striking. First, GroupMe solves a known problem confronted by millions of mobile users as they try to form small groups on the fly. Or as GroupMe puts it on its Home Page: &#8220;Group Messaging from Any Phone.&#8221; Second, it&#8217;s not an &#8220;app,&#8221; it&#8217;s part of the user interface. It can be generalized across multiple modes and media. As we&#8217;ve learned with Google+ and, with some difficulty on Facebook or LinkedIn, we humans benefit from group-forming utilities. GroupMe&#8217;s core product may be the result of a mere eight hours at a TechCrunch Hackathon (using Twilio&#8217;s tools on Amazon Web Services EC2, incidentally) but it fulfilled a recognized Gap in Skype&#8217;s mobile user interface.</p>
<p>The acquisition enhances the Microsoft/Skype e-services genome. It&#8217;s a grand slam home run for GroupMe. From the mobile subscriber&#8217;s point of view, it improves his or her ability to take control of their device and the services it offers. See the pattern?</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/10/07/the-merger-is-the-message-what-acquisitions-like-swype-and-groupme-really-mean/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VC Investment in Interactions Acknowledges that Investors Recognize a Problem</title>
		<link>http://opusresearch.net/wordpress/2011/07/28/vc-investment-in-interactions-acknowledges-that-investors-recognize-a-problem/</link>
		<comments>http://opusresearch.net/wordpress/2011/07/28/vc-investment-in-interactions-acknowledges-that-investors-recognize-a-problem/#comments</comments>
		<pubDate>Thu, 28 Jul 2011 19:13:57 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[customer care]]></category>
		<category><![CDATA[investments]]></category>
		<category><![CDATA[IVR]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4706</guid>
		<description><![CDATA[Interactions Corporation completed a financing round totaling $12 million, which is earmarked to support a "growing customer base, expand its marketing presence, and invest in new technology."]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/07/Interactionslogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/07/Interactionslogo.png" alt="" title="Interactionslogo" width="151" height="31" class="alignright size-full wp-image-4708" /></a>Interactions Corporation completed a financing round totaling $12 million, which is earmarked to support a &#8220;growing customer base, expand its marketing presence, and invest in new technology.&#8221; Leading the group of investors is Sigma Partners (with offices both in Boston and Menlo Park). Sigma has an interesting portfolio of high-tech investments that include voice analytics specialist CallMiner, as well Vlingo and Voxify. </p>
<p>Readers who are into pattern detection should see that these companies comprise small group of companies taking leadership in defining a better user experience that regards better speech recognition and language understanding as key to satisfactory fulfillment of each individual&#8217;s expectations.<br />
Other participants in Interactions funding are  North Hill Ventures, Cross Atlantic Capital Partners, and Updata Partners. </p>
<p>Interactions achieved notoriety of sorts (at least for the speech-enabled customer service crowd) by <a href="http://www.interactions.net/cms/content/consumer-study-finds-overwhelming-dissatisfaction-ivr">publishing survey results depicting &#8220;overwhelming dissatisfaction with IVR.&#8221; </a>The study, conducted by Liel Leibovitz, New York University Assistant Professor of Communications, and commissioned by Interactions, contained no real surprises. Respondents indicated that they found IVRs hard to use and that two-thirds prefer assistance by live agents. Interestingly 15% listed IVRs as their preferred channel for communications with their selected vendors.</p>
<p>The study validated the shortcomings of today&#8217;s IVRs, which Interactions and its investors has interpreted as a &#8220;experience gap&#8221; between live agents and IVRs. The products and services offered by Interactions seek to close the gap by providing what Opus Research has been calling &#8220;assisted self-service&#8221; &#8211; meaning that they introduce enough human input into the IVR experience to enable the system to recognize the intent of a call more quickly and respond most appropriately. This should lead to a higher success rate and correspondingly lower rates of dissatisfaction with the voice channel. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/07/28/vc-investment-in-interactions-acknowledges-that-investors-recognize-a-problem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuiti Taking on the Challenges of a &#8220;Natural User Interface&#8221;</title>
		<link>http://opusresearch.net/wordpress/2011/04/26/nuiti-taking-on-the-challenges-of-a-natural-user-interface/</link>
		<comments>http://opusresearch.net/wordpress/2011/04/26/nuiti-taking-on-the-challenges-of-a-natural-user-interface/#comments</comments>
		<pubDate>Tue, 26 Apr 2011 22:59:50 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[hands-free services]]></category>
		<category><![CDATA[speech processing]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4353</guid>
		<description><![CDATA[An Israeli firm called Nuiti now offers a "platform" for Web site operators and Web developers to offer hands-free, "natural user interface" for their resources.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/04/nuitilogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/04/nuitilogo.png" alt="" title="nuitilogo" width="151" height="45" class="alignright size-full wp-image-4354" /></a>An Israeli firm called Nuiti now offers a &#8220;platform&#8221; for Web site operators and Web developers to offer hands-free, &#8220;natural user interface&#8221; for their resources. As an example, Nuiti offers <a href="http://www.youtube.com/watch?v=8ON25G2wvYU">this video</a> depicting how e-cooking specialist, KitchenPC.com employs Nuiti&#8217;s FireSay InPage to provide truly hands-free access to its site. Users need only utter &#8220;Start Cooking,&#8221; to get the computers attention and begin text-to-speech-based read-outs of recipes and cooking instructions.</p>
<p>The company&#8217;s stock-in-trade is to apply its research and development energy toward &#8220;filling gaps&#8221; in the current speech processing technologies. CEO Ehud Halberstam had worked with the team of engineers at Microsoft who launched the Windows Phone 7 operating system and attendant services (including its voice user interface). Nir Segal, VP of Marketing and Business Development, managed the user experience development team for MediaMind Technologies, a global company whose core product is a platform for multichannel, rich digital advertising campaign management. </p>
<p>In a wide-ranging conversation this morning, we discussed how building a &#8220;delightful hands-free experience&#8221; should not necessarily depend on such &#8220;trigger words.&#8221; Indeed, as the Kinect system from Halberstam&#8217;s old employer Microsoft demonstrates, the addition gesture and video serves to improve the system&#8217;s ability to understand a user&#8217;s intent without resorting to the use of a keyboard or physical mouse.</p>
<p>Today, Nuiti employas core speech processing technology from Microsoft and have put together two families of solutions. FireSay InPage supports hands-free command and control of Web sites (a la KitchenPC.com). FireSay VoiceAds provides a voice based mechanism for delivery of advertising and promotional messages across multimodal channels.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/04/26/nuiti-taking-on-the-challenges-of-a-natural-user-interface/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2011 Will See Stepped up Investment in &#8220;Speech And&#8230;&#8221;</title>
		<link>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/#comments</comments>
		<pubDate>Tue, 21 Dec 2010 18:15:22 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3889</guid>
		<description><![CDATA[Today Apple posted a job listing for an "iOS Speech SW Application Engineer." The job involves working with the iOS Applications Framework Team in "a fast paced environment with rapidly changing priorities." ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Apple_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Apple_logo.png" alt="" title="Apple_logo" width="121" height="137" class="alignright size-full wp-image-1314" /></a>Today Apple posted <a href="http://jobs.apple.com/index.ajs?BID=1&#038;method=mExternal.showJob&#038;RID=68870&#038;CurrentPage=1">a job listing</a> for an &#8220;iOS Speech SW Application Engineer.&#8221; The job involves working with the iOS Applications Framework Team in &#8220;a fast paced environment with rapidly changing priorities.&#8221; </p>
<p>Earlier this year I coined the term &#8220;No rest for the RESTful&#8221; to dramatize how new tools and API&#8217;s to support agile programming would accelerate application development and service delivery. As we enter 2011, the mantra appears to be &#8220;Speech is Sexy,&#8221; as evidenced not just by Apple&#8217;s &#8220;Help Wanted&#8221; posting, but by <a href="http://opusresearch.net/wordpress/2010/12/03/googles-latest-acquisition-brings-text-to-speech-luminaries-into-its-fold/">Google&#8217;s recent acquisition of Phonetic Arts</a> and rapid-fire refinement of mobile user interfaces from Nuance, Vlingo (powered by AT&#038;T&#8217;s Watson Engine), Google and Microsoft/Tellme.</p>
<p>The big difference in 2011 is that speech is getting more pervasive while, at the same time, it is being subsumed into multimodal user interfaces. Microsoft, for instance, continues to call speech recognition &#8220;foundational&#8221; to its user interface but, with the introduction of Kinect, already puts much more emphasis on accurate recognition of gestures. Last year Google&#8217;s Mike Cohen explained his objective of making speech as an alternative &#8220;every time&#8221; a keypad or keyboard is used on a mobile device.</p>
<p>2011 will be a year for smoothing out some of the rough spots in speech enabling the user experience. Candidates include better (more accurate) recognition, noise cancellation, more &#8220;human sounding&#8221; text-to-speech rendering, speech-to-speech translation, low-latency interaction with dynamic data &#8220;in the cloud,&#8221; and (to keep things safe, secure and personalized) voice biometrics-based authentication or ID proofing.</p>
<p>But there&#8217;s been a fundamental change. I used to write about the &#8220;Voice User Interface.&#8221; In the coming year attention will be on the &#8220;Mobile User Interface&#8221; that includes voice. That&#8217;s why it should not be a surprise to see that the the speech software engineer at Apple should be prepared to work on a &#8220;team&#8221; to accommodate the &#8220;rapidly changing priorities.&#8221; </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Version of Vlingo Adds &#8220;Action Bar&#8221; and Deep Links to OpenTable, Fandango and Kayak</title>
		<link>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/#comments</comments>
		<pubDate>Fri, 17 Dec 2010 00:51:48 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Opus Research]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[Vlingo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3884</guid>
		<description><![CDATA[Vlingo is starting to take on a serious resemblance to Siri with its latest release on Android-based phones. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png" alt="" title="vlingo_logo" width="140" height="52" class="alignright size-full wp-image-1371" /></a>Vlingo is starting to take on a serious resemblance to Siri with its latest release on Android-based phones. As with its iPhone-based offering, there is voice-activated &#8220;deep linking&#8221; into the apps that support frequent transactions, like restaurant, theater and travel reservations. But the latest version for Android is much more multimodal because it encourages users to enter commands in text form through a newly designed Action Bar.</p>
<p>There&#8217;s a video demonstration embedded in <a href="http://blog.vlingo.com/vlingo-actionbar/">this blog post</a> on Vlingo&#8217;s site. As its author explains, the Action Bar uses the same &#8220;predictive intelligence&#8221; for both voice recognition and typed commands. This allows users &#8220;to initiate actions when it’s not as convenient speak to their phones – like on a noisy bus or quiet conference room.&#8221; </p>
<p>Well said, and much needed. As voice-enabled applications mature, developers have come to recognize that all mobile commerce apps must support multi-modal input. This positioning is especially important on Android-based devices where, <a href="http://opusresearch.net/wordpress/2010/08/16/voice-actions-for-android-speechable-moments-from-google-spell-new-market-dynamics/">as we noted back in August</a>, Google has made significant investment in rapid, seamless access to speech recognition for command and control of a multiplicity of apps, features and functions on the Android, as well as dictation for populating search boxes or messaging utilities.</p>
<p>Mobile subscribers are the beneficiaries of the efforts by Google, Vlingo, Apple (which owns Siri), Nuance and a handful of other firms who are goading each other on to improve the mobile user interface. Improving recognition accuracy, as well as predictivity (which I know is not a word) is the task at hand and these technology companies are continually raising the bar in what is much more than &#8220;hands-free to hands-free&#8221; combat.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Taking Voice Search To the Next Level of Personalization</title>
		<link>http://opusresearch.net/wordpress/2010/12/14/google-taking-voice-search-to-the-next-level-of-personalization/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/14/google-taking-voice-search-to-the-next-level-of-personalization/#comments</comments>
		<pubDate>Tue, 14 Dec 2010 18:51:48 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3873</guid>
		<description><![CDATA[Google is starting to put some distance between the Voice Search app for Android-based smartphones and the same application as it is offered on alternatives.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png" alt="" title="Google_logo" width="160" height="74" class="alignright size-full wp-image-1265" /></a>Google is starting to put some distance between the Voice Search app for Android-based smartphones and the same application as it is offered on alternatives. The difference will be the ability of phones running Android version 2.2 to support a new way of building the models for recognizing their owners&#8217; utterances. Rather than matching what they say to the huge database of spoken words from other Google Voice Search users, Google will begin collecting utterances in a new way, so they can be associated with a specific user. This will promote greater accuracy, which translates (so to speak) into a better ability to recognize proper names more quickly.</p>
<p>The service is described in greater detail in <a href="http://googlemobile.blogspot.com/2010/12/voice-search-gets-personal.html">this blog post by Amir Mane the product manager along with Glen Shires from the Google Voice technical staff</a>. Google has made personalized search into an &#8220;opt in&#8221; feature of the new Google Voice Search app. Recognizing that there is a fine line between personalization and invasion of privacy, it has taken a further step of providing a mechanism for the personalized voice profiles to be &#8220;disassociated&#8221; from information in your Google account (which, for many people could span Gmail, shared documents, contact lists and other sensitive information).</p>
<p>The app is available from the Android Market, making personalized voice search a few clicks away. The allusion to &#8220;improved name recognition and speed&#8221; is almost nostalgic. For many years, Amir Mane&#8217;s name was synonymous with automated Directory Assistance, an enclave of the speech processing and information processing world that long-ago began to tackle the challenges of rapid recognition and response to the most challenging utterances &#8211; names of cities, towns, streets and people. </p>
<p>Google is about to find out how many people expect to see sufficient benefits from Personalized Voice Search&#8221; to justify their decision to &#8220;opt-in.&#8221; My suspicion is that the numbers will be fairly small at first because people generally have to be given incentive (preferably financial, but often merely gratifying) to &#8220;opt-in&#8221; to just about anything. The promise of better speech rec may not fill the bill. Regardless of the percentage, Google has a large enough sample of subscribers to learn who takes the step into personalized Voice Search.</p>
<p>After overcoming the opt-in hurdle, Google will learn which of its users are sophisticated enough to go deep into the administrative layers of Google Voice to &#8220;disassociate&#8221; their voice profiles from the rest of their Google account. My suspicion is that the number will be pretty low. After all, people are sharing their location, their check-ins and other activity streams routinely. The idea that some &#8220;bad actor&#8221; might benefit from associating audio files (or the metafiles derived from them) with other publicly available information seems pretty remote. But I&#8217;m always surprised at what self-described privacy advocates decide to address as communications, search and transaction processing technologies move forward.</p>
<p>My suggestion: If you have an Android phone running version Froyo or above, it will be worthwhile to upgrade to Personalized Voice Search. As Amir&#8217;s post notes, the improvements will be subtle at first, but they will be beneficial. Thanks to advancements in microphone technologies (like putting multiple microphones on devices to identify and cancel out background noise), as well as acoustic modeling and filtering mobile devices are getting much better at supporting person-to-machine conversations. Personalized Search moves along a different vector to try to provide a more accurate way to recognize spoken commands or search terms consistently.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/14/google-taking-voice-search-to-the-next-level-of-personalization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Speech Enabled Mobile Services Start Removing Obstacles to Adoption</title>
		<link>http://opusresearch.net/wordpress/2010/10/18/speech-enabled-mobile-services-start-overcoming-obstacles-to-adoption/</link>
		<comments>http://opusresearch.net/wordpress/2010/10/18/speech-enabled-mobile-services-start-overcoming-obstacles-to-adoption/#comments</comments>
		<pubDate>Mon, 18 Oct 2010 22:16:46 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Plantronics]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3570</guid>
		<description><![CDATA[Over on Internet2go.com, Greg Sterling posted some thoughts about Google Voice Search. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/voicesearchicon.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/voicesearchicon.jpeg" alt="" title="voicesearchicon" width="100" height="100" class="alignright size-full wp-image-3571" /></a>Over on Internet2go.com, <a href="http://www.internet2go.net/news/data-and-forecasts/google-voice-search-7-iphone-25-android">Greg Sterling posted some thoughts about Google Voice Search</a>. He observes that iPhone users are less than one-third as likely to use spoken words to initiate Google searches. Greg&#8217;s speculation is that &#8220;Google has more deeply integrated voice into the Android platform and may have trained people to use it more frequently accordingly.&#8221; </p>
<p>My take is a little different. Last month I wrote <a href="http://opusresearch.net/wordpress/2010/09/10/google-has-home-field-advantage-on-the-android-home-page/">this post</a> about Google&#8217;s &#8220;Home Field Advantage&#8221; as it provided a number of widgets to invoke its apps with the touch of a single button on the &#8220;home screen&#8221;. This is not about how &#8220;deeply integrated&#8221; voice is with the core platform or mobile operating system. It is, instead, a matter of limiting the number of clicks it takes to invoke speech enabled applications (or &#8220;Speechable Moments&#8221;, as I like to call them).</p>
<p>I&#8217;m not sure what the attrition rate phone users as they click through an icon on the some screen to invoke an application or feature. I remember that Web-based e-commerce experts used to say that you lose one-half of your customers each time they have to click on a link to get to a check-out. That was Amazon.com&#8217;s key motivation to launch &#8220;one-click check-out&#8221;. </p>
<p>The 3:1 ratio of speech enabled search on Droids compared with iPhones conforms to that rule. Users can&#8217;t invoke Voice Search unless and until they launch the Google App and then tilt the phone to its proper position. For Bing, voice search begins after opening an app and pressing a button. They don&#8217;t take a lot of effort, but they pose sufficient barriers to have a measurable effect. A number of speech enabled services (Greg mentions a few) are deeply integrated with the call-flows and API&#8217;s to support the a range of services, including search, text entry, Twitter and more. They would definitely get more use if they were easily or automatically invoked from the home screen or idle screen. </p>
<p>Speech apps work best when they appear to be the product of serendipity. I was recently given a new Plantronics M1100 &#8220;Savor&#8221; bluetooth headset to try. I was pleasantly surprised upon reception of the first incoming call. A synthesized voice (or stored recording) announced an &#8220;incoming call&#8221; and provided a prompt to the effect that I could say &#8220;answer&#8221; or &#8220;ignore&#8221; the event. I didn&#8217;t have to set up the feature or &#8220;train&#8221; the unit to recognize my instructions. Upon pressing the command button on the earpiece, the unit also calls &#8220;Bing 411&#8243; (Tellme&#8217;s Directory Assistance) in response to the command &#8220;Call information.&#8221;</p>
<p>By contrast, I&#8217;ve been told that the unit also supports Plantronics&#8217; a number of other speech-enabled services, with the help of its proprietary, hosted service called &#8220;Vocalyst&#8221;. Registered users can configure the system to support a reminder service (based on Evernote); e-mail review, reply and origination (supporting Gmail, Yahoo! and AOL); Twitter; and review and origination of text messages (on Blackberry&#8217;s and Android-based phones). One of these days I will configure Vocalyst but, in the mean time, I&#8217;m very happy with the information I can get from Bing 411 and in the normal course of using my wireless phone.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/10/18/speech-enabled-mobile-services-start-overcoming-obstacles-to-adoption/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Parakeet: Personal Triage for Mobile Speech</title>
		<link>http://opusresearch.net/wordpress/2010/09/03/parakeet-personal-triage-for-mobile-speech/</link>
		<comments>http://opusresearch.net/wordpress/2010/09/03/parakeet-personal-triage-for-mobile-speech/#comments</comments>
		<pubDate>Fri, 03 Sep 2010 21:40:12 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3398</guid>
		<description><![CDATA[The “Science and Technology” section of this week’s issue of The Economist a feature with the title: "Correct Me If I’m Wrong…”]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/09/parakeet_mic.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/09/parakeet_mic.jpg" alt="" title="parakeet_mic" width="144" height="144" class="alignright size-full wp-image-3399" /></a>The “Science and Technology” section of this week’s issue of The Economist a feature with the title: <a href="http://www.economist.com/node/16909957?story_id=16909957">“Correct Me If I’m Wrong…”</a> (may require a subscription) In it, the author describes a new user interface called Parakeet, developed by Ola Kristensson and Keith Vertanen, at the University of Cambridge’s Computer Laboratory. Its purpose is to enable mobile subscribers to use the touch screen to correct or improve upon the first-pass results of speech-to-text conversion services.</p>
<p>Parakeet does not represent any technological leap forward. The program displays text versions of utterances for which the recognition engine has the highest confidence levels while it simultaneously displays several other renderings which carry lower degrees of confidence, sort of a range of so-called “nBest” candidates. Users can then indicate their actual utterances by using their fingers on the keyboard.</p>
<p>The originator of a spoken message performs first level triage by making sure that the message that is transmitted reflects what was spoken with great accuracy. I think this should be incorporated as an option for every mobile speech application.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/09/03/parakeet-personal-triage-for-mobile-speech/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RedStart Systems Utter Command Enhances Dragon NaturallySpeaking</title>
		<link>http://opusresearch.net/wordpress/2009/05/04/redstart-systems-utter-command-enhances-dragon-naturallyspeaking/</link>
		<comments>http://opusresearch.net/wordpress/2009/05/04/redstart-systems-utter-command-enhances-dragon-naturallyspeaking/#comments</comments>
		<pubDate>Mon, 04 May 2009 22:06:40 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[automated dictation]]></category>
		<category><![CDATA[Dragon]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Opus Research]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=520</guid>
		<description><![CDATA[<img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/05/dragonbox.png" alt="dragonbox" title="dragonbox" width="198" height="243" class="alignleft size-full wp-image-522" />I've found Twitter to be a great mechanism for staying up to date on my friends opinions of new products. Today Moshe Yudkowsky "tweeted" that "Redstart has one of the more impressive speech recognition-based systems to control a PC." So I had to take a look.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/05/dragonbox.png" alt="dragonbox" title="dragonbox" width="198" height="243" class="alignleft size-full wp-image-522" />I&#8217;ve found Twitter to be a great mechanism for staying up to date on my friends opinions of new products. Today Moshe Yudkowsky &#8220;tweeted&#8221; that &#8220;Redstart has one of the more impressive speech recognition-based systems to control a PC.&#8221; So I had to take a look.</p>
<p>First of all, it is wonderful to find an instance where the abbreviation &#8220;UC&#8221; does not stand for &#8220;Unified Communications&#8221;. In the case of Redstart Systems, it refers to its flagship prduct &#8220;<a href="http://www.redstartsystems.com/uttercommand.html?session_id=http://www.redstartsystems.com/uttercommand.html">Utter Command</a>&#8220;. Redstart&#8217;s UC is a set of applets that interworks with Nuance&#8217;s Dragon NaturallySpeaking. Together they provide PC users with the ability to use their voices in ways that truly replace keyboards. They provide a consistent set of spoken commands to do such things as open programs, prepare the header and address information for an email (including cc:&#8217;s), go directly to a Web site, move a cursor, find keywords or phrases in any program or document and even change tracks in a media player while working in a different program. </p>
<p>The company&#8217;s developers based the design to solve problems that PC users encounter every day. For desktop use, NaturallySpeaking can be considered the speech-to-text rendering engine while Redstart enhances with a wrapper of concise commands. The $295 licensing fee may seem a bit steep, with the required Dragon NatruallySpeaking &#8220;Pro&#8221; already carrying a $625-$900 retail price. </p>
<p>In spite of the steep entry price, we see some real promise in the approach taken by Redstart. It uses Dragon as a &#8220;core&#8221; speech-to-text rendering engine, which it has proven to be very good at. It then acts as a developer/partner that creates a market for dictation augmentation. We are already witnessing similar partnering structures taking shape in the mobile world. As Igor Jablokov or Yap mentioned to us, in two <a href="http://opusresearch.net/wordpress/2009/05/04/a-rim-shot-for-yap-a-new-sdk-for-the-blackberry/">recent partnership announcements</a> (SHAPE Services GmbH and MyCaption.com) Yap is the speech-to-text rendering engine while the to partners deliver the workflow and scripting that provide a complete solution. </p>
<p>These constant changes, refinements and, ultimately, &#8220;mashups&#8221; are both the causes and the outcomes of the advent of &#8220;Recombinant Telephony&#8221;.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/05/04/redstart-systems-utter-command-enhances-dragon-naturallyspeaking/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tellme and Microsoft: Two Years On</title>
		<link>http://opusresearch.net/wordpress/2009/04/29/tellme-and-microsoft-two-years-on/</link>
		<comments>http://opusresearch.net/wordpress/2009/04/29/tellme-and-microsoft-two-years-on/#comments</comments>
		<pubDate>Wed, 29 Apr 2009 18:28:18 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[Advisories]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Tellme]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=464</guid>
		<description><![CDATA[<img src="http://opusresearch.net/wordpress/pdfreports/tellMeMSFT_thumb.png" align='right'  HSPACE=10 vspace=10 border=1/>
<em>Featured Research</em>
Microsoft and Tellme celebrate the second anniversary of their merger with new features that leverage Tellme’s voice application experience and conform to the release cycle of Microsoft Windows and the Microsoft speech engine. The result is improvement in the user experience for voice self-service (VSS) and mobile search.

<em>Advisories are available to registered users only.</em> 

For more information on becoming an Opus Research client, please contact Pete Headrick (<a href="mailto:pheadrick@opusresearch.net">pheadrick@opusresearch.net</a>).]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/pdfreports/tellMeMSFT_thumb.png" align='right'  HSPACE=10 vspace=10 border=1/><br />
<em>Featured Research</em><br />
Microsoft and Tellme celebrate the second anniversary of their merger with new features that leverage Tellme’s voice application experience and conform to the release cycle of Microsoft Windows and the Microsoft speech engine. The result is improvement in the user experience for voice self-service (VSS) and mobile search.</p>
<p><em>Advisories are available to registered users only.</em> </p>
<p>For more information on becoming an Opus Research client, please contact Pete Headrick (<a href="mailto:pheadrick@opusresearch.net">pheadrick@opusresearch.net</a>).</p>
<p><!--/hidethis--></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/04/29/tellme-and-microsoft-two-years-on/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

