<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; mobile search</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/mobile-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 23 May 2012 17:29:43 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>New Version of Vlingo Adds &#8220;Action Bar&#8221; and Deep Links to OpenTable, Fandango and Kayak</title>
		<link>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/#comments</comments>
		<pubDate>Fri, 17 Dec 2010 00:51:48 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Opus Research]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[Vlingo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3884</guid>
		<description><![CDATA[Vlingo is starting to take on a serious resemblance to Siri with its latest release on Android-based phones. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png" alt="" title="vlingo_logo" width="140" height="52" class="alignright size-full wp-image-1371" /></a>Vlingo is starting to take on a serious resemblance to Siri with its latest release on Android-based phones. As with its iPhone-based offering, there is voice-activated &#8220;deep linking&#8221; into the apps that support frequent transactions, like restaurant, theater and travel reservations. But the latest version for Android is much more multimodal because it encourages users to enter commands in text form through a newly designed Action Bar.</p>
<p>There&#8217;s a video demonstration embedded in <a href="http://blog.vlingo.com/vlingo-actionbar/">this blog post</a> on Vlingo&#8217;s site. As its author explains, the Action Bar uses the same &#8220;predictive intelligence&#8221; for both voice recognition and typed commands. This allows users &#8220;to initiate actions when it’s not as convenient speak to their phones – like on a noisy bus or quiet conference room.&#8221; </p>
<p>Well said, and much needed. As voice-enabled applications mature, developers have come to recognize that all mobile commerce apps must support multi-modal input. This positioning is especially important on Android-based devices where, <a href="http://opusresearch.net/wordpress/2010/08/16/voice-actions-for-android-speechable-moments-from-google-spell-new-market-dynamics/">as we noted back in August</a>, Google has made significant investment in rapid, seamless access to speech recognition for command and control of a multiplicity of apps, features and functions on the Android, as well as dictation for populating search boxes or messaging utilities.</p>
<p>Mobile subscribers are the beneficiaries of the efforts by Google, Vlingo, Apple (which owns Siri), Nuance and a handful of other firms who are goading each other on to improve the mobile user interface. Improving recognition accuracy, as well as predictivity (which I know is not a word) is the task at hand and these technology companies are continually raising the bar in what is much more than &#8220;hands-free to hands-free&#8221; combat.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/16/new-version-of-vlingo-adds-action-bar-and-deep-links-to-opentable-fandango-and-kayak/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pre-holiday News Rush for Nuance</title>
		<link>http://opusresearch.net/wordpress/2010/11/23/pre-holiday-news-rush-for-nuance/</link>
		<comments>http://opusresearch.net/wordpress/2010/11/23/pre-holiday-news-rush-for-nuance/#comments</comments>
		<pubDate>Tue, 23 Nov 2010 18:27:13 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Ask.com]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[TTS]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3791</guid>
		<description><![CDATA[The week of Thanksgiving is usually characterized by a news lull, but Nuance has been an exception. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/11/1278079098.usr105634.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/11/1278079098.usr105634.jpg" alt="" title="AppleVoiceControl" width="180" height="141" class="alignright size-full wp-image-3794" /></a>The week of Thanksgiving is usually characterized by a news lull, but Nuance has been an exception. Yesterday it announced financial results for last quarter which reflected more than 17% growth in top line revenues across all business units, driven 34% growth in &#8220;mobile and consumer&#8221; and a near doubling in &#8220;imaging&#8221; solutions. Then the financial community was momentarily tantalized by a rumor (traced to a video interview with Steve Wozniak of all people) that Apple &#8220;had bought&#8221; Nuance &#8211; a prospect that is not likely at this time.</p>
<p>But Nuance&#8217;s success, both in the marketplace and financial markets, is increasingly predicated on its successful support of new mobile, customer-facing apps. Corporate spending on automated speech is down, as reflected in a 4.5% decline in top line revenues among for the &#8220;enterprise&#8221; business unit. In spite of the fact that Nuance&#8217;s roster of clients includes a healthy mix of telecom and technology companies, like Acer, AT&#038;T, Comcast, Delta, Express Scripts, GM Onstar, IB System, Invomo, Metro PCS, Telekom Deutschland, Telstra, T-Mobile, and Vodafone.</p>
<p>The rumor of Nuance&#8217;s acquisition by Apple may be greatly exaggerated, but stories about successful integration of Nuance mobile speech processing into iPhone-based services is not. Today, just in time for the holiday shopping season, the company announced that its Dragon-branded technologies for &#8220;natural language&#8221; speech recognition, dictation and text-to-speech rendering is the foundation for a newly introduced <a href="http://www.nuance.com/company/news-room/press-releases/NC_007742">Price Check By Amazon</a> iPhone app. </p>
<p>The same technology provides a way for users <a href="http://www.nuance.com/company/news-room/press-releases/NC_007741">Ask.com for iPhone</a> to speak their queries. Ask.com, now a property of Barry Diller&#8217;s IAC, was one of the first Web-based resources designed to build communities of peers (or experts) who could answer a visitors questions. Through the iPhone, Ask.com users can get immediate responses while on-the-go.</p>
<p>That&#8217;s why Nuance&#8217;s future is not tied to any impending acquisition as much as it is predicated on a succession of partnerships and integrations. The Woz may have confused Nuance with Siri in the now famous &#8220;gaff&#8221;, but we must point out that Siri&#8217;s own speech recognition capability is &#8220;powered by Nuance.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/11/23/pre-holiday-news-rush-for-nuance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Adds Voice Search in Dutch</title>
		<link>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/</link>
		<comments>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/#comments</comments>
		<pubDate>Fri, 05 Nov 2010 18:53:10 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3679</guid>
		<description><![CDATA[Telecom Paper reports that Google Voice Search now supports Dutch on iPhones and Android-based mobile devices. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png" alt="" title="Google_logo" width="160" height="74" class="alignright size-full wp-image-1265" /></a><a href="http://www.telecompaper.com/news/article.aspx?cid=766738">Telecom Paper reports</a> that Google Voice Search now supports Dutch on iPhones and Android-based mobile devices. The application accepts search terms, but not phrases at this point. </p>
<p>For those keeping score: Google Voice Search exists in English, Mandarin, Japanese, Korean, French, German, Italian, Spanish, Czech, Polish, Russian and Turkish. Speech recognition is available for Android 2.2 and 2.1. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Baidu Staffing up for Automated Voice Search</title>
		<link>http://opusresearch.net/wordpress/2010/07/08/baidu-staffing-up-for-automated-voice-search/</link>
		<comments>http://opusresearch.net/wordpress/2010/07/08/baidu-staffing-up-for-automated-voice-search/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 21:54:34 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3178</guid>
		<description><![CDATA[Baidu, the giant search provider in China, has published hiring specifications for a full staff of speech recognition specialists.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Baidulogo.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Baidulogo.jpeg" alt="" title="Baidulogo" width="125" height="94" class="alignright size-full wp-image-3181" /></a>Voice search has found success on all manner of smartphone with Google, Yahoo and Bing all offering users the option to speak their search terms. According to <a href="http://www.tradingmarkets.com/news/stock-alert/bidu_search-engine-titan-eyeing-voice-search-1028914.html">this report emanating from Beijing</a>, Baidu, the giant search provider in China, has published hiring specifications for a full staff of speech recognition specialists, including &#8220;a head for speech recognition technology, speech recognition engineers, senior speech recognition engineers and so on.&#8221;  </p>
<p>Baidu will need to introduce voice-based search just to stay at parity with Google (which is having problems of a different kind in its efforts to grow its Chinese operations). Industry analysts in China see Baidu&#8217;s voice search targeting mobile audiences and perhaps taking aim at delivery of music and entertainment. On July 2, it signed a deal with China Unicom to ensure that the carrier&#8217;s 3G phones ship with Baidu software pre-installed to facilitate search of text, images, MP3 files and news. China Unicom will introduce the iPhone later this month.</p>
<p>Intrepid readers will know that <a href="http://www.chinawebradar.com/1004/voice-matters-baidu-launches-voice-search-qq-voice-mail-follows.html">Baidu introduced &#8220;voice search&#8221;</a> involving live operators over two years ago. Based on that experience, it knows the mix of questions it can expect form phone users. Like Google, it is not shy about introducing new features and functions to its core of interactive services. Analysts believe that Baidu maintains a huge lead in market share, with about two-thirds of the search business compared to 26% for Google. Yet, as Opus Research&#8217;s Greg Sterling notes in <a href="http://searchengineland.com/google-releases-voice-search-in-china-for-nokia-s60s-28976">this post on Search Engine Land</a>, Google introduced its flavor of Chinese-language voice search in November 2009. As Greg points out, there are over 700 million mobile subscribers in China and that China Unicom has twice the number of subscribers as Verizon Wireless.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/07/08/baidu-staffing-up-for-automated-voice-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Isn&#8217;t Yahoo Doing More with Vlingo?</title>
		<link>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/#comments</comments>
		<pubDate>Tue, 15 Jun 2010 17:08:08 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3033</guid>
		<description><![CDATA[
We met with Vlingo yesterday. The company is previewing some new functionality and we had an interesting and wide-ranging discussion about the product and strategy. My question is: why isn&#39;t Vlingo investor Yahoo! leveraging the heck out of it? There was some initial fanfare about voice search on Yahoo! Mobile but since then there&#39;s been [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/Vlingo_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/Vlingo_logo.png" alt="" title="Vlingo_logo" width="198" height="80" class="alignright size-full wp-image-3034" /></a>
<p>We met with <a href="http://www.vlingo.com/">Vlingo</a> yesterday. The company is previewing some new functionality and we had an interesting and wide-ranging discussion about the product and strategy. My question is: why isn&#39;t Vlingo <a href="/news/user-experience/yahoo-upgrades-onesearch-shortcut-auto-locate-voice">investor Yahoo!</a> leveraging the heck out of it? There was some <a href="http://www.ysearchblog.com/2009/02/09/yahoo-onesearch-shortcut-now-with-auto-locate-improved-search-assist-and-windows-mobile-client/">initial fanfare</a> about voice search on Yahoo! Mobile but since then there&#39;s been little or nothing. </p>
<p>I think that Yahoo! should aggressively integrate Vlingo into its mobile app as a discovery tool for all types of content and not just Web search. It should also build a mobile version of Yahoo! Answers and think about Vlingo as a front end to that experience. </p>
<p>Alternatively our friends in Finland (Nokia) might consider buying a company like Vlingo and integrating it as a core component of its handsets &#8212; for smartphones and not-so-smart phones. </p>
<p>Voice will continue to improve as a tool for a broad array of uses on the handset. Apple obviously bought Siri, which is not about voice search per se, but about semantic understanding of voice queries and matching with transactional services. And Google continues to invest in voice and see it as a differentiator. </p>
<p>Microsoft of course had formidable voice assets but has not fully exploited or marketed them. Perhaps with the release of Windows (Mobile) 7 handsets we&#39;ll see a more prominent role for voice on the device. </p>
<p>But Yahoo!&#39;s got to get into the game more here.  </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/15/why-isnt-yahoo-doing-more-with-vlingo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nokia and Yahoo: A Quid Pro Quo, but Quo Vadis?</title>
		<link>http://opusresearch.net/wordpress/2010/05/24/nokia-and-yahoo-a-quid-pro-quo-but-quo-vadis/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/24/nokia-and-yahoo-a-quid-pro-quo-but-quo-vadis/#comments</comments>
		<pubDate>Mon, 24 May 2010 22:35:20 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Nokia]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2920</guid>
		<description><![CDATA[Nokia and Yahoo have formed an alliance which has three principal components.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/197028-nokia-yahoo-mobile-partnership_original.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/197028-nokia-yahoo-mobile-partnership_original-150x119.jpg" alt="" title="197028-nokia-yahoo-mobile-partnership_original" width="150" height="119" class="alignright size-thumbnail wp-image-2921" /></a>Nokia and Yahoo have formed an alliance which, according to this <a href="http://yhoo.client.shareholder.com/press/releasedetail.cfm?ReleaseID=472765">press release</a>, has three principal components. First, Yahoo will be the exclusive provider of email and chat services for subscribers to Nokia&#8217;s wireless internet service, which will be branded Ovi Mail/Ovi Chat powered by Yahoo!. Second, Yahoo! will make Nokia (thanks to its acquisition of NavTeQ three years ago) the exclusive source of maps and driving directions, &#8220;powered by Ovi.&#8221; Finally, the two companies will support &#8220;federated ID&#8221; across the two properties, so that subscribers can use a single user name and password to log on to both companies wireless services.</p>
<p>Selected services will launch in the second half of 2010, with global roll-out of a broad range of services extending into 2011. Both companies say they are trying to expand their global presence. Nokia&#8217;s smartphones have underperformed in the North American market, while Yahoo!&#8217;s mapping capabilities have been largely under-supported on a global basis.</p>
<p>Single sign on across the two sets of services seems like a very modest objective for the two companies. Nokia has been very active in the world of wireless identity management &#8211; through initiatives with Liberty Alliance among others &#8211; for nearly ten years, since the early days of WAP (the Wireless Access Protocol). Yahoo! has long offered single-sign on to multiple services by employing its <a href="http://developer.yahoo.com/openid/">OpenID offering</a>, including a hybrid implementation along with OAuth that enables users to be quickly and anonymously authenticated on multiple secure Web sites.</p>
<p>The initial offering is not impressive. Indeed, more than one analyst has characterized it as the effort of two former category leaders seeking to become relevant gain. Ovi Mail and Messaging had few users. Likewise, Yahoo! Maps has lost its battle against Google Maps (and, besides, it has long been &#8220;powered by NavTeQ&#8221;). As a matter of fact, the whole idea of granting &#8220;exclusives&#8221; is antithetical to the power of RC (Recombinant Communications) because it forecloses on the idea that members of the developer community might introduce an unexpected or much needed innovation. Single-sign on may make the user experience a bit more seamless, but the &#8220;exclusive&#8221; approach seems less likely to generate compelling content or services.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/24/nokia-and-yahoo-a-quid-pro-quo-but-quo-vadis/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Promptu Claims a &#8220;First&#8221;: Speech-to-SMS Service Offered by Telecom Italia Mobile</title>
		<link>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/#comments</comments>
		<pubDate>Thu, 30 Jul 2009 16:59:56 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1122</guid>
		<description><![CDATA[Two firms long known for innovative, high-visibility speech applications are joining forces to enable mobile subscribers to dictate and originate SMS text messages.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-43.png" alt="Picture 4" title="Picture 4" width="104" height="53" class="alignright size-full wp-image-1130" />Two firms long known for innovative, high-visibility speech applications are joining forces to enable mobile subscribers to dictate and originate SMS text messages. Telecom Italia Mobile (TIM) distinguished itself early by operating one of the first and certainly the largest multi-application, speech-enabled mobile voice portal. At the dawn of the 21st century, using technology from sister company Loquendo, TIM&#8217;s i-Portal enabled all of its 36 million or so domestic subscribers to dial a number and talk their way through a broad spectrum of voice information services, akin to those offered by U.S.-based Tellme through the toll-free 800-555-TELL. </p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-33.png" alt="Picture 3" title="Picture 3" width="104" height="96" class="alignleft size-full wp-image-1129" />In 2002, the i-Portal offered access to news, weather, sports and entertainment and had the capability to handle high volumes of activity. At about the same time, Promptu, then called AgileTV, was getting off the ground as a early-stage venture at SRI. Its core product was to be a voice-activated remote control for all the screen-based services offered by broadband carriers (primarily Cable TV system operators). The devices allowed cable subscribers to tune their sets to a selected program in response to instructions like &#8220;Find the Knicks game&#8221; or &#8220;Tune into some comedy&#8221;. Both Comcast and TimeWarner cable conducted market trials of the speech-activated remotes in 2005, and the company raised $22 million in venture funding.</p>
<p>By 2006, the company changed its name to Promptu and broadened its service offering to include voice-activated mobile search, including location-based services. VC&#8217;s chipped in another $11.6 million. In early 2007, another $5.6 million underwrote a further incursion into the mobile search world, with Italy becoming a major geographic focus. By this time, the entry of Vlingo, Yap and expanded offerings from Nuance under the Nuance Voice Control (NVC) brand and a number of other technology providers signaled the development of a speech-enabled mobile services ecosystem. </p>
<p>Promptu leveraged new attention to speech enabled mobile services to forge relationships with a number of content providers. With the Official Airline Guide (OAG) it rolled out voice-activated access to WAP-based flight information, which was branded Flights2Go.com and offered for $2.99/mo by Verizon Wireless. In September 2008 it formally announced a speech-to-text dictation service at the CTIA Wireless event in San Francisco. At Demo 2009, the company showed ShoutOUT a voice-to-SMS application for the iPhone. To the best of our knowledge, each product in the U.S. has hit considerable speed bumps on its way to mass adoption. The cable remote does not appear to have come out of trial. <a href="http://www.oag.com/landwithaplan/flights2go">Promotion of the Flights2Go WAPlet</a> does not mention that it is speech-enabled and ShoutOUT is nowhere to be found in the the iTunes App Store.</p>
<p>Now our eyes and ears are focusing on Italy as we look for carrier-based involvement in the text-to-SMS business. In North America, with new attention paid to the safety issues surrounding text messaging in cars, there is more incentive than ever for voice-activated texting (and Twittering) to take hold. The focus now must turn to providing the best user experience possible for an easy-to-use service that provides for hands-free, heads-up entry of commands and content. Promptu has clearly not cracked this nut, and it will not be alone it its efforts to create and popularize speech enabled mobile services.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Attention to Mobile Voice Control (Thanks Apple and Om!)</title>
		<link>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/#comments</comments>
		<pubDate>Wed, 08 Jul 2009 22:43:04 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=960</guid>
		<description><![CDATA["How Speech Technologies Will Transform Mobile Use" is a topic that's near and dear to my heart; but at this point I would frame the issue as "How Mobile Use Will Transform Speech Technologies."]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-9-150x150.png" align='right' HSPACE=5 vspace=5/>Because I have been an analyst in the automated speech domain for almost 20 years, I&#8217;m all for the introduction of interest among a fresh crop of knowledgeable proponents of new speech-enabled or &#8220;voice command&#8221; services. It was no mean feat for Apple to include Voice Control as one of the top features for the iPhone 3GS. It made voice input for command/control and dictation a peer of improved video, cut-and-paste, global search and other &#8220;most wanted&#8221; services.</p>
<p>Now GigaOm features <a href="http://gigaom.com/2009/07/07/is-iphones-voice-control-the-sound-of-things-to-come/">the opinion of guest columnist Phil Hendrix</a>, Founder and Director or the Institute for Mobile Markets Research. Somewhere behind the new GigaOm Pro&#8217;s paywall is a report from immr that professes to explain &#8220;How Speech Technologies Will Transform Mobile Use.&#8221; It&#8217;s a topic that&#8217;s near and dear to my heart; but at this point I might want to frame it as &#8220;How Mobile Use Will Transform Speech Technologies.&#8221;</p>
<p>1) Make the option for voice input ubiquitous &#8211; We&#8217;re getting there. The iPhone 3GS is a high-visibility device, but preloaded or downloaded apps that support speech input are already on hundreds of millions of handsets around the world. In addition, dial-up services for voice-enabled search (like GOOG411 or Microsoft/Tellme&#8217;s Bing411) have the potential to be a speed-dial button away when they add new territories and languages. </p>
<p>2) Do a better job of managing user expectations of accuracy &#8211; speech rec will never be 100% accurate. Think of all the times that you don&#8217;t understand what a person is saying to you on the first try. Why should wireless users expect machines to be any better. It&#8217;s important upfront and by multiple channels (demonstrations, video tutorials, Peer2Peer discussions), to make it clear that systems make best efforts to capture what is being said, but may not always &#8220;understand&#8221; meaning. It may seem like a hard logic to follow, but it is vital for maintaining the technology&#8217;s potential and the caller&#8217;s low blood pressure.</p>
<p>3) Showcase and support new use cases and tutorials &#8211; I think that we&#8217;ve only begun to present wireless subscribers with the sorts of options that put the technology in its best light. People love to break these things. It was in this post, that I pointed to the <a href="http://opusresearch.net/wordpress/2009/06/30/nytimes-column-highlights-google-voices-transcription-shortcomings/">NYTimes technology blog </a>in which David Gallagher logged the many transcription mistakes boldly displayed by Google Voice&#8217;s voicemail-to-text transcription. </p>
<p>4) Never make spoken input the only option &#8211; Note that I said make the &#8220;option&#8221; ubiquitous. Under no circumstances should an application require spoken input. Google understands this. So does Nuance, often regarded as a roll-up of speech processing technologies, but really a purveyor of platforms that accept (and in many cases predict) user input in the form of text, spoken words and short codes.</p>
<p>My problem with the NYTimes column and other vehicles for criticism is this: they are largely &#8220;descriptive&#8221; and not &#8220;prescriptive.&#8221; Instead of showing how easily broken the new interface is, how about demonstrating what it does quite well. It captures utterances and, like a marginally good stenographer, presents a &#8220;rough draft&#8221; of what was said, along with some notation of where the input might be &#8220;iffy.&#8221; Speakers should never think of this as a finished product. At best it should provide a form of talking triage &#8211; showing the speaker, or recipient where the transcription may need a bit of work or sculpting. </p>
<p>At base it&#8217;s an opportunity to game the system. If people like to make a game of breaking the application (by reading Jabberwocky to Google Voice, for instance), it makes more sense to cast yourself in the role of hero and fix what&#8217;s broken in a poorly transcribed message. If voice control is to be &#8220;the sound of things to come&#8221; for mobile subscribers, it requires a much more concerted effort than the shallow effort to integrate with the iPhone or the one-and-done guessing game that Google Voice plays. Handsfree dialing and text input through a Bluetooth headset or earbud would be a nice start. Well-prompted and choreographed messaging &#8211; be it addressing and dictation of SMS or friendly prompts to help originators or recipients triage transcribed voice mail &#8211; would be a cool way to start. </p>
<p>Mobile voice is a big area of opportunity. IBM had been working on it for years. Nuance, Google, Vlingo, Microsoft/Tellme, Spinvox, PhoneTag, Yap and a few others are fielding some pretty impressive products and services. Adoption will be no accident and, even though its been years in the making, it is still early days in product and market development. In other words: The Customer Has Not Spoken. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/08/new-attention-to-mobile-voice-control-thanks-apple-and-om/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Voice Control for Mobile: Hands Free or Not Hands Free?</title>
		<link>http://opusresearch.net/wordpress/2009/06/08/voice-control-for-mobile-hands-free-or-not-hands-free/</link>
		<comments>http://opusresearch.net/wordpress/2009/06/08/voice-control-for-mobile-hands-free-or-not-hands-free/#comments</comments>
		<pubDate>Mon, 08 Jun 2009 19:40:41 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple iPhone]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[wwdc]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=758</guid>
		<description><![CDATA[That is the Question. Whether it&#8217;s driven by state strictures surrounding drivers&#8217; safety or simply a matter of convenience, a growing number of mobile devices have added automated speech. The iPhone GS, which was demo&#8217;d at the Apple WorldWide Developers Conference (#wwdc for you hash taggers on Twitter) is no exception. 
Automated speech winnowed its [...]]]></description>
			<content:encoded><![CDATA[<p>That is the Question. Whether it&#8217;s driven by state strictures surrounding drivers&#8217; safety or simply a matter of convenience, a growing number of mobile devices have added automated speech. The iPhone GS, which was demo&#8217;d at the Apple WorldWide Developers Conference (#wwdc for you hash taggers on Twitter) is no exception. </p>
<p>Automated speech winnowed its way into the iPhone demo in two instances. First was the text-to-speech rendering of driving directions offered by TomTom, whose CTO put on a demo at WWDC. But the coup-de-gras was the unveiling of native &#8220;Voice Control&#8221;, a feature that delivers a voice-enabled user interface when iPhone users hold down the &#8220;Home&#8221; button. At that piont, they can use the spoken word to conduct voice-activated dialing, search their iTunes library or enter other commands to their iTunes application.</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/picture-5-150x150.png" alt="picture-5" title="picture-5" width="150" height="150" class="aligncenter size-thumbnail wp-image-759" /></p>
<p>There are already more than two dozen voice-enabled applications for the iPhone, ranging from stand-alone voice-activated dialing (from the resident contact list) to full-blown dictation of text messages, Tweets or FaceBook entries. All require some sort of manual input (pressing or holding down a button) to let the device know that it is accepting spoken input.</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/picture-6.png" alt="picture-6" title="picture-6" width="140" height="46" class="alignleft size-full wp-image-765" />In a move that has been a long time coming, Sensory Inc., has finally packaged its technology to support truly hands-free input to mobile devices. By adding resident &#8220;word spotting&#8221; technology to its firmware, Sensory enables the makers of Bluetooth devices (specifically wireless headsets, aftermarket &#8220;car kits&#8221;, and stereo headsets) to sell devices that let their users define phrases that &#8220;trigger&#8221; speech recognition. As Sensory&#8217;s CEO Todd Mozer observes, &#8220;&#8221;What&#8217;s been needed all along is a speech controlled technology that doesn&#8217;t require a button press to get started.&#8221;</p>
<p>We agree. The products will carry the &#8220;BlueGenie&#8221; brand. It will enable its users to say a simple wake-up phrase like &#8220;Hello BlueGenie&#8221; to signal the device to &#8220;listen&#8221; for an operational speech command. It is a new technology from Sensory that both works well in relatively noisy environments, but most importantly, it uses a &#8220;patent-pending&#8221; technology to minimize power consumption and avoid rapid battery drain.  </p>
<p>We&#8217;ll be watching to see how many headset makers, ODMs and OEMs contract to use the Sensory technology. At first it will be a nice differentiator, in the long-run we see it as an important feature to prevent driver distraction.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/06/08/voice-control-for-mobile-hands-free-or-not-hands-free/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Speech Rec Expected To Spur Use of NYC&#8217;s Trip Planner</title>
		<link>http://opusresearch.net/wordpress/2009/06/01/speech-rec-expected-to-spur-use-of-nycs-trip-planner/</link>
		<comments>http://opusresearch.net/wordpress/2009/06/01/speech-rec-expected-to-spur-use-of-nycs-trip-planner/#comments</comments>
		<pubDate>Mon, 01 Jun 2009 15:04:55 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=717</guid>
		<description><![CDATA[<img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/mta-logo-150.jpg" alt="mta-logo-150" title="mta-logo-150" width="130" height="130" class="alignleft size-full wp-image-720" />The New York City Metropolitan Transit Authority (MTA) will deploy an automated speech recognition from Nuance to shorten wait times, with hope, expand the reach of its "Trip Planner" service to the thousands more New Yorkers dialing 718-330-1234.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/mta-logo-150.jpg" alt="mta-logo-150" title="mta-logo-150" width="130" height="130" class="alignright size-full wp-image-720" />Extending accurate trip information to mobile commuters is a large area of opportunity for transit agencies around the world. Now the New York City Metropolitan Transit Authority (MTA) will deploy an automated speech recognition from Nuance to shorten wait times, with hope, expand the reach of its &#8220;Trip Planner&#8221; service to the thousands more New Yorkers dialing 718-330-1234.</p>
<p>Use of Trip Planner on the MTA&#8217;s Web site has increased dramatically over the past year. The agency reports that 18,045 people use Trip Planner &#8220;daily&#8221; in April 2009, a nearly 172% increase of the 6,640 daily users in 2008. At the same time, over 4,500 &#8220;customers&#8221; call Gotham&#8217;s Travel Information Center (TIC), on average each day. The purpose of Trip Planner Voice is to use speech automation to reduce these callers&#8217; wait times and, thus serve more callers. </p>
<p>It&#8217;s early, but the uptake has been impressive: Since a controled &#8220;beta&#8221; version of the service was introduced last September, over 95,000 callers have used the service. During that time, the number of daily users increased from 382 to 746 (which, by our estimates, represents an annualized increase of 127%).</p>
<p>The service is an integration of Nuance&#8217;s speech recognition with a traveler information platform from Trapeze Group. It is integrated on a call handling and information management platform from Aspect Software. The deployment (coordinated by Aspect), takes a classic &#8220;form filling&#8221; approach. It prompts each caller to say his or her point of origin and destination<br />
by address, intersection, or landmark and then provides travel instructions based bus, subway or Staten Island Railway schedule information. </p>
<p>Today, the service can handle 96 simultaneous users. It is offered in American English only, but a Spanish version is under development.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/06/01/speech-rec-expected-to-spur-use-of-nycs-trip-planner/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

