<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; mobile speech</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/mobile-speech/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Recombinant Communications</description>
	<lastBuildDate>Thu, 29 Jul 2010 23:57:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>ARM&#8217;ed and Less Dangerous: Mobile Speech Made Easier</title>
		<link>http://opusresearch.net/wordpress/2010/07/26/armed-and-less-dangerous-mobile-speech-made-easier/</link>
		<comments>http://opusresearch.net/wordpress/2010/07/26/armed-and-less-dangerous-mobile-speech-made-easier/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 21:13:02 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Mobile Platforms]]></category>
		<category><![CDATA[mobile speech]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3281</guid>
		<description><![CDATA[According to a guest blog post on ARM's Web site, the next generation of the mobile CPU maker's flagship chip will be better suited to perform audio (and voice) processing. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Unknown.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Unknown.jpeg" alt="" title="Arm Logo" width="126" height="70" class="alignright size-full wp-image-3284" /></a>According to <a href="http://">this &#8220;guest blog&#8221; post</a> on ARM&#8217;s Web site, the next generation of the mobile CPU maker&#8217;s flagship chip will be better suited to perform audio (and voice) processing. This development is especially interesting in light of <a href="http://www.arm.com/about/newsroom/microsoft-licenses-arm-architecture.php">joint announcement from ARM and Microsoft</a> that they are extending their collaborative development efforts in ways that are expected to have the greatest impact in support of &#8220;embedded&#8221; versions of the Windows OS and the latest version of Windows Phone. </p>
<p>Even though Microsoft and ARM have a  relationship that dates back to the mid-1990s, Microsoft Windows is, for good reason, most closely associated with Intel (WinTel being the predominant desktop combo in enterprise settings). Yet <a href="http://www.forbes.com/2010/07/24/intel-arm-ibm-technology-cio-network-microsoft.html">Ed Sperling gets it exactly right</a> when he points out that ARM performs better than the Intel ATOM processor when it comes to power management, in battery-powered mobile or embedded devices. </p>
<p>Taking the two announcements together, we can expect Windows Phone-based products to have sufficient battery life and do a better job of recognizing and rending spoken utterances than current models.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/07/26/armed-and-less-dangerous-mobile-speech-made-easier/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft&#8217;s Hawaii Project: Leveraging Its Cloud for Mobile</title>
		<link>http://opusresearch.net/wordpress/2010/07/12/microsofts-hawaii-project-leveraging-its-cloud-for-mobile/</link>
		<comments>http://opusresearch.net/wordpress/2010/07/12/microsofts-hawaii-project-leveraging-its-cloud-for-mobile/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 18:44:32 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Azure]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Mobile Platforms]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Windows Mobile]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3194</guid>
		<description><![CDATA[Last Friday, Microsoft Maven Mary-Jo Foley issued this report providing a few details about Microsoft's initiatives to marry its cloud-based resources with application development efforts involving Windows Mobile.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Msftmobile.jpeg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/07/Msftmobile.jpeg" alt="" title="Msftmobile" width="112" height="117" class="alignright size-full wp-image-3198" /></a>Last Friday, Microsoft Maven Mary-Jo Foley issued <a href="http://www.zdnet.com/blog/microsoft/microsofts-new-hawaiian-codenames-are-all-about-mobile/6773">this report</a>, unpacking details about Microsoft&#8217;s initiatives to marry its cloud-based resources with application development efforts involving Windows Mobile. The initiative, code named &#8220;Project Hawaii&#8221; is all about how Microsoft will encourage students &#8211; initially at the University of Southern California (USC), Univeristy of Wisconsin-Madison, and Duke University &#8211; to explore how to &#8220;use the cloud to enhance the user experience on mobile devices.&#8221;</p>
<p>Keeping with the Hawaiian theme, the program incorporates initiatives that have been conducted as part of Microsoft&#8217;s &#8220;Mobile Assistance Using Infrastructure&#8221; (MAUI) Project. Its purpose is to enable &#8220;a new class of cpu- and data-intensive applications that seamlessly augment the cognitive abilities of users by exploiting speech recognition, NLP, vision, machine learning, and augmented reality&#8221;. Based on experience with WinMobile 6 and 6.5, it determined that such a project is required to define how to overcome issues surrounding battery life (which it calls &#8220;energy limitations of handhelds) &#8220;by leaveraging nearby computing infrastructure&#8221;, which is believed to encompass resources in Microsoft&#8217;s storage and compute cloud (Azure) accessible over wireless networks (WiFi, WLAN, FemtoCell&#8230;)</p>
<p>In a nutshell, Microsoft says that the mobile &#8220;platform&#8221; runs on a  Windows Mobile 6.5 smartphone (with Windows Phone 7 &#8220;in planning&#8221;). The services that are invoked in Microsoft&#8217;s cloud include Bing Maps for mapping services, and Windows Live ID for user identification. More detail on MAUI can be found on the <a href="http://research.microsoft.com/en-us/projects/maui/">Microsoft Research Web site</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/07/12/microsofts-hawaii-project-leveraging-its-cloud-for-mobile/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Speechable Moments Showcase: Oranges&#8217; Voxcards and L4&#8217;s Connected TV</title>
		<link>http://opusresearch.net/wordpress/2010/06/09/speechable-moments-showcase-oranges-voxcards-and-l4s-connected-tv/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/09/speechable-moments-showcase-oranges-voxcards-and-l4s-connected-tv/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 22:52:37 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[voice command]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3008</guid>
		<description><![CDATA[One of the roles of this blog is to highlight successful integration of speech processing and voice user interfaces into activities in an increasingly multimedia world.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/DNA_art_2007.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/DNA_art_2007.jpg" alt="" title="DNA_art_2007" width="150" height="150" class="alignright size-full wp-image-3010" /></a>One of the roles of this blog is to highlight successful integration of speech processing and voice user interfaces into activities in an increasingly multimedia world. My habit is to call them &#8220;speechable moments&#8221; and I believe that they will be the foundation for adoption of speech processing technologies, while dramatizing that such success is predicated on positive results and a gratifying user experience rather than just fulfilling on the promise of greater accuracy in recognition engines or lower latencies in response times from resources in the cloud.</p>
<p>Yesterday the &#8220;Inside Facebook&#8221; blog featured <a href="http://www.insidefacebook.com/2010/06/08/orange-has-a-growing-and-funny-text-to-speech-facebook-app/">this post</a> about a service called VoxCard, which enables the denizens of Facebook to festoon their friends&#8217; or their own walls with talking postcards. They start by adding the VoxCard application to their Facebook account. Then, albeit <em>en Francais</em>, they can choose a character from a gallery that includes of celebrity caricatures, popular icons or quirky personalities. They can then choose and customize a standards message or type in an original message from scratch. </p>
<p>Messages can be posted to the originators &#8220;wall&#8221; or to the wall of a friend. As alternatives they can be embedded in messages to any number of people in the Facebook member&#8217;s contact list. When the recipient clicks on the card, it recites the message in the character&#8217;s voice (with a French accent <em>bien sur</em>). According to the Inside Facebook post, there app is seeing about 200,000 &#8220;active monthly users,&#8221; a small fraction of the hundreds of millions of people on the popular social networking platform. Still, it is there for all to see, hear and play around with.</p>
<p>L4 Systems&#8217;s Connected TV is a bit different. It is not yet available for all to see, hear and play around with. Instead, the company will debut its technology in the fall, to coincide with the introduction of Google TV, Tivo Premiere and a forthcoming Yahoo! TV (which I have not seen mention of elsewhere). The Seattle area software company has determined that it is not too early to debut and demo its flagship software product which aims to deliver &#8220;an always-available visual and speech recognition user interface to interact with features in the living room and on the go.&#8221; </p>
<p>It plans a simultaneous release of two applications. L4 Connected TV and L4 Connected TV Mobile. Together they turn an iOS phone, Android or Blackberry into a voice navigation module for &#8220;Connected TV&#8221; products, including &#8220;Yahoo Connected TV&#8221; running on on Samsung, Sony, Vizio or LG sets; Google TV and TiVo. This will provide users with the ability to navigate an aggregation of content that includes social networks, local search and music. The sources, as noted in the company&#8217;s press release, will include Facebook, Twitter, Google Buzz, Blogger, Bing, and Apple iTunes or Amazon Music Store. A relatively static &#8220;demo&#8221; with screen shots can be viewed <a href="http://www.l4sys.com/L4/App.html">here</a>.</p>
<p>The fall season promises to bring a number of speechable moments to the TV-watching public. GoogleTV includes voice control in its offering already. Mobile dictation specialist Promptu got its start as AgileTV, a company whose charter was to provide voice-based control of the cable TV program guide and program selection process. The addition of L4 to the mix of technology providers is a tacit admission that navigating all the content, features and services offered through TV screens today and the future is getting complicated and voice commands should serve as the most efficient way to cut through the clutter.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/09/speechable-moments-showcase-oranges-voxcards-and-l4s-connected-tv/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bing Mobile Turns to Tellme for Spoken Driving Directions</title>
		<link>http://opusresearch.net/wordpress/2010/05/11/bing-mobile-turns-to-tellme-for-spoken-driving-directions/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/11/bing-mobile-turns-to-tellme-for-spoken-driving-directions/#comments</comments>
		<pubDate>Tue, 11 May 2010 18:14:25 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[navigation]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2824</guid>
		<description><![CDATA[In a blog post, Microsoft's Justin Jed highlights two new features that ship with the latest version of the Bing App for Mobile.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/10/Bing-logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/10/Bing-logo-150x65.png" alt="" title="Bing logo" width="150" height="65" class="alignright size-thumbnail wp-image-1697" /></a>In <a href="http://www.bing.com/community/blogs/search/archive/2010/05/10/updated-bing-app-for-windows-phone.aspx">this post</a>, Microsoft&#8217;s Justin Jed highlights two new features that ship with the latest version of the Bing App for Mobile, designed for phones running the Windows Mobile 6.x operating system. First is a new look for the apps &#8220;Home Page&#8221; which, like its counterparts running on desktops, laptops, iPhones and Androids, sports some stunning photography. But more important is the one-touch access to popular uses and features that emphasize local activities: Movies, Traffic, Maps, Local, Directions and Favorites.</p>
<p>Of greater interest from our point of view is the integration of Microsoft&#8217;s Tellme service into the Windows Mobile mix &#8211; this time as the foundation for voice-based turn-by-turn driving directions. It has been over a year since Greg Sterling posted <a href="http://www.internet2go.net/news/directory-assistance/tellme-offers-evolution-voice-search">this assessment</a> of Tellme Mobile and its ability to support voice-based search, messaging and call origination. As one of the originators of the whole &#8220;Voice Portal&#8221; concept, Tellme has had the ability to offer turn-by-turn directions for a number of years. </p>
<p>Starting with the first Droid and its &#8220;car mount&#8221; (introduced in October 2009), smartphone makers have had their sights set on the Personal Navigation Device (PND) market. Just last week, <a href="http://opusresearch.net/wordpress/2010/05/07/nokias-own-voice-for-ovi-maps-youtube-for-driving-directions/">Nokia stepped up competition</a> by creating a marketplace for &#8220;own voices&#8221; to provide turn-by-turn directions for its OVI Maps application. Making turn-by-turn directions part of the Bing Mobile application makes navigation an integral part of the local, mobile search experience and is a very nice touch. </p>
<p>Microsoft already incorporates directions (sans the &#8220;voice guide&#8221; feature) into the Bing app on iPhones and Androids. The new application is designed to work on the Sprint, T-Mobile, or AT&#038;T networks, and could emerge as an important differentiator in what is emerging as an increasingly competitive market.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/11/bing-mobile-turns-to-tellme-for-spoken-driving-directions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GM = Google Motors?</title>
		<link>http://opusresearch.net/wordpress/2010/05/10/gm-google-motors/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/10/gm-google-motors/#comments</comments>
		<pubDate>Mon, 10 May 2010 19:29:47 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Android]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2812</guid>
		<description><![CDATA[After Ford showcased the full spectrum voice-enabled SYNC services on a sub-$16K Fiesta, GM appears prepared to counter with a broad variety of wireless mobile apps offered in conjunction with Google.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/android_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/android_logo-150x113.jpg" alt="" title="android_logo" width="150" height="113" class="alignright size-thumbnail wp-image-2814" /></a>After Ford showcased the full spectrum SYNC services on a sub-$16K Fiesta (<a href="http://kara.allthingsd.com/20100426/inside-fords-app-happy-fiesta-but-no-manilow/">even taking Kara Swisher for a test sit</a>), GM appears prepared to counter with a broad variety of wireless mobile apps offered in conjunction with Google. In <a href="http://wot.motortrend.com/6646652/technology/gm-woos-google-collaboration-would-compete-with-ford-microsofts-sync/index.html">this article in Motor Trend</a> Todd Lassa lays out the basics of a relationship whereby the the &#8220;open&#8221; Android operating system would be licensed for use in GM automobiles.</p>
<p>Lassa asserts that the GM/Google relationship would place emphasis on a better phone-to-car interface, as opposed to the voice control and voice user interface that Microsoft&#8217;s Speech Application Group has played up. Thus GM&#8217;s approach will enable drivers to use their phones to do such things as start or turn off their cars, lock and unlock doors, and make other adjustments. It was not spelled out explicitly in the article, but given Google&#8217;s efforts to invoke automated speech recognition whenever a keyboard comes into play on a mobile device, it is highly likely that all of these functions can be voice controlled &#8211; making starting your car another &#8220;speechable moment&#8221;.</p>
<p>As for the supposition that Android in the car spells the end of OnStar, that is highly unlikely. Lassa notes that turn-by-turn directions through OnStar would become unnecessary because Android phones using Google Maps and a special mount have been successfully deployed for in-car navigation. But OnStar has been sold more as a safety feature and remote diagnostic service. The Android operating system in the car is more likely to augment, rather than compete with OnStar.</p>
<p>The prospects for more automobile-based Android apps is provocative. The car is destined to be the most fertile spawning ground for speech-based apps and the prospects for Android-oriented developers to define a range of &#8220;hands-on-the-wheel/eyes-facing-forward&#8221; capabilities and activities is very promising. Meanwhile, Ford remains ahead of the game with a well-defined, and now time tested, suite of voice control applications for frequent activities like carrying out phone conversations, messaging and controlling the car&#8217;s entertainment system.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/10/gm-google-motors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nokia&#8217;s &#8220;Own Voice for OVI Maps&#8221;: YouTube for Driving Directions?</title>
		<link>http://opusresearch.net/wordpress/2010/05/07/nokias-own-voice-for-ovi-maps-youtube-for-driving-directions/</link>
		<comments>http://opusresearch.net/wordpress/2010/05/07/nokias-own-voice-for-ovi-maps-youtube-for-driving-directions/#comments</comments>
		<pubDate>Fri, 07 May 2010 22:48:49 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nokia]]></category>
		<category><![CDATA[Personal Navigation Devices]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2798</guid>
		<description><![CDATA[Mobile phones are the most personal of communications devices. A person's voice is one of the most personal of communications modes. Put them together and you may create the foundation for a new family of Mo' Personal Navigation Devices (MPNDs?)]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/ownvoicelogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/05/ownvoicelogo-150x50.png" alt="" title="ownvoicelogo" width="150" height="50" class="alignright size-thumbnail wp-image-2807" /></a>Mobile phones are the most personal of communications devices. A person&#8217;s voice is one of the most personal of communications modes. Put them together and you may create the foundation for a new family of Mo&#8217; Personal Navigation Devices (MPNDs?). Well, at least that&#8217;s the logic behind &#8220;Own Voice&#8221; a free, downloadable application for Nokia handsets that can run the latest rev of Ovi Maps (3.03) which feature voice output with turn-by-turn directions. See <a href="http://www.comms.ovi.com/m/p/ovi/01%20SITE/">this page</a> for a description of the service and a list of phones, along with a slightly annoying soundtrack. You can also follow instructions on the site to download the app along with Ovi Maps in order to try the service for yourself.</p>
<p>Nokia purchased the leading provider of digital maps, NavteQ, in October 2007 for more than $8 billion. It was a clear statement that it was prepared to position its smartphones as a more functional and affordable alternative to single-purpose personal navigation devices (PNDs) from TomTom, Garmin or Magellan, bearing in mind that these were companies that in many cases were customers of NavteQ and used its files to support their PNDs. </p>
<p>When Google and Verizon introduced an automobile mount for the Droid in late 2009, it not only intensified the belief that single-purpose PNDs&#8217; days are numbered, it was a direct challenge to Nokia and its OVI Maps service. Thus Own Voice is the product of heightened competition and it will serve as a test to see if the guiding voice emanating from a navigational system (as opposed to price, appearance, accuracy, ease of programming or some other factor) can emerge as a competitive advantage.</p>
<p>As the Web site indicates, it takes quite a few steps to initiate use of Own Voice. At a minimum you must own one of the listed phones, which include Nokia 5800, Nokia X6, Nokia E72, Nokia N97, Nokia N97 mini and all upcoming GPS-enabled smartphones. You must also have purchased and installed the OVI Maps app from the OVI Appstore. Own Voice is a free add-on to the OVI Maps Application. At that point subscribers are navigated through the process of recording 53 discreet spoken elements, including the ever-popular &#8220;You have arrived!&#8221;. Then &#8220;Voila&#8221;, they have created a personal navigation application that employs their own voice.</p>
<p>In this era of immediate gratification and real-time communications, it is ambitious and probably unrealistic to think that new users will spend the hour or so it might take to record 53 utterances to their own satisfaction. Just think of how many times it took for you to get &#8220;the right greeting&#8221; on your mobile phone&#8217;s voicemail service. Now think of doing that 52 more times. It is much more likely that they will opt to choose the pre-recorded voices that other Own Voice users have &#8220;shared&#8221; in the Own Voice marketplace. It will never be as big as &#8220;Charlie Bit My Finger on YouTube, but Nokia is creating the sort of social platform that can provide an ego boost for a creative, self-identifying voice talent. There were already scores of examples in dozens of languages on the site, which includes a column of &#8220;Featured&#8221; voices, as well as the obligatory &#8220;Most Popular&#8221; page to help people identify a short list of candidates for the voice they&#8217;d like to hear when they&#8217;re looking for directions in a strange town.</p>
<p>Nokia deserves credit for using a voice-based service to differentiate its mobile product. Own Voice for Ovi Maps&#8217; planners also deserves style points for creating yet another platform for someone&#8217;s creative work to &#8220;go viral&#8221; (though we&#8217;d be interested to know the size the &#8220;audience&#8221; made up of users of the latest rev of OVI maps on selected Nokia smartphones amounts to these days). It is just the sort of thing that may be daunting to casual users but challenging enough to attract a sufficient number of creative types to make for some very entertaining, and well differentiated, PND-based experiences.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/05/07/nokias-own-voice-for-ovi-maps-youtube-for-driving-directions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>If True, Apple&#8217;s Purchase of Siri Heralds New Age of &#8220;Virtual Assistance&#8221;</title>
		<link>http://opusresearch.net/wordpress/2010/04/28/if-true-apples-purchase-of-siri-heralds-new-age-of-virtual-assistance/</link>
		<comments>http://opusresearch.net/wordpress/2010/04/28/if-true-apples-purchase-of-siri-heralds-new-age-of-virtual-assistance/#comments</comments>
		<pubDate>Wed, 28 Apr 2010 17:21:48 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2770</guid>
		<description><![CDATA[Today on Twitter [I've always wanted to start a column like that] Robert Scoble (@scobleizer) surmised that Apple Computer has acquired Siri.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg" alt="" title="Siri_logo" width="144" height="99" class="alignright size-full wp-image-2773" /></a>Today on Twitter [I've always wanted to start a column like that] Robert Scoble (@scobleizer) surmised that Apple Computer has acquired Siri. As evidence, he points to the <a href="http://www.ftc.gov/bc/earlyterm/index.shtml">April 27 link on the U.S. Federal Trade Commissions &#8220;Hart Scott Rodino&#8221; Web page</a>. On it, both Apple Computer and Siri have been granted &#8220;early termination&#8221; of any waiting period associated with anti-trust concerns. Ergo: they must be merging.</p>
<p>Siri has already proven to be a popular application for iPhone, and has been deemed even better for the iPad where, with closer ties to the parent company&#8217;s internal labs, it would help define Apple&#8217;s tablet as the ultimate personal assistant. The founders of Siri had already been instrumental in bringing millions of dollars (and person months) invested at SRI&#8217;s Artificial Intelligence Center in the <a href="http://en.wikipedia.org/wiki/CALO">CALO/PAL</a> project (where the word pal refers to a &#8220;Personal Assistant that Learns). It was the largest and singularly most ambitious project to apply artificial intelligence to solve everyday, recurring problems.</p>
<p>By buying Siri and hiring the core staff, which includes founders Adam Cheyer, Dag Kittlaus and Tom Gruber. Cheyer served as program director for CALO. Kittlaus applied his extensive experience in mobile services at Telenor, Motorola and Screen3. Gruber&#8217;s insights into the practical application of artificial intelligence has been an important factor in shaping the services of LinkedIn, SocialText and PowerSet.</p>
<p>There is no confirmation from either Siri or Apple that the acquisition is pending. Yet the prospects for embedding a &#8220;personal assistant that learns&#8221; across Apple&#8217;s entire &#8220;iLine&#8221; (iPod, iPad and iPhone) as well as its line of PCs and cloud-based services (meaning the iTunes Store, etc.) is enticing. When we f<a href="http://opusresearch.net/wordpress/2010/02/04/siri-debuts-on-iphone-speech-based-virtual-personal-assistant/">irst wrote up Siri in February</a> we noted that it had forged relationships with Web sites that put an emphasis on &#8220;results&#8221;: Fandango for movie ticket buying or Open Table to book a table at a restaurant. The company&#8217;s revenue plan was to take a percentage of the transactions it supports. Clearly, it would take a while for a small company like Siri to build the subscriber base and transaction flow to make its revenue plans &#8220;lucrative.&#8221;</p>
<p>As part of Apple the company will find that it has a longer time to build up the revenue stream while, at the same time, it will be easier to scale the business using Apple&#8217;s existing resources. Clearly a win/win. </p>
<p>And let&#8217;s not forget that it is also a coup for Nuance Communications, which provides the voice recognition resources for Siri as well as for the &#8220;native&#8221; Voice Command apps on the iPhone. Congratulations to all.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/04/28/if-true-apples-purchase-of-siri-heralds-new-age-of-virtual-assistance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>As Promised: Promptu Debuts on Android</title>
		<link>http://opusresearch.net/wordpress/2010/04/26/as-promised-promptu-debuts-on-android/</link>
		<comments>http://opusresearch.net/wordpress/2010/04/26/as-promised-promptu-debuts-on-android/#comments</comments>
		<pubDate>Mon, 26 Apr 2010 20:21:32 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Android]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Promptu]]></category>
		<category><![CDATA[Recombinant Communications]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2759</guid>
		<description><![CDATA[Today, according to several "unofficial tallies", the Android marketplace has exceeded 50,000 applications. According to this official announcement from mobile speech specialist Promptu, its flagship product is among them.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/03/android.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/03/android.jpg" alt="" title="android" width="108" height="108" class="alignright size-full wp-image-2575" /></a>Today, according to several &#8220;unofficial tallies&#8221;, the Android marketplace has exceeded 50,000 applications. According to <a href="http://www.prnewswire.com/news-releases/first-speech-to-text-messaging-app-for-android-91902209.html">this official announcement from mobile speech specialist Promptu</a>, its flagship product is among them. Promptu has been enjoying some success around the globe as a provider of network-based speech recognition to support transcription, text messaging and email messaging. Its core product debuted on the iPhone last December, as I reported <a href="http://opusresearch.net/wordpress/2009/12/17/promptus-shoutout-for-iphone-launches-puts-a-price-on-voice-to-sms-services/">here</a>. </p>
<p>In its press release, the company claims to be the first to offer &#8220;voice addressing&#8221; for text messages. The company claims high levels of accuracy because it is the only application to be &#8220;fully integrated with the Android SMS client and uses Smartword, Promptu&#8217;s editing tool that lets users see and select from likely word alternatives.&#8221; The application debuts today (April 26) for the Motorola Droid with features taht include: voice addressing, &#8220;full integration with the phone&#8217;s messaging service (including message exchanges threaded by recipient), thumbnail images displayed for contacts and &#8220;one-touch navigation to all recent incoming and outgoing messages.</p>
<p>As reported here, Promptu&#8217;s revenue model will test the value that users attach to transcription services. If it remains true to form it market bundles of &#8220;voice credits&#8221; in increments of 50 for $1.99; 150 for $3.99 or 250 for $4.99, as it does for iPhone users. This is in contrast to its closest functional rivals Vlingo (which has charged for a $17.99 for its application on the Blackberry) and Nuance Dragon Dictate, which has not yet announced a revenue model. Vlingo, by the way, has a &#8220;beta&#8221; of its application for the Android, which has been <a href="http://blog.vlingo.com/2010/03/android-users-free-vlingo-beta-now.html">available since mid-March</a>.</p>
<p>Along with Google, Microsoft and (to some extent) AT&#038;T, these are the initiatives that are shaping the way that spoken words will be integrated into the mobile messaging flow.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/04/26/as-promised-promptu-debuts-on-android/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Speechable Moments on the iPad? You Betcha. But Finding Them is a Challenge.</title>
		<link>http://opusresearch.net/wordpress/2010/04/02/speechable-moments-on-the-ipad-you-betcha-but-finding-them-is-a-challenge/</link>
		<comments>http://opusresearch.net/wordpress/2010/04/02/speechable-moments-on-the-ipad-you-betcha-but-finding-them-is-a-challenge/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 17:50:37 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[iPad]]></category>
		<category><![CDATA[mobile speech]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2651</guid>
		<description><![CDATA[For the past few weeks, I've been taking stock of Apple's carefully orchestrated, gradual disclosure of the iPad's pricing, capabilities and accessories.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/ipad-thumb-300x296-1495.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/ipad-thumb-300x296-1495-150x150.jpg" alt="" title="ipad-thumb-300x296-1495" width="150" height="150" class="alignright size-thumbnail wp-image-2652" /></a>For the past few weeks, I&#8217;ve been taking stock of Apple&#8217;s carefully orchestrated, gradual disclosure of the iPad&#8217;s <a href="http://store.apple.com/us/browse/home/shop_ipad/family/ipad?mco=OTY2ODA0NQ">pricing</a>, capabilities and <a href="http://www.apple.com/ipad/accessories/">accessories</a> on the Apple Web Site; the device&#8217;s native applications and use cases (as illustrated in this <a href="http://www.apple.com/ipad/guided-tours/">video tour</a>); as well as 3rd party apps available through the <a href="http://www.apple.com/ipad/app-store/">iTunes Store</a>.</p>
<p>Heck, Apple features <a href="http://www.apple.com/ipad/sdk/">access to its Software Developers Kit</a> as prominently as its &#8220;How to Order&#8221; link. So it is no wonder that the cottage industry that is &#8220;user generated content&#8221; will be bigger for the iPad than it is for the iPhone (well, at least that&#8217;s my prediction).</p>
<p>But can it talk? Thus far, it has been hard to discover the full spectrum of applications that run on the iPad. Apple claims there are already 150,000 iPad Apps. Yet search for &#8220;iPad&#8221; in the iTunes App store delivers a scant 2 pages. But wait! A search of the iTunes store for &#8220;iPad apps&#8221; that feature &#8220;Speak&#8221; yields a number of applications that don&#8217;t appear when the &#8220;Select All&#8221; option is invoked. These include: </p>
<p>Speak4It from AT&#038;T Interactive R&#038;D (listed under &#8220;Lifestyles&#8221;)<br />
Speak It! Text to Speech (listed under &#8220;Utilities&#8221;)<br />
iSpeech for the iPad (listed under &#8220;Education&#8221;)</p>
<p>Thanks to this <a href="http://gizmodo.com/5421216/dragon-dictation-worlds-most-popular-dictation-software-comes-to-app-store">article in Gizmodo</a>, I was clued into the fact that a version of the Dragon Dictation app that I have on my iPhone has been made available for the iPad. The availability was confirmed in <a href="http://www.nuance.com/news/pressreleases/2010/20100402_iPad.asp">this press release</a> from Nuance, which contains a link (which opens iTunes and delivers you to the iTunes Store where the app is tagged as &#8220;Business&#8221;. </p>
<p>Clearly, &#8220;speech enabling&#8221; apps defy categorization. I dare say that, before I shell out $69+tax for an iPad keyboard/dock or wireless keyboard, I will try my hand at dictating email using the free version of Dragon Dictation. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/04/02/speechable-moments-on-the-ipad-you-betcha-but-finding-them-is-a-challenge/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Vlingo Adds Hands-Free SafeReader to its Blackberry Apps</title>
		<link>http://opusresearch.net/wordpress/2010/03/24/vlingo-adds-hands-free-safereader-to-its-blackberry-apps/</link>
		<comments>http://opusresearch.net/wordpress/2010/03/24/vlingo-adds-hands-free-safereader-to-its-blackberry-apps/#comments</comments>
		<pubDate>Wed, 24 Mar 2010 19:11:33 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[RIM]]></category>
		<category><![CDATA[Vlingo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2596</guid>
		<description><![CDATA[One of the primary objectives of mobile speech apps makers ought to be making the user experience as eyes-forward-and-hands-free as possible. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/09/vlingo_logo.png" alt="" title="vlingo_logo" width="140" height="52" class="alignright size-full wp-image-1371" /></a>One of the primary objectives of mobile speech apps makers ought to be making the user experience as eyes-forward-and-hands-free as possible. While I haven&#8217;t had a chance to use it, I was very pleased to see <a href="http://multivu.prnewswire.com/mnr/vlingo/43086/">the announcement of a new service from Vlingo called SafeReader</a>. In essence, version 4.5 of the Vlingo app for Blackberry will recognize the spoken instruction &#8220;SafeReader On!&#8221; (exclamation point optional) and will respond with spoken renderings of incoming text (SMS) or email messages.</p>
<p>The Vlingo application on iPhones, as well as Blackberries, already support spoken origination of SMS, email and status updates for Facebook or Twitter. In my experience, it has been very good at distinguishing between commands (such as &#8220;Email Ted Stevens&#8230;&#8221; or &#8220;make an appointment&#8221;) and content, thus minimizing the need to do a lot of screen poking. In its marketing literature, Vlingo calls &#8220;the reverse functionality&#8221;, and sees it as a means to accomplishing the tasks that involve rendering written words as speech. </p>
<p>Vlingo&#8217;s Dave Grannan has long made it clear that the company doesn&#8217;t endorse the use of mobile devices while driving. Therefore they consider this a safe, hands-free way to retrieve &#8220;must-have&#8221; messages. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/03/24/vlingo-adds-hands-free-safereader-to-its-blackberry-apps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
