<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; mobile speech</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/mobile-speech/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 23 May 2012 17:29:43 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Automated Speech Improves Personal Productivity</title>
		<link>http://opusresearch.net/wordpress/2012/05/23/automated-speech-to-improve-personal-productivity/</link>
		<comments>http://opusresearch.net/wordpress/2012/05/23/automated-speech-to-improve-personal-productivity/#comments</comments>
		<pubDate>Wed, 23 May 2012 17:28:30 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[automated dictation]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[speech processing]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5362</guid>
		<description><![CDATA[Voice-based productivity is attracting attention both on the investment and product introduction front. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2012/05/bighandlogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2012/05/bighandlogo-150x43.png" alt="" title="bighandlogo" width="150" height="43" class="alignright size-thumbnail wp-image-5367" /></a>Voice-based productivity is attracting attention both on the investment and product introduction front. Regarding the former, leveraged buyout specialist Bridgepoint Development Capital has <a href="http://www.virtual-strategy.com/2012/05/22/bridgepoint-development-capital-acquires-international-voice-productivity-software-group-">completed its acquisition of BigHand </a>in a transaction valued at 49 British Pounds Sterling (roughly $77 million U.S.). On the latter (more mobile front), Nuance Communications introduced Dragon Drive, part of the &#8220;Dragon Dictate&#8221; genome which, like Ford Sync, is designed to support voice command of the devices and communications resources that people use inside their cars.</p>
<p>Let&#8217;s start with the BigHand LBO: Bighand is a 16 year-old company that specializes in workflow management centered around the automated transcription and distribution of dictated documents files. It has been popular with law firms in the UK, North America, Netherlands and Australia. It has also had success in healthcare and other professional services organizations in the UK. It serves over 150,000 end-users in about 1,450 business organizations. With offices in Chicago, London, Sydney and Toronto, it plans to use the cash infusion from Bridgepoint to increase staff, grow its presence in the U.S. and to expand its Software as a Service (SaaS) delivery mechanisms targeting smaller businesses.</p>
<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/04/nuance_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/04/nuance_logo.jpg" alt="" title="nuance_logo" width="117" height="75" class="alignright size-full wp-image-356" /></a>On the automotive front, Nuance&#8217;s introduction of <a href="http://www.nuance.com/products/dragon-drive/index.htm">Dragon Drive! </a>, which offers its users a way to take command of the systems and services available in a &#8220;connected car.&#8221; Productivity in the car starts with local search; giving users the ability to say something like, &#8220;What good Italian restaurant is near me?&#8221;</p>
<p>Next up is control of music selection from a car&#8217;s storage, connected media player, or from the mobile network like Spotify. Drivers or passengers can also dictate messages for delivery via SMS/text or email and, finally, they can retrieve weather and traffic information in real-time by making &#8220;natural language requests&#8221; like &#8220;what&#8217;s the traffic like on Route 80 near New York City.&#8221;</p>
<p>Bottom Line: The instances of &#8220;Speechable Moments&#8221; are growing geometrically. This is the product of proliferation of automated speech processing (ASP) resources on devices and in automobiles, coupled with links to &#8220;workflow management&#8221; tools, natural language processing and databases &#8220;in the cloud.&#8221; As a result, individuals are becoming more comfortable and more skilled at using their voices to step up their personal productivity. This is destined to become self-reinforcing. Though a new entrant is often positioned as a &#8220;Siri-killer,&#8221; &#8220;the next Siri,&#8221; or &#8220;Siri for the enterprise,&#8221; each should be judged according to its ability to make things easier for a user, not its ability to replace Apple&#8217;s mobile assistant.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/05/23/automated-speech-to-improve-personal-productivity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bing 411&#8217;s Three-Year Run Ends June 1</title>
		<link>http://opusresearch.net/wordpress/2012/05/08/bing-411s-three-year-run-ends-june-1/</link>
		<comments>http://opusresearch.net/wordpress/2012/05/08/bing-411s-three-year-run-ends-june-1/#comments</comments>
		<pubDate>Tue, 08 May 2012 23:22:13 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Directory Assistance]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Voice Portals]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5318</guid>
		<description><![CDATA[With very little fanfare, Microsoft is ready to deep six Bing 411, the free voice search and directory assistance service that grew out of its acquisition of Tellme Networks (which took place in may 2007).]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2012/05/bing411.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2012/05/bing411-150x96.jpg" alt="" title="bing411" width="150" height="96" class="alignright size-thumbnail wp-image-5321" /></a>With very little fanfare, Microsoft is ready to deep six Bing 411, the free voice search and directory assistance service that grew out of its acquisition of Tellme Networks (which took place in may 2007). Tellme&#8217;s 1-800-555-TELL (also slated for a June 1 cut-off), is the prototype for a conversational voice assistant. The toll-free number, which launched in 2001, was referred to as a &#8220;Voice Portal&#8221; attracted millions of calls for the evergreen pay-per-call topics: sports scores, weather, news headlines and the like. </p>
<p>By 2006, Tellme had graduated to become the platform for automated, &#8220;paid&#8221; DA services from AT&#038;T Wireless (first Cingular) and Verizon landlines. Carriers were able to charge $2.00 or more per call (which phone companies continue to do today). At the time, Tellme co-founder Mike McCue confessed to Business Week reporter Steve Hamm that &#8220;only about a million people&#8221; were using the free voice portal service.</p>
<p>When Microsoft bought Tellme, roughly a year later, execs in Redmond saw Tellme&#8217;s voice portal and DA platform as a way to take on Google in the local, mobile search marketplace. By the time the deal had consummated, Google countered by launching its own &#8220;free,&#8221; speech-enabled DA service, Goog411 and joined Jingle Networks and a small handful of other companies trying to coax callers into using their phones to conduct local searches and carry out business. </p>
<p>The idea of using your voice to transform the phone into a mobile assistant was on target, but very premature. After investing in both technology and market conditioning, Jingle Networks was purchased by Marchex in April 2011 and is now integrated in its mobile voice search product line. <a href="http://opusresearch.net/wordpress/2010/10/08/goog411-has-served-its-purpose/">Google shuttered Goog411</a> in October 2010. At the time it noted that smartphone users could conduct voice search using Google&#8217;s speech recognition in conjunction with Google Maps. </p>
<p>On June 1, Microsoft will follow in Google&#8217;s footsteps (or voiceprints?) by closing its hosted voice search platforms and directing people to &#8220;visit www.bing.com on [your] PC or mobile device.&#8221; There&#8217;s no mention of a speech-enabled option &#8211; just the end of Tellme-powered voice search as we know it. By contrast, when Google closed GOOG411, it let it be known that the service had helped it build a corpus of utterances to make its voice-search more accurate, and the microphone icon is a ubiquitous feature on all Android-based devices. </p>
<p>In the 20 months since Google closed GOOG411, Microsoft has busily gone about recasting Tellme from the innovative provider of Dialtone 2.0 to a cast-off that is now part of multi-channel customer care specialist 24[7] Inc. During that time, the mobile assets that were exemplified by Bing411 and the Tellme Voice Portal were largely neglected and, presently, will cease operating altogether.</p>
<p>Six years ago, we believed that wireless DA was a viable, and sure to be profitable, vehicle for mobile business search culminating in commerce. Today, it is more likely that mobile assistants, like Apple&#8217;s Siri, are more likely to take on that role. Siri is destined to have company in the very near future as solutions providers emerge from the ranks of knowledge management, artificial intelligence, speech analytics and other closely related disciplines. Indeed personal assistants, front-ending the Internet&#8217;s vast array of information, navigation and communications resources are successfully supplanting directory assistance, front-ending little more than an enhanced phone directory, as the go-to resource to support of local commerce.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/05/08/bing-411s-three-year-run-ends-june-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sensory Adds Speaker ID to Wake-up Words</title>
		<link>http://opusresearch.net/wordpress/2012/05/02/sensory-adds-speaker-id-to-wake-up-words/</link>
		<comments>http://opusresearch.net/wordpress/2012/05/02/sensory-adds-speaker-id-to-wake-up-words/#comments</comments>
		<pubDate>Wed, 02 May 2012 17:43:02 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[personalization]]></category>
		<category><![CDATA[Sensory Inc.]]></category>
		<category><![CDATA[speaker authentication]]></category>
		<category><![CDATA[Voice biometrics]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5311</guid>
		<description><![CDATA[Sensory, Inc., takes embedded speech processing another step forward by adding voice biometric-based speaker identification to its TrulyHandsfree(TM) voice activation technology. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/sensorylogo.gif"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/sensorylogo.gif" alt="" title="sensorylogo" width="144" height="77" class="alignright size-full wp-image-3632" /></a>Sensory, Inc., takes embedded speech processing another step forward by adding voice biometric-based speaker identification to its TrulyHandsfree(TM) voice activation technology. As a result, a wide variety of home and mobile electronic devices can be trained to &#8220;wake up&#8221; and respond to a specific person&#8217;s voice. For mobile professionals means higher levels of security when activating smartphones, laptops, desktops and tablets. There are equally important implications for owners and users of electronic devices in the home. As Sensory&#8217;s CEO Todd Mozer points out in this <a href="http://www.marketwire.com/printer_friendly?id=1651775">press release</a>, the new software will make it possible to train a digital video player (or set-top box) to recognize the voice a specific viewers in order to recommend appropriate &#8220;favorites,&#8221; or block unsuitable programs.</p>
<p><a href="http://opusresearch.net/wordpress/2011/07/28/sensorys-trulyhandsfree-voice-control-2-0-portends-new-apps-and-possibilities/">As we&#8217;ve noted before</a>, Sensory, Inc., has long been devoted to supporting  TrulyHandsfree(TM) voice control of devices and media, especially in the automotive setting, where wake up words need to be distinguished from road noise or other random sounds in the background. Voice activation has been available on the Samsung Galaxy S2 for roughly a year now. In order to reduce the power consumption required to have the device constantly listening for its wake-up words, Sensory has made sure that the technology for the trigger words are &#8220;deeply embedded&#8221; in the devices hardware. </p>
<p>According to Mozer, the company &#8220;always had speaker verification on our chips.&#8221; However, to support the new offer, the company switched to a new HMM (hidden Markov model) based biometric engine in order to provide higher accuracy and lower false acceptance or false rejection rates. Sensory is demonstrating an alpha version of the system running on an Android device and using the passphrase &#8220;Hello Blue Genie&#8221;. Mozer trained the smartphone by repeating the phrase three times. Then he was able to unlock and activate the phone by saying the phrase, whereupon it said &#8220;Greetings Todd Mozer.&#8221;  When i said the phrase after he had trained it, the phone woke up, but displayed the word &#8220;Rejected,&#8221; and I was unable to activate any of its features or services. </p>
<p>A beta version of the voice activation with speaker identification technology will be available under license from Sensory in the coming week. As the press release explains, details are available from info@sensoryinc.com</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/05/02/sensory-adds-speaker-id-to-wake-up-words/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Siri Doesn&#8217;t Suck; And It Will Only Get Better</title>
		<link>http://opusresearch.net/wordpress/2012/04/20/siri-doesnt-suck-and-it-will-only-get-better/</link>
		<comments>http://opusresearch.net/wordpress/2012/04/20/siri-doesnt-suck-and-it-will-only-get-better/#comments</comments>
		<pubDate>Fri, 20 Apr 2012 21:41:10 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[Virtual Assistants]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5292</guid>
		<description><![CDATA[Contrary to what Henry Blodget says, Siri is succeeding.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg" alt="" title="Siri_logo" width="144" height="99" class="alignright size-full wp-image-2773" /></a>Today in <a href="http://www.businessinsider.com/apple-siri-problems-2012-4">this post</a> on the Silicon Insider blog, Henry Blodget asserts, in the royal plural, that &#8220;we think it is highly unlikely that Apple would still be flogging Siri if Steve Jobs were still in charge of the company.&#8221; Blodget uses Apple&#8217;s handling of Apple&#8217;s cloud-based Mobile Me, which launched a couple of years ago, and totally confounded many long-time Apple fans, as a better path. He asserts that Jobs quickly apologized to the public (funny I don&#8217;t remember getting the note) and fired a few people and &#8211; lo and behold &#8211; today we have iCloud, as its replacement.</p>
<p>I certainly hope this doesn&#8217;t start a rash of &#8220;if Steve were alive&#8230;&#8221; or &#8220;WWSD&#8221; comments, though it&#8217;s already too late for that. More to the point, iCloud, and its derivatives like iTunes Music Match and other forms document or info sharing, still has major issues when it comes to usability and transparency. In my experience, it is erratic in its ability to update calendars, contact information and other shared info. It has also been incredibly opaque when it comes to set up and initiation (through settings on each device and in iTunes), as well as its inability to manage memory on my iPhone. As I blissfully thought I could now listen to music from my library &#8211; whether it is located on the device or &#8220;in iCloud&#8221; &#8211; I learned too late that every song with a cloud icon had to be downloaded to my device in order to be played. An error message that rivals Microsoft&#8217;s &#8220;Blue Screen of Death&#8221; when I sync my iPhone and a stern warning that I have exceeded memory limits on my device. </p>
<p>That sucks! And it&#8217;s largely because iCloud, unlike Siri, does not do a very good job of balancing between resources in the cloud and resources on the device. If you unwittingly ask it to download more content than your device can hold, it tries to do so, until it fails. </p>
<p>What&#8217;s more, iCloud, unlike Siri, doesn&#8217;t learn. In today&#8217;s wireless world the smartphone+cloud paradigm is taking hold. Both iCloud and Siri take advantage of that fact. But Siri does it much more intelligently, and transparently. Siri is designed to &#8220;understand&#8221; the phone&#8217;s owner. That is vital for both task completion and personalization. It is dynamic and improves overtime as the owner provides more data and spoken input.</p>
<p>iCloud may be the progeny of Mobile Me, but to many Apple owners, it&#8217;s just opaque &#8211; a sticker on the side of the iPhone or iPad package or a Web page that equates it to &#8220;Find My iPhone&#8221; or &#8220;Find My Friends.&#8221; I know that Apple has long-range plans to make iCloud the repository for all the data I originate from my iOS devices. But Apple has a long way to go in educating the public to change its behavior. &#8220;Imagine&#8230; a world without &#8216;File/SaveAs&#8217; menus.&#8221; That&#8217;s gonna be a tough one to sell.</p>
<p>Meanwhile, just seeing Siri refer to Samuel L. Jackson as &#8220;Sam,&#8221; got me to train my phone to call me &#8220;Dan&#8221; (hey, its better than &#8220;Rock God&#8221;). I think the commercials are incredibly effective, even if &#8220;your mileage may vary&#8221; when it comes to every instance of using Siri. I&#8217;ve also noticed that people who are not of Sam or Zooey&#8217;s generation (meaning younger) are very comfortable dictating text messages or emails, looking for local eateries and other usecases that Apple has already anticipated. Success breeds more use. More use leads to more accuracy. More accuracy leads to more success. Ergo: &#8220;Success Breeds Success.&#8221; Now ask &#8220;What Would Steve Do?&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/04/20/siri-doesnt-suck-and-it-will-only-get-better/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ford to World: &#8220;Bring It In!&#8221;</title>
		<link>http://opusresearch.net/wordpress/2012/04/18/ford-to-world-bring-it-in/</link>
		<comments>http://opusresearch.net/wordpress/2012/04/18/ford-to-world-bring-it-in/#comments</comments>
		<pubDate>Wed, 18 Apr 2012 16:48:16 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Ford]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Sync]]></category>
		<category><![CDATA[voice command]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5286</guid>
		<description><![CDATA[John Ellis, Ford's Global Technologist for Connected Services and Solution, summed it up nicely by saying "Before there was Siri, there was SYNC."]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/Fordbadge.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/Fordbadge-150x101.jpg" alt="" title="Fordbadge" width="150" height="101" class="alignright size-thumbnail wp-image-5287" /></a>As I noted a couple of years ago <a href="http://opusresearch.net/wordpress/2010/01/07/ford-turns-cars-into-open-platforms-for-recombinant-mobile-speech-apps/">here</a>, Ford Motor Company has long taken a very &#8220;open&#8221; approach to promoting highly-personalized, voice-enabled applications in their automobiles. John Ellis, the auto maker&#8217;s Global Technologist for Connected Services and Solution, summed it up nicely by saying &#8220;Before there was Siri, there was SYNC,&#8221; referring to the &#8220;hands-free,&#8221; voice-activated on-board command-and-control system that Ford first introduced in 2007. </p>
<p>Ellis has been instrumental in developing and promoting Ford&#8217;s efforts to leverage both SYNC and its close sibling, SYNC Applink, as the mechanisms for individual drivers to use their favorite smartphone apps &#8211; hands-free and eyes-forward &#8211; through SYNC. Today, the list of featured apps includes Pandora, Stitcher, IHeartRadio, NPR News; but, through the Applink Developer Program, Ellis expects the list to continue to grow, thanks to development and marketing efforts by Ford and its partners. </p>
<p>Ford maintains its business model, which is to &#8220;sell more cars.&#8221; It is using SYNC as a differentiator for global initiatives to accomplish its sales goals by playing up how people can use their voice to personalize their in-car experience. In addition to encouraging more mobile app developers to make their wares compatible with SYNC, it has already made voice command work seamlessly in nine languages with emergency assistance in 34 languages.</p>
<p>The other initiatives that are of great importance is to strike a balance between the capabilities that are &#8220;embedded&#8221; in the car and resources that are in the cloud so that the services meet every driver&#8217;s requirement for reliability &#8211; regardless of location, velocity of travel or other circumstances. Whether it&#8217;s &#8220;brought-in&#8221; or &#8220;beamed in&#8221; it has to work every time.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/04/18/ford-to-world-bring-it-in/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Voxeo LBS Adds Location-Awareness to its Prophecy IVR Applications Arsenal</title>
		<link>http://opusresearch.net/wordpress/2012/03/27/voxeo-lbs-adds-location-awareness-to-its-prophesy-ivr-applications-arsenal/</link>
		<comments>http://opusresearch.net/wordpress/2012/03/27/voxeo-lbs-adds-location-awareness-to-its-prophesy-ivr-applications-arsenal/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 17:32:17 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[hosted IVR]]></category>
		<category><![CDATA[Location Based Services]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Voxeo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5241</guid>
		<description><![CDATA[Voxeo has introduced a new service that enables IVR applications to be "location aware."]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/logo_voxeo.gif"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/logo_voxeo.gif" alt="" title="logo_voxeo" width="74" height="80" class="alignright size-full wp-image-1281" /></a>Voxeo has introduced a new service that enables IVR applications to be &#8220;location aware.&#8221; Without the need to download an app, the new service enables location intelligence to be added to existing IVR, SMS or outbound notification services running on Voxeo&#8217;s Prophecy platform. </p>
<p>There are already many use cases for location aware mobile. The technology grew out of a decade of E911 initiatives, which sought to pinpoint the origin of mobile calls to police or firefighters. Today, mobile subscribers (and mobile app developers) are well-aware of the power of local intelligence and can bake location awareness into 511 calls for traffic conditions or spoken versions of services that &#8220;find-the-nearest&#8221; ATM, gas station, restaurant or restroom. There are also obvious uses for businesses managing mobile workers or service fleets.</p>
<p>In this <a href="http://www.voxeo.com/about/press_reader.jsp?date=032712_lbs.jsp">announcement</a>, Voxeo notes that it will serve as a single source of location information from a number of carriers. It also notes that it is taking an approach that complies with Best Practices and Guidelines for Location-Based Services provided by the Cellular Telephone and Internet Association (CTIA). That means that end-users must opt-in or &#8220;subscribe&#8221; to the service. Among Voxeo&#8217;s service delivery options are &#8220;single or double opt-in&#8221; as well as SMS based confirmations.</p>
<p>More detailed information and support, including a description of the API is available <a href="http://pages.voxeo.com/location-intelligence/">here</a>, including a phone number that demonstrates how the call-flow will work for subscription to LBS and retrieval of weather information. Developers will be pleased to find access to an API that is &#8220;fully compliant with industry standards as an extension of the Location RESTful API from GSMA’s OneAPI.&#8221; It is documented http://help.voxeo.com/helpserver/go/evolution/LBS.  </p>
<p>To encourage use by developers, Voxeo offers 1,000 free geolocation look-ups through the API as well as access to its developer support resources.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/03/27/voxeo-lbs-adds-location-awareness-to-its-prophesy-ivr-applications-arsenal/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Misunderestimating Siri</title>
		<link>http://opusresearch.net/wordpress/2012/03/02/misunderestimating-siri/</link>
		<comments>http://opusresearch.net/wordpress/2012/03/02/misunderestimating-siri/#comments</comments>
		<pubDate>Sat, 03 Mar 2012 01:01:43 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Apple iPhone]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[Speech Actions for Android]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5196</guid>
		<description><![CDATA[Siri has become the benchmark against which competing mobile speech applications wish to be measured.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/04/Siri_logo.jpg" alt="" title="Siri_logo" width="144" height="99" class="alignright size-full wp-image-2773" /></a>Several news items from the Mobile World Congress make it clear that Siri &#8211; the native, speech-enabled mobile assistant running on the iPhone 4S &#8211; is the benchmark against which competing mobile speech applications wish to be measured. It&#8217;s very good news for Apple (a company whose <a href="http://www.tgdaily.com/business-and-law-brief/61796-how-many-countries-are-worth-less-than-apple">$500+ billion market cap</a> exceeds the GDPs of Poland, Belgium, Sweden, Saudi Arabia, and Taiwan. </p>
<p>It started with a series of video press releases from Google&#8217;s soon-to-be device manufacturing subsidiary, Motorola Mobility. Under the headline, <a href="http://www.eweek.com/c/a/Mobile-and-Wireless/Google-Voice-Actions-Knocks-Out-Siri-on-Motorola-Android-Phones-794204/">&#8220;Google Voice Actions Knocks Out Siri on Motorola Android Phones&#8221;</a>, readers could see how Google Voice Actions on several models of Moto&#8217;s wireless phones carried out dictation, getting directions, posting a reminder and Web navigation tasks faster than the iPhone 4S. </p>
<p>One of the &#8220;knockouts&#8221; is rendered below:</p>
<p><iframe width="560" height="315" src="http://www.youtube.com/embed/FBYMDs_RFeI" frameborder="0" allowfullscreen></iframe></p>
<p><a href="http://opusresearch.net/wordpress/2010/08/16/voice-actions-for-android-speechable-moments-from-google-spell-new-market-dynamics/">I first wrote about Voice Actions for Android in August 2010</a> noting that &#8220;When we look back on the summer of 2010, the launch of Voice Actions for Android will be seen as a signal event. It goes a long way toward re-establishing the spoken word as the natural input for a phone (duh!).&#8221; That may have been true and it certainly rattled the chains of Microsoft (which spun off Tellme, but kept much of the IP that will power voice and gesture controlled devices) and Nuance (which, in addition to carrying out the automated speech processing for Siri, is in the process of acquiring Vlingo and continues to refine Dragon Go! and a number of initiatives that define a more Natural User Interface). </p>
<p>What the boxing match between Voice Actions for Android and Siri fails to capture is how real people actually use Siri. <a href="http://sirinotes.wordpress.com/2012/02/12/how-i-am-using-and-not-using-siri/">This post</a> provides details about one person&#8217;s daily experience. While the notorious &#8220;sample of one&#8221; is indicative of only one subscriber&#8217;s preferences, it is fascinating to see how usage starts in the morning with a query about the weather and crescendos into classic Q&#038;A exercises, giving a person a verbal way to ask someone a question before it recedes in memory. That&#8217;s the real-life use case (mostly because the idea of reciting text messages aloud fundamentally misunderstands the purpose of &#8220;passing electronic notes&#8221; in private and asking for directions aloud when you still have to touch the screen and look at a map defeats the purpose of &#8220;hands-free/eyes forward&#8221; navigation.)</p>
<p>But I have to say that <a href="http://www.prweb.com/releases/2012/2/prweb9228289.htm">this announcement</a>, coming out of MWC, caught my imagination. A joint venture between embedded ASR specialist Novauris and <a href="http://www.existor.com/company-overview">Existor Ltd</a>, a technology firm that applies artificial intelligence to the challenge of creating &#8220;conversational,&#8221; spoken interfaces. The two companies plan &#8220;to combine their expertise in artificial intelligence (AI), natural language processing and automatic speech recognition (ASR) to create mobile and wireless applications that allow people to interact conversationally – not just with smartphones and computers, but also with cars, refrigerators, TVs, toys or any device equipped with a suitable embedded processor.&#8221;</p>
<p>The ambition to make consumer electronic devices, appliances and automobiles more conversational is very compelling. Nonetheless, I believe the most interesting aspect of the Novauris/Existor output is the potential to remain conversational even in a disconnected state. Many Siri users have told me that their mobile assistant will duck a question now and then for no apparent reason. They speculate that access to the server &#8220;in the cloud&#8221; may have timed out. At that point Siri will ask for the user to try later or default to a Web search. In their announcement the two technology companies claim that their first output will be a series of speech-enabeled &#8220;chatbot apps,&#8221; called CleverApps which will run on smartphones without any need for access to &#8220;the cloud&#8221; over the data link. </p>
<p>In addition to solving the problem of getting an answer in a disconnected state, the new venture&#8217;s apps have the potential to reduce data traffic at a time when some wireless carriers are &#8220;throttling back&#8221; the speeds for heavy users and getting rid of &#8220;unlimited data plans.&#8221; With product announcements and vendor contests coming fast-and-furious, the ripple effect of Apple&#8217;s Siri launch is gathering momentum.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/03/02/misunderestimating-siri/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Novauris and SingTel Offer Mobile Voice Search in Singapore</title>
		<link>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 19:46:20 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[localization]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Novauris]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4934</guid>
		<description><![CDATA[SingTel and Novauris launched a version of voice search that is tailored specifically for "the unique style of English" that is spoken on the nation-island of Singapore. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/singtel-logo1.jpg" alt="" title="singtel-logo" width="144" height="57" class="alignright size-full wp-image-4937" /></a>Every press release or news item about speech-enabled mobile services now carries the sobriquet &#8220;Siri-like.&#8221; Its use is so frequent these days that it is highly likely that it will be added to the Oxford English Dictionary in the coming year. I mean this year the editors accepted the contemporary definitions of &#8220;OMG&#8221; and &#8220;LOL.&#8221; &#8217;nuff said.</p>
<p>The reason that the term &#8220;Siri-like&#8221; is appearing so often is that there has been an acceleration in the introduction and marketing of new speech-enabled services. One that&#8217;s worth noting is <a href="http://www.ereleases.com/pr/novauris-singtel-bring-local-voice-search-singapore-69256">SingTel deFIND</a>, a version of voice search that is tailored specifically for &#8220;the unique style of English&#8221; that is spoken on the nation-island of Singapore. </p>
<p>SingTel turned to UK-based Novauris to build the corpus of localized utterances. The two companies have successfully introduced an application that delivers details on local shops, restaurants or retailers when a user simply says the name. Users can also search by category (e.g. Filipino restaurant) and the application will use GPS and SingTel&#8217;s &#8220;InSing&#8221; database to show what businesses are nearby.</p>
<p>In the coming months, we expect to see the introduction of speech-enabled mobile search and assistance apps to accelerate and that means there are lots more opportunities for firms that specialize in this sort of localization. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/11/novauris-and-singtel-offer-mobile-voice-search-in-singapore/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance&#8217;s Steve Chambers to Deliver Keynote at Voice Biometrics Conference New York  April 3rd &amp; 4th, 2012</title>
		<link>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/#comments</comments>
		<pubDate>Wed, 09 Nov 2011 18:21:40 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance Communications]]></category>
		<category><![CDATA[Voice biometrics]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4913</guid>
		<description><![CDATA[We’re pleased to announce that Nuance’s President, Sales and Marketing, Steve Chambers, will deliver a keynote address during the Voice Biometrics Conference-NYC April 3-4, 2012]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/SGC2-Photo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/11/SGC2-Photo-150x150.jpg" alt="" title="SGC2 - Photo" width="150" height="150" class="alignleft size-thumbnail wp-image-4915" /></a>Planning for <a href="http://bit.ly/rvwX9J">Opus Research’s 2012 Voice Biometrics Conference New York on April 3rd-4th</a> is in full swing and we’re pleased to announce that Nuance’s President, Sales and Marketing, Steve Chambers, will deliver a keynote address during the event. He will share his perspectives on how speaker verification, accurate speech recognition and artificial intelligence will help enterprises to anticipate a user&#8217;s intent and establish a foundation of trusted communications. Bringing these technologies together will provide a balance of secure interactions and a positive user experiences across channels including, traditional phones, mobile devices and e-commerce Web-sites.</p>
<p><a href="http://bit.ly/uKWI9o ">Register Now &#8211; Discounted Offer Ends November 30th</a>:<br />
The first 30 people to register for Voice Biometric Conference New York can register for $299.00  That&#8217;s over half off the full conference rate of 699.00.  Use registration code, &#8216;earlyvbc&#8217;  to receive the special rate.  </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/09/nuances-steve-chambers-to-deliver-keynote-at-voice-biometrics-conference-new-york-april-3rd-4th-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twilio Positioning SMS as a Pre-API for Siri Development Efforts</title>
		<link>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/</link>
		<comments>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/#comments</comments>
		<pubDate>Thu, 03 Nov 2011 18:32:40 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Mobile Apps]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[Twilio]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4889</guid>
		<description><![CDATA[Twilio is encouraging developers to come up with interesting new applications for the iPhone 4S using the Twilio platform for SMS as a quasi-API.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/twiliologo.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/09/twiliologo.jpg" alt="" title="twiliologo" width="151" height="60" class="alignright size-full wp-image-4784" /></a>In <a href="http://www.twilio.com/contests/2011/10/siri-video-developer-contest.html">this post</a>, the peripatetic promoter of cloud-based phone hacks, Twilio, encourages developers to come up with interesting new applications for the iPhone 4S, taking advantage of speech-based assistant, <a href="http://opusresearch.net/wordpress/2011/10/04/siri-beta-assumes-primacy-on-iphone-4s-home-button/">Siri</a>. </p>
<p>Initiatives and contests like this one illustrate one more reason why Apple&#8217;s introduction of Siri is a signal event for mobile speech. The &#8220;application&#8221; (placed in quotes for reasons I will explain shortly) has its limits. In fact, it is not really an application in the traditional sense of the word. Like many of the downloadable &#8220;speech-enablers,&#8221; Siri defies categorization. There are &#8220;command and control&#8221; elements that fall in the category of &#8220;utility.&#8221; There are dictation and messaging components that make it a &#8220;communications&#8221; app. Finally, there are (or were) the links to 3rd party web sites that enabled Siri to transform the iPhone into a personal assistant. </p>
<p>The pros and cons of the speech-based mobile assistant were tossed around most recently when Google&#8217;s <a href="http://allthingsd.com/20111019/android-chief-says-your-phone-should-not-be-your-assistant/">Andy Rubin dismissed the idea at an AsiaD (an All Things D conference)</a>. The gist of his criticism was that he&#8217;d &#8220;been-there-done-that-and-it-failed,&#8221; with reference to two speech-enabled personal digital assistants. One was <a href="http://en.wikipedia.org/wiki/General_Magic">General Magic</a>, which was spun out of Apple Computer back in 1990 and had a few, high-visibility partnerships, including Sony, Motorola and AT&#038;T among others. </p>
<p>In hindsight, Rubin may see General Magic as a failure but, in fact, its engineers designed and developed a new operating system (Magic Cap) and scripting language (Telescript) that were precursors VoiceXML and efforts to create tools that support agile programming for speech-based, conversational interfaces. The technologies that started in General Magic live on in the automated speech offerings of GM OnStar. And somewhere among the intellectual property vault owned by Microsoft co-founder Paul Allen&#8217;s Vulcan Ventures are General Magic&#8217;s patents, which were bought at auction in 2002.</p>
<p>Rubin also made mention of Wildfire Communications, Inc., a company founded by Rich Miner, who is now a partner at Google Ventures. But Wildfire&#8217;s experience is quite different from General Magic. Founded in 1991, Wildfire built a very loyal following for its speech-enabled services which, at the time, were largely built around management of voice and telephony functions, like voicemail management, call origination, call answering and the like. In 2002, France Telecom&#8217;s Orange Wireless bought the company for $147 million and offered the service to its mobile constituency.</p>
<p>At the time, the service was well-received by mobile customers but, because it was originally engineered as an enterprise app, Orange realized that it would have to re-engineer the underlying technology platform in order to offer the service in sufficient scale. Instead, the telco opted to shutter the service in 2005. As <a href="http://www.theregister.co.uk/2005/07/05/orange_wildfire/">this article</a> by Tim Richardson in The Register explains, shutting down the service took longer than anticipated because of the protests of a loyal following of Wildfire users who, to this day, feel like Orange was too hasty in its decision to cease the offering.</p>
<p>Siri (as an Apple initiative) shares quite a few of the attributes of both General Magic and Wildfire that attracted the attention and imagination of developers. The big difference today is that modern technology around computing power and storage support offering the service economically at scale. In addition, even without a formal API, the creative energy of 3rd party developers can be applied to enhancing the service using tools and scripting languages that have evolved into agile environments since the days of Magic Cap and Telescript.</p>
<p>Google has reason to be dismissive of Siri because it is important to call into question its ability to provide answers to questions that used to be the sole domain of the Google Search box (and therefore a source of advertising supported revenue for Google). But it can equally be argued that Voice Search and Voice Actions on the Android platform will benefit from general acceptance of speech-enabled assistants, like Siri. We have to see whether and when Apple introduces Siri as a downloadable app that runs on other devices and how well it (re)integrates the service with popular destination sites like Yelp!, OpenTable, Fandango, etc. Today Vlingo and Nuance&#8217;s DragonGo! have an advantage when supporting mobile ecommerce.</p>
<p>Greg Sterling and I will be issuing a report on &#8220;Mobile Speech Applications and Services&#8221; in the coming month. In it we will assess current initiatives and provide our insights and perspectives on the ultimate impact on local search and conversational commerce.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/11/03/twilio-positioning-sms-as-a-pre-api-for-siri-development-efforts/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

