<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; Speech recognition</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/speech-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 08 Feb 2012 18:55:10 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Sensory&#8217;s TrulyHandsFree Voice Control 2.0 Portends New Apps and Possibilities</title>
		<link>http://opusresearch.net/wordpress/2011/07/28/sensorys-trulyhandsfree-voice-control-2-0-portends-new-apps-and-possibilities/</link>
		<comments>http://opusresearch.net/wordpress/2011/07/28/sensorys-trulyhandsfree-voice-control-2-0-portends-new-apps-and-possibilities/#comments</comments>
		<pubDate>Thu, 28 Jul 2011 23:31:12 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[hands-free services]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4713</guid>
		<description><![CDATA[With the heartfelt question, "If it's 'hands-free," why do we still have to use our fingers?" Sensory, Inc. has launched its latest product, TrulyHandsFree Voice Control 2.0, predicated on "substantial improvements" in speech detection and recognition accuracy, resulting in expanded voice interaction capabilities. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/07/SensoryLogo.gif"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/07/SensoryLogo.gif" alt="" title="SensoryLogo" width="151" height="41" class="alignright size-full wp-image-4712" /></a>With the heartfelt question, &#8220;If it&#8217;s &#8216;hands-free,&#8221; why do we still have to use our fingers?&#8221; Sensory, Inc. plans to launch its latest product, TrulyHandsFree Voice Control 2.0, predicated on &#8220;substantial improvements&#8221; in speech detection and recognition accuracy, resulting in expanded voice interaction capabilities. Arrival at the Android Market is slated for August 2.</p>
<p>As Todd Mozer, CEO of Sensory, explained to us, &#8220;version 1.0 was primarily a trigger,&#8221; meaning that its primary use involved detection of a wake-up word or phrase to activate a device or initiate a predefined interaction (like dictating an email, SMS, Tweet or Facebook posting; or initiating a mobile search). TrulyHandsFree Voice Control 2.0 adds &#8220;multiphrase&#8221; technology, meaning that it will be able to recognize and respond to a broader variety of words or phrases in order to &#8220;wake up&#8221; and respond accordingly.</p>
<p>The ability to distinguish spoken words from background noise is a major advancement. Rev 2.0 couples speech detection with a model for phrase recognition which enables a mobile device or toy to, in Mozer&#8217;s words, &#8220;reject the wrong words and accept the right words&#8221; in order to respond accordingly. Sensory has been working on this, more resilient &#8220;hands free&#8221; approach for a number of years. It is significantly different from Rev 1.0 of Sensory&#8217;s product as well as competing products. The additional accuracy and coincides with equally impressive advances in text-to-speech rendering and device-based speech recognition (and &#8220;understanding&#8221;). As a result, Sensory expects this approach and user acceptance to be prevalent in a two year period in cars, the kitchen and kid&#8217;s room, running on devices and products from the likes of Samsung, Mattel and others.</p>
<p>The technology is initially available on Android-based phones, devices and browsers. The result will be a dramatic change in user expectation regarding what end-users can accomplish without pushing any buttons. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/07/28/sensorys-trulyhandsfree-voice-control-2-0-portends-new-apps-and-possibilities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance and Empirix Offer &#8220;Proactive Testing&#8221; for Natural Language Customer Interactions</title>
		<link>http://opusresearch.net/wordpress/2011/04/27/nuance-and-empirix-offer-proactive-testing-for-natural-language-customer-interactions/</link>
		<comments>http://opusresearch.net/wordpress/2011/04/27/nuance-and-empirix-offer-proactive-testing-for-natural-language-customer-interactions/#comments</comments>
		<pubDate>Wed, 27 Apr 2011 17:31:55 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Empirix]]></category>
		<category><![CDATA[Natural Language]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[self-service]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4363</guid>
		<description><![CDATA[Acknowledging the importance of testing before a speech-enabled customer care system "goes live," Nuance Communications and Empirix Inc. have expanded their global partnership.]]></description>
			<content:encoded><![CDATA[<p>Acknowledging the importance of testing before a speech-enabled customer care system &#8220;goes live,&#8221; <a href="http://www.empirix.com/news/press/2011/2011-04-27.asp">Nuance Communications and Empirix Inc. have expanded their global partnership.</a> Because Empirix offers its proactive load testing as a service, Nuance can offer its customers and prospects the ability to put new speech applications through rigorous testing prior to deployment. Even though speech-enabled applications are often only part of a company&#8217;s multichannel customer care offering, this type of testing has taken on increased importance because callers expect rapid, accurate response to their issues, regardless of the channel they employ.</p>
<p>As we noted <a href="http://opusresearch.net/wordpress/2011/03/07/nuance-fast-start-program-reduces-time-and-expense-of-natural-language-implementations/">here</a>, Nuance has been dedicated to simplifying and streamlining the implementation of applications that provide a &#8220;natural language&#8221; interface for users. Pre-testing can prove to be an important component to such engagements because they will enable the company to detect where latencies or system delays may occur and correct them before they become a source of customer frustration.</p>
<p>Empirix began <a href="http://opusresearch.net/wordpress/2009/06/24/empirix-offers-hammer-testing-as-a-service/">offering its testing-as-a-service in</a> mid-2009. It formed a similar <a href="http://opusresearch.net/wordpress/2010/10/13/empirix-expands-relationship-supporting-avaya-professional-services/">partnership with Avaya and its go-to-market partners</a> in October 2010. Opus Research sees this sort of testing and ongoing monitoring as vital for promoting the highest-quality user experience across a broad spectrum of self-service and &#8220;assisted self-service&#8221; (meaning those that involve live agents) interactions.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/04/27/nuance-and-empirix-offer-proactive-testing-for-natural-language-customer-interactions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VoiceBox &#8211; With Assist from SVOX &#8211; Simplifies Spoken Destination Entry on TomTom PNDs</title>
		<link>http://opusresearch.net/wordpress/2011/04/12/voicebox-with-assist-from-svox-simplifies-spoken-destination-entry-on-tomtom-pnds/</link>
		<comments>http://opusresearch.net/wordpress/2011/04/12/voicebox-with-assist-from-svox-simplifies-spoken-destination-entry-on-tomtom-pnds/#comments</comments>
		<pubDate>Tue, 12 Apr 2011 16:34:58 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[GPS]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4311</guid>
		<description><![CDATA[Newly released versions of TomTom personal navigation devices (meaning the GO 2435, GO 2535 and GO LIVE 2535) as well as the VIA Series will enable their owners to use "natural language" to input destination.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2011/04/TomTom-Via-1535-GPS-boxshot-thumb-240xauto-230.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2011/04/TomTom-Via-1535-GPS-boxshot-thumb-240xauto-230.jpg" alt="" title="TomTom-Via-1535-GPS-boxshot-thumb-240xauto-230" width="151" height="110" class="alignright size-full wp-image-4312" /></a>Newly released versions of TomTom personal navigation devices (meaning the GO 2435, GO 2535 and GO LIVE 2535) as well as the VIA Series will enable their owners to use &#8220;natural language&#8221; to input destination. The example provided by TomTom on its Web site is, &#8220;Take me to 55 Main Street, Anywhere, USA,&#8221; which replaces the more cumbersome, long-time method of entering a city name, street name and then number. According to this <a href="http://www.businesswire.com/news/home/20110411006936/en/TomTom-Selects-VoiceBox-Latest-Products">release</a> VoiceBox Technologies, using the embedded speech recognition engine from SVOX AG, provides user interface for locations in the United States and Canada.</p>
<p>TomTom is, by no means, the only horse in the natural language race. Nuance Communications, for instance, has VoCon &#8220;One Shot Destination Entry&#8221; baked into the nav systems in prestige nameplates, including Audi, BMW, Ford and Mercedes-Benz. In February, <a href="http://www.nuance.com/company/news-room/press-releases/NC_008248">the company began offering a pre-packaged SDK (Software Development Kit</a>) called VoCon Navigation, which it calls a &#8220;ready-to-integrate pre-packaged voice application module for embedded one-shot destination entry.&#8221;</p>
<p>As we see state troopers and local law enforcement cracking down on distracted drivers, the simplification of destination entry for navigation systems is becoming a valuable attribute.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/04/12/voicebox-with-assist-from-svox-simplifies-spoken-destination-entry-on-tomtom-pnds/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Speech Rec Could &#8220;Go Viral&#8221; with Android</title>
		<link>http://opusresearch.net/wordpress/2011/04/08/speech-rec-could-go-viral-with-android/</link>
		<comments>http://opusresearch.net/wordpress/2011/04/08/speech-rec-could-go-viral-with-android/#comments</comments>
		<pubDate>Fri, 08 Apr 2011 20:02:16 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile applications]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Transcription]]></category>
		<category><![CDATA[translation]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4307</guid>
		<description><![CDATA[Google, under the guidance of Mike Cohen, is going to make sure that speech recognition is one of the input options for every activity through an Android-based device or any device running a Chrome-based browser. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png" alt="" title="Google_logo" width="160" height="74" class="alignright size-full wp-image-1265" /></a>Over at Internet2Go, Greg Sterling quotes figures from Gartner forecasting that <a href="http://www.internet2go.net/news/data-and-forecasts/gartner-says-android-will-control-50-smartphone-market-2015">50% of the smartphone market will run on the Android OS by 2015</a>. That portends some very positive prospects for speech-enabled mobile services thanks largely to single-button access to speech command on Android phones. As Android&#8217;s footprint expands, I expect to see many more articles akin to <a href="http://www.slate.com/id/2290516/">this one</a> by Slate&#8217;s technology columnist <a href="http://www.facebook.com/pages/Farhad-Manjoo/99928665447">Farhad Manjoo</a>. </p>
<p>Google, under the guidance of Mike Cohen, is going to make sure that speech recognition is one of the input options for every activity through an Android-based device or any device running a Chrome-based browser. As reported in Google&#8217;s <a href="http://chrome.blogspot.com/2011/03/talking-to-your-computer-with-html5.html">Chrome blog</a>, the beta version of Chrome 11 takes advantage of the W3C&#8217;s (World Wide Web Consortium&#8217;s) <a href="http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0020/api-draft.html">&#8220;Speech Input API&#8221; </a> in order to support speech-to-text transcription which, in turn, allows for spoken navigation, dictation and command entry.</p>
<p>As I&#8217;ve noted <a href="http://opusresearch.net/wordpress/2010/02/10/googles-approach-to-real-time-translation-a-matter-of-satisficing/">in previous posts</a>, the technology works &#8220;okay.&#8221; It is not 100% accurate and, like all speech recognition resources, it can appear to discriminate against women and individuals whose voices are in registers that correspond to a small sample of collected utterances. But speech-to-text transcription is the gateway technology into a slew of speech-enabled, mobile activities and we expect it to be ubiquitous and its use to grow geometrically as new devices, applications and use cases abound.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/04/08/speech-rec-could-go-viral-with-android/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Buy.com&#8217;s iPhone App Combines Speech Rec and Barcode Reading</title>
		<link>http://opusresearch.net/wordpress/2010/12/21/buy-coms-iphone-app-combines-speech-rec-and-barcode-reading/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/21/buy-coms-iphone-app-combines-speech-rec-and-barcode-reading/#comments</comments>
		<pubDate>Tue, 21 Dec 2010 19:12:53 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Barcode Reading]]></category>
		<category><![CDATA[mobile commerce]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3900</guid>
		<description><![CDATA[Internet retailer and bargain finder Buy.com reports that over 20,000 people have downloaded the latest version of its iPhone app, since the introduction of its "beta" version last month. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/12/Screen-shot-2010-12-21-at-11.06.43-AM.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/12/Screen-shot-2010-12-21-at-11.06.43-AM.png" alt="" title="Screen shot 2010-12-21 at 11.06.43 AM" width="144" height="132" class="alignright size-full wp-image-3901" /></a>Internet retailer and bargain finder Buy.com reports that over 20,000 people have downloaded the latest version of its iPhone app, since the introduction of its &#8220;beta&#8221; version last month. The new version is different from previous ones because it supports voice search, using <a href="http://www.memememobile.com/">MeMeMe&#8217;s mobile voice recognition technology</a> as well as the ability to search by scanning the ubiquitous UPC barcode printed on an item&#8217;s packaging. </p>
<p>Both input modalities provide quick, convenient ways for holiday shoppers to comparison shop, spanning the World Wide Web and the Real World. MeMeMe, as the verbiage on its Web site indicates, is on a mission to &#8220;voice-enable any mobile application or networked service.&#8221; That makes it direct competition to Nuance, Microsoft/Tellme, AT&#038;T/Watson, Loquendo and a few others. The Buy.com app for the iPhone is a good demonstration. It also shows how speech-based input is a peer to (but not a replacement for) keying in info or scanning barcodes. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/21/buy-coms-iphone-app-combines-speech-rec-and-barcode-reading/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2011 Will See Stepped up Investment in &#8220;Speech And&#8230;&#8221;</title>
		<link>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/</link>
		<comments>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/#comments</comments>
		<pubDate>Tue, 21 Dec 2010 18:15:22 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3889</guid>
		<description><![CDATA[Today Apple posted a job listing for an "iOS Speech SW Application Engineer." The job involves working with the iOS Applications Framework Team in "a fast paced environment with rapidly changing priorities." ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Apple_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Apple_logo.png" alt="" title="Apple_logo" width="121" height="137" class="alignright size-full wp-image-1314" /></a>Today Apple posted <a href="http://jobs.apple.com/index.ajs?BID=1&#038;method=mExternal.showJob&#038;RID=68870&#038;CurrentPage=1">a job listing</a> for an &#8220;iOS Speech SW Application Engineer.&#8221; The job involves working with the iOS Applications Framework Team in &#8220;a fast paced environment with rapidly changing priorities.&#8221; </p>
<p>Earlier this year I coined the term &#8220;No rest for the RESTful&#8221; to dramatize how new tools and API&#8217;s to support agile programming would accelerate application development and service delivery. As we enter 2011, the mantra appears to be &#8220;Speech is Sexy,&#8221; as evidenced not just by Apple&#8217;s &#8220;Help Wanted&#8221; posting, but by <a href="http://opusresearch.net/wordpress/2010/12/03/googles-latest-acquisition-brings-text-to-speech-luminaries-into-its-fold/">Google&#8217;s recent acquisition of Phonetic Arts</a> and rapid-fire refinement of mobile user interfaces from Nuance, Vlingo (powered by AT&#038;T&#8217;s Watson Engine), Google and Microsoft/Tellme.</p>
<p>The big difference in 2011 is that speech is getting more pervasive while, at the same time, it is being subsumed into multimodal user interfaces. Microsoft, for instance, continues to call speech recognition &#8220;foundational&#8221; to its user interface but, with the introduction of Kinect, already puts much more emphasis on accurate recognition of gestures. Last year Google&#8217;s Mike Cohen explained his objective of making speech as an alternative &#8220;every time&#8221; a keypad or keyboard is used on a mobile device.</p>
<p>2011 will be a year for smoothing out some of the rough spots in speech enabling the user experience. Candidates include better (more accurate) recognition, noise cancellation, more &#8220;human sounding&#8221; text-to-speech rendering, speech-to-speech translation, low-latency interaction with dynamic data &#8220;in the cloud,&#8221; and (to keep things safe, secure and personalized) voice biometrics-based authentication or ID proofing.</p>
<p>But there&#8217;s been a fundamental change. I used to write about the &#8220;Voice User Interface.&#8221; In the coming year attention will be on the &#8220;Mobile User Interface&#8221; that includes voice. That&#8217;s why it should not be a surprise to see that the the speech software engineer at Apple should be prepared to work on a &#8220;team&#8221; to accommodate the &#8220;rapidly changing priorities.&#8221; </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/12/21/2011-will-see-stepped-up-investment-in-speech-and/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SAIC Buys AppTek&#8217;s Assets; To Focus on Machine Translation for Gov&#8217;t Agencies</title>
		<link>http://opusresearch.net/wordpress/2010/11/11/saic-buys-appteks-assets-to-focus-on-machine-translation-for-govt-agencies/</link>
		<comments>http://opusresearch.net/wordpress/2010/11/11/saic-buys-appteks-assets-to-focus-on-machine-translation-for-govt-agencies/#comments</comments>
		<pubDate>Thu, 11 Nov 2010 23:07:28 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Machine Translation]]></category>
		<category><![CDATA[SAIC]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3711</guid>
		<description><![CDATA[SAIC (Science Applications International Corporation) has purchased the assets of AppTek Partners, LLC; Applications Technology, Inc. and Media Mind for an undisclosed sum.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/11/57955v1-max-250x250.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/11/57955v1-max-250x250.png" alt="" title="SAIC_logo" width="100" height="43" class="alignright size-full wp-image-3716" /></a>SAIC (Science Applications International Corporation) has purchased the assets of AppTek Partners, LLC; Applications Technology, Inc. and Media Mind for an undisclosed sum. Apptek, based in McLean Virginia, has built a base of software and intellectual property in the field of speech recognition and machine translation. It has taken a hybrid approach, initially pursuing a rules-based approach but adding a statistical engine in 2004 by acquiring a company called Aixplain.</p>
<p>AppTek brings bidirectional language pairs that include English with Arabic, Chinese, Dutch, Persian, French, German, Hebrew, Italian, Japanese, Korean, Polish, Portuguese, Russian, Spanish, Turkish and Ukrainian, and unidirectional pairs from Indonesian, Pashto, Tagalog and Urdu into English. SAIC expects its  acquired technology and assets to strengthen its efforts to deliver language services to the intelligence, defense and law enforcement communities. This complements SAIC&#8217;s portfolio of translation products that already spanned 70 languages. </p>
<p>SAIC is part of the Fortune 500 with over 45,000 employees. At various points in its evolution, it owned Network Solutions, which manages the DNS (Domain Name Server) that houses the majority of popular Web addresses (.com, .net, .org, .edu, for example). It has owned Bellcore and Telcordia &#8211; professional service providers that also manage the library of software and support materials that power all of the electronic switches and ancillary processors that conform to &#8220;Bell spec&#8221;. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/11/11/saic-buys-appteks-assets-to-focus-on-machine-translation-for-govt-agencies/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Vlingo Shares Vision for Voice-enhanced Activities</title>
		<link>http://opusresearch.net/wordpress/2010/11/08/vlingo-shares-vision-for-voice-enhanced-activities/</link>
		<comments>http://opusresearch.net/wordpress/2010/11/08/vlingo-shares-vision-for-voice-enhanced-activities/#comments</comments>
		<pubDate>Mon, 08 Nov 2010 22:39:56 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile applications]]></category>
		<category><![CDATA[multimodal messaging]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Vlingo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3682</guid>
		<description><![CDATA[Hats off to the folks at Vlingo for rendering this vision piece in which cartoon characters depict a set of use cases for hands free, eyes forward speech-enhanced applications in your car, in your kitchen and elsewhere.]]></description>
			<content:encoded><![CDATA[<p>Hats off to the folks at Vlingo for rendering this vision piece in which cartoon characters depict a set of use cases for hands free, eyes forward speech-enhanced applications in your car, in your kitchen and elsewhere. We don&#8217;t have to rely on Jetsons&#8217; re-runs or the the next Star Trek feature to be reminded that, deep down, we expect our preferred devices (as well as our friends) to respond to what we say.</p>
<p><object width="640" height="390"><param name="movie" value="http://www.youtube.com/v/4amvRLG4un8&#038;hl=en_US&#038;feature=player_embedded&#038;version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowScriptAccess" value="always"></param><embed src="http://www.youtube.com/v/4amvRLG4un8&#038;hl=en_US&#038;feature=player_embedded&#038;version=3" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="640" height="390"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/11/08/vlingo-shares-vision-for-voice-enhanced-activities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Adds Voice Search in Dutch</title>
		<link>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/</link>
		<comments>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/#comments</comments>
		<pubDate>Fri, 05 Nov 2010 18:53:10 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3679</guid>
		<description><![CDATA[Telecom Paper reports that Google Voice Search now supports Dutch on iPhones and Android-based mobile devices. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Google_logo.png" alt="" title="Google_logo" width="160" height="74" class="alignright size-full wp-image-1265" /></a><a href="http://www.telecompaper.com/news/article.aspx?cid=766738">Telecom Paper reports</a> that Google Voice Search now supports Dutch on iPhones and Android-based mobile devices. The application accepts search terms, but not phrases at this point. </p>
<p>For those keeping score: Google Voice Search exists in English, Mandarin, Japanese, Korean, French, German, Italian, Spanish, Czech, Polish, Russian and Turkish. Speech recognition is available for Android 2.2 and 2.1. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/11/05/google-adds-voice-search-in-dutch/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Soft Launch for Sensory&#8217;s &#8220;Speech Triggers&#8221;</title>
		<link>http://opusresearch.net/wordpress/2010/10/25/soft-launch-for-sensorys-speech-triggers/</link>
		<comments>http://opusresearch.net/wordpress/2010/10/25/soft-launch-for-sensorys-speech-triggers/#comments</comments>
		<pubDate>Mon, 25 Oct 2010 23:32:52 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Android]]></category>
		<category><![CDATA[Bluetooth]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[Mobile Speech Apps]]></category>
		<category><![CDATA[Safe Texting]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3620</guid>
		<description><![CDATA[In June 2009, I blogged this post noting that Sensory Inc. had packaged "word spotting" capabilities into its firmware in order to support "truly hands free" speech recognition into a wide range of mobile, in car and, ultimately, ubiquitous appliances. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/sensorylogo.gif"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/10/sensorylogo.gif" alt="" title="sensorylogo" width="144" height="77" class="alignright size-full wp-image-3632" /></a>In June 2009, I blogged <a href="http://opusresearch.net/wordpress/2009/06/08/voice-control-for-mobile-hands-free-or-not-hands-free/">this post</a> noting that <a href="http://www.sensoryinc.com/">Sensory Inc.</a> had packaged &#8220;word spotting&#8221; capabilities into its firmware in order to support &#8220;truly hands free&#8221; speech recognition into a wide range of mobile, in car and, ultimately, ubiquitous appliances. A month later Mozer and I discussed the <a href="http://opusresearch.net/wordpress/2009/07/30/sensorys-ceo-moser-on-the-scids-speech-controlled-internet-devices/">prospects for a broad variety of &#8220;Speech Controlled Internet Devices&#8221;</a> (SCIDs) to proliferate as people discover the value of a hands-free way to activate, instruct or create content for a the growing number of &#8220;black boxes&#8221;, &#8220;gray boxes&#8221; or everyday appliances that are connected to the Internet (and related cloud-based applications, media or content). </p>
<p>Todd demonstrates the SCIDs concept in this video:</p>
<p><object width="640" height="390"><param name="movie" value="http://www.youtube.com/v/oRdN1iS8Rcw&#038;rel=0&#038;hl=en_US&#038;feature=player_embedded&#038;version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowScriptAccess" value="always"></param><embed src="http://www.youtube.com/v/oRdN1iS8Rcw&#038;rel=0&#038;hl=en_US&#038;feature=player_embedded&#038;version=3" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="600" height="365"></embed></object></p>
<p>Note that the embedded title for this YouTube video is &#8220;Truly Hands Free Trigger.wmv&#8221;. That&#8217;s because Sensory is now shipping Speech Trigger SDKs (software development kits) for mobile devices running either Apple&#8217;s iOS4 or the Android operating system. Vlingo, for one, has taken full advantage of the SDK and introduced a hands-free services called Vlingo InCar. As illustrated in the video below, the &#8220;trigger&#8221; occurs when a driver says &#8220;Hey Vlingo&#8221;. Then, in response to the prompt &#8220;What&#8217;s shakin&#8217;?&#8221; drivers can here the contents of a text message, reply, originate other messages and search for a local business to get spoken directions. </p>
<p><object width="640" height="390"><param name="movie" value="http://www.youtube.com/v/Vqs8XfXxgz4&#038;hl=en_US&#038;feature=player_embedded&#038;version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowScriptAccess" value="always"></param><embed src="http://www.youtube.com/v/Vqs8XfXxgz4&#038;hl=en_US&#038;feature=player_embedded&#038;version=3" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="600" height="365"></embed></object></p>
<p>The service is optimized for the Android 2.2 operating system (&#8220;Foyo&#8221;) and uses Bluetooth for connectivity. Users of the existing Vlingo app, can also originate tweets, update status on Facebook, open and control 3rd party applications and the other activities that Vlingo supports. After a two week &#8220;beta&#8221; when the service was available only to Spring subscribers, Vlingo expanded to &#8220;all Android users&#8221; on October 19.</p>
<p>For those of you keeping track at home, the announcement of Vlingo InCar and the release of the &#8220;Trigger SDK&#8221; was almost simultaneous with the launch of StartTalking from<br />
a Boston-based technology venture called AdelaVoice. As you can see in the video below, AdelaVoice laid claim to being the first and only mobile application that provides a hands-free, eyes-on-the-road way to send and receive text messages. It is a well-understood problem and one that is attracting significant investment dollars, as carriers and application providers work to support safe spoken input of text.</p>
<p><a href="#" onclick="return(false);" rel="videoGUID=257917baw1f19t49ecH83d1Lc2c40eee8c18&#038;" style="display:block;width:640px;height:360px" id="257917baw1f19t49ecH83d1Lc2c40eee8c18" width="600" height="335">StartTalking Overview</a> <script type="text/javascript" src="http://360.sorensonmedia.com/257917baw1f19t49ecH83d1Lc2c40eee8c18/embedv2.js"></script></p>
<p>Todd Mozer tells me that the same firmware that supports &#8220;trigger words&#8221; is being used in high-end, Bluetooth based earpieces from Plantronics to support a hands-free &#8220;answer or ignore&#8221; function for treating incoming calls. The phone can grab Caller ID information, render it as speech and then prompt the driver to react with &#8220;You can say &#8216;answer&#8217; or &#8216;ignore&#8217;.&#8221; It&#8217;s brilliantly simple, and much safer than looking around and reaching down to find the phone. In both cases, the objective of the firmware is to provide the most pleasant and efficient user experience possible.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/10/25/soft-launch-for-sensorys-speech-triggers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

