<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; Speech recognition</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/speech-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Recombinant Communications</description>
	<lastBuildDate>Thu, 29 Jul 2010 23:57:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Nuance Brings Speech to ng Connect</title>
		<link>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/</link>
		<comments>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 00:58:44 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Recombinant Communications]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=3000</guid>
		<description><![CDATA[With its focus squarely on the automotive market, Nuance announced that it has joined ng Connect, an intercompany, interdisciplinary wireless development initiative founded in 2009 by Alcatel-Lucent. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/NEW_ngConnect_color1.jpg"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2010/06/NEW_ngConnect_color1.jpg" alt="" title="NEW_ngConnect_color" width="150" height="66" class="alignright size-full wp-image-3004" /></a>With its focus squarely on the automotive market, <a href="http://www.nuance.com/news/pressreleases/2010/20100608_globalNG.asp">Nuance announced that it has joined ng Connect</a>, an intercompany, interdisciplinary wireless development initiative founded in 2009 by Alcatel-Lucent. The consortium is a mixed bag of technology providers, that include (in addition to ALU) &#8220;name brand&#8221; IT infrastructure providers like HP, NCR, QNX Software and Gemalto along with mobile device makers like Samsung and Kyocera (which makes its own phones as well as Sanyo), joined by application and content providers like <a href="http://opusresearch.net/wordpress/2009/03/25/keep-an-eye-or-ear-out-for-rebelvox/">RebelVox</a>, Atlantic Recordings, BUZZMedia and about twenty others. </p>
<p>The focus of ng Connect is LTE, referring to the &#8220;Long Term Evolution&#8221; of 3G mobile, a term which, itself is trademarked by the 3GPP (3G Partnership Project). Both the 3GPP and ng Connect seek to develop applications and services that take advantage of the high throughput, low latency and plug-and-play nature of the underlying technologies. It is considered a stepping stone toward the 4G (and the as-yet unspecified 5G) environments which promise higher speeds and larger cell sizes. In joining ng Connect, Nuance is the lone provider of key speech processing technologies (Automated Speech Recognition and Text-To-Speech) as well as the predictive texting applications that can fulfill requirements for the new requirements for command, information input and dialog that users of mobile applications require.</p>
<p>Toyota, though its U.S. Sales Division, is the lone automobile manufacturer participating in ng Connect. Nuance has already had its speech processing technologies <a href="http://opusresearch.net/wordpress/2010/01/07/ford-turns-cars-into-open-platforms-for-recombinant-mobile-speech-apps/">integrated into the Ford SYNC system</a> (along with Microsoft and BSquare). Through its participation in ng Connect, Nuance hopes to benefit from sharing results of shared customer research, technology development and market conditioning and promotion. In the automotive domain, Loquendo is probably Nuance&#8217;s most direct competitor (with well-accepted solutions in &#8220;embedded&#8221; text-to-speech and speech recogntion, well-suited for both in-car systems and mobile phones).  </p>
<p>Yet there is no better place than a moving car to showcase the value of high-speed, wireless links and low-latency access to shared resources built around maps, games, social networks and associated data stores. Accurate speech processing is important, but so is elegant orchestration of real-time transactions, asynchronous and synchronous conversations and a broad variety of interactions. In the car Microsoft, with Bing Search, Maps and active Windows development community of developers are already pitted against Google with its voice search, maps and Android developers. Joining the ng Connect community immediately expands Nuance&#8217;s hooks into a broader community of technology providers that stand ready to cooperate to bring more services to market.</p>
<p>The track record for these types of consortia is spotty. While participants profess to promote an &#8220;open&#8221; or shared approach to solution development, participating companies have shown a pre-disposition to protect their brands and their perceived competitive advantage. Still, we wish them luck.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/06/08/nuance-brings-speech-to-ng-connect/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance Buys MacSpeech: Getting more Cozy with Apple and its Users</title>
		<link>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/</link>
		<comments>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 16:14:03 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=2376</guid>
		<description><![CDATA[Nuance announced that it is acquiring MacSpeech, a company that has been developing speech recognition resources for Apple's MacIntosh computers since 1996. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo-150x107.png" alt="NuanceLogo" title="NuanceLogo" width="150" height="107" class="alignright size-thumbnail wp-image-1194" />In this <a href="http://www.nuance.com/news/pressreleases/2010/20100216_macspeech.asp">press release</a>, Nuance announced that it is acquiring MacSpeech, a company that has been developing speech recognition resources for Apple&#8217;s MacIntosh computers since 1996. Because MacSpeech started licensing Dragon Dictate in 2008, this acquisition may amount to a mere formality. But Peter Mahoney, Dragon&#8217;s general manager (as well as senior vice president of Nuance) explains that the two companies will work more closely to bring a line of Dragon branded dictation software to market that is &#8220;100 percent Mac.&#8221;</p>
<p>Symbolically, the move signals that Nuance (especially with the Dragon brand) is emerging as the most-favored speech-processing technology across Apple&#8217;s broad spectrum of  computing and communications platforms, including personal computers, notebooks, laptops, iPhones, iPods and (we would supposed) the upcoming iPad. As we&#8217;ve all learned from the &#8220;Google phenomenon&#8221;, the more utterances that a company is able to collect on behalf of a broad customer base, the better the odds of accurate rendering of spoken words. Stated differently, deeper integration into Apple&#8217;s application environment should translate into a better user experience as those users help computers &#8220;learn&#8221; to understand what they are saying.</p>
<p>Google&#8217;s &#8220;cloud-based&#8221; approach to speech recognition makes it &#8220;platform agnostic&#8221;, therefore we will continue to see speech-enabled Google applications on iPhones and other connected platforms. Google will continue to use the voluminous amounts of spoken phrases to expand a portfolio of offerings that already includes search, transcription and translation. Nuance&#8217;s is responding by forging relationships aimed to make its speech-enabled applications work well on Mac&#8217;s and other Apple branded products, while at the same time supporting multimodal communications on a wide variety of mobile handsets. </p>
<p>Now, all eyes (or ears) should be on Microsoft, the only other &#8220;superpower&#8221; with the ability to define and refine speech-enabled user experience across platforms and modalities. With its own speech recognition &#8220;engine&#8221;, a dedicated speech app server farm called Tellme and some excellent &#8220;voice search&#8221; apps operating under the Bing brand, it has the potential to compete. But, as Greg Sterling noted in <a href="http://opusresearch.net/wordpress/2010/02/15/windows-7-phone-where-is-voice/">this post</a> about the introduction of its new mobile operating system, it has definitely bestowed second-class status to its voice enabled services.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2010/02/16/nuance-buys-macspeech-getting-more-cozy-with-apple-and-its-users/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance Voicemail-to-Text Powers AT&amp;T&#8217;s Transcription Service</title>
		<link>http://opusresearch.net/wordpress/2009/11/30/nuance-voicemail-to-text-powers-atts-transcription-service/</link>
		<comments>http://opusresearch.net/wordpress/2009/11/30/nuance-voicemail-to-text-powers-atts-transcription-service/#comments</comments>
		<pubDate>Mon, 30 Nov 2009 19:43:42 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[AT&T]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1954</guid>
		<description><![CDATA[It's official. The platform for AT&#038;T's Voicemail to Text service is Nuance Communications. As described on the AT&#038;T Mobility Web site, the carrier charges $9.99 per month for the service, which enables subscribers to receive transcriptions of voicemail messages as email and/or text messages in addition to the traditional dial-in-and-retrieve method. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/11/ATTMobility-150x150.jpg" alt="AT&amp;TMobility" title="AT&amp;TMobility" width="150" height="150" class="alignleft size-thumbnail wp-image-1956" />It&#8217;s official. The platform for AT&#038;T&#8217;s Voicemail to Text service is Nuance Communications. As described on <a href="http://www.wireless.att.com/learn/basics/choosing-features-services/voicemail-to-text.jsp">this page</a> on the AT&#038;T Wireless Web site, the carrier charges $9.99 per month for the service, which enables subscribers to receive transcriptions of voicemail messages as email and/or text messages in addition to the traditional dial-in-and-retrieve method. </p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo-150x107.png" alt="NuanceLogo" title="NuanceLogo" width="150" height="107" class="alignright size-thumbnail wp-image-1194" />This is something of a coup for Nuance in a hotly contested marketplace that spans the fully-automated service integrated into Google Voice to the various levels of human assisted services from DiTech Network&#8217;s PhoneTag and SpinVox, in addition to Nuance. </p>
<p>Back in September  <a href="http://opusresearch.net/wordpress/2009/09/16/att-takes-stock-of-mobile-speech-invests-in-vlingo/">AT&#038;T made an equity investment in Vlingo</a> which might have given that speech-to-text specialist an inside track on transcription. However, Vlingo has applied its expertise toward spoken input for mobile applications &#8211; including search, text messaging and voice commands, and has not invested specifically in voicemail transcription and delivery. </p>
<p>I have been trialing Nuance&#8217;s Voicemail to Text for several months, and have it configured to deliver transcribed messages as email with the media file attached. I can attest to the fact that the quality of transcription is accurate enough to make it quite valuable for me. The service will now benefit from tighter integration with AT&#038;T Mobility&#8217;s other enhanced messaging features and functions including delivery of the messages as SMS text.</p>
<p>Carriers have an important role to play in promoting speech-enabled services to mobile subscribers. Thus far they have been a promiscuous bunch, hedging their bets among providers of voice dialing, voice search, dictation and command. This multi-layered approach will prevail for the foreseeable future as mobile subscribers will ultimately vote with their pocketbooks to determine which services and pricing methodologies make the most sense (or cents) in the long run.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/11/30/nuance-voicemail-to-text-powers-atts-transcription-service/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rumblings From the Asterisk Crowd: $99/channel Speech Rec from Vestec</title>
		<link>http://opusresearch.net/wordpress/2009/10/01/rumblings-from-the-asterisk-crowd-99channel-speech-rec-from-vestec/</link>
		<comments>http://opusresearch.net/wordpress/2009/10/01/rumblings-from-the-asterisk-crowd-99channel-speech-rec-from-vestec/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 15:44:42 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Recombinant Telephony]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1561</guid>
		<description><![CDATA[Digium partners with Vestec to offer low-cost speech recogntion engine for Asterisk.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/10/Digium_logo.png" alt="Digium_logo" title="Digium_logo" width="148" height="58" class="alignright size-full wp-image-1562" />I&#8217;m a little late to report this, but it happened below the Opus Research Radar. Around September 17, Digium began to market a low-cost speech recognition solution from <a href="http://www.vestec.ca/aboutus">Vestec, Inc</a>. based in Waterloo, Ontario, Canada. Vestec&#8217;s corporate charter, from its Web site, is &#8220;to commoditize, standardize, and demystify speech recognition through use of unique artificial intelligence (AI) technologies.&#8221; We&#8217;ve heard the &#8220;make speech affordable&#8221; message before from the likes of Lumenvox and Voxeo, as well as a number of &#8220;open source&#8221; community members around Sphynx.</p>
<p>In Digium, Vestec has found a partner that is promoting its speech recognition engine to the thrifty folks among the Asterisk community. We&#8217;ll be watching to see how well speech recognition becomes ingrained in the Asterisk fabric. We do note that commoditization of speech is but the first step of the longer journey of creating a good user experience. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/10/01/rumblings-from-the-asterisk-crowd-99channel-speech-rec-from-vestec/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VoltDelta Amps Up With New Branding and OnDemand Offer</title>
		<link>http://opusresearch.net/wordpress/2009/08/24/voltdelta-amps-up-with-new-branding-and-ondemand-offer/</link>
		<comments>http://opusresearch.net/wordpress/2009/08/24/voltdelta-amps-up-with-new-branding-and-ondemand-offer/#comments</comments>
		<pubDate>Tue, 25 Aug 2009 03:29:36 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Recombinant Telephony]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Voice Self Service]]></category>
		<category><![CDATA[VoltDelta]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1287</guid>
		<description><![CDATA[VoltDelta OnDemand Solutions shows that Volt Information Sciences has done a super job of assimilating a succession of recent acquisitions. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Picture-15.png" alt="Picture 15" title="Picture 15" width="134" height="44" class="alignright size-full wp-image-1289" />There&#8217;s a longer story to come, but we were treated to a briefing from VoltDelta OnDemand Solutions that shows that the company (or its parent Volt Information Sciences) has done a super job of assimilating a succession of acquisitions over the last few years. After a couple of years of touting its &#8220;telco grade&#8221; hosting services, new marketing and sales management personnel have assembled its capabilities into a coherent set of solutions that closely adhere to Opus Research&#8217;s vision of Recombinant Telephony services.</p>
<p>Leveraging the capabilities of a platform that includes Nortel&#8217;s installed base of enhanced directory assistance query processing and agent platforms (acquired in 2004), plus similar platforms developed by Varetis Solutions GMBH (acquired in 2006) and the listings aggregation capabilities of LSSIData (obtained in 2007), the company has the foundation for unassailable leadership in enhanced directory assistance, handling over 3.4 billion queries each year. But DA is, admittedly a slow growth or no-growth domain as Web-based search engines add location-based queries that include addresses, phone numbers and other relevant data.</p>
<p>Enhanced DA is just a teaser demo for the variety of services that the VoltDelta OnDemand can offer hosted services customers. It is a full-blown platform for virtual contact centers that includes speech-enabled interactive voice response (IVR), enhanced call routing and back end data processing that rivals anything else we&#8217;ve seen in the marketplace. Yet, thus far, the company has maintained a very low profile, relegating itself to the &#8220;highly reliable&#8221; or &#8220;telco quality&#8221; platform offering that puts it on a par with Microsoft&#8217;s Tellme or Nuance On Demand, with less of a commitment to standards than Voxeo.</p>
<p>VoltDelta builds differentiation on its ready-made incorporation of live agents into the call flow and workflow. As a leading provider of Directory Assistance, it has built significant credibility surrounding the handling of call volumes and enhanced IVR. Now its challenge is to expand its offer into selected verticals, including more telecommunications, financial services, healthcare and government offered services like the ubiquitous 511-based access to local information.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/08/24/voltdelta-amps-up-with-new-branding-and-ondemand-offer/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Voxeo and Nu Echo Collaborate to Speed Grammar Development</title>
		<link>http://opusresearch.net/wordpress/2009/08/24/voxeo-and-nu-echo-collaborate-to-speed-grammar-development/</link>
		<comments>http://opusresearch.net/wordpress/2009/08/24/voxeo-and-nu-echo-collaborate-to-speed-grammar-development/#comments</comments>
		<pubDate>Mon, 24 Aug 2009 12:30:45 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Recombinant Telephony]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Voice Self Service]]></category>
		<category><![CDATA[Voxeo]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1278</guid>
		<description><![CDATA[One of the first announcement out of SpeechTEK comes from a pair of companies striving to promote faster development of multi-modal self-service systems that involve specialized grammars. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/logo_voxeo.gif" alt="logo_voxeo" title="logo_voxeo" width="74" height="80" class="alignright size-full wp-image-1281" />One of the first announcement out of SpeechTEK comes from a pair of companies striving to promote faster development of multi-modal self-service systems that involve specialized grammars. The announcement more closely links the products and services of Voxeo (including VoiceObjects and IMified) with the Eclipse-based grammar development tools incorporated in Nu Echo&#8217;s NuGram IDE Basic Edition.</p>
<p>The announcement includes a glowing testimonial from the German wireless service provider T-Mobile, which has benefited from a solution that incorporates NuGram as a &#8220;plug-in&#8221; in Voxeo/VoiceObject&#8217;s Eclipse-based development environment. In the spirit of Recombinant Telephony, Voxeo and Nu Echo executives see a broad potential for this standards-based approach.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/08/24/voxeo-and-nu-echo-collaborate-to-speed-grammar-development/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft&#8217;s Tellme Sees Positive Signs for Mobile Speech</title>
		<link>http://opusresearch.net/wordpress/2009/08/10/microsofts-tellme-sees-positive-signs-for-mobile-speech/</link>
		<comments>http://opusresearch.net/wordpress/2009/08/10/microsofts-tellme-sees-positive-signs-for-mobile-speech/#comments</comments>
		<pubDate>Mon, 10 Aug 2009 23:34:04 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[mobile speech]]></category>
		<category><![CDATA[Opus Research]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Tellme]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1198</guid>
		<description><![CDATA[Tellme had some interesting research results regarding social, mobile multitasking. Not surprisingly, it reveals that the group of mobile subscribers who have smartphones have their phones at ready and in use when they undertake their daily activities.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Picture-31.png" alt="Picture 3" title="Picture 3" width="137" height="61" class="alignright size-full wp-image-1199" />I&#8217;m not sure how we missed this. The <a href="http://www.geek.com/articles/mobile/mobile-phone-etiquette-is-changing-according-to-study-20090810/">Geek.com blog reports</a> today that Microsoft&#8217;s speech hosting subsidiary Tellme had some interesting research results regarding social, mobile multitasking. Not surprisingly, it reveals that the group of mobile subscribers who have smartphones have their phones at ready and in use when they undertake their daily activities. Eighty-eight percent (88%) say they use their phones when shopping. Seventy-eight percent (78%) use them when &#8220;walking between places&#8221;. In a slight drop-off, those who are willing to admit it (68%) say that they consult their smartphones while visiting with friends. </p>
<p>Being a speech technology company, Tellme polled its subjects to gauge their level of comfort with entering commands or search terms by talking aloud. While some indicated discomfort in certain settings (Geek.com cited &#8220;a restaurant), 71% of the overall sample indicated that they would be comfortable with voice command. The further breakdown indicated high levels of comfort while walking (93%), exercising (92%), and shopping or running errands (87%). Tellme issued this <a href="http://www.tellme.com/about/media_center/release/20090729">press release</a> with these findings on July 29.  </p>
<p>There&#8217;s no question that the mobile public is taking more interest in mobile speech applications. Later this month Opus Research will be issuing a new Research Report on mobile speech applications complete with a market assessment and forecast, trends analysis and portrait of the many companies that make up the service development and delivery ecosystem, including core speech processing technology providers, application developers, device makers, platform operators and wireless carriers. As I conduct interviews for this document, it is clear that interest is heating up.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/08/10/microsofts-tellme-sees-positive-signs-for-mobile-speech/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Melodis Raises $4 Million for Its Music Search App and Service</title>
		<link>http://opusresearch.net/wordpress/2009/08/04/melodis-raises-4-million-for-its-music-search-app-and-service/</link>
		<comments>http://opusresearch.net/wordpress/2009/08/04/melodis-raises-4-million-for-its-music-search-app-and-service/#comments</comments>
		<pubDate>Tue, 04 Aug 2009 22:49:42 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[Speech enabled search]]></category>
		<category><![CDATA[Speech recognition]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1156</guid>
		<description><![CDATA[Melodis, developer of the Midomi mobile music identification application, has raised $4 million in funding]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/Picture-1.png" alt="Picture 1" title="Picture 1" width="124" height="43" class="alignright size-full wp-image-1157" />As further testimonial to the fact that the investment community is warming up to &#8220;voice search,&#8221; Melodis, developer of the Midomi mobile music identification application, has raised $4 million in funding from Walden Venture Capital&#8217;s Larry Marcus and an unnamed mobile device manufacturer. The Midomi app is comparable to the freely downloadable Shazam (from Shazam Entertainment Ltd.) but it adds a very interesting twist. Earlier this year, Melodis struck a deal with musical instrument maker Yamaha that entitles it to use Yamaha&#8217;s &#8220;singing synthesis software&#8221; called Vocaloid. The combined technologies move mobile voice search into a new realm whereby Midomi can identify songs when a person does his or her best job of humming the tune. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/08/04/melodis-raises-4-million-for-its-music-search-app-and-service/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sensory&#8217;s CEO Mozer On the SCIDs (Speech-Controlled Internet Devices)</title>
		<link>http://opusresearch.net/wordpress/2009/07/30/sensorys-ceo-moser-on-the-scids-speech-controlled-internet-devices/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/30/sensorys-ceo-moser-on-the-scids-speech-controlled-internet-devices/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 00:37:03 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile services]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1138</guid>
		<description><![CDATA[We completed a very interesting phone briefing with Todd Moser, CEO of, a company that specializes in affordable, embedded speech processing technologies.]]></description>
			<content:encoded><![CDATA[<p>We completed a very interesting phone briefing with Todd Mozer, CEO of <a href="http://www.sensoryinc.com/">Sensory Inc</a>., a company that specializes in affordable, embedded speech processing technologies. He provided a preview of a new chip that will be generally available later this year, so news surrounding it is still under embargo. Yet, in the course of his company presentation, we began discussing the prospects surrounding SCID&#8217;s, &#8220;Speech-Controlled Internet Device,&#8221; a broad category that spans household appliances, consumer electronics, automotive products and just about any personal, electronic device that could benefit from well-tuned mechanisms for voice-activation, instruction and content creation.</p>
<p>Mozer noted that Pat Gelsinger, the sr. vice president at Intel who manages its largest operating group had noted this forecast in a recent speech: “By 2015 there will be over 15 Billion devices connected to the Internet”. Gelsinger was referring to networked toasters, ovens, refrigerators, clocks, home entertainment units, lamps and sensors that are capable of sending and retrieving information from servers anywhere on the Internet. Most importantly, from Mozers point of view, a significant number of these devices are destined to be speech-enabled. </p>
<p>As users of speech-enabled navigational devices well-know, the combination of voice-input and data connectivity is powerful. Moser provided us with a demonstration of the <a href="http://www.brookstone.com/alarm-clocks-and-clock-radios_moshi-voice-activated-digital-alarm-clock.html?bkeid=search%7Cgoogle%7Cbidword%7Cmoshi_clock&#038;gclid=CPiWw8vU_psCFRwDagodeFFc-w">Moshi Alarm Clock</a>, an unconnected precursor to a SCID. After using the magic words &#8220;Hello Moshi&#8221; to wake up the device, spoken commands can be used to set the clock, set the alarm, ask for a spoken readout of the time or the temperature.</p>
<p>Connecting the Moshi Clock to the internet through WiFi, Bluetooth or WiMax (in some areas) would make it possible for Moshi to transform itself into a World Clock or provide temperatures from around the world. But there are lots of other use cases for SCIDS and Sensory is playing an important role by providing relatively inexpensive chips to handle so-called natural-language input. One of the immediate opportunities is arises from the need for &#8220;truly hands-free&#8221; ways to enter commands or dictate content to mobile devices. </p>
<p>In a soon-to-be-released report, I&#8217;ll be updating our survey and trends analysis of speech-enabled mobile services. Software to support natural-feeling voice input, especially for music playback and search, is getting much more affordable. Text-to-speech rendering is getting very good and very affordable. Speech-to-text transcription remains a bit elusive, but the ability to interpret &#8220;natural language&#8221; commands &#8211; within understood contexts and use cases &#8211; is getting very good. You can follow our coverage of developments here and you may find <a href="http://sensoryinc.com/blog/">Todd Mozer&#8217;s posts in the Sensory Inc. blog</a> quite interesting as well. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/30/sensorys-ceo-moser-on-the-scids-speech-controlled-internet-devices/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Promptu Claims a &#8220;First&#8221;: Speech-to-SMS Service Offered by Telecom Italia Mobile</title>
		<link>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/</link>
		<comments>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/#comments</comments>
		<pubDate>Thu, 30 Jul 2009 16:59:56 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[speech-enabled mobile services]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=1122</guid>
		<description><![CDATA[Two firms long known for innovative, high-visibility speech applications are joining forces to enable mobile subscribers to dictate and originate SMS text messages.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-43.png" alt="Picture 4" title="Picture 4" width="104" height="53" class="alignright size-full wp-image-1130" />Two firms long known for innovative, high-visibility speech applications are joining forces to enable mobile subscribers to dictate and originate SMS text messages. Telecom Italia Mobile (TIM) distinguished itself early by operating one of the first and certainly the largest multi-application, speech-enabled mobile voice portal. At the dawn of the 21st century, using technology from sister company Loquendo, TIM&#8217;s i-Portal enabled all of its 36 million or so domestic subscribers to dial a number and talk their way through a broad spectrum of voice information services, akin to those offered by U.S.-based Tellme through the toll-free 800-555-TELL. </p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/07/Picture-33.png" alt="Picture 3" title="Picture 3" width="104" height="96" class="alignleft size-full wp-image-1129" />In 2002, the i-Portal offered access to news, weather, sports and entertainment and had the capability to handle high volumes of activity. At about the same time, Promptu, then called AgileTV, was getting off the ground as a early-stage venture at SRI. Its core product was to be a voice-activated remote control for all the screen-based services offered by broadband carriers (primarily Cable TV system operators). The devices allowed cable subscribers to tune their sets to a selected program in response to instructions like &#8220;Find the Knicks game&#8221; or &#8220;Tune into some comedy&#8221;. Both Comcast and TimeWarner cable conducted market trials of the speech-activated remotes in 2005, and the company raised $22 million in venture funding.</p>
<p>By 2006, the company changed its name to Promptu and broadened its service offering to include voice-activated mobile search, including location-based services. VC&#8217;s chipped in another $11.6 million. In early 2007, another $5.6 million underwrote a further incursion into the mobile search world, with Italy becoming a major geographic focus. By this time, the entry of Vlingo, Yap and expanded offerings from Nuance under the Nuance Voice Control (NVC) brand and a number of other technology providers signaled the development of a speech-enabled mobile services ecosystem. </p>
<p>Promptu leveraged new attention to speech enabled mobile services to forge relationships with a number of content providers. With the Official Airline Guide (OAG) it rolled out voice-activated access to WAP-based flight information, which was branded Flights2Go.com and offered for $2.99/mo by Verizon Wireless. In September 2008 it formally announced a speech-to-text dictation service at the CTIA Wireless event in San Francisco. At Demo 2009, the company showed ShoutOUT a voice-to-SMS application for the iPhone. To the best of our knowledge, each product in the U.S. has hit considerable speed bumps on its way to mass adoption. The cable remote does not appear to have come out of trial. <a href="http://www.oag.com/landwithaplan/flights2go">Promotion of the Flights2Go WAPlet</a> does not mention that it is speech-enabled and ShoutOUT is nowhere to be found in the the iTunes App Store.</p>
<p>Now our eyes and ears are focusing on Italy as we look for carrier-based involvement in the text-to-SMS business. In North America, with new attention paid to the safety issues surrounding text messaging in cars, there is more incentive than ever for voice-activated texting (and Twittering) to take hold. The focus now must turn to providing the best user experience possible for an easy-to-use service that provides for hands-free, heads-up entry of commands and content. Promptu has clearly not cracked this nut, and it will not be alone it its efforts to create and popularize speech enabled mobile services.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/07/30/promptu-claims-a-first-speech-to-sms-service-offered-by-telecom-italia-mobile/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
