<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opus Research &#187; Natural Language Understanding</title>
	<atom:link href="http://opusresearch.net/wordpress/tag/natural-language-understanding/feed/" rel="self" type="application/rss+xml" />
	<link>http://opusresearch.net/wordpress</link>
	<description>Analysis and Expertise on Voice Services and Conversational Commerce</description>
	<lastBuildDate>Wed, 23 May 2012 17:29:43 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>&#8220;Watson Come Here!&#8221; Now There&#8217;s a Conversational Speech API from AT&amp;T</title>
		<link>http://opusresearch.net/wordpress/2012/04/24/watson-come-here-now-theres-a-conversational-speech-api-from-att/</link>
		<comments>http://opusresearch.net/wordpress/2012/04/24/watson-come-here-now-theres-a-conversational-speech-api-from-att/#comments</comments>
		<pubDate>Wed, 25 Apr 2012 00:01:57 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[AT&T]]></category>
		<category><![CDATA[Automated Speech]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Natural Language Understanding]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5296</guid>
		<description><![CDATA[Congratulations to AT&#038;T for making some waves among the growing pool of applications developers looking for ways to add conversational speech to their offerings.]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/ATTLogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/ATTLogo-150x50.png" alt="" title="AT&amp;*TLogo" width="150" height="50" class="alignright size-thumbnail wp-image-5303" /></a>Congratulations to AT&#038;T for making some waves among the growing pool of applications developers looking for ways to add conversational speech to their offerings. With relatively little fanfare (<a href="http://www.att.com/gen/press-room?pid=22694">this</a> being the closest thing to a press release), AT&#038;T let it be known that it would release new APIs (application programming interfaces) in June to make it easier for developers to integrate the goodness of AT&#038;T Watson(SM) speech processing technology, rather than some of the alternatives that are emerging in the marketplace. As AT&#038;T likes to explain it, &#8220;This technology reflects an investment of more than one million hours of research and development in speech technologies, leading to more than 600 U.S. patents and patent applications.&#8221;</p>
<p>AT&#038;T&#8217;s John Donovan provides more detail on the future offering <a href="http://www.attinnovationspace.com/innovation/story/a7782318">here</a>. In essence, when the APIs are released in June, developers can look forward to making it easier for the people who use their apps to use their voice (and probably their mobile phones) to carry out general web searches, find local businesses, engage in question and answer discussions and originate messages, including voice mail and text messages. Anticipating tighter coupling with Web-enabled TVs, there will also be a pre-built interface to AT&#038;T&#8217;s U-verse® electronic programming guide. </p>
<p>Here&#8217;s AT&#038;T&#8217;s YouTube video providing background on Watson and &#8220;how it will work&#8221; both for application developers and users:</p>
<p><iframe width="560" height="315" src="http://www.youtube.com/embed/rqleRoRdzII" frameborder="0" allowfullscreen></iframe></p>
<p>In short, AT&#038;T will be encouraging the developer community to take advantage of the market conditioning that Apple has undertaken by promoting the Siri Virtual Assistant through mass media advertising. But Apple, with Siri, is promoting the &#8220;closed garden&#8221; approach that has made the iPhone so successful. AT&#038;T is making it clear that it&#8217;s engine, and the one million person hours of R&#038;D will be there for the taking (though the mention of 600 patents) looks like a veiled warning that some metering of the use is inevitable.</p>
<p>Watson is already in use for voice search on iPhones and Android-based devices. An app called Speak4It is a mashup that includes AT&#038;T Watson, and has been available since mid-2010. In addition, the AT&#038;T Translator App, described <a href="http://www.att.com/gen/press-room?pid=22688">here</a>, has been a showcase for Watson technologies&#8217; ability to support 7 different languages.</p>
<p>While many in the business and tech press think AT&#038;T&#8217;s new offer portends a &#8220;smack-down with Siri,&#8221; that is not necessarily true. What we are destined to see is geometric growth in the energy put forth by developers to bring take advantage of voice as the &#8220;natural interface&#8221; for phones and other mobile/personal devices. Apple will continue to use its experience with Siri to refine the service offerings are Siri-enabled. Brisk sales of the iPhone 4S (which is almost identical to the iPhone 4, save for the inclusion of Siri), attest to its economic value. Apple&#8217;s implementation of Siri as a &#8220;beta&#8221; feature of the phone, give it sole access to the &#8220;Home&#8221; button as the mechanism to summon Siri (as well as the native accelerometer as a mechanism to support the &#8220;Raise to Speak&#8221; function).</p>
<p>Meanwhile, developers can look forward to more tools for using natural language in specific domains. AT&#038;T has deemed general search, Q&#038;A, business search and control of TVs to be worthy of early attention. Our belief is that positive experience in these domains will lead to the development of more <a href="http://opusresearch.net/wordpress/2010/08/16/voice-actions-for-android-speechable-moments-from-google-spell-new-market-dynamics/">&#8220;speechable moments,</a>&#8221; as I started calling them back in mid-2010. Today there are more than a dozen firms &#8211; coming out of a diverse development community that, in addition to speech processing, spans customer care analytics, automotive electronics, telematics and pure academic research &#8211; that have formidable software platforms optimized for understanding the semantic structure, logical content and practical context of spoken phrases. </p>
<p>At this stage in the markets maturity, we&#8217;re happy to see AT&#038;T entering the market with tools and resources to support the developer community. </p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/04/24/watson-come-here-now-theres-a-conversational-speech-api-from-att/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>From QA &amp; WFO to AI &amp; VA: VPI Introduces VirtualSource</title>
		<link>http://opusresearch.net/wordpress/2012/04/17/from-qa-wfo-to-ai-va-vpi-introduces-virtualsource/</link>
		<comments>http://opusresearch.net/wordpress/2012/04/17/from-qa-wfo-to-ai-va-vpi-introduces-virtualsource/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 21:31:36 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Contact Center]]></category>
		<category><![CDATA[customer care]]></category>
		<category><![CDATA[Natural Language Understanding]]></category>
		<category><![CDATA[self-service]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=5265</guid>
		<description><![CDATA[Long time call recording, quality assurance (QA) and Workforce Optimization (WFO) specialist VPI (aka Voice Print International) launched a new, cloud-based virtual assistant (VA) service called VirtualSource. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/VPIlogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2012/04/VPIlogo-150x49.png" alt="" title="VPIlogo" width="150" height="49" class="alignright size-thumbnail wp-image-5266" /></a>In a transition that is not trivial, long-time call recording, quality assurance (QA) and Workforce Optimization (WFO) specialist VPI (aka Voice Print International) launched a new, cloud-based <a href="http://www.vpi-corp.com/news_fullstory.asp?article_id=779">virtual assistant (VA) service called VPI VirtualSource</a>. The service is positioned as a &#8220;scalable workforce in the cloud,&#8221; not as a hosted IVR, according to Patrick Botz, VPI&#8217;s vice president of marketing. It employees Nuance automated speech recognition with statistical language modeling in combination with VPI&#8217;s proprietary &#8220;intelligence engine&#8221; (referred to as &#8220;the brain&#8221;) to support conversational, phone-based interactions.</p>
<p>The company, which has 1,500 customers, is now rolling out the service to organizations of all sizes and in all industry sectors. VPI&#8217;s value proposition is to &#8220;do all the work&#8221; of setting up an agent. The start-up process takes 4-6 weeks and accounts for about 3-4 &#8220;person days&#8221; all told. That includes feeding it the live recordings, suggested scripts and other tuning information. Unlike a typical speech IVR system, VirtualSource does not use or require companies to maintain logic trees. Instead,  VPI uses about 10-20 recorded conversations (preferably with successful outcomes) and augments it with information, rules and logic from training manuals, call-flow diagrams or interviews with contact center managers or supervisors. Performance should improve with time, as the system carries out more and more conversations. </p>
<p>VPI customers can run the virtual agents side-by-side with live customer service reps. Comparisons of the two can be used to tune the virtual agent or to determine which tasks should are most eligible for automated handling. Visit <a href="http://www.vpi-corp.com/VirtualSource/">here</a> (scroll down) to listen to sample conversations. At this point, the company uses a slightly robotic aspect to the voice rendering, which allows the virtual agent to adapt more rapidly to dynamic conversations. </p>
<p>The &#8220;brain&#8221; or &#8220;intelligence engine&#8221; integrates important cognitive functions including &#8220;short- and long-term memory, natural language understanding, reasoning, learning mechanisms, goal-directed behavior and meta-cognition.&#8221; It&#8217;s service cloud is PCI-compliant and Botz believes that companies will see value in using virtual agents to receive payment instructions securely (less prone to human error or theft.)</p>
<p>VPI bills on a per use basis at $0.25 per minute (including tuning) with a monthly minimum and a one-year term agreement.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2012/04/17/from-qa-wfo-to-ai-va-vpi-introduces-virtualsource/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance &#8220;Fast Start&#8221; Program: Reduces Time and Expense of Natural Language Implementations</title>
		<link>http://opusresearch.net/wordpress/2011/03/07/nuance-fast-start-program-reduces-time-and-expense-of-natural-language-implementations/</link>
		<comments>http://opusresearch.net/wordpress/2011/03/07/nuance-fast-start-program-reduces-time-and-expense-of-natural-language-implementations/#comments</comments>
		<pubDate>Tue, 08 Mar 2011 00:15:35 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Automated Speech Recognition]]></category>
		<category><![CDATA[Contact Center]]></category>
		<category><![CDATA[Natural Language Understanding]]></category>
		<category><![CDATA[Nuance Communications]]></category>
		<category><![CDATA[Opus Research]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=4211</guid>
		<description><![CDATA[A new program from Nuance Communications, informally called "Call Steering Fast Start," will introduce shorter deployment times and lower costs for firms that can benefit from taking a constrained approach to speech-based Call Steering. ]]></description>
			<content:encoded><![CDATA[<p><a href="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo.png"><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/08/NuanceLogo.png" alt="" title="NuanceLogo" width="166" height="107" class="alignright size-full wp-image-1194" /></a>The Experience Curve lives! Perhaps more so than ever in this age of machine learning and artificial intelligence, made all-too-apparent when IBM&#8217;s Watson was able to defeat the all-time money winners on Jeopardy! But how, you may ask, will this benefit (rather than humble) mere mortals. I was just briefed by Jeff Foley, Senior Manager of Solutions Marketing, at Nuance Communications and he described a new program, informally called &#8220;Call Steering Fast Start,&#8221; which will introduce shorter deployment times and lower costs for firms that can benefit from taking a constrained approach to speech-based Call Steering. </p>
<p>The program serves as an antidote to the reticence that IT and customer care professionals have shown (especially during the economic downturn) toward launching the lengthy design, deployment and debugging processes associated with natural language understanding (NLU) and enhanced call processing. The pitch is quite simple. Nuance first shows that callers benefit from shorter wait times and fewer call misdirections when a well-designed Call Steering app is deployed. Then they demonstrate how companies can benefit from shorter wait times and greater cost certainty when they adhere to a fixed scope deployment approach.</p>
<p>The term &#8220;fixed scope&#8221; is well chosen. Based on years of experience with Call Steering implementations among large global companies across multiple vertical industries, Nuance understands the application characteristics and communications architectures that will benefit from Fast Start. It has distilled those learnings into a &#8220;configuration file&#8221; which provides a foundation and design framework for getting started. Customers and clients are not being offered a &#8220;crippled app&#8221; or proof-of-concept, instead they are deploying the same code that has worked in full scale deployments. </p>
<p>The constraints have to do with the demographics and architecture of targeted companies. Based on experience in the field, Nuance recognizes that Fast Start will work most effectively for firms limited to 80-100 call destinations and 5-10 instances that require serious interpretation of &#8220;disambiguation&#8221; of a caller&#8217;s intent. In those cases, implementations can be completed in as few as 90 days and the total start up costs are a significant discount from the &#8220;full service&#8221; approach.</p>
<p>The offer seems very attractive to healthcare providers and utilities where companies have high standards for customer satisfaction but often lack the call volumes that justify investment in NLU. As Foley explained, Nuance had observed a threshold in the 10-12 million calls per year range (with some of the largest systems handling over 100 million calls each year). Fast Start can make Call Steering attractive when annual call volumes are less than 4 million. The clear message coming from Nuance is that &#8220;Natural Language Call Steering&#8221; is not too expensive for mid-sized businesses.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2011/03/07/nuance-fast-start-program-reduces-time-and-expense-of-natural-language-implementations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dial Directions Acquired by Sakhr Software; Launches Mobile Arabic Translator</title>
		<link>http://opusresearch.net/wordpress/2009/06/30/dial-directions-acquired-by-sakhr-software-launches-mobile-arabic-translator/</link>
		<comments>http://opusresearch.net/wordpress/2009/06/30/dial-directions-acquired-by-sakhr-software-launches-mobile-arabic-translator/#comments</comments>
		<pubDate>Tue, 30 Jun 2009 16:35:23 +0000</pubDate>
		<dc:creator>Dan Miller</dc:creator>
				<category><![CDATA[CAT Scans]]></category>
		<category><![CDATA[Mobile Translation]]></category>
		<category><![CDATA[Natural Language Understanding]]></category>
		<category><![CDATA[Recombinant Telephony]]></category>

		<guid isPermaLink="false">http://opusresearch.net/wordpress/?p=892</guid>
		<description><![CDATA[Dial Directions has been acquired by Arabic Natural Language Understanding specialist Sakhr Software and introduced a mobile translator application for iPhones and Blackberries.]]></description>
			<content:encoded><![CDATA[<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/Picture-12.png" alt="Picture 1" title="Picture 1" width="131" height="32" class="alignleft size-full wp-image-894" />There&#8217;s an interesting endgame to the Dial Directions story, as the pioneering voice search company <a href="http://www.businesswire.com/portal/site/home/permalink/?ndmViewId=news_view&#038;newsId=20090629006246&#038;newsLang=en">announced today</a> that it has been acquired by Sakhr Software, a company that specializes in natural language processing in Arabic. The resulting company will have 200 employees. Dial Directions&#8217; CEO Adeeb Shanaa will have the same title at Sakhr, while the cheif executive of Sakhr, Fahad Al Sharekh, is elevated to chairman of the combined company, with the objective of building more business alliances and partnerships.</p>
<p><img src="http://opusresearch.net/wordpress/wp-content/uploads/2009/06/DialDirections1.jpg" align='right'  HSPACE=10 vspace=10/>One of the early fruits of the partnership is an application for the Apple iPhone and RIM Blackberry that performs real time translation between spoken English and Arabic. <a href="http://www.youtube.com/watch?v=rW9m9230LnA">This YouTube</a> video provides a demonstration. Mobile translation of this sort is something akin to the Holy Grail of natural language voice processing. Offering the function as a Smartphone app is a real coup. It benefits from nearly twenty years invested by Sakhr in building a knowledge base of Arabic phrases to support optical character recognition (OCR), machine translation (MT), data mining and search. </p>
<p>Sakhr&#8217;s long-standing expertise in the field attracted the attention of BBN Technologies, which incorporated the technology into its work on the U.S. Defense Advanced Research Projects Agency&#8217;s (DARPA&#8217;s) GALE Project (referring to &#8220;Global Autonomous Language Exploitation&#8221; efforts to capture and recognize huge amounts of spoken words and text in a variety of languages. IBM and SRI have been vying with BBN over the past few years to make breakthrough advances in &#8220;natural language understanding&#8221; across a multitude of languages, dialects and modalities.</p>
<p>The mobile application is not related to GALE, but it is an impressive demonstration of Sakhr&#8217;s underlying technology. It is the product of a partnership between Dial Directions and Sakhr that dates back to 2008 when the two companies formed a developmental partnership to address &#8220;language application technology for mobile, cloud-computing environments.&#8221; The result was a &#8220;first of its kind open speech-to-speech mobile translation application for the U.S. government and business customers.&#8221;</p>
<p>The video demonstration is, indeed, impressive. The challenges surrounding transcription (even in a single language) are formidable. As the company brings the application to the marketplace it will need to manage user expectations surrounding &#8220;accuracy&#8221; &#8211; which is a very tough nut to crack in the transcription world where mis-recognizing a single word can distort the meaning of an entire utterance. In the commercial marketplace, that is the proverbial, technological elephant in the room.</p>
]]></content:encoded>
			<wfw:commentRss>http://opusresearch.net/wordpress/2009/06/30/dial-directions-acquired-by-sakhr-software-launches-mobile-arabic-translator/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

