Opus Research is a long-standing and unrepentant booster of speech-enablement. We stand for judicious use of advanced speech recognition (ASR), text-to-speech conversion (TTS) and all the attendant hardware, software, professional services and tooling (soaking up something on the order of $28 billion in 2005) directed at a single goal. That is to take existing self-service infrastructure – on a Web site, in front of a corporate directory ‘autoattendant’, baked into a contact center or hosted environment – and “make it talk.â€
“Make it Talk†should not be confused with efforts to “Make Speech Mainstream.†If the years between 1999 and 2005 have taught us anything it is that speech is seldom, if ever, the mainstream. It is better thought of as a tributary; one of many paths into the frequently used channels of electronic commerce. It is safe to predict that the year 2005 is emphatically not going to be The Year of Speech Enablement. Rather, it is Yet Another Year of Intelligent Migrations (YAYIM): from contact center to IT-driven customer service, from siloed to integrated solutions, and from switched networks to IP-Telephony.
Always at the Threshold
Since the dawn of this millennium, ‘some other IT project’ has been perched ahead of speech recognition in the backlog of mission critical IT projects. Y2K conformance was the must-do change in enterprise IT in 1999. Then came the World Wide Web and a series of e-commerce mandates to suck the air out of the prospects of adding speech recognition to self service resources.
The bursting of the dot.com bubble, coupled with a general depression in the telecom sector, kept a heavy boot on speech-enablement’s air hose during the next couple of years. Thus we entered 2004 with great hope for conversational access technologies to accelerate and improve upon ways for people to carry out their interaction with agents, IVRs, auto-attendants or hosted server farms that comprise the underpinnings of phone-based commerce.
The technologies work better than ever. What’s missing is a clear set of guidelines – a migration path – from what-is-now to what-oughtta-be. In point of fact, there’s no Yellow Brick Road through the forest, underbrush and poppy fields between now and the speech-enabled future. Even the term of a “Wizard of Oz†has been hijacked to describe an application development technique where live people merely simulate the processes that could ultimately be mechanized by a well-designed ASR system.
The Silver Lining
In the ensuing 12 months, the IT staff is going to have its hands full with all sorts of fire drills. “Compliance†and “Security†issues reign supreme with IT these days. In the U.S. compliance to the security and disclosure strictures of Sarbanes-Oxley has IT managers rushing to install new utilities to beef up the ability to control and monitor access to enterprise data. IT analysts see the expense of SOx compliance exceeding that of Y2K hardening in terms of dollars spent. That alone could set the cause of speech enablement back another 12 months. The silver lining is a boomlet expected for deployment of speech-enabled automated systems for PIN discovery and reset.
The inexorable migration to Voice over IP (VoIP) conversion is also a major mandate, especially among multi-national, multi-site enterprises. The efforts of Cisco, Avaya, Alcatel and the horde of telecom infrastructure vendors has accelerated enterprise deployment of IP-based networks for traditional voice conversations, as well as impromptu conferencing and a few other IP-enabled enhanced services. VoIP conversion does not, in-and-of-itself, bring ASR or TTS into play. Nonetheless, the definition and deployment of ‘VoIP+Speech’ solutions will bring tens of millions of dollars to vendors or teams of vendors that demonstrate how an enterprise can use automated speech in conjunction with IP-telephony to leverage components of self-service infrastructure that are already bought-and-paid-for.
Common ground is defined by company-wide mandates to hit well-specified business objectives. Significant technology ‘wins’ used to come at the expense of ‘seats’ in corporate contact centers, in the name of saving money on customer service. 2003 marked the sea change from cost savings to revenue enhancement. 2004 added the idea of using technologies to promote customer retention and upselling. The underlying mandate has been to “use what we bought already†and transcend traditional divisions between and among departments.
The Integrators Dilemma
…or should we say “opportunityâ€. Enterprise decisionmakers have multiple legacy systems to deal with, often with different operating systems and solution sets in their call center, their front office, their Web sites and their back office. Each vendor has a migration or upgrade strategy. Each vendor’s system has internal personnel responsible for care, feeding and perpetuating existing solutions.
In this context, incumbency is less important than the ability to combine multiple incumbent systems into a service delivery environment that is highly responsive to an enterprise’s stakeholders. That’s the historic role of system integrators, which means there are important roles to be played by the giants of the system integration business: IBM (Global Services and Business Consulting Services), Accenture, Siemens Consulting, EDS, CapGemini and a select others stand out as the movers in this respect. Of these, IBM, Siemens and EDS have the longest legacy in contact center operations and automation.
In the 2003-2004 timeframe many large enterprises in financial services, travel, healthcare and other verticals looked to hosted service providers – Convergys, Tellme and the hosted services subsidiaries of major telcos – to ‘front-end’ their self-service resources. In some cases this amounted to off-loading the technology headaches associated with getting geographically dispersed IVR and live agent resources to interact with one another in real-time.
The growth of VoIP+Speech within business enterprises, premises-based solutions – meaning those legacy systems or their replacements – are assuming increased importance. IBM, Microsoft, Cisco, Avaya and their partners have an increasingly vital role to play – not at the exclusion of hosted or managed options, but in conjunction with network resources. This should skew many of the early ‘wins’ toward vendors with toolsets that include business workflows, process automation, telephony routing that recognize that a single phone call from a customer or vendor already implicates resources or personnel that can be anywhere around the globe and residing on any vendor’s system.
In 2005, solutions that involve speech will address the immediate need to break down silos on a call-by-call basis, in real time. Improvement does not start with “an IVR upgrade or replacement†to support speech recognition. It starts with mapping VoIP+Speech to business objectives. Solutions providers who speak to top management about return on existing assets and have the tools to map solutions to stated business objectives and enterprise workflows will be the winners; and, when speech is deployed as one of many channels into self-service resources that are designed to enhance the user experience, the caller is the real winner.
Categories: Articles