Ford Motor Co. claims to have tamed voice recognition with the latest version of its Sync technology by upping the number of commands a hundredfold while simplifying their grammar.
Sync technology is used to connect mobile devices, such as smartphones, to an in-vehicle network that is controlled with voice commands. The improved Sync now allows any command to be directly spoken and recognized in natural language, whereas with most other voice recognition systems, a speaker must identify the pertinent menu of commands before issuing the desired command.
"We have flattened the hierarchy of the grammar," said electrical engineer Jim Buczkowski, a Henry Ford Technical Fellow and director of global electrical and electronics systems engineering for Ford. “Now there are many more commands at the first level. For instance, instead of saying 'phone' first, then saying who you want to call, now you can just say, 'Call Jim.'"
Ford Sync was co-developed with Microsoft on the Windows Embedded Automotive software platform and uses voice recognition algorithms licensed from Nuance Communications Inc. The Sync technology was recently integrated with MyFord Touch, which uses a dash-mounted resistive touchscreen to control vehicle functions.
"We work with many automobile makers, but the Ford Sync system is probably our most sophisticated implementation to date," said Brian Radloff, director of automotive solutions at Nuance. "For instance, on the destination entry side you can now speak an entire address in a single utterance– such as 100 Main St., Detroit, Michigan– instead of having to speak the street number, street name, city and state separately.
"Sync also uses our latest algorithms for on-the-fly adaptation, so its accuracy increases the more you use it."
Most functions—from tuning the radio to asking for directions or even calling 911—can now be controlled by touch or voice. The second-generation Sync also recognizes thousands of aliases that make access more convenient for drivers by not requiring them to remember specific commands. For instance, saying "warmer," "increase temperature" or "temperature up" will all bump up the thermometer reading in the cabin. Simplifications to Sync also help drivers control their own mobile devices with voice.
For instance, when you first buy a Sync-equipped vehicle it will automatically index the names and addresses on your smartphone as well as the names of the bands and individual tracks on your music player. Within a few minutes, those indexed entries can be accessed by voice commands.
Further, the system will convert text messages to audio and read them using Nuance's text-to-speech engine. Drivers can "text back" via voice control by choosing one of up to 20 predefined text responses.
Ford has also been busy persuading developers of such popular Web applications as MapQuest and Google Maps to add "Send to Sync" buttons to their Web sites. Once you find a destination using the mapping program on your PC, all you need to do is hit the "Send to Sync" button and fill in your cell phone number. Then, when you enter your Ford vehicle, the map you selected in MapQuest or Goggle Maps will automatically be uploaded to the dashboard display.
Ford’s "Traffic, Directions and Information" integrates its voice command structure with GPS and Bluetooth connectivity to provide personalized traffic reports; turnbyturn directions; and instant access to news, sports and weather. Destination-related queries can be made, such as "Find a hotel."
Ford provides in-vehicle WiFi service for any driver who plugs a 2G, 3G or 4G modem into a special USB slot. Unlike GM’s OnStar system, which includes a cell phone connection on the vehicle, Ford's Sync depends on the connectivity link provided by the user’s cell phone.
Ford is also allowing third-party developers to target its in-vehicle network with the AppLink application programming interface (API), which lets apps add unique capabilities that can be controlled via voice commands.
"For instance, Pandora’s app has added our API so that you can use Sync to stream music from the cloud in your car with-voice commands," said Buczkowski. Several other apps are already Sync compatible, such as the Stitcher app for radio news and the OpenBeak app for Twitter, with more slated to add AppLink compatibility this year.
The voice recognition technology is currently fluent in English, French and Spanish, and Ford plans to expand Sync’s vocabulary to 21 languages next year. Sync has been installed in more than 3 million Ford vehicles and will be made available on 80 percent of the company’s production vehicles over the next four years, according to Buczkowski.
I can see now that we have apps everywhere!
On the mobile phones, now on the car! Yesterday I was reading about the vision Qualcomm's CEO has of the future, that is a world where silicon chips are found in things as different as running shoes, clothing, and the like and now I see that big companies like Microsoft and Ford are also stepping on the same path. On the other hand, the US might be missing more engineers that they would like but I was reading also about the fact that events like the IBM machine who won Jeopardy (did it won? or almost) pulled attention from the youngsters and now Computer Science is gaining adepts. I think the future will be filled with electronics, computing and robots. Cool! What do you think?
You are completely correct, Luis. Many are embedding electronics into shoes, clothing, accessories and other "all-Internet" devices around us. We will be connected to everybody everywhere. We just need to solve the signal and EMI interference problems, and a few other issues. Microsoft and Ford are only a couple of bigwigs that are bringing the next stage of natural connectivity to drivers. Many more are embedding intelligence in cars. Check out the upcoming webinar on the subject: http://www.eetimes.com/electrical-engineers/education-training/webinars/4216833/Designing-Intelligence-into-the-Car
apps everywhere, I think not. apps, in the apple sense are about over. connectivity is not a passing fad, though it's not going to be easy or fast. consider protocols, especially authorization and authentication. having your shoes and clothing report to you sounds great, but how do you ensure they're not merely broadcasting? oh, no - my pants have been hacked! trivial protocols like BT pairing can't be extended to that kind of scale.
consider also issues of privacy - what are the legal ramifications when our _stuff_ constantly has us under surveillance. do you want your underwear's activity log to be subpoena-able? if you configure your stuff to forget in some time window, how do you know the firmware is really, truely forgetting?
David Patterson, known for his pioneering research that led to RAID, clusters and more, is part of a team at UC Berkeley that recently made its RISC-V processor architecture an open source hardware offering. We talk with Patterson and one of his colleagues behind the effort about the opportunities they see, what new kinds of designs they hope to enable and what it means for today’s commercial processor giants such as Intel, ARM and Imagination Technologies.