• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

This essay is developed based on attempts to summarize the current state of speech and pen input technology and to identify its strengths, limitations and lastly, report on the key multimodal research challenge.

Extracts from this document...


“As applications generally become more complex, a single modality does not permit the user to interact affectively across all tasks and environments. A multi-modal interface offers the user freedom to use a combination of modalities or to switch to a better-suited modality, depending on the specifics of the task environment.”

This essay is developed based on attempted to summarize the current state of speech and pen input technology and to identify its strengths, limitations and lastly, report on the key multimodal research challenge.

        Multimodal technology can be useful in many different environments such as multi-modal interaction for people with disabilities, multi-modal interaction for distributed applications, multimodal systems is emerging in which the user will be able to employ natural communication modalities, includingvoice, hand and pen-based gesture, eye-tracking, body-movement.

        Multimodality allows taking benefits in an optimal way of the human communication capacities. Multimodal interface aim at integrating several communication means in a harmonious way and thus make computer behavior close to human communication paradigms, and multimodal is very easy to learn and use

        Major evolution in new input technologies and algorithms, hardware speed, distributed computing and spoken language, and spoken language technology in particular all have supported the emergence of more transparent and natural communication with this new class of multimodal system. (Designing the user interface for multimodal speech and pen-based gesture applications, 2002, p422).

...read more.


        Pen input technology have advantage of allow users to engage in more powerfully expressive and transparent information-seeking dialogues in human language technology form. Speech is the preferred medium for subject, verb, and object expression. Compare with speech-only interaction to speech and pen interaction for visual-spatial tasks, multimodal pen or voice interaction can result in 10 percent faster in completion time, 36 percent fewer task-critical errors, shorter and simpler linguistic constructions, 90 to 100 percent user preference to interact this way, and 50 percent fewer spontaneous disfluencies.

        Compare to unimodal recognition-based interface, multimodal interface design has particular advantageous feature which is can support superior error recovery. There are both user-centered and system-centered reasons why multimodal system facilitates error recovery. First, in a multimodal interface users intuitively pick the mode that is less error-prone. Second, in a multimodal interface user language is often simplified. Third, users intuitively switch modes after an error, so the same problem is not repeated. Fourth, users report less subjective frustration with errors when interacting multimodally, even when errors are as common as in a unimodal interface. Lastly, a well-designed multimodal architecture can support mutual disambiguation.

        While there are a lot of large individual have different way to communicate or interact with the computer, a multimodal interface allow users to control or to make their on selection how to communicate or interact with the computer.

...read more.


        In conclusion, interest in multimodal interface design growing largely by the goal of supporting more transparent, flexible, efficient, ease use, and powerfully expressive means of human-computer interaction. Multimodal interface is important nowadays not only very useful for difference ages, skill level, or even for disabilities people, but also in dealing with business environment. With multimodal interface system business environment will be more efficiently for example word processing using the speech recognition or pen input technology.

        However, there are several limitation for the multimodal system, which is  speech and pen input systems are not cost effective in other word still relatively expensive, both in terms of software, additional hardware needed and memory requirements, some care is needed before deciding that speech and pen input will benefit a particular user. And multimodal interface system needs to adapt so that their robustness can be enhance. Therefore there are two candidates for system adaptations are user-centered and environmental parameters.

Reference List

  • Bolt, R.A, (1980). Put-that-there: Voice and gestures at the graphics interface, Computer graphics, 14, 3, 262-270
  • Cohen, P.R., McGee, D., Oviatt, S., Wu, L., Clow, J., King, R., Julier, S., and Rosenblum, L., (1999). Multimodal interaction for 2D and 3D environments, IEEE Computer Graphics and Applications, 19, 4, 10 -13, IEEE Press
  • Landay, J., Larson, J., and Ferro, D., (2002). Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions, In Carroll, J.M. (Ed), Human-Computer interaction in the new millenium, New York: ACM Press, Addison-Wesley.
  • http://www.lobby7.com/press_121001.htm

...read more.

This student written piece of work is one of many that can be found in our University Degree Computer Science section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related University Degree Computer Science essays

  1. Information systems development literature review. Since the 1960s Methodologies, Frameworks, Approaches and CASE ...

    This paper is useful as a wide range of topics are discussed. Strengths: A detailed description of the Object-Oriented methodology is provided with supporting evidence from literature research findings. The literature was comprehensible and the distinction between the different methodologies was evident.

  2. Lifecycle Management Of Information Technology Project In Construction

    for work within their traditional ?cope)-yet none of the?e exi?ting ?y?tem? capture? all of multi-dimen?ional and integrated nature of propo?ed approach. Mo?t traditional tool? would become more efficient, and ?ome would increa?e in functionality, becau?e of ability to ?hare project information through third era ICT (?uch a? IFC-ba?ed data exchange).

  1. Develop a Puzzle Website for users of three different age groups, Kids, Teenagers and ...

    for accessing a database. Another new tool we used in the web design along with Front Page was Dream Weaver as it had a few features on it that Front page didn't have like template design features. Other changes involved implementing JavaScript mainly for validation purposes and to implement Snake

  2. The project explains various algorithms that are exercised to recognize the characters present on ...

    or EDTV (progressive) signals. 4.3 DM6437 Functional Overview Functional block diagram of DM6437 is shown in Figure 4.2. Only Video Processing Subsystem is explained briefly in this project. Figure 4.2 Functional Block Diagram of DM6437 [6] The DM6437 device includes a Video Processing Subsystem (VPSS)

  1. Geometric Brownian Motion. The aim of this project is to gain an understanding ...

    This process does not use the historical stock prices as they are not useful. [2] 2.5 Wiener process A wiener process is a specific type of Markov process where the random variable z is drawn from a normal distribution with a mean of 0 and a standard deviation of 1.

  2. An Introduction to the IEEE 802.11p WAVE standard

    APPLICATIONS There will be a range of applications provided by vehicular communication networks. Many of these will depend on what vendors and manufacturers deem as worthwhile implementations, most likely from an economic point of view. However, the primary application of WAVE, and the chief reason for its development, will address issues of motorist safety.

  1. Methods and technology used in Computer Forensics

    One of the most famous civil actions in history was a consolidation of actions against Microsoft Corporation in the case, United States v Microsoft. Amongst the allegations against the software giant was numerous breaches of antitrust laws. Included in these were allegations that then Microsoft executive, Paul Maritz, now CEO

  2. Network report for Middlesex University. The current network design is a star topology with ...

    Each IP subnet is restricted to one wiring-closet switch. This design features no spanning-tree loops and no VLAN trunking to the wiring closet. Each gigabit uplink is a native routed interface on the Layer 3 switches in the distribution layer. Load Balancing: For Load balancing one approach is that the Distribution-layer switch on the left is designated the HSRP

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work