Next: Computer Education Laboratory Up: Department of Computer Previous: Computer Communications Laboratory

Multimedia Devices Laboratory

/ Qiangfu Zhao / Associate Professor
/ Jintae Lee / Assistant Professor
/ Ruck Thawonmas / Assistant Professor

The multimedia devices presently available add sound, still pictures, and moving pictures to traditional computer displays which show only text and diagrams. The addition of these few functions, however, offers remarkable convenience to the user of such multimedia devices. For this reason, it is widely believed that the impact of these devices on human society in the near future will be extremely great.

The properties desirable for multimedia devices can be visualized on a graph with two axes. The horizontal axis represents a device's capacity for depth and variety of communication and the vertical axis represents the capacity for naturalness and high fidelity. Ideally, multimedia devices should be far along both axes if they are to offer the most comfortable and efficient human interfaces. Unfortunately, progress along these dimensions has not gone far enough. Prolonged use of computers still fatigues the body and fails to match the depth and quality of communication that can exist between humans in face-to-face encounters.

For example, human beings can communicate with great ease and variety through the spoken word. Computers can't. In spite of the fact that speech synthesis by computers is almost perfect, speech recognition has not reached the level of practical application that natural speech can achieve. Furthermore speech recognition algorithms are overly sensitive to existing noise. The area of speech recognition is still in the primitive stages and needs to evolve further to make multimedia devices more effective.

Future multimedia systems should be able to accept and recognize natural information. While traditional multimedia systems provide more and more information, future systems should be able to select, and provide useful and important information. One project in this laboratory is to study how to build such kind of intelligent multimedia systems. Our short-term goal is to build a system that can recognize 3D characters and documents. The long-term goal is to construct a system that can recognize nature images and speeches.

Sign language is another means of human communication that computers cannot participate in as easily as those who are forced to use it because of hearing impairment. If a computer could be made to communicate via this medium, far more people could access intellectual services and actively participate in multimedia that have been unable to do so thus far. At present, pioneer research to incorporate this dimension of communication as far as synthesis and recognition technology has just begun in our laboratory.

As for the dimension of high fidelity, the digitalization of high definition TV is one example of progress that has been made along the vertical axis. Signal processing technology is indispensable for the generation, processing, and recognition of visual, auditory and control signals. Furthermore, multimedia devices are preferable to work in real-time. These requirements can be fulfilled not only by high computing processor speeds and large memories, but also by fast algorithms. Therefore, the study of signal processing is fundamental in our laboratory.

Refereed Journal Papers

Qiangfu Zhao, Stable on-line evolutionary learning of NN-MLP. IEEE Trans. Neural Networks, vol.8, no.6, p.1371-1378, 1997.
Lee, J., Sign Language and Engineering Assistance. Journal of the Society of Biomechanisms, vol.20, no.4, p.181--184, 1996.
Lee, J., Physically-based Modeling of Brush Painting. Computer Networks and ISDN Systems, vol.29, p.1571--1576, 1997.

Refereed Proceeding Papers

Qiangfu Zhao, A Co-Evolutionary Algorithm for Neural Network Learning. Proc. IEEE Inter. Conf. on Neural Networks, p.432-437, IEEE, IEEE, June 1997.
Qiangfu Zhao, Evolutionary design of decomposable systems based on EditEr. Proc. Inter. Symp. on System Life, p.127-132, Japan Society of Mechanical Engineers, JSME, July 1997.
Qiangfu Zhao, A society model for co-operative co-evolutionary algorithms. Proc. Inter. Conf. on Neural Information Processing, p.444-447, APNNA, Springer, Nov. 1997.
Qiangfu Zhao, EditEr: A Combination of IEA and CEA. Proc. IEEE Inter. Conf. on Evolutionary Computation, p.641-645, IEEE, Apr. 1997.
Lee, J., Creating Soft Painting Tools Inside the Computer. 1997 IASTED International Conference SOFTWARE ENGINEERING, Hamza, M. H., p.114--118, IASTED, IASTED Press, Nov. 1997.

Grants

Jintae Lee, Science and Technology Agency of Japan, Research on automatic translation of Japanese text into Japanese sign language, Dec. 1997.

Others

Tsuyoshi Ito, A flexible model for character recognition. The Univ. of Aizu, 1997. Thesis Advisor: Q. F. Zhao.
Fumihito Matsuno, A fixed model for character recognition. The Univ. of Aizu, 1997. Thesis Advisor: Q. F. Zhao.
Teppei Suzuki, A study on the effect of preprocessing on character recognition. The Univ. of Aizu, 1997. Thesis Advisor: Q. F. Zhao.
Koichi Sakamoto, A study on image morphing. The Univ. of Aizu, 1997. Thesis Advisor: Q. F. Zhao.
Masakazu Ichimura, Neural network learning based on GA. The Univ. of Aizu, 1997. Thesis Advisor: Q. F. Zhao.
Tanaka, T., Dynamic Animation of Fingerspelling. The Univ. of Aizu, 1997. Thesis Advisor: J. T. Lee.
Takahashi, T., A Study on Dictionary Structure for Sign Language Animation. The Univ. of Aizu, 1997. Thesis Advisor: J. T. Lee.
Ito, Y., Automatic Figure Positioning for Sign Motion Rotoscoping. The Univ. of Aizu, 1997. Thesis Advisor: J. T. Lee.

Next: Computer Education Laboratory Up: Department of Computer Previous: Computer Communications Laboratory

www@u-aizu.ac.jp
December 1998