Next: Computer Education Laboratory Up: Department of Computer Previous: Computer Communications Laboratory

Multimedia Devices Laboratory

/ Qiangfu Zhao / Associate Professor
/ Jintae Lee / Assistant Professor
/ Jie Huang / Assistant Professor
/ Ruck Thawonmas / Assistant Professor

The multimedia devices presently available add sound, still pictures, and moving pictures to traditional computer displays which show only text and diagrams. The addition of these few functions, however, offers remarkable convenience to the user of such multimedia devices. For this reason, it is widely believed that the impact of these devices on human society in the near future will be extremely great.

The properties desirable for multimedia devices can be visualized on a graph with two axes. The horizontal axis represents a device's capacity for depth and variety of communication and the vertical axis represents the capacity for naturalness and high fidelity. Ideally, multimedia devices should be far along both axes if they are to offer the most comfortable and efficient human interfaces. Unfortunately, progress along these dimensions has not gone far enough. Prolonged use of computers still fatigues the body and fails to match the depth and quality of communication that can exist between humans in face-to-face encounters.

For example, human beings can communicate with great ease and variety through the spoken word. Computers can't. In spite of the fact that speech synthesis by computers is almost perfect, speech recognition has not reached the level of practical application that natural speech can achieve. Furthermore speech recognition algorithms are overly sensitive to existing noise. The area of speech recognition is still in the primitive stages and needs to evolve further to make multimedia devices more effective.

Future multimedia systems should be able to accept and recognize natural information. While traditional multimedia systems provide more and more information, future systems should be able to select, and provide useful and important information. One project in this laboratory is to study how to build such kind of intelligent multimedia systems. Our short-term goal is to build a system that can recognize 3D characters and documents. The long-term goal is to construct a system that can recognize nature images and speeches.

Sign language is another means of human communication that computers cannot participate in as easily as those who are forced to use it because of hearing impairment. If a computer could be made to communicate via this medium, far more people could access intellectual services and actively participate in multimedia that have been unable to do so thus far. At present, pioneer research to incorporate this dimension of communication as far as synthesis and recognition technology has just begun in our laboratory.

As for the dimension of high fidelity, the digitalization of high definition TV is one example of progress that has been made along the vertical axis. Signal processing technology is indispensable for the generation, processing, and recognition of visual, auditory and control signals. Furthermore, multimedia devices are preferable to work in real-time. These requirements can be fulfilled not only by high computing processor speeds and large memories, but also by fast algorithms. Therefore, the study of signal processing is fundamental in our laboratory.

Refereed Journal Papers

Qiangfu Zhao, Co-evolutionary learning of neural networks. International Journal of Intelligent and Fuzzy Systems, 1998, vol.6, No.1, pp.83-90.
Compared with the conventional approaches, the evolutionary algorithms (EAs) are more efficient for system design in the sense that EAs can provide higher opportunity for obtaining the global optimal solution. However, in most existing EAs, an individual corresponds directly to a possible solution, and a large amount of computations are required for designing large-scaled systems. To solve this problem, this paper proposes a co-evolutionary algorithm (CEA). The basic idea is divide and conquer: divide the system into many small homogeneous modules, define an individual as a module, find many good individuals using existing EAs, and put them together again to form the whole system. To make the study more concrete, we focus the discussion on the evolutionary learning of neural networks for pattern recognition. Experimental results are provided to show the procedure and the performance of the CEA.
Lee, J., Techniques of Sign Motion Synthesis by Computer Animation. Japanese Journal of Medical Electronics and Biological Engineering, 1998. vol.12, No.8, pp.52--58.
Lee, J., Categorization of Visual Languages. The Journal of Three Dimensional Images, 1998, vol.12, No.3, pp.13--16.
Huang, J., Ohnishi, N. and Sugie, N., Echo Avoidance in a Computaional Model of the Precedence Effect. Speech Communication, 1999. vol.27, No.3-4, pp.223--233.
Huang, J., Supaongprapa, T. and Terakura, I. and Wang, F. and Ohnishi, N. and Sugie, N., A Model Based Sound Localization System and Its Application to Robot Navigation. Robotics and Autonomous Systems, 1999. vol.27, No.4, pp.199-209.

Refereed Proceeding Papers

T. Suzuki and Q. F. Zhao, On-line evolutionary learning of NN-MLP. Third Int. Symp. on Artificial Life and Robotics, pp.740-743, International Society for Artificial Life and Robotics, ISAROB, Jan.1998.
To design the nearest neighbor based multilayer perceptron (NN-MLP) efficiently, we have proposed a non-genetic evolutionary algorithm called the R4-rule. For off-line learning, the R4-rule can produce the smallest or nearly smallest networks with high generalization ability by iteratively performing four basic operations: recognition, remembrance, reduction and review. For on-line learning, we have introduced two parameters: the review limitation beta and the reduction rate gamma. These two parameters are very useful for stabilizing the algorithm, provided that their values are chosen properly. In this paper, we propose a method for finding good parameters based on a simple genetic algorithm (SGA). The efficiency of combining the genetic algorithm with the non-genetic R4-rule is illustrated using experimental results.
Q. F. Zhao, A general framework for cooperative co-evolutionary algorithms: a society model. IEEE Inter. Conf. on Evolutionary Computation, pp.57-62, IEEE, May 1998.
Compared with the conventional algorithms, the evolutionary algorithms (EAs) are usually more efficient for system design because they can provide higher opportunity for obtaining the global optimal solution. However, the EAs cannot be used directly to design large-scale systems because a large amount of computations are required. To solve this problem, many approaches have been proposed in the literature. The cooperative co-evolutionary algorithms (CCEA) is possibly one of the most efficient approaches. The basic idea of most CCEAs is divide-and-conquer: divide the system into many modules, define an individual as a candidate of a module, assign a population to each module, find good individuals within each population, and put them together again to form the whole system. In this paper, we generalize our earlier studies, and introduce a society model for the study of CCEAs. Based on the society model, we will formulate existing CCEAs in a general framework. We will also provide several case studies, all of which are interesting topics for future researches.
M. Shirasaka, Q. F. Zhao, O. Hammami, K. Kuroda and K. Saito, Automatic design of binary decision trees based genetic programming. Second Asia-Pacific Conf. on Simulated Evolution and Learning, Evolutionary Programming Society, EPS, Nov.1998.
It is known that decision trees are very efficient for pattern recognition. If we design a decision tree first, and then map it into a neural network, the network so obtained will have far fewer connections than that designed directly by using conventional (say, BP) learning algorithms. This is because, during the design of a decision tree, useful features can be selected automatically, and then be used in an efficient way. In practice, however, decision trees cannot be used easily because the optimal design of decision trees is an NP-complete problem. In this paper, we apply the genetic programming (GP) to design of decision trees. To make the discussion more concrete, we will focus our discussion on a character recognition problem. The efficiency of the GP is verified through experimental results.
Lee, J., Specifying High Dimensional Visual Languages with Generalized Logic Grammars. 1998 International Conference SOFTWARE ENGINEERING, pp.65--68, IASTED, IASTED Press, Oct.1998.
Lee, J., Categorization of Visual Languages. First International Conference on Human and Computer, pp.43--46, The University of Aizu, Sep.1998.
Huang, J., Ohnishi, N. and Sugie, N., Spatial Localization of Sound Sources: Azimuth and Elevation Estimation. Proc. Instrum. Meas. Technol. Conf., IEEE, pp.330--333, St. Paul, May 1998.
Huang, J., Ohnishi, N., Guo, X. and Sugie, N., A Computational Model of Echo Avoidance in Human Audition. Proc. Int. Conf. Neural Networks and Brain, Beijing, CNNC, pp.61--66, Oct. 1998.

Grants

Jintae Lee. Ministry of Science and Technology, Special Coordination Funds for Participation of Aged and Disabled Persons, 10 million yen, 1998.

Academic Activities

Qiangfu Zhao. Programming committee member of IEEE ICEC'98, May 1998.
Qiangfu Zhao. Organizer and Chair of special invited session at ICEC'98, May 1998.
Qiangfu Zhao. Reviewer of ICONIP'98, October 1998.
Qiangfu Zhao. Programming committee member of SEAL'98, November 1998.
Jintae Lee. Session chair of 1998 International Conference SOFTWARE ENGINEERING. Oct.1998.

Others

M. Shirasaka, Automatic design of binary decision trees based on GP. Master Thesis, The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
T. Suzuki, On line evolutionary learning of neural networks based on IEA and GA. Master Thesis, The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
A. Oshimo, Implementation of a neuron processor using VHDL. The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
F. Okuya, Implementation of a high speed circuit for weighted sum using VHDL. The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
M. Ishii, Implementation of GA using JAVA and its application to neural network learning. The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
M. Takamura, DSP Board PCI/C44: development and application. The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
T. Yanagiya, Evolutionary design of cellular automata for associative memory. The Univ. of Aizu, 1998. Thesis Advisor: Q. F. Zhao.
Tezuka, M., Modeling of Geographical Shapes in Nature Using Fractals. The Univ. of Aizu, 1998. Thesis Advisor: J. T. Lee.
Obata, Y., Linguistic Comparison of Japanese Sign Language and Pidgin Sign Japanese. The Univ. of Aizu, 1998. Thesis Advisor: J. T. Lee.
Tadataka, O., Evaluation of PC-based Interactive Rotoscoping System. The Univ. of Aizu, 1998. Thesis Advisor: J. T. Lee.
Takahashi, Y., Face Modeling Considering Level of Expression Detail. The Univ. of Aizu, 1998. Thesis Advisor: J. T. Lee.
Takahashi, K., Muscle-Based Facial Animation. The Univ. of Aizu, 1998. Thesis Advisor: J. T. Lee.

Next: Computer Education Laboratory Up: Department of Computer Previous: Computer Communications Laboratory

www@u-aizu.ac.jp
November 1999