Next: Computer Solid State Up: Department of Computer Previous: Department of Computer

Computer Architecture Laboratory

/ Tuneo Ikedo / Professor
/ Michael Kishinevsky / Professor
/ Robert H. Fujii / Associate Professor
/ A. Y. Kondratyev / Associate Professor
/ Yamin Li / Associate Professor
/ Omar Hammami / Assistant Professor
/ Jianhua Ma / Visiting Researcher
/ Wanming Chu / Research Associate

Computer Architectrue Laboratory is organized with 7 faculty members and one visiting researcher. The followings re the summary of each members.

Prof. T. Ikedo:
Multimedia Processor Architecture: Multimedia processor has been developed since 1995, which implements font, graphics, audio, video processing in a single chip of 10 million gates. we focused on the architecture design of computer graphics renderer in fine grain processing in 1996 and 30Gflop reconfigurable multimedia accelerator (coarse grain processing). The development of reconfigurable accelerator is a succession of Aizu supercomputer project. 4 graduate students and 8 undergraduate students join in this research.
Profs. Y. Li and W. Chu:
1. Research: (1) Parallel Multithreaded Architecture We have examined various design issues of the parallel multithreaded architecture. A prototype of parallel multithreaded processor was designed. We are working on the performance evaluation in order to enhance the the computation capabilities of the processor. (2) Computer Arithmetic Algorithms and Implementations We have examined latency, throughput and complexity for various computer arithmetic algorithms and developed two new algorithms. The hardware designs are also finished and prototyped with FPGAs.
2. Teaching: We taught Computer Architecture (Li and Chu), Computer Organization I and Organization II (Li), and the laboratory of Logic Circuit Design (Chu). We also developed and updated two processor design projects: Sim2 processor design for the laboratory of Computer Architecture and Aizup pipeline processor design and implementation on FPGA for the laboratory of Computer Organization I. All of the teaching notes and laboratories handouts are prepared and available at Li's homepage, http://www.u-aizu.ac.jp/~yamin/.
Profs. M. Kishinevsky and A. Kondratyev:
Design Automation of Concurrenct and Asynchronous Systems: We have reached further progress in research in automation of asynchronous design, embedded reactive systems and models of concurrency. In a few research directions we continued close cooperation with Prof. A. Taubin (Computer Education Lab.) and with research groups in Cadence Berkeley Labs. (USA), Politecnico di Torino (Italy), University Politecnica de Catalunya (Barcelona, Spain) and University of Newcastle upon Tyne (England).
Research results: 1. Embedded systems and models of concurrency: (1). We have developed a new model, called Place Chart Nets (PCN). It allows the modelling of both asynchronicity and exception handling (preemption). PCNs specify a system behavior using partial orders. PCNs have a notion of hierarchy, which is determined by preemption. PCNs is a non-trivial generalization of classical PNs, in the sense that (1) for the finite (bounded) case modeling a PCN may require an exponentially larger PN, (2) for the infinite (unbounded) case a class of PCN languages properly includes a class of PN languages and (3) k-boundedness of PCNs is decidable. (2). We developed a method for synthesis of PCNs starting from labeled transition systems. We considered applications of PCNs for design of embedded reactive systems in hardware/software codesign framework. 2. Technology mapping of speed-independent circuits: We have reached significant progress in solving a problem of technology mapping for speed-independent circuits based on two different techniques: (1). Algebraic factorization: the proposed method performs both combinational (inserting new gates) and sequential (inserting new memory elements) decomposition of complex gates in a given standard cell library, while preserving original behaviour and speed-independence. (2). Boolean decomposition: the proposed method iteratively performs Boolean decomposition of each complex gate using Boolean relations, as opposed to the less powerful algebraic factorization approach used in previous methods. After logic decomposition, the overall library matching and optimization is carried out. 3. Testing of path delay faults: We provided effective procedures to solve the initialization and the test pattern generation problems for the path delay fault testing of asynchronous circuits. Experimental results shows that a high level of path delay fault testability can be achieved with partial scan.
We have presented a few Invited tutorials at the Summer School, International Conference, and Fujitsu Labs.
Teaching: we taught logic design, computer architecture in the undegradute school and "Synthesis and Optimization of Digital Circuits" in the graduate school. We have lead graduation reserch projects.
Prof. Robert H. Fujii:
This year's activities have included the design of a microcontroller controlled motorized wheelchair prototype, the design of a fuzzy logic controller using Verilog, and the design of a high speed pipeline control unit using Verilog. Implementation of the motorized wheelchair prototype into a real system which can be ridden by people with disabilities is being carried out. Analog circuit designs of various modules will be forthcoming.
Prof. Jianhua Ma:
My researches in 1996 are devoted to improving algorithms of the Truga001 graphics chip and modeling multimedia hyperworld. A new circuit combined Phong shading with bump mapping has been developed. Several bump-mapped shading pictures have been produced to test the developed algorithm and circuit. The interpolation method has been proposed to improve bump-mapped shading effects. Texture mapping and video mapping performance of the Truga001 has been simulated and evaluated. The hyperworld is an integration of various interaction worlds in different time, space and reality. Our research starts from basic features of the hyperworld, composition and models of an one-to-many interaction system. We are now focusing on developing the prototype of an educational hyperworld system, called Cheer.

Refereed Journal Papers

Ikedo T., Aizu Supercomputer Project. ISMM Trans. Computers and Applications, 1997. accepted.
It is proposed new architecture of massive parallel computing for virtual reality system and visual programming language.
Yamin Li and Wanming Chu, A MODEL FOR PREDICTING UTILIZATION OF MULTIPLE PIPELINES IN MTMP ARCHITECTURE. International Journal of MODELLING & SIMULATION, accepted, 1997.
The conventional single-threaded multiple-pipelined processor is not capable of using multiple pipelines efficiently, and so the processor performance suffers. This paper investigates a multiple-threaded multiple-pipelined (MTMP) processor architecture that tries to issue multiple instructions from multiple instruction threads in every clock cycle. For the performance evaluation, the paper proposes a modified analytic model that provides a quick prediction of utilization of pipelines. Unlike previous analytic models of multiple-threaded architecture, the model presented here concerns the utilization of multiple pipelines. It deals not only with pipeline dependencies but also with structure conflicts. The model can be used for turning processor parameters when a MTMP is designed.
Fujino Shinji and Fujii Robert, VLSI Design of a RISC Processor Using Parthenon and Verilog Hardware Description Languages. Tenth Parthenon Research Conference, p. 59-62, 1997.
The design of a 32-bit RISC processor carried out using Verilog-Synergy and PARTHENON is described. Semantic, procedural, and optimization differences are analyzed and utilized to come up with designs which take advantage of the individual strengths of the two hardware description languages.
Tsuneo Ikedo and Jianhua Ma, A Scalable Rendering Processor: the Trug001. IEEE Computer Graphics and Applications, accepted, 1997.
The Truga001 was developed as a graphics processor for generating virtual reality images in real time. It consists of multiple processors embedded in a single chip with each processor interconnected with an MIMD structure. In this sense, it is not a single-chip graphics processor of conventional architecture which mainly performs graphics functions based on a microprogram or user-defined software. The Truga001 performs the various types of rendering in parallel. In the chip, there are 12 processors and 7 special graphics hardware modules.
Tosiyasu L. Kunii, Jianhua Ma and Runhe Huang, Towards Direct Mapping between Visual Information Worlds and Real Worlds. Visual Information Systems, accepted, 1997.
Without supports of effective modeling in visual worlds, even if the visual information can be displayed in real worlds, it is difficult to recognize and extract its features and to confirm or verify the identities and the characteristics. Direct mappings target at refining and abstracting multimedia information by cognitive technology and differential technology to efficiently improve our human performance and actively control the real worlds we live. An assemblability discriminating method and an assembling sequence generating method named SYDEM is explained by giving an example of CIM assembly process. The three other examples of an effective guide-map generation method, of a hierarchical description of surfaces, and of a conceptual visual human algorithm for skiing, are given to show the drastic efficiency and exactness increase.
Tosiyasu L. Kunii, Jianhua Ma, Runhe Huang and Takao Maeda, Japan: Computer Graphics Research Activities. Computer Graphics, SIGGRAPH Quarterly, vol. 30, No. 2, p. 28-31, 1996.
Computer Graphics has been a very active research area in Japan. With the developments of CAD/CAM, visualization and entertainment applications, especially, recent advanced technologies multimedia and virtual reality, computer graphics research and its applications have become hotter topics. There are unique pure academic researches at universities. Most of the research and development at companies have been traditionally emphasizing engineering and applied research. This article reviews the ``state-of-the-art" of the Computer Graphics industry and research activities in Japan.
A. Yakovlev, M. Kishinevsky, A. Kondratyev, L. Lavagno, and M. Pietkiewicz-Koutny, On the models for asynchronous circuit behaviour with OR causality. Formal Methods in System Design, vol. 9, No. 3, p. 189-234, 1996.
Petri nets and Change Diagrams provide adequate modelling and circuit synthesis tools for the various OR causality types, yet they do not always bring the specifier to a unique decision about which modelling construct must be used for which type. We present a unified descriptive tool, called Causal Logic Net, which is graphically based on Petri net but has an explicit logic causality annotation for transitions. It is aimed as the least possible generalisation of Petri nets and Change Diagrams. The signal-transition interpretation of this tool is analogous to, but more powerful than, the well-known Signal Transition Graph. A number of examples demonstrate the usefulness of this model in the synthesis of asynchronous control circuits.
Jordi Cortadella, Michael Kishinevsky, Alex Kondratyev, Luciano Lavagno and Alex Yakovlev, Petrify: a tool for manipulating concurrent specifications and synthesis of asynchronous controllers. IEICE Transactions on Information and Systems, vol. E80-D, No. 3, p. 315-325, 1997.
Petrify is a tool for (1) manipulating concurrent specifications and (2) synthesis and optimization of asynchronous control circuits. Given a Petri Net (PN), a Signal Transition Graph (STG), or a Transition System (TS) it (1) generates another PN or SG which is simpler than the original description and (2) produces an optimized net-list of an asynchronous controller in the target gate library while preserving the specified input-output behavior. An ability of back-annotating to the specification level helps the designer to control the design process.

Refereed Proceeding Papers

Ikedo T., A Multimedia VR(Virtual Reality) System. IEEE Multimedia System'96, International Conference on Multimedia Computing and System , Stevens S. M., p. 4-11, IEEE, IEEE computer Society press. Hiroshima, June 1996.
The paper proposed the new architecture for virtual reality system which can cope with diversed data types and processing of multimedia system.
Ikedo T., Yamada J., Nonoyama Y., Kimura J., and Yoshida M. An Architecture based on the Memory Mapped Node Addressing in Reconfigurable Interconnection Network. The Second Aizu International Symposium on Parallel Algoritms/Architecture Synthesis, 1997. Mirenkov N., p. 50-57, U of Aizu, IEEE and ACM, IEEE computer Society press. Aizu, March 1997.
MMU and router architecture with node addressing mechanism are described. Reconfigurable architecture and node mapping system are available with the highest transmission efficiency in pseudo complete graph IN.
Ikedo T., A Realtime Video-image Mapping using Polygon Rendering Techniques. IEEE Multimedia System'97, International Conference on Multimedia Computing and system, N. D. Georganas, p. 127-134, IEEE, IEEE computer Society press. Canada, June 1997.
A video mapping technique is described where a video camera image is mapped onto the animated polygon surface in real time. The hardware architecture for video capturing, image caching, antialiasing used sinc filtering are proposed.
Yamin Li and Wanming Chu, Aizup -- A Pipelined Processor Design and Implementation on XILINX FPGA Chip. Proceedings of IEEE Symposium on FPGAs for Custom Computing Machines (FCCM'96), Kenneth L. Pocek and Jeffrey Arnold, p. 98--106, IEEE Computer Society. IEEE Computer Society Press, Napa, California, April 1996.
This paper describes a pipelined processor (named Aizup) design and implementation for the exercise of Computer Architecture/Organization Education at the University of Aizu. The Aizup pipeline has four stages and deals with data dependency and control dependency. The Aizup was designed at Cadence environment and implemented on Xilinx XC4006PC84 FPGA chip. We ask students to design the processor, to perform functional simulations, to implement the design on the chip, and to measure the chip with Logic Analyzer. The exercise course is helpful to students to understand the operations of pipelined processors and to master the design methodologies and the use of measuring instruments.
Yamin Li and Wanming Chu, A New Non-Restoring Square Root Algorithm and Its VLSI Implementations. Proceedings of International Conference on Computer Design (ICCD'96) . Bob Werner, p. 538--544, IEEE Computer Society, IEEE Computer Society Press. Austin, Texas, October 1996.
In this paper, we present a new non-restoring square root algorithm that is very efficient to implement. The new algorithm presented here has the following features unlike other square root algorithms. First, the focus of the ``non-restoring'' is on the ``partial remainder'', not on ``each bit of the square root'', with each iteration. Second, it only requires one traditional adder/subtractor in each iteration, i.e., it does not require other hardware components, such as seed generators, multipliers, or even multiplexors. Third, it generates the correct resulting value even in the last bit position. Next, based on the resulting value of the last bit, a precise remainder can be obtained immediately without any correction or addition operation. And finally, it can be implemented at very fast clock rate because of the very simple operations at each iteration. We illustrate two VLSI implementations of the new algorithm. One is a fully pipelined high-performance implementation that can accept a new square-root instruction each clock cycle with each pipeline stage requiring a minimum number of gate counts. The other is a low-cost implementation that uses only a single adder/subtractor for iterative operation.
Fujii Robert, Microelectronic Systems Design Educational Challenge. 1997 IEEE Computer Society International Conference on Microelectronic Systems Education, 1997.
Microelectronic systems design education at Japanese Universities and the support provided by industry and government are described; innovative governmental initiatives and unique University microelectronics programs are also described.
Fujii Robert and Honda Yoshihiko, VLSI Computer System for Motorized Wheelchair. 12th Japanese Conference on Advancement of Rehabilitation Technology, 1997.
A prototype motorized wheelchair controlled by a VLSI microcontroller and various sensors was designed and built. Its performance on various terrains and courses was analyzed.
Jianhua Ma and Runhe Huang, Improving Human Interaction with a Hyperworld. Proceedings of 1996 Pacific Workshop on Distributed Multimedia Systems, p. 46--50, The Hong Kong University of Science and Technology, June 1996.
A person interacts with various worlds in two ways: the one-to-one interaction with a single world and the one-to-many interaction with multi-worlds. Most of the current researches on improving human interaction with the world are limited to the one-to-one interaction, i.e. the interaction of a person with each individual world. For example, telepresence and teleoperation technologies can overcome some of the time, space and other physical constraints when a person interacts with the physical world. CSCW (Computer Support Cooperated Work) and CSCL (Computer Support Cooperated Learning) are changing the interaction among people. Hypermedia and WWW (World Wide Web) are making the interaction of human with the information world more natural. Since relations among worlds in the multi-worlds are nonlinear and can be expressed by a set of links, such multi-worlds as a whole is called a hyperworld. This paper focuses on giving an outline of a future hyperworld as a system and presents some problems of developing such a system and proposes potential solutions.
Runhe Huang, Jianhua Ma and Eiju Tsubo. Communication Network Design via a Genetic Algorithm Based Learning Algorithm. Proceedings of the IASTED International Conference on Artificial Intelligence, Expert Systems and Neural Networks, p. 15--18, IASTED/ISMM, IASTED-ACTA PRESS, August 1996.
In this paper, we focus on developing a genetic algorithm based learning algorithm for a communication network design to evolve solutions that minimize total link cost, and subject to more constraints like the network routing, diameter and two-connected survivability rather than just the survivability considered in some research papers. The implementation results of the genetic algorithm for searching two disjoint paths corresponding to each requirement and performance of the genetic algorithm based learning algorithm for a simulated communication network design in terms of cost are reported.
Runhe Huang and Jianhua Ma, A Distributed Genetic Algorithm over A Transputer Based Parallel Machine for Survivable Network Designs. Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, p. 1202--1211, CSREA, August 1996.
In this paper, a distributed genetic algorithm is proposed and it is emphasized that how the distributed genetic algorithm is implemented over a transputer based parallel machine called ParsyTec Gcel-1/64 by using virtual torus topology for a communication network design that minimizes total communication link cost, and subjects to more constraints like the network routing, diameter and two-connectivity rather than just the survivability. The implementation results of the genetic algorithm for searching two disjoint paths corresponding to each requirement and performances of the distributed genetic algorithm for a simulated communication network design in terms of computing time and link cost are reported.
Tsuneo Ikedo and Jianhua Ma, Pixel Cache Architecture with FIFO Implemented within An ASIC. Proceedings of the Ninth Annual IEEE International ASIC Conference, J. D. Meindl, p. 19-22, IEEE, September 1996.
Implementation technology for 3D pixel cache and performance evaluation of a graphics processor Truga001 embedded 12 processors within a single chip are described. The chip can render 4 million vectors/s (10 pixels/vector) or 1.2 million triangle polygons/s (100 pixels/polygon) with Phong shading, texture mapping and hidden surface removal. A pixel-array configured with 8(x) x 4(y) x 24-bit(intensity) x 24-bit(z) can be accessed with frame buffer at 180ns due to the 3D bus-architecture between chip and frame buffer. The chip was designed with Toshiba TC180C CMOS of 400,000 gates.
Jianhua Ma and Runhe Huang, Modeling Interface with a Multimedia Hyperworld. Proceedings of 12th Human Interface Symposium, p. 219--224, Human Interface Society of Japan, October 1996.
There are two kinds of modeling topics in a hyperworld system: modeling direct mapping between multimedia information worlds and real worlds, and modeling human interface with a hyperworld. Most of the current researches on improving human interaction with the worlds are limited to the one-to-one interaction, i.e., the interaction of a person with each individual world. The study of the paper is devoted to one-to-many interaction features with the hyperworld, composition of a one-to-many interaction system, and its associated reference model so as to lay foundations for further study and system development.
Jianhua Ma and Runhe Huang, Parallel Implementation of A Learning Algorithm for Communication Network Design. The proceedings of International Conference on Applied Informatics. p. 149--152, February 1997.
The most critical step in applying a genetic algorithm to a survivable communication network design is to choose a way to represent a solution to the problem. In this paper, we present a genetic algorithm in which the routing, diameter, and 2 connectivity constraints can be easily and successfully encoded in a chromosome representation. A parallel implementation of the genetic algorithm based learning algorithm in the level of requirements is proposed and implemented in a transputer based parallel machine.
Runhe Huang and Jianhua Ma, A Study on A Hyperworld System of One-to-Many Interaction. The proceedings of International Conference on Applied Informatics, p. 255--258, February 1997.
World integration, beyond media integration, means integrating various interaction worlds of different time, space and reality into one system. Such world integration, called a hyperworld, originates from matching with an one-to-many interaction way between a person and the worlds. There have been lots of models, mechanisms and standards for multimedia integration, however, there are no correspondents for the world integration. This paper focuses on basic features of the hyperworld, composition and models of an one-to-many interaction system, and a case study of a telemedicine hyperworld system. The composition of a hyperworld system has three levels: a level of each world generation and interaction system, a level of world management supported by a hyperworld reference model, and a level of hyperinterface.
Tsuneo Ikedo, Y. Okuyama and Jianhua Ma, Hardware Implementation of Shading Models in an Application Specific Integrated Circuit. Proceedings of the International Conference on Shape Modeling and Applications, p. 150-159, The University of Aizu, Computer Society Press, March 1997.
The Truga001 is a single chip rendering processor with 12 embedded many graphics functions. In the design of the Phong and bump-mapped shading circuit, we used angular parameters for defining surface and light-source normals instead of vector. This enables the circuit-scale less than 10,000 gates/circuit. The chip is fabricated with a 940,000-gate standard cell, 0.3um CMOS in a TCP/BGA package. This paper describes the hardware architecture and ts implementation technologies of the Phong and bump-mapped shading in an ASIC.
Runhe Huang, Jianhua Ma, Tosiyasu L. Kunii and Eiju Tsuboi, Parallel Genetic Algorithms for Communication Network Design. Proceedings of the Second Aizu International Symposium on Parallel Algorithms/Architectures Synthesis , p. 370--377, The University of Aizu, Computer Society Press, March 1997.
This paper is devoted to developing a genetic algorithm for a communication network design that minimizes total link cost, and subjects to some constraints like diameter and two-connectivity. Two parallel genetic algorithms on the level of partitioning requirements and the level of dividing population are proposed and implemented over a transputer based parallel network with various virtual network topologies. The ring-ring topology gives the best performance for the parallel genetic algorithm on the level of partitioning requirements, and the torus topology is the most suitable topology for the parallel genetic algorithm on the level of dividing population.
Tsuneo Ikedo and Jianhua Ma, An Advanced Graphics Chip with Bump-mapped Phong Shading. Proceedings of the Computer Graphics International Conference, p. 156-165, Limburgs University, Computer Society Press. June 1997.
To overcome the complexity of the circuit design for the calculation of the conventional Phong shading model and bump mapping, a vector (such as the normal or unit vectors of a surface, a light source, and other objects) is defined by the two angle parameters of the horizontal and vertical components with the eye-point axes in the Truga system. The new algorithms of Phong and bump shading based on the angle parameters are proposed. One chip can perform 1.2 million polygons (100-pixel 3-D triangle)/second while simultaneously applying Phong shading, bump and texture mapping, hidden surface removal. The system can also be unlimitedly scaled up with parallel interconnections of multiple The Truga001 submodules.
A. Kondratyev, M. Kishinevsky, A. Taubin and S. Ten, A Structural Approach for the Analysis of Petri Nets by Reduced Unfoldings. Proc. of International Conference on Application and Theory of Petri Nets, p. 346-365, Springer, Osaka, June 1996.
The new criterion significantly reduces the size of an unfolding obtained by a PN. The properties of PNs for analysis can be various: boundedness, safety, persistency etc. A practical example of the suggested approach is given in an application to asynchronous design.
Taubin A., Kondratyev, A. and Kishinevsky, M., Deadlock Prevention Using Petri Net Unfoldings. CESA'96 IMACS Multiconference. Symposium on Discrete Events and Manufacturing Systems. Proc., July 1996. p. 426-431, IMACS, Lille, France.
A deadlock prevention procedure first detects deadlocks using the unfolding, then reduces the unfolding to a deadlock-free sub-unfolding, and finally folds the deadlock-free acyclic net into a cyclic net. For the class of reinitialized processes which are very common for the manufacture systems the obtained cyclic net is live equivalent to the initial Petri Net. The live equivalence implies that each trace of the transformed net has an equivalent feasible trace in the initial net and all live (infinite) traces of the initial net are also present in the final net. As a side effect our method constructs ordering relations between places and transitions and checks boundedness, safety, persistency and hazards in the initial Petri Net specification.
M. Kishinevsky, J. Cortadella, A. Kondratyev, L. Lavagno, A. Taubin and A. Yakovlev, Coupling Asynchrony and interrupts: Place Chart Nets and their Synthesis. Proc. of International Conference on Application and Theory of Petri Nets, p. 20, Springer, Toulouse, June 1997.
A model, called Place Chart Nets (PCN), is presented. It allows the modelling ofboth asynchronicity and exception handling (preemption). Contrary to State Charts and other reactive models, which are inherently synchronous, PCNs specify a system behavior using partial orders. Contrary to Petri nets, PCNs have a notion of hierarchy. Contrary to other hierarchical models based on Petri net extensions, the hierarchy in PCNs is determined by preemption. We show that PCNs are a non-trivial generalization of classical PNs, in the sense that (1) for the finite (bounded) case modeling a PCN may require an exponentially larger PN, (2) for the infinite (unbounded) case a class of PCN languages properly includes a class of PN languages and (3) k-boundedness of a place is decidable. Then we present a method for synthesis of safe PCNs from transition systems, which generalizes the theory of regions previously developed for PNs.
M. Kishinevsky, A. Kondratyev, L. Lavagno, A. Saldanha and A. Taubin, Delay Fault Testing of Asynchronous Sequential Circuits. Proc. of International Workshop on Logic Synthesis, ACM Sigda, Granlibakken Resort, California, May 1997.
Asynchronous circuits operate correctly only under timing assumptions that must be tested. Asynchronous nets are a subclass in which feedback is allowed only inside sequential elements, and can be obtained from asynchronous circuits by partial scan. We show a reduction from delay fault testing of asynchronous nets to stuck-at testing on combinational circuits.
J. Cortadella, M. Kishinevsky, A. Kondratyev, L. Lavagno and A. Yakovlev, Methodology and tools for state encoding in asynchronous circuit synthesis. Proc. of the 33d Design Automation Conference, ACM, IEEE, June 1996. p. 63-66.
This paper proposes a state encoding method for asynchronous circuits based on the theory of regions. A region in a Transition System is a set of states that ``behave uniformly'' with respect to a given transition (value change of an observable signal), and is analogue to a place in a Petri net. Regions are tightly connected with a set of properties that must be preserved across the state encoding process, namely: (1) trace equivalence between the original and the encoded specification, and (2) implementability as a speed-independent circuit. We build on a theoretical body of work that has shown the significance of regions for such property-preserving transformations, and describe a set of algorithms aimed at efficiently solving the encoding problem. The algorithms have been implemented in a software tool called petrify.
A. Kondratyev, J. Cortadella, M. Kishinevsky, L. Lavagno and A. Yakovlev, Technology mapping for speed-independent circuits: decomposition and resynthesis. Proc. of Third International Symposium on Advanced research in Asynchronous Circuits and Systems, IEEE Computer Society Press, 1997. pp. 24--253.
This paper presents theory and practical implementation of a method for multi-level logic synthesis of speed-independent circuits. An initial circuit implementation is assumed to satisfy the monotonous cover conditions but is technology independent. The proposed method performs both combinational (inserting new gates) and sequential (inserting new memory elements) decomposition of complex gates in a given standard cell library, while preserving original behaviour and speed-independence. The algorithm applies known efficient algebraic factorization techniques from combinational multi-level logic synthesis, but achieves also boolean simplification and sequential decomposition. The method allows sharing of decomposed logic.
J. Cortadella, M. Kishinevsky, A. Kondratyev, L. Lavagno and A. Yakovlev, Technology mapping of speed-independent circuits based on combinational decomposition and resynthesis. Proc. of the European Design and Test Conference, IEEE, March 1997.
This paper formulates and describes a solution to the problem of sequential multi-level logic synthesis of asynchronous speed-independent circuits. The starting point is a technology-independent speed-independent circuit obtained using, e.g., the monotonous cover conditions. We describe an algorithm for the factorization of this circuit aimed at implementing it in a given standard cell library, while preserving speed-independence. The algorithm exploits known efficient factorization techniques from combinational multi-level logic synthesis, but achieves also boolean simplification. Experimental results show a significant improvement in terms of number and complexity of solvable circuits with respect to existing methods.
A. Kondratyev, M. Kishinevsky, A. Taubin, and S. Ten. Verification of asynchronous systems based on petri net unfoldings. In Proc. of IEICE Concurrent Systems Technology Conference, CST-96, pages 17--23, Aizu, May 1996.
A. Taubin, A. Kondratyev, and M. Kishinevsky. Deadlock prevention by petri net transformations. In Proc. of IEICE Concurrent Systems Technology Conference, CST-96, pages 25--32, Aizu, May 1996.
E. Pastor, J. Cortadella, O. Roig, and A. Kondratyev. Structural methods for the synthesis of speed-independent circuits. In Proc. of the European Design and Test Conference, pages 340--347. IEEE, March 1996.
M. Kishinevsky, J. Cortadella, A. Kondratyev, L. Lavagno, and A. Yakovlev. Synthesis of general petri nets. In Proc. of IEICE Concurrent Systems Technology Conference, CST-96, volume Vol. 96 (CST96-5), pages 33--39. IEICE, Japan, May 1996.
A. Taubin, A. Kondratyev, and M. Kishinevsky. Application of petri net unfoldings to asynchronous design. In Proc. of the Computational Cybernetics and Simulation Conference. IEEE, October 1997.
J. Cortadella, M. Kishinevsky, A. Kondratyev, L. Lavagno, Enric Pastor, and A. Yakovlev. Decomposition and technology mapping of speed-independent circuits using boolean relations. In Proc. of the International Conference on Computer-Aided Design. IEEE, November 1997.
M. Kishinevsky, A. Kondratyev, L. Lavagno, Alexander Saldanha, and A. Taubin. Partial scan delay fault testing of asynchronous circuits. In Proc. of the International Conference on Computer-Aided Design. IEEE, November 1997.

Unrefereed Papers

A. Kondratyev, M. Kishinevsky, A. Taubin and S. Ten. Verification of asynchronous systems based on Petri Net unfoldings. Proc. of IEICE Concurrent Systems Technology Conference, CST-96, p. 17-23, Aizu, May 1996.
A. Taubin, A. Kondratyev, M. Kishinevsky, Deadlock Prevention by Petri Net Transformations. Proc. of IEICE Concurrent Systems Technology Conference, CST-96, p. 25-32, Aizu, May 1996.
M. Kishinevsky, J. Cortadella, A. Kondratyev, L. Lavagno and A. Yakovlev, Synthesis of General Petri Nets. Proc. of IEICE Concurrent Systems Technology Conference, CST-96-5, No. 57, p. 33-39, IEICE, Japan, May 1996.

Patents

Tuneo Ikedo, Computer Graphics Circuit. International Patent(15-country), March 1996. Number: International Patent PCT/JP96/01837(1996), published.
Tuneo Ikedo, Gaseous Object Renderer. Japan, July 1996, Number: 8-204046, published.
Tuneo Ikedo, Texture Mapping Circuit, Japan. June 1996, Number: 8-204047, published.
Tuneo Ikedo, Bump-map Shading Circuit, Japan. March 1996, Number: 8-223011, published.
Tuneo Ikedo, Spot-Lighting Circuit, Japan. July 1996, Number: 8-245398, published.
Tuneo Ikedo, Computer Graphics Circuit, USA. July 1996, Number: PCT/JP96/01837, published.

Grants

Michael Kishinevsky, ACiD Esprit European grants for special meeting on asynchronous design and concurrency and for giving a tutorial at the Summer school in Asynchronous Design. 1996.

Academic Activities

Yamin Li, 1996-1997, Refereeing papers for International Journal of MODELLING & SIMULATION.
Yamin Li, 1997, Refereeing papers for the Second Aizu International Symposiun on Parallel Algorithms/Architecture Synthesis.
Jianhua Ma, 1996, Section Chairman for IASTED 15th International Conference on Applied Informatics.
Jianhua Ma, The IPPS'97 Workshop on Parallel Processing and Multimedia. 1997, Referee of the IPPS'97 Workshop on Parallel Processing and Multimedia.
Jianhua Ma, The International Conference AE'97. 1997. Referee of the International Conference AE'97.
Jianhua Ma, The International Conference on Evolution Computing. 1997. Referee of the International Conference on Evolution Computing.
Michael Kishinevsky, IEEE, 1996. IEEE Senior member from December 1996. Refereeing papers for Journal of VLSI Signal Processing; ICCAD96; ICCAD97; Async97; IEE Transactions; Euro-DAC97; EDTC97; etc.
Michael Kishinevsky, IEICE, 1996. Serving at the Editorial board of the Special Issue on Asynchronous Circuit and System Design of IEICE Japanese and English Journals Information and System Society.
Michael Kishinevsky, Invited tutorial at the Summer school in Asynchronous design, Lyngby, Denmark, August 1997; Invited tutorial at Fujitsu Kawasaki Labs, 1997.
Michael Kishinevsky, Invited seminars at the University Politecnica Barcelona, Spain; Politecnica di Torino, Italy; Technical University of Denmark, 1996.
Michael Kishinevsky, Invited seminars University Passau, Germany; IRISA, Renes Cedex, France; ACiD-WG Workshop on Asynchronous Synthesis, Groningen, The Netherlands, 1996.

Next: Computer Solid State Up: Department of Computer Previous: Department of Computer

www@u-aizu.ac.jp
October 1997