High Performance = Innovative Computer Systems + Efficient Algorithms

HNI Research Report Series: December 2002, 2001, 2000, 1999, 1998, 1997, 1994, 1993.

Theoretical Computer Science: Algorithms and Complexity
Prof. Dr. math. Friedhelm Meyer auf der Heide

 

  1. Abstract
  2. Focus in Research and Education
  3. Central research positions
    1. Parallel Algorithms on Dynamic, Heterogeneous PC Clusters
    2. Mobile ad hoc Networks
    3. Data Distribution in Storage Networks
    4. Routing and Data Management in Networks of Limited Bandwidth
    5. Algorithms for Large Dynamic Geometric Networks
    6. Algorithms and Computer Graphics - Data Structures for Walkthrough Problems
  4. Appendix
    1. Publications
    2. Fairs/Conferences/Seminars
    3. Prizes/Awards
    4. Additional Functions
    5. Projects
    6. Industrial Cooperation
 


1. Abstract

High computing performance can only be achieved by a combination of powerful computer systems and algorithms that solve the given application problems as efficiently as possible. Therefore, the development of efficient algorithms has established itself as a classical branch of computer science. In our research area, we concentrate on solutions where current technological possibilities such as computer networks used as high-performance computers pose new challenges for algorithm development.


2. Focus in research and education

Modern computer systems enable expanding application areas: Parallel computer networks can deal with extremely complex algorithmic problems; the Internet realizes global exchange of information and the interconnected computers may possibly serve as one giant parallel computing device; wireless communication systems allow flexible communication between mobile stations; graphics applications with hardware support display real-time navigation in complex virtual scenes. A special challenge is given by computing systems consisting of heterogenous components (e.g. Different powerful processors, storage devices or communication capabilities) with structural changes within time. In this year our focus turned on the algorithmic challenges imposed by the realisation and efficient usage of such heterogenous, dynamic systems.

Parallel computer networks can potentially supply unlimited computing power. However, the efficient use of these networks is an extremely complex problem. We provide users with a programming environment, the PUB-libarary, that is easy to handle and guides them towards the development of efficient algorithms. In order to support such an environment, efficient implementations of basic routines for communication and synchronization are needed. Meanwhile our PUB-library is used by an international audience, valueing an efficient and straight-forward environment for parallel computing. The latest extension of the PUB-library takes into account the special problems of heterogenous local area networks (LAN) and makes efficient use of the idle-times of the participating computers.

To be able to navigate in a virtual 3-dimensional space, and to give a realistic optical impression of the changing scene, enormous demands are imposed on the underlying data structures that handle the scene. Above all, we have to guarantee real time processing in order to guarantee a realistic impression of the scene. Our work on the development of such new types of data structures are incorporated into our prototype walk-through system ParSIWal.

Dynamic networks,i.e., networks whose nodes change their (geometric, geographic) position over time, play a major role in many areas: They can, e.g., be used as data structures for moving objects in Computer Graphics, as well as models for wireless mobile communication networks. We systematically model various kinds of dynamic networks, design algorithms, and apply them to the above mentioned Computer Graphics and communication problems.

The algorithmic work described above has shown us that using randomized procedures can produce amazing gains in efficiency. We are therefore systematically studying the potential of randomized algorithms and using, or developing, probability theory for analyzing such algorithms.

Our research is closely linked to our teaching. Our courses deal with methods and concepts of the development and analysis of efficient algorithms. We also run project groups and support diploma thesis that apply our theoretical insights in order to design efficient algorithms and libraries.


3. Central research positions

A PC cluster as parallel computer

3.1 Parallel Algorithms on Dynamic, Heterogeneous PC Clusters

In order to satisfy the increasing demand of computing performance in many scientific and engineering areas, parallel computer systems are used. As a matter of fact, programming and maintenance of such systems is much more difficult than that of a sequential computer where we have the versatile, universally successful von Neumann model for which we write our sequential programs.

A prominent reason for this difficulty is the variety of different hardware systems which makes it hard to write portable parallel programs or maintain large parallel software systems. Hence, an abstract model is needed that includes all of the important performance-relevant features and hides hardware specific solutions - a so-called bridging model, that means a "parallel analogue" to the von Neumann model. We extended Valiant's Bulk Synchronous Parallel (BSP) bridging model and implement and evaluate the PUB library (Paderborn University BSP) that enables the programmer to write portable efficient parallel programms that can be run on a variety of different parallel computers.

Interconnecting different, but "usual" PCs by fast networks allows to employ such heterogeneous PC Clusters as parallel computers. Our PUB library has been ported to a PC cluster under the operating system Linux. The migration of processes has been realised, that means that a single PC that is heavily loaded by many processes (even from other users) can send some of its jobs to less loaded Pcs. Due to the fact that such a network may consist of single machines with quite different performance and that in such a network the load and, hence, the available computing power, of the single machines can vary quite considerably during the runtime, new algorithmic challenges emerge in the context of load balancing and of scheduling.

As PC Clusters are considerably less stable than classical parallel computers, for it is quite probable that from time to time single machines are switched off or are installed newly in the network, we enhance the PUB library by functionalities that ensure a stable behaviour of the overall system.

Dr. rer. nat. Rolf Wanka
Phone: +49 (0)52 51 60 64 34
Fax:   +49 (0)52 51 60 64 82
Email: wanka@upb.de
URL:   www.upb.de/cs/wanka.html

3.2 Mobile ad hoc Networks

Simulation of ad hoc networks (SAHNE) Mobile electronic devices (PDA, laptop, mobile phones) are connected to wired networks, e.g. the Internet, by centralized protocols (e.g. GPRS, UMTS). In such hierarchically structured networks an increase in the number of participants results in reduced quality of service.

Mobile ad hoc networks (MANET) rely on a different, namely decentralised, approach. Without a fixed infrastructure a MANET imposes less restrictions for mobility and increasing network density even improves network properties like connectivity and data throughput.

Our research concentrates on the analysis and implementation of suitable algorithms for network management and routing in MANETs. In cooperation with the system and circuit technology working group we implement a prototypical MANET based on the mini-robot Khepera.

In this first period of our project we model, analyze and simulate mobile ad hoc networks. On the physical level we allow variable transmission power and for each station sectorized parallel communication. One result of our work is the development of meaningful complexity measures for the efficiency of the routing in such a wireless network: Dilation, Energy and Congestion. We provide distributed algorithms which build up basic networks, which are optimized with respect to these measures. However, in general it is not possible to construct a protocol, that optimizes more than one measure at the same time. Two of these parameters, energy and dilation, are even incompatible.

On the positive side it turns out that there exists a sparse general-purpose network which enables effieient routing with respect to all these complexity measures (one at a time). Our simulations show that such a communication network as well as a sectorized version can be realized on an embedded, distributed system like a herd of Khepera robots.

PD Dr. rer. nat. Christian Schindelhauer
Phone: +49 (0)52 51 60 64 52
Fax:   +49 (0)52 51 60 64 82
Email: schindel@upb.de
URL:   www.upb.de/cs/schindel.html

SGI(TM) TP9400 hard disk system with 1 Terabyte storage capacity based on RAID-technology  

3.3 Data Distribution in Storage Networks

Current storage systems face the difficult task to manage constantly growing data entities very flexible. One way of meeting capacity and performance requirements is the usage of dedicated storage networks. We design randomized algorithms which not only distribute data blocks evenly but furthermore support diffent disk sizes and which are adaptive to system changes.

Recently, we were able to improve our heterogeneous distribution strategy. We could show that our two new approaches, namely SHARE and SIEVE, are close to an optimal solution. Because such techniques are of high practical interest in modern storage systems we used the SHARE strategy to implement a storage virtulization prototype. It was realized under Linux and consists of a device driver and a graphical management tool. With this virtualization we are capable of decreasing the management of large storage systems and providing transparent and vendor indepentent storage solution.

Kay Salzwedel
Phone: +49 (0)5251 606458
Fax:   +49 (0)5251 606482
Email: nkz@upb.de
URL:   www.upb.de/cs/salzwedel.html

3.4 Routing and Data Management in Networks of Limited Bandwidth

Desing of the PreSto-memory solution In large parallel and distributed systems the bandwidth of the interconnection network is usually the major bottleneck for the performance of distributed applications. In these systems a data management strategy has to ensure that the communication overhead caused by remote accesses to shared data objects is as low as possible. Therefore it is important to distribute the load evenly among all network resources and not necessarily evenly among the hard disks, as in the storage network scenario.

We have developed data management and routing strategies that obtain the optimal communication load up to a polylogarithmic factor in general topology networks. The routing strategy has the remarkable property that it is oblivious, i.e., the routing paths are chosen completely independent from the current traffic in the network. Hence, these strategies can be implemented very efficiently due to their simple structure.

Harald Räcke
Phone: +49 (0)5251 606457
Fax:   +49 (0)5251 606482
Email: harry@upb.de
URL:   www.upb.de/cs/raecke.html

3.5 Algorithms for Large Dynamic Geometric Networks

Rapidly decreasing hardware prices and the increase in storage capacity by orders of magnitude in the recent years are the reason for larger and larger data sets that have to be processed by database systems and data mining tools. Cisco, for example, uses the NetFlow software to gather information about the network flow that is routed through their servers. One flow data entry contains information about the IP-adress, source and destination, number of routed packets and number bytes. At a single WorldNet gateway router the amount of information gathered each day is roughly 10 GB.

Another example for huge data sets can be found in telecommunication industry. AT&T, for example, maintains call-detail statistics which contain the called number, the caller's phone number, time, date, and length of the call, as well as data about the billing. Overall, AT&T serves about 200-300 million calls per day and the size of the gathered call-detail data is more than 7 GB. It is often impossible to process data sets of this size with standard linear time algorithms and thus there is a lot of research in new techniques to deal with huge data sets.

One of these techniques is the so called 'Property Testing'. The goal of property testing is to decide whether a given huge object has a (global) property or is far away from any object that has the property. This task should be achieved by only looking at a small sample of the object. We introduced a general framework to analyze property testing algorithms with one-sided error. We analyzed property testers for clustering problems, graph and hypergraph coloring, and a reversal distance property using our framework.

Smoothed Analysis Related to property testing are the so-called sublinear approximation algorithms. The goal of a sublinear approximation algorithm is to find an approximate solution to an optimization problem in sublinear time, that is, without looking at the whole input. In this area we developed an algorithm that uses a sublinear number of range queries (e.g., orthogonal range queries, simplex range queries) to approximate the weight of the Euclidean minimum spanning tree of a point set. Such range queries are supported by many fundamental data structures.

We also proposed to use 'smoothed analysis' instead of 'worst case analysis' as a measure for the complexity of motion. We consider a set of moving objects and we want to maintain a combinatorial structure uniquely defined by the position of these objects. In such a scenario one can measure the complexity of maintaining this particular structure by analyzing the maximum number of changes that occurs in the structure under linear motion. Smoothed analysis means to analyze the worst case when the input is subject to small random perturbations. Since practical inputs often do not behave exactly like the worst case (e.g., when the worst case occurs in lower dimensions) we think that smoothed analysis is better suited for the analysis of the complexity of motion.

Using this method we analyzed the smoothed number of combinatorial changes that occur when the smallest bounding rectangle of a point set is maintained under continuous motion.

Dipl. Inf. Christian Sohler
Phone: +49 (0)5251 606427
Fax:   +49 (0)5251 606482
Email: csohler@upb.de
URL:   www.upb.de/cs/csohler.html

3.6 Algorithms and Computer Graphics - Data Structures for Walkthrough Problems

Real-time navigation in highly detailed virtual scenes is an extreme challenge for virtual reality systems. Real-time navigation allows the visitor of the scene an intuitive understanding of the virtual scene. There are two problems: On the one side it is not easy to handle scenes of such sizes and on the other side you have to develop an efficient rendering algorithm.

Happy Buddha model: large sample size Happy Buddha model: medium sample size Happy Buddha model: small sample size The main focus of our research consists of the development of virtual reality systems for highly complex scenes that cannot completely be stored in main memory:

The Randomized Sample Tree
We developed a new data structure for rendering highly complex virtual environments of arbitrary topology. The special feature of our approach is that it allows an interactive navigation in very large scenes (30 GB/400 million polygons in our benchmark scenes) that cannot be stored in main memory, but only on a local or remote hard disk. Furthermore, it allows interactive rendering of substantially more complex scenes by instantiating objects.

For the computation of an approximate image of the scene, a sampling technique is used. In the preprocessing, a so-called sample tree is built whose nodes contain randomly selected polygons from the scene. This tree only uses space that is linear in the number of polygons. In order to produce an image of the scene, the tree is traversed and polygons stored in the visited nodes are rendered. During the interactive walkthrough, parts of the sample tree are loaded from local or remote hard disk.

We implemented our algorithm in a prototypical walkthrough system. Analysis and experiments show that the quality of our images is comparable to images computed by the conventional z-buffer algorithm regardless of the scene topology.
Landscape model: 400,000,000 polygons

Realtime Navigation using JPEG Compression
Furthermore, we are looking for rendering techniques that are completely independent of the scene complexity and the topology: Our system renders images for discrete viewpoints and viewing directions in a preprocessing step and store them on an external volume. During navigation each image can be displayed within a very short time by loading it from the volume. For acceleration, our prefetching strategy loads possibly needed images for the next few frames if the viewer takes a break. The measurements show that we achieve interactive frame rates, whereby the difference between the minimal and maximal display time is very small. Our system works well with scenes modelled by polygons, but also digital photos can easily be used for describing a 3D scene.

Jens Krokowski
Phone: +49 (0)5251 606491
Fax:   +49 (0)5251 606482
Email: kroko@upb.de
URL:   www.upb.de/cs/kroko.html


4. Appendix

4.1 Publications

  1. Volbert, K.: A Simulation Environment for Ad Hoc Networks Using Sector Subdivision, in Proc. of the 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing, (PDP'2002), 2002.
  2. Bonorden, O.; Meyer auf der Heide, F.; Wanka, R.: Composition of Efficient Nested BSP Algorithms: Minimum Spanning Tree Computation as an Instructive Example, in Int. Conf. on Parallel and Distributed Processing Techniques and Applications (PDPTA'2002), 2002.
  3. Wanka, R.: Any Load-Balancing Regimen for Evolving Tree Computations on Circulant Graphs is Asymptotically Optimal, in Proceedings of Workshop on Graph-Theoretic Concepts in Computer Science (WG 2002), 2002.
  4. Meyer auf der Heide, F.; Schindelhauer, C.; Volbert, K.; Grünewald, M.: Congestion, Energy and Delay in Radio Networks, In Proc. of the Fourteenth ACM Symposium on Parallel Algorithms and Architectures (SPAA 2002), pages 230-237, 2002.
  5. Grünewald, M.; Lukovszki, T.; Schindelhauer, C.; Volbert, K.: Distributed Maintenance of Resource Efficient Wireless Network Topologies, In Proceedings of the 8th International Euro-Par 2002 (distinguished paper) pages 935-946, 2002.
  6. Räcke, H.: Minimizing Congestion in General Networks, to appear at Proceedings of the 43rd Annual IEEE Symposium on Foundations of Computer Science (FOCS'02).
  7. Räcke, H.;, Sohler, C.; Westermann, M.: Online Scheduling for Sorting Buffers, to appear at Proceedings of the 10th European Symposium on Algorithms (ESA'02), pages 820-839, 2002.
  8. Adler, M., Räcke, H.; Sivadasan N.; Sohler, C.; Vöcking, B.: Randomized Pursuit-Evasion in Graphs, in Proc. of the 29th International Colloquium on Automata, Languages and Programming (ICALP'02), pages 902-912, 2002.
  9. Czumaj, A.; Sohler, C.: Abstract Combinatorial Programs and Efficient Property Testers, to appear at Proceedings of the 43rd Annual IEEE Symposium on Foundations of Computer Science (FOCS'02).
  10. Klein, J.; Krokowski, J.; Fischer, M.; Wanka, R.; Meyer auf der Heide, F.: The Randomized Sample Tree: A Data Structure for Externally Stored Virtual Environments, In Proceedings of ACM Symposium on Virtual Reality Software and Technology (VRST 2002), Hong Kong, 2002.
  11. Cuntz, M.; Klein, J.; Krokowski, J.: Realtime Navigation in Highly Complex 3D-Scenes Using JPEG Compression, to appear at Proceedings of 4. GI-Informatiktage 2002, Bad Schussenried, Germany.
  12. Mueck, B.; Dangelmaier, W.;Fischer, M.; Klemisch, W.: Bi-directional Coupling of Simulation Tools with a Walkthrough System, in Proc. 13th Conference on Simulation and Visualization 2002, pages 71-84, 2002.
  13. Brinkmann, A.; Salzwedel, K.; Scheideler, C.: Compact, Adaptive Placement Schemes for Non-Uniform Distribution Requirements, in Proceedings of the 14th ACM Annual Symp. on Parallel Algorithms and Architectures (SPAA 2002), pages 53-62, 2002.
  14. Damerow, V.; Finschi, L.; Ziegler, M.: Point Location Algorithms of Minimum Size, in Proceedings of the 14th Canadian Conference on Computational Geometry (CCCG'02), pages 5-9, 2002.
  15. Brattka, V., Ziegler, M.: Computability of Linear Equations, in Foundations of Information Technology, Proceedings of the 2nd IFIP International Conference on Theoretical Computer Science (TCS2002@Montreal), pages 95-106, 2002.
  16. Ziegler, M.: Computability on Regular Subsets of Euclidean Space, in Mathematical Logic Quarterly (MLQ), Vol.48 (2002) Supplement 1 on Dagstuhl Seminar on Computability and Complexity in Analysis, pages 157-181, 2002.
  17. Bodlaender, H.L.; Broersma, H.; Fomin, F.V.; Pyatkin, A.V.; Woeginger, G.J.: Radio labeling with pre-assigned frequencies, to appear at Proc. 10th European Symposium on Algorithms (ESA'02), 2002.
  18. Fomin, F.V.; Matamala, M.; Rapaport, I.: The complexity of approximating the oriented diameter of chordal graphs, to appear at Proc. W'shop on Graph-Theoretic Concepts in Computer Science (WG2002), 2002.
  19. Broersma, H.; Fomin, F..; Nesetril, J.; Woeginger, G.J.: More about subcolorings, to appear at Proc. W'shop on Graph-Theoretic Concepts in Computer Science (WG2002), 2002.
  20. Bodlaender, H.L.; Fomin, F.V.: Tree decompositions with small cost, Technical Report UU-CS-2002-01, Institute for Information and Computing Sciences, Utrecht University (2002), to appear at Eighth Scandinavian Workshop on Algorithm Theory (SWAT2002), 2002.
  21. Broersma, H.; Fomin, F.V.; Kratochvil, J.; Woeginger, G.J.: Planar graph coloring with forbidden subgraphs: Why trees and paths are dangerous, to appear at Eighth Scandinavian Workshop on Algorithm Theory (SWAT2002), 2002.
  22. Fomin, F. V.: Pathwidth of planar and line graphs, to appear at Graphs and Combinatorics, 2002.
  23. Fomin, F. V.; Kratsch, D.; Novelli, J.-C.: Approximating minimum cocolourings, to appear at Information Processing Letters, 2002.
  24. Fomin, F.V.; Thilikos, D.: On the Monotonicity of Games Generated by Symmetric Submodular Functions, to appear at special issue of Discrete Applied Mathematics on submodularity, 2002.
  25. Fomin, F.V.; Kratsch, D.; Müller, H.: On a domination search number, to appear at Discrete Applied Mathematics, 2002.
  26. Fomin, F.V., Golovach, P.A.: Interval degree and bandwidth of a graph, to appear at Discrete Appled Mathematics, 2002.
  27. Bodlaender, H.; Fomin, F.V.: Approximation of pathwidth of outerplanar graphs, Journal on Algorithms 43, pages 190-200, 2002.
  28. Fomin, F.V.; Lingas, A.: Approximation algorithms for time time-dependent orienteering, in Inform. Proc. Letters 83(2), pages 57-62, 2002.
  29. Krick, C.; Meyer auf der Heide, F.; Räcke, H.; Vöcking, B.; Westermann, M.: Data Management in Networks: Experimental Evaluation of a Provably Good Strategy, Theory of Computing Systems, 35, pages 217-245, 2002.
  30. Czumaj, A.; Ergun, F.; Fortnow, L.; Magen, A.; Newman, I.; Rubinfeld, R.; Sohler, C.: Sublinear Approximation of Euclidean Minimum Spanning Tree, to appear at Proceedings of the 14th ACM-SIAM Symposium on Discrete Algorithms (SODA'03), 2003.

4.2 Patents

"Procedures for randomized data storage in storage networks", André Brinkmann, Friedhelm Meyer auf der Heide, and Ulrich Rückert (pending)

4.3 Prizes/Awards

Harald Räcke received the Machtey award for best student paper and the best paper award for his work "Minimizing Congestion in General Networks" at the 43th Annual IEEE Symposium on Foundations of Computer science, Vancouver, Canada, November 2002. This symposium and the ACM Symposium on Theory of Computing are the two world-wide most importants conferences on theoretical computer science.

The paper of Matthias Grünewald, Tamas Lukovszki, Christian Schindelhauer und Klaus Volbert "Distributed Maintenance of Resource Efficient Wireless Network Topologies" was rewarded as one of two "Distinguished Papers" at the Euro-Par-Conference.

4.4 Appointments/Chairs

Friedhelm Meyer auf der Heide declined a call to a professorship at the University of Augsburg.
Program comittee memberships: Director of DFG Collaborative Research Centre SFB 376, "Massively Parallel Computing: Algorithms, Design Methods, Applications"
Director of DFG Graduate College "Parallel Computer Networks in Production Technique"
Chairman of the Special Interest Group on Parallel and Distributed Algorithms of the "Gesellschaft für Informatik (GI)"
Elected reviewer of the "Deutsche Forschungsgesellschaft (DFG)"
Member of the Fachbeirat of the Max-Planck-Institute for Computer Science at Saarbrücken
Member of the Senate of the University of Paderborn

Rolf Wanka:
Secretary of the Special Interest Group on Parallel and Distributed Algorithms of the German Computer Society
Foreign Relationship Officer of the Paderborn Computer Science.

4.5 Projects

4.6 Industrial Cooperation

In cooperation with the Infineon Technologies AG (Munich), the BMBF project GigaNetIC aims at developing superfast low-loss digital MOS circuit technologies and systems for communication and network applications. The main focus of our activities is on basic techniques for system-on-a-chip architectures with special emphasis on communication protocols. Other participants in Paderborn are the working groups of Ulrich Rückert and Uwe Kastens.