Abstract:Due to their excellent price-performance ratio, clusters built from commodity nodes have become broadly adopted and increasingly popular as platforms for parallel processing. Among them, the clusters of standard PCs interconnected with high-speed system area networks (SANs) are especially attractive and have been widely established. At the same time, the developments in interconnection technologies also formed the basis for the rise of Non-Uniform Memory Access (NUMA) architectures, i.e. systems with physically distributed memories, but with a global address space allowing an efficient but non-uniform access to any memory location in the system. These kinds of systems, especially when offered as non cache coherent NUMA for loosely coupled commodity architectures, can easily be implemented in a straightforward manner without major hardware efforts. They form a favorable architectural tradeoff by combining the scalability and cost–effectiveness of standard clusters with a shared memory support close to symmetric multiprocessors. The non-uniform memory access characteristic, however, introduces a distinction between local and remote memory causing different memory access latencies. In systems with such characteristics, a remote memory access can take up to an order of magnitude longer than a local one. This leads to the fact that many shared memory applications initially do not achieve a good parallel speedup when running on NUMA-like architectures due to excessive remote memory accesses. This thesis targets such inefficiency problems of NUMA-based shared memory programs. For this purpose, a comprehensive and integrated tool environment has been built which aims at improving the data locality of running applications, by combining single frameworks enabling both program tuning and runtime manipulation. This environment comprises a low-level data acquisition system, a distributed tool middleware, and a set of performance tools. Based on the hardware monitoring facility, which is capable of observing all memory transactions performed on the interconnection fabric, the data acquisition system provides information about an application’s memory access behavior as well as information about e.g. synchronization primitives and address mapping necessary for data placement. This information is then aggregated across the distributed system and made accessible to the tools through an established on-line monitoring interface specification serving as middleware. Tools further process the acquired performance data and use it to steer the execution of programs with a result of an optimization for the runtime data layout. Currently, two such tools have been implemented: a Data Layout Visualizer (DLV) and an Adaptive Runtime System (ARS). DLV is used to present the monitoring data in an easy-

Data locality optimization of shared memory programs on NUMA architectures using an integrated tool environment

Increasing the breadth: applying sensors, inference and self-report in field studies with the MyExperience tool

Nonablative skin remodeling: selective dermal heating with a mid-infrared laser and contact cooling combination

Evaluation of a Novel Ablative 1940 nm Pulsed Laser for Skin Rejuvenation

Selective Cell Targeting with Light-Absorbing Microparticles and Nanoparticles

High‐powered 675‐nm laser: Safety and efficacy in clinical evaluation and in vitro evidence for different skin disorders

The histology of skin treated with a picosecond alexandrite laser and a fractional lens array

Temperature-controlled laser photocoagulation of soft tissue: in vivo evaluation using a tissue welding model

Laser Medical Technology as a Potential Weapon

An endoscopic approach providing near‐infrared laser‐induced coagulation with accurate depth limits

Optical Effects of Focused Fractional Nanosecond 1064‐nm Nd:YAG Laser: Techniques of Application on Human Skin

Photobiomodulation with non-thermal lasers: Mechanisms of action and therapeutic uses in dermatology and aesthetic medicine

In Vivo Study of Intradermal Focusing for Tattoo Removal

Optical Microneedle–Lens Array for Selective Photothermolysis

Direct biological effects of fractional ultrapulsed CO2 laser irradiation on keratinocytes and fibroblasts in human organotypic full-thickness 3D skin models

Laser-Induced Focused Ultrasound for Cavitation Treatment: Toward High-Precision Invisible Sonic Scalpel

Comparing Line‐Field Confocal Optical Coherence Tomography and Reflectance Confocal Microscopy on the In Vivo Healing Process of Lesions Induced by Fractional Photothermolysis

Baseline Hearing Measurements in Alaskan Belugas

Ultralow radiant exposure of a short-pulsed laser to disrupt melanosomes with localized thermal damage through a turbid medium

Model Predictive Temperature Control for Retinal Laser Treatments

In vivo characterization of the threshold of laser-induced optical breakdown (LIOB) of a fractional 1064 nm Nd:YAG picosecond laser by optical coherence tomography: A step forward to precision laser therapy