News:

· This year, Guri Sohi will give in Medea the keynote:

“So what are we going to do with multicore processors?".

· The schedule is available. Each author has 16 minutes for presentation, plus 4 minutes for Q&A.

· Selected Papers will appear on Vol. 3, Issue N. 2 of Transactions on HIPEAC.

Scope and Motivation

MEDEA aims to continue the high level of interest of the previous editions held with PACT Conference since 2000.

Due to the ever-increasing gap between CPU and memory speed, there is always great interest in evaluating and proposing processor, multiprocessor, CMP, multi-core, SoC and system architectures dealing with the "memory wall" and wire-delay problems. At the same time, a modular high-level design is becoming more and more attracting in order to reduce design costs.

In this scenario, design solutions and their corresponding performance are shaped by the combined pressure of a) technological opportunities and limitations, b) features and organization of system architecture and c) critical requirements of specific application domains. Evaluating and controlling the effects on the memory subsystem (e.g. caches, interconnection, bus, memory, coherence) of any architectural proposal is extremely important both from the performance (e.g. bandwidth, latency, predictability) and power (e.g. static, dynamic, manageability) points of view.

In particular, the emerging trend of single-chip multi-core solutions, will push towards new design principles for memory hierarchy and interconnection networks, especially when the design is aimed to build systems with a high number of cores (many-core instead of multi-core systems), which aim to scale performance and power efficiency in a variety of application domains.

From a slightly different point of view, the mutual interaction between the application behavior and the system on which it executes, is responsible of the figures of merit of the memory subsystem and, therefore, pushes towards specific solutions. In addition, it can suggest specific compile/link time tunings for adapting the application to the features of the target architecture.

In the overall picture, power consumption requirements are increasingly important cross-cutting issues and raise specific challenges.

Typical architectural choices of interest include, but are not limited to, single processor, chip and board multiprocessors, SoC, traditional and tiled/clustered architectures, multithreaded or VLIW architectures with emphasis on single-chip design, massive parallelism designs, heterogeneous architectures, architectures equipped with application-domain accelerators as well as endowed with reconfigurable modules. Application domains encompass embedded (e.g. multimedia, mobile, automotive, automation, medical), commercial (e.g. Web, DB, multimedia), networking applications, security, etc. The emerging network on chip infrastructure may suggest new solutions and issues.

MEDEA Workshop wants to continue to be a forum for academic and industrial people to meet, discuss and exchange their ideas, experience and solutions in the design and evaluation of architectures for embedded, commercial and general/special purpose systems taking into account memory issues, both directly and indirectly.

Proceedings of the Workshop will be published under ACM ISBN, and appear also in the ACM Digital Library. Accepted papers will be considered for publication, in an extended version, for the March 2008 special issue of Transactions on HIPEAC (www.hipeac.net/journal).

The format of the workshop includes the presentation of selected papers and discussion after each presentation.

Topics of Interest:

- Memory hierarchy design, analysis, tuning for embedded, general and special purpose systems

- On-chip Multiprocessors and System On Chip architectures, development tools and applications
    o Issues in memory hierarchy design of scalable single chip systems
    o Memory hierarchy issues for heterogeneous and accelerator-based systems
    o Solutions for embedded, DSP, commercial, scientific and technical workloads
    o Inter-Chip and Intra-Chip memory latency tolerant and reduction techniques

- Cache coherence and memory models

- Compile/link time optimization techniques

- Bus/Interconnection Architectures

- Network On Chip

- Low-Power/Wire Delay design of memory hierarchies

- Processor and System Architectures

- Exploitation of application parallelism (e.g.: ILP, TLP, DLP)

- Academic/industrial experience in high performance, embedded systems and memory design

Organizing and Steering Committee

Sandro Bartolini, http://www.dii.unisi.it/~bartolini/

University of Siena, Italy

Pierfrancesco Foglia, foglia@iet.unipi.it

University of Pisa, Italy

Roberto Giorgi, http://www.dii.unisi.it/~giorgi/

University of Siena, Italy

Cosimo Antonio Prete, prete@iet.unipi.it

University of Pisa, Italy

Program Committee

Erik Altman, IBM T.J. Watson Research Center, NY, USA

Alessio Bechini, University of Pisa, Pisa, Italy

Koen Bertels, TUDELFT, Delft, NL

John Cavazos,University of Edinburgh,Edinburgh,UK

Bjoern Franke, University of Edinburgh, Edinburgh, UK

Georgi Gaydadjiev, TUDELFT, Delft, NL

Ali Hurson, Penn State University, University Park, PA, USA

Timothy Jones, University of Edinburgh, Edinburgh, UK

David Kaeli, Northeastern University, Boston, MA, USA

Krishna Kavy, University of North Texas, Denton, TX; USA

Stefanos Kaxiras, University of Patras, Greece

Stephen Keckler, University of Texas at Austin, TX, USA

Hiroaki Kobayashi, Tohoku University, Sendai, Japan

David M. Koppelman, Lousiana State University, Baton Rouge, LA, USA

Enrico Martinelli, University of Siena, Siena, Italy

Avi Mendelson, Intel, Haifa, Israel

Alexander Milencovich, University of Alabama, Huntsville, AL, USA

Yale Patt, University of Texas at Austin, TX, USA

Alex Ramirez, Universitat Politecnica de Catalunya, Barcelona, Spain

Toshinori Sato, Kyushu University, Iizuka, Japan

Per Stenström, Chalmers University of Technology, Sweden

Theo Ungerer, University of Augsburg, DE

Mateo Valero, Universitat Politecnica de Catalunya, Barcelona, Spain

Wei Zhang, Southern Illinois University, Carbondale, IL, USA

Information for Authors

The papers should be 6-8 pages in length. The abstracts and papers should be submitted in PDF format by email to Pierfrancesco Foglia and Sandro Bartolini.

Paper should be written in standard ACM SIG Proceedings Template. Hard copy (postal) submissions will not be accepted.

Please email submissions by July 23, 2007. You receive an acknowledgment of your submission by the following week. Authors will be notified of acceptance or rejection by August 14, 2007 and the final papers are due by August 25, 2007.

To speed-up the reviewing process, we encourage also submission of abstract by July 16, 2007.

All submissions will be refereed, and Proceedings with ACM ISBN will be printed and distributed at the workshop.

Important Dates


July 16, 2007	Abstract Submission (not mandatory)
July 23, 2007	Paper Submission Deadline
August 14, 2007	Acceptance Notification
August 25, 2007	Final Papers Due
September 16, 2007	Workshop will start

Registration and accommodation

Attendees are requested to go to the hosting PACT'07 conference to perform the registration to Medea-2007 and make room reservations in the conference hotels. We suggest that you make your room reservation in advance and register before the advance registration deadline to benefit from combined PACT'07/Medea'07 registration fees.

Final Program

· Keynote: Guri Sohi, "So what are we going to do with multicore processors?"

· Improving the Accuracy of Snoop Filtering Using Stream Registers

Valentina Salapura, Matthias Blumric, Alan Gara

· Improving Disk Bandwidth-Bound Applications Through Main Memory Compression

Vicenç Beltran, Jordi Torres, Eduard Ayguadé

· An On-Chip Cache Design for Vector Processors

Akihiro Musa, Yoshiei Sato, Ryusuke Egawa, Hiroyuki Takizawa, Koki Okabe, Hiroaki Kobayashi

· Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study of Image Processing Algorithm

Tarik Saidani, Stéphane Piskorski, Lionel Lacassagne, Samir Bouaziz

· A Power-Aware Shared Cache Mechanism Based on Locality Assessment of Memory Reference for CMPs

Isao Kotera, Ryusuke Egawa, Hiroyuki Takizawa, Hiroaki Kobayashi

· Multithreaded Software Transactional Memory and OpenMP

Miloš Milovanović, Roger Ferrer, Vladimir Gajinov, Osman S. Unsal, Adrian Cristal, Eduard Ayguadé, Mateo Valero

· Reducing leakage in power-saving capable caches for embedded systems by using a filter cache

P. Bennati, R. Giorgi

· Building a Large Instruction Window Through ROB Compression

Fernando Latorre, Grigorios Magklis, José González, Pedro Chaparro, Antonio González

· Analysis of Static and Dynamic Energy Consumption in NUCA Caches: Initial Results

A. Bardine, P. Foglia, G. Gabrielli, C. A. Prete

· Characterization of Apache web server with Specweb2005

Ana Bosque, Pablo Ibañez, Víctor Viñals, Per Stenström, Jose M. Llabería

· Code-Size Conscious Pipelining of Imperfectly Nested Loops

Mohammed Fellahi, Albert Cohen, Sid Touati

· Broadcast Filtering-Aware Task Assignment Techniques for Low-Power MPSoCs

Chun-Mok Chung and Jihong Kim

· Adaptive Management of Cache Block Replication for High-Performance CMP

Tomonobu Mihara, Koji Inoue, Kazuaki Murakami

· Data Prefetching and Address Pre-Calculation through Instruction Pre-Execution with Two-Step Physical Register Deallocation

Akihiro Yamamoto, Yusuke Tanaka, Hideki Ando, Toshio Shimada

· The STAPL pArray

Gabriel Tanase, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger

Schedule

8-30 - Workshop begins: welcome message

8:40 – 10:00 Session I: CMP/Parallel Systems:

Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study of Image Processing Algorithm

Tarik Saidani¹, Stéphane Piskorski², Lionel Lacassagne¹, Samir Bouaziz¹

¹Institut d’Electronique Fondamentale, Orsay, France

²Laboratoire de Recherche en Informatique, Orsay, France

An On-Chip Cache Design for Vector Processors

Akihiro Musa, Yoshiei Sato, Ryusuke Egawa, Hiroyuki Takizawa, Koki Okabe, Hiroaki Kobayashi

Tohoku University, Sendai, Japan

Improving the Accuracy of Snoop Filtering Using Stream Registers

Valentina Salapura, Matthias Blumric, Alan Gara

IBM Thomas J. Watson Research Center, New York, USA

Adaptive Management of Cache Block Replication for High-Performance CMP

Tomonobu Mihara, Koji Inoue, Kazuaki Murakami

Kyushu University, Fukuoka, Japan

10:00 - 10:30 Morning Break

10:30 - 12:10 Session II: Interaction between Processor and Memory, Compiler, OS and Applications

Data Prefetching and Address Pre-Calculation through Instruction

Pre-Execution with Two-Step Physical Register Deallocation

Akihiro Yamamoto, Yusuke Tanaka, Hideki Ando, Toshio Shimada

Nagoya University, Nagoya, Japan

Building a Large Instruction Window Through ROB Compression

Fernando Latorre, Grigorios Magklis, José González, Pedro Chaparro, Antonio González

Intel Labs, UPC, Barcelona, Spain

Code-Size Conscious Pipelining of Imperfectly Nested Loops

Mohammed Fellahi¹, Albert Cohen¹, Sid Touati²

¹INRIA Futurs, Orsay, France

²PRiSM, University of Versailles,Versailles, France

Improving Disk Bandwidth-Bound Applications Through Main Memory Compression

Vicenç Beltran, Jordi Torres, Eduard Ayguadé

Technical University of Catalunya, Barcelona, Spain

Characterization of Apache web server with Specweb2005

Ana Bosque¹, Pablo Ibañez², Víctor Viñals², Per Stenström³, Jose M. Llabería¹

¹Universitat Politècnica de Catalunya, Barcelona, Spain

²Universidad de Zaragoza, Zaragoza, Spain

³Chalmers University of Technology, Goteborg, Sweden

12:10 - 13:45 Lunch at ARO Hotel

13:45 - 14:25 Session III: Software Issues

The STAPL pArray

Gabriel Tanase, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger

Texas A&M University, College Station, TX, USA

Multithreaded Software Transactional Memory and OpenMP

Miloš Milovanović, Roger Ferrer, Vladimir Gajinov, Osman S. Unsal, Adrian Cristal, Eduard Ayguadé, Mateo Valero

Barcelona Supercomputing Center, Barcelona, Spain

14:25 - 15:15 Keynote: Guri Sohi. “So what are we going to do with multicore processors?”

15:15 - 15:45 Afternoon Break

15:45 - 17:15 Session IV: Managing Power and Energy

Broadcast Filtering-Aware Task Assignment Techniques for Low-Power MPSoCs

Chun-Mok Chung, Jihong Kim

Seoul National University, Seoul, Korea

Reducing leakage in power-saving capable caches for embedded systems by using a filter cache

P. Bennati, R. Giorgi,

University of Siena, Siena, Italy

Analysis of Static and Dynamic Energy Consumption in NUCA Caches: Initial Results

A. Bardine, P. Foglia, G. Gabrielli, C. A. Prete

University of Pisa, Pisa, Italy

A Power-Aware Shared Cache Mechanism Based on Locality Assessment of Memory Reference for CMPs

Isao Kotera, Ryusuke Egawa, Hiroyuki Takizawa, Hiroaki Kobayashi

Tohoku University, Sendai, Japan