News:

·        This year, Guri Sohi will give in Medea the keynote:

“So what are we going to do with multicore processors?".

·        The schedule is available. Each author has 16 minutes for presentation, plus 4 minutes for Q&A.

·        Selected Papers will appear on Vol. 3, Issue N. 2 of Transactions on HIPEAC.

 

 

Scope and Motivation  

MEDEA aims to continue the high level of interest of the previous editions held with PACT Conference since 2000.

Due to the ever-increasing gap between CPU and memory speed, there is always great interest in evaluating and proposing processor, multiprocessor, CMP, multi-core, SoC and system architectures dealing with the "memory wall" and wire-delay problems. At the same time, a modular high-level design is becoming more and more attracting in order to reduce design costs.

In this scenario, design solutions and their corresponding performance are shaped by the combined pressure of a) technological opportunities and limitations, b) features and organization of system architecture and c) critical requirements of specific application domains. Evaluating and controlling the effects on the memory subsystem (e.g. caches, interconnection, bus, memory, coherence) of any architectural proposal  is extremely important both from the performance (e.g. bandwidth, latency, predictability) and power (e.g. static, dynamic, manageability) points of view.

In particular, the emerging trend of single-chip multi-core solutions, will push towards new design principles for memory hierarchy and interconnection networks, especially when the design is aimed to build systems with a high number of cores (many-core instead of multi-core systems), which aim to scale performance and power efficiency in a variety of application domains.

From a slightly different point of view, the mutual interaction between the application behavior and the system on which it executes, is responsible of the figures of merit of the memory subsystem and, therefore, pushes towards specific solutions. In addition, it can suggest specific compile/link time tunings for adapting the application to the features of the target architecture.

In the overall picture, power consumption requirements are increasingly important cross-cutting issues and raise specific challenges.

Typical architectural choices of interest include, but are not limited to, single processor, chip and board multiprocessors, SoC, traditional and tiled/clustered architectures, multithreaded or VLIW architectures with emphasis on single-chip design, massive parallelism designs, heterogeneous architectures, architectures equipped with application-domain accelerators as well as endowed with reconfigurable modules. Application domains encompass embedded (e.g. multimedia, mobile, automotive, automation, medical), commercial (e.g. Web, DB, multimedia), networking applications, security, etc. The emerging network on chip infrastructure may suggest new solutions and issues.

MEDEA Workshop wants to continue to be a forum for academic and industrial people to meet, discuss and exchange their ideas, experience and solutions in the design and evaluation of architectures for embedded, commercial and general/special purpose systems taking into account memory issues, both directly and indirectly.

Proceedings of the Workshop will be published under ACM ISBN, and appear also in the ACM Digital Library. Accepted papers will be considered for publication, in an extended version, for the March 2008 special issue of Transactions on HIPEAC (www.hipeac.net/journal).

 

The format of the workshop includes the presentation of selected papers and discussion after each presentation.

 


Topics of Interest:

- Memory hierarchy design, analysis, tuning for embedded, general and special purpose systems

- On-chip Multiprocessors and System On Chip architectures, development tools and applications
    o Issues in memory hierarchy design of scalable single chip systems
    o Memory hierarchy issues for heterogeneous and accelerator-based systems
    o Solutions for embedded, DSP, commercial, scientific and technical workloads
    o Inter-Chip and Intra-Chip memory latency tolerant and reduction techniques

- Cache coherence and memory models

- Compile/link time optimization techniques

- Bus/Interconnection Architectures

- Network On Chip

- Low-Power/Wire Delay design of memory hierarchies

- Processor and System Architectures

- Exploitation of application parallelism (e.g.: ILP, TLP, DLP)

- Academic/industrial experience in high performance, embedded systems and memory design


Organizing and Steering Committee

 

Sandro Bartolinihttp://www.dii.unisi.it/~bartolini/

University of Siena, Italy 

Pierfrancesco Foglia, foglia@iet.unipi.it

University of Pisa, Italy 

Roberto Giorgi, http://www.dii.unisi.it/~giorgi/

University of Siena, Italy 

Cosimo Antonio Prete, prete@iet.unipi.it

University of Pisa, Italy

 


Program Committee

 

Erik Altman, IBM T.J. Watson Research Center, NY, USA

Alessio Bechini, University of Pisa, Pisa, Italy

Koen Bertels, TUDELFT, Delft, NL

John Cavazos,University of Edinburgh,Edinburgh,UK

Bjoern Franke, University of Edinburgh, Edinburgh, UK

Georgi Gaydadjiev, TUDELFT, Delft, NL

Ali Hurson, Penn State University, University Park, PA, USA

Timothy Jones, University of Edinburgh, Edinburgh, UK

David Kaeli, Northeastern University, Boston, MA, USA

Krishna Kavy, University of North Texas, Denton, TX; USA

Stefanos Kaxiras, University of Patras, Greece

Stephen Keckler, University of Texas at Austin, TX, USA

Hiroaki Kobayashi, Tohoku University, Sendai, Japan

David M. Koppelman, Lousiana State University, Baton Rouge, LA, USA

Enrico Martinelli, University of Siena, Siena, Italy

Avi Mendelson, Intel, Haifa, Israel

Alexander Milencovich, University of Alabama, Huntsville, AL, USA

Yale Patt, University of Texas at Austin, TX, USA

Alex Ramirez, Universitat Politecnica de Catalunya, Barcelona, Spain

Toshinori Sato, Kyushu University, Iizuka, Japan

Per Stenström, Chalmers University of Technology, Sweden

Theo Ungerer, University of Augsburg, DE

Mateo Valero, Universitat Politecnica de Catalunya, Barcelona, Spain

Wei Zhang, Southern Illinois University, Carbondale, IL, USA

 

 


Information for Authors 

The papers should be 6-8 pages in length. The abstracts and papers should be submitted in PDF format by email to  Pierfrancesco Foglia and Sandro Bartolini

Paper should be written in standard ACM SIG Proceedings Template. Hard copy (postal) submissions will not be accepted. 

Please email submissions by July 23, 2007. You receive an acknowledgment of your submission by the following week. Authors will be notified of acceptance or rejection by August 14, 2007 and the final papers are due by August 25, 2007

To speed-up the reviewing process, we encourage also submission of abstract by July 16, 2007

All submissions will be refereed, and Proceedings with ACM ISBN will be printed and distributed at the workshop. 


Important Dates

 

 

July 16, 2007

Abstract Submission (not mandatory) 

July 23, 2007

Paper Submission Deadline

August 14, 2007

Acceptance Notification

August 25, 2007

Final Papers Due

September 16, 2007

Workshop will start


Registration and accommodation

Attendees are requested to go to the hosting PACT'07 conference to perform the registration to Medea-2007 and make room reservations in the conference hotels. We suggest that you make your room reservation in advance and register before the advance registration deadline to benefit from combined PACT'07/Medea'07 registration fees.


Final Program

·        Keynote: Guri Sohi, "So what are we going to do with multicore processors?"

 

 

·        Improving the Accuracy of Snoop Filtering Using Stream Registers

Valentina Salapura, Matthias Blumric, Alan Gara

 

 

·        Improving Disk Bandwidth-Bound Applications Through Main Memory Compression

Vicenç Beltran, Jordi Torres, Eduard Ayguadé

 

 

·        An On-Chip Cache Design for Vector Processors

Akihiro Musa, Yoshiei Sato, Ryusuke Egawa, Hiroyuki Takizawa, Koki Okabe, Hiroaki Kobayashi

 

 

·        Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study of Image Processing Algorithm

Tarik Saidani, Stéphane Piskorski, Lionel Lacassagne, Samir Bouaziz

 

 

·        A Power-Aware Shared Cache Mechanism Based on Locality Assessment of Memory Reference for CMPs

Isao Kotera, Ryusuke Egawa, Hiroyuki Takizawa, Hiroaki Kobayashi

 

 

·        Multithreaded Software Transactional Memory and OpenMP

Miloš Milovanović, Roger Ferrer, Vladimir Gajinov, Osman S. Unsal, Adrian Cristal, Eduard Ayguadé, Mateo Valero

 

 

·        Reducing leakage in power-saving capable caches for embedded systems by using a filter cache

P. Bennati, R. Giorgi

 

 

·        Building a Large Instruction Window Through ROB Compression

Fernando Latorre, Grigorios Magklis, José González, Pedro Chaparro, Antonio González

 

 

·        Analysis of Static and Dynamic Energy Consumption in NUCA Caches: Initial Results

A. Bardine, P. Foglia, G. Gabrielli, C. A. Prete

 

 

·        Characterization of Apache web server with Specweb2005

Ana Bosque, Pablo Ibañez, Víctor Viñals, Per Stenström, Jose M. Llabería

 

 

·        Code-Size Conscious Pipelining of Imperfectly Nested Loops

Mohammed Fellahi, Albert Cohen, Sid Touati

 

 

·        Broadcast Filtering-Aware Task Assignment Techniques for Low-Power MPSoCs

Chun-Mok Chung and Jihong Kim

 

 

·        Adaptive Management of Cache Block Replication for High-Performance CMP

Tomonobu Mihara, Koji Inoue, Kazuaki Murakami

 

 

·        Data Prefetching and Address Pre-Calculation through Instruction Pre-Execution with Two-Step Physical Register Deallocation

Akihiro Yamamoto, Yusuke Tanaka, Hideki Ando, Toshio Shimada

 

 

·        The STAPL pArray

Gabriel Tanase, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger

 


Schedule

8-30 -  Workshop begins: welcome message

 

 

8:40 – 10:00  Session I: CMP/Parallel Systems:

 

Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study of Image Processing Algorithm
Tarik Saidani1, Stéphane Piskorski2, Lionel Lacassagne1, Samir Bouaziz1
1Institut d’Electronique Fondamentale, Orsay, France
2Laboratoire de Recherche en Informatique, Orsay, France

 

An On-Chip Cache Design for Vector Processors
Akihiro Musa, Yoshiei Sato, Ryusuke Egawa, Hiroyuki Takizawa, Koki Okabe, Hiroaki Kobayashi
Tohoku University, Sendai, Japan
 
Improving the Accuracy of Snoop Filtering Using Stream Registers
Valentina Salapura, Matthias Blumric, Alan Gara
IBM Thomas J. Watson Research Center, New York, USA
 
Adaptive Management of Cache Block Replication for High-Performance CMP
Tomonobu Mihara, Koji Inoue, Kazuaki Murakami
Kyushu University, Fukuoka, Japan

 

 

10:00 - 10:30 Morning Break

 

 

10:30 - 12:10  Session II: Interaction between Processor and Memory, Compiler, OS and Applications

 

Data Prefetching and Address Pre-Calculation through Instruction 
Pre-Execution with Two-Step Physical Register Deallocation
Akihiro Yamamoto, Yusuke Tanaka, Hideki Ando, Toshio Shimada
Nagoya University, Nagoya, Japan
 
Building a Large Instruction Window Through ROB Compression
Fernando Latorre, Grigorios Magklis, José González, Pedro Chaparro, Antonio González
Intel Labs, UPC, Barcelona, Spain

 

Code-Size Conscious Pipelining of Imperfectly Nested Loops 
Mohammed Fellahi1, Albert Cohen1, Sid Touati2
1INRIA Futurs, Orsay, France
2PRiSM, University of Versailles,Versailles, France
 
Improving Disk Bandwidth-Bound Applications Through Main Memory Compression
Vicenç Beltran, Jordi Torres, Eduard Ayguadé
Technical University of Catalunya, Barcelona, Spain

 

Characterization of Apache web server with Specweb2005
Ana Bosque1, Pablo Ibañez2, Víctor Viñals2, Per Stenström3, Jose M. Llabería1
1Universitat Politècnica de Catalunya, Barcelona, Spain
2Universidad de Zaragoza, Zaragoza, Spain
3Chalmers University of Technology, Goteborg, Sweden

 

 

12:10 - 13:45  Lunch at ARO Hotel

 

 

13:45 - 14:25 Session III: Software Issues

 

The STAPL pArray
Gabriel Tanase, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger
Texas A&M University, College Station, TX, USA
 
Multithreaded Software Transactional Memory and OpenMP
Miloš Milovanović, Roger Ferrer, Vladimir Gajinov, Osman S. Unsal, Adrian Cristal, Eduard Ayguadé, Mateo Valero 
Barcelona Supercomputing Center, Barcelona, Spain

 

 

14:25 - 15:15 Keynote: Guri Sohi. So what are we going to do with multicore processors?

 

 

15:15 - 15:45  Afternoon Break

 

 

15:45 - 17:15 Session IV: Managing Power and Energy  

 
Broadcast Filtering-Aware Task Assignment Techniques for Low-Power MPSoCs
Chun-Mok Chung, Jihong Kim
Seoul National University, Seoul, Korea
 
Reducing leakage in power-saving capable caches for embedded systems by using a filter cache
P. Bennati, R. Giorgi,
University of Siena, Siena, Italy
 
Analysis of Static and Dynamic Energy Consumption in NUCA Caches: Initial Results
A. Bardine, P. Foglia, G. Gabrielli, C. A. Prete
University of Pisa, Pisa, Italy
 
A Power-Aware Shared Cache Mechanism Based on Locality Assessment of Memory Reference for CMPs
Isao Kotera, Ryusuke Egawa, Hiroyuki Takizawa, Hiroaki Kobayashi
Tohoku University, Sendai, Japan