|
Marcin Żukowski
|
contact
vectorwise
research
publications
talks
varia
Contact
To contact me, please write to marcin at vectorwise dot com.
VectorWise
In 2008 the MonetDB/X100 project described further on this page
has been converted into VectorWise,
a company which I currently manage.
VectorWise cooperates with Ingres,
the leading open-source database company,
on the Ingres
VectorWise product.
Research
Between 2001 and 2008 I was a member of the MonetDB development team.
My initial work with MonetDB concentrated on parallel query execution,
which resulted in my master's thesis
(pdf).
From 2003 I was working on the MonetDB/X100 project under the supervision of
Peter Boncz and
Martin Kersten.
This project resulted in my PhD thesis and was a base for the VectorWise company
MonetDB/X100 [DEBull05, VLDB05]
introduced a novel query processing architecture, based
on the concept of "vectorized in-cache processing"
[CIDR05].
Comparing to the traditional tuple-at-a-time processing model, it
significantly reduces the interpretation overheads and makes most of
the CPU time go into the actual data processing. Additionally, this
architecture allows efficient use of modern processing hardware
[CIDR05].
The project resulted in various innovations in the area of query execution,
including high-efficiency hash-tables [DAMON06],
novel approach to in-flight data storage [DAMON08],
and a new method of handling updates [SIGMOD10]
In-memory, X100 can provide even ca. 1GB/s data
processing throughput on a single CPU.
This high performance calls for high-bandwidth storage facility.
Here, X100 uses a column-based storage, that introduces
two bandwidth-improving techniques [BNCOD05].
"Lightweight in-cache compression" [ICDE06] allows trading some
processor time for an increased perceived disk bandwidth.
"Cooperative scans" [VLDB07] dynamically schedule query activity and I/O operations
to amortize the cost of disk access among multiple queries. As a
result, X100 has been shown to provide very high performance both for
in-memory [CIDR05] and disk-based [ICDE06]
database tasks, as well as in large-scale information retrieval scenarios
[TREC06, CIDR07, VLDBJ08].
Additionally, the ideas behind MonetDB/X100 were influential for a prototype query execution
kernel for STI Cell processors [DAMON07].
Publications
-
"Balancing Vectorized Query Execution with Bandwidth-Optimized Storage"
Marcin Zukowski
PhD Thesis, University of Amsterdam, Amsterdam, The Netherlands, September 2009.
-
"Positional Update Handling in Column Stores">
Sandor Heman, Marcin Zukowski, Niels Nes, Lefteris Sidirourgos, Peter Boncz
SIGMOD 2010, Indianapolis, USA, June 2010
-
"DSM vs. NSM: CPU Performance Tradeoffs
in Block-Oriented Query Processing"
Marcin Zukowski, Niels Nes, Peter Boncz.
DAMON 2008 (collocated with SIGMOD 2008),
Vancouver, Canada, June 2008. Best paper award.
-
"Flexible and Efficient IR using Array Databases"
Roberto Cornacchia, Sandor Heman, Marcin Zukowski, Arjen de Vries, Peter Boncz.
VLDB Journal, Vol.17, No.1, January 2008
-
"Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS"
Marcin Zukowski, Sandor Heman, Niels Nes, Peter Boncz.
VLDB 2007, Vienna, Austria, September 2007
-
"Vectorized Data Processing on the Cell Broadband Engine"
Sandor Heman, Niels Nes, Marcin Zukowski, Peter Boncz.
DAMON 2007 (collocated with SIGMOD 2007),
Beijing, China, June 2007. Best paper award.
-
"Efficient and Flexible Information Retrieval Using MonetDB/X100"
Sandor Heman, Marcin Zukowski, Arjen de Vries, Peter Boncz.
CIDR 2007, Asilomar, USA, January 2007
-
"MonetDB/X100 at the 2006 TREC TeraByte Track"
Sandor Heman, Marcin Zukowski, Arjen de Vries, Peter Boncz.
TREC 2006, Gaithersburg, USA, November 2006
-
"Architecture-Conscious Hashing"
Marcin Zukowski, Sandor Heman, Peter Boncz.
DAMON 2006 (collocated with SIGMOD 2006),
Chicago, USA, June 2006
-
"Super-Scalar RAM-CPU Cache Compression"
Marcin Zukowski, Sandor Heman, Niels Nes, Peter Boncz.
ICDE 2006, Atlanta, USA, April 2006.
-
"Hardware Conscious DBMS Architecture for Data-Intensive Applications"
Marcin Zukowski.
VLDB 2005 PhD Workshop, Trondheim, Norway, August 2005.
-
"Improving I/O Bandwidth for Data-Intensive Applications"
Marcin Zukowski.
BNCOD 2005 Doctoral Consortium, Sunderland, UK, July
2005.
-
"MonetDB/X100: A DBMS In The CPU Cache"
Marcin Zukowski, Peter Boncz, Niels Nes, Sandor Heman.
Published in IEEE Data Engineering Bulletin, June 2005, Vol. 28 No.
2.
-
"MonetDB/X100: Hyper-Pipelining Query Execution"
Peter Boncz, Marcin Zukowski, Niels Nes.
CIDR 2005, Asilomar, CA, USA, January
2005.
Talks
- 2009.12.01
"VectorWise - RapidMiner Demo Case: Accelerate-the-Future"
OSBI 2009, Schloss Romrod, Germany
- 2006.10.06
"MonetDB/X100 - Vectorized in-cache database kernel"
Google, Mountain View, USA
- 2006.01.01 -
"Architecture-Conscious Hashing"
Internal presentation, CWI, Amsterdam, NL
- 2005.12.08 -
"Applying I/O prefetching to improve DML performance (or My Microsoft
Adventure)"
Internal presentation, CWI, Amsterdam, NL
- 2004.12.04 -
"MonetDB/X100"
Dutch-Belgium DataBase Day,
Antwerp, Belgium
- 2004.10.14 -
"x100 query engine"
Second MultimediaN Workshop on Ambient Multimedia Databases.
University of Twente, Enschede, NL
- 2004.08.12 -
"DBMS Classics: Volcano + Volcano, MonetDB and x100 comparison"
internal presentation, CWI, Amsterdam, NL
- 2004.06.03 -
"x100 - state of the project"
internal presentation, CWI, Amsterdam, NL
- 2002.08.15 -
"Parallel Query Execution in Monet on SMP Machines"
Master's thesis presentation, CWI, Amsterdam, NL
Varia