Upcoming Talks
Recent Talks
- Apr 25 — Brown Big Data Symp.
- Mar 27 — Microsoft Research
- Mar 25 — Intel Labs
- Mar 21 — Carnegie Mellon
- Mar 18 — Univ. of Virginia
- Mar 14 — Univ. of Michigan
- Mar 11 — Harvard
- Mar 7 — Univ. of Wisconsin
- Mar 5 — Northwestern
- Feb 28 — Cloudera
- Feb 28 — Facebook (Infrastructure)
- Feb 27 — Basho
- Feb 27 — Quantcast
- Feb 27 — MemSQL
- Feb 25 — IBM Research
- Feb 22 — Univ. of Chicago
- Feb 19 — Univ. of Massachusetts
- Feb 6 — VoltDB
- Jan 29 — ISTC Research Retreat
Contact Info
- Office: CIT 317
- Email: pavlo@cs.brown.edu
- Twitter: @andy_pavlo
- GitHub: apavlo
Project Links
I am a Ph.D. Candidate in the Brown Data Management Research Group under Stan Zdonik and Michael Stonebraker. My research is centered on developing optimization techniques for high-performance NewSQL database management systems (DBMS) for data-intensive applications. I am the lead graduate student working on the H-Store distributed DBMS. Its design has been commercialized as VoltDB. In my salad days, I was a systems programmer for the Condor Project at the University of Wisconsin-Madison under Miron Livny.
Job Search Materials:
Publications & Papers:
Benchmarking OLTP/Web Databases in the Cloud: The OLTP-Bench Framework
Carlo A. Curino, Djellel E. Difallah, Andrew Pavlo, and Philippe Cudre-Mauroux
CloudDB '12, pages. 17—20, October 2012.
Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems
Andrew Pavlo, Carlo Curino, and Stanley Zdonik
Proceedings of SIGMOD, pages. 61—72, May 2012. [INFO]
On Predictive Modeling for Optimizing Transaction Execution in Parallel OLTP Systems
Andrew Pavlo, Evan P. C. Jones, and Stanley Zdonik
Proceedings of the VLDB Endowment, vol. 5, pages. 85—96, October 2011.
MapReduce and Parallel DBMSs: Friends or Foes?
Michael Stonebraker, Daniel Abadi, David. J. DeWitt, Samuel Madden, Erik Paulson, Andrew Pavlo, and Alexander Rasin
Communications of the ACM, vol. 53, iss. 1, pages. 64—71, January 2010. [INFO]
A Comparison of Approaches to Large-Scale Data Analysis
Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker
Proceedings of SIGMOD, pages. 165—178, June 2009. [INFO]
H-Store: A High-Performance, Distributed Main Memory Transaction Processing System
Robert Kallman, Hideaki Kimura, Jonathan Natkins, Andrew Pavlo, Alexander Rasin, Stanley Zdonik, Evan P.C. Jones, Yang Zhang, Samuel Madden, Michael Stonebraker, John Hugg, Daniel J. Abadi
Proceedings of the VLDB Endowment, vol. 1, iss. 2, pages. 1496—1499, 2008.
Pegasus and DAGMan from Concept to Execution: Mapping Scientific Workflows onto Today's Cyberinfrastructure
Ewa Deelman, Miron Livny, Gaurang Mehta, Andrew Pavlo, Gurmeet Singh, Mei-Hui Su, Karan Vahi, and R. Kent Wenger
High Performance Computing and Grids in Action, volume 16, pages. 56—74. March 2008. [INFO]
The NMI Build & Test Laboratory: Continuous Integration Framework for Distributed Computing Software
Andrew Pavlo, Peter Couvares, Rebekah Gietzel, Anatoly Karp, Ian D. Alderman, Miron Livny, and Charles Bacon
Proceedings of LISA, pages. 263—273, December 2006.
Smoother Transitions Between Breadth-First-Spanning-Tree-based Drawings
Christopher Homan and Andrew Pavlo and Jonathan Schull
Proceedings of Graph Drawing, pages. 442—445, September 2006.
Miscellaneous Writings:
Tastes Great, Less Filling: Low-Impact OLAP MapReduce Queries on High-Performance OLTP Systems
Xin Jia, Andrew Pavlo, and Stanley Zdonik
Tiny Transactions on Computer Science, vol 1, August 2012. [INFO]
Not Your Traditional Data Management - HPTS Conference Report
Andrew Pavlo
USENIX ;login:, pages. 76—78, February 2012. [INFO]
Next Generation Database Systems @ Brown CS
Andrew Pavlo and Ugur Cetintemel
Conduit Magazine, Fall 2011. [INFO]
Graffiti Networks: A Subversive, Internet-Scale File Sharing Model
Andrew Pavlo and Ning Shi
CoRR, January 2009. [INFO]
The Penurious Scorned
Andrew Pavlo
City of Madison Parking Division Appeal Letter, November 2006.
A Parent-Centered Radial Layout Algorithm for Interactive Graph Visualization and Animation
Andrew Pavlo, Christopher Homan, and Jonathan Schull
CoRR, June 2006.
Interactive, Tree-based Graph Visualization
Andrew Pavlo
Master's Thesis, Rochester Institute of Technology, 2006. [INFO]
Invited Talks:
Collected Data Sets:
For my various projects, I often find that most of the data I need is not readily available. I am therefore sharing data sets for various information that I have collected from either screen scraping or using Amazon's Mechanical Turk.
Stock Market Historical Data (1970 to 2006)
Screen Scraping - Multiple Sites
Penny Stocks Market Data (Intraday 2008 to 2011)
Screen Scraping - Yahoo! Finance
Fortune 500 & 1000 Contact Information
Amazon Mechanical Turk
List of Open MediaWiki Sites
Screen Scraping - Google
BitTorrent User Trace Information
Tracker Scraping - The Pirate Bay