Verdict is built upon the theories of approximate query processing (AQP) and our novel architecture of AQP-as-a-middleware. Verdict’s huge speedups are possible because, even from a small fraction of the entire data, we can reliably estimate many important statistics of the entire data. Verdict exploits that the values of many aggregate functions that commonly appear in analytic queries can be expressed using those statistics of the entire data, which can be estimated using samples.
Our research in approximate query processing has produced many research papers at premier database conferences.
- Barzan Mozafari. “Approximate Query Engines: Commercial Challenges and Research Opportunities.” SIGMOD 2017 Keynote.
- Yongjoo Park, Ahmad Shahab Tajik, Michael Cafarella, Barzan Mozafari. “Database Learning: Toward a Database that Becomes Smarter Every Time.” SIGMOD 2017.
- Yongjoo Park. “Active Database Learning” CIDR 2017.
- Yongjoo Park, Michael Cafarella, Barzan Mozafari. “Neighbor-Sensitive Hashing.” VLDB 2016.
- Yongjoo Park, Michael Cafarella, Barzan Mozafari. “Visualization-Aware Sampling for Very Large Databases.” ICDE 2016.
- Barzan Mozafari, and Ning Niu. “A Handbook for Building an Approximate Query Engine.” IEEE Data Engineering Bulletin, 2015.
- Barzan Mozafari. “Verdict: A System for Stochastic Query Planning.” CIDR 2015.
- Kai Zeng, Shi Gao, Barzan Mozafari and Carlo Zaniolo. “The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing.” SIGMOD 2014.
- Sameer Agarwal, Henry Milner, Ariel Kleiner, Ameet Talwalkar, Michael Jordan, Samuel Madden, Barzan Mozafari and Ion Stoica. “Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems.” SIGMOD 2014.
- Kai Zeng, Shi Gao, Jiaqi Gu, Barzan Mozafari and Carlo Zaniolo. “ABS: a System for Scalable Approximate Queries with Accuracy Guarantees.” SIGMOD 2014.
- Sameer Agarwal, Barzan Mozafari, Aurojit Panda, Henry Milner, Samuel Madden, and Ion Stoica. “BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data.” EuroSys 2013.
- Sameer Agarwal, Aurojit Panda, Barzan Mozafari, Anand P. Iyer, Samuel Madden, and Ion Stoica. “Blink and It’s Done: Interactive Queries on Very Large Data” PVLDB 2012.