Poster Session 1 and a reception will take place on Tuesday, September 1 at 17:00 in the Kohala Ballroom. This session contains the posters of the following papers:
- Shared Execution of Recurring Workloads in MapReduce, by Chuan Lei, Zhongfang Zhuang, Elke Rundensteiner, Mohamed Eltabakh
- A Performance Study of Big Data on Small Nodes, by Dumitrel Loghin, Bogdan Tudor, Hao Zhang, Beng Chin Ooi, Yong Meng Teo
- Understanding the Causes of Consistency Anomalies in Apache Cassandra, by Hua Fan, Aditya Ramaraju, Marlon McKenzie, Wojciech Golab, Bernard Wong
- Compaction management in distributed key-value datastores, by Muhammad Yousuf Ahmad, Bettina Kemme
- Fuzzy Joins in MapReduce: An Experimental Study, by Ben Kimmett, Venkatesh Srinivasan, Alex Thomo
- Sharing Buffer Pool Memory in Multi-Tenant Relational Database-as-a-Service, by Vivek Narasayya, Ishai Menache, Mohit Singh, Feng Li, Manoj Syamala, Surajit Chaudhuri
- Optimal Probabilistic Cache Stampede Prevention, by Andrea Vattani, Flavio Chierichetti, Keegan Lowenstein
- Indexing Highly Dynamic Hierarchical Data, by Jan Finis, Robert Brunel, Alfons Kemper, Thomas Neumann, Norman May, Franz Faerber
- BF-Tree: Approximate Tree Indexing, by Manos Athanassoulis, Anastasia Ailamaki
- SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index, by Yifang Sun, Wei Wang, Jianbin Qin, Ying Zhang, Xuemin Lin
- Rare Time Series Motif Discovery from Unbounded Streams, by Nurjahan Begum, Eamonn Keogh
- Beyond Itemsets: Mining Frequent Featuresets over Structured Items, by Saravanan Thirumuruganathan, Habibur Rahman, Sofiane Abbar, Gautam Das
- Mining Revenue-Maximizing Bundling Configuration, by Loc Do, Hady W. Lauw, Ke Wang
- ALID: Scalable Dominant Cluster Detection, by Lingyang Chu, Shuhui Wang, Siyuan Liu, Qingming Huang, Jian Pei
- Leveraging Graph Dimensions in Online Graph Search, by Yuanyuan Zhu, Jeffrey Xu Yu, Lu Qin
- Event Pattern Matching over Graph Streams, by Chunyao Song, Tingjian Ge, Cindy Chen, Jie Wang
- An Efficient Similarity Search Framework for SimRank over Large Dynamic Graphs, by Yingxia Shao, Bin Cui, Lei Chen, Mingming Liu, Xing Xie
- Growing a Graph Matching from a Handful of Seeds, by Ehsan Kazemi, Seyed Hamed Hassani, Matthias Grossglauser
- Association Rules with Graph Patterns, by Wenfei Fan, Xin Wang, Yinghui Wu, Jingbo Xu
- Efficient Top-K SimRank-based Similarity Join, by Wenbo Tao, Minghe Yu, Guoliang Li
- MOCgraph: Scalable Distributed Graph Processing Using Message Online Computing, by Chang Zhou, jun Gao, Binbin Sun, Jeffrey Xu Yu
- The More the Merrier: Efficient Multi-Source Graph Traversal, by Manuel Then, Moritz Kaufmann, Fernando Chirigati, Tuan-Anh Hoang-Vu, Kien Pham, Alfons Kemper, Thomas Neumann, Huy Vo
- Efficient Partial-Pairs SimRank Search on Large Networks, by Weiren Yu, Julie McCann
- Exploiting Vertex Relationships in Speeding up Subgraph Isomorphism over Large Graphs, by Xuguang Ren, Junhu Wang
- Preference-aware Integration of Temporal Data, by Bogdan Alexe, Mary Roth, Wang-Chiew Tan
- Optimizing the Chase: Scalable Data Integration under Constraints, by George Konstantinidis, Jose-Luis Ambite
- Supervised Meta-blocking, by George Papadakis, George Papastefanatos, Georgia Koutrika
- Enriching Data Imputation with Extensive Similarity Neighbors, by Shaoxu Song, Aoqian Zhang, Lei Chen, Jianmin Wang
- Answering Why-not Questions on Reverse Top-k Queries, by Yunjun Gao, Qing Liu, Gang Chen, Baihua Zheng, Linlin Zhou
- SnapToQuery: Providing Interactive Feedback during Exploratory Query Specification, by Lilong Jiang, Arnab Nandi
- Constructing an Interactive Natural Language Interface for Relational Databases, by Fei Li, H. V. Jagadish
- A Natural Language Interface for Querying General and Individual Knowledge, by Yael Amsterdamer, Anna Kukliansky, Tova Milo
- Possible and Certain SQL Keys, by Henning Kohler, Sebastian Link, Xiaofang Zhou
- D2P: Distance-Based Differential Privacy in Recommenders, by Rachid Guerraoui, Anne-Marie Kermarrec, Rhicheek Patra, Mahsa Taziki
- 35. Show Me the Money: Dynamic Recommendations for Revenue Maximization , by Wei Lu, Shanshan Chen, Keqian Li, Laks V. S. Lakshmanan
- Finish Them!: Pricing Algorithms for Human Computation, by Yihan Gao, Aditya Parameswaran
- TransactiveDB: Tapping into Collective Human Memories, by Michele Catasta, Alberto Tonon, Djellel Eddine Difallah, Gianluca Demartini, Karl Aberer, Philippe Cudre-Mauroux
- 38. Worker Skill Estimation in Team-Based Tasks , by Habibur Rahman, Saravanan Thirumuruganathan, Senjuti Basu Roy, Sihem Amer-Yahia, Gautam Das
- Scalable Subgraph Enumeration in MapReduce, by Longbin Lai, Lu Qin, Xuemin Lin, Lijun Chang
- 40. FrogWild! — Fast PageRank Approximations on Graph Engines , by Ioannis Mitliagkas, Michael Borokhovich, Alexandros Dimakis, Constantine Caramanis
- Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees, by Da Yan, James Cheng, Kai Xing, Yi Lu, Wilfred Ng, Yingyi Bu
- Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs, by Da Yan, James Cheng, Yi Lu, Wilfred Ng
- LogGP: A Log-based Dynamic Graph Partitioning Method, by Ning Xu, Lei Chen, Bin Cui
- Coordination Avoidance in Database Systems, by Peter Bailis, Alan Fekete, Michael Franklin, Ali Ghodsi, Joseph Hellerstein, Ion Stoica
- A Scalable Search Engine for Mass Storage Smart Objects, by Nicolas Anciaux, Saliha Lallali, Iulian Sandu Popa, Philippe Pucheral
- Schema Management for Document Stores, by Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi, Limei Jiao, Jia Zou, Chen Wang
- Supporting Scalable Analytics with Latency Constraints, by Boduo Li, Yanlei Diao, Prashant Shenoy
- Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff, by Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, Aditya Parameswaran
- Inferring Continuous Dynamic Social Influence and Personal Preference for Temporal Behavior Prediction, by Jun Zhang, Chaokun Wang, Jianmin Wang, Jeffrey Xu Yu
- Influential Community Search in Large Networks, by Rong-Hua LI, Lu Qin, Jeffrey Xu Yu, Rui Mao
- Linearized and Single-Pass Belief Propagation, by Wolfgang Gatterbauer, Stephan Gunnemann, Danai Koutra, Christos Faloutsos
- Online Topic-Aware Influence Maximization, by Shuo Chen, Ju Fan, Guoliang Li, Jianhua Feng, Kian-Lee Tan, Jinhui Tang
- Walk, Not Wait: Faster Sampling Over Online Social Networks, by Azade Nazi, Zhuojie Zhou, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das
- Work-Efficient Parallel Skyline Computation for the GPU, by Kenneth Bogh, Sean Chester, Ira Assent
- Memory-Efficient Hash Joins, by R. Barber, G. Lohman, I. Pandis, V. Raman, R. Sidle, G. Attaluri, N. Chainani, S. Lightstone, D. Sharpe
- MRCSI: Compressing and Searching String Collections with Multiple References, by Sebastian Wandelt, Ulf Leser
- Trill: A High-Performance Incremental Query Processor for Diverse Analytics, by Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, John Platt, James Terwilliger, John Wernsing
- Rapid Sampling for Visualizations with Ordering Guarantees, by Albert Kim, Eric Blais, Aditya Parameswaran, Piotr Indyk, Sam Madden, Ronitt Rubinfeld
- Argonaut: Macrotask Crowdsourcing for Complex Data Processing, by Adam Marcus, Lydia Gu, Daniel Haas, Jason Ansel
- FIT to monitor feed quality, by Tamraparni Dasu, Vladislav Shkapenyuk, Divesh Srivastava, Deborah Swayne
- ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection, by Rahul Potharaju, Navendu Jain
- Gobblin: Unifying Data Ingestion for Hadoop, by Lin Qiao, Kapil Surlaker, Shirshanka Das, Chavdar Botev, Yinan Li, Sahil Takiar, Henry Cai, Narasimha Veeramreddy, Min Tu, Ziyang Liu, Ying Dai
- Schema-Agnostic Indexing with Azure DocumentDB, by Dharma Shukla, Shireesh Thota, Karthik Raman, Madhan Gajendran, Ankur Shah, Sergii Ziuzin, Krishnan Sundaram, Anna Wawrzyniak, Samer Boshra, Mohamed Nassar, Michael Koltachev, Sudipta Sengupta, Justin Levandoski, David Lomet
- Scaling Spark in the Real World, by Michael Armbrust, Tathagata Das, Aaron Davidson, Ali Ghodsi, Andrew Or, Josh Rosen, Ion Stoica, Patrick Wendell, Reynold Xin, Matei Zaharia
- JetScope: Reliable and Interactive Analytics at Cloud Scale, by Eric Boutin, Jaliya Ekanayake, Anna Korsun, Jingren Zhou
- Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLxP Workloads, by Jeffrey Pound, Anil Goel, Nathan Auch, Franz Faerber, Francis Gropengiesser, Christian Mathis, Thomas Bodner, Wolfgang Lehner, Scott MacLean, Peter Bumbulis
- Real-Time Analytical Processing with SQL Server, by Paul Larson, Adrian Birka, Eric Hanson, Weiyun Huang, Michal Novakiewicz, Vassilis Papadimos
- The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing, by Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael Fernandez-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle
- Live Programming Support in the LogicBlox System, by Todd Green, Dan Olteanu , Geoffrey Washburn
- Indexing and Selecting Hierarchical Business Logic, by Anja Gruenheid, Alessandra Loro, Donald Kossman, Damien Profeta, Philippe Beaudequin
- Distributed Architecture of Oracle Database In-memory, by Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartk Kulkarni, Tirthankar Lahiri, Juan Loaiza, Vineet Marwah, Andy Witkowski, Jiaqi Yan, Mohamed Zait
- Gorilla: Facebook’s Fast, Scalable, In-Memory Time Series Database, by Justin Teller, Scott Franklin, Tuomas Pelkonen, Paul Cavallaro
- Query Optimization in Oracle 12c Database In-Memory, by Dinesh Das, Jiaqi Yan, Mohamed Zait, Satya Valluri, Nirav Vyas, Ramarajan Krishnamachari, Prashant Gaharwar, Jesse Kamp, Niloy Mukherjee
- Building a Replicated Logging System with Apache Kafka, by Guozhang Wang, Joel Koshy, Sriram Subramanian, Kartik Paramasivam, Mammad Zadeh, Neha Narkhede, Jun Rao, Jay Kreps, Joe Stein
- Optimization of Common Table Expressions in MPP Database Systems, by Amr El-Helw, Venkatesh Raghavan, Mohamed Soliman, George Caragea, Zhongxian Gu, Michalis Petropoulos
- One Trillion Edges: Graph Processing at Facebook-Scale, by Avery Ching, Dionysios Logothetis, Sergey Edunov, Maja Kabiljo, Sambavi Muthukrishnan
- Differential Privacy in Telco Big Data Platform, by Xueyang Hu, Mingxuan Yuan, Jianguo Yao, Yu Deng, Lei Chen, Haibing Guan, Jia Zeng
- Efficient Evaluation of Object-Centric Exploration Queries for Visualization, by You Wu, Boulos Harb, Jun Yang, Cong Yu
- ACME: A Parallel Cloud-Oriented System for Extracting Frequent Patterns from a Very Long Sequence, by Majed Sahli
- Efficient Distributed Subgraph Similarity Matching, by Ye Yuan
- Efficient k-Closest Pair Queries in General Metric Spaces, by Yunjun Gao
- Task-Assignment Optimization in Knowledge Intensive Crowdsourcing, by Senjuti Basu Roy
- Data Profiling ñ A Survey, by Felix Naumann
- Data Generation for Testing and Grading SQL Queries, by Bikash Chandra/S Sudarshan