NoSQL Database Design - Data Modelling and Query Answering

Research areas

Temporary Supervisor

Dr Qing Wang

Description

The current computing paradigm is shifting with huge, growing data on the web. This raises questions about limitations of traditional database technologies in big data computing such as scalable performance and flexible schema. In the past several years, the “NoSQL” community emerges and rapidly expands, targeting to solve various large-scale data storage, analysis and retrieval tasks in a non-relational database environment. This NoSQL movement has led to the development of a variety of NoSQL databases, some of which were proposed by influential Web 2.0 companies such as Amazon and Google. In general, NoSQL databases can be categorized into: 1) key-value data stores such as MemcacheDB, Redis and Voldemort; 2) document-oriented data stores such as MongoDB, CouchDB and Riak; 3) column-oriented data stores such as Apache’s Hbase, Cassandra and Google’s Bigtable; 4) graph databases such as Neo4j, VertexDB and AllegroGraph.

Goals

The goal of this project is to investigate design principles of NoSQL databases, and the related issues on data modelling and query answering.

Requirements

Students should have an understanding of relational databases and some programming experience.

Background Literature

(1) Bigtable: a distributed storage system for structured data, Fay Chang, Jeffrey Dean & et al, OSDI’06. (2) Dynamo: Amazons Highly Available Key-value Store, SOSP 2007. (3) Will NoSQL databases live up to their promise? Neal Leavitt, Computer, 43(2):12–14, 2010. (4) The MongoDb website: http://mongodb.org, including source code and documentation.

Updated:  1 June 2019/Responsible Officer:  Dean, CECS/Page Contact:  CECS Marketing