SG-ColBase: A Relational Column Database Kernel

Tong Zhou; Tingzhen Liu

doi:10.20944/preprints202211.0220.v2

Submitted:

12 November 2022

Posted:

14 November 2022

You are already at the latest version

Abstract

At present, diversified and highly concurrent businesses in the Internet industry often require heterogeneous databases formed by multiple databases to meet the needs. This report introduces database kernel SG-ColBase we developed. After achieving read and write concurrency control, data rollback, atomic log writing, and downtime data redo to ensure complete transaction support. The parallelism of database kernel execution is extended through field level locks and snapshot reads. Use the Bloom filter, resource cache pool, memory pool, skip list, non blocking log cache, and asynchronous data writing mechanism to improve the overall execution efficiency of the system. In terms of data storage, column storage, logical key and LSM-tree are introduced. While improving the data compression ratio and reducing data gaps, all disk data operations are written in incremental order. With the characteristics of asynchronous batch operation, the data writing speed is greatly improved. Thanks to the continuous feature of vertical data brought by column storage, the disk scanning brought by vertical traversal is reduced, which is a qualitative leap in efficiency compared with traditional relational databases in the big data analysis scenario. SG-ColBase can reduce the use of heterogeneous databases in business and improve R&D efficiency.

Keywords:

Relational Database

;

Columnar Storage

;

Bloom Filter

;

Skip List

;

Field Level Lock

;

Read Write Concurrency

;

OLTP

;

OLAP

;

LSM-Tree

;

Token Bucket Algorithm

Subject:

Computer Science and Mathematics - Information Systems

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

SG-ColBase: A Relational Column Database Kernel

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe