Condensation vs. database systems add graphs from presentation slides
Data is often stored in a database. Usually used on top of file systems, database systems provide a higher level of abstraction as compared to plain files. They take care of data serialization, and keep indices.
In Condensation, a higher abstraction layer is provided by documents.
Immutable vs. mutable
Database systems are mutable data systems, i.e. records and values can be modified in-place. Mutable data is simple and straightforward to use, but more difficult to synchronize, replicate, and distribute.
Condensation is an immutable storage system. Once a piece of data is written, it cannot be modified any more. Any change results in a new version of the data. Old versions are garbage collected when they are not needed any more. This allows for simple synchronization, replication, and distribution.
Conflict-free merging vs. locking
Database systems use locking (in time or space) to execute transactions. On centralized systems, locking requires careful design to work correctly and efficiently. On distributed systems, locking is difficult, and imposes a lot of constraints.
Condensation does not use locking, but allows multiple versions of the data to coexist. Different versions can be merged at any time.
Distributed vs. centralized
Classic database systems are centralized. Even if a database is running on multiple CPUs or servers, it provides a consistent state of the data to all clients. Clients store all their data in the central database.
Condensation is distributed. Every user stores its own data. If the network is down, users may see and edit different versions of some shared data, which will get synchronized as soon as the network is back up.
Client-centric vs. server-centric
In a database system, the bulk of the work is done by the server. Clients merely send queries for execution to the server. As a consequence, the server has access to all data.
In Condensation, the bulk of the work is done by the clients. Any processing step, including indexing, querying, serializing and encrypting the data, is carried out by the client. The server merely stores encrypted pieces of data.
Summary
Condensation | Database (typical) | |
---|---|---|
Server-client protocol | get object, put object, list, add, remove | query language (e.g. SQL) or CRUD functions |
Mutability | immutable | mutable |
Main data structure | object tree | table (collection) with tuples (rows, records, documents), relational |
Data serialization | managed by the client | managed by the server |
Indices | managed by the client | managed by the server |
Distributed | yes, by design | no |
Transactions | inherent, through atomic box additions; executed by the client | good support, but complex; executed by the server |
Replication (one-way) | inherent, efficient | easy; limited efficiency |
Synchronization (two-way) | fairly easy | difficult |
Versioning | inherent, easy | difficult |