da-streamingledger :: da-streamingledger-sdk

Serializable ACID transactions on streaming data

License

License

Categories

Categories

Data
GroupId

GroupId

com.data-artisans.streamingledger
ArtifactId

ArtifactId

da-streamingledger-sdk
Last Version

Last Version

1.0.0
Release Date

Release Date

Type

Type

jar
Description

Description

da-streamingledger :: da-streamingledger-sdk
Serializable ACID transactions on streaming data

Download da-streamingledger-sdk

How to add to project

<!-- https://jarcasting.com/artifacts/com.data-artisans.streamingledger/da-streamingledger-sdk/ -->
<dependency>
    <groupId>com.data-artisans.streamingledger</groupId>
    <artifactId>da-streamingledger-sdk</artifactId>
    <version>1.0.0</version>
</dependency>
// https://jarcasting.com/artifacts/com.data-artisans.streamingledger/da-streamingledger-sdk/
implementation 'com.data-artisans.streamingledger:da-streamingledger-sdk:1.0.0'
// https://jarcasting.com/artifacts/com.data-artisans.streamingledger/da-streamingledger-sdk/
implementation ("com.data-artisans.streamingledger:da-streamingledger-sdk:1.0.0")
'com.data-artisans.streamingledger:da-streamingledger-sdk:jar:1.0.0'
<dependency org="com.data-artisans.streamingledger" name="da-streamingledger-sdk" rev="1.0.0">
  <artifact name="da-streamingledger-sdk" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.data-artisans.streamingledger', module='da-streamingledger-sdk', version='1.0.0')
)
libraryDependencies += "com.data-artisans.streamingledger" % "da-streamingledger-sdk" % "1.0.0"
[com.data-artisans.streamingledger/da-streamingledger-sdk "1.0.0"]

Dependencies

compile (2)

Group / Artifact Type Version
org.slf4j : slf4j-api jar 1.7.7
com.google.code.findbugs : jsr305 jar 1.3.9

provided (1)

Group / Artifact Type Version
org.apache.flink : flink-streaming-java_2.11 jar 1.6.0

test (5)

Group / Artifact Type Version
org.apache.flink : flink-test-utils-junit jar 1.6.0
org.apache.flink : flink-core test-jar 1.6.0
junit : junit jar 4.12
org.slf4j : slf4j-log4j12 jar 1.7.7
log4j : log4j jar 1.2.17

Project Modules

There are no modules declared in this project.

data Artisans Streaming Ledger

Serializable ACID transactions on streaming data

data Artisans Streaming Ledger is a library on top of Apache Flink, for processing event streams across multiple shared states/tables with Serializable ACID Semantics.

Instead of operating on a single key in a single operator at a time (like in vanilla Apache Flink and other stream processors) data Artisans Streaming Ledger allows you to define a set of states, connect streams of events that drive the transactions, and apply flexible business logic that operates transactionally across those states.

This repository contains the following maven modules:

  • da-streamingledger-sdk - The SDK needed to define a streaming ledger application.
  • da-streamingledger-runtime-serial - A simplistic serial runner, to experiment with the SDK.
  • da-streamingledger-examples - Streaming ledger example programs.

A parallel runner exists as part of the dA platform, you can learn more about the dA platform here: dA Platform

Example

Let's create a simple ledger of user accounts. An account in the ledger is identified by a String key, and has a Long value (its balance).

We start by defining the streaming ledger scope. All state definitions and transaction functions would be bound to this named scope "Account Ledger".

    StreamingLedger ledger = StreamingLedger.create("Account Ledger");

Next, we define the accounts state.

    StreamingLedger.State<String, Long> accounts = ledger.declareState("accounts")
      .withKeyType(String.class)
      .withValueType(Long.class);

Next, let's assume we have a DataStream of TransactionEvents, with the following schema:

final class TransactionEvent {

    private final String sourceAccountId;

    private final String targetAccountId;

    private final long accountTransfer;

    ...

    public String getSourceAccountId() {
        return sourceAccountId;
    }

    public String getTargetAccountId() {
        return targetAccountId;
    }

    public long getAccountTransfer() {
        return accountTransfer;
    }

    ...
}

And we would like to transfer money from the source account to the target account, in response to an incoming TransactionEvent.

    DataStream<TransactionEvent> transactions = ...

    ledger.usingStream(transactions, "transaction stream")
    	.apply(new TxnHandler())
        .on(accounts, TransactionEvent::getSourceAccountId, "source-account", READ_WRITE)
        .on(accounts, TransactionEvent::getTargetAccountId, "target-account", READ_WRITE);

Where TxnHandler is a TransactionProcessFunction defined as:

class TxnHandler extends TransactionProcessFunction<TransactionEvent, Void> {

        @ProcessTransaction
        public void process(
                TransactionEvent txn,
                Context<Void> transactionContext,
                @State("source-account") StateAccess<Long> sourceAccount,
                @State("target-account") StateAccess<Long> targetAccount) {

            final long sourceAccountBalance = sourceAccount.read();
            final long targetAccountBalance = targetAccount.read();

            // check the preconditions
            if (sourceAccountBalance > txn.getAccountTransfer()) {

                // compute the new balances
                long newSourceBalance = sourceAccountBalance - txn.getAccountTransfer();
                long newTargetBalance = targetAccountBalance + txn.getAccountTransfer();

                // write back the updated values
                sourceAccount.write(newSourceBalance);
                targetAccount.write(newTargetBalance);
            }
        }
    }

Note that TxnHandler will be executed with the following guaranties:

  • Atomicity: The transaction applies all changes in an atomic manner. Either all of the modifications that the transaction function performs on the rows happen, or none.

  • Consistency: The transaction brings the tables from one consistent state into another consistent state.

  • Isolation: Each transaction executes as if it were the only transaction operating on the tables. Databases know different isolation levels with different guarantees. data Artisans Streaming Ledger here offers the best class: serializability.

  • Durability: The changes made by a transaction are durable and are not lost. Durability is ensured in the same way as in other Flink applications – through persistent sources and checkpoints. In the asynchronous nature of stream processing, durability of a result can only be assumed after a checkpoint.

A more complete example can be found here

Building from source

prerequisites:

  • git
  • Maven
  • At least Java 8
git clone https://github.com/dataArtisans/da-streamingledger.git
cd da-streamingledger
mvn clean install

data Artisans Streaming Ledger is now available at your local .m2 repository.

Obtaining from Maven Central

Just add the following dependency to start experimenting with the SDK

<dependency>
  <groupId>com.data-artisans.streamingledger</groupId>
  <artifactId>da-streamingledger-sdk</artifactId>
  <version>1.0.0</version>
</dependency>
<dependency>
  <groupId>com.data-artisans.streamingledger</groupId>
  <artifactId>da-streamingledger-runtime-serial</artifactId>
  <version>1.0.0</version>
</dependency>

License

The code in this repository is under the Apache license, see license

com.data-artisans.streamingledger

Versions

Version
1.0.0