arc-dataquality-udf-plugin


License

License

MIT
Categories

Categories

Data
GroupId

GroupId

ai.tripl
ArtifactId

ArtifactId

arc-dataquality-udf-plugin_2.11
Last Version

Last Version

1.3.0
Release Date

Release Date

Type

Type

jar
Description

Description

arc-dataquality-udf-plugin
arc-dataquality-udf-plugin
Project URL

Project URL

https://arc.tripl.ai
Project Organization

Project Organization

ai.tripl
Source Code Management

Source Code Management

https://github.com/tripl-ai/arc-dataquality-udf-plugin

Download arc-dataquality-udf-plugin_2.11

How to add to project

<!-- https://jarcasting.com/artifacts/ai.tripl/arc-dataquality-udf-plugin_2.11/ -->
<dependency>
    <groupId>ai.tripl</groupId>
    <artifactId>arc-dataquality-udf-plugin_2.11</artifactId>
    <version>1.3.0</version>
</dependency>
// https://jarcasting.com/artifacts/ai.tripl/arc-dataquality-udf-plugin_2.11/
implementation 'ai.tripl:arc-dataquality-udf-plugin_2.11:1.3.0'
// https://jarcasting.com/artifacts/ai.tripl/arc-dataquality-udf-plugin_2.11/
implementation ("ai.tripl:arc-dataquality-udf-plugin_2.11:1.3.0")
'ai.tripl:arc-dataquality-udf-plugin_2.11:jar:1.3.0'
<dependency org="ai.tripl" name="arc-dataquality-udf-plugin_2.11" rev="1.3.0">
  <artifact name="arc-dataquality-udf-plugin_2.11" type="jar" />
</dependency>
@Grapes(
@Grab(group='ai.tripl', module='arc-dataquality-udf-plugin_2.11', version='1.3.0')
)
libraryDependencies += "ai.tripl" % "arc-dataquality-udf-plugin_2.11" % "1.3.0"
[ai.tripl/arc-dataquality-udf-plugin_2.11 "1.3.0"]

Dependencies

compile (2)

Group / Artifact Type Version
org.scala-lang : scala-library jar 2.11.12
com.googlecode.libphonenumber : libphonenumber jar 8.12.1

provided (2)

Group / Artifact Type Version
ai.tripl : arc_2.11 jar 2.10.0
org.apache.spark : spark-sql_2.11 jar 2.4.5

test (1)

Group / Artifact Type Version
org.scalatest : scalatest_2.11 jar 3.0.7

Project Modules

There are no modules declared in this project.

arc-dataquality-udf-plugin defines a set of data quality/validation user defined functions.

The intention of this plugin is to provide a standard suite of data quality functions that can be used by many people and companies. Feel free to raise pull requests against this repository with reusable functions that may be beneficial to others.

User Defined Functions

  • is_valid_phonenumber(numberToParse, defaultRegion) returns a boolean of whether the phone number appears valid for that region e.g. is_valid_phonenumber('61499000000', 'AU').
  • format_phonenumber(numberToParse, defaultRegion) returns an ISO E164 formatted phone number string e.g. format_phonenumber('61499000000', 'AU') becomes +61499000000.
  • is_valid_abn(abn) returns a boolean of whether the Australian Business Number (ABN) passes the inbuilt checksum function.
  • is_valid_acn(acn) returns a boolean of whether the Australian Company Number (ACN) passes the inbuilt checksum function.

License

Arc is released under the MIT License.

is_valid_phonenumber and format_phonenumber use https://github.com/google/libphonenumber released under the Apache 2.0 license.

ai.tripl

tripl.ai

Versions

Version
1.3.0
1.2.0
1.1.0
1.0.0