Indic Dictionary tools

A collection of scala classes for some basic dictionary and wiki manipulation, contributed by the open source Sanskrit NLP project and friends. Some notable facilities: * Deal with babylon dictionaries. * Use bots to write to wiki projects (wiktionary, wikisource etc..). * Iterate through output of OCR programs such as SanskritOCR. Contributions and suggestions are invited at https://github.com/sanskrit-coders/stardict-sanskrit . (Sister projects there may also be of interest.)

License

License

Apache License 2.0
GroupId

GroupId

com.github.sanskrit-coders
ArtifactId

ArtifactId

dict-tools
Last Version

Last Version

1.0
Release Date

Release Date

Type

Type

jar
Description

Description

Indic Dictionary tools
A collection of scala classes for some basic dictionary and wiki manipulation, contributed by the open source Sanskrit NLP project and friends. Some notable facilities: * Deal with babylon dictionaries. * Use bots to write to wiki projects (wiktionary, wikisource etc..). * Iterate through output of OCR programs such as SanskritOCR. Contributions and suggestions are invited at https://github.com/sanskrit-coders/stardict-sanskrit . (Sister projects there may also be of interest.)
Project URL

Project URL

https://github.com/sanskrit-coders/stardict-sanskrit
Source Code Management

Source Code Management

https://github.com/sanskrit-coders/stardict-sanskrit

Download dict-tools

How to add to project

<!-- https://jarcasting.com/artifacts/com.github.sanskrit-coders/dict-tools/ -->
<dependency>
    <groupId>com.github.sanskrit-coders</groupId>
    <artifactId>dict-tools</artifactId>
    <version>1.0</version>
</dependency>
// https://jarcasting.com/artifacts/com.github.sanskrit-coders/dict-tools/
implementation 'com.github.sanskrit-coders:dict-tools:1.0'
// https://jarcasting.com/artifacts/com.github.sanskrit-coders/dict-tools/
implementation ("com.github.sanskrit-coders:dict-tools:1.0")
'com.github.sanskrit-coders:dict-tools:jar:1.0'
<dependency org="com.github.sanskrit-coders" name="dict-tools" rev="1.0">
  <artifact name="dict-tools" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.github.sanskrit-coders', module='dict-tools', version='1.0')
)
libraryDependencies += "com.github.sanskrit-coders" % "dict-tools" % "1.0"
[com.github.sanskrit-coders/dict-tools "1.0"]

Dependencies

compile (6)

Group / Artifact Type Version
org.scala-lang : scala-library jar 2.11.8
com.github.sanskrit-coders : indic-transliteration jar 1.1
ch.qos.logback : logback-classic jar 0.9.29
ch.qos.logback : logback-core jar 0.9.29
net.sourceforge : jwbf jar 3.0.0
jstl : jstl jar 1.2

Project Modules

There are no modules declared in this project.

Table of content

Dictionary user instructions

Grab latest files to use for with your stardict compatible program.

Options:

Dictionary program recommendations and download help (esp for mobile devices)

Organization

  • Dictionaries are stored in multiple repositories under indic-dict, for example: stardict-sanskrit-vyAkaraNa, stardict-kannada etc..
  • Some external pre-built dictionaries also on archive.org (as in the case of stardict-english).

Builds

  • Dictionaries are built with either github actions or travis-ci and deployed to gh-pages branch of the repository. For example:
  • Within the output branch, for each dictionary collection, the following folders are produced:
    • tars with compressed (xyz.tar.gz) dictionary files for use with stardict clients.
    • slob files for use with aard2 dictionary clients.
    • per-headword text files for use with HTTP calls.

Development

Cologne to babylon convertor

https://github.com/sanskrit-lexicon/cologne-stardict - make_babylon.py is the workhorse.

Latest stardict binaries

Sometimes, latest stardict binaries may be required.

  • Extract the latest stardict tools package (https://sourceforge.net/projects/stardict-4/files/3.0.1/stardict-tools-3.0.1.tar.bz2/download) in ~/stardict/tools directory. Then do cd stardict/tools and build it (Run ./configure and make as described in the INSTALL file in the directory - but don't remove the compiled binaries from the src directory.).
    • A shortcut if you are running Linux on a 64 bit computer: Just git clone --depth 1 https://github.com/sanskrit-coders/stardict in your home directory.
  • Install stardict-tools-git
    • Replaces outdated step: git clone [email protected]:sanskrit-coders/stardict.git. Then build it with ./configure, make etc..

Other notes

Recipe to convert decompiled en-head dictionaries from ajita to sa-head dictionaries: ^.+\t(.+?). should be replaced by \1\t

Dictionary popularity metrics

  • For properly releaseed dictionaries, one can get stats as shown here.
com.github.sanskrit-coders

Sanskrit Open Source Programmers

Versions

Version
1.0