Debian Science Project
Summary
Linguistics
데비안 과학 언어학 패키지

이 메타패키지는 데비안 퓨어 블렌드 "데비안 과학"의 일부이며 언어학과 관련된 패키지를 설치합니다.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Science to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Science mailing list

Links to other tasks

Debian Science Linguistics packages

Official Debian packages with high relevance

apertium
Shallow-transfer machine 변역 엔진
Versions of package apertium
ReleaseVersionArchitectures
jessie3.1.0-2amd64,armel,armhf,i386
buster3.5.2-1amd64,arm64,armhf,i386
bullseye3.7.1-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.8.3-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie3.9.4-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid3.9.4-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
stretch3.4.0~r61013-5amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
Debtags of package apertium:
fieldlinguistics
roleprogram
Popcon: 7 users (22 upd.)*
Versions and Archs
License: DFSG free
Git

오픈 소스 shallow-transfer machine 번역 엔진인, Apertium은 초기에 동족 언어 쌍을 목표로했습니다.

음성 처리를 위한 유한 상태 변환기, 품사 태깅을 위한 숨겨진 마르코프 모델, 구조 전달을 위한 상태 기반 청킹을 사용합니다.

시스템은 interNOSTRUM (스페인-카탈로니아어, http://www.internostrum.com/welcome.php) 및 Traductor 대학 (스페인-포루투갈 어, http://traductor.universia.net)과 같은 d'Alacant 대학 Transducens 그룹 에서 이미 개발한 시스템을 기반으로 합니다.

올바를 형식으로 필요한 언어 데이타를 제공하기만 한다면 다양한 동족 언어 쌍 을 위한 기계 번역 시스템 구축을 위해 Apertiumd을 사용하는 것이 가능할 겁니다.

Screenshots of package apertium
apertium-eval-translator
Evaluate machine translation output against reference
Versions of package apertium-eval-translator
ReleaseVersionArchitectures
sid1.2.1-3all
bullseye1.2.1-2all
trixie1.2.1-3all
bookworm1.2.1-3all
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

This package contails Perl scripts to evaluate Apertium-based machine translation output against reference: WER, PER, TER, BLEU.

apertium-lex-tools
Constraint-based lexical selection module
Versions of package apertium-lex-tools
ReleaseVersionArchitectures
sid0.4.2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.2.1-1amd64,arm64,armhf,i386
bullseye0.2.7-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch0.1.1~r66150-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm0.4.2-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.4.2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 6 users (16 upd.)*
Versions and Archs
License: DFSG free
Git

Module for compiling lexical selection rules and processing them in the pipeline.

artha
Handy off-line thesaurus based on WordNet
Versions of package artha
ReleaseVersionArchitectures
stretch1.0.3-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie1.0.3-1amd64,armel,armhf,i386
buster1.0.3-3amd64,arm64,armhf,i386
bullseye1.0.5-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm1.0.5-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.0.5-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid1.0.5-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package artha:
fieldlinguistics
interfacex11
roleprogram
uitoolkitgtk
uselearning
x11application
Popcon: 29 users (15 upd.)*
Versions and Archs
License: DFSG free
Git

Artha is a off-line English thesaurus with distinct features like:

  • hot-key press word look-up (select text on any window and press a preset hot-key for look-up)
  • regular expressions based search (broaden search using wild-cards like *, ?, etc.)
  • passive desktop notifications (of word definitions for uninterrupted work-flow)
  • spelling suggestions (when the exact spelling is vague/not known)

Once launched, it monitors for a preset hot-key combination. When some text is selected on any window and the hot-key is pressed, it pops-up with the word looked-up. Should the user prefer passive notifications, this can be done by enabling the notifications option.

When the term looked for is vague/not known, then either the search can be broadened with the use of regular expressions (*, ?, etc.) in the search string or spelling suggestions when a term is incorrect.

For regular expressions based search to work, wordnet-sense-index package is required.

Screenshots of package artha
cg3
Tools for using the 3rd edition of Constraint Grammar (CG-3)
Versions of package cg3
ReleaseVersionArchitectures
buster1.1.7-1amd64,arm64,armhf,i386
sid1.4.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
stretch0.9.9~r11624-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
trixie1.4.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye1.3.2-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm1.3.9-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 8 users (13 upd.)*
Versions and Archs
License: DFSG free
Git

Constraint Grammar compiler and applicator for the 3rd edition of CG that is developed and maintained by VISL SDU and GrammarSoft ApS.

CG-3 can be used for disambiguation of morphology, syntax, semantics, etc; dependency markup, target language lemma choice for MT, QA systems, and much more. The core idea is that you choose what to do based on the whole available context, as opposed to n-grams.

See https://visl.sdu.dk/cg3.html for more documentation

collatinus
lemmatisation of latin text
Maintainer: Georges Khaznadar
Versions of package collatinus
ReleaseVersionArchitectures
stretch-backports11-1~bpo9+1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
sid12.2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie10.2-2amd64,armel,armhf,i386
buster11-1amd64,arm64,armhf,i386
bookworm12.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch10.2-2amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
trixie12.2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye11-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package collatinus:
fieldlinguistics
interfacex11
roledummy, program
scopeapplication
uitoolkitgtk
uselearning
x11application
Popcon: 1 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Collatinus can be used to lemmatise latin texts, i.e. extract words and make a lexicon which indicates for each word its canonic form, and how the form actually found in the text was derived from it, for instance by declining it. Example : rosam gives : rosa-rosae -- acc. sing. Collatinus provides a nice graphic front-end to each operation.

Collatinus-nouus (stands for Collatinus, new generation) replaces every previous version of Collatinus.

This package provides a documentation in HTML format.

Screenshots of package collatinus
dimbl
분산형 메모리 기반 학습자
Versions of package dimbl
ReleaseVersionArchitectures
trixie0.17-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
stretch0.15-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm0.15-2.1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.15-2.1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster0.15-2.1amd64,arm64,armhf,i386
sid0.17-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie0.12-2amd64,armel,armhf,i386
Debtags of package dimbl:
roleprogram
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Dimbl은 TiMBL에서 k-nearest 이웃 분류자를 둘러싼 레퍼로, 멀티 CPU 머신에서 병렬 분류를 제공합니다. Dimbl은 원래 학습 집합을 분할하고, 학습 하위 집합마 다 별도의 TiMBL 분류자를 만들고, 그리고 분류된 인스턴스별로 가장 가까운 이 웃 집합을 병합합니다.

Dimbl의 기능은 다음과 같습니다:

  • 모든 명령행 옵션을 유지하면서 TiMBL 주위를 깔끔하게 정리;
  • 멀티, 듀오, 또는 쿼드 코어로 무엇을 할지 알고 있음;
  • 병렬 프로그램을 위한 OpenMP 사양을 사용함;
  • 표준 TiMBL과 비교해서 초고속의 속도를 얻을 수 있음.

Dimbl은 ILK 연구소 그룹 (네덜란드에 Tilburg 대학교)의 제품입니다.

메모리 기반 학습 기술을 사용해서 자연어 처리에 대한 과학적 연구를 한다면 Dimbl은 유용할 것입니다.

fasttext
Efficient learning of word representations and sentence classification library
Versions of package fasttext
ReleaseVersionArchitectures
bullseye0.9.2-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.9.2+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 6 users (9 upd.)*
Versions and Archs
License: DFSG free
Git

fastText is a library for efficient learning of word representations and sentence classification, which refers subword information to enrich word vectors.

frog
tagger and parser for natural languages (runtime)
Versions of package frog
ReleaseVersionArchitectures
trixie0.32-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.32-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.15-1amd64,arm64,armhf,i386
stretch0.13.7-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie0.12.17-7.1amd64,armel,armhf,i386
bullseye0.20-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.20-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
upstream0.33
Popcon: 2 users (4 upd.)*
Newer upstream!
License: DFSG free
Git

Memory-Based Learning (MBL) is a machine-learning method applicable to a wide range of tasks in Natural Language Processing (NLP).

Frog is a modular system integrating a morphosyntactic tagger, lemmatizer, morphological analyzer, and dependency parser for natural languages. It is based upon it's predecessor TADPOLE (TAgger, Dependency Parser, and mOrphoLogical analyzEr). Using Memory-Based Learning techniques, frog tokenizes, tags, lemmatizes, and morphologically segments word tokens in incoming UTF-8 text files, and assigns a dependency graph to each sentence. Frog is particularly targeted at the increasing need for fast, automatic NLP systems applicable to very large (multi-million to billion word) document collections that are becoming available due to the progressive digitization of both new and old textual data. Up to now, frog has only been tested and used using corpora of Dutch natural language (see the frogdata package for samples).

Frog is a product of the Centre of Language and Speech Technology at Radboud University Nijmegen, it subsumes previous work by the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium). It is currently maintained at the KNAW Humanities Cluster.

If you do scientific research in NLP, Frog will likely be of use to you.

giella-sme
Giellatekno single language data for North Saami
Versions of package giella-sme
ReleaseVersionArchitectures
buster0.0.20150917~r121176-3all
stretch0.0.20150917~r121176-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Giellatekno language resources for North Saami.

hfst
Helsinki Finite-State Transducer Technology
Versions of package hfst
ReleaseVersionArchitectures
buster3.15.0-1.1~deb10u1amd64,arm64,armhf,i386
bullseye3.15.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch3.10.0~r2798-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
sid3.16.0-5amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm3.16.0-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 6 users (7 upd.)*
Versions and Archs
License: DFSG free
Git

The Helsinki Finite-State Transducer software is intended for the implementation of morphological analysers and other tools which are based on weighted and unweighted finite-state transducer technology.

hfst-ospell
Spell checker library and tool based on HFST
Versions of package hfst-ospell
ReleaseVersionArchitectures
stretch0.4.0~r4643-4amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bullseye0.5.2-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.5.3-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster0.5.0-2amd64,arm64,armhf,i386
sid0.5.4-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.5.4-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 0 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

Minimal HFST optimized lookup format based spell checker library and a demonstrational implementation of command line based spell checker.

irstlm
IRST 언어 모델링 툴킷
Versions of package irstlm
ReleaseVersionArchitectures
stretch6.00.05-2amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
sid6.00.05-4.1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie6.00.05-4.1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm6.00.05-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye6.00.05-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster6.00.05-2amd64,arm64,armhf,i386
Popcon: 1 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

IRST 언어 모델링 툴킷은 데이타에서 언어 모델을 학습하는데 사용될 수 있습니 다. 생성된 n-gram 모델들은 ARPA 언어 모델 형식을 지원하는 모든 시스템에서 사용할 있습니다.

이 패키지는 명령행 도구를 제공합니다.

libcld2-dev
Compact Language Detector 2, development package
Versions of package libcld2-dev
ReleaseVersionArchitectures
trixie0.0.0-git20150806-9amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64
bookworm0.0.0-git20150806-9amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
buster0.0.0-git20150806-6amd64,arm64,armhf,i386
stretch0.0.0-git20150806-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
sid0.0.0-git20150806-9amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64
bullseye0.0.0-git20150806-9amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
Popcon: 1 users (9 upd.)*
Versions and Archs
License: DFSG free
Git

Detects over 80 languages in UTF-8 text, based largely on groups of four letters. Also tables for 160+ language version.

This is the development package.

link-grammar
Carnegie Mellon University의 링크 문법 파서
Maintainer: Jonas Smedegaard
Versions of package link-grammar
ReleaseVersionArchitectures
jessie4.7.4-2amd64,armel,armhf,i386
stretch5.3.14-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster5.5.1-6amd64,arm64,armhf,i386
bullseye5.8.1-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm5.12.0~dfsg-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie5.12.5~dfsg-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid5.12.5~dfsg-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package link-grammar:
fieldlinguistics
interfacecommandline
roleprogram
usechecking
works-withdictionary
Popcon: 6 users (7 upd.)*
Versions and Archs
License: DFSG free
Git

Sleator, D. and Temperley, D. "Parsing English with a Link Grammar" (1991),에서 저자는 "링크 문법"이라고 하는 새로운 형식에 문법 시스템을 정의했습니다. 각 단어의 지역적 요구 사항이 충족되고, 링크가 교차하지 않으며, 단어들이 연결된 그래프를 형성하는 방식으로 단어 사이에 "링크"를 그리는 방법이 있다면 단어 시퀀스는 링크 문법의 언어로 되어 있습니다. 저자는 영어 문법을 이러한 시스템으로 인코딩하고 이 문법을 사용해서 영어를 파싱하기 위해 이 프로그램을 개발했습니다.

link 문법은 자연어 문서에서 정보 검색 또는 추출을 위한 언어 파싱에 사용할 수 있습니다. 또한 문법 검사기로도 사용할 수 있습니다.

이 패키지는 사용자 실행가능한 바이너리를 포함합니다.

Screenshots of package link-grammar
lttoolbox
Apertium 어휘 처리 모듈 및 도구
Versions of package lttoolbox
ReleaseVersionArchitectures
trixie3.7.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid3.7.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie3.1.0-1.2amd64,armel,armhf,i386
bullseye3.5.3-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.7.1-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster3.5.0-3amd64,arm64,armhf,i386
stretch3.3.3~r68466-2amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
Debtags of package lttoolbox:
fieldlinguistics
roleprogram
Popcon: 7 users (22 upd.)*
Versions and Archs
License: DFSG free
Git

lttoolbox는 규칙 기반 및 하이브리드 장비 변환 시스템을 구축하기 위한 플랫폼 인, Apertium에서 사용되는 자연어 처리를 위한 증강된 문자 변환 도구를 포함합 니다. 소프트웨어는 자연어 처리 어플리케이션용 형태소 분석기 및 생성기를 만 드는데 유용합니다.

mbt
메모리 기반 tagger-generator 및 tagger
Versions of package mbt
ReleaseVersionArchitectures
buster3.4-1amd64,arm64,armhf,i386
stretch3.2.16-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie3.2.10-4amd64,armel,armhf,i386
sid3.10-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie3.10-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm3.6-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye3.6-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package mbt:
fieldlinguistics
roleprogram
Popcon: 1 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

MBT는 하나의 메모리 기반 tagger-generator 및 tagger 입니다. tagger-generator 파트는 태깅된 시퀀스의 트레이닝 세트를 기반으로 시퀀스 tagger를 생성할 수 있습니다; tagger 파트는 새로운 시퀀스에 태그를 지정할 수 있습니다. 예를 들어, MBT는 자연어 처리를 위해 품사 tagger 또는 chunker를 생성하는데 사용할 수 있습니다. 특징들:

  • Tagger 생성: 태그가 있는 텍스트 입력, tagger 제거,
  • 선택적 피드백 루프: 이전 태그 결정을 다음 결정 입력으로 다시 피드백,
  • 쉽게 사용자 정의가 가능한 기능 표현; 사용자 제공 기능 통합,
  • 알려진 단어와 알려지지 않은 단어에 대한 별도의 하위 태그 자동 생성,
  • TiMBL 전체 알고리즘 매개 변수를 사용할 수 있습니다.

MBT는 언어 및 음성 기술 센터 (네덜란드 Radboud 대학 Nijmegen), ILK 연구 그룹 (네덜란드 Tilburg 대학), CLiPS 연구센터 (벨기에 Antwerp 대학)의 제품입니다.

자연어 처리에 대한 과학적 연구를 수행한다면, MBT는 매우 유용할 것 입니다.

mbtserver
Server extensions for the MBT tagger
Versions of package mbtserver
ReleaseVersionArchitectures
sid0.16-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.14-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
jessie0.7-3amd64,armel,armhf,i386
buster0.12-1amd64,arm64,armhf,i386
stretch0.11-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm0.14-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.16-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

MbtServer extends Mbt with a server layer, running as a TCP server. Mbt is a memory-based tagger-generator and tagger for natural language processing. MbtServer provides the possibility to access a trained tagger from multiple sessions. It also allows one to run and access different taggers in parallel.

MbtServer is a product of the Centre for Language and Speech Technology (Radboud University, Nijmegen, The Netherlands), the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).

If you do scientific research in natural language processing, MbtServer will likely be of use to you.

opennlp
wrapper for Apache OpenNLP natural language text processing toolkit
Versions of package opennlp
ReleaseVersionArchitectures
sid2.4.0-1all
trixie2.4.0-1all
bullseye1.9.3-1all
bookworm2.1.0-1all
Popcon: 1 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also included maximum entropy and perceptron based machine learning.

This package contains the command line wrapper.

python3-pynlpl
PyNLPl is a library for Natural Language Processing (Python 3 version)
Versions of package python3-pynlpl
ReleaseVersionArchitectures
trixie1.2.9-1all
bullseye1.2.9-1all
buster1.1.2-1all
stretch1.1.2-1all
sid1.2.9-1all
bookworm1.2.9-1all
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language models. It also contains complex data types and algorithms. Moreover, it includes parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL) and clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

This is the Python 3 version.

python3-thinc
Practical Machine Learning for NLP in Python
Versions of package python3-thinc
ReleaseVersionArchitectures
buster6.12.1-1amd64,arm64,armhf,i386
bookworm8.1.7-1amd64,arm64,armhf,i386,mips64el,s390x
sid9.0.0-2amd64,arm64,armhf,i386,mips64el,riscv64,s390x
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Thinc is the machine learning library powering spaCy https://spacy.io. It features a battle-tested linear model designed for large sparse learning problems, and a flexible neural network model under development for spaCy v2.0 https://spacy.io/usage/v2.

Thinc is a practical toolkit for implementing models that follow the "Embed, encode, attend, predict" architecture. It's designed to be easy to install, efficient for CPU usage and optimised for NLP and deep learning with text – in particular, hierarchically structured input and variable-length sequences.

r-cran-lexrankr
extractive summarization of text with the LexRank algorithm
Versions of package r-cran-lexrankr
ReleaseVersionArchitectures
buster0.5.0-2amd64,arm64,armhf,i386
sid0.5.2-8amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.5.2-8amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.5.2-8amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.5.2-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 3 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

An R implementation of the LexRank algorithm implementing stochastic graph-based method for computing relative importance of textual units for Natural Language Processing. The technique on the problem of Text Summarization (TS) is tested. Extractive TS relies on the concept of sentence salience to identify the most important sentences in a document or set of documents. Salience is typically defined in terms of the presence of particular important words or in terms of similarity to a centroid pseudo-sentence.

Please cite: Güneş Erkan and Dragomir R. Radev: LexRank: Graph-based Lexical Centrality as Salience in Text Summarization. (eprint) Journal of Artific Intelligence Research 22:457-479 (2004)
r-cran-snowballc
Snowball stemmers based on the C libstemmer UTF-8 library
Versions of package r-cran-snowballc
ReleaseVersionArchitectures
trixie0.7.1-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.7.0-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.7.1-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.6.0-1amd64,arm64,armhf,i386
bullseye0.7.0-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 10 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

sentencepiece
Unsupervised text tokenizer and detokenizer
Versions of package sentencepiece
ReleaseVersionArchitectures
bullseye0.1.95-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.1.97-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

SentencePiece is an unsupervised text tokenizer/detokenizer mainly designed for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training.

timbl
틸부르프 메모리 기반 학습자
Versions of package timbl
ReleaseVersionArchitectures
jessie6.4.4-4amd64,armel,armhf,i386
trixie6.9-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid6.9-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye6.5-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster6.4.13-1amd64,arm64,armhf,i386
stretch6.4.8-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm6.5-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package timbl:
roleprogram
Popcon: 1 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

Memory-Based Learning (메모리 기반 학습/MBL)은 자연어 처리 (NLP)의 다양한 작업에 적용할 수 있는 기계 학습 방법입니다.

틸부르프 메모리 기반 학습자, TiMBL,은 NLP 연구를 위한 도구이며, 예제로 분류 작업을 배운 다른 많은 도메인을 위한 도구입니다. k-근접 이웃 분류자의 효과적 구현입니다.

TiMBL의 기능:

  • k-근접 이웃 분류자의 빠른 의사 결정 트리 기반 구현;
  • IB1 및 IB2, IGTree, TRIBL, 및 TRIBL2 알고리즘 구현;
  • 유사성 지표: 중복, MVDM, Jeffrey Divergence, Dot 제품, 코사인;
  • 기능 가중치 지표: 정보 획득, 게인 비율, 카이 제곱, 공유 분산;
  • 거리 가중치 지표: 역, 역 선형, 기하급수적 감쇠;
  • 가장 근접한 이웃 세트를 검사하는 매우 자세한 옵션;
  • 서버 기능 및 광범위한 API;
  • 빠른 leave-one-out 테스팅 및 내부 교차 확인;
  • 그리고 사용자 정의 예제 가중치 처리.

TiMBL은 언어 및 음성 기술 센터 (Radboud University, Nijmegen, The Netherlands), ILK 연구소 그룹 (Tilburg University, The Netherlands) 및 CLiPS 연구소 센터 (University of Antwerp, Belgium)의 제품입니다.

NLP에서 과학 연구를 한다면, timbl은 당신에게 유용할 것 입니다.

timblserver
Timbl용 서버 확장
Versions of package timblserver
ReleaseVersionArchitectures
bookworm1.14-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid1.18-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie1.18-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye1.14-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster1.12-1amd64,arm64,armhf,i386
stretch1.11-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie1.7-4amd64,armel,armhf,i386
Debtags of package timblserver:
roleprogram
Popcon: 1 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

timblserver는 TiMBL 래퍼입니다; TiMBL에 서버 기능을 추가합니다. 이를 통해 TiMBL은 선택적으로 HTTP를 통해 TCP 서버로 여러 실험을 실행할 수 있습니다.

Tilburg Memory Based Learner, TiMBL은 자연어 처리 연구 및 분류 작업이 예제로 부터 학습되는 다른 많은 도메인을 위한 도구입니다.

TimblServer는 ILK 연구 그룹(네덜란드 Tilburg 대학)과 CLiPS 연구 센터 (벨기에 Antwerp 대학)의 제품입니다.

NLP에서 과학적 연구를 한다면, TimblServer는 유용할 것 입니다.

ucto
유니코드 토크나이저
Versions of package ucto
ReleaseVersionArchitectures
buster0.14-2amd64,arm64,armhf,i386
bullseye0.21.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.21.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.30-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.30-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
stretch0.9.6-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie0.5.3-3.1amd64,armel,armhf,i386
upstream0.34
Debtags of package ucto:
roleprogram
Popcon: 5 users (4 upd.)*
Newer upstream!
License: DFSG free
Git

Ucto는 UTF-8로 인코드된 텍스트 파일을 토큰화 (예, 구두점과 단어 구분, 문장 분할, n-gram 생성) 할 수 있으며, 인덱싱, 품사 태깅, 또는 기계 번역 같은 추 가 처리에 적합한 텍스트를 만들기 위한 몇가지 기본 전처리 단계를 제공합니다.

이 패키지는 명령행 도구 자체를 제공합니다.

Ucto는 Maarten van Gompel 과 Ko van der Sloot이 개발하였습니다. Ucto에서의 작업은 NWO, 네덜란드 과학 연구 기관, 암묵적 언어학 프로젝트, CLARIN-NL 프로 그램, 및 CLARIAH 프로젝트에 의해 지원되었습니다.

Ucto는 언어 및 음성 기술 센터 (Radboud University Nijmegen) 및 이전 ILK 연 구 그룹 (Tilburg University, The Netherlands)의 제품입니다.

자연어 처리에 대한 과학적 연구처럼, UTF-8로 인코드된 파일의 기계 구문 분석 에 관심이 있다면, ucto는 당신에게 유용할 것입니다.

uctodata
Data files for Ucto
Versions of package uctodata
ReleaseVersionArchitectures
bullseye0.8-2all
stretch0.4-1all
sid0.9.1-1all
trixie0.9.1-1all
buster0.8-2all
bookworm0.8-2all
upstream0.11
Popcon: 6 users (3 upd.)*
Newer upstream!
License: DFSG free
Git

Ucto can tokenize UTF-8 encoded text files (i.e. separate words from punctuation, split sentences, generate n-grams), and offers several other basic preprocessing steps that make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

This package provides necessary language-specific datafiles for running Ucto.

Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto was funded by NWO, the Netherlands Organisation for Scientific Research, under the Implicit Linguistics project, the CLARIN-NL program, and the CLARIAH project.

Ucto is a product of the Centre of Language and Speech Technology (Radboud University Nijmegen), and previously the ILK Research Group (Tilburg University, The Netherlands).

wordnet
영어의 전자 어휘 데이타베이스
Versions of package wordnet
ReleaseVersionArchitectures
bookworm3.0-37amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye3.0-36amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch3.0-33amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
sid3.0-38amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie3.0-38amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie3.0-33amd64,armel,armhf,i386
buster3.0-35amd64,arm64,armhf,i386
Debtags of package wordnet:
fieldlinguistics
interfacex11
roleprogram
scopeapplication
uitoolkittk
usechecking
works-withdictionary
x11application
Popcon: 80 users (86 upd.)*
Versions and Archs
License: DFSG free
Git

WordNet(C)는 현재 어휘 기억의 심리 언어학적 이론에 영감을 받아 개발된 온라 인 어휘 참조 시스템입니다. 영어 명사, 동사, 형용사, 부사는 각각 하나의 기본 어휘 개념을 나타내는 동의어 집합으로 구성됩니다. 서로 다른 관계가 동의어 집 합을 연결합니다.

WordNet은 George A. Miller 교수(프로젝트 수석 연구원)의 지도하에 프린스턴 대학에 인지 과학 연구소에서 개발되었습니다.

WordNet은 전산 언어학, 텍스트 분석, 많은 관련 분야에서 연구자가 이용할 수 있는 가장 중요한 자원으로 간주됩니다.

WordNet의 바이너리와 맨페이지, 그리고 일반 맨페이지.

Please cite: George A. Miller: WordNet: A Lexical Database for English. Communications of the ACM 38(11):39-41 (1995)

Official Debian packages with lower relevance

apertium-af-nl
Transitional dummy package for apertium-afr-nld
Versions of package apertium-af-nl
ReleaseVersionArchitectures
bullseye0.3.0-2all
stretch0.2.0~r58256-1all
buster0.2.0~r58256-2all
sid0.3.0-3all
trixie0.3.0-3all
bookworm0.3.0-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-apy
Apertium APY service
Versions of package apertium-apy
ReleaseVersionArchitectures
trixie0.11.7-2.2all
sid0.11.7-2.2all
bookworm0.11.7-2.1all
bullseye0.11.7-2all
stretch0.9.1~r343-2all
buster0.11.4-2all
upstream0.12.1
Popcon: 3 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

This package contains Apertium APY which is simple Apertium API written in Python 3 meant as a drop-in replacement for ScaleMT.

apertium-arg
Apertium single language data for Aragonese
Versions of package apertium-arg
ReleaseVersionArchitectures
stretch0.1.2~r65494-1all
buster0.1.2~r65494-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Aragonese

apertium-arg-cat
Apertium translation data for the Aragonese-Catalan pair
Versions of package apertium-arg-cat
ReleaseVersionArchitectures
bookworm0.2.0-3all
buster0.1.0~r64925-2all
trixie0.3.0-2all
sid0.3.0-2all
stretch0.1.0~r64925-1all
bullseye0.2.0-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Aragonese and Catalan languages.

apertium-bel
Apertium single language data for Belarusian
Versions of package apertium-bel
ReleaseVersionArchitectures
buster0.1.0~r81357-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Belarusian.

apertium-bel-rus
Apertium translation data for the Belarusian-Russian pair
Versions of package apertium-bel-rus
ReleaseVersionArchitectures
bookworm0.2.1-2all
bullseye0.2.1-1all
trixie0.2.1-2all
sid0.2.1-2all
buster0.2.0~r81186-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Belarusian and Russian languages.

apertium-br-fr
Apertium linguistic data to translate between Breton and French
Versions of package apertium-br-fr
ReleaseVersionArchitectures
stretch0.5.0~r61325-2all
trixie0.5.1-1all
sid0.5.1-1all
buster0.5.0~r61325-3all
bookworm0.5.1-1all
bullseye0.5.1-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a linguistic package for the Apertium shallow-transfer machine translation system. The package can be used to translate between Breton and French.

apertium-ca-it
Transitional dummy package for apertium-cat-ita
Versions of package apertium-ca-it
ReleaseVersionArchitectures
bookworm0.2.2-1all
sid1.1.0-1all
trixie1.1.0-1all
buster0.1.1~r57554-2all
stretch0.1.1~r57554-1all
bullseye0.2.1-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-cat
Apertium single language data for Catalan
Versions of package apertium-cat
ReleaseVersionArchitectures
stretch1.0.0~r65787-1all
buster2.6.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Catalan.

apertium-cat-srd
Apertium translation data for the Catalan-Sardinian pair
Versions of package apertium-cat-srd
ReleaseVersionArchitectures
buster1.0.0~r82995-2all
sid1.2.0-1all
bullseye1.1.0-1all
bookworm1.1.0-2all
trixie1.2.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Catalan and Sardinian languages.

apertium-crh
Apertium single language data for Crimean Tatar
Versions of package apertium-crh
ReleaseVersionArchitectures
buster0.2.0~r83161-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Crimean Tatar

apertium-crh-tur
Apertium translation data for the Crimean Tatar-Turkish pair
Versions of package apertium-crh-tur
ReleaseVersionArchitectures
buster0.3.0~r83159-2all
bullseye0.3.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Crimean Tatar and Turkish languages.

apertium-cy-en
Apertium translation data for the Welsh-English pair
Versions of package apertium-cy-en
ReleaseVersionArchitectures
bullseye0.1.1~r57554-7all
buster0.1.1~r57554-4all
stretch0.1.1~r57554-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Welsh and English languages.

apertium-dan
Apertium single language data for Danish
Versions of package apertium-dan
ReleaseVersionArchitectures
buster0.5.0~r67099-2all
stretch0.5.0~r67099-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Danish.

apertium-dan-nor
Apertium translation data for the Danish-Norwegian pair
Versions of package apertium-dan-nor
ReleaseVersionArchitectures
bullseye1.4.1-2all
stretch1.3.0~r67099-1all
bookworm1.5.0-2all
trixie1.5.0-2all
sid1.5.0-2all
buster1.3.0~r67099-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating from the Danish to the Norwegian Nynorsk/Norwegian Bokmål variants and from Danish to Norwegian Nynorsk.

apertium-en-ca
Transitional dummy package for apertium-eng-cat
Versions of package apertium-en-ca
ReleaseVersionArchitectures
jessie0.8.9-1amd64,armel,armhf,i386
stretch0.9.3~r61328-1all
trixie1.0.1-5all
sid1.0.1-5all
bookworm1.0.1-5all
bullseye1.0.1-4all
buster0.9.3~r61328-2all
Debtags of package apertium-en-ca:
culturecatalan
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-en-es
Transitional dummy package for apertium-eng-spa
Versions of package apertium-en-es
ReleaseVersionArchitectures
trixie0.8.1-2all
stretch0.8.0~r57502-2all
buster0.8.0~r57502-4all
bullseye0.8.0~r57502-5all
bookworm0.8.1-2all
sid0.8.1-2all
jessie0.6.0-1.1amd64,armel,armhf,i386
Debtags of package apertium-en-es:
culturespanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-en-gl
Apertium translation data for the English-Galician pair
Versions of package apertium-en-gl
ReleaseVersionArchitectures
bullseye0.5.2~r57551-3all
sid0.5.4-2all
stretch0.5.2~r57551-1all
buster0.5.2~r57551-2all
bookworm0.5.4-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the English and Galician languages.

apertium-eo-ca
Apertium translation data for the Esperanto-Catalan pair
Versions of package apertium-eo-ca
ReleaseVersionArchitectures
jessie0.9.0-1.1amd64,armel,armhf,i386
trixie0.9.2-1all
bookworm0.9.2-1all
sid0.9.2-1all
buster0.9.1~r60655-3all
stretch0.9.1~r60655-1all
bullseye0.9.2-1all
Debtags of package apertium-eo-ca:
culturecatalan, esperanto
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Esperanto and Catalan languages.

apertium-eo-en
Apertium linguistic data to translate between Esperanto and English
Versions of package apertium-eo-en
ReleaseVersionArchitectures
bookworm1.0.2-1all
sid1.0.2-1all
bullseye1.0.0~r63833-3all
buster1.0.0~r63833-2all
stretch1.0.0~r63833-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a linguistic package for the Apertium shallow-transfer machine translation system. The package can be used to translate between Esperanto and English.

apertium-eo-es
Apertium translation data for the Esperanto-Spanish pair
Versions of package apertium-eo-es
ReleaseVersionArchitectures
trixie0.9.2-1all
stretch0.9.1~r60655-1all
jessie0.9.0-1.1amd64,armel,armhf,i386
bookworm0.9.2-1all
sid0.9.2-1all
buster0.9.1~r60655-3all
bullseye0.9.1~r60655-4all
Debtags of package apertium-eo-es:
cultureesperanto, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Esperanto and Spanish languages.

apertium-eo-fr
Apertium translation data for the Esperanto-French pair
Versions of package apertium-eo-fr
ReleaseVersionArchitectures
bullseye0.9.1-1all
buster0.9.0~r57551-2all
stretch0.9.0~r57551-1all
sid0.9.1-1all
trixie0.9.1-1all
bookworm0.9.1-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Esperanto and French languages.

apertium-es-ast
Transitional dummy package for apertium-spa-ast
Versions of package apertium-es-ast
ReleaseVersionArchitectures
stretch1.1.0~r51165-1all
trixie1.1.1-2all
bookworm1.1.1-2all
buster1.1.0~r51165-2all
sid1.1.1-2all
bullseye1.1.0~r51165-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-es-ca
Transitional dummy package for apertium-spa-cat
Versions of package apertium-es-ca
ReleaseVersionArchitectures
stretch1.2.1+svn~57448-4all
bookworm2.2.0-3all
sid2.2.0-3all
buster2.1.0~r79717-2all
bullseye2.2.0-2all
jessie1.1.0-1.1amd64,armel,armhf,i386
trixie2.2.0-3all
Debtags of package apertium-es-ca:
culturecatalan, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-es-gl
Apertium translation data for the Spanish-Galician pair
Versions of package apertium-es-gl
ReleaseVersionArchitectures
trixie1.0.9-3all
bullseye1.0.8~r57542-4all
bookworm1.0.9-3all
jessie1.0.7-1amd64,armel,armhf,i386
buster1.0.8~r57542-3all
sid1.0.9-3all
stretch1.0.8~r57542-2all
Debtags of package apertium-es-gl:
culturegalician, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Spanish and Galician languages.

apertium-es-it
Transitional dummy package for apertium-spa-ita
Versions of package apertium-es-it
ReleaseVersionArchitectures
buster0.2.0~r78826-2all
sid0.2.1-3all
stretch0.1.0~r51165-1all
bullseye0.2.0~r78826-2.1all
trixie0.2.1-3all
bookworm0.2.1-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-es-pt
Apertium translation data for the Spanish-Portuguese pair
Versions of package apertium-es-pt
ReleaseVersionArchitectures
buster1.1.5+svn~57507-4all
bullseye1.1.5+svn~57507-5all
bookworm1.1.6-1all
trixie1.1.6-1all
sid1.1.6-1all
jessie1.0.3-2.1amd64,armel,armhf,i386
stretch1.1.5+svn~57507-3all
Debtags of package apertium-es-pt:
cultureesperanto, portuguese, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Spanish and Portuguese languages.

apertium-es-ro
Apertium translation data for the Spanish-Romanian pair
Versions of package apertium-es-ro
ReleaseVersionArchitectures
sid0.7.5-1all
jessie0.7.1-2.1amd64,armel,armhf,i386
stretch0.7.3~r57551-2all
buster0.7.3~r57551-3all
bullseye0.7.3~r57551-4all
bookworm0.7.5-1all
Debtags of package apertium-es-ro:
cultureromanian, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Spanish and Romanian languages.

apertium-eu-en
Apertium translation data for the Basque-English pair
Versions of package apertium-eu-en
ReleaseVersionArchitectures
stretch0.3.1~r56205-1all
bullseye0.3.1~r56205-3all
buster0.3.1~r56205-2all
bookworm0.3.3-1all
sid0.3.3-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Basque and English languages.

apertium-eu-es
Apertium translation data for the Basque-Spanish pair
Versions of package apertium-eu-es
ReleaseVersionArchitectures
bullseye0.3.3~r56159-4all
sid0.3.4-1all
jessie0.3.1-1amd64,armel,armhf,i386
bookworm0.3.4-1all
trixie0.3.4-1all
stretch0.3.3~r56159-2all
buster0.3.3~r56159-3all
Debtags of package apertium-eu-es:
culturebasque, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Basque and Spanish languages.

apertium-fr-ca
??? missing short description for package apertium-fr-ca :-(
Versions of package apertium-fr-ca
ReleaseVersionArchitectures
jessie1.0.2-1amd64,armel,armhf,i386
stretch1.1.0~r64309-1all
Debtags of package apertium-fr-ca:
culturecatalan, french
fieldlinguistics
roleapp-data
Popcon: users ( upd.)*
Versions and Archs
License: DFSG free
Git
apertium-fr-es
Apertium translation data for the French-Spanish pair
Versions of package apertium-fr-es
ReleaseVersionArchitectures
stretch0.9.2~r61322-2all
jessie0.9.0-1amd64,armel,armhf,i386
bookworm0.9.4-1all
trixie0.9.4-1all
sid0.9.4-1all
bullseye0.9.2~r61322-4all
buster0.9.2~r61322-3all
Debtags of package apertium-fr-es:
culturefrench, spanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the French and Spanish languages.

apertium-fra
Apertium single language data for French
Versions of package apertium-fra
ReleaseVersionArchitectures
buster1.5.0-1all
stretch1.0.0~r65786-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for French.

apertium-fra-cat
Apertium translation data for the French-Catalan pair
Versions of package apertium-fra-cat
ReleaseVersionArchitectures
trixie1.10.0-1all
buster1.5.0-1all
stretch1.1.0~r64309-1all
bullseye1.9.0-1all
bookworm1.10.0-1all
sid1.10.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the French and Catalan languages.

apertium-hbs
Apertium single language data for Serbo-Croatian
Versions of package apertium-hbs
ReleaseVersionArchitectures
stretch0.5.0~r68212-2all
buster0.5.0~r68212-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Serbo-Croatian.

apertium-hbs-eng
Apertium translation data for the Serbo-Croatian - English pair
Versions of package apertium-hbs-eng
ReleaseVersionArchitectures
bullseye0.5.1-1all
sid0.5.1-2all
stretch0.1.0~r57598-1all
buster0.1.0~r57598-2all
bookworm0.5.1-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Serbo-Croatian and English languages.

apertium-hbs-mkd
Apertium translation data for the Serbo-Croatian-Macedonian pair
Versions of package apertium-hbs-mkd
ReleaseVersionArchitectures
stretch0.1.0~r57554-1all
bullseye0.1.0~r76450-4all
buster0.1.0~r76450-2.1all
bookworm0.1.1-1all
trixie0.1.1-1all
sid0.1.1-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Serbo-Croatian and Macedonian languages.

apertium-hbs-slv
Apertium translation data for the Serbo-Croatian-Slovenian pair
Versions of package apertium-hbs-slv
ReleaseVersionArchitectures
bullseye0.5.1-1all
buster0.1.0~r59294-2all
trixie0.5.1-2all
bookworm0.5.1-2all
sid0.5.1-2all
stretch0.1.0~r59294-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Serbo-Croatian and Slovenian languages.

apertium-hin
Apertium single language data for Hindi
Versions of package apertium-hin
ReleaseVersionArchitectures
sid0.1.0~r59158-4all
buster0.1.0~r59158-2all
stretch0.1.0~r59158-1all
bullseye0.1.0~r59158-2.1all
bookworm0.1.0~r59158-4all
upstream0.1.0
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

Data package providing Apertium language resources for Hindi.

apertium-id-ms
Transitional dummy package for apertium-ind-zlm
Versions of package apertium-id-ms
ReleaseVersionArchitectures
stretch0.1.1~r57551-1all
buster0.1.1~r57551-2all
bullseye0.1.2-3all
bookworm0.1.2-3all
trixie0.1.2-3all
sid0.1.2-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-is-sv
Transitional dummy package for apertium-isl-swe
Versions of package apertium-is-sv
ReleaseVersionArchitectures
bullseye0.1.0~r76450-3all
sid0.1.1-2all
trixie0.1.1-2all
bookworm0.1.1-2all
buster0.1.0~r76450-2all
stretch0.1.0~r56030-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-isl
Apertium single language data for Icelandic
Versions of package apertium-isl
ReleaseVersionArchitectures
buster0.1.0~r65494-2all
stretch0.1.0~r65494-1all
bullseye0.1.0~r65494-2.1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Icelandic.

apertium-isl-eng
Apertium translation data for the Icelandic-English pair
Versions of package apertium-isl-eng
ReleaseVersionArchitectures
buster0.1.0~r66083-2all
bookworm0.1.2-1all
sid0.1.2-1all
stretch0.1.0~r66083-1all
bullseye0.1.0~r66083-3all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Icelandic and English languages.

apertium-ita
Apertium single language data for Italian
Versions of package apertium-ita
ReleaseVersionArchitectures
stretch0.9.0~r72553-1all
bullseye0.10.0~r82237-2.1all
buster0.10.0~r82237-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Italian.

apertium-kaz
Apertium single language data for Kazakh
Versions of package apertium-kaz
ReleaseVersionArchitectures
buster0.1.0~r61338-2all
stretch0.1.0~r61338-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Kazakh

apertium-kaz-tat
Apertium translation data for the Kazakh-Tatar pair
Versions of package apertium-kaz-tat
ReleaseVersionArchitectures
buster0.2.1~r57554-2all
stretch0.2.1~r57554-1all
bullseye0.2.1-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Kazakh and Tatar languages.

apertium-mk-bg
Transitional dummy package for apertium-mkd-bul
Versions of package apertium-mk-bg
ReleaseVersionArchitectures
buster0.2.0~r49489-2all
bullseye0.2.0~r49489-3all
bookworm0.2.1-2all
trixie0.2.1-2all
sid0.2.1-2all
stretch0.2.0~r49489-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-mk-en
Transitional dummy package for apertium-mkd-eng
Versions of package apertium-mk-en
ReleaseVersionArchitectures
buster0.1.1~r57554-2all
stretch0.1.1~r57554-1all
bullseye0.1.1~r57554-3all
bookworm0.1.3-2all
trixie0.1.3-2all
sid0.1.3-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-mlt-ara
Apertium translation data for the Maltese-Arabic pair
Versions of package apertium-mlt-ara
ReleaseVersionArchitectures
bullseye0.2.0~r62623-2.1all
stretch0.2.0~r62623-1all
buster0.2.0~r62623-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Maltese and Arabic languages.

apertium-nno
Apertium single language data for Norwegian Nynorsk
Versions of package apertium-nno
ReleaseVersionArchitectures
buster0.9.0~r69513-3all
stretch0.9.0~r69513-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Norwegian Nynorsk.

apertium-nno-nob
Apertium translation data for the Norwegian Nynorsk-Norwegian Bokmål pair
Versions of package apertium-nno-nob
ReleaseVersionArchitectures
sid1.5.0-1all
bookworm1.5.0-1all
trixie1.5.0-1all
stretch1.1.0~r66076-1all
buster1.1.0~r66076-2all
bullseye1.3.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Norwegian Nynorsk and Norwegian Bokmål languages.

apertium-nob
Apertium single language data for Norwegian Bokmål
Versions of package apertium-nob
ReleaseVersionArchitectures
stretch0.9.0~r69513-1all
buster0.9.0~r69513-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Norwegian Bokmål.

apertium-oc-ca
Apertium translation data for the Occitan-Catalan pair
Versions of package apertium-oc-ca
ReleaseVersionArchitectures
trixie1.0.7-1all
stretch1.0.6~r57551-2all
bookworm1.0.7-1all
jessie1.0.5-1.1amd64,armel,armhf,i386
buster1.0.6~r57551-3all
bullseye1.0.6~r57551-4all
sid1.0.7-1all
Debtags of package apertium-oc-ca:
culturecatalan
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Occitan and Catalan languages.

apertium-oc-es
Apertium translation data for the Occitan-Spanish pair
Versions of package apertium-oc-es
ReleaseVersionArchitectures
buster1.0.6~r57551-3all
bookworm1.0.8-1all
stretch1.0.6~r57551-2all
sid1.0.8-1all
jessie1.0.5-1.1amd64,armel,armhf,i386
bullseye1.0.6~r57551-4all
Debtags of package apertium-oc-es:
culturespanish
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Occitan and Spanish languages.

apertium-oci
Apertium single language data for Occitan
Versions of package apertium-oci
ReleaseVersionArchitectures
buster0.1.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Occitan.

apertium-pol
Apertium single language data for Polish
Versions of package apertium-pol
ReleaseVersionArchitectures
buster0.1.1-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Polish.

apertium-pol-szl
Apertium translation data for the Polish-Silesian pair
Versions of package apertium-pol-szl
ReleaseVersionArchitectures
sid0.2.1-3all
bookworm0.2.1-3all
trixie0.2.1-3all
bullseye0.2.1-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Polish and Silesian languages.

apertium-pt-ca
Transitional dummy package for apertium-por-cat
Versions of package apertium-pt-ca
ReleaseVersionArchitectures
buster0.8.2+svn~57507-4all
stretch0.8.2+svn~57507-3all
bookworm0.10.1-2all
sid0.10.1-2all
trixie0.10.1-2all
jessie0.8.1-1amd64,armel,armhf,i386
bullseye0.10.0-1all
Debtags of package apertium-pt-ca:
culturecatalan, portuguese
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This is a transitional dummy package. It can safely be removed.

apertium-pt-gl
포르투갈어-갈리시아어 간 번역에 대한 아페르티움 변환 데이터
Versions of package apertium-pt-gl
ReleaseVersionArchitectures
bookworm0.9.3-1all
jessie0.9.1-1amd64,armel,armhf,i386
stretch0.9.2~r57551-2all
buster0.9.2~r57551-3all
bullseye0.9.2~r57551-4all
trixie0.9.3-1all
sid0.9.3-1all
Debtags of package apertium-pt-gl:
culturegalician, portuguese
fieldlinguistics
roleapp-data
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

포르투갈어와 갈라시아어 사이에 번역을 위한 아페르티움 언어 리소스를 제공하는 데이터 패키지.

apertium-rus
Apertium single language data for Russian
Versions of package apertium-rus
ReleaseVersionArchitectures
buster0.2.0~r82706-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Russian

apertium-separable
Reordering separable/discontiguous multiwords
Versions of package apertium-separable
ReleaseVersionArchitectures
sid0.6.1-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.3.2-1amd64,arm64,armhf,i386
bullseye0.3.6-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.6.1-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.6.1-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 4 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

Apertium module for reordering separable/discontiguous multiwords.

apertium-sme-nob
Apertium translation data for the Northern Sami-Norwegian Bokmål pair
Versions of package apertium-sme-nob
ReleaseVersionArchitectures
buster0.6.0~r61921-2all
stretch0.6.0~r61921-1all
bullseye0.6.1+ds.1-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Northern Sami and Norwegian Bokmål languages.

apertium-spa
Apertium single language data for Spanish
Versions of package apertium-spa
ReleaseVersionArchitectures
stretch0.1.0~r65494-1all
bullseye1.1.0~r79716-2.1all
buster1.1.0~r79716-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Spanish.

apertium-spa-arg
Apertium translation data for the Spanish-Aragonese pair
Versions of package apertium-spa-arg
ReleaseVersionArchitectures
trixie0.6.0-2all
bookworm0.5.0-2all
sid0.6.0-2all
bullseye0.5.0-1all
buster0.4.0~r64399-2all
stretch0.4.0~r64399-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Spanish and Aragonese languages.

apertium-srd
Apertium single language data for Sardinian
Versions of package apertium-srd
ReleaseVersionArchitectures
buster1.2.0~r82994-2all
stretch0.9.0~r72792-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Sardinian.

apertium-srd-ita
Apertium translation data for the Sardinian-Italian pair
Versions of package apertium-srd-ita
ReleaseVersionArchitectures
sid1.3.0-1all
bookworm1.1.0-2all
buster0.9.5~r82237-2all
stretch0.9.0~r72554-1all
trixie1.3.0-1all
bullseye1.1.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Sardinian and Italian languages.

apertium-swe
Apertium single language data for Swedish
Versions of package apertium-swe
ReleaseVersionArchitectures
stretch0.7.0~r69513-1all
buster0.7.0~r69513-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Swedish.

apertium-swe-dan
Apertium translation data for the Swedish-Danish pair
Versions of package apertium-swe-dan
ReleaseVersionArchitectures
sid0.8.1-3all
trixie0.8.1-3all
bookworm0.8.1-3all
bullseye0.8.1-2all
stretch0.7.0~r66063-1all
buster0.7.0~r66063-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Swedish and Danish languages.

apertium-swe-nor
Apertium translation data for the Swedish-Norwegian pair
Versions of package apertium-swe-nor
ReleaseVersionArchitectures
bookworm0.4.0-1all
bullseye0.3.1-1all
sid0.4.0-1all
trixie0.4.0-1all
buster0.2.0~r69544-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Swedish and Norwegian languages.

apertium-szl
Apertium single language data for Silesian
Versions of package apertium-szl
ReleaseVersionArchitectures
buster0.1.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Silesian.

apertium-tat
Apertium single language data for Tatar
Versions of package apertium-tat
ReleaseVersionArchitectures
stretch0.1.0~r60887-1all
buster0.1.0~r60887-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Tatar

apertium-tur
Apertium single language data for Turkish
Versions of package apertium-tur
ReleaseVersionArchitectures
buster0.2.0~r83161-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Turkish.

apertium-ukr
Apertium single language data for Ukrainian
Versions of package apertium-ukr
ReleaseVersionArchitectures
buster0.1.0~r82563-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Data package providing Apertium language resources for Ukrainian.

apertium-urd
Apertium single language data for Urdu
Versions of package apertium-urd
ReleaseVersionArchitectures
bullseye0.1.0~r61311-2.1all
bookworm0.1.0~r61311-3all
sid0.1.0~r61311-3all
stretch0.1.0~r61311-1all
buster0.1.0~r61311-2all
upstream0.1.0
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

Data package providing Apertium language resources for Urdu.

apertium-urd-hin
Apertium translation data for the Urdu-Hindi pair
Versions of package apertium-urd-hin
ReleaseVersionArchitectures
sid0.1.0~r64379-4all
stretch0.1.0~r64379-1all
bookworm0.1.0~r64379-4all
bullseye0.1.0~r64379-2.1all
buster0.1.0~r64379-2all
upstream0.1.0
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

Data package providing Apertium language resources for translating between the Urdu and Hindi languages.

frogdata
Data files for Frog
Versions of package frogdata
ReleaseVersionArchitectures
stretch0.13-1all
trixie0.22-1all
bookworm0.18-2all
buster0.16-1all
bullseye0.18-1all
jessie0.4-1all
sid0.22-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Frog is a modular system integrating a morphosyntactic tagger, lemmatizer, morphological analyzer, and dependency parser for the Dutch language.

This package provided necessary datafiles for running Frog.

Frog is a product of the Centre for Language and Speech Technology (Radboud University, Nijmegen) and prior to that of ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium). It is currently maintained at the KNAW Humanities Cluster.

libapache-opennlp-java
machine learning based toolkit for the processing of natural language text
Versions of package libapache-opennlp-java
ReleaseVersionArchitectures
bookworm2.1.0-1all
bullseye1.9.3-1all
trixie2.4.0-1all
sid2.4.0-1all
Popcon: 1 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also included maximum entropy and perceptron based machine learning.

libcg3-dev
Headers and shared files to develop using the CG-3 library
Versions of package libcg3-dev
ReleaseVersionArchitectures
stretch0.9.9~r11624-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bullseye1.3.2-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid1.4.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm1.3.9-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.4.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster1.1.7-1amd64,arm64,armhf,i386
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Development files to use the CG-3 API.

It is recommended to instrument the CLI tools instead of using this API.

See https://visl.sdu.dk/cg3.html for more documentation

libfasttext-dev
Header files of fastText
Versions of package libfasttext-dev
ReleaseVersionArchitectures
bookworm0.9.2+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.9.2-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 0 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

fastText is a library for efficient learning of word representations and sentence classification, which refers subword information to enrich word vectors. This package contains header files for development.

libfolia-dev
Implementation of the FoLiA document format (C++ headers)
Versions of package libfolia-dev
ReleaseVersionArchitectures
bullseye2.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch1.6-2amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster1.15-1amd64,arm64,armhf,i386
sid2.17-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie2.17-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm2.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
jessie0.10-4.2amd64,armel,armhf,i386
upstream2.20
Debtags of package libfolia-dev:
devellibrary
roledevel-lib
Popcon: 0 users (1 upd.)*
Newer upstream!
License: DFSG free
Git

FoLiA is an XML-based format for Linguistic Annotation suitable for representing written language resources such as corpora. Its goal is to unify a variety of linguistic annotations in one single rich format, without committing to any particular standard annotation set. Instead, it seeks to accommodate any desired system or tagset, and so offer maximum flexibility. This makes FoLiA language independent. see https://proycon.github.io/folia for more information.

libfolia is a product of the Centre of Language and Speech Technology, Radboud University Nijmegen (The Netherlands), it was previously developed at the ILK Research Group, Tilburg University. Work on libfolia is funded by NWO, the Netherlands Organisation for Scientific Research, in the scope of projects like CLARIN-NL and CLARIAH.

This package provides the FoLiA header files required to compile C++ programs that use libfolia and implements FoLiA v2.5.1.

libmbt-dev
memory-based tagger-generator and tagger - development
Versions of package libmbt-dev
ReleaseVersionArchitectures
sid3.10-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster3.4-1amd64,arm64,armhf,i386
bullseye3.6-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.6-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie3.10-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package libmbt-dev:
devellibrary
roledevel-lib
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing.

MBT is a product of the Centre of Language and Speech Technology (Radboud University Nijmegen, The Netherlands), the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).

If you do scientific research in natural language processing, MBT will likely be of use to you.

This package provides the header files required to compile C++ programs that use libmbt.

libopennlp-maxent-java
OpenNLP Maximum Entropy Package
Versions of package libopennlp-maxent-java
ReleaseVersionArchitectures
trixie3.0.0+ds-2all
bookworm3.0.0+ds-2all
bullseye3.0.0+ds-2all
sid3.0.0+ds-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part of speech tagging in Natural Language Processing. Several example applications using maxent can be found in the OpenNLP Tools Library.

libsentencepiece-dev
Header files of SentencePiece
Versions of package libsentencepiece-dev
ReleaseVersionArchitectures
bookworm0.1.97-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.1.95-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

SentencePiece is an unsupervised text tokenizer/detokenizer mainly designed for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training.

libticcutils-dev
utility functions used in the context of Natural Language Processing (headers)
Versions of package libticcutils-dev
ReleaseVersionArchitectures
bookworm0.24-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.34-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.34-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.24-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster0.20-1amd64,arm64,armhf,i386
upstream0.35
Debtags of package libticcutils-dev:
devellibrary
roledevel-lib
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

The TiCC utils C++ library contains useful functions and other goodies for general use in TiMBL and other parts of the TiCC software stack and beyond.

TiCC utils is a product of the Tilburg centre for Cognition and Communication (Tilburg University, The Netherlands). If you do scientific research in Natural Language Processing, TiCC software will likely be of use to you.

This package provides the header files required to compile C++ programs that use libticcutils.

libticcutils2-dev
??? missing short description for package libticcutils2-dev :-(
Versions of package libticcutils2-dev
ReleaseVersionArchitectures
jessie0.4-5.1amd64,armel,armhf,i386
stretch0.14-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
upstream0.35
Debtags of package libticcutils2-dev:
devellibrary
roledevel-lib
Popcon: 1 users (0 upd.)*
Newer upstream!
License: DFSG free
Git
libtimbl-dev
Tilburg Memory Based Learner - development
Versions of package libtimbl-dev
ReleaseVersionArchitectures
bullseye6.5-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster6.4.13-1amd64,arm64,armhf,i386
bookworm6.5-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie6.9-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid6.9-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package libtimbl-dev:
devellibrary
roledevel-lib
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

The Tilburg Memory Based Learner, TiMBL, is a tool for Natural Language Processing research, and for many other domains where classification tasks are learned from examples. It is an efficient implementation of k-nearest neighbor classifier.

TiMBL is a product of the Centre of Language and Speech Technology (Radboud University, Nijmegen, The Netherlands), the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).

This package provides the TiMBL header files required to compile C++ programs that use TiMBL.

libtimblserver-dev
Server extensions for Timbl - development
Versions of package libtimblserver-dev
ReleaseVersionArchitectures
bullseye1.14-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.18-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid1.18-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster1.12-1amd64,arm64,armhf,i386
bookworm1.14-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package libtimblserver-dev:
devellibrary
roledevel-lib
Popcon: users ( upd.)*
Versions and Archs
License: DFSG free
Git

timblserver is a TiMBL wrapper; it adds server functionality to TiMBL. It allows TiMBL to run multiple experiments as a TCP server, optionally via HTTP.

The Tilburg Memory Based Learner, TiMBL, is a tool for Natural Language Processing research, and for many other domains where classification tasks are learned from examples.

TimblServer is a product of the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).

This package provides the header files required to compile C++ programs that use timblserver.

libucto-dev
Unicode Tokenizer - development
Versions of package libucto-dev
ReleaseVersionArchitectures
jessie0.5.3-3.1amd64,armel,armhf,i386
buster0.14-2amd64,arm64,armhf,i386
bullseye0.21.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch0.9.6-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm0.21.1-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.30-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.30-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
upstream0.34
Debtags of package libucto-dev:
devellibrary
roledevel-lib
Popcon: 0 users (1 upd.)*
Newer upstream!
License: DFSG free
Git

Ucto can tokenize UTF-8 encoded text files (i.e. separate words from punctuation, split sentences, generate n-grams), and offers several other basic preprocessing steps that make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

This package provides C++ headers for the programming library.

Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto was funded by NWO, the Netherlands Organisation for Scientific Research, under the Implicit Linguistics project, the CLARIN-NL program, and the CLARIAH project.

Ucto is a product of the Centre of Language and Speech Technology (Radboud University Nijmegen), the KNAW Humanities Cluster, and previously the ILK Research Group (Tilburg University, The Netherlands).

If you are interested in machine parsing of UTF-8 encoded text files, e.g. to do scientific research in natural language processing, ucto will likely be of use to you.

python3-fasttext
fastText binding for Python3
Versions of package python3-fasttext
ReleaseVersionArchitectures
trixie0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.9.2+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.9.2-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.9.2+ds-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 3 users (5 upd.)*
Versions and Archs
License: DFSG free
Git

fastText is a library for efficient learning of word representations and sentence classification, which refers subword information to enrich word vectors.

python3-fasttext is its binding for Python3.

python3-gensim
Python framework for fast Vector Space Modelling
Versions of package python3-gensim
ReleaseVersionArchitectures
bookworm4.2.0+dfsg-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid4.3.3+dfsg-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 9 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.

python3-nltk
Python3 libraries for natural language processing
Versions of package python3-nltk
ReleaseVersionArchitectures
bookworm3.8-1all
jessie3.0.0-1all
sid3.9.1-2all
trixie3.9.1-2all
stretch3.2.1-2all
bullseye3.5-1all
buster3.4-1all
Popcon: 4213 users (327 upd.)*
Versions and Archs
License: DFSG free
Git

The Natural Language Toolkit (NLTK) is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning.

This package contains the modules for Python3.

Please cite: Steven Bird, Ewan Klein and Edward Loper: (2009)
python3-sentencepiece
SentencePiece binding for Python3
Versions of package python3-sentencepiece
ReleaseVersionArchitectures
bookworm0.1.97-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.2.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.1.95-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 2 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

SentencePiece is an unsupervised text tokenizer/detokenizer mainly designed for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training.

python3-sentencepiece is its binding for Python3.

python3-snowballstemmer
Pure Python Snowball stemming library
Maintainer: Stefano Rivera
Versions of package python3-snowballstemmer
ReleaseVersionArchitectures
sid2.2.0-4all
trixie2.2.0-4all
stretch1.2.1-1all
bullseye2.1.0-1all
bookworm2.2.0-2all
buster1.2.1-1all
Popcon: 2106 users (720 upd.)*
Versions and Archs
License: DFSG free
Git

Snowball provides access to efficient algorithms for calculating a "stemmed" form of a word. This is a form with most of the common morphological endings removed; hopefully representing a common linguistic base form. This is most useful in building search engines and information retrieval software; for example, a search with stemming enabled should be able to find a document containing "cycling" given the query "cycles".

Snowball provides algorithms for several (mainly European) languages. It also provides access to the classic Porter stemming algorithm for English: although this has been superseded by an improved algorithm, the original algorithm may be of interest to information retrieval researchers wishing to reproduce results of earlier experiments.

This package contains the pure Python module that implements Snowball algorithms. When python3-stemmer package (which contains the C extension) is installed, it uses that extension instead of the pure Python code.

python3-streamparser
Python library to parse Apertium stream format
Versions of package python3-streamparser
ReleaseVersionArchitectures
sid5.0.2-2all
buster5.0.2-1all
bookworm5.0.2-2all
trixie5.0.2-2all
bullseye5.0.2-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This package provides Python 3 library, streamparser, to parse Apertium stream format.

r-cran-nlp
Natural Language Processing Infrastructure for R
Versions of package r-cran-nlp
ReleaseVersionArchitectures
bookworm0.2-1-1all
stretch0.1-9-1all
bullseye0.2-1-1all
buster0.2-0-1all
sid0.2-1-1all
stretch-backports0.2-0-1~bpo9+1all
trixie0.2-1-1all
upstream0.3-0
Popcon: 18 users (3 upd.)*
Newer upstream!
License: DFSG free
Git

Basic classes and methods for Natural Language Processing in R.

r-cran-tm
Text Mining functionality for R
Versions of package r-cran-tm
ReleaseVersionArchitectures
stretch-backports0.7-6-1~bpo9+1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bookworm0.7-11-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.7-14-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.7-8-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch0.6-2-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
trixie0.7-14-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.7-6-1amd64,arm64,armhf,i386
Popcon: 18 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

A framework for text mining applications within R.

tfdocgen
TiLP framework documentation generator
Versions of package tfdocgen
ReleaseVersionArchitectures
bookworm1.0-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.0-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid1.0-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster1.0-2amd64,arm64,armhf,i386
bullseye1.0-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch1.0-1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie1.0-1amd64,armel,armhf,i386
Debtags of package tfdocgen:
develdocsystem
roleprogram
Popcon: users ( upd.)*
Versions and Archs
License: DFSG free
Git

The tfdocgen program is a program used by the libti2 libraries to generate their HTML documentation from sources and misc files. You don't need this package unless you want to develop on the libti2 libraries.

Debian packages in experimental

sequitur-g2p
Grapheme to Phoneme conversion tool
Maintainer: Giulio Paci
Versions of package sequitur-g2p
ReleaseVersionArchitectures
experimental0+r1668.r3-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
Popcon: users ( upd.)*
Versions and Archs
License: DFSG free
Git

Sequitur G2P is a data-driven grapheme-to-phoneme converter. It can be applied to any monotonous sequence translation problem, provided the source and target alphabets are small (less than 255 symbols). Data-driven means that you need to train it with example pronunciations. Training takes a pronunciation dictionary and creates a model file. The model file can then be used to transcribe words that where not in the dictionary.

Packaging has started and developers might try the packaging code in VCS

spacy
Industrial-strength Natural Language Processing (NLP)
Versions of package spacy
ReleaseVersionArchitectures
VCS2.2.3-1all
Versions and Archs
License: MIT
Debian package not available
Git
Version: 2.2.3-1

spaCy is a library for advanced Natural Language Processing in Python and Cython. It’s built on the very latest research, and was designed from day one to be used in real products. spaCy comes with pre-trained statistical models and word vectors, and currently supports tokenization for 30+ languages. It features the fastest syntactic parser in the world, convolutional neural network models for tagging, parsing and named entity recognition and easy deep learning integration.

travatar
tree based machine translation toolkit
Versions of package travatar
ReleaseVersionArchitectures
VCS0.1.0+git20131221-1all
Versions and Archs
License: LGPL-3.0+
Debian package not available
Git
Version: 0.1.0+git20131221-1

Travatar is tree based statistical machine translation system containing Tree-to-String (T2S) and Forest-to-String (F2S).

Tree based translation uses syntax trees of natural language and it's particularly effective for language pairs that require a large amount of reordering, such as English-Japanese translation.

No known packages available but some record of interest (WNPP bug)

python3-timbl - wnpp
Python bindings for the Tilburg Memory Based Learner (Timbl)
Responsible: Maarten van Gompel
License: unknown
Debian package not available

python-timbl is a Python extension module wrapping the full TiMBL C++ programming interface. With this module, all functionality exposed through the C++ interface is also available to Python scripts. Being able to access the API from Python greatly facilitates prototyping TiMBL-based applications.

TiMBL is an open source software package implementing several memory-based learning algorithms, among which IB1-IG, an implementation of k-nearest neighbor classification with feature weighting suitable for symbolic feature spaces, and IGTree, a decision-tree approximation of IB1-IG. All implemented algorithms have in common that they store some representation of the training set explicitly in memory. During testing, new cases are classified by extrapolation from the most similar stored cases.

The Python module offers both a high-level as well as a low-level interface, the former is very Pythonic and easy to use while the latter offers the full API.

No known packages available

wnsqlbuilder
SQL version of WordNet 3.0
License: GPL
Debian package not available

WordNet SQL Builder is a Java utility to generate SQL database from WordNet standard database as released by the WordNet Project (Princeton University)

Features

  • Support for MySql and PostGreSQL.
  • Complete port (however, orphaned morphological forms are dropped, and so are VerbNet/XWordNet data that cannot be linked to WordNet entries).
  • Incremental build support.
  • Retains synset index as primary key allowing easy reference to wordnet original database
  • Includes support for WordNet 3.0
  • Includes support for WordNet 2.0 to 2.1, 2.1 to 3.0, 2.0 to 3.0 sense maps
  • Includes support for VerbNet 2.3
  • Includes support for XWordNet 2.0-1.1
  • Ready-to-use database (see wnsqldatabase package in download section) including
  • WordNet 3.0
  • WordNet 2.0 to 2.1, 2.1 to 3.0, 2.0 to 3.0 sense maps
  • VerbNet 2.3
  • XWordNet 2.0-1.1
  • British National Corpus statistical data (for commonly used-words)
*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 246355