Apertium linguistic data for Catalan
Find a file
2025-10-31 14:21:43 +01:00
.github/workflows Fix call 2021-12-10 18:11:39 +01:00
corpus correcció 2021-11-07 17:24:56 +03:00
dev Preferències: avui/hui 2022-12-19 00:10:00 +01:00
tagger-data textos a punt per a entrenar el tagger, instruccions 2025-03-23 16:28:01 +01:00
test correcció hores i segles 2025-03-16 13:04:07 +01:00
.gitattributes Remove unused files 2025-03-23 17:42:20 +01:00
.gitignore textos a punt per a entrenar el tagger, instruccions 2025-03-23 16:28:01 +01:00
apertium-cat.cat.acx apòstrofs i hac aspirada 2025-05-04 21:16:47 +02:00
apertium-cat.cat.metadix ecoansietat i fisioadsorció 2025-10-31 14:21:43 +01:00
apertium-cat.cat.pre2017_prefs.rlx First pass at moving pre2017 diacritics to preferences 2021-11-04 15:21:10 +01:00
apertium-cat.cat.prefs.rlx rajos 2025-04-16 11:03:48 +02:00
apertium-cat.cat.rlx CG: -ment 2025-05-19 22:07:21 +02:00
apertium-cat.cat.tsx textos a punt per a entrenar el tagger, instruccions 2025-03-23 16:28:01 +01:00
apertium-cat.cat_valencia.prefs.rlx lèxic valencià 2024-09-25 20:03:19 +02:00
apertium-cat.cat_valencia_gva.prefs.rlx rajos 2025-04-16 11:03:48 +02:00
apertium-cat.cat_valencia_uni.prefs.rlx lèxic valencià 2024-09-25 20:03:19 +02:00
apertium-cat.pc.in Remove lib from data-only pkg-config 2016-02-01 20:23:16 +00:00
apertium-cat.post-cat.dix R+D 2025-07-22 10:38:04 +02:00
AUTHORS versió 2.12 2024-01-07 11:42:45 +03:00
autogen.sh add cat 2015-07-28 20:41:27 +00:00
cat.preferences.xml rajos 2025-04-16 11:03:48 +02:00
cat.prob words 2015-12-19 20:34:12 +00:00
ChangeLog add cat 2015-07-28 20:41:27 +00:00
configure.ac versió 2.12 2024-01-07 11:49:49 +03:00
convert-metadix-dix.py Update convert-metadix-dix.py 2024-07-07 19:10:00 +02:00
COPYING License GPLv3 -> GPLv2 for now 2017-03-27 10:21:52 +00:00
Makefile.am Simplificació de la compilació 2024-10-08 21:45:12 +02:00
modes.xml Modes de generació per al valencià 2025-04-04 10:04:53 +02:00
NEWS add cat 2015-07-28 20:41:27 +00:00
README format 2025-04-04 09:03:40 +02:00
README.md Added README.md symlink to README 2019-01-01 13:45:54 +01:00
tagger.supervised.make Rebuild dic if TSX changes 2025-04-05 18:11:03 +02:00
tagger.unsupervised.make Rebuild dic if TSX changes 2025-04-05 18:11:03 +02:00

Catalan: apertium-cat

This is an Apertium monolingual language package for Catalan. What you can use this language package for:

  • Morphological analysis of Catalan
  • Morphological generation of Catalan
  • Part-of-speech tagging of Catalan

Requirements

You will need the following software installed:

  • lttoolbox (>= 3.7.1)
  • apertium (>= 3.8.3)
  • vislcg3 (>= 1.3.9)

If this does not make any sense, we recommend you look at: https://apertium.org

Compiling

Given the requirements being installed, you should be able to just run:

$ ./autogen.sh
$ make

If you're doing development, you don't have to install the data, you can use it directly from this directory.

If you are installing this language package as a prerequisite for an Apertium translation pair, then do (typically as root / with sudo):

# make install

You can give a --prefix to ./autogen.sh to install as a non-root user, but make sure to use the same prefix when installing the translation pair and any other language packages.

If any of this doesn't make sense or doesn't work, see https://wiki.apertium.org/wiki/Install_language_data_by_compiling

Testing

If you are in the source directory after running make, the following commands should work:

$ echo "la casa" | apertium -d . cat-morph
^la/la<n><m><sp>/el<det><def><f><sg>/el<prn><pro><p3><f><sg>/prpers<prn><pro><p3><f><sg>$ ^casa/casa<n><f><sg>/casar<vblex><pri><p3><sg>/casar<vblex><imp><p2><sg>$^./.<sent>$

$ echo "la casa" | apertium -d . cat-tagger
^el<det><def><f><sg>$ ^casa<n><f><sg>$^.<sent>$

Tagger model training

To train the tagger model, do one of the following:

Supervised training:

$ make -f tagger.supervised.make

Unsupervised training

$ make -f tagger.unsupervised.make

For details on the corpora used in training, check the corpora information.

For more information, see https://wiki.apertium.org/wiki/Tagger_training

Files and data

For more information

Help and support

If you need help using this language pair or data, you can contact:

See also the file AUTHORS, included in this distribution.