go-whisper-api/config/xlm-roberta-model.yaml
admin b5c083e06f
Some checks failed
CodeQL / Analyze (go) (push) Successful in 6m28s
Docker Image / build-docker (push) Failing after 13m26s
Lint and Testing / lint (push) Successful in 11m17s
Lint and Testing / test (push) Successful in 11m17s
Lint and Testing / golangci (push) Successful in 2m40s
first commit
2026-06-04 18:10:52 +07:00

82 lines
871 B
YAML
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# Metadata for Salama1429/xlm-roberta_punctuation_fullstop_truecase (ONNX punctuation).
# Install into the model directory:
# cp config/xlm-roberta-model.yaml models/punctuation/xlm-roberta/config.yaml
# or: make install-xlm-punctuation-config
languages:
- af
- am
- ar
- bg
- bn
- de
- el
- en
- es
- et
- fa
- fi
- fr
- gu
- hi
- hr
- hu
- id
- is
- it
- ja
- kk
- kn
- ko
- ky
- lt
- lv
- mk
- ml
- mr
- nl
- or
- pa
- pl
- ps
- pt
- ro
- ru
- rw
- so
- sr
- sw
- ta
- te
- tr
- uk
- zh
max_length: 256
pre_labels:
- "<NULL>"
- "¿"
post_labels:
- "<NULL>"
- "<ACRONYM>"
- "."
- ","
- "?"
- ""
- ""
- "。"
- "、"
- "・"
- "।"
- "؟"
- "،"
- ";"
- "።"
- "፣"
- "፧"
null_token: "<NULL>"
acronym_token: "<ACRONYM>"