-
Updated
Jan 28, 2021 - Python
annotated-corpus
Here are 3 public repositories matching this topic...
An annotated corpus of Turkish–English intra-word code-switching collected from Reddit and annotated using the TREN application. The corpus includes token-level language labels, Leipzig-style morphological glossing and structured source metadata.
-
Updated
Feb 14, 2026
Unified, canonical, open corpus of Biblical Hebrew, Greek, and English texts with morpheme-level linguistic annotation and cross-language alignment for research and computational analysis.
-
Updated
Mar 15, 2026 - Python
Improve this page
Add a description, image, and links to the annotated-corpus topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the annotated-corpus topic, visit your repo's landing page and select "manage topics."