Tokenizer-free, span-aware encoder architecture inspired by X-bar theory. Jointly learns segmentation and representation using pointer networks and compositional spans.

1 Open Issue Need Help Last updated: Jul 18, 2025

Open Issues Need Help

View All on GitHub

AI Summary: Create a comprehensive unit test script for the `jsonl2vocab.py` script, located within the `ox-bar` project's `tests` directory. The tests should cover all aspects of the script's functionality, ensuring that it correctly processes JSONL data to create a vocabulary.

Complexity: 3/5
help wanted

Tokenizer-free, span-aware encoder architecture inspired by X-bar theory. Jointly learns segmentation and representation using pointer networks and compositional spans.

Python