Viewing a single comment thread. View all comments

like_a_tensor t1_j7r4mdx wrote

There's been some work on getting models to work at the byte level. An example: https://arxiv.org/abs/2105.13626

1

MrOfficialCandy OP t1_j7r85li wrote

That doesn't help at all. Reading tokens at the byte level does not stop the word "they" or "it" from being vague in the context of a sentence.

1

like_a_tensor t1_j7rbdno wrote

Sounds like you want something like a logical representation of sentences. Reducing sentences to first order logic might be what you're looking for. There's also AMRs (Abstract Meaning Representations). The problem with AMRs is that they need to be built, which is non-trivial for machines and time-consuming for humans.

1