Been experimenting with language models a lot lately and wondering if human generated text (i.e. "natural" text) is really supposed to be maximally likely according to language models even after training. For example, has someone checked likelihood of human translated text to likelihood of machine translated text according to a language model like GPT-3 ?

Are there any works that do this already ? Does this idea even make sense to begin with ?

Comments

You must log in or register to comment.

breezedeus t1_j0alnlc wrote on December 15, 2022 at 6:39 AM

#934,609

Actually, it's really not like that. If our words came out that way, people would know what you were going to say without even having to say it.

dojoteef t1_j0ayqqq wrote on December 15, 2022 at 9:30 AM

#935,113

See the graphs in the paper that introduced nucleus sampling: The Curious Case of Neural Text Degeneration. They visualize how human authored text has different statistical properties from machine generated text. That's mainly a tradeoff between fluency and coherence. Sampling procedures like top-k or nucleus sampling restrict the tokens that can be emitted and thus introduce statistical bias in the generated text, but produce more fluent text. Rather, sampling from the full distribution gets closer to the distribution of human-authored text, but often degenerates into incoherence (hence the title of the paper).

Emergency_Apricot_77 OP t1_j0c3cii wrote on December 15, 2022 at 3:53 PM

#937,075

Replying to dojoteef (#935,113)

This is VERY similar to what I was looking for. Thanks a LOT for this

prototypist t1_j0c5p2j wrote on December 15, 2022 at 4:09 PM

#937,166

There have been attempts this year at building a more human-like decoder for language models and seeing what outputs humans prefer. Transformers supports typical decoding and contrastive search, and there are papers and code out for RankGen, Time Control, and Contrastive Decoding (which is totally different from contrastive search).

Emergency_Apricot_77 OP t1_j0fe4lo wrote on December 16, 2022 at 6:01 AM

#942,431

Replying to prototypist (#937,166)

Thanks for this ! Typical decoding paper contains really useful information that is similar to what I was looking for

farmingvillein t1_j0fh5lg wrote on December 16, 2022 at 6:36 AM

#942,525

Replying to breezedeus (#934,609)

> If our words came out that way, people would know what you were going to say without even having to say it.

Even if this were true, this would not be correct in any sort of general sense, since every person/agent has its own unique set of (incompletely observable) context that seeds any output.

elbiot t1_j0hbr8o wrote on December 16, 2022 at 5:18 PM

#945,979

Replying to breezedeus (#934,609)

And we'd all be finishing each other's... sandwiches?