Viewing a single comment thread. View all comments

currentscurrents t1_jdu3vwq wrote

Yeah, but they're hand-crafted algorithms and produce code that's hard to read.

10

ultraminxx t1_jdu7uz8 wrote

that said, it might be also a good approach to preprocess the input with a classical algorithm and then train a model on refactoring that decompiled code, so it becomes more readable

10

currentscurrents t1_jdvxga6 wrote

Possibly! But it also seems like a good sequence-to-sequence translation problem, just line up the two streams of tokens and let the model figure it out.

2

s0n0fagun t1_jdu975r wrote

That depends on the language/compiler used. Java and C# have decompilers that turn out great code.

2

currentscurrents t1_jdvxu6g wrote

Those languages don't compile to machine code, they compile to a special bytecode that runs in a VM.

2