currentscurrents t1_jdu3vwq wrote on March 27, 2023 at 5:11 AM

Reply to comment by Smallpaul in [D] Can we train a decompiler? by vintergroena

Yeah, but they're hand-crafted algorithms and produce code that's hard to read.

ultraminxx t1_jdu7uz8 wrote on March 27, 2023 at 5:59 AM

that said, it might be also a good approach to preprocess the input with a classical algorithm and then train a model on refactoring that decompiled code, so it becomes more readable

currentscurrents t1_jdvxga6 wrote on March 27, 2023 at 4:18 PM

Possibly! But it also seems like a good sequence-to-sequence translation problem, just line up the two streams of tokens and let the model figure it out.

s0n0fagun t1_jdu975r wrote on March 27, 2023 at 6:17 AM

That depends on the language/compiler used. Java and C# have decompilers that turn out great code.

currentscurrents t1_jdvxu6g wrote on March 27, 2023 at 4:21 PM

Those languages don't compile to machine code, they compile to a special bytecode that runs in a VM.