Loquzofaricoalaphar
Loquzofaricoalaphar OP t1_j5klsf9 wrote
Loquzofaricoalaphar OP t1_j5kljft wrote
Reply to comment by sothatsit in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
That’s Awesome, thanks for sharing boss
Loquzofaricoalaphar OP t1_j5hf96p wrote
Reply to comment by neanderthal_math in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
Thanks, that’s very interesting resource.
Loquzofaricoalaphar OP t1_j5h6s4z wrote
Reply to comment by PredictorX1 in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
That is interesting to think about. I’m biased to think text patterns have lots of variables and are fairly unique. Perhaps it’s more of a model than compute problem to analyze it at scale and not get mush.
Loquzofaricoalaphar OP t1_j5h5kq4 wrote
Reply to comment by [deleted] in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
Perhaps It could return the top 10 likelihoods of the author of the account, some patterns of writing and and grammatical errors might be pretty unique and the more post it has the more unique right?
Loquzofaricoalaphar OP t1_j5h59id wrote
Reply to comment by PredictorX1 in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
So like if you fed it 200 peoples samples you were looking and then fed it Reddit? Perhaps all of Reddit would be tricky because some might not have public text and it would be difficult to label all the text on Facebook or link-en, etc.
Loquzofaricoalaphar OP t1_j5kmiqg wrote
Reply to comment by MrEloi in [D] With more compute could it be easy to quickly un Mask all the people on Reddit by using text correlations to non masked publicly available text data? by Loquzofaricoalaphar
Yes this is the sort of thing I am thinking about. Some percentage of people have very distinct styles, however with Ted it might have been the content that gave it away.
Yes I am familiar with amiunique and all the variables of the browser.
I wonder if this way of identifying people is ever used when google or others get subpoenaed and hand over stuff. It would be more accurate than IP in determining the individual with correlations it seems, however I wonder if accepted by or holds up in court of law?