Comments

You must log in or register to comment.

junkboxraider t1_jegcvli wrote

Wonder whether they included the Elon+1000 and Can'tBlockHim mods in this version?

41

ZestyData t1_jegdmzo wrote

Putting aside the political undertones behind many peoples' desire to publish "the algorithm", this is a phenomenal piece of educational content for ML professionals.

Here we have a world-class complex recommendation & ranking system laid bare for all to read into, and develop upon. This is a veritable gold mine of an an educational resource.

631

mikiex t1_jegkzo3 wrote

If it's anything like their algorithm that shows me the tweets from a trending, I wouldn't want it.

13

Long_Educational t1_jegm6fp wrote

There is too much money at stake for there not to be additional invisible weights that are able to be tweaked by Twitter behind the scenes.

For example, I would imagine a 2 billion dollar stake by the Saudi's would purchase huge influence. This goes for anyone else that Elon "hangs" with during the Olympics or the Superbowl, or FIFA WorldCup.

6

Necessary-Meringue-1 t1_jegshy4 wrote

It's a pretty cool resource to get to look at an enterprise recommendation algorithm like that.

​

An aside, if you want a chuckle, search the term "Elon" in the repo:https://github.com/twitter/the-algorithm/search?q=elonhttps://github.com/twitter/the-algorithm/search?q=elon&type=issues

​

[edit 1]
since it's gone now, here's the back up provided by u/MjrK:https://i.imgur.com/jxqaByA.png
[edit 2] lol
https://github.com/twitter/the-algorithm/commit/ec83d01dcaebf369444d75ed04b3625a0a645eb9#diff-a58270fa1b8b745cd0bd311bed9cd24c983de80f96e7bd445e16e88b61e492b8L225

100

LoaderD t1_jegsuar wrote

> Here we have a world-class complex recommendation

...You know this is twitter's recommender system right? All the tweets I interact with are ML related from very 'left' people like Jeremy Howard.

My recommender system could legit be:

if interested_in_finance_or_ML:
     recommend_alt_right_hate_speech_accounts()
     recommend_crypto_scam_ads()
24

MjrK t1_jegtjqj wrote

We don't and likely we won't know.

Unless perhaps someone internal checks and leaks important missing details that later on...

But for now, it does seem robust enough to be reflective of what they have probably been using up to some recent - but that's still just speculation

10

ZestyData t1_jeh12gm wrote

Idk man as a fairly well seasoned MLE I find their general architecture and scale of their combined models to be fascinating in-and-of itself.

Twitter sucks ass - but this is a beautiful piece of ML Engineering.

8

ZestyData t1_jeh198p wrote

This quite obviously isn't the repo used by twitter.

It is a pretty large and well put together documentation epic & consolidation of multiple microservices.

Whether the content is 100% reflective of whats deployed is completely unclear. But its not "fake" that's for sure, its genuinely too many man-years of work to not be in-essence real.

52

midnitte t1_jeh1w7t wrote

I wonder if this is an effort to save face after the source code leak

3

midasp t1_jeh2awl wrote

It's kinda nice to see PageRank is still being used as one of the components of the algorithm

48

codingwoman_ t1_jeh2iw5 wrote

Well devil is in the detail, don't miss the fun part in commit messages :)

Please note we have force-pushed a new initial commit in order to remove some publicly-available Twitter user information. Note that this process may be required in the future.
40

FinancialElephant t1_jeh33j9 wrote

Most infrastructure code like computer vision code, device drivers, etc are either not culturally relevant or have little cultural relevance.

I don't think it makes any sense to prioritize them when things like twitter have much more direct cultural impact. It would be great if my network card driver was open source, but does it really matter? Is it worth prioritizing? Will it likely have any cultural relevance? To most people the answer to all these questions is no.

−6

master3243 t1_jeh48sn wrote

I don't take any CEO's words at face value without considering the monetary values and incentives behind that tongue.

A large project like this being open-sourced, even if it's a very old or heavily stripped down version, is always a great thing for the community.

21