Submitted by Super-Martingale t3_y4w0sw in MachineLearning
CremeEmotional6561 t1_ishxulj wrote
The same problem exists with music artists. One solution is not to repeat the work that has already been done by others.
-
Scrape https://www.chartsurfer.de/archiv/artist-a.html for a list of all artist names.
-
Scrape https://www.discogs.com/de/artist/28795-Prince for a list of alias names for each artist.
-
Do just simple Levensthein for spelling errors and prompt the user if in doubt.
Super-Martingale OP t1_isk4gcc wrote
Where can we get "alias names" for the universe of US companies?
Viewing a single comment thread. View all comments