Submitted by mems_m t3_113lu1q in MachineLearning
suflaj t1_j8qwt5d wrote
People usually create datasets when they work on something new. I don't know why you would think that just because a dataset exists you can't or even need to outperform anything.
mems_m OP t1_j8qwyhk wrote
They want us to find an existing dataset cause of the short time we have, and novelty is a big part of the assessment
suflaj t1_j8qx0qv wrote
As I've said, there is no reason you can't do something novel with that, you just can't do what something else has done with it.
mems_m OP t1_j8qx61s wrote
the thing is that i find that almost everything i can do has been already done on the public datasets i find
suflaj t1_j8qxasd wrote
That's more of an issue of you searching. You mention sentiment analysis, for example, but it is a problem that is considered to be solved for years. There is no novelty you could do here besides a bigger model.
Obviously you need to stop looking at what people have done, and start looking at what in their process of doing something they didn't do or did poorly. One such thing is tokenization of text. You can't tell me that it's all figured out.
timelyparadox t1_j8qxcss wrote
Yes and finding small novel new things to do is big part of the way you show you are worth a masters degree
redflexer t1_j8ryw02 wrote
Actually, i find this notion harmful. I consider senior PhD students to be able to assess whether an idea in their field is novel, feasible, and in the right scope given fixed resources. I would never expect that from Master students. That does of course not mean that students can’t have great ideas, but it’s not mandatory for a degree.
mems_m OP t1_j8qx1eg wrote
novelty could be in the data or in the methods applied
Viewing a single comment thread. View all comments