chief167 t1_jbk7gw0 wrote on March 9, 2023 at 5:14 PM

Reply to comment by Wodecki in Meet The World's Cleanest Fully Electric Car That Removes Carbon Dioxide From The Air by Anderson069

Haha lol, so just once a year, think a bit ahead so you save one grocery trip, and you're already better off

chief167 t1_jbjx5jb wrote on March 9, 2023 at 4:09 PM

Reply to comment by Doctor_Amazo in Meet The World's Cleanest Fully Electric Car That Removes Carbon Dioxide From The Air by Anderson069

Yeah, doesn't an ICE exhaust that in like 30 miles?

chief167 t1_j9ku5mq wrote on February 22, 2023 at 6:37 PM

Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

I don't think it implies that all datasets are equally likely. I think it only implies that given all possible datasets, there is no best approach to modelling them. All possible != All are equally likely

But I don't have my book with me, and I do t trust the internet since it seems to lead to random blogposts instead of the original paper (Wikipedia gave a 404 in the footnotes)

chief167 t1_j9kt5ho wrote on February 22, 2023 at 6:31 PM

Reply to comment by Featureless_Bug in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

We use gradient boosting at quite a big scale. Not LLM big, but still big. It's just not NLP or CV at all. It's for fraud detection in large transactional tabular datasets. And it outperforms basically all neural network, shallow or deep, approaches.

chief167 t1_j9jev01 wrote on February 22, 2023 at 12:03 PM

Reply to [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

Define scale

Language models? Sure. Images? Sure. Huge amounts of transaction data to search for fraud? Xgboost all the way lol.

Church no free lunch theorem: there is no single approach best for every possible problem. Djeezes I hate it when marketing takes over. You learn this principle in the first chapter of literally every data course

chief167 t1_j7kl5jd wrote on February 7, 2023 at 2:01 PM

Reply to comment by keepthepace in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Yeah OpenAI was founded to be... Well... open.

It's the most closed ai company in existence probably

chief167 t1_j7kl03z wrote on February 7, 2023 at 1:59 PM

Reply to comment by ---AI--- in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

GPT is largely built on Google research

chief167 t1_j7kkx9g wrote on February 7, 2023 at 1:59 PM

Reply to comment by WokeAssBaller in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

It's smart by Google to wait until Microsoft burns the 10 billion, then easily surpass it.

The hype is so painful at the moment, non technical people and sales idiots are way overselling chatgpt.

chief167 t1_j4n7wdv wrote on January 16, 2023 at 10:19 PM

Reply to [D] Tim Dettmers' GPU advice blog updated for 4000 series by init__27

A little bit dangerous, because the A100 is a beast when you look at performance/kWh. If you run a lot of heavy workloads, it's your best option, but on this chart it looks like the worst

TCO != Purchase price

chief167 t1_j46k3aa wrote on January 13, 2023 at 2:55 PM

Reply to comment by MustachedLobster in [D] Has ML become synonymous with AI? by Valachio

Slam is pure Mechatronics which I don't consider ml.

chief167 t1_j45eo3z wrote on January 13, 2023 at 7:39 AM

Reply to comment by wind_dude in [D] Has ML become synonymous with AI? by Valachio

AI is basically decision making. Giving information, how does a machine learn from its environment, take decisions, without human oversight. How does a machine adapt itself with more experience.

ML is just a way to create models.

For example the SLAM algorithm is an important algorithm in AI, because it allows robots to map their environment. However, this is not ML at all.

Another example of AI is knowledge graphs, like the earliest chess engines. A perfect chess AI can be made without any machine learning at all.

It's important to keep making the distinction.

chief167 t1_j45e8ru wrote on January 13, 2023 at 7:34 AM

Reply to comment by redditsucks1337 in [D] Has ML become synonymous with AI? by Valachio

I view ML as a way to create an accurate model, and AI as a way to have a model take decisions in a complex environment without humans in the loop.

There is a large overlap, but you can easily have one without the other.

chief167 t1_j3qzdfl wrote on January 10, 2023 at 2:01 PM

Reply to comment by [deleted] in [D] Found very similar paper to my submitted paper on Arxiv by [deleted]

arxiv is really planting your flag on the topic. I dont trust it because its not peer reviewed, but its definitely a good way to call dibbs

chief167 t1_j3qeduk wrote on January 10, 2023 at 10:25 AM

Reply to comment by aigyfkkq in [D] What is the most complete reference on the history of neural networks? by gbfar

> there’s some commentary on those early papers in the Neurocomputing 1 and 2 collections

could you link or give a doi or something?

chief167 t1_j3borij wrote on January 7, 2023 at 11:58 AM

Reply to [D] I recently quit my job to start a ML company. Would really appreciate feedback on what we're working on. by jrmylee

First thought: decide for yourself who your target audience is

If you hope to sell this to companies, or even start-ups, be prepared for a lot of questions around data governance, security, ....

Second: do you have an idea how many users you need for break even and how the infrastructure needs to scale to cope with that? Gpu's aren't cheap of course, neither is electricity or cloud providers

chief167 t1_j0funra wrote on December 16, 2022 at 9:36 AM

Reply to Neural networks and machine learning for data science in business [D] by lordgriefter

xgboost or lightgbm are the goto for classic tabular business data

chief167 t1_j027p7v wrote on December 13, 2022 at 3:34 PM

Reply to comment by maxafrass in [Discussion] Amazon's AutoML vs. open source statistical methods by fedegarzar

Don't waste your time. Check datarobot (and H2O is the closest competition).

Everybody else plainly sucks at automl, sorry to put it so bluntly but it's true

I am a happy customer of them, and it took a mountain of effort to convince our it teams to move away from Microsoft and databricks etc..., But the results were just in another ballpark, so we had a strong business case

chief167 t1_j027csj wrote on December 13, 2022 at 3:32 PM

Reply to [Discussion] Amazon's AutoML vs. open source statistical methods by fedegarzar

Honestly, if you want decent automl results, you should only consider datarobot. Everything else is noticeably worse

We are a customer of them and it's a game changer. Yes it's expensive and not aimed at hobbyists, and it's like super expensive. But it's good

If I find the time, I shall upload this dataset into our system and check the results. Remind me later if I forget

chief167 t1_isuculx wrote on October 18, 2022 at 7:36 PM

Reply to comment by SkinnyJoshPeck in [D] How frustrating are the ML interviews these days!!! TOP 3% interview joke by Mogady

Ok yeah well that's stupid. Because I am actually in favour of column names instead of indexes. Indexes are pain in the ass when your incoming dataframe changes, it creates an implicit dependency.

But your last line is my point. You shouldn't be concerned about MLops stuff, but if your models is already in the right framework, it saves soooo much time

chief167 t1_istrptp wrote on October 18, 2022 at 5:19 PM

Reply to comment by SkinnyJoshPeck in [D] How frustrating are the ML interviews these days!!! TOP 3% interview joke by Mogady

As someone who sometimes has to hire people, perhaps this is the issue:

Imagine how difficult it is for big companies to get a MLOps framework going, with all the red tape and scattered IT systems. It was very painful where I work. In the end we got something working using a python platform that really needs you to use pandas and sklearn type interfaces.

Let's hypothetically say you are a great data scientist using R, or Sas or MATLAB or ... If I don't have a lot of options I'd hire you and put you on a training program for our framework. But if I have multiple decent candidates, and some don't require retraining, yeah imma gonna pick one of them. I am not spending 2 months trying to get compliance and cybersec to approve your docker container with R code in it, if I can have a similar model in our pre-approved workflow.

chief167 t1_is9lmou wrote on October 14, 2022 at 8:52 AM

Reply to comment by lifesthateasy in [D] Modern MLOps architecture info sources by lifesthateasy

Would you ask a doctor for a few online resources on your medical treatment? You are completely disrespectful towards the architecture profession if you think a few online resources are all you need to get started, and that you'll just fix it in the future by hiring an architect to clean up your mess.

chief167 t1_is9lh2u wrote on October 14, 2022 at 8:50 AM

Reply to [D] Modern MLOps architecture info sources by lifesthateasy

I will give you free advice, for once: don't trust any of the online simple architecture articles and take free advice. It will cost you much more in the end

Designing an architecture for your company is a multi week project, with lots of nuances and decisions. There is no best option, and especially without your business context it's literally impossible to recommend something good. From your question, it's clear your architecture expertise is very low

Do yourself a favour and get an architect, preferably from a dedicated local data specific consulting company and not a big box all-rounder like Accenture or TCS. Expect day rates of 1200-1500 if it's a short term project.

What were you expecting, someone to say 'just use Google cloud', and just go with it?

Don't fool yourself, if you are not ready to build an architecture, don't. Youll have to start over next year. You were hired for the wrong job then